Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

PREDICTED: Drosophila takahashii TATA box binding protein-related


LOCUS       XM_070218573            9043 bp    mRNA    linear   INV 09-DEC-2024
            factor 2 (Trf2), transcript variant X5, mRNA.
ACCESSION   XM_070218573
VERSION     XM_070218573.1
DBLINK      BioProject: PRJNA1194641
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..9043
                     /organism="Drosophila takahashii"
                     /mol_type="mRNA"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     gene            1..9043
                     /gene="Trf2"
                     /note="TATA box binding protein-related factor 2; Derived
                     by automated computational analysis using gene prediction
                     method: Gnomon. Supporting evidence includes similarity
                     to: 11 Proteins"
                     /db_xref="GeneID:108070355"
     CDS             540..5210
                     /gene="Trf2"
                     /codon_start=1
                     /product="uncharacterized protein Trf2 isoform X2"
                     /protein_id="XP_070074674.1"
                     /db_xref="GeneID:108070355"
                     /translation="MKENEKIPSNLLEIMGIVKNSSDSGQRTSTSTSDVNEENRVSSF
                     EPGRRLVLNEVSGEFAQERDTENTSSPSPRKEESNEESADSSDKENQYPNHRPSRADD
                     SRSPDSVAETENFKDSDVEIGANSVVLSESQIQHEQDESLVSDPLEEEEEKEHEEEVA
                     SGNESEEKDSPRQNSFEEGTSGDNNSDKDSPSKASNSSSSTPKRPYSGSDNRSSEERT
                     PKRRRQSVSDSEDNRRTLFTDSEDNPIKQRRLTYREQVEAYRRRVDAGSTPGRKSSSS
                     EEEPPPPSPPFRYSELYWEQRRASDADSEPDSAFKYRPNANYHRTSPRNYSDEEEEEE
                     IPESDTDQNSGGPLGYRVETRSTNYESSETSEDEARRRTVRRVVRRRRRPRNPEHREP
                     EYRGEEQSESSPLPSSESPDRQDNSDENESDEEEKRVIPSRRRTTDDQQPSFSGWGYL
                     TCRQDADYFSEPTDGKIIRKRKSPFEEESSNQSKRRPSEDTESEDSEKEDSEKVDSEK
                     EDSEKEDSDKEDSDKEDSDKEDSDKEDSDKEDSEKEDSEKEDSDKEDSDNGGSDTEKE
                     DRNTDKRDSDTDNDSNTEDPQDPRDSQDSASDNEQVPDSTGEPDQDNEQFPDSTEEPD
                     QDNEQFPDSTEEPDQDKELGQTAAWTITPDDPKGIQDFQDFVLKQVQLINEIIEVGHQ
                     VFQLGSIVCHGDFTFFQTLYERKVPIEEVVELVYRGILGVKEELRVALIARQDLSEPH
                     SKDIIRPGDKLAYLRLDLFNKFVSKVNQSERIVDQRLLELQEVRKQQGLLEQKQRVLE
                     RVRQQREQHRLQQLQVEREREERQEEEQRRLELERHLERWKEDFNRQLRLQREQEEIK
                     LRQEEENRKREEERRSNYDSYYCQLPKHKHREQMQNEMVSIPVANLNGGLKAASSGSG
                     GVVGVVTSSGVVSSAVLANAPRVYLTPSSTFMANRGQVAAGAGATGKIAGGSNVTSSV
                     GTSSATAGTVRYFSQFSKMQTAGGPSLQRKLANGDTIVLANGNKSLFLSSSGGVGGDK
                     ATNGNSLLTAKTELLEEEVMQPGTVIVDCDDDDDEDENENEDEDDEDGEDKDEKNNSH
                     TAKSSGVQQEQPLALTTATNNAAASDDEPELDIVINNVVCSFSVGCHLKLREIALQGS
                     NVEYRRENGMVTMKLRHPYTTASIWSSGRITCTGATSEAMAKVAARRYARTLGKLGFP
                     TRFLNFRIVNVLGTCSMPWAIKIVNFSERHRENASYEPELHPGVTYKMRDPDPKATLK
                     IFSTGSVTVTAASVSHVESAIQHIYPLVFEFRKQRSLEELQHLRQKQHKQAGGDPSEL
                     EKLIVAENKAAGLDDIFLNTTAAHTKPSPNERTPATSGILTSTIDSMQRLKQIENYSH
                     MMKLTQEERRHIPFQGEKVNPAAASTSAAAAASSGDNICANARRRATECWATKLQYKR
                     PRYNDPGTTGTVNASSTAASSSAASSSSSQAAPHLRTNPLKTAALANARMRGAKMPTV
                     LKPGTRLAPIGGGYLQHQQHLQQQQQLQQQQQKMRQTSFSPSEFDVDDLIEEEESNEL
                     DMQY"
     misc_feature    <1902..2465
                     /gene="Trf2"
                     /note="MSCRAMM family adhesin clumping factor ClfA;
                     Region: MSCRAMM_ClfA; NF033609"
                     /db_xref="CDD:468110"
     misc_feature    3894..4415
                     /gene="Trf2"
                     /note="TBP-like factors (TLF; also called TLP, TRF, TRP),
                     which are found in most metazoans. TLFs and TBPs have
                     well-conserved core domains; however, they only share
                     about 60% similarity. TLFs, like TBPs, interact with TFIIA
                     and TFIIB, which are part of the...; Region: TLF; cd04517"
                     /db_xref="CDD:239953"
     misc_feature    order(3909..3914,3918..3920,3996..4001,4011..4013,
                     4017..4019,4038..4040,4044..4046,4056..4058,4062..4064,
                     4068..4070,4074..4076,4164..4166,4179..4181,4185..4187,
                     4272..4277,4290..4292,4329..4331,4335..4337,4341..4343,
                     4353..4355)
                     /gene="Trf2"
                     /note="putative DNA interaction surface [nucleotide
                     binding]; other site"
                     /db_xref="CDD:239953"
     misc_feature    order(3954..3956,3975..3983,3987..3989,3999..4001,
                     4017..4019,4023..4025)
                     /gene="Trf2"
                     /note="putative TFIIA interaction surface [polypeptide
                     binding]; other site"
                     /db_xref="CDD:239953"
     misc_feature    order(4215..4220,4254..4268,4275..4277,4290..4292)
                     /gene="Trf2"
                     /note="putative TFIIB interaction surface [polypeptide
                     binding]; other site"
                     /db_xref="CDD:239953"
     polyA_site      9043
                     /gene="Trf2"
                     /experiment="COORDINATES: polyA evidence [ECO:0006239]"
ORIGIN      
        1 agccagcaag agagagggag agcagcagca gcagcgcggc ggcggccaca gcaacaaagc
       61 cacaaaagag agcgacggcg agagagacag agagagaggg ggagagaact agacgacgcg
      121 acgacctact acaaaaaaag gctctccaag gaagaaagga taaaagaaga aacaaccatt
      181 caaatattta aggactttta agagactgca aaggaacaac aaaaactaca agctaattta
      241 aataggagaa agaacggaag actagacata atcgtaatac agagtagttt agcattaggg
      301 aaccgacaac aacaaaaacc actgcaaaca ctaattctaa ttcaaaatta cactaaaaga
      361 gtattttttc agccctaaga gcaaacacac acacacatac acaccacaca cacaaacgaa
      421 agaagaagaa tttgtagaag aatcactttt agccttatat acaactacaa agaaaaaaaa
      481 aagaacgcaa gttagtatta agttaagatg ctagaaagag aggacgtagc aagaatccta
      541 tgaaagaaaa cgagaagatt ccaagtaatt tgcttgaaat catggggatt gttaaaaaca
      601 gcagcgattc cgggcaaagg acttcgactt cgacttctga cgtgaatgaa gaaaacaggg
      661 tatcttcatt cgagccaggg cgtcgtcttg tcttaaacga agtaagcgga gaattcgctc
      721 aagaaagaga caccgagaac acaagttccc cgtcgccacg caaggaagaa agcaacgaag
      781 aatcagcaga ttcttcggat aaggagaatc aatacccgaa tcatcgtcct tcaagggcag
      841 acgatagtag atctccggat tctgtcgcgg aaactgaaaa ctttaaggat tctgacgttg
      901 aaatcggcgc aaactcggta gtactttctg aatcccaaat tcagcacgaa caggacgaat
      961 cattagtatc tgatcctctc gaagaagaag aagaaaaaga acacgaagaa gaggtcgcta
     1021 gcggtaacga gtcggaagaa aaggatagtc cgaggcaaaa tagcttcgag gaaggaactt
     1081 caggcgataa taatagcgat aaagatagtc cgtcgaaagc ttcaaattct tcaagttcga
     1141 ctccaaagcg gccttattcg ggttcagaca atcgaagctc cgaagaaagg acccccaaga
     1201 ggcgacgtca gtcagtttca gatagcgagg acaaccgccg aactttattt actgactcgg
     1261 aagataaccc cattaaacag aggagattaa cttaccgtga acaagtagaa gcctatagac
     1321 gtagagttga cgccggaagt acaccaggac ggaagagttc atcttcagaa gaagaaccgc
     1381 cgccgccttc gccaccattt agatattcgg aattatattg ggagcagcgg cgtgcttcag
     1441 acgccgactc tgaacctgat tccgccttca aatatagacc taatgcgaat tatcaccgta
     1501 catcaccgcg aaattatagt gacgaagagg aagaggagga gatcccagaa agcgataccg
     1561 atcagaattc aggaggacca ctaggctata gggtagaaac ccgtagtacc aattacgagt
     1621 caagcgaaac gtcagaagac gaggctcgca ggaggaccgt gaggcgtgta gttcgccggc
     1681 gtcgtagacc acgcaatccc gaacatcgcg aacctgagta tcgcggggaa gagcaaagtg
     1741 agagttcgcc actaccgtcc agtgaatccc ccgatcgcca agacaacagc gacgaaaacg
     1801 agagcgacga agaggagaag cgggttatac cttcaaggcg tcgcactacc gacgatcagc
     1861 agcccagctt ctcaggctgg ggatatctaa cgtgtcgtca ggacgcagac tatttctcgg
     1921 aacccaccga cgggaagatc atcaggaaga ggaagtcgcc atttgaggaa gaatcctcga
     1981 atcaatcaaa gagacgacca tcagaagaca ccgagagcga agactctgag aaggaagact
     2041 ctgagaaggt agattctgag aaggaagatt ctgagaagga agattctgat aaagaagatt
     2101 ctgataaaga agattctgat aaagaagatt ctgataaaga agattctgat aaagaagatt
     2161 ctgagaaaga agattctgag aaagaagatt ctgataaaga agattcggac aacggaggtt
     2221 ctgataccga gaaggaagat cgaaataccg ataaacgcga ttccgatacc gataacgatt
     2281 ccaatactga agatccgcaa gatcccagag attctcaaga ttcggcttct gacaacgagc
     2341 aggttcccga ttctactggg gagcccgacc aagacaacga gcagtttccc gattctaccg
     2401 aggagcccga ccaagacaac gagcagtttc ccgattctac cgaggagccc gaccaagaca
     2461 aggaactagg acagactgcc gcttggacca tcacacccga tgatcccaag ggcattcaag
     2521 acttccaaga cttcgtccta aagcaggtgc aactgatcaa tgaaatcatc gaggtcggcc
     2581 atcaagtatt ccaactggga tcaatagttt gccatggtga ctttacgttc tttcaaacac
     2641 tttacgagcg gaaagttcca atagaagagg tcgttgagct agtctaccgg ggcattctcg
     2701 gtgtcaagga ggagctcagg gtggcattga tagcccgcca agatctaagc gagccacatt
     2761 cgaaagacat catcaggcca ggagataaac tagcctattt aagattggat ttgttcaata
     2821 aatttgtatc aaaagtcaat cagtccgaga ggatcgtgga tcaacgcctt cttgagctgc
     2881 aggaggtacg gaaacagcag ggactactgg agcagaagca gcgcgtcttg gagagagtgc
     2941 gccagcagcg ggaacagcac cgcctgcagc agcttcaggt ggagcgggag cgtgaggagc
     3001 gccaggagga ggagcagcgg cgcctggagc tcgaacgcca cttggagcgt tggaaggagg
     3061 acttcaatcg ccagctgcga ctgcagcgcg agcaggagga gatcaagttg aggcaagagg
     3121 aggagaacag gaagcgagaa gaggagcgca ggagcaacta cgacagctac tactgccaac
     3181 tgccgaagca caagcaccgc gagcagatgc aaaacgaaat ggtcagcatt ccagtggcta
     3241 atttgaacgg tggcctcaag gcggccagca gtggatccgg tggtgtggtc ggtgtggtca
     3301 cctcctccgg cgttgtctcc tcggccgtgc tggccaatgc gccgcgtgtc tatctgacgc
     3361 cctcctccac attcatggcc aatcgtggtc aggtggcggc gggggcgggg gccaccggaa
     3421 agattgcagg agggagcaac gtaacttcat ccgtcggcac atcttcggcc accgccggca
     3481 ctgtacgcta cttttcgcaa ttcagcaaaa tgcaaaccgc cggcggaccc agtttgcagc
     3541 gtaaactggc caacggcgac accattgtgc tggccaacgg caacaagagt ctgttcctca
     3601 gcagcagtgg aggcgttgga ggcgacaagg ccaccaatgg caatagtttg ctaacggcca
     3661 aaacggagct tctcgaggag gaagtcatgc agccgggaac tgtgattgtc gactgcgacg
     3721 acgacgacga cgaggatgag aatgagaatg aggatgagga cgatgaagat ggggaggata
     3781 aggatgagaa gaacaacagc cacactgcca aatccagtgg ggttcaacag gagcagccgt
     3841 tggctttgac caccgcaacc aacaatgcag ccgcctccga tgatgaaccc gaactggata
     3901 ttgttattaa taatgtggta tgctccttca gcgtgggctg ccacctcaag ctgcgcgaaa
     3961 tcgctctcca gggctcaaat gtcgagtacc gtcgtgagaa cggtatggtg acaatgaagc
     4021 tccgccatcc gtacacgacg gcatccattt ggtcctcggg caggatcacg tgcactggag
     4081 ccacctccga ggcaatggcc aaggttgcgg ctcgccgcta tgcgaggacc ctgggcaagc
     4141 tgggatttcc cacccgcttt ctcaacttcc ggatcgtgaa tgttctgggc acctgcagca
     4201 tgccctgggc catcaagatc gtcaatttct cggagcgcca tcgcgagaat gcgagctacg
     4261 agcccgagct gcacccgggc gtgacctaca agatgcgcga tcccgatccc aaggccacgc
     4321 tcaagatctt ctccaccggc agcgtgaccg tgacggcggc cagcgttagc cacgtggagt
     4381 cggccattca gcacatttac ccgctcgtct ttgagttccg caagcagcgt tcgctggagg
     4441 agctgcagca tctgcgccag aagcagcaca agcaagcggg cggtgatccc agcgaactgg
     4501 agaagctgat cgttgcggag aacaaggcgg ccggactgga tgatatcttc ctcaacacaa
     4561 cggcggcgca cacgaagccc tcgccgaatg agcgtacgcc ggccacatcg ggcatcctca
     4621 cgagcaccat cgatagcatg cagcgtttga agcagatcga gaattattct cacatgatga
     4681 agctgacgca ggaggagcgc cgacacatac ccttccaggg tgagaaggtt aacccggcgg
     4741 cggcttctac ttcggcggca gctgctgctt cgtcggggga taatatctgt gccaatgccc
     4801 gtcgccgggc caccgagtgc tgggccacca agctgcagta caagcgtccg cgttacaatg
     4861 atcccggcac cacgggcacc gtcaatgcct cgtcaacagc cgcctcctcc tcagctgcct
     4921 cgtcatcctc ctcacaggca gcgccacacc tgcgtactaa tcctctcaag acggccgctt
     4981 tggccaatgc ccgcatgcgt ggcgccaaaa tgccaacggt gctcaagccg ggcacgcgtc
     5041 tagcgcccat cggcggtggc tatctgcagc accagcagca tctgcagcag cagcagcagc
     5101 tgcaacagca gcagcagaag atgcgccaga ccagcttctc gcccagcgag ttcgatgtgg
     5161 acgatttgat cgaggaggag gagagcaacg agctggacat gcagtattga gctagccaag
     5221 cagctgccca agccaaaaaa acacatttaa acattaacat ttaaatgatg ataattcgat
     5281 ggaaaagaaa agcaaacaga gagcttttcc ccgaatttgt tggcactttg tttaagccat
     5341 ttttcatcac aatacaaaca cacacattta tttttgtgct cgccgcattt aaaacacgaa
     5401 ctcgaggaaa accggacgca ggagcgaaaa caaaaaaata tttaaaaaca aaaaaggatg
     5461 aaaggagaac attttaatcc aaagcgaaac aaaattccca gcagcaacac caactaaata
     5521 gcagcagcaa caacaaaacc aagagccaaa tatttaactt ataagattta cgtttaaacg
     5581 actttgtaac ttaagcgagc acatctgaaa caaactagca gagttagcat atagcgatga
     5641 tttcgaaccc gaaggattcc cagcagcagc agcaacctca agaacatcca gatggtaaac
     5701 acacagagag agcttaccac aacgtaaacc taagtgtaga cttaaagaaa gcgttatgaa
     5761 tacgtatatg ccagcgtttt acgtatgcaa ccaaaaaaaa cgaaatgtgc ggattgcttt
     5821 gaaggaggag gagttacaat ctattttttt tttactcggt ttaaatagtt aatttatgtg
     5881 agcaagacca ccgcctaagc aaatactgta aacccgaatg gatcacatac gacttatata
     5941 tatcaagata gcctttccct ttttttttgt tgtttgacca agaccaagtt cttccggagc
     6001 aaaccaaaat tctcaagagg aaaggagcag aggaggagac cagagaaaac actaggataa
     6061 tattaactga tttatttcat gtattttcac acacccattt aataattttt tttgttttcg
     6121 ctcgtaagtt ctagatgcta taagcgttgc ttcatgtgaa gcaaaacaaa caaaaacatt
     6181 tgaatacgat tataattata ttataatacc aacacacaca caaacaccta tatctggtca
     6241 ctaataacca attataacaa aactgaaaat acagcaagta attatgtttt aatttatgct
     6301 agtagttttt gggttgcgca gaagaacact caatcgaatt aggaataaac acctggatga
     6361 tagaagattg ttgtagggtt tatttttaaa gcaaccaggt gaatgcgcga cttgaaacta
     6421 ttatcagacg atgtatacct aatgcatata ttcgtatata tactataaaa tatattaatg
     6481 tttttttttt atgtaattcg cattattcgc ataatttaat gacaaattta gaggaagcaa
     6541 ctttaagcgt agatttacaa acttgttcct gtgttgttaa aatcagaaag taaagaactt
     6601 attttagtta acgaaactta acgaagcata cgcatacaca aacactcaca cacacacaca
     6661 cattaacaca agcatacgta tacatataca taatgttaag gcatccaaac aaagcaacaa
     6721 acaaacaaac aaaagagcga atcataagca ataatagaat gttttcgttg taaagaggta
     6781 aaaactaaac aaacaaatac tcacgttaaa cataaaaaat aaactctaga tgttttggcg
     6841 tgccttcacc cgtaaatagg aaaaacaaaa agcgagatac gatagccaat tgtacaatca
     6901 ttgaaatagt acttccagac cagaacgaac ccaaaactct ttgtacatag acccaaaatt
     6961 gatgttttcc aattgaacgt gaatattgcc tcatcaactt gaaatttctt ttgtgcttaa
     7021 cgtaaaaacc aactttcata tcgttcaaaa cttatgcata atcaaaatag ctaaaacaaa
     7081 caaaagagat ggggaatatt taagagacgt aatatcatat attttttttt atattttttt
     7141 tattttgtat tctgtctttc tttgataatt tatattttga tttgcataaa caaaatcgag
     7201 aaatgaaaat gcctttaggg aacaaataac tagcgcggaa aggaaagaga ggggagatag
     7261 gtttaggtat tgagaaagcg agagatagag agaaggatat tgaacagatc aattggagcc
     7321 gaaatgtatt tttttgtgta tgcgaatgtg gatagagata gtagggatag gaataggtct
     7381 cgagacagac ccgaacgtgt tacaattgca aaattaacta gccactaagc cacttagccc
     7441 taagttagct gagaaacgga atactaaatt gtaagccaaa atgcatatag caatctaaca
     7501 aaaaaaacaa aatctaatgg aacctaacga cgagaaccta aatgtaatct aaatgtacat
     7561 aaaatagttg ttaacaaatc acaaaaaaca aacgaaagca agaaaccaac aatagagtaa
     7621 agtcttcacg tctaactgcc ttcgaaaaaa caaaaagaaa gatacagaaa aagaaaaaga
     7681 aaggaaaatc ggaagtattc tattctaaac accgactgta aattgcttgt actttcacca
     7741 cattttcttt tttgttaaag caacgacagc aacaacaaca acattaatag aacagaacag
     7801 acggatcgaa aggcctaaat aagtgtattt ggtttcaaga actttaaaaa agcaaaacta
     7861 aatattaatg atagtataca gatatatttt tgattattat gtatttaatg agaacaatgt
     7921 aatgaaaatg atgcaagtat cgatatcgat atgataacta agagaaaaca aattgagaaa
     7981 tcaaatagca taattctgtt tttgttttgc cgtaaagact gcattatacc aacaattaat
     8041 tatatacaaa ataataataa taataatatt aataaatcaa taactgataa atggcgagta
     8101 agcgagagct ttgcattgag cagagaccaa aacggatgaa taaatatata tatatatatt
     8161 aaaacaacca gaaaaaatat tctaaaaacg aatacaaaaa aaaaaacaaa acaaacaaat
     8221 atgaaagaaa aacaaaaaac atggaaaaca aaaccaagaa acatttattt attagatcct
     8281 aagagaagtt tcgctagcgg agatcgagtc aggccagagc ggaccttaaa ggtaccccgc
     8341 atcccagaac gaaaagcaag atgaaattac cttaccaaaa cgtgggcgag tctacaaaag
     8401 tctataatag ggcgataggg agaatcggaa taattagaga actatcgtat gataaatcaa
     8461 ttaccaacta actggctaac ttatacccac taagatttat agttccccct aatccctgaa
     8521 actagttcta agtacataag ttgcaagcga aaaaaaaact aaaagttaga tttatacaca
     8581 aaaggatata tacatcatat atatatatat atgtggagaa ggaggaaggt ttactttggt
     8641 tttgagattg ccgcatcttc acatctcaag caatacgttt gtataccatt tgttattccg
     8701 aaacgcgacg gttttcgaaa tcgccagttt tgggcgattc gaaacatcga aacgtttcgt
     8761 agctaacact tggatatgaa tatttaacaa aaagccacac aaacattaac acgacaacaa
     8821 caacaacact catttgatgt atttagttta acgtttagcc cctagtttca agaaaaatca
     8881 cacacgaaac aagaaaacgc aaaattcgag gaacgtgaaa atgcctttcc tgaactgcag
     8941 tcaacactca aatatatggc aacaaaaaaa caacaaaaat gagaaaagaa acaaaccaac
     9001 caaatgaaag caaattgttt gaataaaaat caaattacat aaa