PREDICTED: Drosophila obscura hornerin (LOC111074833), mRNA.


LOCUS       XM_022367794            2271 bp    mRNA    linear   INV 14-MAY-2021
ACCESSION   XM_022367794
VERSION     XM_022367794.2
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            On May 14, 2021 this sequence version replaced XM_022367794.1.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..2271
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..2271
                     /gene="LOC111074833"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Gnomon. Supporting evidence
                     includes similarity to: 1 Protein, and 100% coverage of
                     the annotated genomic feature by RNAseq alignments"
                     /db_xref="GeneID:111074833"
     CDS             161..2119
                     /gene="LOC111074833"
                     /codon_start=1
                     /product="hornerin"
                     /protein_id="XP_022223486.2"
                     /db_xref="GeneID:111074833"
                     /translation="MSELAKLQEQLGLGQQQQGQQQQQSVELLECDIEFDEEQPQSSQ
                     QAGGIEQPTRANNQRSDALFIEAIKKVSKVHYMLEQAMVLLAAEEASEGAKSHSKKSK
                     KQHKKDKKKDKKKDKKKAKKQEKKRRQSAAAAVGEQPDVELDDEELPSTSRRRSDSIS
                     STSSSSSSSSSSSSSSSSSSSSSSEDESHYYAYGWGRKHWGGGWPYGHHGRDRSRSRS
                     RSRGRHGHGHGGHRRHDNGHGHGHGHGHGHGHGHGHRRHSHGVHHFGPHHFAHQHFGP
                     QHLGPYGFGPRDWAAACPAGFDRRAAWLAPWTEQSRFFRSAGLPWALEHSAARRRRSI
                     SRPPGRGDWRHQHGEHQRRGHGHGHRHGHKGGSHDRRVARSSSSSSSRSSSSSESEPE
                     LSKGGRGRKGRRRSSHHGKRSSSSSSTRRTDNERKHKEKQGGGRKQSRAEGHHGPHHG
                     PHHGPHHGPHHGPHHGPHHGPHHGPHHGPHHGPHHGRHSFPHGPNFGLHLGPDFAWHF
                     HGRYGGHPAVGLEGAPTYGPPTAPSYGPPPDWHGRRCSQFRSRSRSRCHKKKEERKGS
                     RSHSRSRSRSRSGGIKHGKDMEKKHQGHGKGHISNPVHGLHRGSIPNRGHGNHHRGSV
                     PHFEHHRSALHRGSVPLHGPESPWGWGA"
ORIGIN      
        1 atgccggccg gcaagcgttc gttaccagtt cgaagcttgc agcgacagcg aaagcaaagc
       61 caaaacaaca gttcacacac acacacacac acacactcgt acacagaagc agaaagccaa
      121 agcttaacgt acatctgatt ggagcaaggg agacagaaag atgagtgaat tagcgaaatt
      181 gcaggagcaa ctgggcctgg gccaacagca gcagggacag cagcagcaac agtcggtgga
      241 gctgctcgaa tgtgacatcg agttcgacga ggagcagccg cagtcctcgc agcaggcagg
      301 gggcatcgag cagcccacca gggccaacaa tcaacggtcg gacgcgctct tcatcgaggc
      361 catcaagaag gtatcgaagg tgcattacat gctggagcag gccatggtcc tgctggccgc
      421 ggaggaggcc tccgagggcg ccaagtccca ctcgaagaag tccaagaagc agcacaagaa
      481 ggacaagaag aaggacaaga agaaggacaa gaagaaggcc aagaagcagg agaagaagcg
      541 caggcagtcc gctgcggcgg ccgtcggtga gcagccggac gtggagctcg acgatgagga
      601 gctgccctcc accagccgtc ggcgctccga tagcatcagc tcgacgagct ccagctccag
      661 ttccagctca agctccagct cctcgagcag ctccagctcc agctcaagct cggaggacga
      721 gtcgcactac tatgcgtatg gctggggacg caagcactgg ggtggcggat ggccctatgg
      781 ccaccatgga cgcgatcgtt cccgatctcg cagtcgttct cgtggacgtc atggccacgg
      841 tcatggaggg catcgtcgtc acgacaacgg acatggacac ggccacggac atggacatgg
      901 ccacggacac ggccacggac atcgtcgtca tagccatggc gtccatcatt ttgggcccca
      961 tcacttcgca caccaacatt ttggacccca gcaccttggt ccttatggct ttgggccacg
     1021 cgactgggcg gccgcctgtc cggccggatt cgatcgccgt gccgcctggc tggcaccttg
     1081 gaccgagcag agccggtttt ttcggtcggc gggcctgccg tgggccctgg aacacagtgc
     1141 ggcaagaagg aggcgatcga tctctaggcc accaggccgt ggagattgga gacatcagca
     1201 cggtgagcat cagcgacgtg gacacggaca cggacacaga catggccaca agggtggttc
     1261 acatgaccgt cgagtagctc gttcgagctc cagctccagc tctcgctcga gttccagttc
     1321 ggagagcgag ccggaactga gcaagggagg acgcggacgc aagggtcgac gtcgcagcag
     1381 ccatcatggc aaacgctcct ccagttctag ctccaccagg cgtacggaca acgaaaggaa
     1441 gcacaaggag aagcagggtg gcggccgcaa gcagagtcgt gcagaagggc atcatggacc
     1501 gcatcacggt cctcatcatg gaccccatca cggaccccat cacggtcctc atcacggtcc
     1561 gcatcacggt ccgcatcacg gtccgcatca cggtccgcat catggaccac atcacggtcg
     1621 tcatagtttc cctcacggac ctaattttgg actccatctt ggccccgatt tcgcctggca
     1681 ttttcatgga cgctatggag gccatccggc tgtcggcctg gaaggagcac ccacctatgg
     1741 ccctccaacg gctcccagct atgggccacc gcctgactgg catggccgca gatgcagtca
     1801 attccgttcg cgttcgcgtt ctcgctgtca caaaaagaag gaagaacgaa agggttctcg
     1861 ctcccactct cgctctcgct cacgatcgcg ctcaggcggg ataaagcacg gcaaggacat
     1921 ggaaaagaaa catcagggac atggcaaggg acacatttca aatcctgtac acggcctcca
     1981 tcgtggatcc attccaaatc gtggacacgg caatcatcat cgtggatctg tgcctcattt
     2041 cgagcatcat agaagcgccc ttcatcgcgg ctccgttccc cttcacggtc ctgagagccc
     2101 ttggggctgg ggtgcctaag cgacagcggt gacagcggcc ctcccacaac aaaaaaatat
     2161 caatttatct tcaatttaac caacctcaaa catattcatt gatgttgtgc cccacaaact
     2221 attattatta aacgcgtata ccaaaaaaaa atatatatat tataatcctt a