PREDICTED: Drosophila obscura mucin-19 (LOC111066368), partial


LOCUS       XM_041591990            5270 bp    mRNA    linear   INV 14-MAY-2021
            mRNA.
ACCESSION   XM_041591990
VERSION     XM_041591990.1
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq; corrected model.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
            
            ##RefSeq-Attributes-START##
            frameshifts :: corrected 1 indel
            ##RefSeq-Attributes-END##
            COMPLETENESS: incomplete on the 3' end.
PRIMARY     REFSEQ_SPAN         PRIMARY_IDENTIFIER PRIMARY_SPAN        COMP
            1-119               JAECWW010000165.1  3052859-3052977
            120-248             JAECWW010000165.1  3055772-3055900
            249-369             JAECWW010000165.1  3055994-3056114
            370-968             JAECWW010000165.1  3056145-3056743
            969-1445            JAECWW010000165.1  3057645-3058121
            1446-4727           JAECWW010000165.1  3058196-3061477
            4728-5270           JAECWW010000165.1  3061479-3062021
FEATURES             Location/Qualifiers
     source          1..5270
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..>5270
                     /gene="LOC111066368"
                     /note="The sequence of the model RefSeq transcript was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: deleted 1 base in 1 codon;
                     Derived by automated computational analysis using gene
                     prediction method: Gnomon. Supporting evidence includes
                     similarity to: 4 Proteins, and 99% coverage of the
                     annotated genomic feature by RNAseq alignments, including
                     17 samples with support for all annotated introns"
                     /db_xref="GeneID:111066368"
     CDS             51..>5270
                     /gene="LOC111066368"
                     /note="The sequence of the model RefSeq protein was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: deleted 1 base in 1 codon"
                     /codon_start=1
                     /product="LOW QUALITY PROTEIN: mucin-19"
                     /protein_id="XP_041447924.1"
                     /db_xref="GeneID:111066368"
                     /translation="MGWERRVKPRMIRMLGKRQKNKKTKYDAWEDSDFNDLDDASLKK
                     LLEEAYWYRNPGDRKNKSERFLQMLKKAEYDEEISYRAIKSCLTLNPITLATSASTSG
                     AAGIGSNSNNNTHHTNNRSGERHKQGGSLQDLVEAAHRSGSATSSAAAAAAAASATSA
                     TTQRFNCDYQLSGRANRRHQQQQQQQHNNNQNAPSASKKIKNWSASGRQREGGSLPSS
                     VNFESAATSMDAKLKQTMAESSTTSSQSQASTGKISAGSCSSLPSHLGETEAGTQFDM
                     DEDETGGGAASGSAGAGPGPGHIQQEYLLELSGDNRVIKKKKPGANQSMSYLSTKVLD
                     NIMQNYSPQQPESKDSAIGSEYQIGETMPFLGPVGVRPPGTPHIHLHQQHVQLQQQYP
                     TGAHFERGADDPTRKVDAGYVSLSDSYPACSSVYADTSTVSHANSRATTSSGDVFGKK
                     GGRAGPISTGAKSTSRNLDENGNALSDGYGTNNTPSSSSNNGNGNGGGGSSSNQFTAL
                     ITTTMGANAPASSTNRQNSVSTLLTTGSNTTTTAMAAAAVAASFNSQLQQVTVGNGTA
                     VAAGAAAGVGAGAVVSGASAGTGGQPTKTKSKKRSQQERNTTTVDAESVAGYRGKESV
                     EQLVKYIENDGNSGGQKKKERKKQNQKLKKSNSLEELRSCSKMEADDLKRESATTEMM
                     PQKKDANNGNASGSGKHNSASVADISKNCSKEQQQATTVQVQVRGAAAAQQRKGERRS
                     WGTEELQYLGDQDYRKPGLGKAAARSEPDIEAEPMPLSLPALSCMSEMDALNTVLSET
                     AEFHVVTKKKKPKKQRAVTMDDAAVAAAAHAGGNLQRMQQITKSASSNMMSQRAHYYT
                     AYTNSGGSSSTGSSNHHGQYPSKSAQQQQPQYQETQPQHHHHHHHHYHHHHHHGGSGS
                     AKKDSSRRKSTSSMPPSEKSDSSDLDSVHSLPIQTVKKKSLGFSSSGGSSSGGAATSN
                     KERAAQAATQRQVKKKTLTPAPISYADIARNKREALKNASGGIGNGSANASDPEPTEA
                     VAGGAAADKPTTGGKGSKSKVKPDFPELPVALSIPVAISTGTSTSTSTNTASNQVASS
                     SSNNSSSAGSISYSKSLNATPASSTSDVDASPELTPTCTTAPSSSLSSSQPSLQKSKS
                     VEHDATYSFNSSNLDQQYPALEKTVKRHSTTNVSLTAAACPSAGSGSGSGSAAGTATS
                     SIFNFAAATKLQMVDKSTGTTSLQTTSAKSKAKSKDLSYSSASSTSSSSGSSSGSSSK
                     KSMKLGHDEAATVPVPVPAPVPAPVPVAQKSTTATQTEGTKKMGHKMSHSSSKLTPEI
                     IAGRPAVIILNDDRDSGEEGLNNEFIFGDFNEDELKLFDDKKEEKEKEEKEIKKEKQK
                     ENEKSEKKVQPKPESQSKPQAEPKPKVQPTKSQAEPQPQPKPQAEPQPRPKLQAEPQP
                     KPKPLATQMDKQEPIDIDPEDDDDQIPEAEPEQIGEDKAEKKEAENTTATQSDVSSSS
                     YQQLILNDSGAASDVVNSSASLDMLSISGEAVQSSSPNSAAISTNSGSSSLINSSTSS
                     TPSSDSAQNSNSSSSVSISSSSGGNAPLGGLCSGHGSGAAYVANQSSDSGIYGAANTS
                     VKDHINQFLSRASSSSDEPLPNVPISMQQLETCNDIEAAIIAAARAAAAARSNNCSRR
                     NSQELEPPQSQSLSKSQQHQSPVAKSKIVPVYTTYNVGVQDYDEDLEELSFLADLRDA
                     EVDVEQEAEVKAEA"
     misc_feature    <4209..>4541
                     /gene="LOC111066368"
                     /note="transport protein TonB; Provisional; Region:
                     PRK10819"
                     /db_xref="CDD:236768"
ORIGIN      
        1 tcttgacgat acgtcaaagc tggaaggacc cacacgcagg tgcctgcagt atggggtggg
       61 agcgaagggt aaaacccagg atgatcagaa tgttgggaaa gcgacagaaa aataaaaaga
      121 caaaatacga tgcctgggag gatagtgact tcaatgatct cgatgatgcg tcattgaaga
      181 agctactcga ggaggcctat tggtaccgta atcctggcga caggaaaaac aaaagcgagc
      241 gctttctgca aatgctcaaa aaggctgagt acgacgaaga gatctcttat cgtgccatca
      301 aatcctgcct cacactcaac ccaatcacgc tagcgacgtc agcgtccaca tccggtgcag
      361 ccggcattgg cagcaacagc aacaacaaca cccaccacac aaacaatcga tcgggggagc
      421 gccacaagca gggcggctcc ttgcaggatc ttgttgaggc ggctcatcga agtgggtcgg
      481 ccaccagctc tgcagcggca gcagcagcag cggcgtcagc cacatcggcg acaacccaac
      541 gtttcaactg tgattaccag ctgagtggac gcgcgaaccg tcgccaccag caacagcaac
      601 agcagcagca caacaacaac caaaacgcac cctctgcctc caagaagatc aagaactggt
      661 ccgcgtcggg acggcagcgc gagggaggaa gccttccgag tagcgtcaac tttgagtcag
      721 ccgccacgtc catggatgcc aagctgaaac agaccatggc cgagtcatcg acgacatcct
      781 cccagtcaca ggcttcaact ggcaagatca gcgccggcag ctgcagcagc ctgcccagcc
      841 acctgggcga gactgaggcc ggcacccagt tcgatatgga cgaggatgag acgggcggtg
      901 gtgctgcgag tgggagtgct ggcgctggcc ctggccctgg ccacatccaa caagaatatc
      961 tgttggagct ttcgggtgac aatcgcgtaa tcaaaaagaa gaaaccaggc gccaatcagt
     1021 caatgtcgta tctcagcacc aaagtgctcg ataatatcat gcaaaactat agcccgcagc
     1081 agcccgaatc gaaggactcg gccattggca gtgagtatca aattggcgag acaatgccgt
     1141 ttttggggcc cgttggcgtt aggccacccg gcacgcccca catacatcta caccagcaac
     1201 acgtacagct gcagcaacaa tatccgacgg gtgcgcactt tgaacggggt gcagacgatc
     1261 ccacacgcaa agtggacgcc ggctatgtat cgctgagcga cagctatccg gcctgctcat
     1321 cggtctatgc cgatacatcg acggtgtccc atgcaaacag cagagccacc acctcttcgg
     1381 gggatgtgtt tggaaagaag ggcggacgtg cgggtccgat ttccacgggt gccaagtcca
     1441 catcgcgcaa ccttgatgag aacggcaacg ctctgagcga tggctatggc accaataaca
     1501 cgcccagcag cagcagcaac aatggaaatg gcaacggcgg tggcggcagc agctccaatc
     1561 agtttacggc cctgatcact accacgatgg gtgccaatgc accagcatcg tccacaaacc
     1621 gacagaacag cgtgtccacg ctgctcacaa ccggaagcaa caccaccacc acggcaatgg
     1681 cggccgctgc agtggctgcc tccttcaata gccagttaca gcaggttacc gttggcaacg
     1741 gcaccgcagt cgcagctgga gctgcagcgg gagtgggtgc cggagcagtt gtaagtggcg
     1801 ccagtgccgg cactggggga cagcccacca agaccaagtc aaagaagagg tcgcagcagg
     1861 agcgcaacac aaccactgtc gatgctgagt cggtggccgg gtatcgcggc aaggagtcgg
     1921 tggagcagct ggtcaagtac attgagaacg atggcaacag cggcgggcag aaaaagaagg
     1981 agcgcaagaa gcagaaccag aagctgaaga agagcaactc gctggaggag ctgcgcagct
     2041 gttccaaaat ggaggcggac gacctgaagc gcgagtcggc caccaccgag atgatgcccc
     2101 agaagaagga cgcaaacaat gggaatgcca gtggcagtgg caagcacaac tctgcttctg
     2161 tggcggacat aagcaagaac tgcagcaagg agcagcagca ggccacaact gtccaggtgc
     2221 aagtgcgtgg agcagccgct gcccagcagc ggaaaggcga acgccgctcc tggggcaccg
     2281 aggagctaca gtatctgggg gatcaggact accgcaagcc cggcttgggt aaggcggcgg
     2341 cccgctctga gcccgatatc gaggcggaac cgatgccact ctccctgcca gccctctcct
     2401 gcatgtcgga aatggatgcc ctgaacaccg tcctctcgga gacagccgaa ttccatgtgg
     2461 tgacaaagaa gaagaaacca aagaagcagc gagccgtcac catggacgat gcggccgttg
     2521 cggccgctgc ccatgcgggt ggcaatctgc agcgcatgca gcagatcacc aaatcggcct
     2581 cctccaacat gatgtcgcag cgggcgcact actacacggc gtacaccaac agcggcggca
     2641 gcagcagcac cggctccagc aaccatcacg gccaatatcc gagcaagtcg gcacagcagc
     2701 agcagccgca gtaccaggag acacagccac agcatcacca tcaccaccat catcattacc
     2761 atcatcatca ccatcatggt gggtcgggat cggccaaaaa ggacagctcg cgacgtaagt
     2821 ccacatcctc gatgccgccc tcggagaaat ccgattctag tgatctcgac tcggtccact
     2881 cgttgcccat tcagacggtc aagaagaaga gcctcggctt tagcagcagt ggcggcagca
     2941 gcagcggcgg cgccgccacc agcaacaagg aacgggcagc gcaggccgcc acccagcgtc
     3001 aggttaagaa aaagacactg acaccagctc ccatttccta tgcggacata gcccgcaaca
     3061 agcgggaggc cctgaagaat gccagtggcg gcatcggcaa tggcagcgca aatgccagcg
     3121 acccagagcc aacggaagca gtagctggcg gagcagcggc tgataagccc accaccggcg
     3181 gcaagggcag caaatctaag gtcaaacccg atttcccaga gctgccagta gctctatcca
     3241 taccggtagc catcagcacc ggcaccagca ctagcaccag caccaatacg gccagcaatc
     3301 aggtggccag tagttccagc aacaattcgt cgtcggcggg atcgatcagc tactcgaaga
     3361 gtctgaatgc gacgcctgcc agcagcacca gcgacgttga tgcctcgcca gagctcacac
     3421 ccacatgcac gactgcgccc agcagcagcc tgtcctcgtc acagccctcg ttgcaaaagt
     3481 ctaagagtgt ggagcacgat gccacctata gcttcaacag cagcaacctc gatcagcagt
     3541 atccagccct ggagaagact gtgaagaggc atagcacaac caatgtttcc ctgacagctg
     3601 ccgcctgtcc atccgctggc tctggctcgg gctccggatc ggctgcgggc acggcaacgt
     3661 cttcgatttt caattttgct gccgcaacta aattacaaat ggtagacaaa tcgacgggaa
     3721 ccacatcatt acagactacc tcagccaagt ccaaggccaa gtcaaaggat ctgtcataca
     3781 gcagcgcgag cagcaccagc agcagctcgg gcagtagctc gggcagcagc agcaagaagt
     3841 ccatgaagct tggtcacgat gaggcggcga cagtgcccgt tcccgtgcca gcgccagtgc
     3901 cagcgccagt gccagttgcc cagaagtcta cgacagccac ccagaccgag gggaccaaga
     3961 agatgggaca caagatgtcc catagcagca gcaagctgac acctgagatt attgctggtc
     4021 ggccggcggt gataatactc aacgatgacc gcgactcggg agaggaggga ctcaacaacg
     4081 agttcatctt tggcgatttc aacgaggatg agctgaagct cttcgacgac aaaaaagagg
     4141 agaaggaaaa ggaggagaaa gaaattaaaa aagagaaaca aaaggagaat gagaagagtg
     4201 agaagaaggt gcaaccgaaa ccagagtcac agtcgaagcc acaggcagag ccaaagccga
     4261 aggtacagcc cacaaagtca caggcagagc cacagccaca gccaaagcca caggcagagc
     4321 cacagccacg gccaaagctg caggcagagc cacagccaaa gccaaagcca ctagccacac
     4381 agatggacaa gcaagagcca atagatatag acccggagga cgatgacgat cagattccag
     4441 aagcagaacc agaacaaatt ggagaggaca aagccgagaa gaaggaggca gaaaatacca
     4501 cagccacaca gtcggatgtg agctccagta gctatcagca gctgatactg aatgactcgg
     4561 gcgccgcctc cgatgtggtc aacagttcgg ccagtttgga catgctatcg atctcaggcg
     4621 aggccgtgca gtcgtcgtcg ccaaactctg cggccatttc cacaaactcg ggctcatcca
     4681 gcctaatcaa ttcgtcaaca tcctcaacac cttcgtccga ttcggcacaa aattccaact
     4741 cgtcgtcatc tgtttcgata tcgtcgtcct ctggtggcaa cgctcccttg ggcggcttgt
     4801 gctccggaca cggttcgggt gcggcctatg tggcgaacca gtccagcgat agtggcatct
     4861 atggtgccgc caacacgtcc gtaaaggacc acatcaacca gttccttagc cgggccagca
     4921 gcagtagtga tgagccgctg ccgaatgtcc ccatctccat gcaacagttg gaaacctgca
     4981 atgacattga ggcggccatc atagctgctg cccgggccgc ggccgccgca cgcagcaaca
     5041 actgcagccg cagaaattcc caggaactgg agccgccaca gtcacagtcc ctgtccaagt
     5101 cgcagcagca tcagagccca gtggccaaat cgaaaattgt gcccgtgtat acgacctaca
     5161 atgttggcgt acaggactat gacgaggact tggaggagct gagcttcctg gccgacctcc
     5221 gagatgcgga ggtagacgtc gagcaggagg cggaagtgaa agcggaggcc