PREDICTED: Drosophila obscura mucin-19 (LOC111066368), partial
LOCUS XM_041591990 5270 bp mRNA linear INV 14-MAY-2021
mRNA.
ACCESSION XM_041591990
VERSION XM_041591990.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq; corrected model.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
frameshifts :: corrected 1 indel
##RefSeq-Attributes-END##
COMPLETENESS: incomplete on the 3' end.
PRIMARY REFSEQ_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP
1-119 JAECWW010000165.1 3052859-3052977
120-248 JAECWW010000165.1 3055772-3055900
249-369 JAECWW010000165.1 3055994-3056114
370-968 JAECWW010000165.1 3056145-3056743
969-1445 JAECWW010000165.1 3057645-3058121
1446-4727 JAECWW010000165.1 3058196-3061477
4728-5270 JAECWW010000165.1 3061479-3062021
FEATURES Location/Qualifiers
source 1..5270
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..>5270
/gene="LOC111066368"
/note="The sequence of the model RefSeq transcript was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon;
Derived by automated computational analysis using gene
prediction method: Gnomon. Supporting evidence includes
similarity to: 4 Proteins, and 99% coverage of the
annotated genomic feature by RNAseq alignments, including
17 samples with support for all annotated introns"
/db_xref="GeneID:111066368"
CDS 51..>5270
/gene="LOC111066368"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon"
/codon_start=1
/product="LOW QUALITY PROTEIN: mucin-19"
/protein_id="XP_041447924.1"
/db_xref="GeneID:111066368"
/translation="MGWERRVKPRMIRMLGKRQKNKKTKYDAWEDSDFNDLDDASLKK
LLEEAYWYRNPGDRKNKSERFLQMLKKAEYDEEISYRAIKSCLTLNPITLATSASTSG
AAGIGSNSNNNTHHTNNRSGERHKQGGSLQDLVEAAHRSGSATSSAAAAAAAASATSA
TTQRFNCDYQLSGRANRRHQQQQQQQHNNNQNAPSASKKIKNWSASGRQREGGSLPSS
VNFESAATSMDAKLKQTMAESSTTSSQSQASTGKISAGSCSSLPSHLGETEAGTQFDM
DEDETGGGAASGSAGAGPGPGHIQQEYLLELSGDNRVIKKKKPGANQSMSYLSTKVLD
NIMQNYSPQQPESKDSAIGSEYQIGETMPFLGPVGVRPPGTPHIHLHQQHVQLQQQYP
TGAHFERGADDPTRKVDAGYVSLSDSYPACSSVYADTSTVSHANSRATTSSGDVFGKK
GGRAGPISTGAKSTSRNLDENGNALSDGYGTNNTPSSSSNNGNGNGGGGSSSNQFTAL
ITTTMGANAPASSTNRQNSVSTLLTTGSNTTTTAMAAAAVAASFNSQLQQVTVGNGTA
VAAGAAAGVGAGAVVSGASAGTGGQPTKTKSKKRSQQERNTTTVDAESVAGYRGKESV
EQLVKYIENDGNSGGQKKKERKKQNQKLKKSNSLEELRSCSKMEADDLKRESATTEMM
PQKKDANNGNASGSGKHNSASVADISKNCSKEQQQATTVQVQVRGAAAAQQRKGERRS
WGTEELQYLGDQDYRKPGLGKAAARSEPDIEAEPMPLSLPALSCMSEMDALNTVLSET
AEFHVVTKKKKPKKQRAVTMDDAAVAAAAHAGGNLQRMQQITKSASSNMMSQRAHYYT
AYTNSGGSSSTGSSNHHGQYPSKSAQQQQPQYQETQPQHHHHHHHHYHHHHHHGGSGS
AKKDSSRRKSTSSMPPSEKSDSSDLDSVHSLPIQTVKKKSLGFSSSGGSSSGGAATSN
KERAAQAATQRQVKKKTLTPAPISYADIARNKREALKNASGGIGNGSANASDPEPTEA
VAGGAAADKPTTGGKGSKSKVKPDFPELPVALSIPVAISTGTSTSTSTNTASNQVASS
SSNNSSSAGSISYSKSLNATPASSTSDVDASPELTPTCTTAPSSSLSSSQPSLQKSKS
VEHDATYSFNSSNLDQQYPALEKTVKRHSTTNVSLTAAACPSAGSGSGSGSAAGTATS
SIFNFAAATKLQMVDKSTGTTSLQTTSAKSKAKSKDLSYSSASSTSSSSGSSSGSSSK
KSMKLGHDEAATVPVPVPAPVPAPVPVAQKSTTATQTEGTKKMGHKMSHSSSKLTPEI
IAGRPAVIILNDDRDSGEEGLNNEFIFGDFNEDELKLFDDKKEEKEKEEKEIKKEKQK
ENEKSEKKVQPKPESQSKPQAEPKPKVQPTKSQAEPQPQPKPQAEPQPRPKLQAEPQP
KPKPLATQMDKQEPIDIDPEDDDDQIPEAEPEQIGEDKAEKKEAENTTATQSDVSSSS
YQQLILNDSGAASDVVNSSASLDMLSISGEAVQSSSPNSAAISTNSGSSSLINSSTSS
TPSSDSAQNSNSSSSVSISSSSGGNAPLGGLCSGHGSGAAYVANQSSDSGIYGAANTS
VKDHINQFLSRASSSSDEPLPNVPISMQQLETCNDIEAAIIAAARAAAAARSNNCSRR
NSQELEPPQSQSLSKSQQHQSPVAKSKIVPVYTTYNVGVQDYDEDLEELSFLADLRDA
EVDVEQEAEVKAEA"
misc_feature <4209..>4541
/gene="LOC111066368"
/note="transport protein TonB; Provisional; Region:
PRK10819"
/db_xref="CDD:236768"
ORIGIN
1 tcttgacgat acgtcaaagc tggaaggacc cacacgcagg tgcctgcagt atggggtggg
61 agcgaagggt aaaacccagg atgatcagaa tgttgggaaa gcgacagaaa aataaaaaga
121 caaaatacga tgcctgggag gatagtgact tcaatgatct cgatgatgcg tcattgaaga
181 agctactcga ggaggcctat tggtaccgta atcctggcga caggaaaaac aaaagcgagc
241 gctttctgca aatgctcaaa aaggctgagt acgacgaaga gatctcttat cgtgccatca
301 aatcctgcct cacactcaac ccaatcacgc tagcgacgtc agcgtccaca tccggtgcag
361 ccggcattgg cagcaacagc aacaacaaca cccaccacac aaacaatcga tcgggggagc
421 gccacaagca gggcggctcc ttgcaggatc ttgttgaggc ggctcatcga agtgggtcgg
481 ccaccagctc tgcagcggca gcagcagcag cggcgtcagc cacatcggcg acaacccaac
541 gtttcaactg tgattaccag ctgagtggac gcgcgaaccg tcgccaccag caacagcaac
601 agcagcagca caacaacaac caaaacgcac cctctgcctc caagaagatc aagaactggt
661 ccgcgtcggg acggcagcgc gagggaggaa gccttccgag tagcgtcaac tttgagtcag
721 ccgccacgtc catggatgcc aagctgaaac agaccatggc cgagtcatcg acgacatcct
781 cccagtcaca ggcttcaact ggcaagatca gcgccggcag ctgcagcagc ctgcccagcc
841 acctgggcga gactgaggcc ggcacccagt tcgatatgga cgaggatgag acgggcggtg
901 gtgctgcgag tgggagtgct ggcgctggcc ctggccctgg ccacatccaa caagaatatc
961 tgttggagct ttcgggtgac aatcgcgtaa tcaaaaagaa gaaaccaggc gccaatcagt
1021 caatgtcgta tctcagcacc aaagtgctcg ataatatcat gcaaaactat agcccgcagc
1081 agcccgaatc gaaggactcg gccattggca gtgagtatca aattggcgag acaatgccgt
1141 ttttggggcc cgttggcgtt aggccacccg gcacgcccca catacatcta caccagcaac
1201 acgtacagct gcagcaacaa tatccgacgg gtgcgcactt tgaacggggt gcagacgatc
1261 ccacacgcaa agtggacgcc ggctatgtat cgctgagcga cagctatccg gcctgctcat
1321 cggtctatgc cgatacatcg acggtgtccc atgcaaacag cagagccacc acctcttcgg
1381 gggatgtgtt tggaaagaag ggcggacgtg cgggtccgat ttccacgggt gccaagtcca
1441 catcgcgcaa ccttgatgag aacggcaacg ctctgagcga tggctatggc accaataaca
1501 cgcccagcag cagcagcaac aatggaaatg gcaacggcgg tggcggcagc agctccaatc
1561 agtttacggc cctgatcact accacgatgg gtgccaatgc accagcatcg tccacaaacc
1621 gacagaacag cgtgtccacg ctgctcacaa ccggaagcaa caccaccacc acggcaatgg
1681 cggccgctgc agtggctgcc tccttcaata gccagttaca gcaggttacc gttggcaacg
1741 gcaccgcagt cgcagctgga gctgcagcgg gagtgggtgc cggagcagtt gtaagtggcg
1801 ccagtgccgg cactggggga cagcccacca agaccaagtc aaagaagagg tcgcagcagg
1861 agcgcaacac aaccactgtc gatgctgagt cggtggccgg gtatcgcggc aaggagtcgg
1921 tggagcagct ggtcaagtac attgagaacg atggcaacag cggcgggcag aaaaagaagg
1981 agcgcaagaa gcagaaccag aagctgaaga agagcaactc gctggaggag ctgcgcagct
2041 gttccaaaat ggaggcggac gacctgaagc gcgagtcggc caccaccgag atgatgcccc
2101 agaagaagga cgcaaacaat gggaatgcca gtggcagtgg caagcacaac tctgcttctg
2161 tggcggacat aagcaagaac tgcagcaagg agcagcagca ggccacaact gtccaggtgc
2221 aagtgcgtgg agcagccgct gcccagcagc ggaaaggcga acgccgctcc tggggcaccg
2281 aggagctaca gtatctgggg gatcaggact accgcaagcc cggcttgggt aaggcggcgg
2341 cccgctctga gcccgatatc gaggcggaac cgatgccact ctccctgcca gccctctcct
2401 gcatgtcgga aatggatgcc ctgaacaccg tcctctcgga gacagccgaa ttccatgtgg
2461 tgacaaagaa gaagaaacca aagaagcagc gagccgtcac catggacgat gcggccgttg
2521 cggccgctgc ccatgcgggt ggcaatctgc agcgcatgca gcagatcacc aaatcggcct
2581 cctccaacat gatgtcgcag cgggcgcact actacacggc gtacaccaac agcggcggca
2641 gcagcagcac cggctccagc aaccatcacg gccaatatcc gagcaagtcg gcacagcagc
2701 agcagccgca gtaccaggag acacagccac agcatcacca tcaccaccat catcattacc
2761 atcatcatca ccatcatggt gggtcgggat cggccaaaaa ggacagctcg cgacgtaagt
2821 ccacatcctc gatgccgccc tcggagaaat ccgattctag tgatctcgac tcggtccact
2881 cgttgcccat tcagacggtc aagaagaaga gcctcggctt tagcagcagt ggcggcagca
2941 gcagcggcgg cgccgccacc agcaacaagg aacgggcagc gcaggccgcc acccagcgtc
3001 aggttaagaa aaagacactg acaccagctc ccatttccta tgcggacata gcccgcaaca
3061 agcgggaggc cctgaagaat gccagtggcg gcatcggcaa tggcagcgca aatgccagcg
3121 acccagagcc aacggaagca gtagctggcg gagcagcggc tgataagccc accaccggcg
3181 gcaagggcag caaatctaag gtcaaacccg atttcccaga gctgccagta gctctatcca
3241 taccggtagc catcagcacc ggcaccagca ctagcaccag caccaatacg gccagcaatc
3301 aggtggccag tagttccagc aacaattcgt cgtcggcggg atcgatcagc tactcgaaga
3361 gtctgaatgc gacgcctgcc agcagcacca gcgacgttga tgcctcgcca gagctcacac
3421 ccacatgcac gactgcgccc agcagcagcc tgtcctcgtc acagccctcg ttgcaaaagt
3481 ctaagagtgt ggagcacgat gccacctata gcttcaacag cagcaacctc gatcagcagt
3541 atccagccct ggagaagact gtgaagaggc atagcacaac caatgtttcc ctgacagctg
3601 ccgcctgtcc atccgctggc tctggctcgg gctccggatc ggctgcgggc acggcaacgt
3661 cttcgatttt caattttgct gccgcaacta aattacaaat ggtagacaaa tcgacgggaa
3721 ccacatcatt acagactacc tcagccaagt ccaaggccaa gtcaaaggat ctgtcataca
3781 gcagcgcgag cagcaccagc agcagctcgg gcagtagctc gggcagcagc agcaagaagt
3841 ccatgaagct tggtcacgat gaggcggcga cagtgcccgt tcccgtgcca gcgccagtgc
3901 cagcgccagt gccagttgcc cagaagtcta cgacagccac ccagaccgag gggaccaaga
3961 agatgggaca caagatgtcc catagcagca gcaagctgac acctgagatt attgctggtc
4021 ggccggcggt gataatactc aacgatgacc gcgactcggg agaggaggga ctcaacaacg
4081 agttcatctt tggcgatttc aacgaggatg agctgaagct cttcgacgac aaaaaagagg
4141 agaaggaaaa ggaggagaaa gaaattaaaa aagagaaaca aaaggagaat gagaagagtg
4201 agaagaaggt gcaaccgaaa ccagagtcac agtcgaagcc acaggcagag ccaaagccga
4261 aggtacagcc cacaaagtca caggcagagc cacagccaca gccaaagcca caggcagagc
4321 cacagccacg gccaaagctg caggcagagc cacagccaaa gccaaagcca ctagccacac
4381 agatggacaa gcaagagcca atagatatag acccggagga cgatgacgat cagattccag
4441 aagcagaacc agaacaaatt ggagaggaca aagccgagaa gaaggaggca gaaaatacca
4501 cagccacaca gtcggatgtg agctccagta gctatcagca gctgatactg aatgactcgg
4561 gcgccgcctc cgatgtggtc aacagttcgg ccagtttgga catgctatcg atctcaggcg
4621 aggccgtgca gtcgtcgtcg ccaaactctg cggccatttc cacaaactcg ggctcatcca
4681 gcctaatcaa ttcgtcaaca tcctcaacac cttcgtccga ttcggcacaa aattccaact
4741 cgtcgtcatc tgtttcgata tcgtcgtcct ctggtggcaa cgctcccttg ggcggcttgt
4801 gctccggaca cggttcgggt gcggcctatg tggcgaacca gtccagcgat agtggcatct
4861 atggtgccgc caacacgtcc gtaaaggacc acatcaacca gttccttagc cgggccagca
4921 gcagtagtga tgagccgctg ccgaatgtcc ccatctccat gcaacagttg gaaacctgca
4981 atgacattga ggcggccatc atagctgctg cccgggccgc ggccgccgca cgcagcaaca
5041 actgcagccg cagaaattcc caggaactgg agccgccaca gtcacagtcc ctgtccaagt
5101 cgcagcagca tcagagccca gtggccaaat cgaaaattgt gcccgtgtat acgacctaca
5161 atgttggcgt acaggactat gacgaggact tggaggagct gagcttcctg gccgacctcc
5221 gagatgcgga ggtagacgtc gagcaggagg cggaagtgaa agcggaggcc