PREDICTED: Drosophila obscura mucin-5AC (LOC111070715), mRNA.
LOCUS XM_041591973 2638 bp mRNA linear INV 14-MAY-2021
ACCESSION XM_041591973
VERSION XM_041591973.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq; includes ab initio.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
ab initio :: 8% of CDS bases
##RefSeq-Attributes-END##
FEATURES Location/Qualifiers
source 1..2638
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..2638
/gene="LOC111070715"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 2 Proteins, and 92% coverage of
the annotated genomic feature by RNAseq alignments"
/db_xref="GeneID:111070715"
CDS 10..2259
/gene="LOC111070715"
/codon_start=1
/product="mucin-5AC"
/protein_id="XP_041447907.1"
/db_xref="GeneID:111070715"
/translation="MTTALCSIGLALCLLASIARGWDYSIPYDYAAPYYAAPYYAPPY
YAAPNYFAPPRYDISSCPPTGEESAVCAVAPGAQPATFPSPCEVDHFSRQTGVAWQII
YEDRCDKMEECPEVCNDEYSPVCATFQYSRRTFRNYCELLLVTCRTHHQWKLVHDRVC
DSRLPPLYSPRARTYNPYAMHAIQHSPSYGSYGSYRPQPLRRPQTSFVSPPPPYKAVA
YEPYEISWDNLPTEYEMKQTSPEAYVTTTQAPETTTQEAQTTETTQSTETTGAPAASG
RSAKVLSVNAVAMDNETTPGPSSTVAVTSEDTTETSRETSSTAAPTQYQPYPYYSPTS
TTEEASTTVKPQARAININAIAIEDETSTESSSTASSPSYETYPSFPVYNTTAPPEHA
TTTARPAARSIQINAVAETIDDKTTPEPSSTVDSSTEASSTVAPSSYETYPSYSASPS
YPSYSDSSAYPSYSPTATETTEEDSTTVKTVLKSLRINAVAETEEDVEKEETTPDPTT
EETSTVAPSNYKTYPSYSASPSYPSYSDSSAYPSYSPTATETTEEESTTVKTVLKSLR
INAVAETEEDVEKEETTPDPTTEETSTVAPSNYKTYPSYSASPSYPSYSDASAYPSYS
PTATETPEEDLLRRLPMVAETEEDVAEEETTADTTTEATSTAAPSSYETYPSYSASPS
YPSYSDSSAYPSYSPTATETPEEESTTVKTVLRSLKINAVAETEEDVAEEGNHRQAEL
YSRRGGGQY"
misc_feature 343..489
/gene="LOC111070715"
/note="Kazal type serine protease inhibitors; Region:
KAZAL; smart00280"
/db_xref="CDD:197624"
ORIGIN
1 gttcgcaaaa tgacaacagc cttgtgttcc atcggtctgg ccttgtgcct gctcgcctcc
61 atcgcccgtg gatgggacta ctcgatcccc tacgactacg ctgcacctta ctacgcggcc
121 ccctactacg cgccccccta ctacgcggct cccaattact ttgcaccgcc gcgttacgac
181 atatcgtcct gcccacccac cggagaggag agtgcggtct gtgccgtggc cccgggcgca
241 cagccagcga ccttccccag cccctgcgaa gtggatcact ttagccgcca gacgggcgta
301 gcatggcaga tcatctacga ggataggtgc gacaagatgg aggagtgccc ggaggtctgc
361 aatgatgaat atagtcccgt ctgcgcgaca ttccagtact ccagacgcac cttcaggaac
421 tactgcgagc tgctgctggt cacctgccgt acacatcatc agtggaaact tgtgcatgac
481 cgcgtctgtg acagcagatt gccacctttg tactcgccac gcgctcgaac ctacaatccc
541 tacgccatgc acgccatcca gcactcgccc tcgtatggat cctacggctc ctatcgacca
601 cagccattga gaaggccaca aacgtcgttt gtatccccgc caccgccgta caaggcggta
661 gcgtacgaac cctacgagat ctcttgggac aacctgccca ccgagtatga aatgaagcag
721 acctccccag aggcatatgt gacgactact caagcgccag agactactac ccaggaagcg
781 caaaccactg agacgacgca atccactgaa acaaccgggg ctccggctgc ctcagggaga
841 tctgcaaaag ttttgagcgt taatgctgtg gccatggata atgaaactac tccgggaccc
901 agctcgacgg tggcggtaac tagtgaggac accacagaaa catcgcggga aaccagttct
961 acagccgccc ctacacagta ccagccgtat ccatattata gtccgacctc tacaacagaa
1021 gaggcttcca ccacagtcaa gccccaagca cgagctataa atatcaatgc cattgcaata
1081 gaggatgaaa cttctacaga gtccagctcc acagcctctt caccgagcta cgagacatat
1141 ccatcctttc cagtatacaa cacgactgcc ccgccagaac acgctacaac aactgccaga
1201 cctgcggcac gatccataca gataaatgca gttgccgaaa ccatagacga taaaaccacc
1261 cccgaaccca gctctacagt cgattcatca acggaagcaa gctctacagt ggcaccttca
1321 agctatgaga cgtatccgtc atattcagcc tcaccatcct atccttccta ttcagactcc
1381 tcagcatatc cctcctatag cccgactgct actgagacca cagaggaaga ttctactact
1441 gtcaagaccg ttctaaaatc cctcagaata aatgcagtgg ctgagactga agaggatgtt
1501 gaaaaggaag aaacaacacc agatcccaca acggaagaaa cttctacagt ggcaccttcg
1561 aactataaga cttatccatc atactccgcc tcaccgtcct atccttccta ttcagactcc
1621 tcagcatatc cctcctatag cccgactgct actgagacca cagaggaaga gtctactact
1681 gtcaagaccg ttctaaaatc cctcagaata aatgcagtgg ctgagactga agaggatgtt
1741 gaaaaggaag aaacaacacc agatcccaca acggaagaaa cttctacagt ggcaccttcg
1801 aactataaga cttatccatc atactccgcc tcaccgtcct atccttccta ttcagacgcc
1861 tcagcatatc cctcctatag ccctactgct actgagaccc ccgaggaaga tctactacgg
1921 cgtctcccca tggtggctga gactgaagag gatgttgcag aggaggaaac caccgccgat
1981 acaacaacgg aagcaacttc tacagctgca ccttccagct atgagactta tccatcatac
2041 tccgcctcac cgtcctatcc ttcctactca gactcctcag catatccctc ctatagcccg
2101 actgctactg agaccccaga ggaagagtct actactgtca agaccgttct aagatccctc
2161 aaaataaatg cagtggctga gactgaagag gatgttgcag aggagggaaa ccaccgccaa
2221 gccgagctct acagtcgcag aggcggaggc cagtactgag aagagccggt cctacccatc
2281 ttatacaacc cctccagtca atagctctac cgatgcgccg aagattacct ctacatcgga
2341 ttcactatca tcgtttaggg tgaacgcggt ggctgtaagc gaggaccagg cacctgtaca
2401 ggtgtataat gcgacggcat cgcaggggtc actgtcatcc ggaaacgctc cgaataacac
2461 aatcgtggac tatgtgtctg gaaaagatgg cgagaacaag cagcacattc gggtgtacaa
2521 cactggcagc cagggtaatg tcattatatt taacattaat aataactaat cgagtatttt
2581 gaatttcatc aaagaactga attagataat ccgttttata agtcgattaa attgttaa