PREDICTED: Drosophila obscura uncharacterized LOC111074799
LOCUS XM_041591978 7215 bp mRNA linear INV 14-MAY-2021
(LOC111074799), mRNA.
ACCESSION XM_041591978
VERSION XM_041591978.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq; includes ab initio.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
ab initio :: 1% of CDS bases
##RefSeq-Attributes-END##
FEATURES Location/Qualifiers
source 1..7215
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..7215
/gene="LOC111074799"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 2 Proteins, and 96% coverage of
the annotated genomic feature by RNAseq alignments"
/db_xref="GeneID:111074799"
CDS 75..6944
/gene="LOC111074799"
/codon_start=1
/product="uncharacterized protein LOC111074799"
/protein_id="XP_041447912.1"
/db_xref="GeneID:111074799"
/translation="MGETFYGNYARTGQEQLTESANMLHNNAPWLPPHQSPQQQHQQQ
HHPQQQHAQLQQQQQQQQQQQQQPVYASQMFHHQAPPPLGQQQPQYWPEEQQQQQHQQ
QQPSLNYNNYFAVQDGGQQVPPPMPLPQPELQQQPPQMYYQPPPQTTTPQHQMHPEIP
LDSFDNSSGSINNNNTNNTGSRSDGWGDWGDWNETSNNNNNNSQSQGHSNEAVMEPPP
SIVEDAFNIQGSQGNSWQAFANNNNNLELPQSAERQPQQPQHPLQPLQQHLQQLHLQQ
PPPEVGSEPEVDAIVPPRAFQNQPPAAGCAPVSGGAEVAPVAGGLVAMAPPSALPPII
ASPGGNPFKRSTGPSKRVNILAATQGAAPAPAAAPTPAPPQPATTLAPIPIAPVAAPS
AEAIFGLAAEPQPEHFNILTPPAVIPAPVVPGAIPAAASSIYGLPAPSQPQQQQQQLL
IPPPLVHESSPLEMGAYENMEVLAAPNDERAQYLQTSHLSEQPEEGEANWEADTDTGL
LPPPGLSRLVLGQPELEQQRLVTGTEQPPATAMNVVQALHMEERQADGEDTSEGEQPP
RVDDRNLYQTPPRRVVTGVETTAVSVREQREVVLDGENLEDREEPPPALAGVPAVVPE
LPPQLQQHHSLPDEAEDMHHNPPSQSAPVAASTAAAASHQQPHPHSQSHQQPLQQQQQ
QQQQQQRLEKKRPQAGQRMSRTGNASLDLESEESDEFLLSERERDRERDRRDLPDDRR
GRRGHYNNYDGETEDSVRGAGPAGSGESKPVCDSHRRSHEGNSQQHGRRRNPDTDLER
ERERDRDRERNWRRRSNKYHSGGEDQERYEHSRRYNNSNYDGESDGPDSSHMGADSEL
LLDGGSVGPGAGPSSIGGRSSKNGRERQRRSGAAPEDDYDYDDYERERGQYSRRSIKQ
SEKGRPSGGRRDRHEATTGDQRPQRRSNDDGTLERRRRHPHADQRSKRRNYAPYAGAG
YDPYGMYEQMSRNPQAYADMYAKFYGQMINSMTAAVAPYKAAAGAGAIGGVPVQQQHQ
QQQQMMTAGVQMMDVMRTTGKEGDLLTERERYTHAYINQANEFHRQQYNEGLYQQKLQ
QLQQQQQQQLQQLHQQQQELLNRSMADMEDGSASFYGGESFSSRRAIYGHHGLASARS
LSNLNGGDLNGTGRSSRCSGMYYAGSECGLDIRGADVAAGGSVTTTSVARPPRRRTPL
MYNRPHLVASYAMSLLLKVKPKYAGRGRLRNDVEVAAPRFRDGTSSLLRMYPGPLQGR
KLHKDKIISFCKEQIRLGPTRGCTMLYATQKKPQGTVEKYRASHALMWNLLILLLRQN
GYIADTDVGDLLLENQQEYPYNPAELDAASETDPEADPEPSDPAEPAVDSEAESETAA
TRAASQSSDEGQTAAPGAGADAGASGKAATLSEQEATDKFRSYVLRGNVEEALQWATD
NNLWTHAFFLALYEDRYALIDVAQKFLNRAIKANDPLQTLYQMKSCHTPACVSQLRDE
QWGDWRSHLSILVTNKSRQPEYDRSSVVALGDTLFQRGDIYAAHFCYLVAQEEFGRYD
SSATQLTTLTANVPRLILLGASHYKHFNEFASNEAIIMTEIYEYARSLFDQKFSIGNF
QHYKFLLATRILDYGQHFRCTNYLEQIAKHIELKPDSYDSDFIQRVCGLAERLRYHDP
ILINRVSFASPPIGNNSSTSSSSKDPAVPEDKAWLRQLRIMADVQQQQQQQQQQQQQE
LQQQQQNDIDQQFAEVNQQFRELNMQYDGGNLEDTLHLTKEQPPVPDVHQQQLPEQHM
QQQYYEPTPQTQMQLQPQVQHQEELQTDAYGQQTQQLQQQQKPPQMYYDPNPATAQHY
EQQQHIAASYGSHIEPAAEAYSAGGQTVDQAAAAAAASGYGYDYWSGTQQPPYGDEQQ
LQHMQQQKLKQQPNYRSGNSNNLNNNSLKGSKTAATKSGLEMERTKTLLRQMAASPAT
KPQTAATTRATKAEPATRAAEGAIEAAEASPQILATTMTTPAPAATKAFNLNDLQHQH
QQGPQGRPAISMPKSKSYGDEDDGAAAGSPAQAAASRSKPGAGAKQGSSGEIGAPGNQ
NAGWFGGLWNKLSLKPKNQMILPDDKNPTIVWDKERKCWTNTEGNVDEAESFKPPPKM
SDMGMGMGMGMPLGSPPPNMLGSMPTPHLLGGHEAVAAAPQQPQMYGNPHDYAAAAPT
PELYPATVPSPAPAPAIPPPAPASASVPAPGGAQPKLQSNMFKMQRNRTLKNSYVDVF
NPSGAPMSAPSENVLAPIMAPAALPQGGYFVPGGASASHQQ"
misc_feature 3837..5075
/gene="LOC111074799"
/note="Ancestral coatomer element 1 (ACE1) of COPII coat
complex assembly protein Sec16; Region: ACE1-Sec16-like;
cd09233"
/db_xref="CDD:187750"
misc_feature order(4311..4316,4320..4325,4329..4343,4347..4349,
4353..4355,4383..4385,4389..4391,4395..4400,4404..4409,
4419..4421,4437..4439,4458..4460,4488..4490,4503..4505,
4512..4514,4596..4601)
/gene="LOC111074799"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:187750"
misc_feature order(4842..4844,4908..4913,4917..4922,4929..4934,
4941..4943,5022..5027,5034..5039,5046..5048,5058..5060)
/gene="LOC111074799"
/note="heterodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:187750"
ORIGIN
1 cagagggatt agtggctggc aggggggacg aggatttaac agaggtgtgt gtgtcgtcca
61 ctgcggtgga gcaaatgggt gaaacattct atgggaatta tgcaagaact gggcaggaac
121 agctaaccga gagcgcgaac atgctgcaca acaatgctcc ctggctgcca ccccatcagt
181 cgccacagca gcaacaccag cagcaacacc atccgcagca gcagcatgca cagctgcagc
241 agcagcaaca gcagcagcag cagcagcagc agcagcctgt ctatgccagc caaatgttcc
301 atcatcaggc accaccacca cttggccagc agcagccaca atactggcca gaggagcagc
361 agcagcagca gcaccagcag cagcagccgt cgctaaatta caacaactac tttgcagtcc
421 aagatggggg tcagcaggtg ccaccgccta tgcctctgcc tcagccagag ctacagcaac
481 agcctcctca aatgtattat cagcctccac ctcaaacaac caccccgcaa catcagatgc
541 atcccgaaat ccctttggat tcctttgaca acagcagcgg cagcatcaat aataacaaca
601 ccaacaacac cggcagcagg agtgatggct ggggggattg gggcgattgg aatgagacga
661 gcaacaacaa taacaacaat agccagagcc agggccacag caacgaggcc gtgatggagc
721 caccgccaag catagtggag gatgctttca atattcaggg gtcgcagggc aacagctggc
781 aggcctttgc caacaataac aacaatttgg aacttccgca atccgcggaa cgccagccgc
841 agcaaccgca gcacccgttg cagccgctgc agcaacacct ccaacaactc catttgcaac
901 aaccgccgcc agaggtaggc tcggaacccg aagtggatgc gattgtgccg ccgcgagcct
961 ttcagaatca accacctgcg gcaggatgcg cacctgtctc cggaggggca gaagtggcgc
1021 cagtagccgg aggattggtg gcaatggccc caccatctgc cctgccaccg atcattgcca
1081 gtccgggtgg caaccccttt aaacgttcca ctggacccag caaacgtgtc aacatactgg
1141 ccgccacaca aggagctgct cctgctcctg ctgcagcacc tactcctgct cctccgcagc
1201 ctgccacgac actcgcacca ataccaatag ctcctgtggc agcaccatct gctgaggcaa
1261 tctttggtct ggccgcagag ccacagccgg aacatttcaa tatattgaca ccgccggcag
1321 tgattcctgc acctgttgtt ccgggagcga ttcccgcagc tgcatcctcc atttatggac
1381 tacccgcacc ctcacagcct cagcagcaac agcagcagct gctgattcca ccaccactcg
1441 tccacgagtc gtcaccattg gagatggggg cctatgagaa catggaggtg ctggccgcac
1501 ccaacgacga gagggcccag tatctgcaga ccagtcacct ctcggagcag cccgaggagg
1561 gggaggccaa ctgggaggcg gacacagaca cgggcctgct cccgccaccc ggtctgtcgc
1621 gtctggttct gggccagccg gagttggagc agcagcgact cgtcacgggc accgaacagc
1681 caccggccac ggccatgaat gttgtccagg ccctccacat ggaggagcgt caggccgatg
1741 gagaggacac ctccgaggga gagcagccgc cccgtgtcga cgatcgcaat ctgtatcaga
1801 caccgccacg tcgtgtggtt acgggtgtgg agacgacggc cgtgtctgtg cgggagcagc
1861 gtgaggttgt cctggatggc gagaatctag aggatcggga ggagccacca ccagccctgg
1921 ccggagttcc ggctgtcgtt ccggagctgc cgccacagct acagcaacat cacagcctgc
1981 ccgacgaggc cgaggacatg catcacaatc ccccatcaca gtcggctcca gtggcagcct
2041 ctaccgccgc cgccgcctct caccagcagc cgcacccaca ctcacaatca caccagcagc
2101 ccctgcagca gcagcagcag caacaacagc agcagcagag attggagaag aagcgtcccc
2161 aagccggcca gcgcatgtcg cggacaggaa acgcctcctt ggacctggag tccgaggagt
2221 cggatgagtt tctgctcagc gagcgagaac gcgacaggga gcgcgataga cgggacctgc
2281 ccgatgatcg gcgcggacgt cgtggccact acaacaacta cgacggcgag acggaggact
2341 ctgtgcgggg cgccggccca gccggatctg gggagagcaa accggtgtgc gatagccatc
2401 gccgcagcca cgaaggcaac agccagcagc atggacgacg ccgcaatccc gacacagacc
2461 tggaacggga gcgggaacgc gatcgggatc gggagcgcaa ctggcgccgg cgatcgaaca
2521 agtatcacag cggcggagag gatcaggagc gttacgagca ctcccgccgc tacaacaata
2581 gcaactacga cggagagtcc gatggtcccg actctagcca catgggcgcc gacagcgagt
2641 tgctgctaga tggaggcagc gtcggccctg gggccggacc cagcagcatt ggtggccgca
2701 gcagcaagaa cggcagagag cgccagaggc gcagtggtgc tgcacccgag gatgactacg
2761 actatgatga ctatgagcga gagcgcggcc aatactctcg acgctccatc aagcagtcgg
2821 agaagggccg gcctagtggt gggcggcgcg atcgacatga ggccaccact ggtgatcagc
2881 ggccgcagcg gcgcagcaac gatgatggca ccctggagcg gcgccgtcgt catccgcatg
2941 cggatcaacg ctccaagcgg cgcaactacg ctccgtacgc gggtgctggc tatgacccgt
3001 acggaatgta cgagcagatg tcgaggaatc cccaagccta tgcggacatg tacgctaagt
3061 tctatggcca aatgatcaac tcgatgacgg cagcggtggc tccgtacaag gcggcggctg
3121 gtgcgggtgc cattggtggt gttccggtac agcagcagca ccagcagcag cagcagatga
3181 tgactgcagg agtccaaatg atggatgtca tgcggacgac tggtaaggag ggtgacctgc
3241 ttacagaacg agaacggtac acacacgcat acataaatca agcaaacgaa ttccatcgtc
3301 agcaatataa tgaggggctc taccagcaaa agctccagca gctacagcaa caacagcaac
3361 agcagctcca gcagctacat cagcagcaac aggagctcct taaccgcagc atggcggaca
3421 tggaggatgg cagtgccagc ttctatggcg gtgaatcgtt tagcagccgg cgagctatct
3481 atgggcatca tgggctggcc agtgcccgat cgctgagcaa tctgaatggt ggggatctga
3541 atggaacagg acgaagttca cgttgcagcg gcatgtatta tgctggctct gaatgcggcc
3601 tcgatatcag aggtgccgat gtggctgcag gtggcagtgt aaccaccaca tcggtggccc
3661 gtccacccag gagacgcacc ccactgatgt acaaccgacc gcatttggtg gcctcgtacg
3721 cgatgagcct gctgctgaag gtgaagccaa agtacgccgg acgcggccgc ctgcgcaacg
3781 atgtggaggt ggcggcaccg cgcttccggg acggcacaag cagcctgctg cgcatgtatc
3841 cgggacccct gcagggccgc aagttgcaca aggacaagat catcagtttc tgcaaggaac
3901 agatacgcct cggccccacc cgcggctgta cgatgctgta tgccacgcag aagaagcccc
3961 agggcaccgt ggagaagtac cgggcctcgc acgccctcat gtggaacctg ctcatcctgc
4021 tgctgcgcca gaacgggtac attgccgaca cggatgtggg cgatctgctg ctggagaacc
4081 agcaggagta tccctacaat ccggccgagc tggatgcggc gtccgaaacg gatcccgagg
4141 ctgatccaga gccatcggat ccagcggagc cggctgtgga ttcggaagct gaaagcgaaa
4201 cggccgccac cagggccgcc agtcaatcct cggacgaggg ccagactgcc gctccagggg
4261 ctggcgctga tgctggcgcc tcggggaaag cagcgacgct atcggagcag gaggccacgg
4321 acaagttccg tagctatgtg ttgcgcggca atgtggagga ggccctgcag tgggccaccg
4381 acaataacct gtggacgcac gccttcttcc tggctctgta cgaggaccgc tatgccctga
4441 tagatgtggc ccagaagttc ctcaatcgcg ccatcaaggc caacgatccg ctgcagacgc
4501 tctaccagat gaagagctgc cacaccccgg cctgcgttag ccagctgcgg gacgagcagt
4561 ggggcgactg gcgctcccat ctctccattc tcgtgacgaa caagtcgcgc cagccggagt
4621 acgatcgcag ctcggtggtg gccctgggcg acacgctctt ccagcgcggt gacatctatg
4681 cggcccactt ctgctatctg gttgcccagg aggagtttgg acgctacgac agctcggcca
4741 cccagctgac gacgcttacg gccaatgttc ccagactcat cctgctgggc gcctcacact
4801 acaagcactt caacgagttt gccagcaacg aggccatcat catgacggag atctacgagt
4861 acgcccgctc gctgttcgac caaaagttca gcatcggcaa cttccagcac tacaagttcc
4921 tgctggccac gcgcatcctc gactatggac agcatttccg ctgcaccaac tacctggagc
4981 agattgccaa acacattgaa ctcaagccag acagctacga cagtgatttt attcagcgcg
5041 tatgcggcct ggccgagcgt ttgcgctacc acgatcccat cctgatcaat cgggtgtcgt
5101 ttgccagtcc gccgattggc aacaacagta gcaccagcag cagcagcaag gaccccgccg
5161 tgcccgagga caaggcctgg ctgcgtcagc tgcggatcat ggccgatgtg caacagcagc
5221 agcagcagca acagcaacag caacaacagg agctgcagca gcagcaacag aacgatatcg
5281 atcagcagtt tgcggaggtg aaccaacagt tccgggagct taatatgcaa tatgatggcg
5341 gcaatttgga ggatacccta catctaacca aggagcagcc acctgtgcct gatgtccacc
5401 agcaacagct gccagagcag catatgcagc agcaatacta cgagccaacg cctcagacgc
5461 aaatgcagct acagcctcag gtacagcatc aggaggagct acaaactgat gcgtatggcc
5521 agcagaccca acagttacaa cagcagcaaa aaccgccaca gatgtactat gatcccaatc
5581 cagcaacagc acagcactac gaacagcagc aacatattgc tgcctcatat ggcagccaca
5641 tagaaccagc cgccgaggca tattccgctg gcggccaaac agtagatcaa gcggcagcag
5701 cagcagcagc ttctggctat ggctacgact actggtcggg cacacagcag ccgccctacg
5761 gcgatgagca gcagctgcag cacatgcaac agcagaagct aaagcaacag ccaaactatc
5821 gcagtggcaa cagcaacaac ctaaacaaca attccttgaa aggctcaaag acagcagcca
5881 caaagtctgg cctggagatg gagaggacaa agacactact ccgccaaatg gcagcatccc
5941 ccgctactaa gccacagaca gcagccacaa cacgagccac aaaagcagag ccagcaacac
6001 gagcagcaga aggagcaatt gaggcagcag aagcttcacc acaaatatta gccacaacga
6061 tgacaacgcc agcgccagca gcaacaaagg cattcaattt gaatgatctg cagcatcagc
6121 atcaacaggg gccacagggg cgaccagcaa tctccatgcc aaagtccaag agctacggcg
6181 atgaggacga tggtgcggct gcgggttcac cagcccaagc agcggccagc agaagcaagc
6241 caggagcagg agccaaacag ggatcgagtg gagaaattgg tgcacctggc aatcagaatg
6301 ccggctggtt cgggggattg tggaacaagt tgtcgctgaa accgaagaac caaatgattc
6361 tgccggacga caagaatccc accattgtgt gggacaagga gcgcaagtgc tggaccaaca
6421 ccgagggcaa tgtggatgag gctgaaagct tcaagccgcc gccaaagatg agtgatatgg
6481 gcatgggcat gggcatgggc atgcccttgg gatcaccacc tccaaatatg ctgggcagca
6541 tgcccacacc tcatctgttg ggtggccacg aggctgtggc agcggcgcca cagcagccac
6601 aaatgtatgg gaatccgcat gactatgccg ctgccgcccc aacgcccgag ctgtatccag
6661 caacggtgcc atcgccagcc ccagccccag cgatacctcc accagcacca gcatcagcat
6721 cagtaccagc tcctggtggc gcccagccca agctgcagtc gaacatgttt aaaatgcagc
6781 gcaatcgcac tctaaagaac tcatatgtgg atgtgttcaa tccttcgggt gcgcccatgt
6841 cagcgccatc cgagaatgtc ctggccccca taatggcgcc ggctgcactg ccccaaggtg
6901 gctactttgt acccggcggc gcgtctgcat cgcatcagca gtaggcccac cgctactagc
6961 cccagccaaa accctctaat gccagcgaaa tctgaacttc gagctacgta aaaactacca
7021 gacactagac cagctgtgcg cccacacact cacacacaca cacacacaca cacataacca
7081 gaaaccagac gagaccatac acacgcacac acccattgat cgtgccaccc tgtcactcgc
7141 ctgcctgata catgaagcat agcatgcaac cacacaccac agtggtgaca cagggacggg
7201 attgggtggg gggcc