PREDICTED: Drosophila obscura uncharacterized LOC111072395
LOCUS XM_022364241 2785 bp mRNA linear INV 14-MAY-2021
(LOC111072395), transcript variant X3, mRNA.
ACCESSION XM_022364241
VERSION XM_022364241.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022364241.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..2785
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..2785
/gene="LOC111072395"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 2 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 11 samples with support for all annotated
introns"
/db_xref="GeneID:111072395"
CDS 412..2106
/gene="LOC111072395"
/codon_start=1
/product="uncharacterized protein LOC111072395"
/protein_id="XP_022219933.2"
/db_xref="GeneID:111072395"
/translation="METKNNPPPVTTSNYHHQRAPRTPESYFNVPESVALLNIVKSER
IQSAFQSNRKNHASVWEMVAEVLNRFSARKRTAKQCCNRYENLKKIYTQLKKNPERHV
RRNWPYMFLFKEIEEQRGECWGSNGKRLALISKNHNELSYYQRRRQAAELGVLLNKDN
LTPHQHSLLQSLSQSHAHSTDSSQSGASKLERFLPNHFVETQLNDVPCSNGNGLTVGL
PAGVYDDNPNPLQLHVAGAVAAAAAAAAVAAQKRSNELHLSNGGCVQLPPDEDEDDEP
QNAAFKDHHHGLGHGHGHGHSLGHGHGLGHGLHLGHGIGPVHGHGHMDDTAETPDYEK
DCNGALNLHHQNNNNNDNNISMKSEPLSEGEFNPDDIQLMQTNYNGTQNYYSPGIDQN
ILHPDVIVDTDNLSDCSSSTSLKKSKRKLSTSTDGDSTNYELIEYLKRREKRDEELLK
RMDAREERFMNLLDRTVIAIEKLAAGRTSVPETQLGTSTISQPIAPSPSAQAPPAATA
PAAAAEAATAATTVETAADSSRSSFTATASILHSSDSQTHRADQPRVVSEVVTQPP"
misc_feature 490..735
/gene="LOC111072395"
/note="Myb/SANT-like DNA-binding domain; Region:
Myb_DNA-bind_4; pfam13837"
/db_xref="CDD:463994"
ORIGIN
1 ttcagtttca gtttcgttgt tgtttggttt ggttagtttt tagttgacat caattggatg
61 gggaatagaa aagtgcatgg aataaccgtt accgttataa gcggtaaatg cagcgagccg
121 tttcattata tttgaaatgc aaaataccgt tagacagaaa caacaagaaa tagaggaacc
181 cttccccaac tccgtggcag aagccgccat tacaagttac aaaaggagca accattgaaa
241 agcgtgaagc acgtacacac acactagcaa acacacacag acaaagaaga taaagagcaa
301 aaatagagaa gaggagagat ctcagcgtta cgcctactca gcaaaaacgg gtgggcggca
361 ggaggaagga ggagcaacga aaaagcggct gactacaaca aaaattctca aatggagacg
421 aagaacaacc cacccccagt gacgacgagc aactatcacc accagcgggc gccgcgcaca
481 ccggaaagct atttcaatgt gccagagtcg gtggctctgc tgaatattgt caagtcggag
541 cgcatacaga gtgcgttcca gtcgaaccgg aagaaccatg ccagtgtgtg ggagatggtg
601 gccgaggtgc tgaatcggtt cagtgcccgc aagcggacgg ccaagcagtg ctgcaatcgg
661 tacgagaacc tcaagaagat ctacacgcag ctgaagaaga atccggagcg acacgtgcgt
721 cgcaactggc cgtacatgtt cctgttcaag gagatcgagg agcagcgcgg cgagtgctgg
781 ggctccaacg gcaagcgtct ggctctcatc tccaagaatc acaacgagct ctcctactac
841 cagcgccggc gacaggcggc cgagctcggc gtcctgctga acaaggacaa tctcacgccc
901 caccagcaca gcctgctgca gagcctcagc cagagccatg cccactccac ggactcgtcg
961 cagagcggtg ccagcaagct ggagcgcttc ctgcccaacc actttgttga gacgcagctg
1021 aacgacgtgc cctgctccaa cggcaacggc ctgacggtgg gcctgcccgc cggcgtctat
1081 gacgacaatc ccaatcccct tcagctgcat gtggccggcg ctgtggcggc tgccgcagct
1141 gccgctgccg tggccgccca aaagcggagc aatgaactgc acctgagcaa tggcggctgt
1201 gttcagctgc cgccggatga ggacgaggac gacgagccac agaatgctgc atttaaggat
1261 catcatcatg gacttggtca cggacacgga cacggccata gtctggggca cggccatggt
1321 ctggggcatg gtcttcatct gggacatggc attggtcctg ttcatggtca tgggcatatg
1381 gatgacaccg ccgaaacgcc cgactacgag aaggactgca atggagccct caacttgcac
1441 catcagaaca ataacaacaa cgacaacaac atatccatga aatcggagcc actttcggag
1501 ggggaattca atccggatga cattcagctg atgcagacca actacaatgg gacacagaac
1561 tattactcgc cgggcatcga ccagaacatc ctgcatccgg atgttatcgt ggacacggac
1621 aatctgtccg actgcagctc ctccacctcg ctgaagaaga gcaagcgcaa gctgtccacc
1681 tccaccgatg gcgacagcac caattacgag ctgatcgagt atttgaagag gcgcgagaag
1741 cgcgacgaag agctcctcaa acggatggat gcacgcgagg agcgattcat gaatcttctg
1801 gatcggacgg tgattgccat tgagaagctg gctgcgggca ggacgtcagt cccagagaca
1861 cagctgggca ccagtaccat cagtcagcca atagctcctt ctccctcagc acaggcacca
1921 ccagcagcaa cagcgccagc agcagcagca gaagcagcaa cagcagcaac aacagtagaa
1981 acagcagcag acagcagcag aagcagcttc accgctaccg ccagcatcct ccactcatct
2041 gatagccaaa cccaccgagc agaccagccc cgtgttgtca gtgaagtcgt cacccagcca
2101 ccgtagtcgc agtccgacat tggagcgaga tgaaccaagc agcgctgctg ctactgctgc
2161 tgttgttgtt ctggtacccg atgaaccgga tgtcgatgct caggcggagg cactgacagc
2221 gagtgctgca tccacggatg cgtaggttat atacttctta cttaataccc ccacaatctc
2281 caccaaaaaa aaaacaaacc actcgtgaac cggatggagg agacccggat cggagaatta
2341 ccaaagtacc aaagacgagc tacgtctcga attttaattg agatttgcaa cgtttacata
2401 gagccgggga cgcagcgcca catccccgtc atccatgaca gccaaaagtg cttaaggatt
2461 cctacgccac cgccccgaac ggaggaagtc ccccaaaacc ggaatattgc gaatcgcaat
2521 gtttgtgcca gaaataattt taaaaaacgt ttattttagt ctaagttcaa atcagaggag
2581 gagaaaaagt tctgcaactg caacgctctt ctcctgcttc tgctgcgggg tcggatcggc
2641 cgaattgcca tgatttataa gcaatcgaaa atttctatga taatctagtt gcgagctgaa
2701 atggggctca aaatgcattt gtgcaggaac ttatacacat aatcatctct ctctcccact
2761 accactacca ttctctctct atatc