PREDICTED: Drosophila obscura uncharacterized LOC111072395
LOCUS XM_022364242 2810 bp mRNA linear INV 14-MAY-2021
(LOC111072395), transcript variant X4, mRNA.
ACCESSION XM_022364242
VERSION XM_022364242.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022364242.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..2810
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..2810
/gene="LOC111072395"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 2 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 9 samples with support for all annotated
introns"
/db_xref="GeneID:111072395"
CDS 437..2131
/gene="LOC111072395"
/codon_start=1
/product="uncharacterized protein LOC111072395"
/protein_id="XP_022219934.2"
/db_xref="GeneID:111072395"
/translation="METKNNPPPVTTSNYHHQRAPRTPESYFNVPESVALLNIVKSER
IQSAFQSNRKNHASVWEMVAEVLNRFSARKRTAKQCCNRYENLKKIYTQLKKNPERHV
RRNWPYMFLFKEIEEQRGECWGSNGKRLALISKNHNELSYYQRRRQAAELGVLLNKDN
LTPHQHSLLQSLSQSHAHSTDSSQSGASKLERFLPNHFVETQLNDVPCSNGNGLTVGL
PAGVYDDNPNPLQLHVAGAVAAAAAAAAVAAQKRSNELHLSNGGCVQLPPDEDEDDEP
QNAAFKDHHHGLGHGHGHGHSLGHGHGLGHGLHLGHGIGPVHGHGHMDDTAETPDYEK
DCNGALNLHHQNNNNNDNNISMKSEPLSEGEFNPDDIQLMQTNYNGTQNYYSPGIDQN
ILHPDVIVDTDNLSDCSSSTSLKKSKRKLSTSTDGDSTNYELIEYLKRREKRDEELLK
RMDAREERFMNLLDRTVIAIEKLAAGRTSVPETQLGTSTISQPIAPSPSAQAPPAATA
PAAAAEAATAATTVETAADSSRSSFTATASILHSSDSQTHRADQPRVVSEVVTQPP"
misc_feature 515..760
/gene="LOC111072395"
/note="Myb/SANT-like DNA-binding domain; Region:
Myb_DNA-bind_4; pfam13837"
/db_xref="CDD:463994"
ORIGIN
1 acttgtgatc ggacggaccg agtggcgcgg cggcgtcttc gtccgcctaa cgaccgtagc
61 actatgtgaa tacatcaatt ggatggggaa tagaaaagtg catggaataa ccgttaccgt
121 tataagcggt aaatgcagcg agccgtttca ttatatttga aatgcaaaat accgttagac
181 agaaacaaca agaaatagag gaacccttcc ccaactccgt ggcagaagcc gccattacaa
241 gttacaaaag gagcaaccat tgaaaagcgt gaagcacgta cacacacact agcaaacaca
301 cacagacaaa gaagataaag agcaaaaata gagaagagga gagatctcag cgttacgcct
361 actcagcaaa aacgggtggg cggcaggagg aaggaggagc aacgaaaaag cggctgacta
421 caacaaaaat tctcaaatgg agacgaagaa caacccaccc ccagtgacga cgagcaacta
481 tcaccaccag cgggcgccgc gcacaccgga aagctatttc aatgtgccag agtcggtggc
541 tctgctgaat attgtcaagt cggagcgcat acagagtgcg ttccagtcga accggaagaa
601 ccatgccagt gtgtgggaga tggtggccga ggtgctgaat cggttcagtg cccgcaagcg
661 gacggccaag cagtgctgca atcggtacga gaacctcaag aagatctaca cgcagctgaa
721 gaagaatccg gagcgacacg tgcgtcgcaa ctggccgtac atgttcctgt tcaaggagat
781 cgaggagcag cgcggcgagt gctggggctc caacggcaag cgtctggctc tcatctccaa
841 gaatcacaac gagctctcct actaccagcg ccggcgacag gcggccgagc tcggcgtcct
901 gctgaacaag gacaatctca cgccccacca gcacagcctg ctgcagagcc tcagccagag
961 ccatgcccac tccacggact cgtcgcagag cggtgccagc aagctggagc gcttcctgcc
1021 caaccacttt gttgagacgc agctgaacga cgtgccctgc tccaacggca acggcctgac
1081 ggtgggcctg cccgccggcg tctatgacga caatcccaat ccccttcagc tgcatgtggc
1141 cggcgctgtg gcggctgccg cagctgccgc tgccgtggcc gcccaaaagc ggagcaatga
1201 actgcacctg agcaatggcg gctgtgttca gctgccgccg gatgaggacg aggacgacga
1261 gccacagaat gctgcattta aggatcatca tcatggactt ggtcacggac acggacacgg
1321 ccatagtctg gggcacggcc atggtctggg gcatggtctt catctgggac atggcattgg
1381 tcctgttcat ggtcatgggc atatggatga caccgccgaa acgcccgact acgagaagga
1441 ctgcaatgga gccctcaact tgcaccatca gaacaataac aacaacgaca acaacatatc
1501 catgaaatcg gagccacttt cggaggggga attcaatccg gatgacattc agctgatgca
1561 gaccaactac aatgggacac agaactatta ctcgccgggc atcgaccaga acatcctgca
1621 tccggatgtt atcgtggaca cggacaatct gtccgactgc agctcctcca cctcgctgaa
1681 gaagagcaag cgcaagctgt ccacctccac cgatggcgac agcaccaatt acgagctgat
1741 cgagtatttg aagaggcgcg agaagcgcga cgaagagctc ctcaaacgga tggatgcacg
1801 cgaggagcga ttcatgaatc ttctggatcg gacggtgatt gccattgaga agctggctgc
1861 gggcaggacg tcagtcccag agacacagct gggcaccagt accatcagtc agccaatagc
1921 tccttctccc tcagcacagg caccaccagc agcaacagcg ccagcagcag cagcagaagc
1981 agcaacagca gcaacaacag tagaaacagc agcagacagc agcagaagca gcttcaccgc
2041 taccgccagc atcctccact catctgatag ccaaacccac cgagcagacc agccccgtgt
2101 tgtcagtgaa gtcgtcaccc agccaccgta gtcgcagtcc gacattggag cgagatgaac
2161 caagcagcgc tgctgctact gctgctgttg ttgttctggt acccgatgaa ccggatgtcg
2221 atgctcaggc ggaggcactg acagcgagtg ctgcatccac ggatgcgtag gttatatact
2281 tcttacttaa tacccccaca atctccacca aaaaaaaaac aaaccactcg tgaaccggat
2341 ggaggagacc cggatcggag aattaccaaa gtaccaaaga cgagctacgt ctcgaatttt
2401 aattgagatt tgcaacgttt acatagagcc ggggacgcag cgccacatcc ccgtcatcca
2461 tgacagccaa aagtgcttaa ggattcctac gccaccgccc cgaacggagg aagtccccca
2521 aaaccggaat attgcgaatc gcaatgtttg tgccagaaat aattttaaaa aacgtttatt
2581 ttagtctaag ttcaaatcag aggaggagaa aaagttctgc aactgcaacg ctcttctcct
2641 gcttctgctg cggggtcgga tcggccgaat tgccatgatt tataagcaat cgaaaatttc
2701 tatgataatc tagttgcgag ctgaaatggg gctcaaaatg catttgtgca ggaacttata
2761 cacataatca tctctctctc ccactaccac taccattctc tctctatatc