PREDICTED: Drosophila obscura uncharacterized LOC111066363
LOCUS XM_041591909 2983 bp mRNA linear INV 14-MAY-2021
(LOC111066363), transcript variant X2, mRNA.
ACCESSION XM_041591909
VERSION XM_041591909.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..2983
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..2983
/gene="LOC111066363"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 4 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 1 sample with support for all annotated introns"
/db_xref="GeneID:111066363"
CDS 402..2939
/gene="LOC111066363"
/codon_start=1
/product="uncharacterized protein LOC111066363 isoform X1"
/protein_id="XP_041447843.1"
/db_xref="GeneID:111066363"
/translation="MSDKYQDKKIETSVAVTLHEDAQGASCSFGNGDGAAGAAGGAAG
AAGADEAAGAHGGAAAADGGAAGGHTGAAGADRGAAGAHGGVADADGGATGTEVGAVA
VAVPSVVVVQGGDVDADRDRESHSDRSAARDELGTQRVYKKTSPNCVLTLYLPTREMT
LTGDKAAVLRGILFVDPKAIQGYRVYAQLTLTFRYGREDEEVMGLRFCNEAIMSLHQI
WPRLQEPAPESLSPLQEALMKRLGAGAHAFTLVLNAHSPPSVQLVPAKRYYGAPIGTS
YDVRCFIADKTDEKFHRRASVKMGVRVIYRTDVFQHSSTEHFATCSGCHHQAGAKTSP
PAAPSSATLPVNRETSEQQTQTPTHSLGSGTKAKGKSERGDSFPKLRLSPKAFRFSGR
FGRSKSEIEKCPPDSFHNYSKSFQEHHCDCMLAPFSGAGVSGGIGSITLGPDGGPQGS
VDKPFLLHDGRVGLKASLDKGWYTHGEDVNVTIEIRNDSRKTVRKVRVCAIQHVDVCM
FNNGKFKNVVADSEQISPPVDRTVSPGATVNSVVVLRPQRGQTKNWIALEDSLQRSTE
PDEITGEIAASAIRPPHYIVPNVQQSGQQSLPSPAMAMPSADMPPQQGNATTTSTEDR
NIFAIYVSYYIKVKLTLSSMGGELSLKLPFVLVHVDETRRPGFASATLGELRIEMEKL
ALHDAPTGRRANRRDPPMGAESGDGPSTSAAAAAAQSTASGGGGGGGASSSGGHLDRI
ESADDEFHIHIPIPAAAGQKNAPQKRRLQRSETLARDLEESDEHLTEPHDDRAEHIVQ
IHLSQEPTKVERSQQTDDEQQQEQKSELERAQAPGAETGAVPKSTDV"
misc_feature 1776..2381
/gene="LOC111066363"
/note="Arrestin (or S-antigen), C-terminal domain; Region:
Arrestin_C; smart01017"
/db_xref="CDD:214976"
ORIGIN
1 ttccttccca ctttgttgcg gtattaattt cagcccccaa tcccttccac atttagaggc
61 aaacggacac tctcctgaat aggtacgagt atgagatgag ttcacatgag tgtgaaggaa
121 agtgggaagg agtaggaact acataagctt gtgacaagga aaggaaacaa agtaacgtac
181 gagcttaacg actgcgacca taacacagcc gcgacagcag cagcagcggc agcggcagcg
241 gtggcagcgg caggagtgga gatgttgaag tgccggacac ggacacgaat acggacacgg
301 acaggaaact cacccgcaaa ctgcaaattg caaattgcca aattgccaca aggaactctc
361 agaccagcag aagaagccaa gcctttcgat tgagtggaaa gatgtcggac aagtatcagg
421 acaagaagat cgaaacatct gtggccgtga ccctgcacga ggatgcccag ggggccagct
481 gcagctttgg caacggtgac ggggccgccg gagccgccgg aggagctgca ggagcagctg
541 gtgccgatga agccgcagga gcccatggcg gcgccgcagc agccgacgga ggagccgctg
601 gcggccacac tggagccgca ggtgccgaca gaggagccgc aggagcccac ggaggagtcg
661 cagatgccga cggaggagcc acgggtaccg aggtaggagc ggtggctgtt gcagttccat
721 ccgtggtggt ggtgcaaggt ggggatgtgg atgcggatcg cgacagggaa tcgcacagcg
781 acaggtcggc cgccagggac gagcttggaa cgcagcgcgt ctacaagaag acatcgccga
841 actgcgtgct tacgttgtat ttgccgacgc gcgagatgac gctcaccggg gacaaggcgg
901 cggtgctgcg tggcattctc ttcgtggacc caaaggccat ccagggatat cgtgtgtatg
961 cccagctaac gctgaccttt cgctatggcc gcgaggatga ggaggttatg ggcctgcgtt
1021 tctgcaacga ggccatcatg tcgctgcatc agatctggcc acgcctccag gagccagccc
1081 cggaatcgct tagccccttg caggaggccc ttatgaagcg tctaggagct ggcgcccacg
1141 ccttcacttt agtgcttaac gctcactcgc caccgagcgt tcagttggtt ccggccaagc
1201 gctattatgg ggctccaatc ggcaccagct acgatgtgcg ttgcttcatt gctgataaga
1261 cggacgagaa gttccatcga cgggcctcgg tcaagatggg ggttcgggtc atctaccgca
1321 cggacgtctt tcagcattcg agcacggagc actttgccac ctgctctggg tgccatcatc
1381 aggccggagc caagacgtcg ccaccggccg caccatccag cgccacgctg ccggtgaaca
1441 gggagaccag cgagcagcag acccaaacac cgacccattc cctgggctcc ggtacgaagg
1501 cgaagggcaa atccgaacgg ggtgactcgt tcccaaagtt gcgcctgtcg ccgaaagcat
1561 tccggttcag cggccgcttt ggccgcagca agagcgagat cgagaaatgc ccaccggact
1621 cgttccacaa ctacagcaag tccttccagg agcatcactg cgactgcatg ttggcgccat
1681 ttagcggtgc aggcgtatcc gggggcattg gcagcatcac cctcggcccc gacggcggac
1741 cccagggctc ggtggacaag ccgtttttgc tgcatgacgg ccgcgtcgga ctaaaggcca
1801 gcctggacaa gggctggtac acccatggcg aggatgtcaa tgtgaccatt gagatacgca
1861 acgacagccg caagacggtc cgaaaagtga gggtgtgcgc cattcagcat gtggacgtct
1921 gtatgttcaa caatgggaag tttaagaacg ttgtagctga ctcggagcaa atctcgccgc
1981 cggtggaccg taccgtaagc ccgggtgcca cggtcaattc ggtggtggtg ttgcggcccc
2041 agcgcggcca aacgaagaac tggatcgcgc tggaggactc gctgcagcga agtaccgagc
2101 cggacgagat cactggggag atcgccgcct cggcaatacg tcctccccac tatattgtgc
2161 cgaatgtgca acagtccggg caacagtcgc tgcccagtcc ggccatggcc atgcccagtg
2221 ccgatatgcc accacagcag ggaaacgcca ccaccacgtc gacggaggat cggaacatct
2281 tcgccatcta tgtgtcttac tacatcaagg taaagctcac tctgagcagc atgggtggcg
2341 aattgtcgtt aaagctcccc ttcgtactgg tgcacgtgga tgagaccagg cgcccgggct
2401 tcgcatcggc caccctcggc gagctacgca tcgagatgga aaagctggcg ctccacgacg
2461 cacccaccgg gcgacgtgcc aaccgccggg atccccccat gggggctgaa agcggggatg
2521 ggccatccac cagtgccgct gctgcggccg cccaatccac cgcgtccggc ggaggtggag
2581 gcggtggcgc cagctcgtct ggaggtcacc tcgaccgcat cgagtcagcg gacgatgagt
2641 tccatataca catccccatt ccagcagctg ccgggcaaaa gaacgcccca caaaagcggc
2701 gactgcagcg cagcgagaca ctggccagag atctcgagga gagcgacgaa cacctgaccg
2761 agccgcatga cgaccgggcg gagcacattg tccagattca cctctcgcag gagcccacca
2821 aggtggagcg atcccagcaa accgacgacg agcagcagca ggagcagaag tccgagctgg
2881 agcgcgctca agcaccaggc gccgagactg gggccgtgcc caagtctaca gatgtctgat
2941 tctgattgtg gtcccagatg tttgccaagc gcaagagaaa gcc