PREDICTED: Drosophila obscura uncharacterized LOC111066225
LOCUS XM_041591923 5697 bp mRNA linear INV 14-MAY-2021
(LOC111066225), transcript variant X2, mRNA.
ACCESSION XM_041591923
VERSION XM_041591923.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..5697
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..5697
/gene="LOC111066225"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 1 Protein, and 99% coverage of the
annotated genomic feature by RNAseq alignments"
/db_xref="GeneID:111066225"
CDS 372..5681
/gene="LOC111066225"
/codon_start=1
/product="uncharacterized protein LOC111066225 isoform X2"
/protein_id="XP_041447857.1"
/db_xref="GeneID:111066225"
/translation="MDTVAIKTALPHAPLAAQQQQQQQQQQQQQPQNAQTATINAKNN
IGAFFQTCKVLWHLDAFRRSFRGLNQHVCGGQDCIFCALKELFQQLQTSSEPALCPEP
LRRALASGPLAGRRFPLGCLGDAAECFELLLHRVHSHISSDDGDSCESSACIAHRRFA
MRVIEQSVCKCGANSEQLPFTQMVHYVSASALTSQKSLALQSHQQLSFGQLLRAAGNM
GDIRDCPNTCGAKIGIRRALLNRPDVVSIGIVWDSERPAADQVHAVLKAVGTSLRLAD
VFHQVSEQRWAQQAQHELVGIVSYYGKHYTTFFFHTKLKVWVYFDDANVKEVGPSWDG
VVDKCSRGRYQPLLLLYAVPQPPSSAAGMDQLPVAGSLPSPAASMTAAGLAATVVRRA
VTPSPEKPSLGSTRRAITPTPLRTPTNDYQNLSVIQKSIFPSNGAGTDETDAYISRKT
VEHVLSAQYQNLSVIQDKINAVPGSVATSNADKDGGFLNRKPIEAMLSAQLARRQHLQ
LQRSHSAESSSGHASNGSSPPSDGLTMPEHLNQPRRRDSGNWSGDRNSASSASSTTLD
NPYLYMVAKRGVAAVPQSPTRHGLPYDPGYDSFSLSSTDSYPPKHILNPQLAKIPEAA
MGMGMGMGMGMGAGVGVGAGNVQGAGVGVGAGVPVGHGLALSGDCEKLCHEADQLLEK
SRIVEESHDLETALVLCNAAAGRARAAMDAPYSNPHTMTFARMKHNTCVMRARSLHRR
ILVEKGAESEAMPELKHMREGSTSSVKHVRQNSKDRTLEKEQLQMQLQLQQQLQQQLV
PSAVSKNIEIYATLPKRKSPLKALAAAASGSGSVGMDDNAIEYEQQVVASSSSPASSQ
KPERERESRSLFGRKDKDKDKEKEKRSRSEDRNKLTREFSLTETLLVNAKDTLKKHKE
DKDEKKEKDKSGKKQHKIRRKLLMGGLIRRKNRSMPDLTEAVDDAAAPISGPAHHHHH
LQLHHPCAPTAGHLMGSSVDDSAVGLGKHALAMSGYISEGHFDYPLGTGPCTSNVGVG
SGSGSNPNPNLERSKLMRKSFHGSGRQLTMPVPKVAPPPPMRTSSALTPHQQQQQQQQ
QFHHEANLSNLSAMSSNTSISEDSCQTIITTCAQVHSEQSPLKDMQMLGMAQGQEELP
LPLPPMELPPYPSPPHSVCHSRQASEDFPPPPPPELDLEPLNQQLSQLQQLEAANKQQ
RQQQLDSLEGTTSILAQLQQRQHLLKLRKEQTGAGSDATWLKELQAKQAADLRTMQRK
LEASSPSSVRDLAHRFEQTATIRSYASQELLAQPRTQMNGLPSQAKLDADEVDCAGPV
RASLPLPLPLPHQLMLPKPKYEMSQSQIAEEIREVELLNSMVQQTLNQSAAAGAPPKR
VKKKSVSFCDQVILVATAGNEEDDDFIPNPILERVLRTAQHPNEKVTAQMIQQQQQNM
LRLQTEPQPRQQHQQQQQQQQQQQQQQQQQQQQQQQQLQLQHQQQQQQPSPMLGRPAA
TDMQRYVQRMQQQQQQQQQQQQQQQEMYRQQQQQQQHRTSMSGIVSQQHLQLQQQQQQ
QQQQHQIDAQYTSMPRPLHHPHLLPQTYSVQLQKHQQHQQQLQQQHQQQPSPQLSLGY
FSNGQAAASEQVPQPQPLPSPYQRVPLPHGYQPTGGPYYPLPNQQQQQLIIANGKPAQ
KKVSFEPGTKGEADCLPPPPPPPQTKLALQNGLPESASPGAGPGSAPAAITAIPTRVY
NNAIVKASAKAVECNLCRKRHVIAPAVYCTNCEYYLQMLNQRR"
misc_feature 498..1364
/gene="LOC111066225"
/note="Peptidase C19 contains ubiquitinyl hydrolases. They
are intracellular peptidases that remove ubiquitin
molecules from polyubiquinated peptides by cleavage of
isopeptide bonds. They hydrolyse bonds involving the
carboxyl group of the C-terminal Gly...; Region:
Peptidase_C19; cl02553"
/db_xref="CDD:470612"
misc_feature <5589..>5657
/gene="LOC111066225"
/note="protein kinase C conserved region 1 (C1 domain)
superfamily; Region: C1; cl00040"
/db_xref="CDD:412127"
ORIGIN
1 acaacagcaa caaaaacagc aacaacagca acaaaaagtt tagttaagtg aacgcaaaca
61 ggtgcaaaac gaagcacgag aagaagaacc aaaaaaagtg taagagctgg aagagagaag
121 aaccagaagt ggaggaaaac aacaacagca aaaaaaaaat agaagcagca gcagcagttg
181 cccaagaagt ggaagcaaaa gggttgtcga tactcgcaac acttcccttt cgggcttcct
241 ctggcggaag ttgcaagtgg caacaacagc aacaacagca acaacagcaa caacaacaac
301 aacaacaacc atgtgctgat ctcaaagcgc ggggaatagc cttactgttc ggttgccaaa
361 actcattcaa tatggatacg gtggccatta agacagcact gccgcatgcc ccattggctg
421 cccaacagca gcagcagcaa cagcaacagc agcagcagca accacagaat gcccaaacag
481 caacaatcaa tgccaaaaac aacattgggg cattttttca gacctgcaag gtcttgtggc
541 acttggacgc ctttcgacgc tcgttccggg gccttaacca gcatgtatgc ggcggccaag
601 actgtatatt ctgcgctcta aaggaactct tccagcagct gcagaccagc tcggagcccg
661 cactctgtcc ggaacccttg cggcgagcct tggccagtgg acccctggcc ggacgacgat
721 ttccgctggg ctgcctgggc gatgctgccg aatgctttga gctgctgctg catcgcgtcc
781 actcgcacat ctcgtccgat gatggggact cgtgcgagtc gagtgcctgc attgcccatc
841 gccgctttgc catgcgcgtc atcgagcaga gtgtctgcaa gtgtggggcc aactccgagc
901 agcttccctt cacgcagatg gtgcactatg tgtccgcctc ggcccttaca tcgcagaaga
961 gcctggccct gcagagccac cagcagctga gctttggcca gctgctgaga gctgccggca
1021 acatgggcga catacgcgac tgtccgaaca cctgtggagc caagatcggg atacgccgag
1081 cgctgctcaa tcgtcccgat gtcgtctcca ttggcatcgt ttgggactcg gaacgacccg
1141 ccgccgacca ggtgcatgcc gtgctcaagg cggtcggaac gagtctgcgc ctggcggacg
1201 tcttccatca ggtcagcgag cagcgctggg cccagcaggc gcagcacgag ctggtgggga
1261 tcgtctccta ctacggcaag cactacacca ccttcttctt ccacaccaag ctgaaggtgt
1321 gggtgtactt cgacgacgcc aacgtcaagg aggtgggtcc cagctgggac ggagtggtgg
1381 acaagtgcag tcgcggccgc taccagccgc tgctgctgct gtacgcggtg ccccagccgc
1441 ccagctccgc ggcgggcatg gatcagctgc ccgtcgccgg atcgctgccc tcgccggctg
1501 cctcaatgac ggcggcggga ctggccgcga cggtggtgcg ccgggcggtg acacccagcc
1561 cggagaagcc ctcgctgggc agcactcgac gggccattac gcccacgccc ctgcgcacgc
1621 ccaccaatga ctaccagaat ctgagtgtca tccagaagag cattttcccg tcgaatgggg
1681 cgggcaccga tgagacggat gcctacatca gccgcaagac ggtggagcac gtgctgagcg
1741 cacagtacca gaacctgagc gtcatccagg acaagatcaa cgccgtgccc ggctcggtgg
1801 ccacgtccaa tgccgacaag gatgggggct tcctcaatcg gaagcccatc gaggcgatgc
1861 tgagcgccca gctggcgcgc cggcagcacc tccagctgca gcgcagccac agcgccgagt
1921 cgagctccgg ccacgcctcc aatggcagct cgccgcccag cgacgggctg accatgcccg
1981 agcacctgaa ccagccccgg agacgggact cgggcaactg gtcgggggat cgcaacagcg
2041 ccagctcggc cagctccacc accctggaca atccctacct gtacatggtg gccaagcggg
2101 gcgtggcggc ggtgccgcag agccccacgc ggcacggcct gccctacgat cccggctacg
2161 actcgttctc gctgagctcc acggactcct atccgcccaa gcacatcctc aatccgcagc
2221 tggccaagat acccgaggca gccatgggaa tgggaatggg catgggcatg ggcatgggcg
2281 ccggagtcgg agttggggcg ggaaacgttc aaggagcagg agtcggagtc ggagcaggag
2341 tcccagtcgg acatggcctg gccctgtccg gggactgtga gaagctctgc cacgaggcgg
2401 accagctgct ggagaagtcg cgcatcgtgg aggagtcgca cgacctggag acggccctgg
2461 tgctgtgcaa tgcggccgcc ggccgggccc gggccgccat ggatgcgccc tacagcaacc
2521 cgcacacgat gacctttgcc cggatgaagc acaacacgtg cgtgatgcgg gcccgcagcc
2581 tgcaccggcg catcctcgtg gagaagggcg ccgaatcgga ggccatgccc gagctgaagc
2641 acatgcggga gggcagcacc agcagcgtga agcacgtccg ccagaacagc aaggaccgca
2701 ccctggagaa ggagcagctc cagatgcagc tccagctgca gcagcagctg cagcagcagc
2761 tcgtgccaag cgccgtctcc aagaacatcg agatctacgc gacgctgccc aagcgaaaga
2821 gcccgctcaa ggcgctggcc gctgccgcct ctggctctgg ttcagtgggc atggatgaca
2881 atgccatcga gtacgagcag caggtggtgg ccagcagcag cagtccggcc agcagccaga
2941 agccggagcg ggagcgggag agccgctcgc tgttcgggcg caaggacaag gacaaagaca
3001 aggagaagga gaagcgcagc cggagcgagg accggaacaa gctcaccagg gagttctcgc
3061 tgacggagac gctgctggtc aatgccaagg acacgctgaa gaagcacaag gaggacaagg
3121 acgagaagaa ggagaaggac aagagcggca agaagcagca caagatacgc cgcaagctgc
3181 tcatgggcgg cctcatccgc cgcaagaacc gctccatgcc cgatctgacc gaggccgtgg
3241 acgacgccgc agcgccgatc agcggcccag cccatcatca ccatcacctg cagctgcacc
3301 atccctgcgc cccgaccgct ggccacctca tgggcagctc cgtggacgac agcgccgtgg
3361 gcctgggcaa gcacgccctg gccatgagcg gctacatctc cgagggtcac tttgactatc
3421 cgctgggcac tgggccgtgc accagtaatg tgggagtggg cagtgggagc ggcagcaatc
3481 ccaatcccaa tctggagcga agcaaactga tgcggaagag cttccacggc agtggccgcc
3541 agctgaccat gcccgtgccc aaggtggcac cgccgccgcc catgcgcacc agctccgccc
3601 tgacgcccca ccagcagcag cagcaacagc agcagcaatt ccatcacgag gccaatctgt
3661 cgaacctctc cgccatgagc tcaaacacct cgatcagtga ggactcctgc cagacgatca
3721 tcacgacctg tgcccaggtg cactccgagc agagcccgct caaggacatg caaatgctgg
3781 gcatggccca gggccaggag gagctgccgc tgccgctgcc gccgatggag ctgccgccgt
3841 acccgagtcc gccgcactcc gtttgccact cgcggcaggc cagcgaggac ttcccgccac
3901 cgccgccacc cgagctcgat ctggagccgc tgaaccagca gctcagccag ctgcagcagc
3961 tggaggcggc caacaagcag cagcgccagc agcaactcga ctcactggag ggcaccacca
4021 gcatcctggc ccagctgcag cagcgccagc atctgctcaa gctgcgcaag gagcagacgg
4081 gagccggcag cgacgccacc tggctgaagg agctgcaggc caagcaggcc gccgatctgc
4141 gcacgatgca gcgcaagctg gaggcctcct cacccagcag cgtgcgcgat ctggcgcacc
4201 gcttcgagca gacggccacc attcggtcgt acgcgtcgca ggagctcctg gcccagccac
4261 gcacccagat gaacggcctg cccagccagg cgaagctcga cgctgatgag gttgactgtg
4321 cgggcccagt gcgtgcctct ctcccgctgc cactgccgct gcctcatcag ctgatgctgc
4381 ccaagcccaa gtacgagatg tcgcagtcgc agatcgccga ggagatccgc gaggtggagc
4441 tgctcaactc gatggtgcag cagacgctca accagagtgc cgccgctgga gccccaccca
4501 agcgggtgaa gaagaagagc gtgtcgttct gcgaccaggt gatcctggtg gcaacggccg
4561 gcaatgaaga ggacgacgac ttcataccca acccgatact ggagcgcgta ctgcgcaccg
4621 cacagcaccc gaacgagaag gtaacggccc agatgatcca gcagcagcag cagaacatgc
4681 tccgactgca gacggagccg cagccccgac aacagcacca gcagcagcag cagcagcaac
4741 agcagcagca gcagcaacag cagcagcagc agcaacagca gcagcaacaa ttgcagcttc
4801 aacatcaaca gcagcagcag cagccttctc ccatgctggg aagaccagcg gcaacggata
4861 tgcagcgata tgtgcagaga atgcaacaac aacagcagca acaacagcag cagcaacaac
4921 agcagcagga gatgtaccgc cagcagcagc aacagcagca gcatcgcacc agcatgagtg
4981 gcatcgtcag ccagcaacac ctgcaactgc aacagcagca gcagcagcag cagcagcaac
5041 accagatcga tgcacagtac accagcatgc cccggccatt gcatcatccc catctactgc
5101 cacagaccta ctcagtgcag ctgcagaaac accagcagca ccagcaacag ctgcaacagc
5161 agcaccaaca gcagccatcg ccgcaactga gcctgggata cttcagcaat ggccaggcag
5221 cggccagcga gcaagtgcca cagccacagc cactgccttc gccttatcag cgagtgccac
5281 tgccacatgg ctaccagccc acaggcggac cctactaccc tttgccaaac caacagcagc
5341 aacagctgat tattgccaat ggaaaacctg cccagaagaa ggtcagcttc gagccaggga
5401 ccaagggcga agccgactgc ctgccaccgc caccaccgcc gccacaaacg aagctggccc
5461 tgcagaatgg tctgcccgag tcggccagcc caggagctgg ccctggctcg gcaccggcgg
5521 ccatcacagc catacccacc cgcgtctaca acaatgccat cgtgaaggcc tcggccaagg
5581 ccgtggagtg caatctgtgc cgcaagcggc acgtcattgc gccagccgtc tactgcacca
5641 actgcgagta ctatctgcag atgctgaacc agcgcagatg atgcactccc agtccca