PREDICTED: Drosophila obscura uncharacterized LOC111066225
LOCUS XM_041591922 5976 bp mRNA linear INV 14-MAY-2021
(LOC111066225), transcript variant X1, mRNA.
ACCESSION XM_041591922
VERSION XM_041591922.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..5976
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..5976
/gene="LOC111066225"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 1 Protein, and 99% coverage of the
annotated genomic feature by RNAseq alignments"
/db_xref="GeneID:111066225"
CDS 486..5960
/gene="LOC111066225"
/codon_start=1
/product="uncharacterized protein LOC111066225 isoform X1"
/protein_id="XP_041447856.1"
/db_xref="GeneID:111066225"
/translation="MIKFRYKRKEPTNVVAQGCAGAAAVPASPQSIAAPSASIATTLQ
HQQLNQLMGSNGGGTLPRSSPRKLMMMEAGEGGHKAATLTRQSKQKASVSALAKVASS
VELAVLGYNNNNNHSPSNGGGISNGIAERDRCINAVIELFQQLQTSSEPALCPEPLRR
ALASGPLAGRRFPLGCLGDAAECFELLLHRVHSHISSDDGDSCESSACIAHRRFAMRV
IEQSVCKCGANSEQLPFTQMVHYVSASALTSQKSLALQSHQQLSFGQLLRAAGNMGDI
RDCPNTCGAKIGIRRALLNRPDVVSIGIVWDSERPAADQVHAVLKAVGTSLRLADVFH
QVSEQRWAQQAQHELVGIVSYYGKHYTTFFFHTKLKVWVYFDDANVKEVGPSWDGVVD
KCSRGRYQPLLLLYAVPQPPSSAAGMDQLPVAGSLPSPAASMTAAGLAATVVRRAVTP
SPEKPSLGSTRRAITPTPLRTPTNDYQNLSVIQKSIFPSNGAGTDETDAYISRKTVEH
VLSAQYQNLSVIQDKINAVPGSVATSNADKDGGFLNRKPIEAMLSAQLARRQHLQLQR
SHSAESSSGHASNGSSPPSDGLTMPEHLNQPRRRDSGNWSGDRNSASSASSTTLDNPY
LYMVAKRGVAAVPQSPTRHGLPYDPGYDSFSLSSTDSYPPKHILNPQLAKIPEAAMGM
GMGMGMGMGAGVGVGAGNVQGAGVGVGAGVPVGHGLALSGDCEKLCHEADQLLEKSRI
VEESHDLETALVLCNAAAGRARAAMDAPYSNPHTMTFARMKHNTCVMRARSLHRRILV
EKGAESEAMPELKHMREGSTSSVKHVRQNSKDRTLEKEQLQMQLQLQQQLQQQLVPSA
VSKNIEIYATLPKRKSPLKALAAAASGSGSVGMDDNAIEYEQQVVASSSSPASSQKPE
RERESRSLFGRKDKDKDKEKEKRSRSEDRNKLTREFSLTETLLVNAKDTLKKHKEDKD
EKKEKDKSGKKQHKIRRKLLMGGLIRRKNRSMPDLTEAVDDAAAPISGPAHHHHHLQL
HHPCAPTAGHLMGSSVDDSAVGLGKHALAMSGYISEGHFDYPLGTGPCTSNVGVGSGS
GSNPNPNLERSKLMRKSFHGSGRQLTMPVPKVAPPPPMRTSSALTPHQQQQQQQQQFH
HEANLSNLSAMSSNTSISEDSCQTIITTCAQVHSEQSPLKDMQMLGMAQGQEELPLPL
PPMELPPYPSPPHSVCHSRQASEDFPPPPPPELDLEPLNQQLSQLQQLEAANKQQRQQ
QLDSLEGTTSILAQLQQRQHLLKLRKEQTGAGSDATWLKELQAKQAADLRTMQRKLEA
SSPSSVRDLAHRFEQTATIRSYASQELLAQPRTQMNGLPSQAKLDADEVDCAGPVRAS
LPLPLPLPHQLMLPKPKYEMSQSQIAEEIREVELLNSMVQQTLNQSAAAGAPPKRVKK
KSVSFCDQVILVATAGNEEDDDFIPNPILERVLRTAQHPNEKVTAQMIQQQQQNMLRL
QTEPQPRQQHQQQQQQQQQQQQQQQQQQQQQQQQLQLQHQQQQQQPSPMLGRPAATDM
QRYVQRMQQQQQQQQQQQQQQQEMYRQQQQQQQHRTSMSGIVSQQHLQLQQQQQQQQQ
QHQIDAQYTSMPRPLHHPHLLPQTYSVQLQKHQQHQQQLQQQHQQQPSPQLSLGYFSN
GQAAASEQVPQPQPLPSPYQRVPLPHGYQPTGGPYYPLPNQQQQQLIIANGKPAQKKV
SFEPGTKGEADCLPPPPPPPQTKLALQNGLPESASPGAGPGSAPAAITAIPTRVYNNA
IVKASAKAVECNLCRKRHVIAPAVYCTNCEYYLQMLNQRR"
misc_feature 1020..1703
/gene="LOC111066225"
/note="Peptidase C19 contains ubiquitinyl hydrolases. They
are intracellular peptidases that remove ubiquitin
molecules from polyubiquinated peptides by cleavage of
isopeptide bonds. They hydrolyse bonds involving the
carboxyl group of the C-terminal Gly...; Region:
Peptidase_C19; cd02257"
/db_xref="CDD:239072"
misc_feature <5868..>5936
/gene="LOC111066225"
/note="protein kinase C conserved region 1 (C1 domain)
superfamily; Region: C1; cl00040"
/db_xref="CDD:412127"
ORIGIN
1 gcccaaacaa gtttcagtcg cgagacagtg cggacgtgtt tgatccaacc gaaccatcca
61 gccaccagaa acagtaaacg agaatcgaaa aaaaattgcg cgcgcggcaa aataaaaaaa
121 aaacgctgat ttcaatcata tctcgttttg tttttctttt ttcgagtttt gagtgtgtgt
181 gtgtgtgtgt gcccgttgtc cgtgcctgac aaaagttgaa gagttcgaca aaagttgaag
241 aaacaacaaa aagaacaaca acaacaacga caacaacgaa gttgtttgac tggcaaatgt
301 gcaaaagtaa ctttttagct ggaaaaccat ttgggatttg gataactcag cgagaatcga
361 ggaacgagaa acgagcaacg agcctggaat acacatgtac atgaataccc ctattagaat
421 gatgccggct gaaatgccca actgaaacgc aacgaaaccc aatcggagcg agcgcaggga
481 gcaacatgat caagtttcgg tacaagcgaa aggagcccac caatgtggtg gcacagggct
541 gcgctggagc agcagcagtg ccggcatcac cccagtccat agcagccccc tcagccagca
601 tagccacaac gctgcagcat cagcagctga accagctgat gggcagcaat ggcggcggca
661 ccctgccccg cagcagtccg cgcaagctga tgatgatgga ggcgggcgag ggtggccaca
721 aggcggccac actgacgcgc cagagcaagc agaaggcgtc ggtgagtgct ctggccaagg
781 tggccagcag cgtggagctg gcggtgctcg gctacaacaa caacaacaac cacagcccct
841 cgaatggcgg aggaatcagc aatggaatcg cggaaagaga tcgctgcatc aatgcggtca
901 ttgaactctt ccagcagctg cagaccagct cggagcccgc actctgtccg gaacccttgc
961 ggcgagcctt ggccagtgga cccctggccg gacgacgatt tccgctgggc tgcctgggcg
1021 atgctgccga atgctttgag ctgctgctgc atcgcgtcca ctcgcacatc tcgtccgatg
1081 atggggactc gtgcgagtcg agtgcctgca ttgcccatcg ccgctttgcc atgcgcgtca
1141 tcgagcagag tgtctgcaag tgtggggcca actccgagca gcttcccttc acgcagatgg
1201 tgcactatgt gtccgcctcg gcccttacat cgcagaagag cctggccctg cagagccacc
1261 agcagctgag ctttggccag ctgctgagag ctgccggcaa catgggcgac atacgcgact
1321 gtccgaacac ctgtggagcc aagatcggga tacgccgagc gctgctcaat cgtcccgatg
1381 tcgtctccat tggcatcgtt tgggactcgg aacgacccgc cgccgaccag gtgcatgccg
1441 tgctcaaggc ggtcggaacg agtctgcgcc tggcggacgt cttccatcag gtcagcgagc
1501 agcgctgggc ccagcaggcg cagcacgagc tggtggggat cgtctcctac tacggcaagc
1561 actacaccac cttcttcttc cacaccaagc tgaaggtgtg ggtgtacttc gacgacgcca
1621 acgtcaagga ggtgggtccc agctgggacg gagtggtgga caagtgcagt cgcggccgct
1681 accagccgct gctgctgctg tacgcggtgc cccagccgcc cagctccgcg gcgggcatgg
1741 atcagctgcc cgtcgccgga tcgctgccct cgccggctgc ctcaatgacg gcggcgggac
1801 tggccgcgac ggtggtgcgc cgggcggtga cacccagccc ggagaagccc tcgctgggca
1861 gcactcgacg ggccattacg cccacgcccc tgcgcacgcc caccaatgac taccagaatc
1921 tgagtgtcat ccagaagagc attttcccgt cgaatggggc gggcaccgat gagacggatg
1981 cctacatcag ccgcaagacg gtggagcacg tgctgagcgc acagtaccag aacctgagcg
2041 tcatccagga caagatcaac gccgtgcccg gctcggtggc cacgtccaat gccgacaagg
2101 atgggggctt cctcaatcgg aagcccatcg aggcgatgct gagcgcccag ctggcgcgcc
2161 ggcagcacct ccagctgcag cgcagccaca gcgccgagtc gagctccggc cacgcctcca
2221 atggcagctc gccgcccagc gacgggctga ccatgcccga gcacctgaac cagccccgga
2281 gacgggactc gggcaactgg tcgggggatc gcaacagcgc cagctcggcc agctccacca
2341 ccctggacaa tccctacctg tacatggtgg ccaagcgggg cgtggcggcg gtgccgcaga
2401 gccccacgcg gcacggcctg ccctacgatc ccggctacga ctcgttctcg ctgagctcca
2461 cggactccta tccgcccaag cacatcctca atccgcagct ggccaagata cccgaggcag
2521 ccatgggaat gggaatgggc atgggcatgg gcatgggcgc cggagtcgga gttggggcgg
2581 gaaacgttca aggagcagga gtcggagtcg gagcaggagt cccagtcgga catggcctgg
2641 ccctgtccgg ggactgtgag aagctctgcc acgaggcgga ccagctgctg gagaagtcgc
2701 gcatcgtgga ggagtcgcac gacctggaga cggccctggt gctgtgcaat gcggccgccg
2761 gccgggcccg ggccgccatg gatgcgccct acagcaaccc gcacacgatg acctttgccc
2821 ggatgaagca caacacgtgc gtgatgcggg cccgcagcct gcaccggcgc atcctcgtgg
2881 agaagggcgc cgaatcggag gccatgcccg agctgaagca catgcgggag ggcagcacca
2941 gcagcgtgaa gcacgtccgc cagaacagca aggaccgcac cctggagaag gagcagctcc
3001 agatgcagct ccagctgcag cagcagctgc agcagcagct cgtgccaagc gccgtctcca
3061 agaacatcga gatctacgcg acgctgccca agcgaaagag cccgctcaag gcgctggccg
3121 ctgccgcctc tggctctggt tcagtgggca tggatgacaa tgccatcgag tacgagcagc
3181 aggtggtggc cagcagcagc agtccggcca gcagccagaa gccggagcgg gagcgggaga
3241 gccgctcgct gttcgggcgc aaggacaagg acaaagacaa ggagaaggag aagcgcagcc
3301 ggagcgagga ccggaacaag ctcaccaggg agttctcgct gacggagacg ctgctggtca
3361 atgccaagga cacgctgaag aagcacaagg aggacaagga cgagaagaag gagaaggaca
3421 agagcggcaa gaagcagcac aagatacgcc gcaagctgct catgggcggc ctcatccgcc
3481 gcaagaaccg ctccatgccc gatctgaccg aggccgtgga cgacgccgca gcgccgatca
3541 gcggcccagc ccatcatcac catcacctgc agctgcacca tccctgcgcc ccgaccgctg
3601 gccacctcat gggcagctcc gtggacgaca gcgccgtggg cctgggcaag cacgccctgg
3661 ccatgagcgg ctacatctcc gagggtcact ttgactatcc gctgggcact gggccgtgca
3721 ccagtaatgt gggagtgggc agtgggagcg gcagcaatcc caatcccaat ctggagcgaa
3781 gcaaactgat gcggaagagc ttccacggca gtggccgcca gctgaccatg cccgtgccca
3841 aggtggcacc gccgccgccc atgcgcacca gctccgccct gacgccccac cagcagcagc
3901 agcaacagca gcagcaattc catcacgagg ccaatctgtc gaacctctcc gccatgagct
3961 caaacacctc gatcagtgag gactcctgcc agacgatcat cacgacctgt gcccaggtgc
4021 actccgagca gagcccgctc aaggacatgc aaatgctggg catggcccag ggccaggagg
4081 agctgccgct gccgctgccg ccgatggagc tgccgccgta cccgagtccg ccgcactccg
4141 tttgccactc gcggcaggcc agcgaggact tcccgccacc gccgccaccc gagctcgatc
4201 tggagccgct gaaccagcag ctcagccagc tgcagcagct ggaggcggcc aacaagcagc
4261 agcgccagca gcaactcgac tcactggagg gcaccaccag catcctggcc cagctgcagc
4321 agcgccagca tctgctcaag ctgcgcaagg agcagacggg agccggcagc gacgccacct
4381 ggctgaagga gctgcaggcc aagcaggccg ccgatctgcg cacgatgcag cgcaagctgg
4441 aggcctcctc acccagcagc gtgcgcgatc tggcgcaccg cttcgagcag acggccacca
4501 ttcggtcgta cgcgtcgcag gagctcctgg cccagccacg cacccagatg aacggcctgc
4561 ccagccaggc gaagctcgac gctgatgagg ttgactgtgc gggcccagtg cgtgcctctc
4621 tcccgctgcc actgccgctg cctcatcagc tgatgctgcc caagcccaag tacgagatgt
4681 cgcagtcgca gatcgccgag gagatccgcg aggtggagct gctcaactcg atggtgcagc
4741 agacgctcaa ccagagtgcc gccgctggag ccccacccaa gcgggtgaag aagaagagcg
4801 tgtcgttctg cgaccaggtg atcctggtgg caacggccgg caatgaagag gacgacgact
4861 tcatacccaa cccgatactg gagcgcgtac tgcgcaccgc acagcacccg aacgagaagg
4921 taacggccca gatgatccag cagcagcagc agaacatgct ccgactgcag acggagccgc
4981 agccccgaca acagcaccag cagcagcagc agcagcaaca gcagcagcag cagcaacagc
5041 agcagcagca gcaacagcag cagcaacaat tgcagcttca acatcaacag cagcagcagc
5101 agccttctcc catgctggga agaccagcgg caacggatat gcagcgatat gtgcagagaa
5161 tgcaacaaca acagcagcaa caacagcagc agcaacaaca gcagcaggag atgtaccgcc
5221 agcagcagca acagcagcag catcgcacca gcatgagtgg catcgtcagc cagcaacacc
5281 tgcaactgca acagcagcag cagcagcagc agcagcaaca ccagatcgat gcacagtaca
5341 ccagcatgcc ccggccattg catcatcccc atctactgcc acagacctac tcagtgcagc
5401 tgcagaaaca ccagcagcac cagcaacagc tgcaacagca gcaccaacag cagccatcgc
5461 cgcaactgag cctgggatac ttcagcaatg gccaggcagc ggccagcgag caagtgccac
5521 agccacagcc actgccttcg ccttatcagc gagtgccact gccacatggc taccagccca
5581 caggcggacc ctactaccct ttgccaaacc aacagcagca acagctgatt attgccaatg
5641 gaaaacctgc ccagaagaag gtcagcttcg agccagggac caagggcgaa gccgactgcc
5701 tgccaccgcc accaccgccg ccacaaacga agctggccct gcagaatggt ctgcccgagt
5761 cggccagccc aggagctggc cctggctcgg caccggcggc catcacagcc atacccaccc
5821 gcgtctacaa caatgccatc gtgaaggcct cggccaaggc cgtggagtgc aatctgtgcc
5881 gcaagcggca cgtcattgcg ccagccgtct actgcaccaa ctgcgagtac tatctgcaga
5941 tgctgaacca gcgcagatga tgcactccca gtccca