PREDICTED: Drosophila obscura ubiquitin carboxyl-terminal hydrolase
LOCUS XM_022361371 4694 bp mRNA linear INV 14-MAY-2021
7 (LOC111070661), transcript variant X3, mRNA.
ACCESSION XM_022361371
VERSION XM_022361371.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022361371.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..4694
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..4694
/gene="LOC111070661"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 2 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 17 samples with support for all annotated
introns"
/db_xref="GeneID:111070661"
CDS 570..4016
/gene="LOC111070661"
/codon_start=1
/product="ubiquitin carboxyl-terminal hydrolase 7"
/protein_id="XP_022217063.2"
/db_xref="GeneID:111070661"
/translation="MEIETEQSIEAMDTQDTQEVEIITSDLHQQQQQQQQQQQQQQQL
QNQNQQQNSPPQLPKFKNLIQPQLHAVGGVTQLPSENGNVPPQQLLADSSSSSFANEQ
DTMSLDDENKEDQFRSETTFAYTVDNVGQLKSQRLSPPVYVRMLPWKIMVIPNDRALG
FFLQCNGENDSPTWSCNAIAELRLKCHKAESQPFTRARIKHLFYSKENDYGYSNFITW
QELQDSEKSYIHNNSITLEVHVIADAPHGVLWDSKKHTGYVGLKNQGATCYMNSLLQT
LYFTNSLRLAVYRIPTEADDSTKSVGLSLQRVFHELQFGDRPVGTKKLTKSFGWETLD
SFMQHDVQEFLRVLLDKLESKMKGTCLEGTIPGLFEGKMSSYIKCKNVDYNSTRYETF
YDIQLNIKDKKNIYESFQDYVASETLEGDNKYDAGVHGLQEASKGVIFTSFPPVLHLH
LMRFQYDPITDSSIKYNDRFEFYEQINLDNYLAEGEKTTADYVLHAVLVHSGDNHGGH
YVVFINPKADGRWFKFDDDVVSSCRKQEAIELNYGGMDDEISFHAKCSNAYMLVYIRQ
SELDRVLGDIPENEISSDLVERLDLEKRIEMARRKERSEANLYISVHVILEEYFEGQQ
KRRLFDLEKTHQRPFKLKQNQTVDELVDMFVKSFGVSRDRMRIWNMCTAQTQKFLYFD
FEAEGSRTIEQIPTSQKPWVIFLELAPQDSSEPLAPFNPKSDVLLFLKYYDAKNKRLN
YIGCTHQPQSRRLIDLCPEVNRKLGFDLDTEMTVFDEYGDKKIANLNDPIESVLYLPP
DNLQGHILIFEREMIDIKLDLPTVEEYFLDLVYRIEIIFSDKCNPNEPDFVLELSNRY
NYDQLTNAVAERLNTDPQKLQFFMCINNYKETAGNAVPYTFKGTVKDLVSYTKQSTPK
RIFYQRLSLSIHELDNKKQFKCIWVSNDLKEEKELVLYPNKNDNVKGLLEEAAKKITF
QENSRKKLRLMKVSNHKIVAHCKDDIPLDTLLKTNDSAIATTQSAQKIYRVEEVPTDE
MQLAENEYLLPVAHFSKELYNSFGVPFLTKARHGEPYGVLKQRIQKRLNVPDKEWENY
KFTVINTGITTEVNDNDVINLENFRTSTSGQLPFLGLDHINKSRKRSSLNFSEKAIKI
YN"
misc_feature 924..3983
/gene="LOC111070661"
/note="Peptidase C19 contains ubiquitinyl hydrolases. They
are intracellular peptidases that remove ubiquitin
molecules from polyubiquinated peptides by cleavage of
isopeptide bonds. They hydrolyse bonds involving the
carboxyl group of the C-terminal Gly...; Region:
Peptidase_C19; cl02553"
/db_xref="CDD:470612"
misc_feature order(1356..1358,1371..1373,2091..2093,2145..2147)
/gene="LOC111070661"
/note="active site"
/db_xref="CDD:239124"
ORIGIN
1 tatatgtttt cggcaacaat atttttctaa gaaaaggcga cgtgcaaagc acacactgaa
61 aagtcttacc cacacatgta aaacagtgta cgccgtgaat taagaagtgt gtaggttggt
121 tgaaaagaaa gcacacacgc acacgcgcat acacacaaca cacatacaaa actatatgcg
181 aaaatctgca agtttatctt gcggctgtcc acacccccct cccggcccca taacccttca
241 atccgacaaa cacacagcga gcgtgcggac cacgagcgcc ccgagccaaa agaaaatata
301 caatagaaat acaataaata aaattgattg aaatttcgat aaccagcaaa ggagaacacc
361 ttttcgatcg aactactgca acacagtgca agggctctct ccaaggaccc cccagcgaca
421 cacaaattcg agcccgaagg ggcatatgta cattcagcag cgcaaactcg aaaaaaaggg
481 cagcgcaaca gaagaggagg aaacagcgat accgtcgaaa ccgagaggcg acaccgcctg
541 tttctagcac cattcaagtc ccgaataaga tggaaattga aacagaacaa tccatcgaag
601 caatggacac ccaggacacc caagaggtcg agataatcac tagcgatctt caccagcagc
661 agcaacaaca gcagcagcaa caacaacagc agcagcagct ccagaatcaa aatcaacaac
721 agaattcgcc gccacagctg ccgaaattca aaaatctgat acagccacaa ttgcacgcgg
781 ttggtggagt cacccagctg cccagcgaga acggcaacgt gccgccacaa cagctgctgg
841 ccgacagcag ctcctcatcg tttgccaacg aacaggacac aatgtcgttg gacgacgaga
901 acaaggagga tcagttccgt tcggagacca cattcgccta tacggtcgat aatgtgggac
961 aattgaaatc acagcgtctg tcgccgccag tgtatgtacg gatgctgcca tggaagataa
1021 tggtcatacc gaacgaccgg gcattgggct tctttctgca gtgcaacggt gagaatgatt
1081 cgcccacctg gtcgtgcaat gccattgccg aattgcgtct caagtgccac aaggcagagt
1141 ctcagccgtt cacgcgtgcc cgcatcaagc atttgttcta ctcgaaggag aacgactatg
1201 gctactcgaa cttcattacc tggcaggagc tgcaggattc ggagaagagc tatatacaca
1261 acaacagcat taccttagag gtgcatgtca tagccgatgc accgcatggc gtactctggg
1321 actcgaagaa gcacaccggc tatgtcggcc taaagaatca gggggccacc tgctacatga
1381 actccctgct gcagacgctc tattttacca attccctgcg cctggccgtc tatcgcatac
1441 ccacagaggc cgacgatagc accaagtcgg tgggcctctc actgcagcgt gttttccacg
1501 agctccagtt cggcgatcgt ccggtgggca ccaagaagct aaccaaatcc tttggctggg
1561 agacactcga ctcgttcatg cagcacgatg tccaggagtt cctgcgcgtt ctgctcgaca
1621 agctcgagtc aaagatgaag ggcacctgcc tcgagggcac cataccgggc ctctttgagg
1681 gcaaaatgtc atcgtatatc aaatgcaaga atgtggacta taacagcaca cgctatgaga
1741 ccttctacga tatccagcta aatatcaaag acaaaaagaa catttacgaa tcatttcagg
1801 actatgtcgc ctccgagacc ctcgagggtg acaacaaata cgatgctggc gtccatggct
1861 tgcaggaggc cagcaagggt gtcatcttca cgtcattccc gcccgttctg cacttgcatt
1921 tgatgcgttt ccaatacgat cccattacgg acagctcgat caagtacaac gatcgcttcg
1981 agttctacga acaaatcaat ctcgataact atctggccga gggtgagaag acaacagcgg
2041 actatgtcct gcatgccgtg ctggtgcatt cgggcgataa tcatggcggg cactatgtgg
2101 tctttatcaa tccaaaggcc gatgggcgct ggttcaagtt tgatgacgat gtggtctcca
2161 gttgccgcaa acaggaggcc attgagctga attatggcgg catggatgat gagatctcgt
2221 tccatgccaa atgcagcaat gcctatatgc tggtgtacat caggcaatcg gaactggatc
2281 gtgtgctggg cgatatacca gagaatgaga tatccagcga tctggtggag cgtctcgatc
2341 tggagaaacg catcgagatg gcgcgtcgca aggagcgcag cgaggccaat ttgtatatct
2401 cggtgcatgt catactcgag gagtactttg agggccaaca gaagcgtcgt ctcttcgact
2461 tggagaagac ccatcagcgt ccgtttaagc tcaagcagaa tcagaccgtt gacgagctgg
2521 tcgatatgtt tgtgaagagc tttggcgtct cgcgcgatcg tatgcgcatt tggaacatgt
2581 gcaccgccca gacgcaaaag tttctctact ttgactttga ggcggaggga tcgcgcacca
2641 tcgaacagat acccacatca cagaagccgt gggtgatctt tctcgagctg gcaccacagg
2701 acagcagcga accgttggca cccttcaatc ccaaatcaga tgtgctgctc ttcctcaagt
2761 attacgatgc caagaacaag cgtctcaatt atattggatg cacacatcag ccgcagagtc
2821 gacgcctcat cgatctgtgt ccggaggtga accgcaagct gggcttcgat ctggacaccg
2881 agatgaccgt ctttgatgag tatggcgaca aaaagatcgc caatctgaat gatcccattg
2941 agagtgtgct ctatctgccg cccgataatc tgcaggggca catactcatc tttgagcgtg
3001 agatgattga cattaaactg gatctgccaa cggtcgagga atactttctg gatctggtct
3061 atcgcattga gattatattc agcgacaagt gtaatcccaa tgagccagac tttgtgctgg
3121 aattgtccaa tcgctataat tacgatcaac taaccaatgc ggtggccgag cgcctcaaca
3181 cggatccaca gaagttgcag ttctttatgt gcatcaacaa ctacaaggag acggcgggca
3241 atgcagtgcc ctatacattc aagggaacgg tcaaggatct ggtctcgtac accaagcaaa
3301 gcacaccgaa gcgcatattt tatcagcgtc tgtcgttgag catccatgag ctggacaaca
3361 agaagcaatt caagtgcatt tgggtgtcca acgatctgaa ggaggagaag gagcttgtac
3421 tctatccgaa caagaatgac aacgttaagg ggctgctgga ggaggcggcc aagaagataa
3481 ccttccagga gaatagccgc aagaagctgc gcctgatgaa ggtcagcaat cataagattg
3541 tcgcccactg caaggacgat ataccgctgg atacgctgct caagacgaat gactcggcga
3601 tcgccaccac ccagagtgca cagaagatct atcgcgttga ggaggtgccc acagatgaga
3661 tgcagctggc cgagaatgaa tatctgttgc cggtggccca tttcagcaag gagctgtaca
3721 actcgtttgg cgtacccttc ctcaccaagg cgcgtcatgg ggagccctat ggggtcctca
3781 agcagcgcat acagaagcgt cttaatgtcc cggacaagga gtgggagaac tacaagttca
3841 ccgtgatcaa tacgggcatc accactgagg tgaacgacaa cgatgtcatc aatttggaga
3901 atttccgcac ctcgactagc ggacagttgc ccttccttgg tctcgatcac atcaacaagt
3961 cacgcaagcg tagttcgctg aatttctccg agaaggcgat caaaatttac aattaaattg
4021 aacacacaca cacacataca cacacacaag aaacaagaaa gcagatggag cagatggatg
4081 gtggttctgc gagtggaggt ggaggcaagc gaggctagga ggaggctgct gtaactggga
4141 ctgggtcgaa aattgggcga gacgggaagc aaagacgtga cgacatccat ccatcgatcg
4201 acacgcgccc gccgcccgaa aacacaaatt ttaagcaaat tgtccttaaa attgtatgcc
4261 tcctactcca actcccactc cctcaaaacg ggcccatcac cccccgccca cttcttctga
4321 acccaaaagt agaaattgac gtggattttt ttttaataat attatttcta agtcataaaa
4381 tttatttatt aatcgcctcc ggtcaatttt taatatggtt gtccgcccga tcctacaccc
4441 agagtttgcc ttccctcccc ccgctgcggc tccggcctaa actccaggtc cggcccgagc
4501 ctcacggcac taagtccatg agaaagctaa aaaaaaaaaa tttataaatt tactgactta
4561 ttgcgcttgc attagtgact tcaaattcta ggctaaggct tttaaagtgt gtttatttaa
4621 ttaatttttt ttttcctcta attttctaaa tacgatattc ctaaacgcgg taattgtttt
4681 gaagtcggaa cgaa