PREDICTED: Drosophila obscura protein DDI1 homolog 2
LOCUS XM_022361383 2243 bp mRNA linear INV 14-MAY-2021
(LOC111070671), transcript variant X1, mRNA.
ACCESSION XM_022361383
VERSION XM_022361383.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022361383.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..2243
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..2243
/gene="LOC111070671"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 2 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 17 samples with support for all annotated
introns"
/db_xref="GeneID:111070671"
CDS 344..1801
/gene="LOC111070671"
/codon_start=1
/product="protein DDI1 homolog 2"
/protein_id="XP_022217075.2"
/db_xref="GeneID:111070671"
/translation="MKITVTASDDRLFCLDVSHDLELENLKALCAMEIGADVDQIIVR
LNGRELTNNKHSLQQCGVNDGDFIMLERRRPNNRAGGANNPVISGLDFSSIAVPGSSA
TTSGGSPPSMPNPSLGGVAGWGGLGAGNSLQQQQQQQQQMQNLTDIPMTDDFNVNFED
DPATVRQLLLSSPDTLALLREYNTRLAEALDSGDPDTFARALREHVTERKRRNDQRVR
MLTADPFDEETQRLIAEEIKQKNIQDNMAAAIEYNPEIFGMVTMLYINCKVNGVPVKA
FVDSGAQTTIMSKDCAERCHVNRLIDTRWNGVAKGVGTQPILGRIHMVQLQIENDHLT
SSFTVLGQQPMDMLLGLDMLKRHQCLIDLQRNLLIIGTTGTTTPFLPESELPVGARLT
GNPDEQTEQQAIAEAMEQSRKEGASTSASASAGQGANTIQPMDRFTEQDVSDLMALGF
PRRDVLIVLRQCGGNKQVASSLLLSRNEAAGGGLS"
misc_feature 344..559
/gene="LOC111070671"
/note="ubiquitin-like (Ubl) domain found in the eukaryotic
Ddi1 family; Region: Ubl_Ddi1_like; cd01796"
/db_xref="CDD:340494"
misc_feature 1082..1441
/gene="LOC111070671"
/note="retropepsin-like domain of DNA damage inducible
protein; Region: RP_DDI; cd05479"
/db_xref="CDD:133146"
misc_feature 1178..1186
/gene="LOC111070671"
/note="catalytic motif [active]"
/db_xref="CDD:133146"
misc_feature 1178..1180
/gene="LOC111070671"
/note="Catalytic residue [active]"
/db_xref="CDD:133146"
misc_feature 1658..1765
/gene="LOC111070671"
/note="UBA domain-like superfamily; Region: UBA_like_SF;
cl21463"
/db_xref="CDD:473871"
ORIGIN
1 agggttttca aaaaaaaaaa aatgctcccg agtgtgtttt gtgaataata tccattcatc
61 gatatcaatc tgaatgcaac ccattacatc gcatcgcatc gcagcggcgt ttatgttagg
121 taccaaaaca acaaagcaat agcacacaaa gagaaaacgc aatctaccga gagcatttat
181 aaattattgc ggtccattca agcaggcagt cgtcccccaa cagagaatcc ttccaaaaca
241 gtgtttgtgc gcaatcaatc gacccagacc agacccacgg agtttagtga aacaccccac
301 accatacatc ccatcaaatc aagccccagc ggtctctcgc actatgaaaa tcacagtgac
361 tgcctccgat gatcggcttt tctgcttgga cgtgtcccac gatctcgagc tggagaacct
421 caaagccctc tgtgcgatgg agatcggggc cgatgtcgat caaattatcg tccgcttgaa
481 tggccgggag ctgacgaaca acaagcattc gctgcagcag tgcggcgtca acgatgggga
541 ctttatcatg ctggagcgaa gaaggcccaa caatcgcgca gggggtgcca acaatccggt
601 aataagtgga ctggacttta gcagtatcgc tgtgccggga tccagtgcaa cgaccagtgg
661 cggtagccca ccctcaatgc ccaaccccag cctgggagga gtggcaggat ggggtggact
721 gggagcaggc aacagtttgc agcagcagca gcaacaacag cagcagatgc agaacctaac
781 agacatccca atgacggacg atttcaatgt gaacttcgaa gacgatccgg cgacggtgag
841 gcagttgtta ttgtccagcc cggatacact ggccctgctg agggagtata atacgcggct
901 agcggaggcc ctcgattcgg gggatccgga cacgtttgcc cgggctctca gggagcatgt
961 gacggagcgg aagcggcgca acgatcagcg ggtgcgaatg ctgacagcgg atccgttcga
1021 tgaggagacg cagcgtctga ttgctgagga gataaagcag aagaacattc aggacaatat
1081 ggcggcggcc attgaataca atccggagat cttcggcatg gtcacgatgc tctacatcaa
1141 ctgcaaggta aacggtgtgc cggtgaaggc cttcgttgac tctggagccc agacgacaat
1201 catgagcaag gactgtgccg aaaggtgtca cgtgaaccgg ctgattgaca cgcgctggaa
1261 tggtgtcgcc aagggtgtgg gcactcagcc cattctgggc agaatccaca tggttcagct
1321 gcagatcgag aacgatcatt tgacgtccag cttcaccgtt cttggccaac agcccatgga
1381 catgttgctg ggtctggata tgctcaagcg tcatcagtgt ctcatcgatc tgcagaggaa
1441 tcttctgatt attggcacca ccggaacgac aacaccattt ctacccgaaa gcgagctacc
1501 ggttggggcc cggctcacgg gcaatccaga cgagcagacc gaacagcagg cgattgccga
1561 ggccatggag cagagtcgga aggagggtgc ctccacatcc gcatccgcct cggcagggca
1621 gggcgcgaat accatccagc caatggatcg atttaccgag caggatgtca gcgatctcat
1681 ggccctcggc tttccgcgcc gcgatgttct cattgtgctg aggcagtgtg gtggcaataa
1741 gcaggtggcg agctcacttc tgctcagccg caacgaggca gccggtggcg gacttagctg
1801 aactcgggct ggagccccct gtccccccat acccaagctc tgcccatccc accacaaacg
1861 ccacagacac ctcctcctac tccgcctcct cctacgcacc acaattcact cctatgggat
1921 caatcgtaac aaaaacaaca tgaaaaccaa cacaaggata tgatgacgag caatcacagc
1981 cagctagcca gccagacaga cagacacgac agacagccag ccactatatt ccgtttcaac
2041 gaaatactct gagcaatacc aaaacgaaaa tggaaaatgg ataaaattaa atttgtaatt
2101 tagggaaatt tttgtaaact aattatgcct cttttttata tgaatcgtta tacacaatag
2161 ctaaatagta ggatgagtat gcatttgggc gacgcgcgac accctcctcc aaaacacaca
2221 cgagaggacg cgaatagtag ggt