PREDICTED: Drosophila obscura eukaryotic translation initiation
LOCUS XM_022361394 1325 bp mRNA linear INV 14-MAY-2021
factor 4H (LOC111070678), mRNA.
ACCESSION XM_022361394
VERSION XM_022361394.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq; corrected model.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022361394.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
frameshifts :: corrected 1 indel
##RefSeq-Attributes-END##
PRIMARY REFSEQ_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP
1-106 JAECWW010000165.1 454148-454253
107-412 JAECWW010000165.1 455221-455526
413-662 JAECWW010000165.1 455635-455884
663-711 JAECWW010000165.1 455886-455934
712-1029 JAECWW010000165.1 456510-456827
1030-1325 JAECWW010000165.1 456902-457197
FEATURES Location/Qualifiers
source 1..1325
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..1325
/gene="LOC111070678"
/note="The sequence of the model RefSeq transcript was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon;
Derived by automated computational analysis using gene
prediction method: Gnomon. Supporting evidence includes
similarity to: 8 Proteins, and 91% coverage of the
annotated genomic feature by RNAseq alignments, including
17 samples with support for all annotated introns"
/db_xref="GeneID:111070678"
CDS 75..1196
/gene="LOC111070678"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon"
/codon_start=1
/product="LOW QUALITY PROTEIN: eukaryotic translation
initiation factor 4H"
/protein_id="XP_022217086.2"
/db_xref="GeneID:111070678"
/translation="MAGRGGYEHARGGFGGERQMKVLPTEPPFIAFVGNLPQGLVQGD
VMKIFQDFEVKNVRLVKDRETDMFKGFCYVEFETLENLERALEFDGRIKLDDLSAPLR
IDIADRRKNDRPGGGGVGGGNGGMTRDGGRDGFQKRGPPRQGGTSQSYSRGGPGTGGN
GGGGREGGGGSGNRADSRGSYNDNYGGHSDRSRGGGAGAGSGSGSGMNRGYNDRPANR
GRYGNFNNDDRFERNQDRDRGQREGSYGNQSRDGDRYNNFSRHRDRERTHYNPNQQSE
RPSGGLGGGGGGLGGSGGLGSSGNLGGGGGGGGGGGASIGAIDDTERPRLQLKPRTIA
APINAVAETKQSASIFGNAKPREEKLKELEQNVSHNGDI"
misc_feature 144..404
/gene="LOC111070678"
/note="RNA recognition motif (RRM) found in eukaryotic
translation initiation factor 4H (eIF-4H) and similar
proteins; Region: RRM_eIF4H; cd12401"
/db_xref="CDD:409835"
ORIGIN
1 ggccgggcga caacttgggt tgtcaggtca atctgcggat tgcagcgaat tcgctcattt
61 gaaaaaaata aaacatggct ggaagaggtg gttatgaaca cgctaggggg ggatttggcg
121 gcgaacgcca gatgaaggta ctgccaacgg aaccgccctt tatcgccttt gtgggcaacc
181 tgccccaggg cctcgtccag ggcgatgtga tgaaaatatt tcaagacttt gaggtgaaga
241 acgtgaggct ggtaaaggat cgggaaacgg acatgttcaa aggcttttgc tacgtggaat
301 tcgaaacgct ggagaatctg gagcgggcac tcgagttcga tggacgcatc aaactggacg
361 atctgtcggc tccgctgcgc atcgatatag ctgaccggag gaaaaatgat cggcccggtg
421 gcggcggtgt tggcggtggc aatggaggca tgacccgtga tggtggaaga gacggtttcc
481 agaagcgtgg accgccccgt cagggcggta ccagccaatc ctatagccgg ggcggtcccg
541 gcactggcgg caacggcggc ggtggtcgcg agggtggcgg cggctctggt aatcgcgccg
601 atagtagagg ttcgtacaat gataattatg gtggtcacag tgatagaagt cgaggcggcg
661 gcgcgggcgc aggctccggc tccggcagtg gcatgaatag aggatacaat gatcggccgg
721 cgaatcgtgg tcggtacgga aactttaaca atgatgatcg gtttgagcgg aaccaggacc
781 gtgaccgggg gcagcgcgag ggcagttatg gcaaccagtc gcgcgacggc gaccgctaca
841 acaacttcag caggcaccgt gaccgcgagc gtacccacta caatcccaac cagcagagtg
901 aacgtcctag cggcggtctt ggtggaggcg gcggcggcct tgggggcagc ggcggtcttg
961 gcagcagcgg caacctcggt ggtggtggtg gcggcggagg cggtggtggt gcatcgattg
1021 gcgccattga cgacacggag aggccacgcc tgcagctgaa gccacggacc attgctgcgc
1081 caatcaatgc ggtggccgaa accaagcagt cggcatccat tttcggcaat gccaagccgc
1141 gcgaggagaa attgaaggag ctggagcaga atgtctctca caatggcgac atctaaagtg
1201 cagggaatcg atccgcaata aagcctgata tgatattttg ctggtgagag tgagacaggg
1261 agagagagag agagagaaaa agagggagag cgacagatga ggaagaagga gaagcgtcat
1321 gcgtc