PREDICTED: Drosophila obscura T-cell acute lymphocytic leukemia
LOCUS XM_022364372 1231 bp mRNA linear INV 14-MAY-2021
protein 1 (LOC111072469), mRNA.
ACCESSION XM_022364372
VERSION XM_022364372.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022364372.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..1231
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..1231
/gene="LOC111072469"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 3 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 12 samples with support for all annotated
introns"
/db_xref="GeneID:111072469"
CDS 1..1071
/gene="LOC111072469"
/codon_start=1
/product="T-cell acute lymphocytic leukemia protein 1"
/protein_id="XP_022220064.2"
/db_xref="GeneID:111072469"
/translation="MAWLPSSSNGSDNADERSTASSASASSAAGSSGANGSPRPAAGA
APATASAGARQPPALLRHATPYLQPKTESISDGDGDLSDFSLNDTEEDEEDLRDYIVL
NGNQADGCRSLSNSPRSGSGCGNPFASSRTSPCRNGQLSGSATNGNGNGASAGTGGGG
GVRKVFTNTRERWRQQNVSGAFAELRKLVPTHPPDKKLSKNEILRSAIKYIKLLTGIL
EWQQRQDGHGHEQNNNDNHMANGHAPAPDTAGEVLSAFRHIKCERLDGHQRNGTGNDL
LMIAPEALIKTELQQQASTYPEPPPAHPTASMAVAVAEARVLAALKPTTNTSSGGRSS
KRRIKLGLGLPTDLSVVKRRRT"
misc_feature 487..666
/gene="LOC111072469"
/note="basic helix-loop-helix (bHLH) domain found in
Drosophila melanogaster helix loop helix protein 3B
(dHLH3B) and similar proteins; Region:
bHLH_TS_dHLH3B_like; cd19708"
/db_xref="CDD:381551"
misc_feature order(490..492,502..504,508..516,520..522,535..537,
589..591,595..600)
/gene="LOC111072469"
/note="putative DNA binding site [nucleotide binding];
other site"
/db_xref="CDD:381551"
misc_feature order(529..534,541..546,553..555,562..564,601..603,
610..615,622..624,628..636,640..645,652..654,661..666)
/gene="LOC111072469"
/note="putative dimer interface [polypeptide binding];
other site"
/db_xref="CDD:381551"
ORIGIN
1 atggcctggc taccgagcag ctcgaacgga tcggacaacg cggatgaacg gagtacggcg
61 tccagcgcca gcgccagctc ggccgccggc agcagcggag ccaacggctc gcccaggcca
121 gcagcaggag cagcaccagc taccgcttcg gctggggcac gccagccccc ggccctgctg
181 cggcatgcca cgccctacct gcagcccaag acggaatcca tttcggacgg agatggtgac
241 ctgtcggact tctcgctgaa cgacaccgag gaggatgagg aggatctgcg cgactacatc
301 gtgctcaacg gcaatcaggc ggacggctgt cgctccctgt ccaactcccc gcgcagcggc
361 agtggctgtg gcaatccctt cgctagctcc cgcacctcgc cctgtcgcaa cggacagctg
421 tcgggcagcg caacaaatgg caacggtaat ggagcctccg cagggactgg cggtggcggt
481 ggcgtccgta aggtgttcac caacacccgc gagcgctggc gccagcaaaa cgtgtccggg
541 gcctttgcgg agctgcgcaa gctggtgccg actcatccgc cggacaagaa actgtccaag
601 aacgagatcc tgcgttcggc catcaagtac atcaagctgc tgaccggcat cctcgagtgg
661 cagcagcggc aggacgggca cggccacgag cagaacaata acgacaacca catggccaat
721 ggccatgcac cagcgccgga caccgctggt gaagttctgt ccgctttccg gcacatcaag
781 tgcgagcgcc tggacggaca tcagcgcaac ggaactggca acgatctact catgattgcc
841 cccgaagcac tcatcaaaac tgagctgcag cagcaggcca gcacctaccc cgaaccccca
901 ccggcgcatc caaccgcatc catggctgtg gcggtggccg aggcccgggt tcttgccgca
961 ctgaagccga ccacaaatac ctcgtccggc ggcaggagca gcaaacggcg gattaagctg
1021 ggcctgggcc tgcccaccga cctgagtgtg gtcaagcgac gcaggacctg agagccaagg
1081 caggggacag ggcacagcct aagaggctga gacgcccaag ccaatcgttt cggagaagat
1141 tggaagcaac catcagatac ataatctaat cgtatacaca aatacataac aatccttttt
1201 ttttttgtta aataaattcg agtgcgacta t