PREDICTED: Drosophila obscura TATA element modulatory factor
LOCUS XM_022367792 3359 bp mRNA linear INV 14-MAY-2021
(LOC111074831), transcript variant X1, mRNA.
ACCESSION XM_022367792
VERSION XM_022367792.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022367792.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..3359
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..3359
/gene="LOC111074831"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 2 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 17 samples with support for all annotated
introns"
/db_xref="GeneID:111074831"
CDS 222..3191
/gene="LOC111074831"
/codon_start=1
/product="TATA element modulatory factor"
/protein_id="XP_022223484.2"
/db_xref="GeneID:111074831"
/translation="MSWFEKAKTVLIEALDIQDDDRDKAAEKDGVTAGSGGGGGSGGS
SDTGATSSGAALVSQTTSSDTPTFFDNPHANEMVTIPPRASDPAVTTPTPTSTSAAMG
MGNPYAKRHSATSDSVDLLSSPSPTSPEGDAASAMENSESSLELITATTASEMEMSPD
TEPSLPGSIVIIASNETSDNAEGDGEDDDDDDDGTQLERHLEDHLNDSTKTMKALVFS
DTQAAANVTGADSDSTQSFEDIQMQMSHKDKAGGKGQGQDGEGEAKAEVVDQSYSSTS
SDIEIISNPNGDSSTNSTTTRTSPQKFKELGSSSSRSGAGTGTGLTGQVLKPKGHHRE
PSEISLLSEDSQSELDKLVNRISELNQVIEAREQRLLQSERQNAELLERNQELRASVE
AAANSANSPDAAEAVQRLSALEKKFQTSIRERDALRIQIKSLKDELLNKIPKDELAEC
NEMIAALQSEGEKLSKEILQQSIIIKKLRAKEKTSDTLLKKNGEQISLLSSESERLKK
SLAAKEEMERTQIEAVGRMTHEKRRVDEENADCRSRVEDLQSKLSALQSSFDGVRGDL
VKRTRVEQDSLRAENQEYVQQLGDMREKLRHAEHSLAKREQQLREENRQLLRRLEAAE
LRAESSTQELSVTTTPLIRQIESLQKTLNQRTASWNKEEQLLIQRADDAQLQLRSVQQ
LESVQNEKQELLRTRCSLLEAKLSSALAEVESARMALSQLEHDASLKESSHSRKLATL
QEQLQAHQERIVGLEEQQQCQQRQQQQKADAEAERETLQRQPMPSLLTVEAVKASSEM
QPHKSPVHSPHSALRQHSPPLSLADESGSTEDAMMGGIIDWQTDDLDCASNSGRNQSG
IIQGVHLSFMAGNTTTLEHLQSLLKQRDGELTHLQWELSRLQAERGVLDGEISHLTIE
LETMKEKMQSYEAMEKCYEDLQHRYDALLQMYGEKVERTEELELDLVELKSAYKLQID
ELLANPPPNLQRPAKHT"
misc_feature <1230..>2564
/gene="LOC111074831"
/note="Chromosome segregation ATPase Smc [Cell cycle
control, cell division, chromosome partitioning]; Region:
Smc; COG1196"
/db_xref="CDD:440809"
misc_feature 2802..3143
/gene="LOC111074831"
/note="TATA element modulatory factor 1 TATA binding;
Region: TMF_TATA_bd; pfam12325"
/db_xref="CDD:432481"
ORIGIN
1 catcgatagt tgagggtccg tatgagctgg aaatacaatt tttttgtatt atgtgttatt
61 gaaatcaact aattgtgcga gaaaacaatt gggatttgca tgtgcttggt tagtggaccc
121 gtcagaccga tccctgcgag tccccccgat ccctagcccc accccaccac caagagcaga
181 caaaacaaaa ttttgcagtt cagaatcagg cacgcagcga gatgagttgg ttcgagaagg
241 ccaagacggt gctcatcgag gccctcgaca tccaggatga cgacagggac aaggccgccg
301 agaaggatgg cgttactgcg gggtcgggcg gtggcggtgg aagcggagga tcatcggaca
361 cgggcgccac gagctctggc gccgcactcg tctcgcagac gacatcctcg gacacgccta
421 cattcttcga caatccacac gccaacgaga tggtaaccat tccacccaga gccagcgatc
481 cggccgtaac cacgcccaca cccacatcca catccgcagc aatgggcatg ggtaatccgt
541 atgccaagcg acattcagcc acatcggact ccgtcgatct gctctcctcg ccatcgccca
601 catcgcccga gggcgacgcc gcctccgcga tggagaactc tgaatcttcg ctggagctaa
661 tcacggccac gacggccagc gagatggaga tgtcgccgga cacggagccc tcgctgcccg
721 gcagcatcgt gatcatagcc agcaacgaga cctccgacaa tgccgaaggc gatggcgaag
781 acgacgacga cgatgacgat ggcacccagc tggagcggca cctggaggac catctaaatg
841 attccacaaa gaccatgaaa gccctggtgt ttagcgacac ccaggcggcc gccaatgtga
901 cgggcgccga ttcagactcc acgcagagct tcgaggacat ccaaatgcag atgagccaca
961 aggacaaggc cggtggaaag ggacaaggac aagatggcga gggcgaggcc aaagcagagg
1021 tggtcgacca gagctactcg tccacctcct cggacattga gatcatttcc aatccgaacg
1081 gggactccag caccaacagc acaacgacgc gcacgagtcc gcaaaagttc aaggagctgg
1141 gcagcagcag cagccggtct ggggctggga caggcacagg gctgactgga caggtgctca
1201 agcccaaggg acaccatcgc gagccctcag agatatcgct gctgtcggag gactcgcaat
1261 cggagctgga taagctggtg aatcgcatta gcgagctgaa ccaggtgatt gaggcgcgcg
1321 agcagcgctt gctgcagtcg gagcgacaga acgccgagct gctggagcgc aaccaggagc
1381 ttcgcgcctc cgtggaggca gcggcgaaca gcgccaacag tccggatgcc gcggaggccg
1441 tacagcggtt gtcggcgctg gagaagaagt tccagacgag catacgggag cgggacgcgc
1501 tgcgcatcca gatcaagagc ctcaaggacg agctgctcaa caagataccc aaggacgagc
1561 tggccgagtg caacgagatg atcgcagcgc tgcagtcgga gggcgagaag ctctccaagg
1621 agattctcca gcagtcgatc atcatcaaga agctgcgcgc caaggagaag acctcggaca
1681 cgctcctcaa gaaaaacggc gagcagatct cgctgctgtc cagcgaatcg gagcggctca
1741 agaagtccct ggccgccaag gaggaaatgg agcgcacgca gatcgaggcg gtgggccgaa
1801 tgacccacga gaagcgacgc gtcgatgagg agaacgccga ctgccgcagt cgtgtcgagg
1861 atctgcagtc gaaactgtcg gcccttcagt ccagctttga cggcgtccgg ggcgatctgg
1921 ttaagcggac gcgcgtggag caggacagcc tcagggccga gaatcaggag tacgtccagc
1981 agctgggcga catgcgggag aagctgcgcc acgccgagca cagtctggcc aagcgggagc
2041 agcagctgcg ggaggagaac cgccagctgc tgcgacgctt ggaggctgcc gaactgcgag
2101 cggagagctc cacgcaggag ctgagcgtca ccaccacccc gctgatccgc cagatcgagt
2161 cgctgcagaa gaccctcaac cagcgcaccg cctcctggaa caaggaggag cagctgctga
2221 tccagagggc cgacgatgcc cagctgcagc tgcgctcggt gcagcagctc gagtcggtgc
2281 agaacgagaa gcaggagctg ctgcgcacgc ggtgcagcct tctcgaggcg aagctctcca
2341 gcgctctggc ggaggtggag agtgccagga tggccctcag ccagctggag cacgatgcca
2401 gcctcaagga gagctcacac agcagaaagt tggccacgct gcaggagcag ctgcaggcgc
2461 atcaggagag gattgtgggc ctagaggagc agcagcagtg ccagcagcgc cagcaacagc
2521 agaaggccga tgccgaggct gagcgggaaa ccttgcagcg acagcccatg cccagcctgc
2581 ttaccgtaga ggccgtcaag gccagcagtg aaatgcagcc gcataaatct ccggttcatt
2641 ccccacactc tgcgctgcgc cagcactcgc ctccgctcag tctggccgat gagagcggct
2701 ccacggagga tgcgatgatg ggcggcatca ttgactggca gacggacgac ctggactgtg
2761 cctccaactc gggccgcaac cagtcgggca tcatccaggg cgtccacctg agcttcatgg
2821 cgggcaacac cacaacgctg gagcatctgc agtcgctgct gaagcagcgt gacggcgagc
2881 tcacacacct ccaatgggag ctgtcgcgcc tgcaggccga gcgcggtgtg ctcgacggag
2941 agatatccca tttgacgatt gaactggaga cgatgaagga gaaaatgcag tcatacgagg
3001 ccatggagaa gtgctacgag gacctgcagc atcgctacga tgccctgctc cagatgtacg
3061 gcgagaaggt ggagcgcacc gaggagctgg aactggatct ggtcgaactg aagtccgcct
3121 acaagctgca gatcgacgag ctgctggcca atccaccccc gaacctgcag aggccagcga
3181 agcacacatg attgccggaa cgcagaagca tctactgtgt ctcgctgccc aatctgacat
3241 gcctgcggcg atcctagtcc cgtttcccac taccaattac cattcccgat gggccattat
3301 ccttatccta ccatactctg gctaacgtcg ggggaatgag agcagcgcag ggcagggca