PREDICTED: Drosophila obscura homeotic protein female sterile
LOCUS XM_041591942 6182 bp mRNA linear INV 14-MAY-2021
(LOC111072429), transcript variant X2, mRNA.
ACCESSION XM_041591942
VERSION XM_041591942.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..6182
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..6182
/gene="LOC111072429"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 9 Proteins, and 99% coverage of
the annotated genomic feature by RNAseq alignments,
including 8 samples with support for all annotated
introns"
/db_xref="GeneID:111072429"
CDS 170..6028
/gene="LOC111072429"
/codon_start=1
/product="homeotic protein female sterile isoform X2"
/protein_id="XP_041447876.1"
/db_xref="GeneID:111072429"
/translation="MSSNEPPPRYEPPVEPVNGIVQPAVVPPVDRPGRNTNQLQFLIK
TVMKMIWKHHFAWPFQQPVDAKKLNLPDYHKIIKQPMDMGTIKKRLENNYYWSAKETI
HDFNTMFNNCYVYNKPGEDVVVMAQTLEKVFLQKIETMPKEELELEPATAKGGKKKAR
APATPKSGSGGGGGTGSSTGAAVANSNTGSSNSGSINKVAPSAQQLPIMGSGLAAQST
GAGGHGSATGSGTTSRPVSAMGGTVSSTAGGGGAPSIPPISTMPPHTVPGSTNTTTTA
TTMVDAIIASLRNPSQAAYPGATQAAANSSSLLDGSSSAAASAAATTAAATAAAAAAA
AAAAAAAAASSSGGGAGAPGTVSTAGGVTIPSVNTTTNVHSSYVNSTAGVGVGVDAVI
PPQQPAKMKKGVKRKADTTTPTANAFESPYAQMDSKSAKIATRRESNRQDLTFQGSGY
AMSPSMGVVPGLAAGVAASVAAAKSKEKLSDALKSCNEILKELFSKKHSGYAWPFYKP
VDAEMLGLHDYHDIIKKPMDLGTVKRKMDNREYKSAPEFAADVRLIFTNCYKYNPPDH
DVVAMGRKLQDVFEMRYANIPDEPVSNAAHHHAHTHGHGHGHGHGQSHGHGHGHGHGG
YSSASLKHDGSDSSSEDSSGTENESNSDEERSAKLKMLESKLLGLQEEIRKLSEEASA
KKKAKKKLKEKKKSIGGGSGSGSASHHAHASAGGGAGGPSASHGGVSVPASVVALSAG
GAAGPNLSALLSGSLVGASGAGGGVPNVSALHSLAHDMTLAMNQLTGNAAATGAGFTA
GIPPPGPAPGGKGANMASAMAAAGAGPGASGGTGGSSKSKSKGQRGKGSSSNAGAGGA
SSGVTVNAGSGIGGAGAASSAGSGGGSGTTGPSGNSSKRGKGSSSAGGGGGGSSGSAA
NTTSATGAGARGSSKKKPSQVMNFDSEEEDTAKPMSYDEKRQLSLDINKLPGDKLGRV
VHIIQNREPSLRDSNPDEIEIDFETLKPSTLRELESYVASCLRKKTHKKPSGKSKDEQ
MAEKKQELEKRLQDVTGQLGASKKNAKKDDSTSNKVEAVQPANPVSSSSSSSDSSSSS
SSDSSSSDSSDSEAGDGDERPPRKKKPRDSNGSNVNNPSLSVVLGGTLPSGGLTPTLL
HMGLDHVLNANSPTSQMANLLGNPNPLTAAAMLNNNNKTTLQVGGSFGGASAGGNNSM
LHGQVGATAHHKNGQNDHLMTSKMPPGGGAAGAASLPPHSYAGSVPTQAASLGGGGIR
IASNLHKQPNLPGGDLVENNAALTAALTSTVNSNSGSGAGVGVAAGLSNTSSGTASGH
THNTNTMGGQGDGLPGASLASGLKQIPQFDDPVEQSLASLEFSAAAASTGKSGLTDNF
LMAQQGHLMQQPGLQQQPPFGHHQQQQQQQQQQQHHPMDYMTELLTKGTENVPGMNGN
LLNFNLDMAAAAAYQQKQHHGQHTQQQQQQQQQQQQQQQQQQASHNNGFNVADFGMPG
FDNLTATAFLDLEHTLQQQHMQLQQQQQQQQQHHHQQQSLQQLHAHQQQLANKMLLIK
PIESMMPSPPDKLKVMLKVSQSPSDLKLHSSQAAAAQSMLAQSFKAAEQNLKNASSWS
SLASASSPQNTSSSSSSSKPKPAMDSFQQFRNKAKERDRLKLLEAAEKEKKHQKEAAE
KEQQRKHHKSSSSSSAAQAASSTNISTGNAPQSGAGQISSDRERSSSAVDATRMGSGG
GGNGGLLLPGNGNNSSNSANSNGPGSAGSGSGGPVSGGLASGGGPNSGGCIIGPLANT
GSNSNSGVGSGATSSNSNSSVGGVVGSGGPGSNSQGSSGGGGTMGSGLLEYGPQVVVL
TQAAANAQAQHVSTTMTAQTVLASSPLEAMESGRKSVHDTQSQISRVEDIKASPGQGQ
SSPAQQSPQDRAAAKRAEQRRAEQERRRREALAGQIDMNMQSDLMAAFEETL"
misc_feature 269..589
/gene="LOC111072429"
/note="Bromodomain, Brdt_like subfamily, repeat I. Human
Brdt is a testis-specific member of the BET subfamily of
bromodomain proteins; the first bromodomain in Brdt has
been shown to be essential for male germ cell
differentiation. Bromodomains are 110 amino...; Region:
Bromo_Brdt_I_like; cd05497"
/db_xref="CDD:99929"
misc_feature order(356..358,377..379,386..388,503..505,515..517,
533..535)
/gene="LOC111072429"
/note="acetyllysine binding site [active]"
/db_xref="CDD:99929"
misc_feature 1616..1918
/gene="LOC111072429"
/note="Bromodomain, Brdt_like subfamily, repeat II. Human
Brdt is a testis-specific member of the BET subfamily of
bromodomain proteins; the first bromodomain in Brdt has
been shown to be essential for male germ cell
differentiation. Bromodomains are 110 amino...; Region:
Bromo_Brdt_II_like; cd05498"
/db_xref="CDD:99930"
misc_feature order(1694..1696,1715..1717,1724..1726,1841..1843,
1853..1855,1871..1873)
/gene="LOC111072429"
/note="acetyllysine binding site [active]"
/db_xref="CDD:99930"
misc_feature 3011..3205
/gene="LOC111072429"
/note="Bromodomain extra-terminal - transcription
regulation; Region: BET; pfam17035"
/db_xref="CDD:435704"
misc_feature <5963..6025
/gene="LOC111072429"
/note="C-terminal domain of bromodomain protein 4; Region:
BRD4_CDT; pfam17105"
/db_xref="CDD:465349"
ORIGIN
1 agcagcagca gcagtacaac cagcatccac ttgaggatca gataccttaa aagcaaccgc
61 aacaggagca gggacaaaag caacaaccag aagcctcatc atcagccaac aggaggagca
121 gcagtagccg cagcaccact gtgtcaccat catcaccatc atcaacaaca tgtcgtccaa
181 cgaaccacca cctcgctacg agccgcctgt ggagccggtc aatggcattg tgcagccagc
241 ggttgtgcct ccagttgatc gacccggccg caatacaaat caattgcaat tcttgatcaa
301 aaccgtaatg aagatgatat ggaaacacca ttttgcgtgg cccttccagc agcccgtcga
361 cgcgaaaaag ctcaacctgc ccgactacca caagatcatc aagcagccga tggacatggg
421 aacgatcaag aagcgactgg agaacaacta ttactggtcg gcgaaggaga ccatccatga
481 cttcaatacc atgttcaaca actgctatgt ctacaacaag cccggcgagg atgtggtcgt
541 gatggcccag acgctggaga aggtattctt gcagaagatc gagacgatgc ccaaggagga
601 gcttgaactg gagccggcca cggccaaggg tggcaagaag aaggcacgtg ccccggccac
661 gcccaagtct ggatcaggag gcggaggtgg aacgggatca agcacgggtg cagctgtcgc
721 caatagcaac actggcagca gtaacagcgg atctatcaat aaggtagcac catcggccca
781 acagttgccc atcatgggca gcggcctggc ggcacagtca acgggtgcag gaggccatgg
841 atcagcgacg gggtcgggga caacatcccg acccgtatca gcgatgggcg gaacggtttc
901 atcgacggcc ggcggcggcg gtgcaccgtc cataccaccg attagcacaa tgcctccgca
961 cacagtcccc ggcagcacca acaccacgac gacggccacg acaatggtcg acgccattat
1021 agccagcctt cggaatccca gccaggccgc atatccgggc gctacccagg cggcggctaa
1081 cagctcctcc ctactggacg gcagcagcag tgcggcagca tcggcggcgg caacaacagc
1141 agcggcgacg gcagcagcag cggcagcagc agcagcagcg gcggcggcag cagcagcatc
1201 atcatctggc ggtggagcgg gagctcccgg tacagtcagt acggcgggtg gtgtgacaat
1261 accatccgtc aataccacca ccaatgtgca ctcctcctac gtgaacagca cggcgggcgt
1321 cggagttggt gtggatgcgg ttataccgcc ccaacagccg gcgaaaatga agaagggtgt
1381 caaacggaag gcggacacaa cgacgccgac ggccaatgca tttgagtcgc cctacgccca
1441 aatggactcc aagtcggcca agatagcgac gcgaagggaa tcgaatcgtc aggatcttac
1501 attccagggc tcgggatatg cgatgtcgcc atcgatgggt gtcgtgcctg gattggcggc
1561 gggtgtggcc gctagtgtgg cggcggccaa gtcaaaggag aagctgtcgg atgcactaaa
1621 gtcatgcaac gagatcctca aggagctgtt cagcaagaag cactcgggct atgcctggcc
1681 cttctacaag ccggtggatg cggaaatgct gggcctccat gactaccacg acatcatcaa
1741 aaagcccatg gacctgggca ccgtcaagcg gaaaatggac aatcgggagt acaagagcgc
1801 accggagttt gcggccgatg tgcgattaat attcaccaac tgctacaagt acaatccgcc
1861 agatcatgat gttgtggcca tgggccggaa gctgcaagac gtcttcgaaa tgcgctacgc
1921 caacataccc gacgaaccgg tctctaatgc ggcccatcat catgcacata cgcacggaca
1981 tgggcacggc catggacacg gccagagtca cggtcacggg catgggcatg ggcacggtgg
2041 ctattcgagt gcttccctca agcacgatgg cagcgactcg tccagcgagg actccagcgg
2101 cacagagaac gagtcgaact cggacgagga gcgcagcgcc aagctgaaaa tgctcgagtc
2161 caagctgctc ggtctccagg aggagatccg aaagctctcc gaggaggcct ccgcaaagaa
2221 gaaggccaag aaaaaactca aagagaagaa gaagagcatc ggtgggggct ctggttccgg
2281 ctcggcctct catcatgccc atgcctcggc tggtggtggc gcaggtggtc cgtccgccag
2341 ccatggtggc gtctctgttc cggccagcgt cgttgcccta tcggcgggcg gagcagcggg
2401 acccaaccta tcggccttac tcagcggttc gctggtgggc gcgagcggag cgggcggcgg
2461 agtgcccaac gtctcggccc ttcattcact cgcccacgac atgacccttg ccatgaacca
2521 actgacgggc aatgcggcag cgaccggagc tggctttacg gccggcatac cgccaccggg
2581 accagcgcca ggcggcaagg gggccaacat ggccagcgcc atggctgcgg cgggagcggg
2641 accaggggct tctgggggaa ccggcggcag cagcaaatcg aagagcaagg gccagcgcgg
2701 taaaggaagc tcgagcaatg cgggagcggg cggagcatcc tctggagtga cggtcaacgc
2761 gggcagcgga atcggcggtg ctggcgcagc gtccagcgct ggcagtggcg gtggcagtgg
2821 gactactggc cccagcggca attcctcaaa gcggggcaag ggtagtagta gtgccggcgg
2881 tggtggtggc ggcagcagtg gcagtgcggc caataccacc agtgcgaccg gtgccggagc
2941 gcgcggtagc agcaagaaga agcccagcca ggtgatgaac tttgactccg aggaggagga
3001 cacggccaag cccatgtcgt acgatgagaa gcgacaactc tcactggaca tcaacaagtt
3061 gccaggcgac aagctggggc gagtggtgca cattatccag aatcgggagc catcgctgcg
3121 ggactccaat ccggacgaga tagagatcga ctttgagacg ctgaagccgt cgacgctgcg
3181 cgagcttgaa agctatgtgg cgtcgtgttt gcgcaaaaaa acacataaaa agccttccgg
3241 caaatcgaag gatgagcaaa tggcggaaaa gaagcaggag ctggagaagc ggctgcagga
3301 cgtcaccggc cagctgggtg caagcaagaa aaacgccaag aaagatgatt ccacctcaaa
3361 caaagtggaa gctgtgcagc cagcgaatcc cgtgtcgtcg agttccagtt ccagcgattc
3421 atcgtcctcg agttcgagtg atagcagttc gagtgactcg agcgacagtg aagcaggtga
3481 tggagacgaa cgacctccgc gcaaaaagaa gccacgcgac tcgaatggca gcaatgtaaa
3541 taatcccagc ctcagcgttg ttttgggagg aaccctgccg agtggcggct tgacgccgac
3601 gctccttcac atgggcctgg accacgtgct caacgcaaac tcacccacat cacaaatggc
3661 caatctgttg ggcaatccga atcccttgac ggcggctgcc atgctgaaca ataacaacaa
3721 gacgacgttg caggtcggcg gcagcttcgg cggcgcctcc gcgggcggca acaacagcat
3781 gctgcacggc caggttggcg caacggcgca ccacaagaac ggacaaaacg accacctaat
3841 gactagcaaa atgccgcctg gcggtggggc tgccggagcc gcttccctgc cgccccacag
3901 ctatgccggt agcgtgccca cccaggccgc aagtctgggc ggagggggca tacggatagc
3961 cagtaatctg cacaagcaac ccaatctgcc cggcggcgac ctggtcgaga ataacgcggc
4021 gctgaccgct gctttgacgt cgaccgtcaa cagcaacagc ggcagcggcg ctggtgtggg
4081 agtggccgcc ggcctgagca acaccagcag tggcaccgcc tcgggccaca cacacaacac
4141 caacacaatg ggtggccagg gggatggctt gccgggtgcc tcgctggcat cgggcctcaa
4201 gcaaatcccg cagttcgacg atcccgttga gcagtcgctg gcctccctgg agttcagtgc
4261 ggcagcggcc tccacgggca agtcgggctt aaccgataac tttttgatgg cccagcaggg
4321 acacctaatg cagcaacctg gtctgcagca gcagccgccc tttggacacc accaacagca
4381 gcagcaacag cagcaacagc agcagcacca ccctatggac tacatgacgg aactgttgac
4441 aaagggtaca gagaacgtgc ccggtatgaa cggaaacctt ttgaacttca acctcgacat
4501 ggcggctgcg gcggcctacc aacagaagca gcaccatggc cagcacaccc agcagcagca
4561 gcagcaacaa cagcagcagc agcagcaaca gcagcagcaa caggcgtcgc acaacaatgg
4621 cttcaatgtg gcggactttg gcatgccagg attcgataat ctgactgcaa cagcattcct
4681 cgacctggag cacacactcc agcagcagca catgcagctg caacagcagc agcagcaaca
4741 gcaacaacac caccaccaac agcagtccct tcagcagctg catgcccacc agcagcagct
4801 ggccaacaaa atgctcctca tcaaaccgat cgagtcgatg atgcccagtc cgccggacaa
4861 gctgaaggta atgctgaagg tatcccaatc gccctccgac ctgaagctgc actcaagcca
4921 ggcggcggcc gcccagagta tgctggctca atccttcaag gcggccgagc agaacctgaa
4981 gaatgccagc tcgtggtcat cgctggcatc agccagctca ccgcagaaca cttcgagcag
5041 ttcgagcagc agcaagccca agccggcgat ggactcattc cagcaattcc gcaacaaagc
5101 caaagaacgc gaccgcctca agctgttgga ggcggccgag aaggagaaga agcatcaaaa
5161 ggaggctgcc gagaaggagc agcagcgcaa gcatcacaag tcatcctcgt cctcatccgc
5221 cgcacaggca gcctcctcaa caaacatctc caccggaaat gcgcctcagt ctggggcagg
5281 acagataagc agtgataggg aaaggtccag cagcgctgtg gatgccaccc gcatgggcag
5341 cggtgggggc ggcaacggcg gtctactcct acccggcaat ggcaacaaca gcagcaactc
5401 tgccaacagc aacggacccg gcagcgcggg tagcggcagt gggggccccg tcagtggtgg
5461 cctagccagt ggcggcggac ccaacagtgg cggctgcatc atcggaccat tggcaaatac
5521 aggcagcaac agcaacagcg gcgttggcag cggggccaca agcagcaaca gcaatagcag
5581 tgtgggcgga gtggtcggga gcggggggcc cggctcaaat agccaaggca gtagcggtgg
5641 gggcgggacg atgggaagcg gactcctaga atacggcccg caggtggtgg tcctcactca
5701 ggcggcggcg aatgcccagg cccagcacgt atcgactaca atgacggcgc agaccgtcct
5761 ggcttcctcg cccctggagg ccatggagag tggcaggaaa agcgtgcatg acacccagtc
5821 ccagatatcg cgtgtcgagg acatcaaggc atcgcccggc cagggccaga gttcgccggc
5881 acagcaatcg ccgcaagatc gggcagcggc caagcgtgcg gagcaacgaa gagcggagca
5941 ggagagacgc agacgcgaag cgttggctgg ccaaatcgac atgaacatgc agagtgatct
6001 aatggcagcg tttgaggaga cgttgtagga caaggccttc ggaggacagc aacggacata
6061 tacgcatccc agaaccagca tcctcattgc tgctccgaga atgccgtcgt cgctttcaaa
6121 gcggcgggcg gcgaggacgc caaagcctct gccaacacag acaacaatgg agagcagcac
6181 ca