Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

PREDICTED: Drosophila takahashii RNA polymerase II subunit RpII215


LOCUS       XM_017155674            6149 bp    mRNA    linear   INV 09-DEC-2024
            (Polr2A), mRNA.
ACCESSION   XM_017155674
VERSION     XM_017155674.3
DBLINK      BioProject: PRJNA1194641
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            On Dec 9, 2024 this sequence version replaced XM_017155674.2.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..6149
                     /organism="Drosophila takahashii"
                     /mol_type="mRNA"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     gene            1..6149
                     /gene="Polr2A"
                     /note="RNA polymerase II subunit RpII215; Derived by
                     automated computational analysis using gene prediction
                     method: Gnomon. Supporting evidence includes similarity
                     to: 8 Proteins"
                     /db_xref="GeneID:108066911"
     CDS             355..6024
                     /gene="Polr2A"
                     /codon_start=1
                     /product="DNA-directed RNA polymerase II subunit RPB1"
                     /protein_id="XP_017011163.1"
                     /db_xref="GeneID:108066911"
                     /translation="MSTTTDSKAPLRQVKRVQFGILSPDEIRRMSVTEGGVQFAETME
                     GGRPKLGGLMDPRQGVIDRTSRCQTCAGNMTECPGHFGHIDLAKPVFHIGFITKTIKI
                     LRCVCFYCSKMLVSPHNPKIKEIVMKSKGQPRKRLAYVYDLCKGKTICEGGEDMDLTK
                     ENQQPDPNKKPGHGGCGHYQPSIRRTGLDLTAEWKHPNEDSQEKKIVVSAERVWEILK
                     HITDEECFILGMDPKYARPDWMIVTVLPVPPLAVRPAVVMFGAAKNQDDLTHKLSDII
                     KANNELRKNEASGAAAHVIQENIKMLQFHVATLVDNDMPGMPRAMQKSGKPLKAIKAR
                     LKGKEGRIRGNLMGKRVDFSARTVITPDPNLRIDQVGVPRSIAQNLTFPELVTPFNID
                     RMQELVRRGNSQYPGAKYIVRDNGERIDLRFHPKSSDLHLQCGYKVERHLRDDDLVIF
                     NRQPTLHKMSMMGHRVKVLPWSTFRMNLSCTSPYNADFDGDEMNLHVPQSMETRAEVE
                     NIHITPRQIITPQANKPVMGIVQDTLTAVRKMTKRDVFITREQVMNLLMFLPTWDGKM
                     PQPCILKPRPLWTGKQIFSLIIPGNVNMIRTHSTHPDEEDDGPYKWISPGDTKVMVEH
                     GELIMGILCKKTLGTSAGSLLHICFLELGHDIAGRFYGNIQTVINNWLLLEGHSIGIG
                     DTIADPQTYNEIQQAIKKAKDDVINVIQKAHNMELEPTPGNTLRQTFENKVNRILNDA
                     RDKTGGSAKKSLTEYNNLKAMVVSGSKGSNINISQVIACVGQQNVEGKRIPYGFRKRT
                     LPHFIKDDYGPESRGFVENSYLAGLTPSEFYFHAMGGREGLIDTAVKTAETGYIQRRL
                     IKAMESVMVNYDGTVRNSVGQLIQLRYGEDGLCGELVEFQNMPTVKLSNKAFEKRFKF
                     DWSNERYMRKVFTDDVIKEMTDSSDAIQELEAEWDRLVADRDSLRQIFPNGDSKVVLP
                     CNLQRMIWNVQKIFHINKRLPTDLSPMRVIKGVKGLLERCVIVTGNDRISKQANENAT
                     LLFQCLIRSTLCTKYVSEEFRLSAEAFEWLIGEIETRFQQAQANPGEMVGALAAQSLG
                     EPATQMTLNTFHFAGVSSKNVTLGVPRLKEIINISKKPKAPSLTVFLTGGAARDAEKA
                     KNVLCRLEHTTLRKVTANTAIYYDPDPQRTVISEDQEFVNVYYEMPDFDPTRISPWLL
                     RIELDRKRMTDKKLTMEQIAEKINVGFGEDLNCIFNDDNADKLVLRIRIMNNEENKFQ
                     DEDEAVDKMEDDMFLRCIEANMLSDMTLQGIEAIGKVYMHLPQTDSKKRIVITETGEF
                     KAIGEWLLETDGTSMMKVLSERDVDPIRTSSNDICEIFQVLGIEAVRKSVEKEMNAVL
                     QFYGLYVNYRHLALLCDVMTAKGHLMAITRHGINRQDTGALMRCSFEETVDVLMDAAA
                     HAETDPMRGVSENIIMGQLPKMGTGCFDLLLDAEKCRFGIEIPNTLGSSMLGGAAMFI
                     GGGSTPSMTPPMTPWVNCNTPRYFSPPGHVSAMTPGGPSFSPSAASDASGMSPSWSPA
                     HPGSSPSSPGPSMSPYFPASPSVSPSYSPTSPNYTASSPGGASPNYSPSSPNYSPTSP
                     LYAAASPRYASTTPNFNPQSTGYSPSSSGYSPTSPVYSPTPQFQSSPSFAGSGSNLYS
                     PGNAYSPSSSNYSPNSPSYSPTSPSYSPSSPSYSPTSPCYSPTSPSYSPTSPNYTPVT
                     PSYSPTSPNYSASPHYSPASPAYSQTGVKYSPTSPTYSPPSPSYDGSPGSPQYTPGSP
                     QYSPASPKYSPTSPLYSPSSPQHSPSNQYSPTGSTYSATSPRYSPNMSIYSPSSTKYS
                     PTSPTYTPTARNYSPTSPMYSPTAPSHYSPTSPAYSPSSPTFEESDD"
     misc_feature    397..2958
                     /gene="Polr2A"
                     /note="Largest subunit (Rpb1) of eukaryotic RNA polymerase
                     II (RNAP II), N-terminal domain; Region: RNAP_II_RPB1_N;
                     cd02733"
                     /db_xref="CDD:259848"
     misc_feature    order(418..420,430..432,439..444,583..591,595..597,
                     628..630,1078..1080,1084..1086,1090..1092,1105..1110,
                     1282..1284,1345..1347,1354..1359,1366..1368,1387..1389,
                     1396..1398,1402..1425,1438..1446,1480..1482,1489..1494,
                     1699..1701,1705..1707,1711..1713,1723..1728,1732..1737,
                     1768..1773,1777..1779,1786..1788,1813..1824,1828..1830,
                     1834..1836,1840..1842,1849..1854,1861..1863,1870..1875,
                     1882..1887,1900..1902,1945..1950,1957..1959,2368..2373,
                     2635..2637,2650..2655,2668..2670,2677..2682,2707..2712,
                     2800..2805,2809..2817,2824..2829,2836..2841,2845..2853,
                     2860..2865,2869..2874,2911..2916,2923..2925,2935..2937)
                     /gene="Polr2A"
                     /note="RPB1 - RPB2 interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:259848"
     misc_feature    order(418..459,472..474,589..666,1063..1122,1234..1254,
                     1261..1287,1342..1392,1396..1410)
                     /gene="Polr2A"
                     /note="clamp; other site"
                     /db_xref="CDD:259848"
     misc_feature    order(472..486,577..579,1114..1116,1120..1122,1138..1155,
                     1159..1167,1171..1176,1183..1185,1195..1197,1243..1245,
                     1252..1257,1264..1269,1294..1311,1318..1335,1339..1341,
                     1354..1356,1369..1371,1582..1584,1591..1593,1603..1614,
                     1618..1626)
                     /gene="Polr2A"
                     /note="RPB1-TFIIB interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:259848"
     misc_feature    order(553..555,562..564,583..585,592..594)
                     /gene="Polr2A"
                     /note="Zn-binding [ion binding]; other site"
                     /db_xref="CDD:259848"
     misc_feature    order(652..657,1366..1368,1381..1383,1402..1404,
                     1420..1422,1708..1716,1813..1815,1819..1821,1825..1827,
                     2902..2907)
                     /gene="Polr2A"
                     /note="active site region [active]"
                     /db_xref="CDD:259848"
     misc_feature    3022..3567
                     /gene="Polr2A"
                     /note="RNA polymerase Rpb1, domain 6; Region:
                     RNA_pol_Rpb1_6; pfam04992"
                     /db_xref="CDD:461511"
     misc_feature    3502..4758
                     /gene="Polr2A"
                     /note="Largest subunit (Rpb1) of Eukaryotic RNA polymerase
                     II (RNAP II), C-terminal domain; Region: RNAP_II_Rpb1_C;
                     cd02584"
                     /db_xref="CDD:132720"
     misc_feature    order(3559..3564,3574..3576,3583..3585,4732..4734,
                     4741..4752,4756..4758)
                     /gene="Polr2A"
                     /note="Rpb1 - Rpb6 interaction site [polypeptide binding];
                     other site"
                     /db_xref="CDD:132720"
     misc_feature    order(3574..3666,3670..3762,3778..3822,4228..4230,
                     4258..4287,4291..4293,4306..4491,4498..4557,4561..4605)
                     /gene="Polr2A"
                     /note="cleft [active]"
                     /db_xref="CDD:132720"
     misc_feature    order(3586..3588,3595..3597,4198..4200,4210..4212,
                     4648..4650,4678..4686,4702..4707,4711..4713,4717..4719,
                     4732..4734)
                     /gene="Polr2A"
                     /note="Rpb1 - Rpb2 interaction site [polypeptide binding];
                     other site"
                     /db_xref="CDD:132720"
     misc_feature    order(3721..3723,4576..4578,4627..4632,4639..4641)
                     /gene="Polr2A"
                     /note="DNA binding site [nucleotide binding]"
                     /db_xref="CDD:132720"
     misc_feature    order(4372..4374,4390..4395,4399..4401,4429..4449,
                     4453..4455,4513..4515,4546..4551,4555..4557)
                     /gene="Polr2A"
                     /note="Rpb1 - Rpb5 interaction site [polypeptide binding];
                     other site"
                     /db_xref="CDD:132720"
     misc_feature    4603..4728
                     /gene="Polr2A"
                     /note="clamp [active]"
                     /db_xref="CDD:132720"
     misc_feature    <4777..5946
                     /gene="Polr2A"
                     /note="Herpes virus major outer envelope glycoprotein
                     (BLLF1); Region: Herpes_BLLF1; pfam05109"
                     /db_xref="CDD:282904"
ORIGIN      
        1 actcgccagt ggtatttccg acacttttcg gtcgcgttgt ccgcgttttc acgtcgcatc
       61 gcgctagaaa aaagggaaat tcgcactcgc aaccgcctgt agagagaccg aatgttgtag
      121 aatgcacagc agcagacagt gattgtttac ccagtcttgt tttcgcccac gccccgccga
      181 tccgtgccgt gcaatctctc cttctccccc ctcggcggcc ccccgaatcg cacgaggtga
      241 aaagtgtgac gacgcagcag aaagcagccc cactttctgg gatctcgagt cgccagtggt
      301 tctgcctcca attccgatct cctagtggcc attggccagt gaccagtgac caggatgagc
      361 accaccacgg actcgaaggc gccgctgcgc caggtgaagc gcgtacagtt cggcattttg
      421 tcacctgatg aaattcgccg catgtccgtc accgagggtg gcgtccagtt cgcggaaacg
      481 atggagggcg gccgaccgaa gctgggcggc ctgatggatc cccgccaggg cgtcatcgac
      541 aggacgtcgc ggtgccagac gtgcgccggc aacatgaccg aatgccccgg ccactttggg
      601 cacatcgatc tggccaagcc ggtgttccac atcggtttca tcaccaagac aatcaagatt
      661 ctgcgctgcg tttgcttcta ctgctccaag atgctcgtct cgccccacaa tcccaagatc
      721 aaggagattg tgatgaagtc caagggacag ccgcgcaagc gattggctta cgtctacgat
      781 ttgtgcaagg gcaagacgat ttgcgagggc ggtgaggaca tggatctgac caaggagaac
      841 cagcagccgg atccgaacaa gaaacccggt cacggcggct gcggtcacta tcagccttcg
      901 attcgacgca ctggactcga tctcaccgca gaatggaagc atcccaatga ggattcgcag
      961 gagaagaaga ttgtggtgtc ggcggagagg gtttgggaga tcctcaagca catcaccgac
     1021 gaggagtgct ttattctggg aatggacccc aaatacgccc gtcccgattg gatgatcgtc
     1081 accgtgctgc ctgttccccc attggccgtt cgtccggcgg tcgtgatgtt tggtgcggcc
     1141 aagaatcagg acgatttgac ccacaagcta tcggacatca tcaaggcgaa caatgagctg
     1201 cgcaagaacg aggccagcgg agcagcggcg catgttatac aggagaacat caagatgctg
     1261 cagttccatg tggccacgct ggtcgacaac gatatgcccg gcatgccgag ggccatgcaa
     1321 aagtcgggga aacccctaaa agccatcaag gcgcgtctca agggcaagga gggcagaatt
     1381 cgtggcaatt tgatgggcaa acgtgtcgat ttctccgccc gcacagtcat cacacccgat
     1441 cccaatttgc gcatcgatca ggtgggcgtt ccacgctcca ttgcccagaa tctgaccttc
     1501 cccgagctgg tcaccccctt caatatcgat cgcatgcagg agttggttcg ccgaggcaat
     1561 tcccaatacc caggagccaa gtacattgtg cgcgacaatg gagagcgcat cgatctgcgc
     1621 ttccatccca aatcctcgga tctacatctg cagtgcggct acaaggtgga gcggcatttg
     1681 cgtgacgacg atctggtcat tttcaatcgc cagccgacgc tgcacaagat gagtatgatg
     1741 gggcacaggg tgaaggtgct gccctggtcg acgtttcgca tgaacctctc ctgtacatcg
     1801 ccctacaatg cggatttcga cggtgacgag atgaatctcc atgtgccgca gtccatggaa
     1861 acgcgcgccg aggtggagaa tattcacatc acccccaggc agattattac gccgcaggca
     1921 aacaaacccg tcatgggcat cgtgcaagac accctgacgg cggtgcgaaa gatgaccaag
     1981 agggatgtct ttatcacaag ggagcaggtg atgaatctac tgatgttcct gcctacatgg
     2041 gatggcaaaa tgccgcaacc gtgcattcta aaaccacgtc ccttgtggac cggcaagcag
     2101 atcttctcgc tgatcatccc cggcaacgtg aacatgatac gcacgcactc cacgcatccc
     2161 gacgaggagg atgacgggcc gtacaagtgg atctcgcccg gcgacaccaa ggtaatggtc
     2221 gagcacggtg aacttatcat gggaattctt tgcaaaaaga cgctgggaac ctcggcggga
     2281 tcgctgctgc atatttgctt ccttgaattg ggacacgata tagccggtcg gttctacggc
     2341 aacattcaga ccgtgatcaa caattggctg ctgctcgagg ggcacagcat tggtatcggt
     2401 gacaccattg ccgatccgca gacctacaac gagatccagc aggccatcaa gaaggccaag
     2461 gacgatgtga taaacgtcat ccagaaggcc cacaacatgg aactcgagcc cacgcctggt
     2521 aatacgctgc gccagacatt cgagaacaag gtgaaccgca tcctgaacga tgctcgtgac
     2581 aaaaccggtg gctcggccaa gaaatccctc accgaataca acaatctaaa ggccatggtg
     2641 gtgtccggat ccaagggatc caacattaat atctcacagg tcattgcctg tgtgggccag
     2701 cagaacgtgg agggcaagcg gatcccgtac ggcttccgca agcgcactct tccccacttc
     2761 atcaaggacg attatggtcc ggagtcgcgt ggcttcgtgg agaactcgta tctggccggc
     2821 ctaacgccct cggagttcta tttccacgct atgggcggtc gcgagggtct cattgatacc
     2881 gctgtaaaga cagccgaaac tggctacatt cagcgacgtc tcatcaaagc tatggagtcg
     2941 gtgatggtca actatgacgg cacggtgcgt aattcggtgg gccagctgat tcagctgcgt
     3001 tacggcgagg atggtctctg cggcgagttg gtcgagttcc agaacatgcc aacggtcaag
     3061 ctatcgaaca aggctttcga gaagcgcttc aagttcgact ggagcaacga gcgatatatg
     3121 cgaaaggttt tcaccgacga cgtgatcaaa gagatgaccg acagcagcga tgccatccag
     3181 gaattggagg ccgaatggga tcgcctagtc gccgatcgcg atagtttgcg acaaatcttt
     3241 cccaacggcg attcaaaggt tgtgctgcct tgcaatctgc aaaggatgat ttggaatgta
     3301 cagaagatct ttcacatcaa caagcgactg cccacggatc tctcgcccat gcgggtgatc
     3361 aagggagtca agggattgct tgaacgatgc gttattgtaa ctggcaacga tcgcatatcg
     3421 aagcaggcga atgagaatgc cacgctgctg tttcagtgcc taattcgctc gactctttgc
     3481 acgaaatatg tgtccgagga gttccgcttg tcggccgagg cttttgagtg gcttattgga
     3541 gagatcgaga cgcgtttcca gcaagcgcag gcaaatcccg gcgagatggt gggcgccttg
     3601 gccgcccaga gtttgggcga gcccgccact cagatgacac tgaatacctt ccattttgcc
     3661 ggtgtgtcgt cgaagaacgt gacattggga gtgccgcgtc tcaaggagat tatcaatatc
     3721 tccaagaagc ccaaggcgcc gtcgctaacc gttttcctaa cgggcggtgc tgctcgcgat
     3781 gcggagaagg ccaagaatgt gctgtgccgg ctggagcata cgacgctgcg caaggtgact
     3841 gcgaatacgg ctatttacta cgatccagat ccgcagagaa cggtgatttc ggaggatcag
     3901 gagtttgtga acgtgtacta cgaaatgccc gactttgatc ccacacgcat ctcgccctgg
     3961 ctgctgcgca tcgagttgga tcgcaagcgg atgacggaca agaagctgac catggagcag
     4021 attgccgaga agatcaacgt gggctttggc gaggatctca attgcatatt caatgatgac
     4081 aatgcggata agctggtgct gcgcatcagg atcatgaaca acgaggagaa caagttccag
     4141 gatgaggatg aggcggtgga taagatggag gacgacatgt tcctgcgctg catcgaggcc
     4201 aatatgcttt cggatatgac actgcagggc atcgaggcga ttggcaaggt gtacatgcat
     4261 ttgccgcaaa ccgacagcaa gaaacgcatc gtgataacgg aaacgggcga gtttaaggcg
     4321 attggcgagt ggctgctcga aacggacggc acgtcgatga tgaaagtgct ctccgaacgc
     4381 gacgtcgatc ccattcgcac ctcgtccaac gatatctgcg agatcttcca ggtgctgggc
     4441 atcgaggcgg tgcgcaagtc ggtcgagaag gagatgaatg cggtgctgca gttctacggc
     4501 ctgtacgtga actatcgcca cttggctctg ttgtgcgatg tgatgaccgc caagggccat
     4561 ctgatggcca tcacgcgtca tggtatcaat aggcaggaca ctggtgccct catgagatgc
     4621 tcctttgagg agacggtgga tgtgctgatg gatgccgccg cccatgccga aacggatccc
     4681 atgagaggcg tttcggagaa cattatcatg ggtcagctgc ccaaaatggg caccggctgc
     4741 ttcgatctgc tgctggatgc cgagaagtgc cgcttcggca tcgagatacc gaacacactg
     4801 ggcagcagca tgctgggcgg cgccgccatg ttcatcggcg gcggctccac gccgagcatg
     4861 acgccaccga tgacgccgtg ggtcaactgc aacacgccgc gctacttctc gccaccggga
     4921 catgtaagtg cgatgacacc tggcggaccc agtttctcgc cctcggcggc ctcggatgcc
     4981 tctggcatgt cgcccagctg gtcgccagcc catccgggct cctcgcccag ttcaccgggc
     5041 ccctcgatgt cgccctactt cccggcctcg cccagcgttt cgccctctta ctcgccaacg
     5101 agtccgaatt atacggcctc gtcgcctggc ggagcgtcac cgaattactc gccctcgagt
     5161 cccaactact cgccgacatc gccgctttat gcggctgcaa gtccgcgtta cgcctcgacg
     5221 acgcccaact tcaatcccca gtcgacgggc tattcgcctt cttcgtcggg ctactcgccc
     5281 acgtcccctg tttactcgcc cacaccgcag ttccagtcga gtccgtcgtt tgcgggcagc
     5341 ggcagcaact tgtactcgcc gggcaatgcg tactcgccca gctcgtccaa ttactcaccc
     5401 aactcgccat cctattcgcc gacatcaccg tcctactcgc cgtcgagtcc ctcgtactcg
     5461 cccacgtcgc cttgctactc gcccacatcg ccttcgtact cgccaacgag tccgaactac
     5521 acgcccgtca cgccctcgta ctcgccgacc agtccgaact actcggcgtc gccgcattac
     5581 tccccagcct cgccggccta ctcgcagacg ggggtaaagt actcgccgac atcgcccacg
     5641 tactcgccgc cctcaccgtc gtacgatggc tcgccgggat caccgcaata cacgccggga
     5701 tcgccgcagt actcgccggc ctcgcccaag tactcgccga cctcgccact gtactcgccc
     5761 agctcgccgc agcactcgcc gtcgaaccag tacagtccca ctggatcgac gtactcggcg
     5821 acgagtccgc ggtactcgcc gaacatgtcc atctactcgc cgagcagcac caagtactcg
     5881 cccacctcac cgacctacac gccgacggcc cgcaactact cgcccacatc gcctatgtac
     5941 tcgccgacgg cgccatcgca ctacagtcca acgagtccgg cctactcgcc cagcagtccc
     6001 acgttcgagg agagcgacga ctgaggaagg gaggacgggg gcagctcccc cggcacgtcg
     6061 cgatccccgg ggcggtcgag tcgtagttaa gatcgaccat gtagagggga aaaagggtga
     6121 atttaatgat ataaagtccg ctgataggt