Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

PREDICTED: Drosophila takahashii SET domain containing 2 (Set2),


LOCUS       XM_017155304            6922 bp    mRNA    linear   INV 09-DEC-2024
            mRNA.
ACCESSION   XM_017155304
VERSION     XM_017155304.3
DBLINK      BioProject: PRJNA1194641
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            On Dec 9, 2024 this sequence version replaced XM_017155304.2.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..6922
                     /organism="Drosophila takahashii"
                     /mol_type="mRNA"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     gene            1..6922
                     /gene="Set2"
                     /note="SET domain containing 2; Derived by automated
                     computational analysis using gene prediction method:
                     Gnomon. Supporting evidence includes similarity to: 2
                     Proteins"
                     /db_xref="GeneID:108066687"
     CDS             407..6640
                     /gene="Set2"
                     /codon_start=1
                     /product="histone-lysine N-methyltransferase Set2"
                     /protein_id="XP_017010793.2"
                     /db_xref="GeneID:108066687"
                     /translation="MEESSSSPAVASRGRGRGRPPKQVAPSQPEPPTATATPPPPDAG
                     EDQCRRSSRKKIIKFDVRDLLNKNRKAHKIQIEARIDSNPGCSSSSSASASRLFSMFD
                     SNQQATSKLPPPPTPPPPIALEIFAKPRATQSLIVAQVSSSEPSARKRGRPRKSQPTV
                     VSSDSDTNSTGTGTSSTSTSGQDGQDGQEEVNRKPKSKLRVSLKRLNLSRRQESSDSG
                     SGSGSGSTFSSPEVEVEVEVEPPALQDDNAMDEQPEPEQQQEQQMVMNEEEGNSDSDS
                     QIIFIEIETESPKRQEEQEEQEPNLDEIMVEVLSGPPSLWSEEDQEEEEPTASSPAPR
                     RSRRSAQPRGGGGGGSSSSQQGKTLEETFAEIAAESSKQILAAEEEVVEEEEESHVLI
                     DLIEDSLSQPANVEEKQNQEFVIEFVEETEKETEKETETQPVVSESESPVPAKELPNQ
                     EAPKPEAAEEIPEKLEPPTEPKDSIESAEEPQPNDEANSNSNSISMAADPKSTSKRIP
                     MNDDIDTKLATMQVSESDTPEEKPALSAREKPPDEAEGLKSPPETETEQQEQKKAETA
                     EPEKEVKSSPAAEERESKPLEKSSSPPEAIILKQKDEPEAIEATNSSLAMEELDKETA
                     KSPTEADTIKKPVTDHSSCKPTKETAKEKGFSPGFVECDAMFKAMDKANAQLRLEEKS
                     KKRQKKVPALGQKKPLGGRGGGKTTTSPPHSPKLERNSSPSSDSNQVSKLMKRSKAKK
                     KLNPRRSTICEEAKERLAASSPVSSSNSSDSSAKRRNLPEQPAAKLDLRRNTICEDHR
                     STPVPLTKRRFSMHPKASANPLHDTLLRSATSSASAAASAAGKKRGRKLGSSRQNSLD
                     SSCSASKKDALLETESSESTAELEATNPLSDIAKFIEDGVNLLKRDYKVEEEEEEQQQ
                     QEDEFAQRVANMETPATTPSPSPTQSNPEDPGNPGKLLLQENNSSSGGVRRSHRIKQK
                     PQGQRASQGRGVASAGLAPISMDEQLAELASIEAINEQFLRSEGLNTFQPLRENYYRC
                     ARQVSQENAEMQCDCFLTGDEEAQGHLSCGAGCINRMLMIECGPLCTNGQRCTNKRFQ
                     QHQCWPCRVFRTEKKGCGITAELQMPPGEFIMEYVGEVIDSEEFERRQHLYSKDRKRH
                     YYFMALRGEAIIDATSKGNISRYINHSCDPNAETQKWTVNGELRIGFFSVKPIQPGEE
                     ITFDYQYQRYGRDAQRCYCEAANCRGWIGGEPDSDEGEQLDVSSDSEAEAEAEEVEVE
                     DLDADAEPEKTSKGKTGRSSKIKVKLPLSQAASRKRKPKPKDREYKAGRWLKPTREKA
                     TRKPKVSKFHAMLEDPDVLEELSLLSRSGLKNQLDTLRFSRCLVRAKLLPTRLQLLGV
                     LTRGELPCRRLFLDYHGLRLLHAWISESGSDGQLRMALLDTLESLPIPNRTMLNDSRV
                     YQSVQLWSNSLAEASEASEEQHKRMVALLEKWQALPEIFRIPKRERIEQMKEHEREAD
                     RQQKHVHASTALEDQRERIGESSSNDRFRQDRFRRDTNSSRMTGSKPTRMSGNNTICT
                     ITTQPKGSNGSSDGMARNDNRRRAEMGNPAAEPRRTLSKELRRSLFERKVALDEAEKR
                     VCSEDWREHELRCEIFGADLNTDPKQLPFYQNADTGEWFNSEDMPVPAPQRTELLSQA
                     LLSPETPDNGGSGQAAPPAVEYKLPAGVDPLPPAWHWRMTSDGDIYYYNLRERISQWE
                     PPSPEQRLQTLVEEDATQQQQQPLHELQIDPALLATELIQVDLDYVGSLSSKSLAQFV
                     EAKVRERRELRRSRLVSVRVISPRRDEDRLYNQLESRKYKENKEKIRRRKEFFRRRKI
                     DAASAQPNSSTNPDDAAADSSNSLPIQAYLYSSDEDAPADAAPAGAAEPPLADGQTAA
                     QAEELDSLNLAPSTSHAALAALGKTTVVGSQAAAQSAGASGKRKLPMPPNAAAAVKKH
                     RQEHRSKKSKSSHSLLTTTSGREAHEKFRFEISGHVADFLRPYRKDSCQLGRITSDED
                     YKFLIKRLSHHITTKEVRYCDVTGNPLSCTESVKHKSYDFINQYMRKKGRVYRKPAES
                     TIF"
     misc_feature    <1715..2239
                     /gene="Set2"
                     /note="Prolipoprotein diacylglyceryl transferase; Region:
                     LGT; cl00478"
                     /db_xref="CDD:469786"
     misc_feature    3524..3682
                     /gene="Set2"
                     /note="associated with SET domains; Region: AWS;
                     smart00570"
                     /db_xref="CDD:197795"
     misc_feature    3680..4105
                     /gene="Set2"
                     /note="SET domain (including post-SET domain) found in SET
                     domain-containing protein 2 (SETD2) and similar proteins;
                     Region: SET_SETD2; cd19172"
                     /db_xref="CDD:380949"
     misc_feature    order(3713..3721,3770..3772,3800..3802,3812..3814,
                     3824..3826,3839..3859,3908..3922,3941..3949,3992..3994,
                     4025..4039,4043..4057,4061..4075,4100..4102)
                     /gene="Set2"
                     /note="active site"
                     /db_xref="CDD:380949"
     misc_feature    order(3713..3721,3839..3850,3908..3922,4031..4033,
                     4061..4075,4100..4102)
                     /gene="Set2"
                     /note="SAM binding site; other site"
                     /db_xref="CDD:380949"
     misc_feature    order(3770..3772,3800..3802,3812..3814,3845..3859,
                     3941..3949,3992..3994,4025..4039)
                     /gene="Set2"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:380949"
     misc_feature    order(3926..3928,4067..4069,4073..4075,4088..4090)
                     /gene="Set2"
                     /note="Zn binding site [ion binding]; other site"
                     /db_xref="CDD:380949"
     misc_feature    5501..5590
                     /gene="Set2"
                     /note="WW domain; Region: WW; pfam00397"
                     /db_xref="CDD:459800"
     misc_feature    6350..6610
                     /gene="Set2"
                     /note="SRI (Set2 Rpb1 interacting) domain; Region: SRI;
                     pfam08236"
                     /db_xref="CDD:462404"
     polyA_site      6922
                     /gene="Set2"
                     /experiment="COORDINATES: polyA evidence [ECO:0006239]"
ORIGIN      
        1 agaataaaaa ctcattcgat tgcagacttg tagtgtagtg tagtgcagac atgctgtttt
       61 cctaaccgcc taatgaatgc ccccaatatt ctagctaaac aagcgcatac agaagacgaa
      121 gaagaaggag aagaagggca gagcagagca gagaaggaaa atcgaaggga acaataacaa
      181 ttaatcgaaa atcgaatgcg aatgcaagtg cagttagaaa gttagaggct gttctaactg
      241 ttaactgtaa atacccgcta actttgcgat ggccccgcga aaattgttca attgaattgg
      301 ccgtgtgtga aaaacaaaag ccgtacggct gctcctcctt ctgctttctg cttttaccac
      361 ccacctactg cgcctactgc cagactgccc aacaaaccca gcagagatgg aggaatcctc
      421 atcctcgccc gctgttgcgt cacgtggccg cggacgcgga aggccgccga aacaagtcgc
      481 tcccagtcag cccgagccgc ccactgccac tgccacgccc ccgcccccgg atgctggcga
      541 ggatcagtgc cgtcgcagtt cgcgcaagaa gatcatcaag ttcgatgtgc gggatttgct
      601 caacaagaac cgaaaggcgc acaaaattca gatcgaagcc cgcatcgact cgaatcccgg
      661 atgcagctcc tcctcctccg cctccgcgag ccgtctgttc tccatgttcg atagtaacca
      721 gcaggcaacc agcaagctac cgccacctcc cactcctcct cctcccatcg ccttggagat
      781 cttcgccaag ccgcgggcca cccaaagtct gattgtggcc caggtgagca gcagcgaacc
      841 cagcgcccgg aagcggggac gtcccaggaa gagccagccc accgtcgtat catccgattc
      901 ggacaccaat tccacgggca cgggcacgag ctccacgagc accagtggcc aggatggcca
      961 ggatggccag gaggaggtca accggaagcc caagtccaag ctgcgcgtct cgctgaagcg
     1021 cctgaatctc tcccgccgcc aggagtcgtc cgattccggc tctggatcgg gatcgggttc
     1081 caccttctcc tcgccggagg tggaagtgga ggtggaggtg gagccgccgg cactgcagga
     1141 tgacaacgct atggacgagc agccggagcc ggagcagcag caggagcagc agatggtgat
     1201 gaacgaggag gagggtaact ccgattccga ttcgcagatc atattcattg aaatcgaaac
     1261 ggagagtccc aaaaggcagg aggagcagga ggagcaggaa ccgaatctcg acgagatcat
     1321 ggtggaggtg ctcagtgggc cgcccagcct gtggtcggag gaggatcagg aggaggagga
     1381 acccaccgca tcctcgccag ctccacggcg cagcaggcga tcggcacagc cgcgcggcgg
     1441 cggcggcggc gggagcagca gcagccagca gggcaagacg ctcgaggaga cgttcgccga
     1501 gatagccgcc gagagcagca agcagatcct ggcagcggag gaggaggtgg ttgaggagga
     1561 ggaggagtcc catgtgctca tcgatctcat cgaggacagt ctcagtcagc cggcgaatgt
     1621 cgaggagaag cagaatcagg agtttgtcat tgaatttgtt gaggaaacgg aaaaggaaac
     1681 ggaaaaggaa acggagaccc agccagtggt ttccgaatcg gaatcgcctg ttccagcgaa
     1741 agaactgccc aaccaggagg caccaaagcc agaagcagcc gaagagatcc ctgagaaact
     1801 ggaaccacct acggaaccaa aagattcgat agaaagtgcg gaggagccgc agcccaatga
     1861 tgaagcgaat tctaattcga attcgatttc aatggcagct gatccaaagt cgacgagtaa
     1921 acgaattccg atgaacgacg atatcgacac caagttggcc acgatgcaag tgtcagaatc
     1981 tgacacccca gaagaaaaac cagcactgtc agccagggaa aagcctccag atgaagcgga
     2041 aggcttaaaa tccccgccag aaactgaaac agaacagcag gagcagaaga aagctgagac
     2101 agctgagcca gagaaagaag ttaaatcctc accagcagct gaggaaaggg aatctaagcc
     2161 actggagaaa tcctcctccc ctccagaggc gatcattctg aagcaaaaag acgagccaga
     2221 agcaatagaa gccaccaatt cttcattagc gatggaggag ctcgataaag aaactgccaa
     2281 aagtccaaca gaagccgaca caatcaagaa gcccgtaacc gatcattcct cctgcaagcc
     2341 gactaaggag acggccaagg agaagggatt ctcgcccggc ttcgttgagt gcgatgccat
     2401 gttcaaggcc atggacaagg cgaatgccca gctgcgtctc gaggagaaga gcaagaagag
     2461 gcagaagaaa gtgcccgctc ttggccaaaa gaagccgctg ggaggcagag gaggaggcaa
     2521 gaccaccaca tcaccacctc attcgcccaa gctggagagg aacagctcgc cgagctccga
     2581 ttccaaccaa gtgagcaagc tgatgaagcg ttcgaaggcc aaaaagaagc tcaatccgcg
     2641 acgcagcacc atttgtgagg aggccaagga gaggctggcc gccagttcgc ccgtttcctc
     2701 ctcgaattca tcggattcct ccgcgaagcg aaggaatctt cccgagcagc cagcagccaa
     2761 gttggatctg cgccgcaaca ccatctgcga ggatcaccgg agcactccgg tgccgctgac
     2821 caagcgccgc ttctccatgc atcccaaggc ctccgcgaat cccctgcacg acaccctcct
     2881 gcgatcggcg acttcatctg catctgcagc tgcatctgcc gccggcaaga agcgtggtcg
     2941 aaagctgggt tcgagtcgcc agaactcgct ggactccagt tgcagtgcct cgaagaagga
     3001 tgccctcttg gaaacggagt cctccgagtc cacagccgag ctggaggcaa ccaatccgct
     3061 cagcgacata gccaagttta tcgaagacgg cgttaatctt ctcaagcgcg actacaaagt
     3121 ggaggaggag gaggaggagc agcagcagca ggaggacgag tttgcgcaac gagtggccaa
     3181 tatggagacg ccggccacca caccctcgcc ctcgcccacg caatccaatc cagaggatcc
     3241 cggcaatccc ggcaagctgc tgcttcagga aaataatagt agcagcggcg gagtgcgtcg
     3301 ctcgcatcgc atcaagcaga agccccaggg gcagcgggcc agccagggaa gaggggtggc
     3361 cagcgccggg ctggcaccga tcagcatgga cgagcagctg gccgagttgg ccagcatcga
     3421 ggcgataaac gagcagttcc tccgcagcga gggactcaac acgtttcagc cgctaaggga
     3481 gaactactac cgatgcgcca ggcaggtcag ccaggaaaac gccgagatgc agtgcgactg
     3541 cttcctcacc ggcgacgagg aggcccaggg ccatctgagc tgcggcgccg gctgcatcaa
     3601 tcgcatgctg atgatcgagt gcggcccgct gtgcaccaac ggccagcgct gcacgaacaa
     3661 gcgcttccag cagcaccagt gctggccctg ccgcgtcttt cgcaccgaga agaagggctg
     3721 cggcatcacc gcggagctgc aaatgccgcc cggcgagttc atcatggagt acgtcggcga
     3781 ggtgatcgac agcgaggagt tcgagcggcg gcagcacctc tattcgaagg accgcaagcg
     3841 gcactactac ttcatggcgc tgcgcggcga ggcgatcatc gatgccacct ccaaggggaa
     3901 tatatcgcgg tacatcaacc acagctgcga tccgaatgcc gagacccaga agtggacggt
     3961 caacggcgag ctgcgcatcg gcttcttcag cgtgaagccc attcagccgg gcgaggagat
     4021 caccttcgac tatcagtatc agcggtacgg acgcgatgcg cagcgctgct actgcgaggc
     4081 ggccaactgc aggggctgga ttggcggcga gccggactcc gatgagggtg agcagctgga
     4141 cgtgagcagc gacagcgagg cggaggcgga ggccgaggag gtggaggtgg aggacctcga
     4201 tgcggatgcg gagcccgaga agacgtcgaa ggggaagacg ggcaggtcta gcaaaatcaa
     4261 ggtgaagctg ccgctctcgc aggctgccag tcgcaagcgg aagcccaagc ccaaggatcg
     4321 ggaatacaag gccggccgct ggctgaagcc gacgagggag aaggcgacgc ggaagccgaa
     4381 ggtcagcaag ttccatgcca tgctcgagga tcccgatgtg ctcgaggagc tgtcgctgct
     4441 cagccgcagc ggtctgaaga accagctgga cacgctgcgc ttctcgcgct gcctggtccg
     4501 cgccaagctg ctaccaacgc gcctccagct gctcggagtt ctaacgcgcg gcgagctgcc
     4561 ctgccgccgg ctcttcctcg actaccacgg cctgcggctg ctgcacgcct ggatcagcga
     4621 gagcggcagc gatggccagc tgcgaatggc cctgctcgac accctcgagt cgctgcccat
     4681 tccgaatcgc accatgctga acgacagccg cgtctaccag agcgtccagc tgtggagcaa
     4741 cagcttggcg gaggcgtcgg aggcgtcgga ggagcagcac aaacggatgg tggccctgct
     4801 ggagaagtgg caagcgctgc ccgagatctt tcgcataccc aagcgcgagc gcatcgagca
     4861 gatgaaggag cacgagcggg aggccgaccg ccagcagaag cacgtccatg cgagcaccgc
     4921 tctggaggat cagcgcgagc ggattggcga gagcagcagc aacgatcgct tccggcagga
     4981 tcgctttcgc cgggacacga acagcagtcg catgactggc agcaagccga caaggatgag
     5041 cggcaacaat acgatatgca cgatcaccac gcagccgaag ggcagcaatg gatcctcgga
     5101 cggcatggcc aggaacgata atcgccgacg ggcggagatg ggtaatcccg ccgcggagcc
     5161 acgacgcacg ctgtccaagg agctcaggcg cagtctgttc gagcggaagg ttgccctcga
     5221 tgaggccgag aaacgggttt gcagcgagga ctggcgggag cacgagctgc gctgcgagat
     5281 ctttggcgcc gatctcaaca cggatcccaa gcagctgccc ttctatcaga acgcggacac
     5341 gggcgaatgg ttcaacagcg aggacatgcc agtgccagcg ccgcagcgca cagagctcct
     5401 ctcgcaggcg ctgctctcgc ccgaaacgcc ggacaacggc ggcagcggac aggcagctcc
     5461 gccggccgtc gagtacaagc tgccggccgg cgtggatccc ctgccgcccg cctggcactg
     5521 gcggatgacc agcgacggcg acatctacta ctacaatctg cgcgaacgca tctcgcaatg
     5581 ggagccgccc agtcccgagc agcgcctcca gactttggtc gaggaggatg cgacgcagca
     5641 gcagcagcag ccgcttcacg agctccagat cgatcccgcg ctcctggcca ccgagctcat
     5701 ccaggtggac ctcgattacg tgggctcgct cagctccaag tccctggccc agtttgtcga
     5761 ggccaaggtg cgggagcggc gcgagctgcg acgcagtcgc ttggtctccg tccgtgtgat
     5821 tagtccgcga cgagacgagg atcgcctgta caatcagctg gagtcgcgaa agtacaagga
     5881 gaacaaggag aagatccgcc gtcgcaagga gttcttccgt cgccgcaaga tcgacgctgc
     5941 ttctgctcag ccgaattcct cgacgaatcc agacgatgct gccgccgata gcagcaattc
     6001 actgcccatt caggcctacc tgtactcatc cgacgaggat gctcccgcgg atgctgctcc
     6061 agctggagct gcggagccgc ccctcgccga tgggcagact gcggcccagg cggaggagct
     6121 ggattcactt aacttggcgc cgagcaccag tcatgcggct ctggccgctc tcggcaagac
     6181 cacagtcgtc ggcagtcagg cggcggcaca atcagccgga gcaagtggca agcgcaagct
     6241 gccaatgcca ccgaatgcgg cggcggcggt gaagaagcat cgccaggagc accgcagcaa
     6301 gaagagcaaa agctcgcaca gccttttgac gaccaccagc ggacgcgagg cccacgagaa
     6361 gttccgcttc gaaataagcg ggcatgtcgc cgactttctg cgaccctatc gcaaggacag
     6421 ctgccaactg ggtcggatca ccagcgacga ggactacaag ttcttaatta agaggctcag
     6481 ccatcatatt accaccaagg aagtgcgcta ttgcgacgtc accggcaatc ccttgtcctg
     6541 cacggagtcg gtcaagcaca agtcctacga cttcatcaac cagtacatgc gaaagaaggg
     6601 ccgcgtctat cggaaaccag cggaaagcac tatattctag cacacccaat tcaacggcgg
     6661 gaggacatgt ccttcttttt cttcttcact taatacccaa ccaaattgcg catgaactaa
     6721 gaactaaaaa cgtataataa ctataactat aactagaatt ttattttaag tgcaaacctt
     6781 agataagaac aaacccctta gtatgcgatc taccctcgaa gcgggtagcg tcttaagtga
     6841 aaaccactag aacattatga atcttaaaga gagagttaga gaattattaa atacacggaa
     6901 aataaatgga aaatcacaca ta