Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

PREDICTED: Drosophila takahashii histone-lysine N-methyltransferase


LOCUS       XM_070218560            4104 bp    mRNA    linear   INV 09-DEC-2024
            SETD1B-A (LOC108060961), transcript variant X1, mRNA.
ACCESSION   XM_070218560
VERSION     XM_070218560.1
DBLINK      BioProject: PRJNA1194641
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..4104
                     /organism="Drosophila takahashii"
                     /mol_type="mRNA"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     gene            1..4104
                     /gene="LOC108060961"
                     /note="histone-lysine N-methyltransferase SETD1B-A;
                     Derived by automated computational analysis using gene
                     prediction method: Gnomon. Supporting evidence includes
                     similarity to: 2 Proteins"
                     /db_xref="GeneID:108060961"
     CDS             181..4011
                     /gene="LOC108060961"
                     /codon_start=1
                     /product="histone-lysine N-methyltransferase SETD1B-A
                     isoform X1"
                     /protein_id="XP_070074661.1"
                     /db_xref="GeneID:108060961"
                     /translation="MSEPAVPAEELCSTISNADISSSSSMSLASTAIDLIHTGLPLLE
                     QSAADLESLPEKQDAEAKQEPKAMEEAVLETIAEPPTDLAADVTADSPLEPPSEPLAE
                     PLPEPLPKTLQEPLLKCLIEPLPQPVQEPLPKPLQESLPQPLPEPLVKPVPEPLPENT
                     TEPLQEPLAEKQPQPIPEPLSNPLPKALPEPLPEPPLPEPLPEPLPKPLPEPLPEPQE
                     GRPETPAGNPENQAQAESPPSIMDISISAQMSPDAPVFYPMGSCLARLLTNGGGGGGD
                     SGQDSPSVPRISPPRSAFEYGTGPYIGPGGDIPRSYQFIDPTATGERNFNSFGLGMDD
                     QEMDEQQQAAQLAETPGVSWVPYYFGSPRTRTTGEDPLSPETGAARSRNNSRSHCECR
                     GEASEVGTSTEFPSFESFAGQLDEIGLARDEFMATLRNLLIRANDLLAPLLQGPFQHM
                     TPIQMAEYTSAILETPPLHPSLFMPANVPRPWTGLTGLSLPYDIHPHSLTPEDLARIQ
                     ALPKPVDAATQTEFRCMCMLLSQAGSAPASGDLRPPTTSYFPGQAMNGPRMRVPPQPY
                     ANPGTSQNGQAQPQPQQPYGFWGRPMPMAMHPHQVPQHRMPRQAAFQHPPPPQRHPHP
                     QAHPQPHPRPGMQPGHPMAHHKMGGAVRYPEAGNYRGYRLPGPYSPRNQASRYGAIGS
                     DRPAENNGNINGRNHYQNSGYRGNPLYARTALGNNQQQMASVAPNIGHLGNPIGNPVQ
                     PMDLGMQLRLNLSAEEANASRQVASDSSRTESSATENTSYHSERRVSEESSYVEAEDE
                     DGDSEEVDSEGEEDEVEGGEDEDDDDEEEEDENENDEEEDEDEDEEDDEQPRQLSAPD
                     SRSYCFGEVEFMNYVMSETPMLPNPGQQPGAAAYMEHGLPPTPRSNVPCAQVTPHHLP
                     THFPGYRAAREQPSQVMPPMNQMPPMAQMSQMPQMSQVPPMTSMPGGSYPPTGIPDGQ
                     FKEPPPGSQDHYYNNQQYNAMYGQPRPNFDYNASMMGQESAPVAGPSMYMRPPPPPTA
                     APPPQGPPPSYPMRQERPMAPILFEVGGNRSHSTGSPMMMQNVMSAHGHGRGRGRGPS
                     RGPSRGPHGHHNAYGQMNQMNQNHGMPFNGQMNQMGQMPPHMRNHQVNHQVNHMGQPM
                     GRGMGMGLMPLPVQMLNPNRMAQPNVQQGMPMNRSTMSPRQYANHNPNAMPSYQPQPE
                     MNKMPPMAEGGAAAGSSTPRVNFAANVANKPRGGTPRNQVGTPRPGAPVPAPVEVATG
                     QQKPIQPSYASMLQ"
     misc_feature    <418..876
                     /gene="LOC108060961"
                     /note="large tegument protein UL36; Provisional; Region:
                     PHA03247"
                     /db_xref="CDD:223021"
     misc_feature    <556..717
                     /gene="LOC108060961"
                     /note="Procyclic acidic repetitive protein (PARP); Region:
                     Trypan_PARP; pfam05887"
                     /db_xref="CDD:368653"
ORIGIN      
        1 gggaggcagt tagattggag ctgcgctagc agcaacatca aaaatatttt aagaaaaaaa
       61 gtacaaaaaa agttttggaa aaaaaacgaa cattttcaat aatttataac gcgttgtttt
      121 gagatacctt taggataaaa gtttttggaa accggtcagg aggtcctgtg ccccctcaga
      181 atgtccgaac cggcagtgcc cgccgaagag ctgtgctcca ccatttcgaa cgcagacatt
      241 tctagctcct ccagcatgtc cctcgcatcc acggccatcg acctcatcca caccggtctg
      301 ccgcttctcg aacagagcgc agccgacctg gagtcgctgc cggagaagca ggatgcggag
      361 gccaagcagg aaccgaaagc aatggaagaa gctgttttgg aaacgatagc agaaccacca
      421 acagatttag ctgcggatgt aacagctgat tccccattgg aaccaccatc ggaacccctt
      481 gcggaaccac tacctgaacc cctaccgaaa actctacagg aacccctgct gaaatgctta
      541 atagaacccc taccacaacc ggtacaggaa cccctaccga aacctcttca ggaatcacta
      601 ccacaacctc taccggaacc actagtgaaa cctgtaccgg aacctctgcc ggaaaataca
      661 acagaacctc tgcaggaacc tctagcggaa aaacaaccac agcccatacc agaaccccta
      721 tcaaatcctt taccaaaagc actaccggaa cccctaccgg aaccacccct accggaaccc
      781 ctaccagaac ctctaccaaa accactacca gaaccactac cagaacctca agaaggtcga
      841 ccagagacgc ccgctggcaa tccggagaac caggcccaag ccgagtcgcc tccgagcatc
      901 atggacatct cgatcagcgc tcagatgtcc ccggacgccc ccgtcttcta tcccatgggc
      961 tcctgtctcg cccgcctgct gaccaacgga ggaggaggag gaggagactc gggacaggac
     1021 agtccaagcg ttccgcggat ctcgccgcca cggagcgcct tcgaatacgg aactggtccg
     1081 tatatcggcc caggaggcga catcccgcgc agctatcagt tcatcgatcc aacggccacc
     1141 ggcgagcgga acttcaacag tttcggactg ggcatggatg accaggagat ggacgaacag
     1201 cagcaggcgg cgcagctggc ggaaacacca ggcgtatcgt gggtgcccta ctatttcggc
     1261 agtccgcgaa cgaggacaac gggcgaggat cctttgtcgc cggaaacggg agcagcgagg
     1321 agcaggaaca acagcaggag ccactgcgaa tgcagaggcg aggcatccga ggtcgggaca
     1381 tccaccgagt tcccatcctt cgagtccttt gccgggcaac tggacgaaat cggtttggcc
     1441 cgcgacgagt tcatggcaac gctgcgcaat ctgctcatcc gggcgaatga cctgttggcg
     1501 ccgctactcc agggtccctt ccagcacatg acgcccatcc agatggccga gtataccagc
     1561 gccattctgg agacgcctcc cctgcatccg agtctcttca tgccggcgaa tgtgccacgt
     1621 ccctggacgg ggctcactgg tctcagcctg ccctacgaca tccatccgca cagcctgacg
     1681 cccgaggact tggcccgcat ccaggcgctg cccaagcccg tggatgccgc cacccagacg
     1741 gagttccgct gcatgtgcat gctactctcg caggcaggat ctgcgccagc atccggcgac
     1801 ctgcgacctc caacgacctc ttacttccca ggacaagcga tgaatggacc acgaatgcgg
     1861 gttcctccgc agccatatgc aaacccggga acttcacaga acggacaggc acagccgcaa
     1921 cctcagcagc cctacggttt ctggggcaga cccatgccga tggcgatgca tccgcaccaa
     1981 gttccgcagc atcgcatgcc ccgacaggcc gccttccagc atccaccgcc gccgcaacgg
     2041 catccacatc cgcaagcgca tccgcagccg catccgcgcc cgggaatgca gcccggtcat
     2101 ccgatggcgc accacaagat gggcggagca gtgcgctacc cggaggctgg taactatcgt
     2161 ggctatcggc tgcctggtcc ctacagtccg cggaatcagg cgagtcgcta cggcgccatc
     2221 ggcagtgatc gtcccgccga gaataatggt aatattaatg gaaggaatca ttaccagaac
     2281 tccggctacc gtggtaatcc actctatgcg agaacagctc tcggcaacaa tcaacagcag
     2341 atggcttcgg ttgctccgaa tatcggtcat ctcggtaatc ccatcggcaa tcctgtccag
     2401 cccatggatc tgggcatgca gctgagattg aatctgagtg cggaggaggc gaatgccagc
     2461 agacaggtgg cctccgactc gtcgcgcacc gagtcctcgg ccaccgagaa caccagctat
     2521 cactcggagc gccgggtgag cgaggagagc agctacgtgg aggcggagga cgaggatggg
     2581 gacagcgaag aggtggactc cgagggggag gaggacgagg tcgagggcgg cgaggacgag
     2641 gatgacgatg atgaggagga agaggatgag aatgagaatg acgaagaaga ggatgaggat
     2701 gaggatgagg aggatgatga gcagccccgt cagctgtctg ctcccgattc ccgttcctac
     2761 tgcttcggtg aggtggagtt catgaactac gtcatgtccg agacgcccat gctgcccaat
     2821 cccggccagc agccaggagc cgccgcctat atggagcacg gcctgccgcc aactccacgc
     2881 tcgaatgtgc cctgtgccca ggtgacgccc caccacctgc ccacccactt tccaggctat
     2941 cgtgctgctc gggagcagcc ttcccaggtc atgcctccaa tgaaccagat gcctcccatg
     3001 gctcaaatgt ctcaaatgcc tcaaatgtcc caagtgcctc ctatgacctc catgccagga
     3061 ggcagttacc cacctaccgg cataccagat ggccagttca aggagccacc gccaggatca
     3121 caggatcatt actacaacaa ccagcaatac aatgccatgt acggccaacc gcgacccaat
     3181 ttcgactaca atgcgagcat gatgggtcag gaatcagctc cggtggcagg acccagcatg
     3241 tacatgagac caccacctcc ccccacagca gcaccaccac cacaaggacc tccaccttcg
     3301 tatcccatgc gacaggagcg cccaatggcc cccattctct tcgaggtggg cggcaatcgc
     3361 agtcatagca ctggttcgcc catgatgatg cagaacgtga tgagtgccca cggacatgga
     3421 cgcggtcgtg gtcgtggtcc cagtcgtggt cctagtcgag gtccccatgg ccaccacaat
     3481 gcctatggcc agatgaatca gatgaatcag aatcatggaa tgcccttcaa tgggcagatg
     3541 aaccaaatgg gacagatgcc gccgcacatg cggaaccatc aagtgaacca ccaggtcaac
     3601 cacatgggcc aaccaatggg caggggaatg ggcatgggcc tgatgcccct gcccgtccag
     3661 atgctcaatc cgaatcgcat ggcccagccg aatgtgcagc agggaatgcc catgaatcgc
     3721 tcgaccatga gtcctcgcca gtatgccaat cacaatccca atgcgatgcc cagctatcag
     3781 cctcagccgg agatgaacaa gatgccaccg atggcggagg gaggagctgc agctggatcc
     3841 tccacacctc gtgtgaattt cgcggccaat gtggcgaaca agccacgtgg tggaactcca
     3901 cgtaaccagg tgggcactcc gagaccggga gctcctgttc ctgcgccggt ggaagtggct
     3961 actggacagc agaaacccat ccaaccttcc tatgcctcca tgctgcagta aaagtgggat
     4021 tttaatagtt ccttccatat gtcctttttt ttctcttatt tttttttggt caaataaatc
     4081 gcgcacaatc attccatatt cata