Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

PREDICTED: Drosophila takahashii histone-lysine N-methyltransferase


LOCUS       XM_070218561            4012 bp    mRNA    linear   INV 09-DEC-2024
            SETD1B-A (LOC108060961), transcript variant X3, mRNA.
ACCESSION   XM_070218561
VERSION     XM_070218561.1
DBLINK      BioProject: PRJNA1194641
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..4012
                     /organism="Drosophila takahashii"
                     /mol_type="mRNA"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     gene            1..4012
                     /gene="LOC108060961"
                     /note="histone-lysine N-methyltransferase SETD1B-A;
                     Derived by automated computational analysis using gene
                     prediction method: Gnomon. Supporting evidence includes
                     similarity to: 2 Proteins"
                     /db_xref="GeneID:108060961"
     CDS             163..3918
                     /gene="LOC108060961"
                     /codon_start=1
                     /product="histone-lysine N-methyltransferase SETD1B-A
                     isoform X2"
                     /protein_id="XP_070074662.1"
                     /db_xref="GeneID:108060961"
                     /translation="MSLASTAIDLIHTGLPLLEQSAADLESLPEKQDAEAKQEPKAME
                     EAVLETIAEPPTDLAADVTADSPLEPPSEPLAEPLPEPLPKTLQEPLLKCLIEPLPQP
                     VQEPLPKPLQESLPQPLPEPLVKPVPEPLPENTTEPLQEPLAEKQPQPIPEPLSNPLP
                     KALPEPLPEPPLPEPLPEPLPKPLPEPLPEPQEGRPETPAGNPENQAQAESPPSIMDI
                     SISAQMSPDAPVFYPMGSCLARLLTNGGGGGGDSGQDSPSVPRISPPRSAFEYGTGPY
                     IGPGGDIPRSYQFIDPTATGERNFNSFGLGMDDQEMDEQQQAAQLAETPGVSWVPYYF
                     GSPRTRTTGEDPLSPETGAARSRNNSRSHCECRGEASEVGTSTEFPSFESFAGQLDEI
                     GLARDEFMATLRNLLIRANDLLAPLLQGPFQHMTPIQMAEYTSAILETPPLHPSLFMP
                     ANVPRPWTGLTGLSLPYDIHPHSLTPEDLARIQALPKPVDAATQTEFRCMCMLLSQAG
                     SAPASGDLRPPTTSYFPGQAMNGPRMRVPPQPYANPGTSQNGQAQPQPQQPYGFWGRP
                     MPMAMHPHQVPQHRMPRQAAFQHPPPPQRHPHPQAHPQPHPRPGMQPGHPMAHHKMGG
                     AVRYPEAGNYRGYRLPGPYSPRNQASRYGAIGSDRPAENNGNINGRNHYQNSGYRGNP
                     LYARTALGNNQQQMASVAPNIGHLGNPIGNPVQPMDLGMQLRLNLSAEEANASRQVAS
                     DSSRTESSATENTSYHSERRVSEESSYVEAEDEDGDSEEVDSEGEEDEVEGGEDEDDD
                     DEEEEDENENDEEEDEDEDEEDDEQPRQLSAPDSRSYCFGEVEFMNYVMSETPMLPNP
                     GQQPGAAAYMEHGLPPTPRSNVPCAQVTPHHLPTHFPGYRAAREQPSQVMPPMNQMPP
                     MAQMSQMPQMSQVPPMTSMPGGSYPPTGIPDGQFKEPPPGSQDHYYNNQQYNAMYGQP
                     RPNFDYNASMMGQESAPVAGPSMYMRPPPPPTAAPPPQGPPPSYPMRQERPMAPILFE
                     VGGNRSHSTGSPMMMQNVMSAHGHGRGRGRGPSRGPSRGPHGHHNAYGQMNQMNQNHG
                     MPFNGQMNQMGQMPPHMRNHQVNHQVNHMGQPMGRGMGMGLMPLPVQMLNPNRMAQPN
                     VQQGMPMNRSTMSPRQYANHNPNAMPSYQPQPEMNKMPPMAEGGAAAGSSTPRVNFAA
                     NVANKPRGGTPRNQVGTPRPGAPVPAPVEVATGQQKPIQPSYASMLQ"
     misc_feature    <325..783
                     /gene="LOC108060961"
                     /note="large tegument protein UL36; Provisional; Region:
                     PHA03247"
                     /db_xref="CDD:223021"
     misc_feature    <463..624
                     /gene="LOC108060961"
                     /note="Procyclic acidic repetitive protein (PARP); Region:
                     Trypan_PARP; pfam05887"
                     /db_xref="CDD:368653"
ORIGIN      
        1 agttagattg gagctgcgct agcagcaaca tcaaaaatat tttaagaaaa aaagtacaaa
       61 aaaagttttg gaaaaaaaac gaacattttc aataatttat aacgcgttgt tttgagatac
      121 ctttaggata aaagtttttg gaaaccggtc agctcctcca gcatgtccct cgcatccacg
      181 gccatcgacc tcatccacac cggtctgccg cttctcgaac agagcgcagc cgacctggag
      241 tcgctgccgg agaagcagga tgcggaggcc aagcaggaac cgaaagcaat ggaagaagct
      301 gttttggaaa cgatagcaga accaccaaca gatttagctg cggatgtaac agctgattcc
      361 ccattggaac caccatcgga accccttgcg gaaccactac ctgaacccct accgaaaact
      421 ctacaggaac ccctgctgaa atgcttaata gaacccctac cacaaccggt acaggaaccc
      481 ctaccgaaac ctcttcagga atcactacca caacctctac cggaaccact agtgaaacct
      541 gtaccggaac ctctgccgga aaatacaaca gaacctctgc aggaacctct agcggaaaaa
      601 caaccacagc ccataccaga acccctatca aatcctttac caaaagcact accggaaccc
      661 ctaccggaac cacccctacc ggaaccccta ccagaacctc taccaaaacc actaccagaa
      721 ccactaccag aacctcaaga aggtcgacca gagacgcccg ctggcaatcc ggagaaccag
      781 gcccaagccg agtcgcctcc gagcatcatg gacatctcga tcagcgctca gatgtccccg
      841 gacgcccccg tcttctatcc catgggctcc tgtctcgccc gcctgctgac caacggagga
      901 ggaggaggag gagactcggg acaggacagt ccaagcgttc cgcggatctc gccgccacgg
      961 agcgccttcg aatacggaac tggtccgtat atcggcccag gaggcgacat cccgcgcagc
     1021 tatcagttca tcgatccaac ggccaccggc gagcggaact tcaacagttt cggactgggc
     1081 atggatgacc aggagatgga cgaacagcag caggcggcgc agctggcgga aacaccaggc
     1141 gtatcgtggg tgccctacta tttcggcagt ccgcgaacga ggacaacggg cgaggatcct
     1201 ttgtcgccgg aaacgggagc agcgaggagc aggaacaaca gcaggagcca ctgcgaatgc
     1261 agaggcgagg catccgaggt cgggacatcc accgagttcc catccttcga gtcctttgcc
     1321 gggcaactgg acgaaatcgg tttggcccgc gacgagttca tggcaacgct gcgcaatctg
     1381 ctcatccggg cgaatgacct gttggcgccg ctactccagg gtcccttcca gcacatgacg
     1441 cccatccaga tggccgagta taccagcgcc attctggaga cgcctcccct gcatccgagt
     1501 ctcttcatgc cggcgaatgt gccacgtccc tggacggggc tcactggtct cagcctgccc
     1561 tacgacatcc atccgcacag cctgacgccc gaggacttgg cccgcatcca ggcgctgccc
     1621 aagcccgtgg atgccgccac ccagacggag ttccgctgca tgtgcatgct actctcgcag
     1681 gcaggatctg cgccagcatc cggcgacctg cgacctccaa cgacctctta cttcccagga
     1741 caagcgatga atggaccacg aatgcgggtt cctccgcagc catatgcaaa cccgggaact
     1801 tcacagaacg gacaggcaca gccgcaacct cagcagccct acggtttctg gggcagaccc
     1861 atgccgatgg cgatgcatcc gcaccaagtt ccgcagcatc gcatgccccg acaggccgcc
     1921 ttccagcatc caccgccgcc gcaacggcat ccacatccgc aagcgcatcc gcagccgcat
     1981 ccgcgcccgg gaatgcagcc cggtcatccg atggcgcacc acaagatggg cggagcagtg
     2041 cgctacccgg aggctggtaa ctatcgtggc tatcggctgc ctggtcccta cagtccgcgg
     2101 aatcaggcga gtcgctacgg cgccatcggc agtgatcgtc ccgccgagaa taatggtaat
     2161 attaatggaa ggaatcatta ccagaactcc ggctaccgtg gtaatccact ctatgcgaga
     2221 acagctctcg gcaacaatca acagcagatg gcttcggttg ctccgaatat cggtcatctc
     2281 ggtaatccca tcggcaatcc tgtccagccc atggatctgg gcatgcagct gagattgaat
     2341 ctgagtgcgg aggaggcgaa tgccagcaga caggtggcct ccgactcgtc gcgcaccgag
     2401 tcctcggcca ccgagaacac cagctatcac tcggagcgcc gggtgagcga ggagagcagc
     2461 tacgtggagg cggaggacga ggatggggac agcgaagagg tggactccga gggggaggag
     2521 gacgaggtcg agggcggcga ggacgaggat gacgatgatg aggaggaaga ggatgagaat
     2581 gagaatgacg aagaagagga tgaggatgag gatgaggagg atgatgagca gccccgtcag
     2641 ctgtctgctc ccgattcccg ttcctactgc ttcggtgagg tggagttcat gaactacgtc
     2701 atgtccgaga cgcccatgct gcccaatccc ggccagcagc caggagccgc cgcctatatg
     2761 gagcacggcc tgccgccaac tccacgctcg aatgtgccct gtgcccaggt gacgccccac
     2821 cacctgccca cccactttcc aggctatcgt gctgctcggg agcagccttc ccaggtcatg
     2881 cctccaatga accagatgcc tcccatggct caaatgtctc aaatgcctca aatgtcccaa
     2941 gtgcctccta tgacctccat gccaggaggc agttacccac ctaccggcat accagatggc
     3001 cagttcaagg agccaccgcc aggatcacag gatcattact acaacaacca gcaatacaat
     3061 gccatgtacg gccaaccgcg acccaatttc gactacaatg cgagcatgat gggtcaggaa
     3121 tcagctccgg tggcaggacc cagcatgtac atgagaccac cacctccccc cacagcagca
     3181 ccaccaccac aaggacctcc accttcgtat cccatgcgac aggagcgccc aatggccccc
     3241 attctcttcg aggtgggcgg caatcgcagt catagcactg gttcgcccat gatgatgcag
     3301 aacgtgatga gtgcccacgg acatggacgc ggtcgtggtc gtggtcccag tcgtggtcct
     3361 agtcgaggtc cccatggcca ccacaatgcc tatggccaga tgaatcagat gaatcagaat
     3421 catggaatgc ccttcaatgg gcagatgaac caaatgggac agatgccgcc gcacatgcgg
     3481 aaccatcaag tgaaccacca ggtcaaccac atgggccaac caatgggcag gggaatgggc
     3541 atgggcctga tgcccctgcc cgtccagatg ctcaatccga atcgcatggc ccagccgaat
     3601 gtgcagcagg gaatgcccat gaatcgctcg accatgagtc ctcgccagta tgccaatcac
     3661 aatcccaatg cgatgcccag ctatcagcct cagccggaga tgaacaagat gccaccgatg
     3721 gcggagggag gagctgcagc tggatcctcc acacctcgtg tgaatttcgc ggccaatgtg
     3781 gcgaacaagc cacgtggtgg aactccacgt aaccaggtgg gcactccgag accgggagct
     3841 cctgttcctg cgccggtgga agtggctact ggacagcaga aacccatcca accttcctat
     3901 gcctccatgc tgcagtaaaa gtgggatttt aatagttcct tccatatgtc cttttttttc
     3961 tcttattttt ttttggtcaa ataaatcgcg cacaatcatt ccatattcat at