Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

PREDICTED: Drosophila takahashii histone-lysine N-methyltransferase


LOCUS       XM_044396137            4084 bp    mRNA    linear   INV 09-DEC-2024
            SETD1B-A (LOC108060961), transcript variant X2, mRNA.
ACCESSION   XM_044396137
VERSION     XM_044396137.2
DBLINK      BioProject: PRJNA1194641
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            On Dec 9, 2024 this sequence version replaced XM_044396137.1.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..4084
                     /organism="Drosophila takahashii"
                     /mol_type="mRNA"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     gene            1..4084
                     /gene="LOC108060961"
                     /note="histone-lysine N-methyltransferase SETD1B-A;
                     Derived by automated computational analysis using gene
                     prediction method: Gnomon. Supporting evidence includes
                     similarity to: 2 Proteins"
                     /db_xref="GeneID:108060961"
     CDS             160..3990
                     /gene="LOC108060961"
                     /codon_start=1
                     /product="histone-lysine N-methyltransferase SETD1B-A
                     isoform X1"
                     /protein_id="XP_044252072.1"
                     /db_xref="GeneID:108060961"
                     /translation="MSEPAVPAEELCSTISNADISSSSSMSLASTAIDLIHTGLPLLE
                     QSAADLESLPEKQDAEAKQEPKAMEEAVLETIAEPPTDLAADVTADSPLEPPSEPLAE
                     PLPEPLPKTLQEPLLKCLIEPLPQPVQEPLPKPLQESLPQPLPEPLVKPVPEPLPENT
                     TEPLQEPLAEKQPQPIPEPLSNPLPKALPEPLPEPPLPEPLPEPLPKPLPEPLPEPQE
                     GRPETPAGNPENQAQAESPPSIMDISISAQMSPDAPVFYPMGSCLARLLTNGGGGGGD
                     SGQDSPSVPRISPPRSAFEYGTGPYIGPGGDIPRSYQFIDPTATGERNFNSFGLGMDD
                     QEMDEQQQAAQLAETPGVSWVPYYFGSPRTRTTGEDPLSPETGAARSRNNSRSHCECR
                     GEASEVGTSTEFPSFESFAGQLDEIGLARDEFMATLRNLLIRANDLLAPLLQGPFQHM
                     TPIQMAEYTSAILETPPLHPSLFMPANVPRPWTGLTGLSLPYDIHPHSLTPEDLARIQ
                     ALPKPVDAATQTEFRCMCMLLSQAGSAPASGDLRPPTTSYFPGQAMNGPRMRVPPQPY
                     ANPGTSQNGQAQPQPQQPYGFWGRPMPMAMHPHQVPQHRMPRQAAFQHPPPPQRHPHP
                     QAHPQPHPRPGMQPGHPMAHHKMGGAVRYPEAGNYRGYRLPGPYSPRNQASRYGAIGS
                     DRPAENNGNINGRNHYQNSGYRGNPLYARTALGNNQQQMASVAPNIGHLGNPIGNPVQ
                     PMDLGMQLRLNLSAEEANASRQVASDSSRTESSATENTSYHSERRVSEESSYVEAEDE
                     DGDSEEVDSEGEEDEVEGGEDEDDDDEEEEDENENDEEEDEDEDEEDDEQPRQLSAPD
                     SRSYCFGEVEFMNYVMSETPMLPNPGQQPGAAAYMEHGLPPTPRSNVPCAQVTPHHLP
                     THFPGYRAAREQPSQVMPPMNQMPPMAQMSQMPQMSQVPPMTSMPGGSYPPTGIPDGQ
                     FKEPPPGSQDHYYNNQQYNAMYGQPRPNFDYNASMMGQESAPVAGPSMYMRPPPPPTA
                     APPPQGPPPSYPMRQERPMAPILFEVGGNRSHSTGSPMMMQNVMSAHGHGRGRGRGPS
                     RGPSRGPHGHHNAYGQMNQMNQNHGMPFNGQMNQMGQMPPHMRNHQVNHQVNHMGQPM
                     GRGMGMGLMPLPVQMLNPNRMAQPNVQQGMPMNRSTMSPRQYANHNPNAMPSYQPQPE
                     MNKMPPMAEGGAAAGSSTPRVNFAANVANKPRGGTPRNQVGTPRPGAPVPAPVEVATG
                     QQKPIQPSYASMLQ"
     misc_feature    <397..855
                     /gene="LOC108060961"
                     /note="large tegument protein UL36; Provisional; Region:
                     PHA03247"
                     /db_xref="CDD:223021"
     misc_feature    <535..696
                     /gene="LOC108060961"
                     /note="Procyclic acidic repetitive protein (PARP); Region:
                     Trypan_PARP; pfam05887"
                     /db_xref="CDD:368653"
ORIGIN      
        1 ggaggcagtt agattggagc tgcgctagca gcaacatcaa aaatatttta agaaaaaaag
       61 tacaaaaaaa gttttggaaa aaaaacgaac attttcaata atttataacg cgttgttttg
      121 agataccttt aggataaaag tttttggaaa ccggtcagaa tgtccgaacc ggcagtgccc
      181 gccgaagagc tgtgctccac catttcgaac gcagacattt ctagctcctc cagcatgtcc
      241 ctcgcatcca cggccatcga cctcatccac accggtctgc cgcttctcga acagagcgca
      301 gccgacctgg agtcgctgcc ggagaagcag gatgcggagg ccaagcagga accgaaagca
      361 atggaagaag ctgttttgga aacgatagca gaaccaccaa cagatttagc tgcggatgta
      421 acagctgatt ccccattgga accaccatcg gaaccccttg cggaaccact acctgaaccc
      481 ctaccgaaaa ctctacagga acccctgctg aaatgcttaa tagaacccct accacaaccg
      541 gtacaggaac ccctaccgaa acctcttcag gaatcactac cacaacctct accggaacca
      601 ctagtgaaac ctgtaccgga acctctgccg gaaaatacaa cagaacctct gcaggaacct
      661 ctagcggaaa aacaaccaca gcccatacca gaacccctat caaatccttt accaaaagca
      721 ctaccggaac ccctaccgga accaccccta ccggaacccc taccagaacc tctaccaaaa
      781 ccactaccag aaccactacc agaacctcaa gaaggtcgac cagagacgcc cgctggcaat
      841 ccggagaacc aggcccaagc cgagtcgcct ccgagcatca tggacatctc gatcagcgct
      901 cagatgtccc cggacgcccc cgtcttctat cccatgggct cctgtctcgc ccgcctgctg
      961 accaacggag gaggaggagg aggagactcg ggacaggaca gtccaagcgt tccgcggatc
     1021 tcgccgccac ggagcgcctt cgaatacgga actggtccgt atatcggccc aggaggcgac
     1081 atcccgcgca gctatcagtt catcgatcca acggccaccg gcgagcggaa cttcaacagt
     1141 ttcggactgg gcatggatga ccaggagatg gacgaacagc agcaggcggc gcagctggcg
     1201 gaaacaccag gcgtatcgtg ggtgccctac tatttcggca gtccgcgaac gaggacaacg
     1261 ggcgaggatc ctttgtcgcc ggaaacggga gcagcgagga gcaggaacaa cagcaggagc
     1321 cactgcgaat gcagaggcga ggcatccgag gtcgggacat ccaccgagtt cccatccttc
     1381 gagtcctttg ccgggcaact ggacgaaatc ggtttggccc gcgacgagtt catggcaacg
     1441 ctgcgcaatc tgctcatccg ggcgaatgac ctgttggcgc cgctactcca gggtcccttc
     1501 cagcacatga cgcccatcca gatggccgag tataccagcg ccattctgga gacgcctccc
     1561 ctgcatccga gtctcttcat gccggcgaat gtgccacgtc cctggacggg gctcactggt
     1621 ctcagcctgc cctacgacat ccatccgcac agcctgacgc ccgaggactt ggcccgcatc
     1681 caggcgctgc ccaagcccgt ggatgccgcc acccagacgg agttccgctg catgtgcatg
     1741 ctactctcgc aggcaggatc tgcgccagca tccggcgacc tgcgacctcc aacgacctct
     1801 tacttcccag gacaagcgat gaatggacca cgaatgcggg ttcctccgca gccatatgca
     1861 aacccgggaa cttcacagaa cggacaggca cagccgcaac ctcagcagcc ctacggtttc
     1921 tggggcagac ccatgccgat ggcgatgcat ccgcaccaag ttccgcagca tcgcatgccc
     1981 cgacaggccg ccttccagca tccaccgccg ccgcaacggc atccacatcc gcaagcgcat
     2041 ccgcagccgc atccgcgccc gggaatgcag cccggtcatc cgatggcgca ccacaagatg
     2101 ggcggagcag tgcgctaccc ggaggctggt aactatcgtg gctatcggct gcctggtccc
     2161 tacagtccgc ggaatcaggc gagtcgctac ggcgccatcg gcagtgatcg tcccgccgag
     2221 aataatggta atattaatgg aaggaatcat taccagaact ccggctaccg tggtaatcca
     2281 ctctatgcga gaacagctct cggcaacaat caacagcaga tggcttcggt tgctccgaat
     2341 atcggtcatc tcggtaatcc catcggcaat cctgtccagc ccatggatct gggcatgcag
     2401 ctgagattga atctgagtgc ggaggaggcg aatgccagca gacaggtggc ctccgactcg
     2461 tcgcgcaccg agtcctcggc caccgagaac accagctatc actcggagcg ccgggtgagc
     2521 gaggagagca gctacgtgga ggcggaggac gaggatgggg acagcgaaga ggtggactcc
     2581 gagggggagg aggacgaggt cgagggcggc gaggacgagg atgacgatga tgaggaggaa
     2641 gaggatgaga atgagaatga cgaagaagag gatgaggatg aggatgagga ggatgatgag
     2701 cagccccgtc agctgtctgc tcccgattcc cgttcctact gcttcggtga ggtggagttc
     2761 atgaactacg tcatgtccga gacgcccatg ctgcccaatc ccggccagca gccaggagcc
     2821 gccgcctata tggagcacgg cctgccgcca actccacgct cgaatgtgcc ctgtgcccag
     2881 gtgacgcccc accacctgcc cacccacttt ccaggctatc gtgctgctcg ggagcagcct
     2941 tcccaggtca tgcctccaat gaaccagatg cctcccatgg ctcaaatgtc tcaaatgcct
     3001 caaatgtccc aagtgcctcc tatgacctcc atgccaggag gcagttaccc acctaccggc
     3061 ataccagatg gccagttcaa ggagccaccg ccaggatcac aggatcatta ctacaacaac
     3121 cagcaataca atgccatgta cggccaaccg cgacccaatt tcgactacaa tgcgagcatg
     3181 atgggtcagg aatcagctcc ggtggcagga cccagcatgt acatgagacc accacctccc
     3241 cccacagcag caccaccacc acaaggacct ccaccttcgt atcccatgcg acaggagcgc
     3301 ccaatggccc ccattctctt cgaggtgggc ggcaatcgca gtcatagcac tggttcgccc
     3361 atgatgatgc agaacgtgat gagtgcccac ggacatggac gcggtcgtgg tcgtggtccc
     3421 agtcgtggtc ctagtcgagg tccccatggc caccacaatg cctatggcca gatgaatcag
     3481 atgaatcaga atcatggaat gcccttcaat gggcagatga accaaatggg acagatgccg
     3541 ccgcacatgc ggaaccatca agtgaaccac caggtcaacc acatgggcca accaatgggc
     3601 aggggaatgg gcatgggcct gatgcccctg cccgtccaga tgctcaatcc gaatcgcatg
     3661 gcccagccga atgtgcagca gggaatgccc atgaatcgct cgaccatgag tcctcgccag
     3721 tatgccaatc acaatcccaa tgcgatgccc agctatcagc ctcagccgga gatgaacaag
     3781 atgccaccga tggcggaggg aggagctgca gctggatcct ccacacctcg tgtgaatttc
     3841 gcggccaatg tggcgaacaa gccacgtggt ggaactccac gtaaccaggt gggcactccg
     3901 agaccgggag ctcctgttcc tgcgccggtg gaagtggcta ctggacagca gaaacccatc
     3961 caaccttcct atgcctccat gctgcagtaa aagtgggatt ttaatagttc cttccatatg
     4021 tccttttttt tctcttattt ttttttggtc aaataaatcg cgcacaatca ttccatattc
     4081 atat