Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XM_070218560 4104 bp mRNA linear INV 09-DEC-2024 SETD1B-A (LOC108060961), transcript variant X1, mRNA. ACCESSION XM_070218560 VERSION XM_070218560.1 DBLINK BioProject: PRJNA1194641 KEYWORDS RefSeq. SOURCE Drosophila takahashii ORGANISM Drosophila takahashii Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_091683) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Status :: Full annotation Annotation Name :: GCF_030179915.1-RS_2024_12 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 10.3 Annotation Method :: Gnomon; cmsearch; tRNAscan-SE Features Annotated :: Gene; mRNA; CDS; ncRNA Annotation Date :: 12/07/2024 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4104 /organism="Drosophila takahashii" /mol_type="mRNA" /strain="IR98-3 E-12201" /db_xref="taxon:29030" /chromosome="X" /sex="female" /tissue_type="Whole fly" /dev_stage="Adult fly" /collected_by="Originally obtained from EHIME-Fly" gene 1..4104 /gene="LOC108060961" /note="histone-lysine N-methyltransferase SETD1B-A; Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 2 Proteins" /db_xref="GeneID:108060961" CDS 181..4011 /gene="LOC108060961" /codon_start=1 /product="histone-lysine N-methyltransferase SETD1B-A isoform X1" /protein_id="XP_070074661.1" /db_xref="GeneID:108060961" /translation="MSEPAVPAEELCSTISNADISSSSSMSLASTAIDLIHTGLPLLE QSAADLESLPEKQDAEAKQEPKAMEEAVLETIAEPPTDLAADVTADSPLEPPSEPLAE PLPEPLPKTLQEPLLKCLIEPLPQPVQEPLPKPLQESLPQPLPEPLVKPVPEPLPENT TEPLQEPLAEKQPQPIPEPLSNPLPKALPEPLPEPPLPEPLPEPLPKPLPEPLPEPQE GRPETPAGNPENQAQAESPPSIMDISISAQMSPDAPVFYPMGSCLARLLTNGGGGGGD SGQDSPSVPRISPPRSAFEYGTGPYIGPGGDIPRSYQFIDPTATGERNFNSFGLGMDD QEMDEQQQAAQLAETPGVSWVPYYFGSPRTRTTGEDPLSPETGAARSRNNSRSHCECR GEASEVGTSTEFPSFESFAGQLDEIGLARDEFMATLRNLLIRANDLLAPLLQGPFQHM TPIQMAEYTSAILETPPLHPSLFMPANVPRPWTGLTGLSLPYDIHPHSLTPEDLARIQ ALPKPVDAATQTEFRCMCMLLSQAGSAPASGDLRPPTTSYFPGQAMNGPRMRVPPQPY ANPGTSQNGQAQPQPQQPYGFWGRPMPMAMHPHQVPQHRMPRQAAFQHPPPPQRHPHP QAHPQPHPRPGMQPGHPMAHHKMGGAVRYPEAGNYRGYRLPGPYSPRNQASRYGAIGS DRPAENNGNINGRNHYQNSGYRGNPLYARTALGNNQQQMASVAPNIGHLGNPIGNPVQ PMDLGMQLRLNLSAEEANASRQVASDSSRTESSATENTSYHSERRVSEESSYVEAEDE DGDSEEVDSEGEEDEVEGGEDEDDDDEEEEDENENDEEEDEDEDEEDDEQPRQLSAPD SRSYCFGEVEFMNYVMSETPMLPNPGQQPGAAAYMEHGLPPTPRSNVPCAQVTPHHLP THFPGYRAAREQPSQVMPPMNQMPPMAQMSQMPQMSQVPPMTSMPGGSYPPTGIPDGQ FKEPPPGSQDHYYNNQQYNAMYGQPRPNFDYNASMMGQESAPVAGPSMYMRPPPPPTA APPPQGPPPSYPMRQERPMAPILFEVGGNRSHSTGSPMMMQNVMSAHGHGRGRGRGPS RGPSRGPHGHHNAYGQMNQMNQNHGMPFNGQMNQMGQMPPHMRNHQVNHQVNHMGQPM GRGMGMGLMPLPVQMLNPNRMAQPNVQQGMPMNRSTMSPRQYANHNPNAMPSYQPQPE MNKMPPMAEGGAAAGSSTPRVNFAANVANKPRGGTPRNQVGTPRPGAPVPAPVEVATG QQKPIQPSYASMLQ" misc_feature <418..876 /gene="LOC108060961" /note="large tegument protein UL36; Provisional; Region: PHA03247" /db_xref="CDD:223021" misc_feature <556..717 /gene="LOC108060961" /note="Procyclic acidic repetitive protein (PARP); Region: Trypan_PARP; pfam05887" /db_xref="CDD:368653" ORIGIN 1 gggaggcagt tagattggag ctgcgctagc agcaacatca aaaatatttt aagaaaaaaa 61 gtacaaaaaa agttttggaa aaaaaacgaa cattttcaat aatttataac gcgttgtttt 121 gagatacctt taggataaaa gtttttggaa accggtcagg aggtcctgtg ccccctcaga 181 atgtccgaac cggcagtgcc cgccgaagag ctgtgctcca ccatttcgaa cgcagacatt 241 tctagctcct ccagcatgtc cctcgcatcc acggccatcg acctcatcca caccggtctg 301 ccgcttctcg aacagagcgc agccgacctg gagtcgctgc cggagaagca ggatgcggag 361 gccaagcagg aaccgaaagc aatggaagaa gctgttttgg aaacgatagc agaaccacca 421 acagatttag ctgcggatgt aacagctgat tccccattgg aaccaccatc ggaacccctt 481 gcggaaccac tacctgaacc cctaccgaaa actctacagg aacccctgct gaaatgctta 541 atagaacccc taccacaacc ggtacaggaa cccctaccga aacctcttca ggaatcacta 601 ccacaacctc taccggaacc actagtgaaa cctgtaccgg aacctctgcc ggaaaataca 661 acagaacctc tgcaggaacc tctagcggaa aaacaaccac agcccatacc agaaccccta 721 tcaaatcctt taccaaaagc actaccggaa cccctaccgg aaccacccct accggaaccc 781 ctaccagaac ctctaccaaa accactacca gaaccactac cagaacctca agaaggtcga 841 ccagagacgc ccgctggcaa tccggagaac caggcccaag ccgagtcgcc tccgagcatc 901 atggacatct cgatcagcgc tcagatgtcc ccggacgccc ccgtcttcta tcccatgggc 961 tcctgtctcg cccgcctgct gaccaacgga ggaggaggag gaggagactc gggacaggac 1021 agtccaagcg ttccgcggat ctcgccgcca cggagcgcct tcgaatacgg aactggtccg 1081 tatatcggcc caggaggcga catcccgcgc agctatcagt tcatcgatcc aacggccacc 1141 ggcgagcgga acttcaacag tttcggactg ggcatggatg accaggagat ggacgaacag 1201 cagcaggcgg cgcagctggc ggaaacacca ggcgtatcgt gggtgcccta ctatttcggc 1261 agtccgcgaa cgaggacaac gggcgaggat cctttgtcgc cggaaacggg agcagcgagg 1321 agcaggaaca acagcaggag ccactgcgaa tgcagaggcg aggcatccga ggtcgggaca 1381 tccaccgagt tcccatcctt cgagtccttt gccgggcaac tggacgaaat cggtttggcc 1441 cgcgacgagt tcatggcaac gctgcgcaat ctgctcatcc gggcgaatga cctgttggcg 1501 ccgctactcc agggtccctt ccagcacatg acgcccatcc agatggccga gtataccagc 1561 gccattctgg agacgcctcc cctgcatccg agtctcttca tgccggcgaa tgtgccacgt 1621 ccctggacgg ggctcactgg tctcagcctg ccctacgaca tccatccgca cagcctgacg 1681 cccgaggact tggcccgcat ccaggcgctg cccaagcccg tggatgccgc cacccagacg 1741 gagttccgct gcatgtgcat gctactctcg caggcaggat ctgcgccagc atccggcgac 1801 ctgcgacctc caacgacctc ttacttccca ggacaagcga tgaatggacc acgaatgcgg 1861 gttcctccgc agccatatgc aaacccggga acttcacaga acggacaggc acagccgcaa 1921 cctcagcagc cctacggttt ctggggcaga cccatgccga tggcgatgca tccgcaccaa 1981 gttccgcagc atcgcatgcc ccgacaggcc gccttccagc atccaccgcc gccgcaacgg 2041 catccacatc cgcaagcgca tccgcagccg catccgcgcc cgggaatgca gcccggtcat 2101 ccgatggcgc accacaagat gggcggagca gtgcgctacc cggaggctgg taactatcgt 2161 ggctatcggc tgcctggtcc ctacagtccg cggaatcagg cgagtcgcta cggcgccatc 2221 ggcagtgatc gtcccgccga gaataatggt aatattaatg gaaggaatca ttaccagaac 2281 tccggctacc gtggtaatcc actctatgcg agaacagctc tcggcaacaa tcaacagcag 2341 atggcttcgg ttgctccgaa tatcggtcat ctcggtaatc ccatcggcaa tcctgtccag 2401 cccatggatc tgggcatgca gctgagattg aatctgagtg cggaggaggc gaatgccagc 2461 agacaggtgg cctccgactc gtcgcgcacc gagtcctcgg ccaccgagaa caccagctat 2521 cactcggagc gccgggtgag cgaggagagc agctacgtgg aggcggagga cgaggatggg 2581 gacagcgaag aggtggactc cgagggggag gaggacgagg tcgagggcgg cgaggacgag 2641 gatgacgatg atgaggagga agaggatgag aatgagaatg acgaagaaga ggatgaggat 2701 gaggatgagg aggatgatga gcagccccgt cagctgtctg ctcccgattc ccgttcctac 2761 tgcttcggtg aggtggagtt catgaactac gtcatgtccg agacgcccat gctgcccaat 2821 cccggccagc agccaggagc cgccgcctat atggagcacg gcctgccgcc aactccacgc 2881 tcgaatgtgc cctgtgccca ggtgacgccc caccacctgc ccacccactt tccaggctat 2941 cgtgctgctc gggagcagcc ttcccaggtc atgcctccaa tgaaccagat gcctcccatg 3001 gctcaaatgt ctcaaatgcc tcaaatgtcc caagtgcctc ctatgacctc catgccagga 3061 ggcagttacc cacctaccgg cataccagat ggccagttca aggagccacc gccaggatca 3121 caggatcatt actacaacaa ccagcaatac aatgccatgt acggccaacc gcgacccaat 3181 ttcgactaca atgcgagcat gatgggtcag gaatcagctc cggtggcagg acccagcatg 3241 tacatgagac caccacctcc ccccacagca gcaccaccac cacaaggacc tccaccttcg 3301 tatcccatgc gacaggagcg cccaatggcc cccattctct tcgaggtggg cggcaatcgc 3361 agtcatagca ctggttcgcc catgatgatg cagaacgtga tgagtgccca cggacatgga 3421 cgcggtcgtg gtcgtggtcc cagtcgtggt cctagtcgag gtccccatgg ccaccacaat 3481 gcctatggcc agatgaatca gatgaatcag aatcatggaa tgcccttcaa tgggcagatg 3541 aaccaaatgg gacagatgcc gccgcacatg cggaaccatc aagtgaacca ccaggtcaac 3601 cacatgggcc aaccaatggg caggggaatg ggcatgggcc tgatgcccct gcccgtccag 3661 atgctcaatc cgaatcgcat ggcccagccg aatgtgcagc agggaatgcc catgaatcgc 3721 tcgaccatga gtcctcgcca gtatgccaat cacaatccca atgcgatgcc cagctatcag 3781 cctcagccgg agatgaacaa gatgccaccg atggcggagg gaggagctgc agctggatcc 3841 tccacacctc gtgtgaattt cgcggccaat gtggcgaaca agccacgtgg tggaactcca 3901 cgtaaccagg tgggcactcc gagaccggga gctcctgttc ctgcgccggt ggaagtggct 3961 actggacagc agaaacccat ccaaccttcc tatgcctcca tgctgcagta aaagtgggat 4021 tttaatagtt ccttccatat gtcctttttt ttctcttatt tttttttggt caaataaatc 4081 gcgcacaatc attccatatt cata