Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XM_044396137 4084 bp mRNA linear INV 09-DEC-2024 SETD1B-A (LOC108060961), transcript variant X2, mRNA. ACCESSION XM_044396137 VERSION XM_044396137.2 DBLINK BioProject: PRJNA1194641 KEYWORDS RefSeq. SOURCE Drosophila takahashii ORGANISM Drosophila takahashii Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_091683) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process On Dec 9, 2024 this sequence version replaced XM_044396137.1. ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Status :: Full annotation Annotation Name :: GCF_030179915.1-RS_2024_12 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 10.3 Annotation Method :: Gnomon; cmsearch; tRNAscan-SE Features Annotated :: Gene; mRNA; CDS; ncRNA Annotation Date :: 12/07/2024 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..4084 /organism="Drosophila takahashii" /mol_type="mRNA" /strain="IR98-3 E-12201" /db_xref="taxon:29030" /chromosome="X" /sex="female" /tissue_type="Whole fly" /dev_stage="Adult fly" /collected_by="Originally obtained from EHIME-Fly" gene 1..4084 /gene="LOC108060961" /note="histone-lysine N-methyltransferase SETD1B-A; Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 2 Proteins" /db_xref="GeneID:108060961" CDS 160..3990 /gene="LOC108060961" /codon_start=1 /product="histone-lysine N-methyltransferase SETD1B-A isoform X1" /protein_id="XP_044252072.1" /db_xref="GeneID:108060961" /translation="MSEPAVPAEELCSTISNADISSSSSMSLASTAIDLIHTGLPLLE QSAADLESLPEKQDAEAKQEPKAMEEAVLETIAEPPTDLAADVTADSPLEPPSEPLAE PLPEPLPKTLQEPLLKCLIEPLPQPVQEPLPKPLQESLPQPLPEPLVKPVPEPLPENT TEPLQEPLAEKQPQPIPEPLSNPLPKALPEPLPEPPLPEPLPEPLPKPLPEPLPEPQE GRPETPAGNPENQAQAESPPSIMDISISAQMSPDAPVFYPMGSCLARLLTNGGGGGGD SGQDSPSVPRISPPRSAFEYGTGPYIGPGGDIPRSYQFIDPTATGERNFNSFGLGMDD QEMDEQQQAAQLAETPGVSWVPYYFGSPRTRTTGEDPLSPETGAARSRNNSRSHCECR GEASEVGTSTEFPSFESFAGQLDEIGLARDEFMATLRNLLIRANDLLAPLLQGPFQHM TPIQMAEYTSAILETPPLHPSLFMPANVPRPWTGLTGLSLPYDIHPHSLTPEDLARIQ ALPKPVDAATQTEFRCMCMLLSQAGSAPASGDLRPPTTSYFPGQAMNGPRMRVPPQPY ANPGTSQNGQAQPQPQQPYGFWGRPMPMAMHPHQVPQHRMPRQAAFQHPPPPQRHPHP QAHPQPHPRPGMQPGHPMAHHKMGGAVRYPEAGNYRGYRLPGPYSPRNQASRYGAIGS DRPAENNGNINGRNHYQNSGYRGNPLYARTALGNNQQQMASVAPNIGHLGNPIGNPVQ PMDLGMQLRLNLSAEEANASRQVASDSSRTESSATENTSYHSERRVSEESSYVEAEDE DGDSEEVDSEGEEDEVEGGEDEDDDDEEEEDENENDEEEDEDEDEEDDEQPRQLSAPD SRSYCFGEVEFMNYVMSETPMLPNPGQQPGAAAYMEHGLPPTPRSNVPCAQVTPHHLP THFPGYRAAREQPSQVMPPMNQMPPMAQMSQMPQMSQVPPMTSMPGGSYPPTGIPDGQ FKEPPPGSQDHYYNNQQYNAMYGQPRPNFDYNASMMGQESAPVAGPSMYMRPPPPPTA APPPQGPPPSYPMRQERPMAPILFEVGGNRSHSTGSPMMMQNVMSAHGHGRGRGRGPS RGPSRGPHGHHNAYGQMNQMNQNHGMPFNGQMNQMGQMPPHMRNHQVNHQVNHMGQPM GRGMGMGLMPLPVQMLNPNRMAQPNVQQGMPMNRSTMSPRQYANHNPNAMPSYQPQPE MNKMPPMAEGGAAAGSSTPRVNFAANVANKPRGGTPRNQVGTPRPGAPVPAPVEVATG QQKPIQPSYASMLQ" misc_feature <397..855 /gene="LOC108060961" /note="large tegument protein UL36; Provisional; Region: PHA03247" /db_xref="CDD:223021" misc_feature <535..696 /gene="LOC108060961" /note="Procyclic acidic repetitive protein (PARP); Region: Trypan_PARP; pfam05887" /db_xref="CDD:368653" ORIGIN 1 ggaggcagtt agattggagc tgcgctagca gcaacatcaa aaatatttta agaaaaaaag 61 tacaaaaaaa gttttggaaa aaaaacgaac attttcaata atttataacg cgttgttttg 121 agataccttt aggataaaag tttttggaaa ccggtcagaa tgtccgaacc ggcagtgccc 181 gccgaagagc tgtgctccac catttcgaac gcagacattt ctagctcctc cagcatgtcc 241 ctcgcatcca cggccatcga cctcatccac accggtctgc cgcttctcga acagagcgca 301 gccgacctgg agtcgctgcc ggagaagcag gatgcggagg ccaagcagga accgaaagca 361 atggaagaag ctgttttgga aacgatagca gaaccaccaa cagatttagc tgcggatgta 421 acagctgatt ccccattgga accaccatcg gaaccccttg cggaaccact acctgaaccc 481 ctaccgaaaa ctctacagga acccctgctg aaatgcttaa tagaacccct accacaaccg 541 gtacaggaac ccctaccgaa acctcttcag gaatcactac cacaacctct accggaacca 601 ctagtgaaac ctgtaccgga acctctgccg gaaaatacaa cagaacctct gcaggaacct 661 ctagcggaaa aacaaccaca gcccatacca gaacccctat caaatccttt accaaaagca 721 ctaccggaac ccctaccgga accaccccta ccggaacccc taccagaacc tctaccaaaa 781 ccactaccag aaccactacc agaacctcaa gaaggtcgac cagagacgcc cgctggcaat 841 ccggagaacc aggcccaagc cgagtcgcct ccgagcatca tggacatctc gatcagcgct 901 cagatgtccc cggacgcccc cgtcttctat cccatgggct cctgtctcgc ccgcctgctg 961 accaacggag gaggaggagg aggagactcg ggacaggaca gtccaagcgt tccgcggatc 1021 tcgccgccac ggagcgcctt cgaatacgga actggtccgt atatcggccc aggaggcgac 1081 atcccgcgca gctatcagtt catcgatcca acggccaccg gcgagcggaa cttcaacagt 1141 ttcggactgg gcatggatga ccaggagatg gacgaacagc agcaggcggc gcagctggcg 1201 gaaacaccag gcgtatcgtg ggtgccctac tatttcggca gtccgcgaac gaggacaacg 1261 ggcgaggatc ctttgtcgcc ggaaacggga gcagcgagga gcaggaacaa cagcaggagc 1321 cactgcgaat gcagaggcga ggcatccgag gtcgggacat ccaccgagtt cccatccttc 1381 gagtcctttg ccgggcaact ggacgaaatc ggtttggccc gcgacgagtt catggcaacg 1441 ctgcgcaatc tgctcatccg ggcgaatgac ctgttggcgc cgctactcca gggtcccttc 1501 cagcacatga cgcccatcca gatggccgag tataccagcg ccattctgga gacgcctccc 1561 ctgcatccga gtctcttcat gccggcgaat gtgccacgtc cctggacggg gctcactggt 1621 ctcagcctgc cctacgacat ccatccgcac agcctgacgc ccgaggactt ggcccgcatc 1681 caggcgctgc ccaagcccgt ggatgccgcc acccagacgg agttccgctg catgtgcatg 1741 ctactctcgc aggcaggatc tgcgccagca tccggcgacc tgcgacctcc aacgacctct 1801 tacttcccag gacaagcgat gaatggacca cgaatgcggg ttcctccgca gccatatgca 1861 aacccgggaa cttcacagaa cggacaggca cagccgcaac ctcagcagcc ctacggtttc 1921 tggggcagac ccatgccgat ggcgatgcat ccgcaccaag ttccgcagca tcgcatgccc 1981 cgacaggccg ccttccagca tccaccgccg ccgcaacggc atccacatcc gcaagcgcat 2041 ccgcagccgc atccgcgccc gggaatgcag cccggtcatc cgatggcgca ccacaagatg 2101 ggcggagcag tgcgctaccc ggaggctggt aactatcgtg gctatcggct gcctggtccc 2161 tacagtccgc ggaatcaggc gagtcgctac ggcgccatcg gcagtgatcg tcccgccgag 2221 aataatggta atattaatgg aaggaatcat taccagaact ccggctaccg tggtaatcca 2281 ctctatgcga gaacagctct cggcaacaat caacagcaga tggcttcggt tgctccgaat 2341 atcggtcatc tcggtaatcc catcggcaat cctgtccagc ccatggatct gggcatgcag 2401 ctgagattga atctgagtgc ggaggaggcg aatgccagca gacaggtggc ctccgactcg 2461 tcgcgcaccg agtcctcggc caccgagaac accagctatc actcggagcg ccgggtgagc 2521 gaggagagca gctacgtgga ggcggaggac gaggatgggg acagcgaaga ggtggactcc 2581 gagggggagg aggacgaggt cgagggcggc gaggacgagg atgacgatga tgaggaggaa 2641 gaggatgaga atgagaatga cgaagaagag gatgaggatg aggatgagga ggatgatgag 2701 cagccccgtc agctgtctgc tcccgattcc cgttcctact gcttcggtga ggtggagttc 2761 atgaactacg tcatgtccga gacgcccatg ctgcccaatc ccggccagca gccaggagcc 2821 gccgcctata tggagcacgg cctgccgcca actccacgct cgaatgtgcc ctgtgcccag 2881 gtgacgcccc accacctgcc cacccacttt ccaggctatc gtgctgctcg ggagcagcct 2941 tcccaggtca tgcctccaat gaaccagatg cctcccatgg ctcaaatgtc tcaaatgcct 3001 caaatgtccc aagtgcctcc tatgacctcc atgccaggag gcagttaccc acctaccggc 3061 ataccagatg gccagttcaa ggagccaccg ccaggatcac aggatcatta ctacaacaac 3121 cagcaataca atgccatgta cggccaaccg cgacccaatt tcgactacaa tgcgagcatg 3181 atgggtcagg aatcagctcc ggtggcagga cccagcatgt acatgagacc accacctccc 3241 cccacagcag caccaccacc acaaggacct ccaccttcgt atcccatgcg acaggagcgc 3301 ccaatggccc ccattctctt cgaggtgggc ggcaatcgca gtcatagcac tggttcgccc 3361 atgatgatgc agaacgtgat gagtgcccac ggacatggac gcggtcgtgg tcgtggtccc 3421 agtcgtggtc ctagtcgagg tccccatggc caccacaatg cctatggcca gatgaatcag 3481 atgaatcaga atcatggaat gcccttcaat gggcagatga accaaatggg acagatgccg 3541 ccgcacatgc ggaaccatca agtgaaccac caggtcaacc acatgggcca accaatgggc 3601 aggggaatgg gcatgggcct gatgcccctg cccgtccaga tgctcaatcc gaatcgcatg 3661 gcccagccga atgtgcagca gggaatgccc atgaatcgct cgaccatgag tcctcgccag 3721 tatgccaatc acaatcccaa tgcgatgccc agctatcagc ctcagccgga gatgaacaag 3781 atgccaccga tggcggaggg aggagctgca gctggatcct ccacacctcg tgtgaatttc 3841 gcggccaatg tggcgaacaa gccacgtggt ggaactccac gtaaccaggt gggcactccg 3901 agaccgggag ctcctgttcc tgcgccggtg gaagtggcta ctggacagca gaaacccatc 3961 caaccttcct atgcctccat gctgcagtaa aagtgggatt ttaatagttc cttccatatg 4021 tccttttttt tctcttattt ttttttggtc aaataaatcg cgcacaatca ttccatattc 4081 atat