Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XM_070218510 5163 bp mRNA linear INV 09-DEC-2024 (LOC123003142), mRNA. ACCESSION XM_070218510 VERSION XM_070218510.1 DBLINK BioProject: PRJNA1194641 KEYWORDS RefSeq; includes ab initio. SOURCE Drosophila takahashii ORGANISM Drosophila takahashii Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_091683) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Status :: Full annotation Annotation Name :: GCF_030179915.1-RS_2024_12 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 10.3 Annotation Method :: Gnomon; cmsearch; tRNAscan-SE Features Annotated :: Gene; mRNA; CDS; ncRNA Annotation Date :: 12/07/2024 ##Genome-Annotation-Data-END## ##RefSeq-Attributes-START## ab initio :: 100% of CDS bases ##RefSeq-Attributes-END## FEATURES Location/Qualifiers source 1..5163 /organism="Drosophila takahashii" /mol_type="mRNA" /strain="IR98-3 E-12201" /db_xref="taxon:29030" /chromosome="X" /sex="female" /tissue_type="Whole fly" /dev_stage="Adult fly" /collected_by="Originally obtained from EHIME-Fly" gene 1..5163 /gene="LOC123003142" /note="uncharacterized LOC123003142; Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 12 Proteins" /db_xref="GeneID:123003142" CDS 1..5163 /gene="LOC123003142" /codon_start=1 /product="uncharacterized protein" /protein_id="XP_070074611.1" /db_xref="GeneID:123003142" /translation="MCPTPLDVLKQKRSNIRRSISRIGTAVAKTEKVEKPDAALTPAE LTCRQTILEAYFKQIMAVQTEIEALDSTEEQLNYRAELEDMFIAIKVTIKDQLGDDVH NSTPLAHDTLLRPGPPSSLKLPTLALPTFAGEYSEYKNFITTFQQVVDQQEALSSIEK FNHLINCLRGAALETIKAFQVTPENYTKALLRLKSRYDNPTIVFLDNIASLFKLPTVT TENSVQLRSLVDNASALFNSMRSLGSEAQIAQAMLITIVMEKVDKRTRQLWNESLTFA TLPSWNACLTLIERHCQHLESSIHHFGNTNTAQNTSGRMRQPQAPRMGHSFTCSTQTC PICSGRDHRIFRCPRLQTMTPAQRLEAVRTKSLCTNCLGKAHHPSNCPSLNRCRICSR LHHTMLHFDPSGQESRSPRRPSPPNAPPQHHVAEAVTHAHVDTHHDQVILATAIILVQ DAVGNYVPGRALLDSCSQVNFMSEDFAQQLRLPRSKGHIDIRSIGDSFTRIKHRTSTV VKSRFGDVELPVECCITSRIAYQPDAEIDISSWNLPDNAPLADERFYQSRHIDLLLGT ETFFTALSIGQIKLGPHLPMLQKTIFGWVVSGRTGNQGHAAHAHSFLSSEESINLNLE RLWRIEEIFTTPLTLTPAQAKCESIFQKSVCQVADGRLMIRLPFKDDPNLLGRSLETA RLRFASLERRLSRNANLRAEYTKFMDEYERLGHMELVTSPLLNEPHYYIPHHCVFKLN STSTKLRVVFDASCPTSTYKSLNDILLVGPTIQPDLFTLLLRFRTHRYVITADVVKMY RQVLIDPAHRKFQYILWRSHIDQEIRTYQLNTVTYGTASAPYLAIRSLSHLADQHASS HQIGAEAIKSCFYVDDFLGGADTLDQLNQIRTEVIAILTAGKMELAKWHSNHQDFVND TTVKELNLDDYATTSTLGLKWDQREDVFKFSFNCSTAPERITKRTILSVASSLFDPLG LVSPVIMVAKILLQEIWLLRLQWDESVPMTLQQAWLSFVSSLQHLDSLLIPRFCLQPH SDELQLHGFSDASIRGYGCCIYARTRLADGHVEVHLIASKSRVAPTKKKTLPQLELCG AHLLARLYNQIKAAFLQHTPDTFLWTDSQLVLHWIRQHSATLSTFVGNRISDIQEFTR DCHWRFVPTRHNPADMISRGCNLAELTKSTWFHGPTFLSNPPTLWPPNVHGNLDVDVV SSEKRKSAFLASTPPSRNNVLEALHNKESHQSGLRLVAWMLRFCDRCQKKKPFTTGPL SPLELRRAMLCVAWNLQQRYFIEEITLLQRGAPVRSHLKFLSPFLQVTDGFNLLFVGG RLELASIPDTHKHPILLPAKDVAVVNYVRHLHLRNYHAGPKVLVGLMRLEFWVVNARD VARRVVRNCVHCVRYRPKLLQQLMGNLPVERLTLSRPFARCGIDFCGPVHTYLRVRGK MPVKSYIAIFVCFATKAAHIEIVGELSTDSFLGALKRMIARRGLPSDIFCDNATNFVG ASNKLQDLKDFFFSTKTQQDITSYCTNEFVTFHFIPPRAPHFGGLWEAAVKSAKNLLC RTLKDARLTFEELSTVAAETEAILNSRPLSPHSSDPSDLAVLTPGHFLTGDSLRALPD WPVKDDQLNCSQRWKRVSAIKQHIWRRWSTEYLHELQAKSNWSTGSSNIAVNEMVLIH EDHVPPHRWLLGRIEAAIPGKDNHVRVADVRTSKGIIRRPIHKLARLPINLK" misc_feature 391..807 /gene="LOC123003142" /note="Protein of unknown function (DUF1759); Region: DUF1759; pfam03564" /db_xref="CDD:281552" misc_feature 886..>1185 /gene="LOC123003142" /note="Arginine methyltransferase-interacting protein, contains RING Zn-finger [Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion]; Region: AIR1; COG5082" /db_xref="CDD:227414" misc_feature 1339..1821 /gene="LOC123003142" /note="Cellular and retroviral pepsin-like aspartate proteases; Region: pepsin_retropepsin_like; cl11403" /db_xref="CDD:472175" misc_feature order(1390..1392,1396..1398,1402..1404,1471..1479) /gene="LOC123003142" /note="inhibitor binding site [active]" /db_xref="CDD:133136" misc_feature 1390..1398 /gene="LOC123003142" /note="catalytic motif [active]" /db_xref="CDD:133136" misc_feature 1390..1392 /gene="LOC123003142" /note="Catalytic residue [active]" /db_xref="CDD:133136" misc_feature order(1471..1482,1492..1503) /gene="LOC123003142" /note="Active site flap [active]" /db_xref="CDD:133136" misc_feature 2185..2820 /gene="LOC123003142" /note="Reverse transcriptase (RTs) in retrotransposons. This subfamily represents the RT domain of a multifunctional enzyme. C-terminal to the RT domain is a domain homologous to aspartic proteinases (corresponding to Merops family A17) encoded by...; Region: RT_pepA17; cd01644" /db_xref="CDD:238822" misc_feature order(2377..2394,2497..2502,2605..2607,2611..2616, 2791..2796) /gene="LOC123003142" /note="putative active site [active]" /db_xref="CDD:238822" misc_feature order(2377..2394,2497..2499,2611..2613) /gene="LOC123003142" /note="putative NTP binding site [chemical binding]; other site" /db_xref="CDD:238822" misc_feature 2500..2502 /gene="LOC123003142" /note="putative nucleic acid binding site [nucleotide binding]; other site" /db_xref="CDD:238822" misc_feature 2866..3348 /gene="LOC123003142" /note="Pao retrotransposon peptidase; Region: Peptidase_A17; pfam05380" /db_xref="CDD:461634" misc_feature 4042..4188 /gene="LOC123003142" /note="Integrase zinc binding domain; Region: Integrase_H2C2; pfam17921" /db_xref="CDD:465569" misc_feature 4867..5145 /gene="LOC123003142" /note="Family of unknown function (DUF5641); Region: DUF5641; pfam18701" /db_xref="CDD:465838" ORIGIN 1 atgtgtccca caccgctgga cgtattaaaa cagaagcgat ccaacatacg caggagcatt 61 tcccgcattg gaacagcagt cgccaagacg gaaaaggttg agaagccaga tgccgcgctg 121 acgccagccg aactcacgtg tcggcaaact attttggagg catatttcaa acaaatcatg 181 gccgtccaaa ccgaaatcga ggcactcgat tcaacagagg aacagctaaa ctaccgggcc 241 gagctggaag atatgttcat cgctataaag gtcaccataa aggatcagct tggagacgac 301 gtgcacaatt ctacgccgct tgcccacgac acgctcctca ggcctggacc accatcatct 361 ctcaagctgc caactcttgc gctcccaacc tttgctggcg agtactcgga atacaaaaac 421 tttattacta cgttccagca agttgtagac cagcaggaag cgctatcgtc gatcgagaag 481 ttcaatcacc tcataaactg tctcagaggc gccgccttag aaaccatcaa ggcgtttcaa 541 gtcactccag agaattacac caaggcctta cttcgcctaa agagcaggta cgataacccc 601 accatcgtat tcctggacaa catagcgtca ctcttcaaat tgccaactgt gaccactgaa 661 aatagtgtac agttgcgcag cctggttgac aacgcatcag cgttgttcaa ctcgatgcga 721 tctttaggat ccgaagcgca gatagctcaa gcgatgttaa ttactattgt catggaaaag 781 gtagacaaga gaactcgaca gctctggaat gaatctctca cgttcgccac attaccctcg 841 tggaatgcct gcttaacctt gattgagcgg cactgtcaac acttggagtc ctcaatacat 901 cacttcggca atacgaatac ggctcaaaat acatctggca gaatgcggca accgcaagct 961 ccacgcatgg gccacagctt tacgtgctcc acgcaaacgt gtcctatatg ctcaggcagg 1021 gatcatcgga ttttcagatg tccgcgactt caaaccatga cgcctgccca acgcctagaa 1081 gctgtccgga caaaatcgct gtgcacaaac tgcctaggaa aggctcatca tccatcaaac 1141 tgtccatctt taaataggtg cagaatatgc tcccggctgc atcacactat gcttcacttc 1201 gatccgtcag gacaggaatc aaggagtcct cgacggccaa gtccgcctaa cgcgccacct 1261 cagcaccatg tcgctgaggc cgtcactcac gcacacgtgg atacccacca cgaccaagtc 1321 attttagcta cagccataat cctagtgcag gatgctgtgg gcaactacgt accaggacgt 1381 gccctactgg actcgtgctc ccaagttaac ttcatgtccg aagacttcgc tcaacaactt 1441 cgtttgcccc gaagcaaggg acacatcgac attcgaagca tcggcgattc tttcacccgc 1501 atcaagcatc gcacttcgac cgttgtcaag tcacgattcg gtgatgtaga gttacccgtc 1561 gaatgctgta taacctcgag aatcgcctat caacctgatg cagaaatcga catttcatca 1621 tggaacctcc cagacaacgc tccactggct gatgagagat tttatcagtc tcgccatatc 1681 gacctacttc ttggcacgga aacgttcttc accgctttgt ctattggcca aattaagttg 1741 ggccctcatc tccccatgtt gcaaaagacg atctttggat gggtagtatc agggcgcact 1801 ggcaatcaag gccatgccgc ccacgcccac agtttccttt catcagagga gtcaataaac 1861 ctgaacttgg agcgcttatg gcgcatagag gaaattttta ctacgccatt aacacttact 1921 ccagcgcaag ctaaatgcga atcgatcttc caaaaatctg tctgtcaagt agcggatgga 1981 agactgatga ttcgactgcc tttcaaggat gaccctaatc tgttggggag gtctcttgaa 2041 acagcacggc taagatttgc atcgttggaa cgtcgcttaa gccgaaacgc gaatttgcga 2101 gccgaataca caaaattcat ggatgaatat gagcgcttgg gtcacatgga attagtaacc 2161 agtccactcc tcaacgaacc ccactactac ataccacacc actgcgtgtt taagctcaac 2221 agcacgtcaa cgaaactacg ggtagtcttt gacgcctcgt gcccgacaag cacctacaag 2281 tccctaaacg acatccttct agttggaccc acgattcaac cagacttgtt tacactctta 2341 ctaaggttcc ggactcatcg ttatgtaata actgctgacg tcgtcaagat gtacaggcag 2401 gtattaatcg acccagcgca tcgtaaattt cagtacatcc tatggagaag ccacatcgac 2461 caggagattc gcacatacca gctgaacacc gttacctacg gcaccgcatc agcgccgtat 2521 ctagcaattc gcagcctgtc tcacctcgcc gatcaacatg caagttcaca tcaaattgga 2581 gctgaagcca taaaatcgtg tttttatgtg gatgactttc tcggtggagc agatacgtta 2641 gatcaattga accagattcg aaccgaagtc atcgctatcc tcacggccgg caagatggaa 2701 ctggcaaaat ggcactccaa ccatcaggat ttcgtcaatg atacaacggt caaggagcta 2761 aatcttgacg actacgccac gacaagcaca ttgggactga agtgggacca acgagaagac 2821 gtcttcaaat tctcattcaa ctgcagcaca gctccggaaa ggatcacaaa gcgaaccatc 2881 ctctcggtgg cctcatccct cttcgatccg ctgggacttg tctcgccagt tattatggtg 2941 gcaaaaatcc tgctacagga aatttggctt cttcgactac agtgggatga atcagtccca 3001 atgacactcc aacaagcgtg gctatcgttt gtgtcgtcgc tacagcattt agactctctg 3061 ctcatccctc gtttctgcct gcagccccat tcagatgaac tacaacttca cggcttcagc 3121 gatgcttcaa taagggggta tggatgttgc atttacgcaa gaacaaggct tgccgatggc 3181 cacgtggaag ttcatctaat cgcctcaaag tcgcgtgtgg caccaacgaa gaagaagacg 3241 ctgccgcagc tcgagctctg tggagcgcat cttctagcac gactgtacaa ccagataaag 3301 gcagcttttc tacaacacac gcccgatact tttctgtgga cagattcaca gctagtgttg 3361 cattggattc gacaacattc tgcgacactt tctacgtttg tcggcaatcg gatatccgac 3421 atccaagaat tcacgcgaga ctgtcactgg aggttcgttc cgacacggca taacccagct 3481 gacatgatct ccagaggatg caacctcgca gaactcacca aatcaacatg gttccacgga 3541 cctacatttc tatcaaatcc gccgacatta tggccgccta acgtacacgg caatctggac 3601 gtggacgtcg tatcgtcaga aaaacgcaaa tcagcattcc ttgcatcaac accgccttcc 3661 aggaataatg tcttggaagc ccttcacaac aaggaatcac accagtctgg cctaagactg 3721 gtagcatgga tgcttcgatt ttgcgacaga tgtcagaaga aaaaaccctt tactacgggg 3781 ccactttccc ctctggaact gcgaagggcc atgctttgcg tcgcatggaa cctacaacag 3841 cgatatttca tcgaggaaat aactcttttg caaaggggcg cgccggttcg tagccatctt 3901 aaattcttat cgcccttctt gcaggtcaca gatggattca acctactttt cgtcggagga 3961 cgattggaac tcgcatcgat cccagacact cacaagcatc ccatattgct tcccgctaag 4021 gatgttgctg tcgtcaacta cgtacgtcat ctacacctga ggaactacca tgctggtcca 4081 aaggtcctgg tgggactaat gcggctggaa ttttgggtcg tgaatgcccg agatgtcgct 4141 cgtcgtgttg tacgcaactg tgtccactgt gtgcgctaca ggcctaagtt gctgcaacaa 4201 cttatgggca acctcccagt cgagagactt accctgtcaa gaccatttgc tcggtgtggg 4261 atagacttct gcgggccggt tcatacctat ctacgcgttc gaggaaagat gcccgttaaa 4321 agctacatcg caatatttgt gtgcttcgcc acgaaggcgg cgcacatcga aatcgtcgga 4381 gaactatcaa ctgattcgtt cttgggtgct ctgaagcgga tgatcgccag acgtgggctg 4441 ccttcagaca ttttttgcga caacgcgacg aattttgtag gcgcgagcaa caagttgcaa 4501 gacctgaagg actttttctt cagcaccaag acccagcaag acattacatc gtactgtaca 4561 aacgaatttg taacgttcca ttttatacct cctagggctc cacacttcgg cggcctctgg 4621 gaggctgctg ttaaaagcgc caagaatctc ttgtgccgca cgctcaagga tgcccgacta 4681 acctttgagg agctctcgac tgtcgctgct gagactgagg caatcctaaa ttctcgcccg 4741 ctctctccac actcatcgga tcctagcgat ttggccgtct taacgcctgg tcatttctta 4801 accggggact cccttcgagc actcccagac tggcctgtca aggatgatca gctcaactgc 4861 tctcaacgat ggaagcgtgt cagcgccatc aagcaacaca tctggaggcg ctggtctacg 4921 gagtatcttc acgagcttca ggccaaatca aactggtcga ctggatcttc gaatattgca 4981 gtcaacgaaa tggtcctaat ccacgaagac catgtccctc ctcatcgatg gctgttaggt 5041 cgcatcgaag cagctatacc aggaaaggac aaccatgtcc gagtcgctga tgtccggacg 5101 tcgaaaggaa ttataaggcg accgattcac aaattggccc gtttaccaat taatttaaag 5161 taa