Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

PREDICTED: Drosophila takahashii uncharacterized protein


LOCUS       XM_070218510            5163 bp    mRNA    linear   INV 09-DEC-2024
            (LOC123003142), mRNA.
ACCESSION   XM_070218510
VERSION     XM_070218510.1
DBLINK      BioProject: PRJNA1194641
KEYWORDS    RefSeq; includes ab initio.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            
            ##RefSeq-Attributes-START##
            ab initio :: 100% of CDS bases
            ##RefSeq-Attributes-END##
FEATURES             Location/Qualifiers
     source          1..5163
                     /organism="Drosophila takahashii"
                     /mol_type="mRNA"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     gene            1..5163
                     /gene="LOC123003142"
                     /note="uncharacterized LOC123003142; Derived by automated
                     computational analysis using gene prediction method:
                     Gnomon. Supporting evidence includes similarity to: 12
                     Proteins"
                     /db_xref="GeneID:123003142"
     CDS             1..5163
                     /gene="LOC123003142"
                     /codon_start=1
                     /product="uncharacterized protein"
                     /protein_id="XP_070074611.1"
                     /db_xref="GeneID:123003142"
                     /translation="MCPTPLDVLKQKRSNIRRSISRIGTAVAKTEKVEKPDAALTPAE
                     LTCRQTILEAYFKQIMAVQTEIEALDSTEEQLNYRAELEDMFIAIKVTIKDQLGDDVH
                     NSTPLAHDTLLRPGPPSSLKLPTLALPTFAGEYSEYKNFITTFQQVVDQQEALSSIEK
                     FNHLINCLRGAALETIKAFQVTPENYTKALLRLKSRYDNPTIVFLDNIASLFKLPTVT
                     TENSVQLRSLVDNASALFNSMRSLGSEAQIAQAMLITIVMEKVDKRTRQLWNESLTFA
                     TLPSWNACLTLIERHCQHLESSIHHFGNTNTAQNTSGRMRQPQAPRMGHSFTCSTQTC
                     PICSGRDHRIFRCPRLQTMTPAQRLEAVRTKSLCTNCLGKAHHPSNCPSLNRCRICSR
                     LHHTMLHFDPSGQESRSPRRPSPPNAPPQHHVAEAVTHAHVDTHHDQVILATAIILVQ
                     DAVGNYVPGRALLDSCSQVNFMSEDFAQQLRLPRSKGHIDIRSIGDSFTRIKHRTSTV
                     VKSRFGDVELPVECCITSRIAYQPDAEIDISSWNLPDNAPLADERFYQSRHIDLLLGT
                     ETFFTALSIGQIKLGPHLPMLQKTIFGWVVSGRTGNQGHAAHAHSFLSSEESINLNLE
                     RLWRIEEIFTTPLTLTPAQAKCESIFQKSVCQVADGRLMIRLPFKDDPNLLGRSLETA
                     RLRFASLERRLSRNANLRAEYTKFMDEYERLGHMELVTSPLLNEPHYYIPHHCVFKLN
                     STSTKLRVVFDASCPTSTYKSLNDILLVGPTIQPDLFTLLLRFRTHRYVITADVVKMY
                     RQVLIDPAHRKFQYILWRSHIDQEIRTYQLNTVTYGTASAPYLAIRSLSHLADQHASS
                     HQIGAEAIKSCFYVDDFLGGADTLDQLNQIRTEVIAILTAGKMELAKWHSNHQDFVND
                     TTVKELNLDDYATTSTLGLKWDQREDVFKFSFNCSTAPERITKRTILSVASSLFDPLG
                     LVSPVIMVAKILLQEIWLLRLQWDESVPMTLQQAWLSFVSSLQHLDSLLIPRFCLQPH
                     SDELQLHGFSDASIRGYGCCIYARTRLADGHVEVHLIASKSRVAPTKKKTLPQLELCG
                     AHLLARLYNQIKAAFLQHTPDTFLWTDSQLVLHWIRQHSATLSTFVGNRISDIQEFTR
                     DCHWRFVPTRHNPADMISRGCNLAELTKSTWFHGPTFLSNPPTLWPPNVHGNLDVDVV
                     SSEKRKSAFLASTPPSRNNVLEALHNKESHQSGLRLVAWMLRFCDRCQKKKPFTTGPL
                     SPLELRRAMLCVAWNLQQRYFIEEITLLQRGAPVRSHLKFLSPFLQVTDGFNLLFVGG
                     RLELASIPDTHKHPILLPAKDVAVVNYVRHLHLRNYHAGPKVLVGLMRLEFWVVNARD
                     VARRVVRNCVHCVRYRPKLLQQLMGNLPVERLTLSRPFARCGIDFCGPVHTYLRVRGK
                     MPVKSYIAIFVCFATKAAHIEIVGELSTDSFLGALKRMIARRGLPSDIFCDNATNFVG
                     ASNKLQDLKDFFFSTKTQQDITSYCTNEFVTFHFIPPRAPHFGGLWEAAVKSAKNLLC
                     RTLKDARLTFEELSTVAAETEAILNSRPLSPHSSDPSDLAVLTPGHFLTGDSLRALPD
                     WPVKDDQLNCSQRWKRVSAIKQHIWRRWSTEYLHELQAKSNWSTGSSNIAVNEMVLIH
                     EDHVPPHRWLLGRIEAAIPGKDNHVRVADVRTSKGIIRRPIHKLARLPINLK"
     misc_feature    391..807
                     /gene="LOC123003142"
                     /note="Protein of unknown function (DUF1759); Region:
                     DUF1759; pfam03564"
                     /db_xref="CDD:281552"
     misc_feature    886..>1185
                     /gene="LOC123003142"
                     /note="Arginine methyltransferase-interacting protein,
                     contains RING Zn-finger [Posttranslational modification,
                     protein turnover, chaperones / Intracellular trafficking
                     and secretion]; Region: AIR1; COG5082"
                     /db_xref="CDD:227414"
     misc_feature    1339..1821
                     /gene="LOC123003142"
                     /note="Cellular and retroviral pepsin-like aspartate
                     proteases; Region: pepsin_retropepsin_like; cl11403"
                     /db_xref="CDD:472175"
     misc_feature    order(1390..1392,1396..1398,1402..1404,1471..1479)
                     /gene="LOC123003142"
                     /note="inhibitor binding site [active]"
                     /db_xref="CDD:133136"
     misc_feature    1390..1398
                     /gene="LOC123003142"
                     /note="catalytic motif [active]"
                     /db_xref="CDD:133136"
     misc_feature    1390..1392
                     /gene="LOC123003142"
                     /note="Catalytic residue [active]"
                     /db_xref="CDD:133136"
     misc_feature    order(1471..1482,1492..1503)
                     /gene="LOC123003142"
                     /note="Active site flap [active]"
                     /db_xref="CDD:133136"
     misc_feature    2185..2820
                     /gene="LOC123003142"
                     /note="Reverse transcriptase (RTs) in retrotransposons.
                     This subfamily represents the RT domain of a
                     multifunctional enzyme. C-terminal to the RT domain is a
                     domain homologous to aspartic proteinases (corresponding
                     to Merops family A17) encoded by...; Region: RT_pepA17;
                     cd01644"
                     /db_xref="CDD:238822"
     misc_feature    order(2377..2394,2497..2502,2605..2607,2611..2616,
                     2791..2796)
                     /gene="LOC123003142"
                     /note="putative active site [active]"
                     /db_xref="CDD:238822"
     misc_feature    order(2377..2394,2497..2499,2611..2613)
                     /gene="LOC123003142"
                     /note="putative NTP binding site [chemical binding]; other
                     site"
                     /db_xref="CDD:238822"
     misc_feature    2500..2502
                     /gene="LOC123003142"
                     /note="putative nucleic acid binding site [nucleotide
                     binding]; other site"
                     /db_xref="CDD:238822"
     misc_feature    2866..3348
                     /gene="LOC123003142"
                     /note="Pao retrotransposon peptidase; Region:
                     Peptidase_A17; pfam05380"
                     /db_xref="CDD:461634"
     misc_feature    4042..4188
                     /gene="LOC123003142"
                     /note="Integrase zinc binding domain; Region:
                     Integrase_H2C2; pfam17921"
                     /db_xref="CDD:465569"
     misc_feature    4867..5145
                     /gene="LOC123003142"
                     /note="Family of unknown function (DUF5641); Region:
                     DUF5641; pfam18701"
                     /db_xref="CDD:465838"
ORIGIN      
        1 atgtgtccca caccgctgga cgtattaaaa cagaagcgat ccaacatacg caggagcatt
       61 tcccgcattg gaacagcagt cgccaagacg gaaaaggttg agaagccaga tgccgcgctg
      121 acgccagccg aactcacgtg tcggcaaact attttggagg catatttcaa acaaatcatg
      181 gccgtccaaa ccgaaatcga ggcactcgat tcaacagagg aacagctaaa ctaccgggcc
      241 gagctggaag atatgttcat cgctataaag gtcaccataa aggatcagct tggagacgac
      301 gtgcacaatt ctacgccgct tgcccacgac acgctcctca ggcctggacc accatcatct
      361 ctcaagctgc caactcttgc gctcccaacc tttgctggcg agtactcgga atacaaaaac
      421 tttattacta cgttccagca agttgtagac cagcaggaag cgctatcgtc gatcgagaag
      481 ttcaatcacc tcataaactg tctcagaggc gccgccttag aaaccatcaa ggcgtttcaa
      541 gtcactccag agaattacac caaggcctta cttcgcctaa agagcaggta cgataacccc
      601 accatcgtat tcctggacaa catagcgtca ctcttcaaat tgccaactgt gaccactgaa
      661 aatagtgtac agttgcgcag cctggttgac aacgcatcag cgttgttcaa ctcgatgcga
      721 tctttaggat ccgaagcgca gatagctcaa gcgatgttaa ttactattgt catggaaaag
      781 gtagacaaga gaactcgaca gctctggaat gaatctctca cgttcgccac attaccctcg
      841 tggaatgcct gcttaacctt gattgagcgg cactgtcaac acttggagtc ctcaatacat
      901 cacttcggca atacgaatac ggctcaaaat acatctggca gaatgcggca accgcaagct
      961 ccacgcatgg gccacagctt tacgtgctcc acgcaaacgt gtcctatatg ctcaggcagg
     1021 gatcatcgga ttttcagatg tccgcgactt caaaccatga cgcctgccca acgcctagaa
     1081 gctgtccgga caaaatcgct gtgcacaaac tgcctaggaa aggctcatca tccatcaaac
     1141 tgtccatctt taaataggtg cagaatatgc tcccggctgc atcacactat gcttcacttc
     1201 gatccgtcag gacaggaatc aaggagtcct cgacggccaa gtccgcctaa cgcgccacct
     1261 cagcaccatg tcgctgaggc cgtcactcac gcacacgtgg atacccacca cgaccaagtc
     1321 attttagcta cagccataat cctagtgcag gatgctgtgg gcaactacgt accaggacgt
     1381 gccctactgg actcgtgctc ccaagttaac ttcatgtccg aagacttcgc tcaacaactt
     1441 cgtttgcccc gaagcaaggg acacatcgac attcgaagca tcggcgattc tttcacccgc
     1501 atcaagcatc gcacttcgac cgttgtcaag tcacgattcg gtgatgtaga gttacccgtc
     1561 gaatgctgta taacctcgag aatcgcctat caacctgatg cagaaatcga catttcatca
     1621 tggaacctcc cagacaacgc tccactggct gatgagagat tttatcagtc tcgccatatc
     1681 gacctacttc ttggcacgga aacgttcttc accgctttgt ctattggcca aattaagttg
     1741 ggccctcatc tccccatgtt gcaaaagacg atctttggat gggtagtatc agggcgcact
     1801 ggcaatcaag gccatgccgc ccacgcccac agtttccttt catcagagga gtcaataaac
     1861 ctgaacttgg agcgcttatg gcgcatagag gaaattttta ctacgccatt aacacttact
     1921 ccagcgcaag ctaaatgcga atcgatcttc caaaaatctg tctgtcaagt agcggatgga
     1981 agactgatga ttcgactgcc tttcaaggat gaccctaatc tgttggggag gtctcttgaa
     2041 acagcacggc taagatttgc atcgttggaa cgtcgcttaa gccgaaacgc gaatttgcga
     2101 gccgaataca caaaattcat ggatgaatat gagcgcttgg gtcacatgga attagtaacc
     2161 agtccactcc tcaacgaacc ccactactac ataccacacc actgcgtgtt taagctcaac
     2221 agcacgtcaa cgaaactacg ggtagtcttt gacgcctcgt gcccgacaag cacctacaag
     2281 tccctaaacg acatccttct agttggaccc acgattcaac cagacttgtt tacactctta
     2341 ctaaggttcc ggactcatcg ttatgtaata actgctgacg tcgtcaagat gtacaggcag
     2401 gtattaatcg acccagcgca tcgtaaattt cagtacatcc tatggagaag ccacatcgac
     2461 caggagattc gcacatacca gctgaacacc gttacctacg gcaccgcatc agcgccgtat
     2521 ctagcaattc gcagcctgtc tcacctcgcc gatcaacatg caagttcaca tcaaattgga
     2581 gctgaagcca taaaatcgtg tttttatgtg gatgactttc tcggtggagc agatacgtta
     2641 gatcaattga accagattcg aaccgaagtc atcgctatcc tcacggccgg caagatggaa
     2701 ctggcaaaat ggcactccaa ccatcaggat ttcgtcaatg atacaacggt caaggagcta
     2761 aatcttgacg actacgccac gacaagcaca ttgggactga agtgggacca acgagaagac
     2821 gtcttcaaat tctcattcaa ctgcagcaca gctccggaaa ggatcacaaa gcgaaccatc
     2881 ctctcggtgg cctcatccct cttcgatccg ctgggacttg tctcgccagt tattatggtg
     2941 gcaaaaatcc tgctacagga aatttggctt cttcgactac agtgggatga atcagtccca
     3001 atgacactcc aacaagcgtg gctatcgttt gtgtcgtcgc tacagcattt agactctctg
     3061 ctcatccctc gtttctgcct gcagccccat tcagatgaac tacaacttca cggcttcagc
     3121 gatgcttcaa taagggggta tggatgttgc atttacgcaa gaacaaggct tgccgatggc
     3181 cacgtggaag ttcatctaat cgcctcaaag tcgcgtgtgg caccaacgaa gaagaagacg
     3241 ctgccgcagc tcgagctctg tggagcgcat cttctagcac gactgtacaa ccagataaag
     3301 gcagcttttc tacaacacac gcccgatact tttctgtgga cagattcaca gctagtgttg
     3361 cattggattc gacaacattc tgcgacactt tctacgtttg tcggcaatcg gatatccgac
     3421 atccaagaat tcacgcgaga ctgtcactgg aggttcgttc cgacacggca taacccagct
     3481 gacatgatct ccagaggatg caacctcgca gaactcacca aatcaacatg gttccacgga
     3541 cctacatttc tatcaaatcc gccgacatta tggccgccta acgtacacgg caatctggac
     3601 gtggacgtcg tatcgtcaga aaaacgcaaa tcagcattcc ttgcatcaac accgccttcc
     3661 aggaataatg tcttggaagc ccttcacaac aaggaatcac accagtctgg cctaagactg
     3721 gtagcatgga tgcttcgatt ttgcgacaga tgtcagaaga aaaaaccctt tactacgggg
     3781 ccactttccc ctctggaact gcgaagggcc atgctttgcg tcgcatggaa cctacaacag
     3841 cgatatttca tcgaggaaat aactcttttg caaaggggcg cgccggttcg tagccatctt
     3901 aaattcttat cgcccttctt gcaggtcaca gatggattca acctactttt cgtcggagga
     3961 cgattggaac tcgcatcgat cccagacact cacaagcatc ccatattgct tcccgctaag
     4021 gatgttgctg tcgtcaacta cgtacgtcat ctacacctga ggaactacca tgctggtcca
     4081 aaggtcctgg tgggactaat gcggctggaa ttttgggtcg tgaatgcccg agatgtcgct
     4141 cgtcgtgttg tacgcaactg tgtccactgt gtgcgctaca ggcctaagtt gctgcaacaa
     4201 cttatgggca acctcccagt cgagagactt accctgtcaa gaccatttgc tcggtgtggg
     4261 atagacttct gcgggccggt tcatacctat ctacgcgttc gaggaaagat gcccgttaaa
     4321 agctacatcg caatatttgt gtgcttcgcc acgaaggcgg cgcacatcga aatcgtcgga
     4381 gaactatcaa ctgattcgtt cttgggtgct ctgaagcgga tgatcgccag acgtgggctg
     4441 ccttcagaca ttttttgcga caacgcgacg aattttgtag gcgcgagcaa caagttgcaa
     4501 gacctgaagg actttttctt cagcaccaag acccagcaag acattacatc gtactgtaca
     4561 aacgaatttg taacgttcca ttttatacct cctagggctc cacacttcgg cggcctctgg
     4621 gaggctgctg ttaaaagcgc caagaatctc ttgtgccgca cgctcaagga tgcccgacta
     4681 acctttgagg agctctcgac tgtcgctgct gagactgagg caatcctaaa ttctcgcccg
     4741 ctctctccac actcatcgga tcctagcgat ttggccgtct taacgcctgg tcatttctta
     4801 accggggact cccttcgagc actcccagac tggcctgtca aggatgatca gctcaactgc
     4861 tctcaacgat ggaagcgtgt cagcgccatc aagcaacaca tctggaggcg ctggtctacg
     4921 gagtatcttc acgagcttca ggccaaatca aactggtcga ctggatcttc gaatattgca
     4981 gtcaacgaaa tggtcctaat ccacgaagac catgtccctc ctcatcgatg gctgttaggt
     5041 cgcatcgaag cagctatacc aggaaaggac aaccatgtcc gagtcgctga tgtccggacg
     5101 tcgaaaggaa ttataaggcg accgattcac aaattggccc gtttaccaat taatttaaag
     5161 taa