Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

PREDICTED: Drosophila takahashii uncharacterized protein


LOCUS       XM_044396094            6608 bp    mRNA    linear   INV 09-DEC-2024
            (LOC123003484), transcript variant X1, mRNA.
ACCESSION   XM_044396094
VERSION     XM_044396094.2
DBLINK      BioProject: PRJNA1194641
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            On Dec 9, 2024 this sequence version replaced XM_044396094.1.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..6608
                     /organism="Drosophila takahashii"
                     /mol_type="mRNA"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     gene            1..6608
                     /gene="LOC123003484"
                     /note="uncharacterized LOC123003484; Derived by automated
                     computational analysis using gene prediction method:
                     Gnomon."
                     /db_xref="GeneID:123003484"
     CDS             777..6170
                     /gene="LOC123003484"
                     /codon_start=1
                     /product="uncharacterized protein isoform X1"
                     /protein_id="XP_044252029.1"
                     /db_xref="GeneID:123003484"
                     /translation="MAADKDQEQGQASSDASLSNLHILQADIFERVKQLYQNFKKDSQ
                     SRKTKIYFQTRINKLENFSKEFDTNHQTLLLNGCSPEHEYFTSQLAARFEESYLTYYC
                     EIGDAFETRFPTVPSDATPSHSNSTMHNTTMLQAPGSTVQLPKLPVPSYSGKLTEWPA
                     FHDVFQQLIHDNGALSSIQKFHFLKQALPADRDQDVHQMDLTEANYLVAWSLIVTRYN
                     NPRLLFMHHMNTLYELPSISKESSAELKHMLNVANVCINEFKRLNIAIANCDHWIVHH
                     LTTKLPSQTIQAWEHSLGSTKEIPTFFTLESFLNNRLFSIDIIEGRKVPPPPRQPQGV
                     NGFQRNVIKHSGNAQTRISSCHTSGVTTNSVRCAHCNDQHILRRCPDFLAKDSFARKL
                     IVDRSKVCLNCLSPTHSLSKCNSSKNCLQCGQRHHTLLHFPTQVKASDVAQSAGNITS
                     LNSQGLTASSNTTQLISASASHSTPHTMILATALVRLHSNATGQSAVVRALIDHGSEG
                     TLVTETVVQALGLPRFPVSAEISGVGGNSTNRCKYRTECTLSSTTNSGFKMWVENAFV
                     LRTLTSPLPRTNLTLPTCPHLTGLELADPNFMHTNSIDVLLGVDTIPQFMMSGIRRGS
                     YDQPIAQCTQLGWIIFGRITPKQTHTISIQCHHSNLETLVQKFFELEAVSTTRLYTAE
                     EQWCEDHFKRTHIRQPNGKYMVRLPFKTLFDPNQTIGKSKQTALNRFHFLQRKFQKKP
                     DLYEQYSKVMEEYFELHQITEAITSEEQHRLADKGGSISYTACTLPHHAVLKADSSTT
                     KLRVVYDASCKTSNGKSLNDILCIGPALQNDLGGVILNWRFLQYVFAADIQKMYRCID
                     VHPEDTHFQRIIWQRENNAIKDYCLTTVTFGTASAPYTAIRIMHQIAQDERDQFPLAE
                     HVLRKEIYVDDLQSGHETIKGALQVRDDVIGALQSAGMELRKWAANHPSLLNSIPPEH
                     MSNSKILEIENQESIKTLGLYWHPKEDFYGFKLKFTIDEIFTKRSILATVARLYDPLG
                     FVAPVIIIAKVILKEVWSIRIQQADGTPAGLAWDATVPPVIQHKWKEYCTNLLKIESL
                     RIPRWLQYLPSTIASLQLHIFCDGSSMAYAACAYVRVQHTNNSVYTHLIAAKSRVTPT
                     KPLTIPRVELCGAVLAAQLGDWLCKQIDQPTHPISTYFWSDATIVLYWIAGDPLHWKT
                     FVANRVGRILESSSASQWRHVPTGDNPADCATRGLYPDQLAAYDLWWQGPSWLRLPES
                     QWPSKIFDIPDSTNLSCEQKSLSLQTHSCVERNPNSLLTSFSSYNKLLFIMAYVRRFI
                     HNSQTRVDSRQRGPVTAQEFQQALGHIVRLVQHETFKVEIQKIKTKTHLSRSNKLSQL
                     SPFLDNEGVLRVRGRLKNALHLSPHQRTPIILPKDHHFTELVIRNAHLNTLHGGISLT
                     LAVTRQTFWILNGKQAVKKILHKCVDCFKHRPKAVTQLMGDLPLHRVNPPKRAFEATG
                     VDYTGALEIKASKFRGHHKYKAYIAVFICLATKAVHLEAVTGLSSQDFLWALQRFIGR
                     RGYCQHIYSDCGTNFIGADKSLNLWHEEFRQSVIATVIPKLTAQQIQWHFNPPHSPNF
                     GGLWEANVKAVKTHLHRTCKGALMTYEQLSTILVQIEACLNSRPLCPLSSDMEDLAVL
                     TPAHFLIGDSMMALPNPSASDKSLNAQFLEGQRLLRTFWHRWSSDWLSHLQSRPKWQR
                     VEENLRLHDIVIIKDDRLPPNEWKLGRIVELHPGSDNLIRVASIKTASGIYKRSLSKI
                     CPLPLATYSEATE"
     misc_feature    1227..1670
                     /gene="LOC123003484"
                     /note="Protein of unknown function (DUF1759); Region:
                     DUF1759; pfam03564"
                     /db_xref="CDD:281552"
     misc_feature    2241..2693
                     /gene="LOC123003484"
                     /note="Cellular and retroviral pepsin-like aspartate
                     proteases; Region: pepsin_retropepsin_like; cl11403"
                     /db_xref="CDD:472175"
     misc_feature    order(2283..2285,2289..2291,2295..2297,2364..2372)
                     /gene="LOC123003484"
                     /note="inhibitor binding site [active]"
                     /db_xref="CDD:133137"
     misc_feature    2283..2291
                     /gene="LOC123003484"
                     /note="catalytic motif [active]"
                     /db_xref="CDD:133137"
     misc_feature    2283..2285
                     /gene="LOC123003484"
                     /note="Catalytic residue [active]"
                     /db_xref="CDD:133137"
     misc_feature    order(2364..2375,2385..2396)
                     /gene="LOC123003484"
                     /note="Active site flap [active]"
                     /db_xref="CDD:133137"
     misc_feature    3129..3773
                     /gene="LOC123003484"
                     /note="conserved catalytic core domain of RNA-dependent
                     RNA polymerase (RdRp) from the positive-sense
                     single-stranded RNA [(+)ssRNA] viruses and closely related
                     viruses; Region: ps-ssRNAv_RdRp-like; cl40470"
                     /db_xref="CDD:477363"
     misc_feature    order(3315..3332,3432..3437,3540..3542,3546..3551,
                     3759..3764)
                     /gene="LOC123003484"
                     /note="active site"
                     /db_xref="CDD:238822"
     misc_feature    order(3315..3332,3432..3434,3546..3548)
                     /gene="LOC123003484"
                     /note="NTP binding site [chemical binding]; other site"
                     /db_xref="CDD:238822"
     misc_feature    3435..3437
                     /gene="LOC123003484"
                     /note="nucleic acid binding site [nucleotide binding];
                     other site"
                     /db_xref="CDD:238822"
     misc_feature    3816..4337
                     /gene="LOC123003484"
                     /note="Pao retrotransposon peptidase; Region:
                     Peptidase_A17; pfam05380"
                     /db_xref="CDD:461634"
     misc_feature    5013..5183
                     /gene="LOC123003484"
                     /note="Integrase zinc binding domain; Region:
                     Integrase_H2C2; pfam17921"
                     /db_xref="CDD:465569"
     misc_feature    5874..6137
                     /gene="LOC123003484"
                     /note="Family of unknown function (DUF5641); Region:
                     DUF5641; pfam18701"
                     /db_xref="CDD:465838"
ORIGIN      
        1 cggacgtttg ctgtagttga gagaacataa acccaccact ctgttataac cgagttttat
       61 gtaaagggag tcacaagaga gagaacgatc gacgattctt ttatgggcat cgaacagaag
      121 cagtcatccc agccgctggt cgcgtttaat aatctaatat aataaattaa aattgatacc
      181 attgtcagca ttgaagattg taaagtgaaa aaactgaatt cgaagtaatt gtagataata
      241 aatagaccat ccatgaagtg caccttttag agttataatt ttcaaattgg tggaaaaacc
      301 ccagagcatc tttgctcatc tttgacaata taaattgtca caacattttg gtggcccatg
      361 aggggaccca tccaatcacg ccttctggaa taattatcta ctaatctgtt ttcgaattaa
      421 ttaatcagtt tggatctaaa ctttgtgaaa tctggcacat ggttataaat tttccattcg
      481 caactacata taaccagcaa acggcagcac atgtataagt gctgttacac tatcgtccca
      541 tacgctttac cctccaactg cgctgccctg ccgatccatc gctttgccga agtcaacgtg
      601 ggttaattta gatgctggag agaagctgtt tcgctgccga aatctatcac gtcgcatccc
      661 ccacaccttt tcgtcaaacc tggagcgact tcacttggac ttccagactt ttcgttcaat
      721 ttcgttacgt cacaacttga aagggtgtat aagagtaata caatcaaagc tgaaacatgg
      781 cagcagataa agaccaggag caggggcagg catccagcga tgccagccta tcaaacctac
      841 atatacttca agctgatatt tttgaacgtg ttaaacaact ttatcaaaat tttaaaaagg
      901 atagtcagtc aagaaaaact aaaatctatt ttcaaactcg tataaataaa cttgaaaatt
      961 tctcaaagga attcgacacc aatcatcaaa ctcttttatt gaacggttgc tcacctgaac
     1021 atgaatattt tacctctcaa ctagcagcaa ggtttgaaga gtcttactta acatattatt
     1081 gcgagattgg agacgccttc gagacccggt ttccaacagt tcccagcgac gctactccta
     1141 gccattctaa ttcaacgatg cacaacacca ccatgctaca agcgcctggc tctacagttc
     1201 aactaccaaa attaccagta cctagttatt ctggcaagct gactgaatgg ccagcctttc
     1261 atgatgtctt tcaacaactt attcacgata atggtgcact ttccagcata caaaaattcc
     1321 actttctgaa gcaagcactc ccagcggacc gagatcaaga cgtgcatcaa atggacttga
     1381 ctgaagcgaa ttatctcgtc gcctggagtc tcattgttac gcgttataat aatccccgat
     1441 tgttattcat gcatcatatg aataccctat atgagttgcc atcaatttcc aaagaaagtt
     1501 cagccgaact aaaacatatg ttaaacgttg ccaacgtttg catcaatgag ttcaaacgcc
     1561 tcaacattgc aattgccaat tgcgatcatt ggattgtgca tcatctcacc actaagctgc
     1621 catcgcaaac tatccaagcc tgggaacaca gtcttgggag tacaaaggaa atcccaacct
     1681 ttttcacttt ggaatcattt ctcaacaacc gactttttag catcgatata attgaaggtc
     1741 gcaaggttcc acctccacca cggcaaccac aaggagtcaa cggcttccaa cgaaacgtca
     1801 tcaaacattc aggcaacgca caaaccagga tcagtagctg ccatacttca ggggttacta
     1861 ccaactctgt tcgctgcgct cattgcaatg accaacacat cttacgtcgt tgtccagact
     1921 ttttagcaaa ggattctttc gctcgcaaac ttattgttga ccgcagtaag gtatgtctca
     1981 attgcctcag ccccacccat tctctgtcaa aatgtaacag cagcaaaaac tgtttacaat
     2041 gtggacaacg ccatcacact ctcttacatt tcccaaccca ggtaaaggct agtgatgtcg
     2101 cgcagtctgc aggaaacata actagtctca actcacaggg tctgacagca tctagcaaca
     2161 ccacacagct catcagcgcg tcagcctcac attcaacacc tcatacgatg attctcgcaa
     2221 ctgctctggt tcgtcttcat agtaatgcca caggtcagtc agcggtggtc cgagccctta
     2281 tcgatcatgg atcagaaggt acccttgtta cggaaactgt agtccaagcg ctagggcttc
     2341 cccgatttcc agtctcagca gaaatatcag gggtaggagg gaattccacc aacaggtgca
     2401 aatatcgcac cgaatgtact ttaagctcaa ctactaattc aggattcaag atgtgggtgg
     2461 aaaatgcctt cgttttgcgt accctgactt ctcctttacc aaggacgaat ctaactctac
     2521 caacgtgtcc acatttaact gggttggagc tcgccgaccc aaattttatg cacactaata
     2581 gcatagatgt gctcctcggc gtggacacca ttccacaatt tatgatgagt ggaatcagga
     2641 gaggatctta cgatcaacca attgcccaat gcacgcaact tggttggatc atttttggca
     2701 gaattacgcc aaaacagaca cacacgatat ccatacaatg tcatcattct aatctggaaa
     2761 ctctagttca aaaattcttc gaattggaag cagtaagtac aacccgtctc tacactgcgg
     2821 aagaacaatg gtgcgaagac cactttaaac gcactcatat acgccaacca aatggaaagt
     2881 acatggtgcg cctaccattt aaaaccttat ttgatcctaa ccaaaccata ggaaaatcaa
     2941 aacaaactgc cctaaatcgg tttcattttt tacagcgaaa attccaaaag aaaccagatc
     3001 tgtatgaaca atactcaaag gtcatggagg aatacttcga gctgcaccaa ataactgaag
     3061 ccatcacctc agaagaacag catcgcctgg cggacaaagg aggctcaatc agttatactg
     3121 catgcacatt accccatcat gcagttctga aggcggacag cagcaccaca aaacttcgag
     3181 ttgtgtacga cgcctcatgc aagacatcaa atggaaaatc gctcaacgac attctctgta
     3241 taggcccagc cttacaaaac gaccttggtg gggtcattct caattggcga tttttacaat
     3301 atgtctttgc ggccgatatt cagaaaatgt atcgttgcat cgatgtacac ccggaggata
     3361 cacacttcca acggattatt tggcaaaggg aaaacaatgc tataaaggat tattgtttaa
     3421 caactgtgac gtttggaaca gcatcagccc cgtacacagc cattcgcatc atgcatcaga
     3481 tcgctcagga tgaacgagat caatttccat tagcagaaca tgttttacga aaagaaatat
     3541 atgtggatga tttgcaaagt ggacatgaga caattaaggg agcccttcaa gttcgagatg
     3601 atgtcatcgg ggcacttcaa tcagctggta tggagcttcg aaaatgggca gcaaatcatc
     3661 ccagtctttt gaactctatt cctcctgagc acatgtccaa ttccaaaatt ttggaaatcg
     3721 aaaaccaaga gtccattaaa actttggggc tatattggca ccccaaggaa gacttttatg
     3781 gtttcaaatt aaaatttaca attgatgaaa ttttcactaa acgatcaatt cttgctacag
     3841 ttgcacgtct gtacgatcct ttaggtttcg ttgctccagt cataatcatc gccaaggtta
     3901 ttctcaaaga agtctggagc atacgtattc aacaggctga tggcactcca gcaggattag
     3961 cttgggacgc aacggtaccg ccagtaattc agcacaagtg gaaagaatac tgcaccaatt
     4021 tacttaaaat tgaatcactg agaatacccc gatggttaca gtatttaccc agcaccatcg
     4081 catccttgca gctgcatatc ttttgtgatg gctcttctat ggcatacgca gcgtgtgcct
     4141 atgtgcgggt ccaacataca aataactcgg tatacacgca tctcatagct gcaaagagtc
     4201 gagtcacacc cacaaagccg ttgacaattc ctagagtgga actatgtgga gctgtactcg
     4261 ccgcacaact gggtgactgg ctctgcaagc aaatagatca accaacgcat cccatttcga
     4321 cctatttctg gagcgacgca acgatcgttc tttattggat tgcgggggat cccttacatt
     4381 ggaaaacatt cgtagcgaat cgtgtaggac gaattttgga gtccagttct gcatctcaat
     4441 ggagacatgt gcccaccgga gataaccctg cagactgcgc gacgcgtggt ctctatccag
     4501 atcaattagc tgcctatgat ttatggtggc aggggccttc ttggctacga ctaccagaat
     4561 ctcaatggcc tagcaaaatc tttgacattc ctgatagcac caatctctcg tgtgagcaaa
     4621 aatcgctttc tctacaaacc cacagctgcg ttgagagaaa tccaaactct ttactcacgt
     4681 cgttttcatc ttacaacaaa cttctgttta taatggctta cgtacgacgc tttattcaca
     4741 attcacaaac gcgcgtggac tctcgacaga gaggtccagt taccgcccag gagtttcagc
     4801 aggcactagg gcacattgtc cgactagtgc aacatgaaac gtttaaggtg gagattcaaa
     4861 aaataaagac aaaaacacat ttatctagat cgaacaaatt aagtcaactt tccccatttc
     4921 tagataacga aggagtgctg cgtgtccgag ggagattgaa aaatgcattg cacctatcgc
     4981 cgcatcaacg cacaccgatc attttgccaa aggatcatca ttttaccgaa cttgtcattc
     5041 gtaatgctca tctcaacaca ttacatggcg ggatatcact tacactggca gtgacaaggc
     5101 aaacgttttg gattcttaat gggaaacagg cagttaaaaa gattctgcac aaatgtgttg
     5161 actgcttcaa gcacagaccg aaggcagtca cgcagctcat gggagacctt cccttacatc
     5221 gagtgaaccc accaaagcga gcctttgaag ccacaggcgt ggactatacc ggtgcgctag
     5281 aaatcaaagc gtcaaagttt cgtggacacc acaaatacaa ggcatatatt gcagttttta
     5341 tatgcttggc gacaaaggca gtccacttgg aagctgttac cggattgtct tctcaggatt
     5401 ttttgtgggc tctacaacgt tttattggca gacgtggata ctgccaacac atctacagcg
     5461 attgcggtac caatttcatt ggagcagata aatccctaaa cctttggcat gaagagtttc
     5521 gacaaagcgt tatcgcaacg gttattccaa aattaaccgc tcagcagatt caatggcatt
     5581 tcaatccgcc acacagccca aacttcggtg gattatggga ggcgaatgtt aaagcggtta
     5641 agacacactt acatcgcacg tgtaaagggg cgctcatgac ctacgaacaa ctctcaacta
     5701 ttttggttca aatcgaagcc tgtttgaact ctcgcccact ttgcccgttg agctcggata
     5761 tggaggattt agcagtactg acaccggccc atttcttaat tggcgattcc atgatggcgc
     5821 tacctaatcc atcagcttcg gataaatcgt taaacgcaca attcttggaa ggacaaagac
     5881 tgcttcgaac cttttggcat cgttggagct cggattggct ttcacactta caatctcgtc
     5941 caaaatggca gcgggttgag gaaaacttgc gtttacacga catagttatt attaaagatg
     6001 atcggcttcc gccaaatgaa tggaagctcg gtcgcatagt cgaattacat ccaggatctg
     6061 ataatctcat tcgagtcgca tctattaaga cggcatccgg aatttataaa cgctctttgt
     6121 cgaaaatctg tccattgcca ttagctactt attcagaggc tacggaataa acattaatta
     6181 cgtacatact ttgtccacct ctacatatcg taatgacact tatttcttta cagcaaagtg
     6241 cagaaatttg gtttaaatac tggactcagt atttgggggc ggcatgttcg gacgtttgct
     6301 gtagttgaga gaacataaac ccaccactct gttataaccg agttttatgt aaagggagtc
     6361 acaagagaga gaacgatcga cgattctttt atgggcatcg aacagaagca gtcatcccag
     6421 ccgctggtcg cgtttaataa tctaatataa taaattaaaa ttgataccat tgtcagcatt
     6481 gaagattgta aagtgaaaaa actgaattcg aagtaattgt agataataaa tagaccatcc
     6541 atgaagtgca ccttttagag ttataatttt caaattggtg gaaaaacccc agagcatctt
     6601 tgctcatc