Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

PREDICTED: Drosophila takahashii short gastrulation (sog),


LOCUS       XM_017158307            5879 bp    mRNA    linear   INV 09-DEC-2024
            transcript variant X1, mRNA.
ACCESSION   XM_017158307
VERSION     XM_017158307.3
DBLINK      BioProject: PRJNA1194641
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            On Dec 9, 2024 this sequence version replaced XM_017158307.2.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..5879
                     /organism="Drosophila takahashii"
                     /mol_type="mRNA"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     gene            1..5879
                     /gene="sog"
                     /note="short gastrulation; Derived by automated
                     computational analysis using gene prediction method:
                     Gnomon. Supporting evidence includes similarity to: 7
                     Proteins"
                     /db_xref="GeneID:108068627"
     CDS             1173..4259
                     /gene="sog"
                     /codon_start=1
                     /product="dorsal-ventral patterning protein Sog"
                     /protein_id="XP_017013796.2"
                     /db_xref="GeneID:108068627"
                     /translation="MANKQRKSSEWATATGTVPLLERYCNSNSSSEETEVETEAQIPR
                     SIHRAPLIRHLRHLLIAGLLIVCLVGGTEGRRHAPLMFEESDTGRRSNRPAVTECQFG
                     KVVRELGSTWYADLGPPFGVMYCIKCECVAIPKKRRIVARVQCRNIKNECPPAKCDDP
                     ISLPGKCCKTCPGDRNDTDVALDVPVPSEEEERNMKHYAALLTGRTSYFLKGEEMKSM
                     YTTYNPQNVVATARFLFHKKNLYYSFYTSARIGRPRAIQFVDDAGVILEEHQLETTLA
                     GTLSVYQNATGKICGVWRRVPRDYKRILRDDRLHVVLLWGNKQQAELALAGKVAKYTA
                     LQTELFSSLLEASPQSDPQLAGAGGTAIVSTSSGAASSMHLTLVFNGVFGAEEFADAA
                     LSVRIELPERKELIFDEIPRVRKPSAEINVLELSSPISIQNLRLMSRGKLLLTVESKK
                     HPHLRIQGHIVTRASCEIFQTLLAPHSAESSTKSSGLAWVYLNTDGSLAYNIETEHVN
                     TRDRPNISLIEEQGKRKAKLEDLTPSFNFNQAVGSVEKLGPKVLESLYAGELGVNVAT
                     EHEASLIRGRLVPRPVADARDSAEPILLKRVQEEDPHAVGMAWMSIDNECNLHYEVTL
                     NGVPAQELQLYLEEKPIEAIGAPVTRKLLEEFNGSYLEGFFLSMPSAELIKLEMSVCY
                     LEVHSKHSKQLLLRGKLKSTKVPGHCFPVYTDNNVPVPGDHNDNHLVNGETKCFHSGR
                     FYNESEQWRSAQDSCQMCACLRGQSSCEVIKCPALRCKAATEQLLQREGECCPSCVPT
                     QQSAVQSSPAVNASDLLQQRRGCRLGDQFHTAGASWHPFLPPNGFDTCTTCSCDPLTL
                     EIRCPRLVCPPLQCSEKLAYRPDKKACCKICPEGKQSSSNGHKAAPNNPNVLQDQAMQ
                     RSPSHSAEEVLANGGCKVVNKVYENGQEWHPILMSHGEQKCIKCRCKDSKVNCDRKRC
                     SRSTCQQQTRVSSKRRLFEKPDAAAPAIDECCSTQCRRSRRHHKRQPHHQQQRSSS"
     misc_feature    1467..1685
                     /gene="sog"
                     /note="von Willebrand factor type C domain; Region: VWC;
                     pfam00093"
                     /db_xref="CDD:278520"
     misc_feature    1761..2165
                     /gene="sog"
                     /note="A domain in the BMP inhibitor chordin and in
                     microbial proteins; Region: CHRD; smart00754"
                     /db_xref="CDD:214804"
     misc_feature    2184..2555
                     /gene="sog"
                     /note="A domain in the BMP inhibitor chordin and in
                     microbial proteins; Region: CHRD; smart00754"
                     /db_xref="CDD:214804"
     misc_feature    2571..2909
                     /gene="sog"
                     /note="A domain in the BMP inhibitor chordin and in
                     microbial proteins; Region: CHRD; smart00754"
                     /db_xref="CDD:214804"
     misc_feature    2934..3269
                     /gene="sog"
                     /note="A domain in the BMP inhibitor chordin and in
                     microbial proteins; Region: CHRD; smart00754"
                     /db_xref="CDD:214804"
     misc_feature    3375..3557
                     /gene="sog"
                     /note="von Willebrand factor type C domain; Region: VWC;
                     pfam00093"
                     /db_xref="CDD:278520"
     misc_feature    3633..3833
                     /gene="sog"
                     /note="von Willebrand factor type C domain; Region: VWC;
                     pfam00093"
                     /db_xref="CDD:278520"
     misc_feature    3960..4190
                     /gene="sog"
                     /note="von Willebrand factor type C domain; Region: VWC;
                     pfam00093"
                     /db_xref="CDD:278520"
     polyA_site      5879
                     /gene="sog"
                     /experiment="COORDINATES: polyA evidence [ECO:0006239]"
ORIGIN      
        1 tgcaagacta aaggtaaaac atgttgcgaa acttttgttt tgttccccaa caaagttcca
       61 atgttttatt ttggcaatat cttttttttt taatttttgt aaaatataaa aaagagaggc
      121 tgcagttgca gcgctgctcg cgctcattca actcgacttc aacttcaact tgcctcgttc
      181 gtcatcgcca aagcatctct tcgacacgcg actcgcacgc acacacctgc agttgcgaaa
      241 attgcagtga gaacgatcgt gtccgtgtgc gcgagagaga gggagtgaga gcggcagagg
      301 gagagcaaac gagagaggga gagagagagc gcgctgcctg cgccgcagtc ggcgtcgcag
      361 cgcgcgcagt tgttggcctg cttggtagtt tatttattca accgaccgat accgtgtgga
      421 cattatttta ccaagcgttc gcaaccgcac aacacggaga ggagagccaa gagagccaag
      481 aaacccagag agctacgatc cgatccgatc caaaatcaca ataccatcat atacacggcg
      541 gttcgcaccg ccagctatct agtagataaa aagtcgccgc gacgagaacg cagaacgcag
      601 aaccgatccg aaccgctaac ttgaacgcca gaaactcgcg ttgtcgtaat ccccatccga
      661 gatcgactct attttccaga gcatatatac cgcaatagtg ccgaaactaa actgaacttt
      721 acttagccac actataccat gcgatgatga attgaaatta cagtaaaata agagtaaaaa
      781 gtaacaagta aaaaccaagt taagaaaaac aagtctcgaa actgaaaacg aaaaacgcaa
      841 atttatgcag ccgctaaata aaaacaaaac aaaacacaag acagcgaatt aaaaacagca
      901 aatagaataa atccaataga tcggcgcgcg aaactcgcgt gtgttatcta atctgcaaga
      961 gaagtacaag aatcggtatc ggttctatac tatatctata tctatatatc tatatccatt
     1021 gtgtgtgtgc cagtgtgtgc gttgcgacct ttgtttttat atattttttg ttgttgttca
     1081 aactgtgaaa cgtgctttac aagccgggcg tttaaaatac atactaaata ctacaactac
     1141 acaccacaaa ataacaaaat aaccatataa atatggccaa caagcagagg aaatcgagtg
     1201 aatgggccac ggccaccggc acagtaccgc tcctggaaag gtattgcaac agcaacagca
     1261 gcagcgaaga aacggaagtg gaaacggaag cccagatccc cagatccatc cacagagccc
     1321 ccctgatccg ccacctgcgc cacctgctca tcgccggact gctgatcgtc tgtttggtgg
     1381 gcggtacgga gggccggcgg catgcgccgc tcatgttcga ggagtccgac acgggcaggc
     1441 ggtccaacag accagcggtc accgaatgtc agtttggcaa agtcgtgcgc gaattgggct
     1501 cgacctggta tgcggatttg ggtccaccct tcggagtgat gtactgcatc aagtgtgaat
     1561 gcgtggcgat acccaagaag cgccgcatcg ttgcacgcgt ccagtgtcgc aatattaaga
     1621 acgagtgtcc accggccaaa tgcgatgatc ccatctcgct gcccggcaaa tgctgcaaga
     1681 cctgtcccgg cgatcggaac gatacggatg tggccttgga tgtgccagtg cccagcgaag
     1741 aggaagagcg caacatgaaa cattacgctg cgttgctcac gggccgcacc tcctatttcc
     1801 tcaagggcga ggaaatgaag tccatgtaca ccacctacaa tccacagaat gtggtggcca
     1861 ccgcccggtt tctgttccac aagaagaacc tctactactc gttctacacc tcggcgcgga
     1921 tcggtcgtcc ccgggccatt cagttcgtcg acgatgccgg tgtcattctg gaggagcacc
     1981 aactggaaac cacactggcc gggactttga gtgtctacca gaatgccacc ggcaagattt
     2041 gcggcgtttg gcgtcgtgtg ccccgcgatt acaagcgcat cctgcgcgac gatcgcctcc
     2101 atgtggtgct cctctggggc aacaagcagc aggccgagtt ggccctcgcc ggcaaggtgg
     2161 ccaagtacac ggccctccag acggaactct ttagctcgct cctcgaggcg tcgccccagt
     2221 cggatcccca gctggccgga gcaggcggca cggcgattgt gtccaccagc agcggtgccg
     2281 cctcctcgat gcatctcacc ctggtcttta atggcgtctt tggcgcggag gagtttgcgg
     2341 atgccgcgct gagtgtccgc atcgagttgc ccgaacgcaa ggagctgatc ttcgatgaga
     2401 ttccccgcgt gcgcaaaccc tccgccgaga tcaatgtcct ggagctctcc tcgcccatct
     2461 ccattcagaa tttgcgcctg atgtcgcgtg gcaagttgct gttgaccgtc gagtccaaga
     2521 agcatcccca cctgcgcatc cagggacaca ttgtgacccg tgccagctgc gagatcttcc
     2581 agaccctcct ggcgccgcac agcgccgaat cctcgaccaa gagcagcggc ctcgcctggg
     2641 tctatctgaa caccgacggc tcgctggcct acaacatcga aacggagcac gtgaacaccc
     2701 gcgatcggcc caacatcagt ttgatcgagg agcagggcaa gcgcaaggcc aagctggagg
     2761 atctgacgcc cagcttcaac ttcaaccagg ccgtcggcag tgtggagaag ctggggccca
     2821 aggttctcga gtcgctctac gccggcgagc tgggcgtcaa tgtggccacc gaacatgagg
     2881 ctagtttaat ccgcggacgt ctagttcccc gtccagtggc cgatgcccgt gactccgctg
     2941 agcctattct gctcaagcga gtgcaggagg aggatcccca tgccgtgggc atggcctgga
     3001 tgtccatcga caacgagtgc aatttgcact acgaggtgac cctcaacggt gtgcccgccc
     3061 aggaattgca actgtatctc gaggagaagc ccatcgaggc cattggagcg cctgtcacta
     3121 ggaaactact cgaggagttc aacggctcgt atctggaggg cttcttcctc agcatgccgt
     3181 cggcggagct gatcaagctg gagatgagcg tctgctatct ggaggtccac tcgaagcact
     3241 ccaagcagct gcttctgcgc ggcaagctga agagcaccaa ggtgccgggt cactgcttcc
     3301 ccgtctacac ggacaacaat gtcccggtgc cgggcgatca taacgacaat catctggtta
     3361 atggcgagac caagtgcttc cactccggac gcttctacaa cgaatcggag cagtggcgca
     3421 gtgcccagga ttcctgccag atgtgcgcct gcctgcgcgg tcagtccagt tgcgaggtca
     3481 tcaagtgtcc tgccctccgt tgcaaggctg ccacggagca gctgctccag cgggagggtg
     3541 aatgctgtcc cagttgtgtg cccacacagc agtcagccgt acaatcctcg ccggccgtga
     3601 atgccagcga tttgctgcaa cagcgacgcg gctgccggct gggcgatcaa ttccacacgg
     3661 ccggagccag ctggcatcca ttcctgccgc ccaacggctt cgacacctgc accacctgca
     3721 gctgcgatcc gctgaccctc gagatccggt gcccccggct cgtctgcccg ccgctgcagt
     3781 gcagcgagaa gctcgcctac cggccggaca agaaggcctg ctgcaagatc tgcccggagg
     3841 gcaagcagag cagctccaac gggcacaagg cggcgccgaa caaccccaat gtgctgcagg
     3901 accaggccat gcagcgatcg cccagccaca gtgccgagga ggtcctcgcc aacggcggct
     3961 gcaaggtggt caacaaggtc tacgagaacg gccaggagtg gcaccccatc ctcatgtccc
     4021 acggcgagca gaagtgcatc aagtgccgct gcaaggactc caaggtgaac tgcgaccgca
     4081 agcgctgctc ccgctcgacg tgccagcaac agactcgtgt ctccagcaag cggcggctgt
     4141 tcgagaagcc cgacgcagcg gctccggcca tcgacgagtg ctgctccacc cagtgccgcc
     4201 ggtcgaggcg ccaccacaag cggcaaccgc accaccagca gcagcgctcc tccagttgag
     4261 cggttgcggt tgtaggttgt aggatggccg ggggattccg agttcaatcc cgatctagat
     4321 tcagttgcag aagcagaaac agagccacgg tagaaagttg agccactcac tcactcagtg
     4381 cgatgtgcac caccacccaa ccactcacac acacactcac tcacactaac acacatcacc
     4441 cacacaaacg agcctacata cacacacttg tgcaaggact tgcatagatc gttgttgtta
     4501 tgtggcagca atgaaaactt gtattatata tttgaaaaca gaaaaacaaa aggaggagga
     4561 ggaggagaga attctcaaaa aaaaaaaaat agagaaaagg agagggttag agagatgata
     4621 atgagatcct tggaaaagga cattaaacca gtgcagtttg ctttaaattc tccagcgcag
     4681 aattttcaac agaaagcatt ttctgaattt cttttcacaa cccgcccccc attgatttcc
     4741 cccaccatcc caggcaccca aaaaaaaaca ttaaaattaa attttaattt attaaaaaaa
     4801 gcaccataaa aatcacatta aaaaacacta ccaaaaacaa aaagaattcg aaattgaaac
     4861 tgtactgagt tttgaaaaca cacacatttg acttattgaa atacttaaac cttttcaatt
     4921 catttaaatt caaatttaaa ttgaaatttg aaagcaaatc gataacgcag cagctatcgt
     4981 gggaatatta acttaaatct aattattctg attatacata tatacaatat tttttatata
     5041 tacatatata tatttatata catatatata tctatataat ttatgagaaa caaaacacag
     5101 tgggaaacgc atttggcgca ttatgtggtt atcgacattt gtatcgatat atctatttta
     5161 gcaacccaaa agtttcaact gattgactga atggcaaaaa taaaattagc tttaaatatg
     5221 tttttcgcaa gcaagaaaat gaaggaaatt gaaaacaccg cttttcaatg tgaaattatt
     5281 ttttactaat tgttactcac accgacacac accacacgca cacatacatt taggaatcga
     5341 caagcacaca catacacatt gaaatgtttg tattttttag gaatatattt ttaaattaaa
     5401 atacatttta tggaactttt tcaataatta tgtacaaaat ttgttaaaaa aaaaagaaaa
     5461 ctatttcaaa aaatgccatg tacacatatt tatacacaaa cacacacata ctcccataaa
     5521 aagaacaaat cgaaaaaaga aacaaaaata aaatttagcg catacacaga cagaaattgt
     5581 attaattttt tattctatta aattattctt attattatta ttttatgttt attattattg
     5641 taatgatttt ttttgtagct tgtataaaca tttatatata tattctatat ttaataattg
     5701 attgattaat tttaaataat tgatcaatgg cgagcaaaaa tatttaaatc gacgaagagc
     5761 taaacaaact tgtactaata acttaaattt agtgtttata aattaaacaa aaattgaaaa
     5821 tcaaaaacaa caacaaaaca cacacagaga gaataaacaa aaaactacaa aatcacaaa