Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

PREDICTED: Drosophila takahashii short gastrulation (sog),


LOCUS       XM_070218624            5543 bp    mRNA    linear   INV 09-DEC-2024
            transcript variant X2, mRNA.
ACCESSION   XM_070218624
VERSION     XM_070218624.1
DBLINK      BioProject: PRJNA1194641
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..5543
                     /organism="Drosophila takahashii"
                     /mol_type="mRNA"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     gene            1..5543
                     /gene="sog"
                     /note="short gastrulation; Derived by automated
                     computational analysis using gene prediction method:
                     Gnomon. Supporting evidence includes similarity to: 7
                     Proteins"
                     /db_xref="GeneID:108068627"
     CDS             837..3923
                     /gene="sog"
                     /codon_start=1
                     /product="dorsal-ventral patterning protein Sog"
                     /protein_id="XP_070074725.1"
                     /db_xref="GeneID:108068627"
                     /translation="MANKQRKSSEWATATGTVPLLERYCNSNSSSEETEVETEAQIPR
                     SIHRAPLIRHLRHLLIAGLLIVCLVGGTEGRRHAPLMFEESDTGRRSNRPAVTECQFG
                     KVVRELGSTWYADLGPPFGVMYCIKCECVAIPKKRRIVARVQCRNIKNECPPAKCDDP
                     ISLPGKCCKTCPGDRNDTDVALDVPVPSEEEERNMKHYAALLTGRTSYFLKGEEMKSM
                     YTTYNPQNVVATARFLFHKKNLYYSFYTSARIGRPRAIQFVDDAGVILEEHQLETTLA
                     GTLSVYQNATGKICGVWRRVPRDYKRILRDDRLHVVLLWGNKQQAELALAGKVAKYTA
                     LQTELFSSLLEASPQSDPQLAGAGGTAIVSTSSGAASSMHLTLVFNGVFGAEEFADAA
                     LSVRIELPERKELIFDEIPRVRKPSAEINVLELSSPISIQNLRLMSRGKLLLTVESKK
                     HPHLRIQGHIVTRASCEIFQTLLAPHSAESSTKSSGLAWVYLNTDGSLAYNIETEHVN
                     TRDRPNISLIEEQGKRKAKLEDLTPSFNFNQAVGSVEKLGPKVLESLYAGELGVNVAT
                     EHEASLIRGRLVPRPVADARDSAEPILLKRVQEEDPHAVGMAWMSIDNECNLHYEVTL
                     NGVPAQELQLYLEEKPIEAIGAPVTRKLLEEFNGSYLEGFFLSMPSAELIKLEMSVCY
                     LEVHSKHSKQLLLRGKLKSTKVPGHCFPVYTDNNVPVPGDHNDNHLVNGETKCFHSGR
                     FYNESEQWRSAQDSCQMCACLRGQSSCEVIKCPALRCKAATEQLLQREGECCPSCVPT
                     QQSAVQSSPAVNASDLLQQRRGCRLGDQFHTAGASWHPFLPPNGFDTCTTCSCDPLTL
                     EIRCPRLVCPPLQCSEKLAYRPDKKACCKICPEGKQSSSNGHKAAPNNPNVLQDQAMQ
                     RSPSHSAEEVLANGGCKVVNKVYENGQEWHPILMSHGEQKCIKCRCKDSKVNCDRKRC
                     SRSTCQQQTRVSSKRRLFEKPDAAAPAIDECCSTQCRRSRRHHKRQPHHQQQRSSS"
     misc_feature    1131..1349
                     /gene="sog"
                     /note="von Willebrand factor type C domain; Region: VWC;
                     pfam00093"
                     /db_xref="CDD:278520"
     misc_feature    1425..1829
                     /gene="sog"
                     /note="A domain in the BMP inhibitor chordin and in
                     microbial proteins; Region: CHRD; smart00754"
                     /db_xref="CDD:214804"
     misc_feature    1848..2219
                     /gene="sog"
                     /note="A domain in the BMP inhibitor chordin and in
                     microbial proteins; Region: CHRD; smart00754"
                     /db_xref="CDD:214804"
     misc_feature    2235..2573
                     /gene="sog"
                     /note="A domain in the BMP inhibitor chordin and in
                     microbial proteins; Region: CHRD; smart00754"
                     /db_xref="CDD:214804"
     misc_feature    2598..2933
                     /gene="sog"
                     /note="A domain in the BMP inhibitor chordin and in
                     microbial proteins; Region: CHRD; smart00754"
                     /db_xref="CDD:214804"
     misc_feature    3039..3221
                     /gene="sog"
                     /note="von Willebrand factor type C domain; Region: VWC;
                     pfam00093"
                     /db_xref="CDD:278520"
     misc_feature    3297..3497
                     /gene="sog"
                     /note="von Willebrand factor type C domain; Region: VWC;
                     pfam00093"
                     /db_xref="CDD:278520"
     misc_feature    3624..3854
                     /gene="sog"
                     /note="von Willebrand factor type C domain; Region: VWC;
                     pfam00093"
                     /db_xref="CDD:278520"
     polyA_site      5543
                     /gene="sog"
                     /experiment="COORDINATES: polyA evidence [ECO:0006239]"
ORIGIN      
        1 ccgaacgaac gccgctagag aaacgcgcgc gcgctttggc agcaaaagcc cgaatacgaa
       61 accgaaacgc gtccgcaacc gctcccgaac cgcgcccgaa ccgcacccga accgcatccg
      121 aaccgcactc gaaccgctgc cgaaaacgtg accaatttgt gtcctgttct tcttcttccc
      181 acccaaagtt cgatgaccaa tctaagcgat atcgattggc agcaggagtc atatcgaaaa
      241 gtgcgcgcgc gtttagcgac gcattgcata taaatcaaat acgaaacaga ctttctgtga
      301 acttttccct tttctttttt tttttgctgt gaaacccaca tacagcatat ataccgcaat
      361 agtgccgaaa ctaaactgaa ctttacttag ccacactata ccatgcgatg atgaattgaa
      421 attacagtaa aataagagta aaaagtaaca agtaaaaacc aagttaagaa aaacaagtct
      481 cgaaactgaa aacgaaaaac gcaaatttat gcagccgcta aataaaaaca aaacaaaaca
      541 caagacagcg aattaaaaac agcaaataga ataaatccaa tagatcggcg cgcgaaactc
      601 gcgtgtgtta tctaatctgc aagagaagta caagaatcgg tatcggttct atactatatc
      661 tatatctata tatctatatc cattgtgtgt gtgccagtgt gtgcgttgcg acctttgttt
      721 ttatatattt tttgttgttg ttcaaactgt gaaacgtgct ttacaagccg ggcgtttaaa
      781 atacatacta aatactacaa ctacacacca caaaataaca aaataaccat ataaatatgg
      841 ccaacaagca gaggaaatcg agtgaatggg ccacggccac cggcacagta ccgctcctgg
      901 aaaggtattg caacagcaac agcagcagcg aagaaacgga agtggaaacg gaagcccaga
      961 tccccagatc catccacaga gcccccctga tccgccacct gcgccacctg ctcatcgccg
     1021 gactgctgat cgtctgtttg gtgggcggta cggagggccg gcggcatgcg ccgctcatgt
     1081 tcgaggagtc cgacacgggc aggcggtcca acagaccagc ggtcaccgaa tgtcagtttg
     1141 gcaaagtcgt gcgcgaattg ggctcgacct ggtatgcgga tttgggtcca cccttcggag
     1201 tgatgtactg catcaagtgt gaatgcgtgg cgatacccaa gaagcgccgc atcgttgcac
     1261 gcgtccagtg tcgcaatatt aagaacgagt gtccaccggc caaatgcgat gatcccatct
     1321 cgctgcccgg caaatgctgc aagacctgtc ccggcgatcg gaacgatacg gatgtggcct
     1381 tggatgtgcc agtgcccagc gaagaggaag agcgcaacat gaaacattac gctgcgttgc
     1441 tcacgggccg cacctcctat ttcctcaagg gcgaggaaat gaagtccatg tacaccacct
     1501 acaatccaca gaatgtggtg gccaccgccc ggtttctgtt ccacaagaag aacctctact
     1561 actcgttcta cacctcggcg cggatcggtc gtccccgggc cattcagttc gtcgacgatg
     1621 ccggtgtcat tctggaggag caccaactgg aaaccacact ggccgggact ttgagtgtct
     1681 accagaatgc caccggcaag atttgcggcg tttggcgtcg tgtgccccgc gattacaagc
     1741 gcatcctgcg cgacgatcgc ctccatgtgg tgctcctctg gggcaacaag cagcaggccg
     1801 agttggccct cgccggcaag gtggccaagt acacggccct ccagacggaa ctctttagct
     1861 cgctcctcga ggcgtcgccc cagtcggatc cccagctggc cggagcaggc ggcacggcga
     1921 ttgtgtccac cagcagcggt gccgcctcct cgatgcatct caccctggtc tttaatggcg
     1981 tctttggcgc ggaggagttt gcggatgccg cgctgagtgt ccgcatcgag ttgcccgaac
     2041 gcaaggagct gatcttcgat gagattcccc gcgtgcgcaa accctccgcc gagatcaatg
     2101 tcctggagct ctcctcgccc atctccattc agaatttgcg cctgatgtcg cgtggcaagt
     2161 tgctgttgac cgtcgagtcc aagaagcatc cccacctgcg catccaggga cacattgtga
     2221 cccgtgccag ctgcgagatc ttccagaccc tcctggcgcc gcacagcgcc gaatcctcga
     2281 ccaagagcag cggcctcgcc tgggtctatc tgaacaccga cggctcgctg gcctacaaca
     2341 tcgaaacgga gcacgtgaac acccgcgatc ggcccaacat cagtttgatc gaggagcagg
     2401 gcaagcgcaa ggccaagctg gaggatctga cgcccagctt caacttcaac caggccgtcg
     2461 gcagtgtgga gaagctgggg cccaaggttc tcgagtcgct ctacgccggc gagctgggcg
     2521 tcaatgtggc caccgaacat gaggctagtt taatccgcgg acgtctagtt ccccgtccag
     2581 tggccgatgc ccgtgactcc gctgagccta ttctgctcaa gcgagtgcag gaggaggatc
     2641 cccatgccgt gggcatggcc tggatgtcca tcgacaacga gtgcaatttg cactacgagg
     2701 tgaccctcaa cggtgtgccc gcccaggaat tgcaactgta tctcgaggag aagcccatcg
     2761 aggccattgg agcgcctgtc actaggaaac tactcgagga gttcaacggc tcgtatctgg
     2821 agggcttctt cctcagcatg ccgtcggcgg agctgatcaa gctggagatg agcgtctgct
     2881 atctggaggt ccactcgaag cactccaagc agctgcttct gcgcggcaag ctgaagagca
     2941 ccaaggtgcc gggtcactgc ttccccgtct acacggacaa caatgtcccg gtgccgggcg
     3001 atcataacga caatcatctg gttaatggcg agaccaagtg cttccactcc ggacgcttct
     3061 acaacgaatc ggagcagtgg cgcagtgccc aggattcctg ccagatgtgc gcctgcctgc
     3121 gcggtcagtc cagttgcgag gtcatcaagt gtcctgccct ccgttgcaag gctgccacgg
     3181 agcagctgct ccagcgggag ggtgaatgct gtcccagttg tgtgcccaca cagcagtcag
     3241 ccgtacaatc ctcgccggcc gtgaatgcca gcgatttgct gcaacagcga cgcggctgcc
     3301 ggctgggcga tcaattccac acggccggag ccagctggca tccattcctg ccgcccaacg
     3361 gcttcgacac ctgcaccacc tgcagctgcg atccgctgac cctcgagatc cggtgccccc
     3421 ggctcgtctg cccgccgctg cagtgcagcg agaagctcgc ctaccggccg gacaagaagg
     3481 cctgctgcaa gatctgcccg gagggcaagc agagcagctc caacgggcac aaggcggcgc
     3541 cgaacaaccc caatgtgctg caggaccagg ccatgcagcg atcgcccagc cacagtgccg
     3601 aggaggtcct cgccaacggc ggctgcaagg tggtcaacaa ggtctacgag aacggccagg
     3661 agtggcaccc catcctcatg tcccacggcg agcagaagtg catcaagtgc cgctgcaagg
     3721 actccaaggt gaactgcgac cgcaagcgct gctcccgctc gacgtgccag caacagactc
     3781 gtgtctccag caagcggcgg ctgttcgaga agcccgacgc agcggctccg gccatcgacg
     3841 agtgctgctc cacccagtgc cgccggtcga ggcgccacca caagcggcaa ccgcaccacc
     3901 agcagcagcg ctcctccagt tgagcggttg cggttgtagg ttgtaggatg gccgggggat
     3961 tccgagttca atcccgatct agattcagtt gcagaagcag aaacagagcc acggtagaaa
     4021 gttgagccac tcactcactc agtgcgatgt gcaccaccac ccaaccactc acacacacac
     4081 tcactcacac taacacacat cacccacaca aacgagccta catacacaca cttgtgcaag
     4141 gacttgcata gatcgttgtt gttatgtggc agcaatgaaa acttgtatta tatatttgaa
     4201 aacagaaaaa caaaaggagg aggaggagga gagaattctc aaaaaaaaaa aaatagagaa
     4261 aaggagaggg ttagagagat gataatgaga tccttggaaa aggacattaa accagtgcag
     4321 tttgctttaa attctccagc gcagaatttt caacagaaag cattttctga atttcttttc
     4381 acaacccgcc ccccattgat ttcccccacc atcccaggca cccaaaaaaa aacattaaaa
     4441 ttaaatttta atttattaaa aaaagcacca taaaaatcac attaaaaaac actaccaaaa
     4501 acaaaaagaa ttcgaaattg aaactgtact gagttttgaa aacacacaca tttgacttat
     4561 tgaaatactt aaaccttttc aattcattta aattcaaatt taaattgaaa tttgaaagca
     4621 aatcgataac gcagcagcta tcgtgggaat attaacttaa atctaattat tctgattata
     4681 catatataca atatttttta tatatacata tatatattta tatacatata tatatctata
     4741 taatttatga gaaacaaaac acagtgggaa acgcatttgg cgcattatgt ggttatcgac
     4801 atttgtatcg atatatctat tttagcaacc caaaagtttc aactgattga ctgaatggca
     4861 aaaataaaat tagctttaaa tatgtttttc gcaagcaaga aaatgaagga aattgaaaac
     4921 accgcttttc aatgtgaaat tattttttac taattgttac tcacaccgac acacaccaca
     4981 cgcacacata catttaggaa tcgacaagca cacacataca cattgaaatg tttgtatttt
     5041 ttaggaatat atttttaaat taaaatacat tttatggaac tttttcaata attatgtaca
     5101 aaatttgtta aaaaaaaaag aaaactattt caaaaaatgc catgtacaca tatttataca
     5161 caaacacaca catactccca taaaaagaac aaatcgaaaa aagaaacaaa aataaaattt
     5221 agcgcataca cagacagaaa ttgtattaat tttttattct attaaattat tcttattatt
     5281 attattttat gtttattatt attgtaatga ttttttttgt agcttgtata aacatttata
     5341 tatatattct atatttaata attgattgat taattttaaa taattgatca atggcgagca
     5401 aaaatattta aatcgacgaa gagctaaaca aacttgtact aataacttaa atttagtgtt
     5461 tataaattaa acaaaaattg aaaatcaaaa acaacaacaa aacacacaca gagagaataa
     5521 acaaaaaact acaaaatcac aaa