Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XP_070074543 3376 aa linear INV 09-DEC-2024 protein isoform X30 [Drosophila takahashii]. ACCESSION XP_070074543 VERSION XP_070074543.1 DBLINK BioProject: PRJNA1194641 DBSOURCE REFSEQ: accession XM_070218442.1 KEYWORDS RefSeq. SOURCE Drosophila takahashii ORGANISM Drosophila takahashii Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_091683) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Status :: Full annotation Annotation Name :: GCF_030179915.1-RS_2024_12 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 10.3 Annotation Method :: Gnomon; cmsearch; tRNAscan-SE Features Annotated :: Gene; mRNA; CDS; ncRNA Annotation Date :: 12/07/2024 ##Genome-Annotation-Data-END## COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..3376 /organism="Drosophila takahashii" /strain="IR98-3 E-12201" /db_xref="taxon:29030" /chromosome="X" /sex="female" /tissue_type="Whole fly" /dev_stage="Adult fly" /collected_by="Originally obtained from EHIME-Fly" Protein 1..3376 /product="basement membrane-specific heparan sulfate proteoglycan core protein isoform X30" /calculated_mol_wt=370678 Region 35..67 /region_name="LDLa" /note="Low-density lipoprotein receptor domain class A; smart00192" /db_xref="CDD:197566" Site order(41,49,60..61) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(53,56,60,66..67) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 63..67 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 73..107 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(78,86,97..98) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(90,93,97,103..104) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 100..104 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 113..147 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(118,126,137..138) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(130,133,137,143..144) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 140..144 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 162..230 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 262..296 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(267,275,286..287) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(279,282,286,292..293) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 289..293 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 302..336 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(307,315,326..327) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(319,322,326,332..333) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 329..333 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 345..379 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(350,358,369..370) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(362,365,369,375..376) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 372..376 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 397..472 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 400..404 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 413..417 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 436..440 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 450..455 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 464..467 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region 571..701 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 758..811 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Site order(758,760,772,782,784,793) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region 818..862 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domai; smart00180" /db_xref="CDD:214543" Region 936..1072 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region <1073..1099 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Region 1107..1156 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Site order(1108,1110,1117,1124,1127,1136) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region <1193..1223 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Region 1289..1423 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 1578..1660 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 1589..1593 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 1605..1609 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 1624..1628 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 1638..1643 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 1653..1656 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region <1687..1740 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 1771..1854 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 1782..1786 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 1795..1799 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 1818..1822 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 1832..1837 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 1867..1947 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 1884..1888 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409570" Region 1897..1901 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409570" Region 1916..1920 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409570" Region 1930..1935 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409570" Region 1943..1946 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409570" Region 1965..2041 /region_name="I-set" /note="Immunoglobulin I-set domain; pfam07679" /db_xref="CDD:400151" Region 1973..1977 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 1986..1990 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2021..2026 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2034..2037 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region 2054..2141 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2064..2068 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2077..2081 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2121..2126 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2266..2342 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2275..2279 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2288..2292 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2308..2312 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2322..2327 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2354..>2421 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2365..2369 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2381..2388 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2404..2408 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2504..2572 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 2528..2532 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2549..2552 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2562..2567 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2596..2662 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2604..2608 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2617..2621 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2636..2640 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2650..2655 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2684..2831 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 2854..2886 /region_name="EGF" /note="EGF-like domain; pfam00008" /db_xref="CDD:394967" Region 2934..3087 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 3141..3173 /region_name="EGF_CA" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; cd00054" /db_xref="CDD:238011" Region 3182..3335 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" CDS 1..3376 /gene="trol" /coded_by="XM_070218442.1:198..10328" /db_xref="GeneID:108055788" ORIGIN 1 mgrrlraafw llaaliviek pqksiargty vaygecrate frcnngdcid irkrcdhisd 61 csegedenee crcyadqfrc nngdciaesa hcdgnidcsd qsdeldcggd sqclpnqfrc 121 kngqcvssta rcnkrsdcld gsdeqncane pnnsgrgtnq lklktypdnq iikesrevif 181 rcrdegpnra kvkwsrpggr plppgftdrn grleipnirv edagayvcea vgyanyipgq 241 hvtvnlnver lnereirpds acteyqatcm ngecidksgi cdghpdcsdg sdehscslgl 301 kcqpnqfmcs nskcvdrtwr cdgendcgdn sdetscdpep sdapcrydef qcrsghcipk 361 sfqcdymndc tdgtdeigcs vpspmtlpap sivvmeyevl eltcvgtgvp tptivwrlnw 421 ghvpekcesk syggtgtlrc pnmrpqdsga yscefintrg tfypktnsiv tvtpvrsdvc 481 kagffnmlar kseecvqcfc fgvstncdsa nlftyaiqpp ilshrvvsve lspfrqivin 541 easpgqdllt lhhgvqfras nvhyngretp flalpaeymg nqlksyggnl ryevryngng 601 rpvsgpdvii tgnsftlthr vrthpgqnnr vtipflpggw tkpdgrkgtr edimmilanv 661 dnilirlgyl dstarevdli nialdsagsa dqglgsaslv ekctcppgyv gdscescasg 721 yvrqargpwl ghcvpftpep cpagtygnpr lgvpcqecpc phagannfas gcqqspdgdv 781 icrcnegyag krcehcaqgy qgnplapggv crkipdsscn vdgtyniysn gtcqckdsvi 841 geqcdtcapk sfhlnsftyt gciecfcsgv gldcdssswy rnqvtstfgr trvnhgfali 901 sdymrntpvt vpvsmstqan alsfvgsaeq agntlywslp aaflgnklts yggklsytls 961 ysplpsgims rnsapdvvik sgedlrlihy rksqvspsva ntyaveikes awqrgdelvp 1021 nrehvlmals nitaiyikat yttstkeasl rsvtldtata tnlgtarave veqcrcpegy 1081 lglsceqcap gytrdpeagi ylglcrpcec nghskycnse tgecescsdn tegfncdrca 1141 agyvgdatrg tsydcqyddg gyptsrppap gnqtaeclvn cqqegtagcr gyqceckrnv 1201 agdrcdqcrp gtyglsaqnp dgckecycsg ltnqcrsasl yrqlipvdfi stpplitdef 1261 gdimdrdnlv pdvprnvyty khtsytpkyw slrgsvlgnq llsyggrley slivesvgrd 1321 hrgkdvvlig nglkliwsrp dghdneqeyh vrlhedeqwt vedrgsarqa tradfmtvls 1381 dlqhililat pkvptvstsi snvilessit trapgathas dielcqcpsg ytgtscesca 1441 plhyrdasgr csqcpcdasn tescglvsgg nvecqcrprw rgdrcreidt spiieeppqi 1501 cdlsrgfccs gfqfdiapne tisfndtlqi ykgnriignm tklrygcpsr etneptpepd 1561 tstddpvrtq iivsiarpei tilpvggslt lsctgrmrwt nspvfvnwyk qgshlpegve 1621 vqggnlqlfn lqisdsgiyi cqavsnetgh sftdhvsitv sqedqrspah ivdlpndvtf 1681 eeyvsneivc evegnppptv twtrvdghad aqstrtdnnr lvfdsprksd egryrcqaen 1741 slsreekyvv vyvrsnppqp ppqqdrlyit pqevngvagd sfqlscqfts aaslrydwsh 1801 dgrslsassp rnvvvrgnvl evrdanvrds gtytcvafdl rtrrnftesa rvyieqpnep 1861 gilgdkphil tleqniiivq gedlsitcea sgtpypsikw tkvqenlaen vrisgnvlti 1921 yggrsenrgl ysciaenshg sdqsstsidi eprerpslti dtatqkvsvg sqaslycaaq 1981 gipeptvewv rtdgqplspr hkvqapgyvv iddivlddsg tyecrasnia gqvsglatin 2041 vqeptlvrie pdrqhhivtq gdelslscvg sgvptpsvfw sfegrdvdrm gvpegavfaq 2101 pfrtntadvk ifrvskeneg iyvchgsnda gedqqyirve vqprrgdvga ggddngdvdt 2161 rqppnrpqiq pnplsnerlt telgnnvtli cnvdnvntew ervdgtplph naytvrntlv 2221 ivfvepqnlg qyrcngigrd grveahvvre lvllplprit fypnipltve lgqnldvycq 2281 venvrpedvh wttdnnrplp ssvriegnvl rfasitqaaa geyrcsatnq ygsrsknarv 2341 vvkqpsgfqp vphsqvqqrq vgdsiqlrcr lttqygdevr gniqfnwyre dgsplprgvr 2401 pdsqvlqlvk lqpedegryi cnsydlgsgq qlppvsidlq vlrtttqypf nrfkggvslk 2461 dtpcmvlyic aavpaapqnp iylppvappr sperilepql slsvqssnlp agdgttvecf 2521 ssddsypdvv weradgapls envqqvgnnl visnvastda gnyvckcktd egdlyttsyk 2581 leveeqphel ksskivyakv ggnadlqcga dedrqpsyrw srqygqlqag rslqneklsl 2641 drvqandagt yvcsaqysdg etvdfpnilv vtgaipqfrq eprsymsfpt lsnssfkfnf 2701 eltfrpenad glllfngqtr gsgdyialsl kdryaefrfd fggkpllvra eeplaldewh 2761 tvrvsrfkrd gyiqvddqhp vafptsqhqq ipqleliedl yiggvpnwef lpaeavgqqs 2821 gfvgcisrlt lqgrtvelir eakfkegitd crpcaqgpcq nkgvclesqt eqaytcvcqp 2881 gwtgrdcaie gtqctagvcg sgrcentend meclcplnra gdrcqyneil neqslnfksn 2941 sfaaygtpkv tkvnitlsvr pasledsvil ytaestlpsg dylalvlrgg haellintaa 3001 rldpvvvrsa eplplnrwtr ieirrrlgeg ilkvgdgper kakapgsdri lslkthlfvg 3061 gvdrstvkin rdvnitkgfd gcisklynsq ksvnllgdir daanvqncge aneidddeye 3121 mpvalpspkv aenerqlmap casdpcengg scseqedmai cscpfgfsgk hcqnhlqlsf 3181 nasfrgdgyv elnrshfqpa leqtyshigi vfttnkpngl lfwwgqeage eytgqdfiaa 3241 avvdgyveys mrldgeeavi rnsdirvdng erhiviakrd entamleldq ildtgdtrpt 3301 inkamklpgn vfvggapdva aftgfrykdn fngcivvveg etvgqinlss aaingvnanv 3361 cpandeplgg teppvv