Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XP_070074545 3356 aa linear INV 09-DEC-2024 protein isoform X31 [Drosophila takahashii]. ACCESSION XP_070074545 VERSION XP_070074545.1 DBLINK BioProject: PRJNA1194641 DBSOURCE REFSEQ: accession XM_070218444.1 KEYWORDS RefSeq. SOURCE Drosophila takahashii ORGANISM Drosophila takahashii Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_091683) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Status :: Full annotation Annotation Name :: GCF_030179915.1-RS_2024_12 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 10.3 Annotation Method :: Gnomon; cmsearch; tRNAscan-SE Features Annotated :: Gene; mRNA; CDS; ncRNA Annotation Date :: 12/07/2024 ##Genome-Annotation-Data-END## COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..3356 /organism="Drosophila takahashii" /strain="IR98-3 E-12201" /db_xref="taxon:29030" /chromosome="X" /sex="female" /tissue_type="Whole fly" /dev_stage="Adult fly" /collected_by="Originally obtained from EHIME-Fly" Protein 1..3356 /product="basement membrane-specific heparan sulfate proteoglycan core protein isoform X31" /calculated_mol_wt=368131 Region 22..52 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(23,31,42..43) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(35,38,42,48..49) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 45..49 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 53..87 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(58,66,77..78) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(70,73,77,83..84) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 80..84 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 93..127 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(98,106,117..118) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(110,113,117,123..124) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 120..124 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 142..210 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 242..276 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(247,255,266..267) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(259,262,266,272..273) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 269..273 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 282..316 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(287,295,306..307) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(299,302,306,312..313) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 309..313 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 325..359 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(330,338,349..350) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(342,345,349,355..356) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 352..356 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 377..452 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 380..384 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 393..397 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 416..420 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 430..435 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 444..447 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region 551..681 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 738..791 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Site order(738,740,752,762,764,773) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region 798..835 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domai; smart00180" /db_xref="CDD:214543" Region 916..1052 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region <1053..1079 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Region 1087..1136 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Site order(1088,1090,1097,1104,1107,1116) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region <1173..1203 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Region 1269..1403 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 1558..1640 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 1569..1573 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 1585..1589 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 1604..1608 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 1618..1623 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 1633..1636 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region <1667..1720 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 1751..1834 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 1762..1766 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 1775..1779 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 1798..1802 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 1812..1817 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 1847..1927 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 1864..1868 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409570" Region 1877..1881 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409570" Region 1896..1900 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409570" Region 1910..1915 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409570" Region 1923..1926 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409570" Region 1945..2021 /region_name="I-set" /note="Immunoglobulin I-set domain; pfam07679" /db_xref="CDD:400151" Region 1953..1957 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 1966..1970 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2001..2006 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2014..2017 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region 2034..2121 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2044..2048 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2057..2061 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2101..2106 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2246..2322 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2255..2259 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2268..2272 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2288..2292 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2302..2307 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2334..>2401 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2345..2349 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2361..2368 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2384..2388 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2484..2552 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 2508..2512 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2529..2532 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2542..2547 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2576..2642 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2584..2588 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2597..2601 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2616..2620 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2630..2635 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2664..2811 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 2834..2866 /region_name="EGF" /note="EGF-like domain; pfam00008" /db_xref="CDD:394967" Region 2914..3067 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 3121..3153 /region_name="EGF_CA" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; cd00054" /db_xref="CDD:238011" Region 3162..3315 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" CDS 1..3356 /gene="trol" /coded_by="XM_070218444.1:346..10416" /db_xref="GeneID:108055788" ORIGIN 1 mdcsdgsdei acsslsvlpc pqhqcpsgrc ysesercdrh rhcedgsdea nccyadqfrc 61 nngdciaesa hcdgnidcsd qsdeldcggd sqclpnqfrc kngqcvssta rcnkrsdcld 121 gsdeqncane pnnsgrgtnq lklktypdnq iikesrevif rcrdegpnra kvkwsrpggr 181 plppgftdrn grleipnirv edagayvcea vgyanyipgq hvtvnlnver lnereirpds 241 acteyqatcm ngecidksgi cdghpdcsdg sdehscslgl kcqpnqfmcs nskcvdrtwr 301 cdgendcgdn sdetscdpep sdapcrydef qcrsghcipk sfqcdymndc tdgtdeigcs 361 vpspmtlpap sivvmeyevl eltcvgtgvp tptivwrlnw ghvpekcesk syggtgtlrc 421 pnmrpqdsga yscefintrg tfypktnsiv tvtpvrsdvc kagffnmlar kseecvqcfc 481 fgvstncdsa nlftyaiqpp ilshrvvsve lspfrqivin easpgqdllt lhhgvqfras 541 nvhyngretp flalpaeymg nqlksyggnl ryevryngng rpvsgpdvii tgnsftlthr 601 vrthpgqnnr vtipflpggw tkpdgrkgtr edimmilanv dnilirlgyl dstarevdli 661 nialdsagsa dqglgsaslv ekctcppgyv gdscescasg yvrqargpwl ghcvpftpep 721 cpagtygnpr lgvpcqecpc phagannfas gcqqspdgdv icrcnegyag krcehcaqgy 781 qgnplapggv crkipdsscn vdgtyniysn gtcqckdsvi geqcdtcapk sfhlnsftyt 841 gciecfcsgv gldcdssswy rnqvtstfgr trvnhgfali sdymrntpvt vpvsmstqan 901 alsfvgsaeq agntlywslp aaflgnklts yggklsytls ysplpsgims rnsapdvvik 961 sgedlrlihy rksqvspsva ntyaveikes awqrgdelvp nrehvlmals nitaiyikat 1021 yttstkeasl rsvtldtata tnlgtarave veqcrcpegy lglsceqcap gytrdpeagi 1081 ylglcrpcec nghskycnse tgecescsdn tegfncdrca agyvgdatrg tsydcqyddg 1141 gyptsrppap gnqtaeclvn cqqegtagcr gyqceckrnv agdrcdqcrp gtyglsaqnp 1201 dgckecycsg ltnqcrsasl yrqlipvdfi stpplitdef gdimdrdnlv pdvprnvyty 1261 khtsytpkyw slrgsvlgnq llsyggrley slivesvgrd hrgkdvvlig nglkliwsrp 1321 dghdneqeyh vrlhedeqwt vedrgsarqa tradfmtvls dlqhililat pkvptvstsi 1381 snvilessit trapgathas dielcqcpsg ytgtscesca plhyrdasgr csqcpcdasn 1441 tescglvsgg nvecqcrprw rgdrcreidt spiieeppqi cdlsrgfccs gfqfdiapne 1501 tisfndtlqi ykgnriignm tklrygcpsr etneptpepd tstddpvrtq iivsiarpei 1561 tilpvggslt lsctgrmrwt nspvfvnwyk qgshlpegve vqggnlqlfn lqisdsgiyi 1621 cqavsnetgh sftdhvsitv sqedqrspah ivdlpndvtf eeyvsneivc evegnppptv 1681 twtrvdghad aqstrtdnnr lvfdsprksd egryrcqaen slsreekyvv vyvrsnppqp 1741 ppqqdrlyit pqevngvagd sfqlscqfts aaslrydwsh dgrslsassp rnvvvrgnvl 1801 evrdanvrds gtytcvafdl rtrrnftesa rvyieqpnep gilgdkphil tleqniiivq 1861 gedlsitcea sgtpypsikw tkvqenlaen vrisgnvlti yggrsenrgl ysciaenshg 1921 sdqsstsidi eprerpslti dtatqkvsvg sqaslycaaq gipeptvewv rtdgqplspr 1981 hkvqapgyvv iddivlddsg tyecrasnia gqvsglatin vqeptlvrie pdrqhhivtq 2041 gdelslscvg sgvptpsvfw sfegrdvdrm gvpegavfaq pfrtntadvk ifrvskeneg 2101 iyvchgsnda gedqqyirve vqprrgdvga ggddngdvdt rqppnrpqiq pnplsnerlt 2161 telgnnvtli cnvdnvntew ervdgtplph naytvrntlv ivfvepqnlg qyrcngigrd 2221 grveahvvre lvllplprit fypnipltve lgqnldvycq venvrpedvh wttdnnrplp 2281 ssvriegnvl rfasitqaaa geyrcsatnq ygsrsknarv vvkqpsgfqp vphsqvqqrq 2341 vgdsiqlrcr lttqygdevr gniqfnwyre dgsplprgvr pdsqvlqlvk lqpedegryi 2401 cnsydlgsgq qlppvsidlq vlrtttqypf nrfkggvslk dtpcmvlyic aavpaapqnp 2461 iylppvappr sperilepql slsvqssnlp agdgttvecf ssddsypdvv weradgapls 2521 envqqvgnnl visnvastda gnyvckcktd egdlyttsyk leveeqphel ksskivyakv 2581 ggnadlqcga dedrqpsyrw srqygqlqag rslqneklsl drvqandagt yvcsaqysdg 2641 etvdfpnilv vtgaipqfrq eprsymsfpt lsnssfkfnf eltfrpenad glllfngqtr 2701 gsgdyialsl kdryaefrfd fggkpllvra eeplaldewh tvrvsrfkrd gyiqvddqhp 2761 vafptsqhqq ipqleliedl yiggvpnwef lpaeavgqqs gfvgcisrlt lqgrtvelir 2821 eakfkegitd crpcaqgpcq nkgvclesqt eqaytcvcqp gwtgrdcaie gtqctagvcg 2881 sgrcentend meclcplnra gdrcqyneil neqslnfksn sfaaygtpkv tkvnitlsvr 2941 pasledsvil ytaestlpsg dylalvlrgg haellintaa rldpvvvrsa eplplnrwtr 3001 ieirrrlgeg ilkvgdgper kakapgsdri lslkthlfvg gvdrstvkin rdvnitkgfd 3061 gcisklynsq ksvnllgdir daanvqncge aneidddeye mpvalpspkv aenerqlmap 3121 casdpcengg scseqedmai cscpfgfsgk hcqnhlqlsf nasfrgdgyv elnrshfqpa 3181 leqtyshigi vfttnkpngl lfwwgqeage eytgqdfiaa avvdgyveys mrldgeeavi 3241 rnsdirvdng erhiviakrd entamleldq ildtgdtrpt inkamklpgn vfvggapdva 3301 aftgfrykdn fngcivvveg etvgqinlss aaingvnanv cpandeplgg teppvv