Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XP_070074542 3845 aa linear INV 09-DEC-2024 protein isoform X28 [Drosophila takahashii]. ACCESSION XP_070074542 VERSION XP_070074542.1 DBLINK BioProject: PRJNA1194641 DBSOURCE REFSEQ: accession XM_070218441.1 KEYWORDS RefSeq. SOURCE Drosophila takahashii ORGANISM Drosophila takahashii Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_091683) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Status :: Full annotation Annotation Name :: GCF_030179915.1-RS_2024_12 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 10.3 Annotation Method :: Gnomon; cmsearch; tRNAscan-SE Features Annotated :: Gene; mRNA; CDS; ncRNA Annotation Date :: 12/07/2024 ##Genome-Annotation-Data-END## COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..3845 /organism="Drosophila takahashii" /strain="IR98-3 E-12201" /db_xref="taxon:29030" /chromosome="X" /sex="female" /tissue_type="Whole fly" /dev_stage="Adult fly" /collected_by="Originally obtained from EHIME-Fly" Protein 1..3845 /product="basement membrane-specific heparan sulfate proteoglycan core protein isoform X28" /calculated_mol_wt=424910 Region 84..>169 /region_name="CCDC66" /note="Coiled-coil domain-containing protein 66; pfam15236" /db_xref="CDD:434558" Region 379..415 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(385,394,405..406) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(398,401,405,411..412) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 408..412 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 418..451 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(423,430,441..442) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(434,437,441,447..448) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 444..448 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 504..536 /region_name="LDLa" /note="Low-density lipoprotein receptor domain class A; smart00192" /db_xref="CDD:197566" Site order(510,518,529..530) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(522,525,529,535..536) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 532..536 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 542..576 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(547,555,566..567) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(559,562,566,572..573) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 569..573 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 582..616 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(587,595,606..607) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(599,602,606,612..613) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 609..613 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 631..699 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 731..765 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(736,744,755..756) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(748,751,755,761..762) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 758..762 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 771..805 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(776,784,795..796) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(788,791,795,801..802) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 798..802 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 814..848 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(819,827,838..839) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(831,834,838,844..845) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 841..845 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 866..941 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 869..873 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 882..886 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 905..909 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 919..924 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 933..936 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region 1040..1170 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 1227..1280 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Site order(1227,1229,1241,1251,1253,1262) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region 1287..1331 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domai; smart00180" /db_xref="CDD:214543" Region 1405..1541 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region <1542..1568 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Region 1576..1625 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Site order(1577,1579,1586,1593,1596,1605) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region <1662..1692 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Region 1758..1892 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 2047..2129 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2058..2062 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2074..2078 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2093..2097 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2107..2112 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2122..2125 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region <2156..2209 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 2240..2323 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2251..2255 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2264..2268 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2287..2291 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2301..2306 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2336..2416 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 2353..2357 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409570" Region 2366..2370 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409570" Region 2385..2389 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409570" Region 2399..2404 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409570" Region 2412..2415 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409570" Region 2434..2510 /region_name="I-set" /note="Immunoglobulin I-set domain; pfam07679" /db_xref="CDD:400151" Region 2442..2446 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2455..2459 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2490..2495 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2503..2506 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region 2523..2610 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2533..2537 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2546..2550 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2590..2595 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2735..2811 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2744..2748 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2757..2761 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2777..2781 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2791..2796 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2823..>2890 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2834..2838 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2850..2857 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2873..2877 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2967..3035 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 3065..3131 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 3073..3077 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 3086..3090 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 3105..3109 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 3119..3124 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 3153..3300 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 3323..3355 /region_name="EGF" /note="EGF-like domain; pfam00008" /db_xref="CDD:394967" Region 3403..3556 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 3610..3642 /region_name="EGF_CA" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; cd00054" /db_xref="CDD:238011" Region 3651..3804 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" CDS 1..3845 /gene="trol" /coded_by="XM_070218441.1:209..11746" /db_xref="GeneID:108055788" ORIGIN 1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf 61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd 121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn 181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde 241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln 301 vykridrlga stdgvftfte ssvykyehqy divtprvivs gqpitekpve apqefdedpi 361 evalptdeve gsgqdsgscr gdatfqcrrs gkticdemrc dgsrdcpdae deegcevcne 421 lqfkcdnkcl plnkrcdnry dcedqtdeag cqryeveesq pqpqpqpqpe pepepepepe 481 pepepepepe epitdneqpe qnsecratef rcnngdcidi rkrcdhisdc segedeneec 541 rcyadqfrcn ngdciaesah cdgnidcsdq sdeldcggds qclpnqfrck ngqcvsstar 601 cnkrsdcldg sdeqncanep nnsgrgtnql klktypdnqi ikesrevifr crdegpnrak 661 vkwsrpggrp lppgftdrng rleipnirve dagayvceav gyanyipgqh vtvnlnverl 721 nereirpdsa cteyqatcmn gecidksgic dghpdcsdgs dehscslglk cqpnqfmcsn 781 skcvdrtwrc dgendcgdns detscdpeps dapcrydefq crsghcipks fqcdymndct 841 dgtdeigcsv pspmtlpaps ivvmeyevle ltcvgtgvpt ptivwrlnwg hvpekcesks 901 yggtgtlrcp nmrpqdsgay scefintrgt fypktnsivt vtpvrsdvck agffnmlark 961 seecvqcfcf gvstncdsan lftyaiqppi lshrvvsvel spfrqivine aspgqdlltl 1021 hhgvqfrasn vhyngretpf lalpaeymgn qlksyggnlr yevryngngr pvsgpdviit 1081 gnsftlthrv rthpgqnnrv tipflpggwt kpdgrkgtre dimmilanvd nilirlgyld 1141 starevdlin ialdsagsad qglgsaslve kctcppgyvg dscescasgy vrqargpwlg 1201 hcvpftpepc pagtygnprl gvpcqecpcp hagannfasg cqqspdgdvi crcnegyagk 1261 rcehcaqgyq gnplapggvc rkipdsscnv dgtyniysng tcqckdsvig eqcdtcapks 1321 fhlnsftytg ciecfcsgvg ldcdssswyr nqvtstfgrt rvnhgfalis dymrntpvtv 1381 pvsmstqana lsfvgsaeqa gntlywslpa aflgnkltsy ggklsytlsy splpsgimsr 1441 nsapdvviks gedlrlihyr ksqvspsvan tyaveikesa wqrgdelvpn rehvlmalsn 1501 itaiyikaty ttstkeaslr svtldtatat nlgtaravev eqcrcpegyl glsceqcapg 1561 ytrdpeagiy lglcrpcecn ghskycnset gecescsdnt egfncdrcaa gyvgdatrgt 1621 sydcqyddgg yptsrppapg nqtaeclvnc qqegtagcrg yqceckrnva gdrcdqcrpg 1681 tyglsaqnpd gckecycsgl tnqcrsasly rqlipvdfis tpplitdefg dimdrdnlvp 1741 dvprnvytyk htsytpkyws lrgsvlgnql lsyggrleys livesvgrdh rgkdvvlign 1801 glkliwsrpd ghdneqeyhv rlhedeqwtv edrgsarqat radfmtvlsd lqhililatp 1861 kvptvstsis nvilessitt rapgathasd ielcqcpsgy tgtscescap lhyrdasgrc 1921 sqcpcdasnt escglvsggn vecqcrprwr gdrcreidts piieeppqic dlsrgfccsg 1981 fqfdiapnet isfndtlqiy kgnriignmt klrygcpsre tneptpepdt stddpvrtqi 2041 ivsiarpeit ilpvggsltl sctgrmrwtn spvfvnwykq gshlpegvev qggnlqlfnl 2101 qisdsgiyic qavsnetghs ftdhvsitvs qedqrspahi vdlpndvtfe eyvsneivce 2161 vegnppptvt wtrvdghada qstrtdnnrl vfdsprksde gryrcqaens lsreekyvvv 2221 yvrsnppqpp pqqdrlyitp qevngvagds fqlscqftsa aslrydwshd grslsasspr 2281 nvvvrgnvle vrdanvrdsg tytcvafdlr trrnftesar vyieqpnepg ilgdkphilt 2341 leqniiivqg edlsitceas gtpypsikwt kvqenlaenv risgnvltiy ggrsenrgly 2401 sciaenshgs dqsstsidie prerpsltid tatqkvsvgs qaslycaaqg ipeptvewvr 2461 tdgqplsprh kvqapgyvvi ddivlddsgt yecrasniag qvsglatinv qeptlvriep 2521 drqhhivtqg delslscvgs gvptpsvfws fegrdvdrmg vpegavfaqp frtntadvki 2581 frvskenegi yvchgsndag edqqyirvev qprrgdvgag gddngdvdtr qppnrpqiqp 2641 nplsnerltt elgnnvtlic nvdnvntewe rvdgtplphn aytvrntlvi vfvepqnlgq 2701 yrcngigrdg rveahvvrel vllplpritf ypnipltvel gqnldvycqv envrpedvhw 2761 ttdnnrplps svriegnvlr fasitqaaag eyrcsatnqy gsrsknarvv vkqpsgfqpv 2821 phsqvqqrqv gdsiqlrcrl ttqygdevrg niqfnwyred gsplprgvrp dsqvlqlvkl 2881 qpedegryic nsydlgsgqq lppvsidlqv lrtttqypfn rfkggvslkd tpcmvlyica 2941 avpaapqnpi ylppvapprs perilepqls lsvqssnlpa gdgttvecfs sddsypdvvw 3001 eradgaplse nvqqvgnnlv isnvastdag nyvckcktde gdlyttsykl eveeqphelk 3061 sskivyakvg gnadlqcgad edrqpsyrws rqygqlqagr slqneklsld rvqandagty 3121 vcsaqysdge tvdfpnilvv tgaipqfrqe prsymsfptl snssfkfnfe ltfrpenadg 3181 lllfngqtrg sgdyialslk dryaefrfdf ggkpllvrae eplaldewht vrvsrfkrdg 3241 yiqvddqhpv afptsqhqqi pqleliedly iggvpnwefl paeavgqqsg fvgcisrltl 3301 qgrtvelire akfkegitdc rpcaqgpcqn kgvclesqte qaytcvcqpg wtgrdcaieg 3361 tqctagvcgs grcentendm eclcplnrag drcqyneiln eqslnfksns faaygtpkvt 3421 kvnitlsvrp asledsvily taestlpsgd ylalvlrggh aellintaar ldpvvvrsae 3481 plplnrwtri eirrrlgegi lkvgdgperk akapgsdril slkthlfvgg vdrstvkinr 3541 dvnitkgfdg cisklynsqk svnllgdird aanvqncgea neidddeyem pvalpspkva 3601 enerqlmapc asdpcenggs cseqedmaic scpfgfsgkh cqnhlqlsfn asfrgdgyve 3661 lnrshfqpal eqtyshigiv fttnkpngll fwwgqeagee ytgqdfiaaa vvdgyveysm 3721 rldgeeavir nsdirvdnge rhiviakrde ntamleldqi ldtgdtrpti nkamklpgnv 3781 fvggapdvaa ftgfrykdnf ngcivvvege tvgqinlssa aingvnanvc pandeplggt 3841 eppvv