Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XP_044252018 3783 aa linear INV 09-DEC-2024 protein isoform X29 [Drosophila takahashii]. ACCESSION XP_044252018 VERSION XP_044252018.1 DBLINK BioProject: PRJNA1194641 DBSOURCE REFSEQ: accession XM_044396083.1 KEYWORDS RefSeq. SOURCE Drosophila takahashii ORGANISM Drosophila takahashii Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_091683) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Status :: Full annotation Annotation Name :: GCF_030179915.1-RS_2024_12 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 10.3 Annotation Method :: Gnomon; cmsearch; tRNAscan-SE Features Annotated :: Gene; mRNA; CDS; ncRNA Annotation Date :: 12/07/2024 ##Genome-Annotation-Data-END## COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..3783 /organism="Drosophila takahashii" /strain="IR98-3 E-12201" /db_xref="taxon:29030" /chromosome="X" /sex="female" /tissue_type="Whole fly" /dev_stage="Adult fly" /collected_by="Originally obtained from EHIME-Fly" Protein 1..3783 /product="basement membrane-specific heparan sulfate proteoglycan core protein isoform X29" /calculated_mol_wt=417955 Region 84..>169 /region_name="CCDC66" /note="Coiled-coil domain-containing protein 66; pfam15236" /db_xref="CDD:434558" Region 379..415 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(385,394,405..406) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(398,401,405,411..412) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 408..412 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 418..451 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(423,430,441..442) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(434,437,441,447..448) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 444..448 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 504..536 /region_name="LDLa" /note="Low-density lipoprotein receptor domain class A; smart00192" /db_xref="CDD:197566" Site order(510,518,529..530) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(522,525,529,535..536) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 532..536 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 542..576 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(547,555,566..567) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(559,562,566,572..573) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 569..573 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 582..616 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(587,595,606..607) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(599,602,606,612..613) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 609..613 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 631..699 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 731..765 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(736,744,755..756) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(748,751,755,761..762) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 758..762 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 771..805 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(776,784,795..796) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(788,791,795,801..802) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 798..802 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 814..848 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(819,827,838..839) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(831,834,838,844..845) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 841..845 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 866..941 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 869..873 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:143220" Region 882..886 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:143220" Region 905..909 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:143220" Region 919..924 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:143220" Region 933..936 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:143220" Region 1040..1170 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 1227..1280 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Site order(1227,1229,1241,1251,1253,1262) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region 1287..1331 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domai; smart00180" /db_xref="CDD:214543" Region 1405..1541 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region <1542..1568 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Region 1576..1625 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Site order(1577,1579,1586,1593,1596,1605) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region <1662..1692 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Region 1758..1892 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 1985..2067 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 1996..2000 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2012..2016 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2031..2035 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2045..2050 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2060..2063 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region <2094..2147 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 2178..2261 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2189..2193 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409544" Region 2202..2206 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409544" Region 2225..2229 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409544" Region 2239..2244 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409544" Region 2274..2354 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 2291..2295 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409570" Region 2304..2308 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409570" Region 2323..2327 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409570" Region 2337..2342 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409570" Region 2350..2353 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409570" Region 2372..2448 /region_name="I-set" /note="Immunoglobulin I-set domain; pfam07679" /db_xref="CDD:400151" Region 2380..2384 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409543" Region 2393..2397 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409543" Region 2428..2433 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409543" Region 2441..2444 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409543" Region 2461..2548 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2471..2475 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409548" Region 2484..2488 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409548" Region 2528..2533 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409548" Region 2673..2749 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2682..2686 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409541" Region 2695..2699 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409541" Region 2715..2719 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409541" Region 2729..2734 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409541" Region 2761..>2828 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2772..2776 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409390" Region 2788..2795 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409390" Region 2811..2815 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409390" Region 2905..2973 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 3003..3069 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 3011..3015 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409390" Region 3024..3028 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409390" Region 3043..3047 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409390" Region 3057..3062 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409390" Region 3091..3238 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 3261..3293 /region_name="EGF" /note="EGF-like domain; pfam00008" /db_xref="CDD:394967" Region 3341..3494 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 3548..3580 /region_name="EGF_CA" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; cd00054" /db_xref="CDD:238011" Region 3589..3742 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" CDS 1..3783 /gene="trol" /coded_by="XM_044396083.1:209..11560" /db_xref="GeneID:108055788" ORIGIN 1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf 61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd 121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn 181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde 241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln 301 vykridrlga stdgvftfte ssvykyehqy divtprvivs gqpitekpve apqefdedpi 361 evalptdeve gsgqdsgscr gdatfqcrrs gkticdemrc dgsrdcpdae deegcevcne 421 lqfkcdnkcl plnkrcdnry dcedqtdeag cqryeveesq pqpqpqpqpe pepepepepe 481 pepepepepe epitdneqpe qnsecratef rcnngdcidi rkrcdhisdc segedeneec 541 rcyadqfrcn ngdciaesah cdgnidcsdq sdeldcggds qclpnqfrck ngqcvsstar 601 cnkrsdcldg sdeqncanep nnsgrgtnql klktypdnqi ikesrevifr crdegpnrak 661 vkwsrpggrp lppgftdrng rleipnirve dagayvceav gyanyipgqh vtvnlnverl 721 nereirpdsa cteyqatcmn gecidksgic dghpdcsdgs dehscslglk cqpnqfmcsn 781 skcvdrtwrc dgendcgdns detscdpeps dapcrydefq crsghcipks fqcdymndct 841 dgtdeigcsv pspmtlpaps ivvmeyevle ltcvgtgvpt ptivwrlnwg hvpekcesks 901 yggtgtlrcp nmrpqdsgay scefintrgt fypktnsivt vtpvrsdvck agffnmlark 961 seecvqcfcf gvstncdsan lftyaiqppi lshrvvsvel spfrqivine aspgqdlltl 1021 hhgvqfrasn vhyngretpf lalpaeymgn qlksyggnlr yevryngngr pvsgpdviit 1081 gnsftlthrv rthpgqnnrv tipflpggwt kpdgrkgtre dimmilanvd nilirlgyld 1141 starevdlin ialdsagsad qglgsaslve kctcppgyvg dscescasgy vrqargpwlg 1201 hcvpftpepc pagtygnprl gvpcqecpcp hagannfasg cqqspdgdvi crcnegyagk 1261 rcehcaqgyq gnplapggvc rkipdsscnv dgtyniysng tcqckdsvig eqcdtcapks 1321 fhlnsftytg ciecfcsgvg ldcdssswyr nqvtstfgrt rvnhgfalis dymrntpvtv 1381 pvsmstqana lsfvgsaeqa gntlywslpa aflgnkltsy ggklsytlsy splpsgimsr 1441 nsapdvviks gedlrlihyr ksqvspsvan tyaveikesa wqrgdelvpn rehvlmalsn 1501 itaiyikaty ttstkeaslr svtldtatat nlgtaravev eqcrcpegyl glsceqcapg 1561 ytrdpeagiy lglcrpcecn ghskycnset gecescsdnt egfncdrcaa gyvgdatrgt 1621 sydcqyddgg yptsrppapg nqtaeclvnc qqegtagcrg yqceckrnva gdrcdqcrpg 1681 tyglsaqnpd gckecycsgl tnqcrsasly rqlipvdfis tpplitdefg dimdrdnlvp 1741 dvprnvytyk htsytpkyws lrgsvlgnql lsyggrleys livesvgrdh rgkdvvlign 1801 glkliwsrpd ghdneqeyhv rlhedeqwtv edrgsarqat radfmtvlsd lqhililatp 1861 kvptvstsis nvilessitt rapgathasd ielcqcpsgy tgtscescap lhyrdasgrc 1921 sqcpcdasnt escglvsggn vecqcrprwr gdrcreietn eptpepdtst ddpvrtqiiv 1981 siarpeitil pvggsltlsc tgrmrwtnsp vfvnwykqgs hlpegvevqg gnlqlfnlqi 2041 sdsgiyicqa vsnetghsft dhvsitvsqe dqrspahivd lpndvtfeey vsneivceve 2101 gnppptvtwt rvdghadaqs trtdnnrlvf dsprksdegr yrcqaensls reekyvvvyv 2161 rsnppqpppq qdrlyitpqe vngvagdsfq lscqftsaas lrydwshdgr slsassprnv 2221 vvrgnvlevr danvrdsgty tcvafdlrtr rnftesarvy ieqpnepgil gdkphiltle 2281 qniiivqged lsitceasgt pypsikwtkv qenlaenvri sgnvltiygg rsenrglysc 2341 iaenshgsdq sstsidiepr erpsltidta tqkvsvgsqa slycaaqgip eptvewvrtd 2401 gqplsprhkv qapgyvvidd ivlddsgtye crasniagqv sglatinvqe ptlvriepdr 2461 qhhivtqgde lslscvgsgv ptpsvfwsfe grdvdrmgvp egavfaqpfr tntadvkifr 2521 vskenegiyv chgsndaged qqyirvevqp rrgdvgaggd dngdvdtrqp pnrpqiqpnp 2581 lsnerlttel gnnvtlicnv dnvntewerv dgtplphnay tvrntlvivf vepqnlgqyr 2641 cngigrdgrv eahvvrelvl lplpritfyp nipltvelgq nldvycqven vrpedvhwtt 2701 dnnrplpssv riegnvlrfa sitqaaagey rcsatnqygs rsknarvvvk qpsgfqpvph 2761 sqvqqrqvgd siqlrcrltt qygdevrgni qfnwyredgs plprgvrpds qvlqlvklqp 2821 edegryicns ydlgsgqqlp pvsidlqvlr tttqypfnrf kggvslkdtp cmvlyicaav 2881 paapqnpiyl ppvapprspe rilepqlsls vqssnlpagd gttvecfssd dsypdvvwer 2941 adgaplsenv qqvgnnlvis nvastdagny vckcktdegd lyttsyklev eeqphelkss 3001 kivyakvggn adlqcgaded rqpsyrwsrq ygqlqagrsl qneklsldrv qandagtyvc 3061 saqysdgetv dfpnilvvtg aipqfrqepr symsfptlsn ssfkfnfelt frpenadgll 3121 lfngqtrgsg dyialslkdr yaefrfdfgg kpllvraeep laldewhtvr vsrfkrdgyi 3181 qvddqhpvaf ptsqhqqipq leliedlyig gvpnweflpa eavgqqsgfv gcisrltlqg 3241 rtvelireak fkegitdcrp caqgpcqnkg vclesqteqa ytcvcqpgwt grdcaiegtq 3301 ctagvcgsgr centendmec lcplnragdr cqyneilneq slnfksnsfa aygtpkvtkv 3361 nitlsvrpas ledsvilyta estlpsgdyl alvlrgghae llintaarld pvvvrsaepl 3421 plnrwtriei rrrlgegilk vgdgperkak apgsdrilsl kthlfvggvd rstvkinrdv 3481 nitkgfdgci sklynsqksv nllgdirdaa nvqncgeane idddeyempv alpspkvaen 3541 erqlmapcas dpcenggscs eqedmaicsc pfgfsgkhcq nhlqlsfnas frgdgyveln 3601 rshfqpaleq tyshigivft tnkpngllfw wgqeageeyt gqdfiaaavv dgyveysmrl 3661 dgeeavirns dirvdngerh iviakrdent amleldqild tgdtrptink amklpgnvfv 3721 ggapdvaaft gfrykdnfng civvvegetv gqinlssaai ngvnanvcpa ndeplggtep 3781 pvv