Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XP_044252026 3723 aa linear INV 09-DEC-2024 protein isoform X39 [Drosophila takahashii]. ACCESSION XP_044252026 VERSION XP_044252026.1 DBLINK BioProject: PRJNA1194641 DBSOURCE REFSEQ: accession XM_044396091.1 KEYWORDS RefSeq. SOURCE Drosophila takahashii ORGANISM Drosophila takahashii Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_091683) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Status :: Full annotation Annotation Name :: GCF_030179915.1-RS_2024_12 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 10.3 Annotation Method :: Gnomon; cmsearch; tRNAscan-SE Features Annotated :: Gene; mRNA; CDS; ncRNA Annotation Date :: 12/07/2024 ##Genome-Annotation-Data-END## COMPLETENESS: full length. FEATURES Location/Qualifiers source 1..3723 /organism="Drosophila takahashii" /strain="IR98-3 E-12201" /db_xref="taxon:29030" /chromosome="X" /sex="female" /tissue_type="Whole fly" /dev_stage="Adult fly" /collected_by="Originally obtained from EHIME-Fly" Protein 1..3723 /product="basement membrane-specific heparan sulfate proteoglycan core protein isoform X39" /calculated_mol_wt=411125 Region 84..>169 /region_name="CCDC66" /note="Coiled-coil domain-containing protein 66; pfam15236" /db_xref="CDD:434558" Region 348..384 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(354,363,374..375) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(367,370,374,380..381) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 377..381 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 387..420 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(392,399,410..411) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(403,406,410,416..417) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 413..417 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 473..505 /region_name="LDLa" /note="Low-density lipoprotein receptor domain class A; smart00192" /db_xref="CDD:197566" Site order(479,487,498..499) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(491,494,498,504..505) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 501..505 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 511..545 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(516,524,535..536) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(528,531,535,541..542) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 538..542 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 551..585 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(556,564,575..576) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(568,571,575,581..582) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 578..582 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 600..668 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 700..734 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(705,713,724..725) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(717,720,724,730..731) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 727..731 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 740..774 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(745,753,764..765) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(757,760,764,770..771) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 767..771 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 783..817 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(788,796,807..808) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(800,803,807,813..814) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 810..814 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 835..910 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 838..842 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:143220" Region 851..855 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:143220" Region 874..878 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:143220" Region 888..893 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:143220" Region 902..905 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:143220" Region 1009..1139 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 1196..1249 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Site order(1196,1198,1210,1220,1222,1231) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region 1256..1300 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domai; smart00180" /db_xref="CDD:214543" Region 1374..1510 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region <1511..1537 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Region 1545..1594 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Site order(1546,1548,1555,1562,1565,1574) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region <1631..1661 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Region 1727..1861 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 1954..2036 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 1965..1969 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 1981..1985 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2000..2004 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2014..2019 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2029..2032 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region <2063..2116 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 2147..2230 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2158..2162 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409544" Region 2171..2175 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409544" Region 2194..2198 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409544" Region 2208..2213 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409544" Region 2243..2323 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 2260..2264 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409570" Region 2273..2277 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409570" Region 2292..2296 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409570" Region 2306..2311 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409570" Region 2319..2322 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409570" Region 2341..2417 /region_name="I-set" /note="Immunoglobulin I-set domain; pfam07679" /db_xref="CDD:400151" Region 2349..2353 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409543" Region 2362..2366 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409543" Region 2397..2402 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409543" Region 2410..2413 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409543" Region 2430..2517 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2440..2444 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409548" Region 2453..2457 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409548" Region 2497..2502 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409548" Region 2642..2718 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2651..2655 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409541" Region 2664..2668 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409541" Region 2684..2688 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409541" Region 2698..2703 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409541" Region 2730..>2797 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2741..2745 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409390" Region 2757..2764 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409390" Region 2780..2784 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409390" Region 2845..2913 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 2943..3009 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2951..2955 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409390" Region 2964..2968 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409390" Region 2983..2987 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409390" Region 2997..3002 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409390" Region 3031..3178 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 3201..3233 /region_name="EGF" /note="EGF-like domain; pfam00008" /db_xref="CDD:394967" Region 3281..3434 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 3488..3520 /region_name="EGF_CA" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; cd00054" /db_xref="CDD:238011" Region 3529..3682 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" CDS 1..3723 /gene="trol" /coded_by="XM_044396091.1:209..11380" /db_xref="GeneID:108055788" ORIGIN 1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf 61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd 121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn 181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde 241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln 301 vykridrlga stdgvftfte ssefdedpie valptdeveg sgqdsgscrg datfqcrrsg 361 kticdemrcd gsrdcpdaed eegcevcnel qfkcdnkclp lnkrcdnryd cedqtdeagc 421 qryeveesqp qpqpqpqpep epepepepep epepepepee pitdneqpeq nsecratefr 481 cnngdcidir krcdhisdcs egedeneecr cyadqfrcnn gdciaesahc dgnidcsdqs 541 deldcggdsq clpnqfrckn gqcvsstarc nkrsdcldgs deqncanepn nsgrgtnqlk 601 lktypdnqii kesrevifrc rdegpnrakv kwsrpggrpl ppgftdrngr leipnirved 661 agayvceavg yanyipgqhv tvnlnverln ereirpdsac teyqatcmng ecidksgicd 721 ghpdcsdgsd ehscslglkc qpnqfmcsns kcvdrtwrcd gendcgdnsd etscdpepsd 781 apcrydefqc rsghcipksf qcdymndctd gtdeigcsvp spmtlpapsi vvmeyevlel 841 tcvgtgvptp tivwrlnwgh vpekcesksy ggtgtlrcpn mrpqdsgays cefintrgtf 901 ypktnsivtv tpvrsdvcka gffnmlarks eecvqcfcfg vstncdsanl ftyaiqppil 961 shrvvsvels pfrqivinea spgqdlltlh hgvqfrasnv hyngretpfl alpaeymgnq 1021 lksyggnlry evryngngrp vsgpdviitg nsftlthrvr thpgqnnrvt ipflpggwtk 1081 pdgrkgtred immilanvdn ilirlgylds tarevdlini aldsagsadq glgsaslvek 1141 ctcppgyvgd scescasgyv rqargpwlgh cvpftpepcp agtygnprlg vpcqecpcph 1201 agannfasgc qqspdgdvic rcnegyagkr cehcaqgyqg nplapggvcr kipdsscnvd 1261 gtyniysngt cqckdsvige qcdtcapksf hlnsftytgc iecfcsgvgl dcdssswyrn 1321 qvtstfgrtr vnhgfalisd ymrntpvtvp vsmstqanal sfvgsaeqag ntlywslpaa 1381 flgnkltsyg gklsytlsys plpsgimsrn sapdvviksg edlrlihyrk sqvspsvant 1441 yaveikesaw qrgdelvpnr ehvlmalsni taiyikatyt tstkeaslrs vtldtatatn 1501 lgtaraveve qcrcpegylg lsceqcapgy trdpeagiyl glcrpcecng hskycnsetg 1561 ecescsdnte gfncdrcaag yvgdatrgts ydcqyddggy ptsrppapgn qtaeclvncq 1621 qegtagcrgy qceckrnvag drcdqcrpgt yglsaqnpdg ckecycsglt nqcrsaslyr 1681 qlipvdfist pplitdefgd imdrdnlvpd vprnvytykh tsytpkywsl rgsvlgnqll 1741 syggrleysl ivesvgrdhr gkdvvligng lkliwsrpdg hdneqeyhvr lhedeqwtve 1801 drgsarqatr adfmtvlsdl qhililatpk vptvstsisn vilessittr apgathasdi 1861 elcqcpsgyt gtscescapl hyrdasgrcs qcpcdasnte scglvsggnv ecqcrprwrg 1921 drcreietne ptpepdtstd dpvrtqiivs iarpeitilp vggsltlsct grmrwtnspv 1981 fvnwykqgsh lpegvevqgg nlqlfnlqis dsgiyicqav snetghsftd hvsitvsqed 2041 qrspahivdl pndvtfeeyv sneivceveg nppptvtwtr vdghadaqst rtdnnrlvfd 2101 sprksdegry rcqaenslsr eekyvvvyvr snppqpppqq drlyitpqev ngvagdsfql 2161 scqftsaasl rydwshdgrs lsassprnvv vrgnvlevrd anvrdsgtyt cvafdlrtrr 2221 nftesarvyi eqpnepgilg dkphiltleq niiivqgedl sitceasgtp ypsikwtkvq 2281 enlaenvris gnvltiyggr senrglysci aenshgsdqs stsidiepre rpsltidtat 2341 qkvsvgsqas lycaaqgipe ptvewvrtdg qplsprhkvq apgyvviddi vlddsgtyec 2401 rasniagqvs glatinvqep tlvriepdrq hhivtqgdel slscvgsgvp tpsvfwsfeg 2461 rdvdrmgvpe gavfaqpfrt ntadvkifrv skenegiyvc hgsndagedq qyirvevqpr 2521 rgdvgaggdd ngdvdtrqpp nrpqiqpnpl snerlttelg nnvtlicnvd nvntewervd 2581 gtplphnayt vrntlvivfv epqnlgqyrc ngigrdgrve ahvvrelvll plpritfypn 2641 ipltvelgqn ldvycqvenv rpedvhwttd nnrplpssvr iegnvlrfas itqaaageyr 2701 csatnqygsr sknarvvvkq psgfqpvphs qvqqrqvgds iqlrcrlttq ygdevrgniq 2761 fnwyredgsp lprgvrpdsq vlqlvklqpe degryicnsy dlgsgqqlpp vsidlqvltv 2821 paapqnpiyl ppvapprspe rilepqlsls vqssnlpagd gttvecfssd dsypdvvwer 2881 adgaplsenv qqvgnnlvis nvastdagny vckcktdegd lyttsyklev eeqphelkss 2941 kivyakvggn adlqcgaded rqpsyrwsrq ygqlqagrsl qneklsldrv qandagtyvc 3001 saqysdgetv dfpnilvvtg aipqfrqepr symsfptlsn ssfkfnfelt frpenadgll 3061 lfngqtrgsg dyialslkdr yaefrfdfgg kpllvraeep laldewhtvr vsrfkrdgyi 3121 qvddqhpvaf ptsqhqqipq leliedlyig gvpnweflpa eavgqqsgfv gcisrltlqg 3181 rtvelireak fkegitdcrp caqgpcqnkg vclesqteqa ytcvcqpgwt grdcaiegtq 3241 ctagvcgsgr centendmec lcplnragdr cqyneilneq slnfksnsfa aygtpkvtkv 3301 nitlsvrpas ledsvilyta estlpsgdyl alvlrgghae llintaarld pvvvrsaepl 3361 plnrwtriei rrrlgegilk vgdgperkak apgsdrilsl kthlfvggvd rstvkinrdv 3421 nitkgfdgci sklynsqksv nllgdirdaa nvqncgeane idddeyempv alpspkvaen 3481 erqlmapcas dpcenggscs eqedmaicsc pfgfsgkhcq nhlqlsfnas frgdgyveln 3541 rshfqpaleq tyshigivft tnkpngllfw wgqeageeyt gqdfiaaavv dgyveysmrl 3601 dgeeavirns dirvdngerh iviakrdent amleldqild tgdtrptink amklpgnvfv 3661 ggapdvaaft gfrykdnfng civvvegetv gqinlssaai ngvnanvcpa ndeplggtep 3721 pvv