Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS NP_001027033 4007 aa linear INV 26-DEC-2023 ACCESSION NP_001027033 VERSION NP_001027033.3 DBLINK BioProject: PRJNA164 BioSample: SAMN02803731 DBSOURCE REFSEQ: accession NM_001031862.4 KEYWORDS RefSeq. SOURCE Drosophila melanogaster (fruit fly) ORGANISM Drosophila melanogaster Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. REFERENCE 1 (residues 1 to 4007) AUTHORS Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K., Strelets,V., Russo,S.M. and Gelbart,W.M. CONSRTM FlyBase Consortium TITLE Gene Model Annotations for Drosophila melanogaster: Impact of High-Throughput Data JOURNAL G3 (Bethesda) 5 (8), 1721-1736 (2015) PUBMED 26109357 REMARK Publication Status: Online-Only REFERENCE 2 (residues 1 to 4007) AUTHORS Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B., Russo,S.M. and Gelbart,W.M. CONSRTM FlyBase Consortium TITLE Gene Model Annotations for Drosophila melanogaster: The Rule-Benders JOURNAL G3 (Bethesda) 5 (8), 1737-1749 (2015) PUBMED 26109356 REMARK Publication Status: Online-Only REFERENCE 3 (residues 1 to 4007) AUTHORS Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I., Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R., Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G., Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N., Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A., Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E. TITLE The Release 6 reference sequence of the Drosophila melanogaster genome JOURNAL Genome Res 25 (3), 445-458 (2015) PUBMED 25589440 REFERENCE 4 (residues 1 to 4007) AUTHORS Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M., Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F., Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E. TITLE Sequence finishing and mapping of Drosophila melanogaster heterochromatin JOURNAL Science 316 (5831), 1625-1628 (2007) PUBMED 17569867 REFERENCE 5 (residues 1 to 4007) AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H. TITLE The Release 5.1 annotation of Drosophila melanogaster heterochromatin JOURNAL Science 316 (5831), 1586-1591 (2007) PUBMED 17569856 REMARK Erratum:[Science. 2007 Sep 7;317(5843):1325] REFERENCE 6 (residues 1 to 4007) AUTHORS Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D., Ashburner,M. and Anxolabehere,D. TITLE Combined evidence annotation of transposable elements in genome sequences JOURNAL PLoS Comput Biol 1 (2), 166-175 (2005) PUBMED 16110336 REFERENCE 7 (residues 1 to 4007) AUTHORS Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A., Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A., Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W., Celniker,S.E., Rubin,G.M. and Karpen,G.H. TITLE Heterochromatic sequences in a Drosophila whole-genome shotgun assembly JOURNAL Genome Biol 3 (12), RESEARCH0085 (2002) PUBMED 12537574 REFERENCE 8 (residues 1 to 4007) AUTHORS Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J., Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E., Rubin,G.M., Ashburner,M. and Celniker,S.E. TITLE The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective JOURNAL Genome Biol 3 (12), RESEARCH0084 (2002) PUBMED 12537573 REFERENCE 9 (residues 1 to 4007) AUTHORS Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S., Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E., Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L., Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D., Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J., Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M., Rubin,G.M. and Lewis,S.E. TITLE Annotation of the Drosophila melanogaster euchromatic genome: a systematic review JOURNAL Genome Biol 3 (12), RESEARCH0083 (2002) PUBMED 12537572 REFERENCE 10 (residues 1 to 4007) AUTHORS Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W., Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E., Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M., Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S., Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M., Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W., Gibbs,R.A. and Rubin,G.M. TITLE Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence JOURNAL Genome Biol 3 (12), RESEARCH0079 (2002) PUBMED 12537568 REFERENCE 11 (residues 1 to 4007) AUTHORS Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D., Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F., George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N., Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X., Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D., Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L., Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D., Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M., Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S., Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P., Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A., Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B., Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I., Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S., Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C., Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S., Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z., Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J., Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J., Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z., Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C., Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A., Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C., McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C., Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L., Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K., Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S., Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K., Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I., Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C., Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R., Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A., Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT, Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F., Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H., Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O., Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C. TITLE The genome sequence of Drosophila melanogaster JOURNAL Science 287 (5461), 2185-2195 (2000) PUBMED 10731132 REFERENCE 12 (residues 1 to 4007) AUTHORS Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R., Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R., Smith,E., Yu,C. and Rubin,G. CONSRTM Berkeley Drosophila Genome Project TITLE Drosophila melanogaster release 4 sequence JOURNAL Unpublished REFERENCE 13 (residues 1 to 4007) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (20-DEC-2023) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 14 (residues 1 to 4007) CONSRTM FlyBase TITLE Direct Submission JOURNAL Submitted (13-DEC-2023) FlyBase, Harvard University, Biological Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA REFERENCE 15 (residues 1 to 4007) CONSRTM FlyBase TITLE Direct Submission JOURNAL Submitted (19-OCT-2022) FlyBase, Harvard University, Biological Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA REFERENCE 16 (residues 1 to 4007) CONSRTM FlyBase TITLE Direct Submission JOURNAL Submitted (20-APR-2020) FlyBase, Harvard University, Biological Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA REFERENCE 17 (residues 1 to 4007) CONSRTM FlyBase TITLE Direct Submission JOURNAL Submitted (22-APR-2019) FlyBase, Harvard University, Biological Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA REFERENCE 18 (residues 1 to 4007) CONSRTM FlyBase TITLE Direct Submission JOURNAL Submitted (24-MAY-2018) FlyBase, Harvard University, Biological Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA REFERENCE 19 (residues 1 to 4007) CONSRTM FlyBase TITLE Direct Submission JOURNAL Submitted (07-DEC-2016) FlyBase, Harvard University, Biological Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA REFERENCE 20 (residues 1 to 4007) AUTHORS Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R., Park,S., Svirskas,R. and Karpen,G. TITLE Direct Submission JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA REMARK Direct Submission REFERENCE 21 (residues 1 to 4007) AUTHORS Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S., Svirskas,R. and Rubin,G. TITLE Direct Submission JOURNAL Submitted (10-AUG-2006) Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA REMARK Direct Submission REFERENCE 22 (residues 1 to 4007) AUTHORS Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H. CONSRTM Drosophila Heterochromatin Genome Project TITLE Direct Submission JOURNAL Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project, Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Mailstop 64-121, Berkeley, CA 94720, USA REFERENCE 23 (residues 1 to 4007) AUTHORS Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J. TITLE Direct Submission JOURNAL Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive, Rockville, MD 20850, USA COMMENT REVIEWED REFSEQ: This record has been curated by FlyBase. The reference sequence is identical to AAN09077. On Jul 15, 2014 this sequence version replaced NP_001027033.2. ##Genome-Annotation-Data-START## Annotation Provider :: FlyBase Annotation Status :: Full annotation Annotation Version :: Release 6.54 URL :: http://flybase.org ##Genome-Annotation-Data-END## Method: conceptual translation. FEATURES Location/Qualifiers source 1..4007 /organism="Drosophila melanogaster" /db_xref="taxon:7227" /chromosome="X" /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2] bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]" Protein 1..4007 /product="terribly reduced optic lobes, isoform AW" /name="CG33950 gene product from transcript CG33950-RAW" /note="CG33950-PAW; trol-PAW; lethal (1) G0271; Trol/perlecan; mRNA-like ncRNA in embryogenesis 7; lethal (1) G0211; lethal (1) G0181; lethal (1) G0412; terribly reduced optic lobes; dPerlecan; lethal (1) G0023; lethal (1) G0374; lethal (1) G0019; lethal (1) G0021" /calculated_mol_wt=444080 Region 447..480 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(452,459,470..471) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(463,466,470,476..477) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 473..477 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 532..564 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(537,545,556..557) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(549,552,556,562..563) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 559..563 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 571..606 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(576,585,596..597) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(589,592,596,602..603) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 599..603 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 616..650 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(621,629,640..641) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(633,636,640,646..647) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 643..647 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 704..739 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(709,718,729..730) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(722,725,729,735..736) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 732..736 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 808..842 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(813,821,832..833) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(825,828,832,838..839) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 835..839 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 848..882 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(853,861,872..873) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(865,868,872,878..879) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 875..879 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 900..961 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 911..915 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 925..928 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 944..948 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 995..1029 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(1000,1008,1019..1020) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(1012,1015,1019,1025..1026) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 1022..1026 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 1035..1069 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(1040,1048,1059..1060) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(1052,1055,1059,1065..1066) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 1062..1066 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 1078..1112 /region_name="LDLa" /note="Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about...; cd00112" /db_xref="CDD:238060" Site order(1083,1091,1102..1103) /site_type="active" /note="putative binding surface [active]" /db_xref="CDD:238060" Site order(1095,1098,1102,1108..1109) /site_type="other" /note="calcium-binding site [ion binding]" /db_xref="CDD:238060" Site 1105..1109 /site_type="other" /note="D-X-S-D-E motif" /db_xref="CDD:238060" Region 1130..1205 /region_name="Ig_Perlecan_like" /note="Immunoglobulin (Ig)-like domain of the human basement membrane heparan sulfate proteoglycan perlecan and similar proteins; cd05743" /db_xref="CDD:143220" Region 1133..1139 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:143220" Region 1146..1151 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:143220" Region 1169..1173 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:143220" Region 1182..1188 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:143220" Region 1197..1203 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:143220" Region 1304..1434 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 1491..1545 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Site order(1491,1493,1505,1515,1517,1526) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region 1669..1805 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region <1806..1832 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Region 1840..1889 /region_name="EGF_Lam" /note="Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in...; cd00055" /db_xref="CDD:238012" Site order(1841,1843,1850,1857,1860,1869) /site_type="active" /note="EGF-like motif [active]" /db_xref="CDD:238012" Region <1925..1955 /region_name="Laminin_EGF" /note="Laminin EGF domain; pfam00053" /db_xref="CDD:395007" Region 2021..2155 /region_name="Laminin_B" /note="Laminin B (Domain IV); pfam00052" /db_xref="CDD:459652" Region 2244..2312 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 2337..2407 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 2436..2504 /region_name="Ig_3" /note="Immunoglobulin domain; pfam13927" /db_xref="CDD:464046" Region 2533..2617 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 2550..2554 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:143259" Region 2563..2567 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:143259" Region 2582..2586 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:143259" Region 2596..2601 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:143259" Region 2609..2612 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:143259" Region 2635..2708 /region_name="I-set" /note="Immunoglobulin I-set domain; pfam07679" /db_xref="CDD:400151" Region 2640..2644 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2653..2657 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2674..2678 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2688..2693 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2701..2704 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region 2711..2799 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 2731..2735 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2744..2748 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2765..2769 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2779..2784 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 2792..2795 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region 2844..2901 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 2847..2851 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2859..2862 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2879..2882 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 2927..3003 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 2936..2940 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 2949..2953 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 2983..2988 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 3021..>3082 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 3026..3030 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 3037..3049 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 3065..3069 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 3131..3193 /region_name="Ig" /note="Immunoglobulin domain; cl11960" /db_xref="CDD:472250" Region 3143..3147 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 3155..3160 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 3176..3180 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 3224..3290 /region_name="IG_like" /note="Immunoglobulin like; smart00410" /db_xref="CDD:214653" Region 3232..3236 /region_name="Ig strand B" /note="Ig strand B [structural motif]" /db_xref="CDD:409353" Region 3245..3249 /region_name="Ig strand C" /note="Ig strand C [structural motif]" /db_xref="CDD:409353" Region 3264..3268 /region_name="Ig strand E" /note="Ig strand E [structural motif]" /db_xref="CDD:409353" Region 3278..3283 /region_name="Ig strand F" /note="Ig strand F [structural motif]" /db_xref="CDD:409353" Region 3292..3295 /region_name="Ig strand G" /note="Ig strand G [structural motif]" /db_xref="CDD:409353" Region 3312..3457 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 3480..3512 /region_name="EGF" /note="EGF-like domain; pfam00008" /db_xref="CDD:394967" Region 3560..3713 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" Region 3772..3804 /region_name="EGF_CA" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; cd00054" /db_xref="CDD:238011" Region 3813..3966 /region_name="LamG" /note="Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of...; cd00110" /db_xref="CDD:238058" CDS 1..4007 /gene="trol" /locus_tag="Dmel_CG33950" /gene_synonym="anon-WO0153538.72; BcDNA:GM02481; CG12497; CG33675; CG33950; CG7981; CT23996; Dmel\CG33950; EG:BACR25B3.1; EG:BACR25B3.10; EG:BACR25B3.11; EG:BACR25B3.2; GC7891; l(1)3Ac; l(1)9-96; l(1)G0019; l(1)G0021; l(1)G0023; l(1)G0181; l(1)G0211; l(1)G0271; l(1)G0374; l(1)G0412; l(1)trol; l(1)VA51; l(1)zw1; l(1)zwl; MRE7; pcan; Pcan; Pcn; Perl; Perlecan; Trol; TROL; Trol-A; Trol-B; troll; Troll; zw-1; ZW-1; zw1" /coded_by="NM_001031862.4:185..12208" /db_xref="FLYBASE:FBpp0309658" /db_xref="GeneID:45320" /db_xref="FLYBASE:FBgn0284408" ORIGIN 1 mmgspgsqas aiatsvgirs grrgqaggsl llrllavtfv laachapllt nakqisnlgd 61 dqdfmladde slqgindsew qlmgddiddg llddvdetlk pmetkseeed lptgnwfsqs 121 vhrvrrsinr lfgsddnqer grrqqrersq rnrdainrqk elrrrqkedh nrwkqmrmer 181 qlekqrlvkr tnhvvfnrat dprkrasdly deneasgyhe edttlyrtyf vvnepydney 241 rdresvqfqn lqklldddlr nffhsnyegn ddeeqeirst lerveptndn fkirvqlrie 301 lptsvndfgs klqqqlnvyn rienlsaatd gvfsftessv teesvrhldv gfvephitlf 361 heqntgyddf drkpttetqd ieeeaidvtl pqeevegsgs ddsscrgdat ftcprsgkti 421 cdemrcdrei qcpdgedeey cnypnvcted qfkcddkcle lkkrcdgsid cldqtdeagc 481 inapepepep epepepepes epeaepepep epepesepeq epepqvpean ecqanefrcn 541 ngdcidarkr cnnvsdcseg edeneecpaa csgmeyqcrd gtrcisvsqq cdghsdcsdg 601 ddeehcdgsg ydseecrfde fhcgtgecip mrqvcdniyd cndysdevnc vegeeedrvg 661 ipighqpwrp askhddwlhe mdtseyqvyq psnvyekans qnpcasnqfr cttsnvcipl 721 hlrcdgfyhc ndmsdeksce qyqrhtttrr pltlatptsr ittqgpglle rrntttatea 781 srwpwatktt tiatttsnpi ttvgvascya nqfrcnngdc vsgsapcngy secsdhsdel 841 ncggtqeclp nqfrcnsgqc vsssvrcngr tdcqdssdeq ncaadsndrr pnqlnlktyp 901 dsqiikesre vifrcrdegp arakvkwsrp ggrplppgft drngrleipn irvedagtyv 961 ceavgyasyi pgqqvtvnln veryndvgsr pesacteyqa tcmngecidk ssicdgnpdc 1021 sdasdeqscs lglkcqpnqf mcsnskcvdr twrcdgendc gdnsdetscd pepsgapcry 1081 nefqcrsghc ipksfqcdnv pdctdgtdev gcmaplpirp ppqsvslley evleltcvat 1141 gtptptivwr lnwghvpdkc esksyggtgt lrcpdmrpqd sgaysceiin trgthfvnpd 1201 tivtvrpvrt dvceagffnm larkaeecvq cfcfgvakac dsanlftyai hppilshrvv 1261 svelsplrqi vineaapgqd lltllhgvqf ratnvhfsgr etpylalpad ymgnqlksyg 1321 gnlryevnyr gsgrpvngpd viitgnrftl tyrvrtqpgq nnrvsipfvp ggwqkpdgrk 1381 asreeimmil anvdnilirl gyldstarev dlinialdsa gtadkglgsa slvekcqcpp 1441 gyvgdscesc asgyvrqpgg pwlghcvpfi pdscpsgtyg dprrgvpcke cpcpltgsnn 1501 fasgcqqspd gdvvcrcneg ytgrrceqca agyqgnplaa ggicrripdt scnvdgtysv 1561 hsngtcqckd svigeqcdtc ksksfhlnsf tytgciecfc sgvgldcdss twyrdqvtst 1621 fgrsrvdhgf vlvtnymqpt pdtvpvsmaa epnalsfigs adqsgntlyw slpaaflgnk 1681 lssyggklty tlsysplpng imsrnsapdv viksgedlrl ihyrksqvvp svantysvei 1741 kesawqrgde vvanrehvlm alsditaiyi katyttstke aslrqvtldv atptnlgtpr 1801 aveveqcrcp egylglsceq capgyardpe ggiylglcrp cecnghskyc nsdtgdceec 1861 sdntegpsce rcaagyvgda trgtiydcqp degypipspp apgnqtlect aycqiegiyd 1921 crgneclckr nvigdqcdqc rpgtyglsaq nqdgckecyc sglasqcrsa alyrqlipvd 1981 filnaplitd esgavqdten lipdisrnmy tythtsylpk ywslrgsvlg nqlfsyggrl 2041 syslivesyg nyerghdivl ignglkliws rpdgnenqee ynvrlhedeq wtrqdresar 2101 pasrsdfmtv lsdlqhilir atprvptqst signvilesa vttrtpgath asdielcqcp 2161 sgyvgtsces caplhyrdas gscslcpcdv sntescdlvs ggyvecrcka rwkgdrcrei 2221 dtndptdigt edpvltqiiv siqkpeitiv pvggsmtlsc sgrmrwsnsp vivnwykens 2281 rlpenvevqg gnlylydlqv sdsgvyicqa vnnetasvfk dtvsititkk dqlspaeivn 2341 lpshvtfeey vnneiicevl gnpaprvtwa rvdghadaqs trtydnrlif dsprksdegr 2401 yrcqaendqn rdekyvivyv qsnppqpppq qdrlyitpee inglagesfq lncqftsvas 2461 lrydwshngr slsssparnv eirgntlevr dasesdsgvy tcvaydvrtr rnftesarvn 2521 idrreeqpfg nkpiiesleq niliiqgedy sitceasgsp ypsikwakvh dfmpenvhis 2581 gnvltiygar fenrgvyscv aendhgsdls stsidiepre rpsvkivsap lqtfsvgapa 2641 slyctvegip dptvewvrvd gqplsprhki qspgymvidd iqledsgdye craknivgea 2701 tgvatitvqe ptlvqiipdn rdlrltegde lsltcvgsgv pnpevewvne malkrdlysp 2761 psntailkiy rvtkadagiy tchgkneags deahvrvevq errgdiggvd ddsdrdpiny 2821 nppqqqnpgi hqpgsnqlla tdigdnvtlt cdmfqplntr wervdgaplp rnaytiknrl 2881 eivrveqqnl gqyrcngigr dgnvktyfvk elvlmplpri rfypnipltv eagqnldvhc 2941 qvenvrpedv hwstdnnrpl pssvrivgsv lrfvsitqaa ageyrcsafn qygnrsqiar 3001 vavkkpadfh qvpqsqlqrh regeniqlqc tvtdqygvra qdnvefnwfr ddrrplpnna 3061 rtdsqilvlt nlrpedagry icnsydvdrg qqlpevsidl qvltatpppn spiylppqlp 3121 aksrdyslkl ddqssnlrag estdvecyss ddtytdvvwe rsdgaplsnn vrqvgnrlvi 3181 snvspsdagn yvckcktdeg dlyttsykle vedqphelks skivyakvga nadlqcgade 3241 srqptyrwsr qygqlqagrs lmneklslds vqandagtyi ctaqyadget adfpnilvvt 3301 gaipqfrqep rsymsfptlp nssfkfnfel tfrpengdgl llfngqtrgs gdyialslkd 3361 ryaefrfdfg gkpmlvraee plalnewhtv rvsrfkrdgy iqvdeqhpva fptlqqipql 3421 dliedlyigg vpnwellpad avsqqvgfvg cisrltlqgr tvelireaky kegitdcrpc 3481 aqgpcqnkgv clesqteqay tcicqpgwtg rdcaiegtqc tpgvcgagrc entendmecl 3541 cplnrsgdrc qyneilnehs lnfkgnsfaa ygtpkvtkvn itlsvrpasl edsvilytae 3601 stlpsgdyla lvlrgghael lintaarldp vvvrsaeplp lnrwtrieir rrlgegilrv 3661 gdgperkaka pgsdrilslk thlyvggydr stvkvnrdvn itkgfdgcis rlynfqkpvn 3721 lladikdaan iqscgetnmi ggdedsdnep pvppptpdvh enelqpyama pcasdpceng 3781 gscseqedva vcscpfgfsg khcqehlqlg fnasfrgdgy velnrshfqp aleqsytsmg 3841 ivfttnkpng llfwwgqeag eeytgqdfia aavvdgyvey smrldgeeav irnsdirvdn 3901 gerhiviakr dentailevd rmlhsgetrp tskksmklpg nvfvggapdl evftgfrykh 3961 nlngcivvve getvgqinls saavngvnan vcpanddplg gteppvv