Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

terribly reduced optic lobes, isoform BA [Drosophila melanogaster].


LOCUS       NP_001284825            4203 aa            linear   INV 26-DEC-2023
ACCESSION   NP_001284825
VERSION     NP_001284825.1
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
DBSOURCE    REFSEQ: accession NM_001297896.2
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (residues 1 to 4203)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (residues 1 to 4203)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (residues 1 to 4203)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (residues 1 to 4203)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (residues 1 to 4203)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (residues 1 to 4203)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (residues 1 to 4203)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (residues 1 to 4203)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (residues 1 to 4203)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (residues 1 to 4203)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (residues 1 to 4203)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (residues 1 to 4203)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (residues 1 to 4203)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (residues 1 to 4203)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (residues 1 to 4203)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (residues 1 to 4203)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (residues 1 to 4203)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (residues 1 to 4203)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (residues 1 to 4203)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (residues 1 to 4203)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   21 (residues 1 to 4203)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (residues 1 to 4203)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   23 (residues 1 to 4203)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. The
            reference sequence is identical to AHN59296.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
            Method: conceptual translation.
FEATURES             Location/Qualifiers
     source          1..4203
                     /organism="Drosophila melanogaster"
                     /db_xref="taxon:7227"
                     /chromosome="X"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     Protein         1..4203
                     /product="terribly reduced optic lobes, isoform BA"
                     /name="CG33950 gene product from transcript CG33950-RBA"
                     /note="CG33950-PBA; trol-PBA; lethal (1) G0271;
                     Trol/perlecan; mRNA-like ncRNA in embryogenesis 7; lethal
                     (1) G0211; lethal (1) G0181; lethal (1) G0412; terribly
                     reduced optic lobes; dPerlecan; lethal (1) G0023; lethal
                     (1) G0374; lethal (1) G0019; lethal (1) G0021"
                     /calculated_mol_wt=465096
     Region          407..440
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(412,419,430..431)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(423,426,430,436..437)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            433..437
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          492..524
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(497,505,516..517)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(509,512,516,522..523)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            519..523
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          531..566
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(536,545,556..557)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(549,552,556,562..563)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            559..563
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          576..610
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(581,589,600..601)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(593,596,600,606..607)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            603..607
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          664..699
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(669,678,689..690)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(682,685,689,695..696)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            692..696
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          773..807
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(779,787,798..799)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(791,794,798,804..805)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            801..805
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          812..847
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(817,826,837..838)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(830,833,837,843..844)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            840..844
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          857..890
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(862,869,880..881)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(873,876,880,886..887)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            883..887
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          896..926
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(897,905,916..917)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(909,912,916,922..923)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            919..923
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          927..961
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(932,940,951..952)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(944,947,951,957..958)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            954..958
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          967..1001
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(972,980,991..992)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(984,987,991,997..998)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            994..998
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1035..1070
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1040,1048,1059..1060)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1052,1055,1059,1066..1067)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1063..1067
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1096..1157
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          1107..1111
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          1121..1124
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          1140..1144
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          1191..1225
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1196,1204,1215..1216)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1208,1211,1215,1221..1222)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1218..1222
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1231..1265
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1236,1244,1255..1256)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1248,1251,1255,1261..1262)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1258..1262
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1274..1308
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1279,1287,1298..1299)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1291,1294,1298,1304..1305)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1301..1305
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1326..1401
                     /region_name="Ig_Perlecan_like"
                     /note="Immunoglobulin (Ig)-like domain of the human
                     basement membrane heparan sulfate proteoglycan perlecan
                     and similar proteins; cd05743"
                     /db_xref="CDD:143220"
     Region          1329..1335
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:143220"
     Region          1342..1347
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:143220"
     Region          1365..1369
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:143220"
     Region          1378..1384
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:143220"
     Region          1393..1399
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:143220"
     Region          1500..1630
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1687..1741
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1687,1689,1701,1711,1713,1722)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1865..2001
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <2002..2028
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          2036..2085
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(2037,2039,2046,2053,2056,2065)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <2121..2151
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          2217..2351
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          2440..2508
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2533..2603
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2632..2700
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2729..2813
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2746..2750
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:143259"
     Region          2759..2763
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:143259"
     Region          2778..2782
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:143259"
     Region          2792..2797
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:143259"
     Region          2805..2808
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:143259"
     Region          2831..2904
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2836..2840
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2849..2853
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2870..2874
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2884..2889
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2897..2900
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          2907..2995
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2927..2931
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2940..2944
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2961..2965
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2975..2980
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2988..2991
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          3040..3097
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          3043..3047
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3055..3058
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3075..3078
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3123..3199
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3132..3136
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3145..3149
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3179..3184
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          3217..>3278
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3222..3226
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3233..3245
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3261..3265
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3327..3389
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          3339..3343
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3351..3356
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3372..3376
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3420..3486
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3428..3432
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3441..3445
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3460..3464
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3474..3479
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          3488..3491
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          3508..3653
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3676..3708
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3756..3909
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3968..4000
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          4009..4162
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..4203
                     /gene="trol"
                     /locus_tag="Dmel_CG33950"
                     /gene_synonym="anon-WO0153538.72; BcDNA:GM02481; CG12497;
                     CG33675; CG33950; CG7981; CT23996; Dmel\CG33950;
                     EG:BACR25B3.1; EG:BACR25B3.10; EG:BACR25B3.11;
                     EG:BACR25B3.2; GC7891; l(1)3Ac; l(1)9-96; l(1)G0019;
                     l(1)G0021; l(1)G0023; l(1)G0181; l(1)G0211; l(1)G0271;
                     l(1)G0374; l(1)G0412; l(1)trol; l(1)VA51; l(1)zw1;
                     l(1)zwl; MRE7; pcan; Pcan; Pcn; Perl; Perlecan; Trol;
                     TROL; Trol-A; Trol-B; troll; Troll; zw-1; ZW-1; zw1"
                     /coded_by="NM_001297896.2:185..12796"
                     /db_xref="FLYBASE:FBpp0309662"
                     /db_xref="GeneID:45320"
                     /db_xref="FLYBASE:FBgn0284408"
ORIGIN      
        1 mmgspgsqas aiatsvgirs grrgqaggsl llrllavtfv laachapllt nakqisnlgd
       61 dqdfmladde slqgindsew qlmgddiddg llddvdetlk pmetkseeed lptgnwfsqs
      121 vhrvrrsinr lfgsddnqer grrqqrersq rnrdainrqk elrrrqkedh nrwkqmrmer
      181 qlekqrlvkr tnhvvfnrat dprkrasdly deneasgyhe edttlyrtyf vvnepydney
      241 rdresvqfqn lqklldddlr nffhsnyegn ddeeqeirst lerveptndn fkirvqlrie
      301 lptsvndfgs klqqqlnvyn rienlsaatd gvfsftessd ieeeaidvtl pqeevegsgs
      361 ddsscrgdat ftcprsgkti cdemrcdrei qcpdgedeey cnypnvcted qfkcddkcle
      421 lkkrcdgsid cldqtdeagc inapepepep epepepepes epeaepepep epepesepeq
      481 epepqvpean ecqanefrcn ngdcidarkr cnnvsdcseg edeneecpaa csgmeyqcrd
      541 gtrcisvsqq cdghsdcsdg ddeehcdgsg ydseecrfde fhcgtgecip mrqvcdniyd
      601 cndysdevnc vegeeedrvg ipighqpwrp askhddwlhe mdtseyqvyq psnvyekans
      661 qnpcasnqfr cttsnvcipl hlrcdgfyhc ndmsdeksce qyqrhtttrr pltlatptsr
      721 ittqgpglle rrntttatea srwpwatktt tiatttsnpi ttvgvanspp qtcleniefa
      781 chnrdcisie svcdgipdcg rnededdalc kcsgdkykcq rgggcipksq vcdgkpqchd
      841 rsdesacptt rlkpsdcspe qfycdescyn rsvrcnghvd csdgsdevgc slpcpqhqcp
      901 sgrcyteser cdrhrhcedg sdeanccyan qfrcnngdcv sgsapcngys ecsdhsdeln
      961 cggtqeclpn qfrcnsgqcv sssvrcngrt dcqdssdeqn cghrhtevsq glettgvftt
     1021 sttsttamtp lriicpptsf kcengpcisl glkcngrvdc pydgsdeadc gqisndidpa
     1081 dsndrrpnql nlktypdsqi ikesrevifr crdegparak vkwsrpggrp lppgftdrng
     1141 rleipnirve dagtyvceav gyasyipgqq vtvnlnvery ndvgsrpesa cteyqatcmn
     1201 gecidkssic dgnpdcsdas deqscslglk cqpnqfmcsn skcvdrtwrc dgendcgdns
     1261 detscdpeps gapcrynefq crsghcipks fqcdnvpdct dgtdevgcma plpirpppqs
     1321 vslleyevle ltcvatgtpt ptivwrlnwg hvpdkcesks yggtgtlrcp dmrpqdsgay
     1381 sceiintrgt hfvnpdtivt vrpvrtdvce agffnmlark aeecvqcfcf gvakacdsan
     1441 lftyaihppi lshrvvsvel splrqivine aapgqdlltl lhgvqfratn vhfsgretpy
     1501 lalpadymgn qlksyggnlr yevnyrgsgr pvngpdviit gnrftltyrv rtqpgqnnrv
     1561 sipfvpggwq kpdgrkasre eimmilanvd nilirlgyld starevdlin ialdsagtad
     1621 kglgsaslve kcqcppgyvg dscescasgy vrqpggpwlg hcvpfipdsc psgtygdprr
     1681 gvpckecpcp ltgsnnfasg cqqspdgdvv crcnegytgr rceqcaagyq gnplaaggic
     1741 rripdtscnv dgtysvhsng tcqckdsvig eqcdtcksks fhlnsftytg ciecfcsgvg
     1801 ldcdsstwyr dqvtstfgrs rvdhgfvlvt nymqptpdtv pvsmaaepna lsfigsadqs
     1861 gntlywslpa aflgnklssy ggkltytlsy splpngimsr nsapdvviks gedlrlihyr
     1921 ksqvvpsvan tysveikesa wqrgdevvan rehvlmalsd itaiyikaty ttstkeaslr
     1981 qvtldvatpt nlgtpravev eqcrcpegyl glsceqcapg yardpeggiy lglcrpcecn
     2041 ghskycnsdt gdceecsdnt egpscercaa gyvgdatrgt iydcqpdegy pipsppapgn
     2101 qtlectaycq iegiydcrgn eclckrnvig dqcdqcrpgt yglsaqnqdg ckecycsgla
     2161 sqcrsaalyr qlipvdfiln aplitdesga vqdtenlipd isrnmytyth tsylpkywsl
     2221 rgsvlgnqlf syggrlsysl ivesygnyer ghdivligng lkliwsrpdg nenqeeynvr
     2281 lhedeqwtrq dresarpasr sdfmtvlsdl qhiliratpr vptqstsign vilesavttr
     2341 tpgathasdi elcqcpsgyv gtscescapl hyrdasgscs lcpcdvsnte scdlvsggyv
     2401 ecrckarwkg drcreidtnd ptdigtedpv ltqiivsiqk peitivpvgg smtlscsgrm
     2461 rwsnspvivn wykensrlpe nvevqggnly lydlqvsdsg vyicqavnne tasvfkdtvs
     2521 ititkkdqls paeivnlpsh vtfeeyvnne iicevlgnpa prvtwarvdg hadaqstrty
     2581 dnrlifdspr ksdegryrcq aendqnrdek yvivyvqsnp pqpppqqdrl yitpeeingl
     2641 agesfqlncq ftsvaslryd wshngrslss sparnveirg ntlevrdase sdsgvytcva
     2701 ydvrtrrnft esarvnidrr eeqpfgnkpi iesleqnili iqgedysitc easgspypsi
     2761 kwakvhdfmp envhisgnvl tiygarfenr gvyscvaend hgsdlsstsi dieprerpsv
     2821 kivsaplqtf svgapaslyc tvegipdptv ewvrvdgqpl sprhkiqspg ymviddiqle
     2881 dsgdyecrak nivgeatgva titvqeptlv qiipdnrdlr ltegdelslt cvgsgvpnpe
     2941 vewvnemalk rdlysppsnt ailkiyrvtk adagiytchg kneagsdeah vrvevqerrg
     3001 diggvdddsd rdpinynppq qqnpgihqpg snqllatdig dnvtltcdmf qplntrwerv
     3061 dgaplprnay tiknrleivr veqqnlgqyr cngigrdgnv ktyfvkelvl mplprirfyp
     3121 nipltveagq nldvhcqven vrpedvhwst dnnrplpssv rivgsvlrfv sitqaaagey
     3181 rcsafnqygn rsqiarvavk kpadfhqvpq sqlqrhrege niqlqctvtd qygvraqdnv
     3241 efnwfrddrr plpnnartds qilvltnlrp edagryicns ydvdrgqqlp evsidlqvlt
     3301 atpppnspiy lppqlpaksr dyslklddqs snlragestd vecyssddty tdvvwersdg
     3361 aplsnnvrqv gnrlvisnvs psdagnyvck cktdegdlyt tsyklevedq phelksskiv
     3421 yakvganadl qcgadesrqp tyrwsrqygq lqagrslmne klsldsvqan dagtyictaq
     3481 yadgetadfp nilvvtgaip qfrqeprsym sfptlpnssf kfnfeltfrp engdglllfn
     3541 gqtrgsgdyi alslkdryae frfdfggkpm lvraeeplal newhtvrvsr fkrdgyiqvd
     3601 eqhpvafptl qqipqldlie dlyiggvpnw ellpadavsq qvgfvgcisr ltlqgrtvel
     3661 ireakykegi tdcrpcaqgp cqnkgvcles qteqaytcic qpgwtgrdca iegtqctpgv
     3721 cgagrcente ndmeclcpln rsgdrcqyne ilnehslnfk gnsfaaygtp kvtkvnitls
     3781 vrpasledsv ilytaestlp sgdylalvlr gghaellint aarldpvvvr saeplplnrw
     3841 trieirrrlg egilrvgdgp erkakapgsd rilslkthly vggydrstvk vnrdvnitkg
     3901 fdgcisrlyn fqkpvnllad ikdaaniqsc getnmiggde dsdneppvpp ptpdvhenel
     3961 qpyamapcas dpcenggscs eqedvavcsc pfgfsgkhcq ehlqlgfnas frgdgyveln
     4021 rshfqpaleq sytsmgivft tnkpngllfw wgqeageeyt gqdfiaaavv dgyveysmrl
     4081 dgeeavirns dirvdngerh iviakrdent ailevdrmlh sgetrptskk smklpgnvfv
     4141 ggapdlevft gfrykhnlng civvvegetv gqinlssaav ngvnanvcpa nddplggtep
     4201 pvv