Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_070074538            4176 aa            linear   INV 09-DEC-2024
            protein isoform X20 [Drosophila takahashii].
ACCESSION   XP_070074538
VERSION     XP_070074538.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_070218437.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..4176
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..4176
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X20"
                     /calculated_mol_wt=461789
     Region          84..>169
                     /region_name="CCDC66"
                     /note="Coiled-coil domain-containing protein 66;
                     pfam15236"
                     /db_xref="CDD:434558"
     Region          385..415
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(385,394,405..406)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(398,401,405,411..412)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            408..412
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          418..451
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(423,430,441..442)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(434,437,441,447..448)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            444..448
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          504..536
                     /region_name="LDLa"
                     /note="Low-density lipoprotein receptor domain class A;
                     smart00192"
                     /db_xref="CDD:197566"
     Site            order(510,518,529..530)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(522,525,529,535..536)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            532..536
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          544..579
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(549,558,569..570)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(562,565,569,575..576)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            572..576
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          589..623
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(594,602,613..614)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(606,609,613,619..620)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            616..620
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          675..710
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(680,689,700..701)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(693,696,700,706..707)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            703..707
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          799..832
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(804,811,822..823)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(815,818,822,828..829)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            825..829
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          842..872
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(843,851,862..863)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(855,858,862,868..869)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            865..869
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          873..907
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(878,886,897..898)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(890,893,897,903..904)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            900..904
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          913..947
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(918,926,937..938)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(930,933,937,943..944)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            940..944
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          962..1030
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          1062..1096
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1067,1075,1086..1087)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1079,1082,1086,1092..1093)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1089..1093
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1102..1136
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1107,1115,1126..1127)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1119,1122,1126,1132..1133)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1129..1133
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1145..1179
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1150,1158,1169..1170)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1162,1165,1169,1175..1176)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1172..1176
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1197..1272
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          1200..1204
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          1213..1217
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          1236..1240
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          1250..1255
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          1264..1267
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          1371..1501
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1558..1611
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(1558,1560,1572,1582,1584,1593)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1736..1872
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1873..1899
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1907..1956
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1908,1910,1917,1924,1927,1936)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <1993..2023
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          2089..2223
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          2378..2460
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2389..2393
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2405..2409
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2424..2428
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2438..2443
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2453..2456
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <2487..2540
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2571..2654
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2582..2586
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2595..2599
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2618..2622
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2632..2637
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2667..2747
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2684..2688
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          2697..2701
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          2716..2720
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          2730..2735
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          2743..2746
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          2765..2841
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2773..2777
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2786..2790
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2807..2811
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2821..2826
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2834..2837
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          2854..2941
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2864..2868
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2877..2881
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2921..2926
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          3066..3142
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3075..3079
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3088..3092
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3108..3112
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3122..3127
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          3154..>3221
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3165..3169
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3181..3188
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3204..3208
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3298..3366
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          3396..3462
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3404..3408
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3417..3421
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3436..3440
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3450..3455
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          3484..3631
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3654..3686
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3734..3887
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3941..3973
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3982..4135
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..4176
                     /gene="trol"
                     /coded_by="XM_070218437.1:209..12739"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf
       61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd
      121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn
      181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde
      241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln
      301 vykridrlga stdgvftfte ssvykyehqy divtprvivs gqpitekpve apqefdedpi
      361 evalptdeve gsgqdsgscr gdatfqcrrs gkticdemrc dgsrdcpdae deegcevcne
      421 lqfkcdnkcl plnkrcdnry dcedqtdeag cqryeveesq pqpqpqpqpe pepepepepe
      481 pepepepepe epitdneqpe qnsecratef rcnngdcidi rkrcdhisdc segedeneec
      541 paacsgmeyq crdgtrcisl sqqcdghsdc sdaddeehcd gsgndgedcr fdefrcgtge
      601 cipmrqvcdn iydcndysde vscaeeeeds vgipigrppq rpapkhdwld eldaneyhvy
      661 hpsnvyelan sknpcasnqf rcattnvcip lhlrcdnfyh cndmsdekdc eqyqrrtttt
      721 trrpstsarp sftftfttqg pgllerrnst tsrttagstt rateapqwpw atrptetttt
      781 npittvgvat ttrlkpsdcg pdqffcddlc ynrsircngh mdcsdgsdei acsslsvlpc
      841 pqhqcpsgrc ysesercdrh rhcedgsdea nccyadqfrc nngdciaesa hcdgnidcsd
      901 qsdeldcggd sqclpnqfrc kngqcvssta rcnkrsdcld gsdeqncane pnnsgrgtnq
      961 lklktypdnq iikesrevif rcrdegpnra kvkwsrpggr plppgftdrn grleipnirv
     1021 edagayvcea vgyanyipgq hvtvnlnver lnereirpds acteyqatcm ngecidksgi
     1081 cdghpdcsdg sdehscslgl kcqpnqfmcs nskcvdrtwr cdgendcgdn sdetscdpep
     1141 sdapcrydef qcrsghcipk sfqcdymndc tdgtdeigcs vpspmtlpap sivvmeyevl
     1201 eltcvgtgvp tptivwrlnw ghvpekcesk syggtgtlrc pnmrpqdsga yscefintrg
     1261 tfypktnsiv tvtpvrsdvc kagffnmlar kseecvqcfc fgvstncdsa nlftyaiqpp
     1321 ilshrvvsve lspfrqivin easpgqdllt lhhgvqfras nvhyngretp flalpaeymg
     1381 nqlksyggnl ryevryngng rpvsgpdvii tgnsftlthr vrthpgqnnr vtipflpggw
     1441 tkpdgrkgtr edimmilanv dnilirlgyl dstarevdli nialdsagsa dqglgsaslv
     1501 ekctcppgyv gdscescasg yvrqargpwl ghcvpftpep cpagtygnpr lgvpcqecpc
     1561 phagannfas gcqqspdgdv icrcnegyag krcehcaqgy qgnplapggv crkipdsscn
     1621 vdgtyniysn gtcqckdsvi geqcdtcapk sfhlnsftyt gciecfcsgv gldcdssswy
     1681 rnqvtstfgr trvnhgfali sdymrntpvt vpvsmstqan alsfvgsaeq agntlywslp
     1741 aaflgnklts yggklsytls ysplpsgims rnsapdvvik sgedlrlihy rksqvspsva
     1801 ntyaveikes awqrgdelvp nrehvlmals nitaiyikat yttstkeasl rsvtldtata
     1861 tnlgtarave veqcrcpegy lglsceqcap gytrdpeagi ylglcrpcec nghskycnse
     1921 tgecescsdn tegfncdrca agyvgdatrg tsydcqyddg gyptsrppap gnqtaeclvn
     1981 cqqegtagcr gyqceckrnv agdrcdqcrp gtyglsaqnp dgckecycsg ltnqcrsasl
     2041 yrqlipvdfi stpplitdef gdimdrdnlv pdvprnvyty khtsytpkyw slrgsvlgnq
     2101 llsyggrley slivesvgrd hrgkdvvlig nglkliwsrp dghdneqeyh vrlhedeqwt
     2161 vedrgsarqa tradfmtvls dlqhililat pkvptvstsi snvilessit trapgathas
     2221 dielcqcpsg ytgtscesca plhyrdasgr csqcpcdasn tescglvsgg nvecqcrprw
     2281 rgdrcreidt spiieeppqi cdlsrgfccs gfqfdiapne tisfndtlqi ykgnriignm
     2341 tklrygcpsr etneptpepd tstddpvrtq iivsiarpei tilpvggslt lsctgrmrwt
     2401 nspvfvnwyk qgshlpegve vqggnlqlfn lqisdsgiyi cqavsnetgh sftdhvsitv
     2461 sqedqrspah ivdlpndvtf eeyvsneivc evegnppptv twtrvdghad aqstrtdnnr
     2521 lvfdsprksd egryrcqaen slsreekyvv vyvrsnppqp ppqqdrlyit pqevngvagd
     2581 sfqlscqfts aaslrydwsh dgrslsassp rnvvvrgnvl evrdanvrds gtytcvafdl
     2641 rtrrnftesa rvyieqpnep gilgdkphil tleqniiivq gedlsitcea sgtpypsikw
     2701 tkvqenlaen vrisgnvlti yggrsenrgl ysciaenshg sdqsstsidi eprerpslti
     2761 dtatqkvsvg sqaslycaaq gipeptvewv rtdgqplspr hkvqapgyvv iddivlddsg
     2821 tyecrasnia gqvsglatin vqeptlvrie pdrqhhivtq gdelslscvg sgvptpsvfw
     2881 sfegrdvdrm gvpegavfaq pfrtntadvk ifrvskeneg iyvchgsnda gedqqyirve
     2941 vqprrgdvga ggddngdvdt rqppnrpqiq pnplsnerlt telgnnvtli cnvdnvntew
     3001 ervdgtplph naytvrntlv ivfvepqnlg qyrcngigrd grveahvvre lvllplprit
     3061 fypnipltve lgqnldvycq venvrpedvh wttdnnrplp ssvriegnvl rfasitqaaa
     3121 geyrcsatnq ygsrsknarv vvkqpsgfqp vphsqvqqrq vgdsiqlrcr lttqygdevr
     3181 gniqfnwyre dgsplprgvr pdsqvlqlvk lqpedegryi cnsydlgsgq qlppvsidlq
     3241 vlrtttqypf nrfkggvslk dtpcmvlyic aavpaapqnp iylppvappr sperilepql
     3301 slsvqssnlp agdgttvecf ssddsypdvv weradgapls envqqvgnnl visnvastda
     3361 gnyvckcktd egdlyttsyk leveeqphel ksskivyakv ggnadlqcga dedrqpsyrw
     3421 srqygqlqag rslqneklsl drvqandagt yvcsaqysdg etvdfpnilv vtgaipqfrq
     3481 eprsymsfpt lsnssfkfnf eltfrpenad glllfngqtr gsgdyialsl kdryaefrfd
     3541 fggkpllvra eeplaldewh tvrvsrfkrd gyiqvddqhp vafptsqhqq ipqleliedl
     3601 yiggvpnwef lpaeavgqqs gfvgcisrlt lqgrtvelir eakfkegitd crpcaqgpcq
     3661 nkgvclesqt eqaytcvcqp gwtgrdcaie gtqctagvcg sgrcentend meclcplnra
     3721 gdrcqyneil neqslnfksn sfaaygtpkv tkvnitlsvr pasledsvil ytaestlpsg
     3781 dylalvlrgg haellintaa rldpvvvrsa eplplnrwtr ieirrrlgeg ilkvgdgper
     3841 kakapgsdri lslkthlfvg gvdrstvkin rdvnitkgfd gcisklynsq ksvnllgdir
     3901 daanvqncge aneidddeye mpvalpspkv aenerqlmap casdpcengg scseqedmai
     3961 cscpfgfsgk hcqnhlqlsf nasfrgdgyv elnrshfqpa leqtyshigi vfttnkpngl
     4021 lfwwgqeage eytgqdfiaa avvdgyveys mrldgeeavi rnsdirvdng erhiviakrd
     4081 entamleldq ildtgdtrpt inkamklpgn vfvggapdva aftgfrykdn fngcivvveg
     4141 etvgqinlss aaingvnanv cpandeplgg teppvv