Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_044252014            4114 aa            linear   INV 09-DEC-2024
            protein isoform X21 [Drosophila takahashii].
ACCESSION   XP_044252014
VERSION     XP_044252014.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_044396079.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..4114
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..4114
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X21"
                     /calculated_mol_wt=454834
     Region          84..>169
                     /region_name="CCDC66"
                     /note="Coiled-coil domain-containing protein 66;
                     pfam15236"
                     /db_xref="CDD:434558"
     Region          385..415
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(385,394,405..406)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(398,401,405,411..412)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            408..412
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          418..451
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(423,430,441..442)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(434,437,441,447..448)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            444..448
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          504..536
                     /region_name="LDLa"
                     /note="Low-density lipoprotein receptor domain class A;
                     smart00192"
                     /db_xref="CDD:197566"
     Site            order(510,518,529..530)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(522,525,529,535..536)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            532..536
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          544..579
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(549,558,569..570)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(562,565,569,575..576)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            572..576
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          589..623
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(594,602,613..614)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(606,609,613,619..620)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            616..620
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          675..710
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(680,689,700..701)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(693,696,700,706..707)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            703..707
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          799..832
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(804,811,822..823)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(815,818,822,828..829)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            825..829
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          842..872
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(843,851,862..863)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(855,858,862,868..869)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            865..869
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          873..907
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(878,886,897..898)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(890,893,897,903..904)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            900..904
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          913..947
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(918,926,937..938)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(930,933,937,943..944)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            940..944
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          962..1030
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          1062..1096
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1067,1075,1086..1087)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1079,1082,1086,1092..1093)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1089..1093
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1102..1136
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1107,1115,1126..1127)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1119,1122,1126,1132..1133)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1129..1133
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1145..1179
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1150,1158,1169..1170)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1162,1165,1169,1175..1176)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1172..1176
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1197..1272
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          1200..1204
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:143220"
     Region          1213..1217
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:143220"
     Region          1236..1240
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:143220"
     Region          1250..1255
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:143220"
     Region          1264..1267
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:143220"
     Region          1371..1501
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1558..1611
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(1558,1560,1572,1582,1584,1593)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1618..1655
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domai;
                     smart00180"
                     /db_xref="CDD:214543"
     Region          1736..1872
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1873..1899
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1907..1956
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1908,1910,1917,1924,1927,1936)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <1993..2023
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          2089..2223
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          2316..2398
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2327..2331
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2343..2347
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2362..2366
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2376..2381
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2391..2394
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <2425..2478
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2509..2592
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2520..2524
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          2533..2537
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          2556..2560
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          2570..2575
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          2605..2685
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2622..2626
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          2635..2639
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          2654..2658
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          2668..2673
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          2681..2684
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          2703..2779
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2711..2715
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          2724..2728
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          2745..2749
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          2759..2764
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          2772..2775
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409544"
     Region          2792..2879
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2802..2806
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409548"
     Region          2815..2819
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409548"
     Region          2859..2864
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409548"
     Region          3004..3080
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3013..3017
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409541"
     Region          3026..3030
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409541"
     Region          3046..3050
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409541"
     Region          3060..3065
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409541"
     Region          3092..>3159
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3103..3107
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          3119..3126
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          3142..3146
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          3236..3304
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          3334..3400
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3342..3346
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          3355..3359
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          3374..3378
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          3388..3393
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409390"
     Region          3422..3569
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3592..3624
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3672..3825
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3879..3911
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3920..4073
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..4114
                     /gene="trol"
                     /coded_by="XM_044396079.1:209..12553"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf
       61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd
      121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn
      181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde
      241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln
      301 vykridrlga stdgvftfte ssvykyehqy divtprvivs gqpitekpve apqefdedpi
      361 evalptdeve gsgqdsgscr gdatfqcrrs gkticdemrc dgsrdcpdae deegcevcne
      421 lqfkcdnkcl plnkrcdnry dcedqtdeag cqryeveesq pqpqpqpqpe pepepepepe
      481 pepepepepe epitdneqpe qnsecratef rcnngdcidi rkrcdhisdc segedeneec
      541 paacsgmeyq crdgtrcisl sqqcdghsdc sdaddeehcd gsgndgedcr fdefrcgtge
      601 cipmrqvcdn iydcndysde vscaeeeeds vgipigrppq rpapkhdwld eldaneyhvy
      661 hpsnvyelan sknpcasnqf rcattnvcip lhlrcdnfyh cndmsdekdc eqyqrrtttt
      721 trrpstsarp sftftfttqg pgllerrnst tsrttagstt rateapqwpw atrptetttt
      781 npittvgvat ttrlkpsdcg pdqffcddlc ynrsircngh mdcsdgsdei acsslsvlpc
      841 pqhqcpsgrc ysesercdrh rhcedgsdea nccyadqfrc nngdciaesa hcdgnidcsd
      901 qsdeldcggd sqclpnqfrc kngqcvssta rcnkrsdcld gsdeqncane pnnsgrgtnq
      961 lklktypdnq iikesrevif rcrdegpnra kvkwsrpggr plppgftdrn grleipnirv
     1021 edagayvcea vgyanyipgq hvtvnlnver lnereirpds acteyqatcm ngecidksgi
     1081 cdghpdcsdg sdehscslgl kcqpnqfmcs nskcvdrtwr cdgendcgdn sdetscdpep
     1141 sdapcrydef qcrsghcipk sfqcdymndc tdgtdeigcs vpspmtlpap sivvmeyevl
     1201 eltcvgtgvp tptivwrlnw ghvpekcesk syggtgtlrc pnmrpqdsga yscefintrg
     1261 tfypktnsiv tvtpvrsdvc kagffnmlar kseecvqcfc fgvstncdsa nlftyaiqpp
     1321 ilshrvvsve lspfrqivin easpgqdllt lhhgvqfras nvhyngretp flalpaeymg
     1381 nqlksyggnl ryevryngng rpvsgpdvii tgnsftlthr vrthpgqnnr vtipflpggw
     1441 tkpdgrkgtr edimmilanv dnilirlgyl dstarevdli nialdsagsa dqglgsaslv
     1501 ekctcppgyv gdscescasg yvrqargpwl ghcvpftpep cpagtygnpr lgvpcqecpc
     1561 phagannfas gcqqspdgdv icrcnegyag krcehcaqgy qgnplapggv crkipdsscn
     1621 vdgtyniysn gtcqckdsvi geqcdtcapk sfhlnsftyt gciecfcsgv gldcdssswy
     1681 rnqvtstfgr trvnhgfali sdymrntpvt vpvsmstqan alsfvgsaeq agntlywslp
     1741 aaflgnklts yggklsytls ysplpsgims rnsapdvvik sgedlrlihy rksqvspsva
     1801 ntyaveikes awqrgdelvp nrehvlmals nitaiyikat yttstkeasl rsvtldtata
     1861 tnlgtarave veqcrcpegy lglsceqcap gytrdpeagi ylglcrpcec nghskycnse
     1921 tgecescsdn tegfncdrca agyvgdatrg tsydcqyddg gyptsrppap gnqtaeclvn
     1981 cqqegtagcr gyqceckrnv agdrcdqcrp gtyglsaqnp dgckecycsg ltnqcrsasl
     2041 yrqlipvdfi stpplitdef gdimdrdnlv pdvprnvyty khtsytpkyw slrgsvlgnq
     2101 llsyggrley slivesvgrd hrgkdvvlig nglkliwsrp dghdneqeyh vrlhedeqwt
     2161 vedrgsarqa tradfmtvls dlqhililat pkvptvstsi snvilessit trapgathas
     2221 dielcqcpsg ytgtscesca plhyrdasgr csqcpcdasn tescglvsgg nvecqcrprw
     2281 rgdrcreiet neptpepdts tddpvrtqii vsiarpeiti lpvggsltls ctgrmrwtns
     2341 pvfvnwykqg shlpegvevq ggnlqlfnlq isdsgiyicq avsnetghsf tdhvsitvsq
     2401 edqrspahiv dlpndvtfee yvsneivcev egnppptvtw trvdghadaq strtdnnrlv
     2461 fdsprksdeg ryrcqaensl sreekyvvvy vrsnppqppp qqdrlyitpq evngvagdsf
     2521 qlscqftsaa slrydwshdg rslsassprn vvvrgnvlev rdanvrdsgt ytcvafdlrt
     2581 rrnftesarv yieqpnepgi lgdkphiltl eqniiivqge dlsitceasg tpypsikwtk
     2641 vqenlaenvr isgnvltiyg grsenrglys ciaenshgsd qsstsidiep rerpsltidt
     2701 atqkvsvgsq aslycaaqgi peptvewvrt dgqplsprhk vqapgyvvid divlddsgty
     2761 ecrasniagq vsglatinvq eptlvriepd rqhhivtqgd elslscvgsg vptpsvfwsf
     2821 egrdvdrmgv pegavfaqpf rtntadvkif rvskenegiy vchgsndage dqqyirvevq
     2881 prrgdvgagg ddngdvdtrq ppnrpqiqpn plsnerltte lgnnvtlicn vdnvntewer
     2941 vdgtplphna ytvrntlviv fvepqnlgqy rcngigrdgr veahvvrelv llplpritfy
     3001 pnipltvelg qnldvycqve nvrpedvhwt tdnnrplpss vriegnvlrf asitqaaage
     3061 yrcsatnqyg srsknarvvv kqpsgfqpvp hsqvqqrqvg dsiqlrcrlt tqygdevrgn
     3121 iqfnwyredg splprgvrpd sqvlqlvklq pedegryicn sydlgsgqql ppvsidlqvl
     3181 rtttqypfnr fkggvslkdt pcmvlyicaa vpaapqnpiy lppvapprsp erilepqlsl
     3241 svqssnlpag dgttvecfss ddsypdvvwe radgaplsen vqqvgnnlvi snvastdagn
     3301 yvckcktdeg dlyttsykle veeqphelks skivyakvgg nadlqcgade drqpsyrwsr
     3361 qygqlqagrs lqneklsldr vqandagtyv csaqysdget vdfpnilvvt gaipqfrqep
     3421 rsymsfptls nssfkfnfel tfrpenadgl llfngqtrgs gdyialslkd ryaefrfdfg
     3481 gkpllvraee plaldewhtv rvsrfkrdgy iqvddqhpva fptsqhqqip qleliedlyi
     3541 ggvpnweflp aeavgqqsgf vgcisrltlq grtvelirea kfkegitdcr pcaqgpcqnk
     3601 gvclesqteq aytcvcqpgw tgrdcaiegt qctagvcgsg rcentendme clcplnragd
     3661 rcqyneilne qslnfksnsf aaygtpkvtk vnitlsvrpa sledsvilyt aestlpsgdy
     3721 lalvlrggha ellintaarl dpvvvrsaep lplnrwtrie irrrlgegil kvgdgperka
     3781 kapgsdrils lkthlfvggv drstvkinrd vnitkgfdgc isklynsqks vnllgdirda
     3841 anvqncgean eidddeyemp valpspkvae nerqlmapca sdpcenggsc seqedmaics
     3901 cpfgfsgkhc qnhlqlsfna sfrgdgyvel nrshfqpale qtyshigivf ttnkpngllf
     3961 wwgqeageey tgqdfiaaav vdgyveysmr ldgeeavirn sdirvdnger hiviakrden
     4021 tamleldqil dtgdtrptin kamklpgnvf vggapdvaaf tgfrykdnfn gcivvveget
     4081 vgqinlssaa ingvnanvcp andeplggte ppvv