Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_044252022            4002 aa            linear   INV 09-DEC-2024
            protein isoform X34 [Drosophila takahashii].
ACCESSION   XP_044252022
VERSION     XP_044252022.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_044396087.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..4002
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..4002
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X34"
                     /calculated_mol_wt=442474
     Region          84..>169
                     /region_name="CCDC66"
                     /note="Coiled-coil domain-containing protein 66;
                     pfam15236"
                     /db_xref="CDD:434558"
     Region          385..415
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(385,394,405..406)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(398,401,405,411..412)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            408..412
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          418..451
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(423,430,441..442)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(434,437,441,447..448)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            444..448
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          504..533
                     /region_name="LDLa"
                     /note="Low-density lipoprotein receptor domain class A;
                     smart00192"
                     /db_xref="CDD:197566"
     Site            order(510,518,529..530)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Region          592..627
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(597,606,617..618)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(610,613,617,623..624)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            620..624
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          716..749
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(721,728,739..740)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(732,735,739,745..746)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            742..746
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          759..789
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(760,768,779..780)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(772,775,779,785..786)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            782..786
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          790..824
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(795,803,814..815)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(807,810,814,820..821)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            817..821
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          830..864
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(835,843,854..855)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(847,850,854,860..861)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            857..861
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          879..947
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          979..1013
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(984,992,1003..1004)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(996,999,1003,1009..1010)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1006..1010
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1019..1053
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1024,1032,1043..1044)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1036,1039,1043,1049..1050)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1046..1050
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1062..1096
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1067,1075,1086..1087)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1079,1082,1086,1092..1093)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1089..1093
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1114..1189
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          1117..1121
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:143220"
     Region          1130..1134
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:143220"
     Region          1153..1157
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:143220"
     Region          1167..1172
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:143220"
     Region          1181..1184
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:143220"
     Region          1288..1418
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1475..1528
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(1475,1477,1489,1499,1501,1510)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1535..1572
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domai;
                     smart00180"
                     /db_xref="CDD:214543"
     Region          1653..1789
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1790..1816
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1824..1873
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1825,1827,1834,1841,1844,1853)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <1910..1940
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          2006..2140
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          2233..2315
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2244..2248
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2260..2264
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2279..2283
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2293..2298
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2308..2311
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <2342..2395
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2426..2509
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2437..2441
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          2450..2454
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          2473..2477
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          2487..2492
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          2522..2602
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2539..2543
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          2552..2556
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          2571..2575
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          2585..2590
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          2598..2601
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          2620..2696
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2628..2632
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          2641..2645
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          2662..2666
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          2676..2681
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          2689..2692
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409544"
     Region          2709..2796
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2719..2723
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409548"
     Region          2732..2736
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409548"
     Region          2776..2781
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409548"
     Region          2921..2997
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2930..2934
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409541"
     Region          2943..2947
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409541"
     Region          2963..2967
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409541"
     Region          2977..2982
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409541"
     Region          3009..>3076
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3020..3024
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          3036..3043
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          3059..3063
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          3124..3192
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          3222..3288
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3230..3234
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          3243..3247
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          3262..3266
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          3276..3281
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409390"
     Region          3310..3457
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3480..3512
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3560..3713
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3767..3799
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3808..3961
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..4002
                     /gene="trol"
                     /coded_by="XM_044396087.1:209..12217"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf
       61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd
      121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn
      181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde
      241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln
      301 vykridrlga stdgvftfte ssvykyehqy divtprvivs gqpitekpve apqefdedpi
      361 evalptdeve gsgqdsgscr gdatfqcrrs gkticdemrc dgsrdcpdae deegcevcne
      421 lqfkcdnkcl plnkrcdnry dcedqtdeag cqryeveesq pqpqpqpqpe pepepepepe
      481 pepepepepe epitdneqpe qnsecratef rcnngdcidi rkrcdhisdc segedeneec
      541 peeeedsvgi pigrppqrpa pkhdwldeld aneyhvyhps nvyelanskn pcasnqfrca
      601 ttnvciplhl rcdnfyhcnd msdekdceqy qrrtttttrr pstsarpsft ftfttqgpgl
      661 lerrnsttsr ttagsttrat eapqwpwatr ptettttnpi ttvgvatttr lkpsdcgpdq
      721 ffcddlcynr sircnghmdc sdgsdeiacs slsvlpcpqh qcpsgrcyse sercdrhrhc
      781 edgsdeancc yadqfrcnng dciaesahcd gnidcsdqsd eldcggdsqc lpnqfrckng
      841 qcvsstarcn krsdcldgsd eqncanepnn sgrgtnqlkl ktypdnqiik esrevifrcr
      901 degpnrakvk wsrpggrplp pgftdrngrl eipnirveda gayvceavgy anyipgqhvt
      961 vnlnverlne reirpdsact eyqatcmnge cidksgicdg hpdcsdgsde hscslglkcq
     1021 pnqfmcsnsk cvdrtwrcdg endcgdnsde tscdpepsda pcrydefqcr sghcipksfq
     1081 cdymndctdg tdeigcsvps pmtlpapsiv vmeyevlelt cvgtgvptpt ivwrlnwghv
     1141 pekcesksyg gtgtlrcpnm rpqdsgaysc efintrgtfy pktnsivtvt pvrsdvckag
     1201 ffnmlarkse ecvqcfcfgv stncdsanlf tyaiqppils hrvvsvelsp frqivineas
     1261 pgqdlltlhh gvqfrasnvh yngretpfla lpaeymgnql ksyggnlrye vryngngrpv
     1321 sgpdviitgn sftlthrvrt hpgqnnrvti pflpggwtkp dgrkgtredi mmilanvdni
     1381 lirlgyldst arevdlinia ldsagsadqg lgsaslvekc tcppgyvgds cescasgyvr
     1441 qargpwlghc vpftpepcpa gtygnprlgv pcqecpcpha gannfasgcq qspdgdvicr
     1501 cnegyagkrc ehcaqgyqgn plapggvcrk ipdsscnvdg tyniysngtc qckdsvigeq
     1561 cdtcapksfh lnsftytgci ecfcsgvgld cdssswyrnq vtstfgrtrv nhgfalisdy
     1621 mrntpvtvpv smstqanals fvgsaeqagn tlywslpaaf lgnkltsygg klsytlsysp
     1681 lpsgimsrns apdvviksge dlrlihyrks qvspsvanty aveikesawq rgdelvpnre
     1741 hvlmalsnit aiyikatytt stkeaslrsv tldtatatnl gtaraveveq crcpegylgl
     1801 sceqcapgyt rdpeagiylg lcrpcecngh skycnsetge cescsdnteg fncdrcaagy
     1861 vgdatrgtsy dcqyddggyp tsrppapgnq taeclvncqq egtagcrgyq ceckrnvagd
     1921 rcdqcrpgty glsaqnpdgc kecycsgltn qcrsaslyrq lipvdfistp plitdefgdi
     1981 mdrdnlvpdv prnvytykht sytpkywslr gsvlgnqlls yggrleysli vesvgrdhrg
     2041 kdvvligngl kliwsrpdgh dneqeyhvrl hedeqwtved rgsarqatra dfmtvlsdlq
     2101 hililatpkv ptvstsisnv ilessittra pgathasdie lcqcpsgytg tscescaplh
     2161 yrdasgrcsq cpcdasntes cglvsggnve cqcrprwrgd rcreietnep tpepdtstdd
     2221 pvrtqiivsi arpeitilpv ggsltlsctg rmrwtnspvf vnwykqgshl pegvevqggn
     2281 lqlfnlqisd sgiyicqavs netghsftdh vsitvsqedq rspahivdlp ndvtfeeyvs
     2341 neivcevegn ppptvtwtrv dghadaqstr tdnnrlvfds prksdegryr cqaenslsre
     2401 ekyvvvyvrs nppqpppqqd rlyitpqevn gvagdsfqls cqftsaaslr ydwshdgrsl
     2461 sassprnvvv rgnvlevrda nvrdsgtytc vafdlrtrrn ftesarvyie qpnepgilgd
     2521 kphiltleqn iiivqgedls itceasgtpy psikwtkvqe nlaenvrisg nvltiyggrs
     2581 enrglyscia enshgsdqss tsidieprer psltidtatq kvsvgsqasl ycaaqgipep
     2641 tvewvrtdgq plsprhkvqa pgyvviddiv lddsgtyecr asniagqvsg latinvqept
     2701 lvriepdrqh hivtqgdels lscvgsgvpt psvfwsfegr dvdrmgvpeg avfaqpfrtn
     2761 tadvkifrvs kenegiyvch gsndagedqq yirvevqprr gdvgaggddn gdvdtrqppn
     2821 rpqiqpnpls nerlttelgn nvtlicnvdn vntewervdg tplphnaytv rntlvivfve
     2881 pqnlgqyrcn gigrdgrvea hvvrelvllp lpritfypni pltvelgqnl dvycqvenvr
     2941 pedvhwttdn nrplpssvri egnvlrfasi tqaaageyrc satnqygsrs knarvvvkqp
     3001 sgfqpvphsq vqqrqvgdsi qlrcrlttqy gdevrgniqf nwyredgspl prgvrpdsqv
     3061 lqlvklqped egryicnsyd lgsgqqlppv sidlqvltvp aapqnpiylp pvapprsper
     3121 ilepqlslsv qssnlpagdg ttvecfssdd sypdvvwera dgaplsenvq qvgnnlvisn
     3181 vastdagnyv ckcktdegdl yttsykleve eqphelkssk ivyakvggna dlqcgadedr
     3241 qpsyrwsrqy gqlqagrslq neklsldrvq andagtyvcs aqysdgetvd fpnilvvtga
     3301 ipqfrqeprs ymsfptlsns sfkfnfeltf rpenadglll fngqtrgsgd yialslkdry
     3361 aefrfdfggk pllvraeepl aldewhtvrv srfkrdgyiq vddqhpvafp tsqhqqipql
     3421 eliedlyigg vpnweflpae avgqqsgfvg cisrltlqgr tvelireakf kegitdcrpc
     3481 aqgpcqnkgv clesqteqay tcvcqpgwtg rdcaiegtqc tagvcgsgrc entendmecl
     3541 cplnragdrc qyneilneqs lnfksnsfaa ygtpkvtkvn itlsvrpasl edsvilytae
     3601 stlpsgdyla lvlrgghael lintaarldp vvvrsaeplp lnrwtrieir rrlgegilkv
     3661 gdgperkaka pgsdrilslk thlfvggvdr stvkinrdvn itkgfdgcis klynsqksvn
     3721 llgdirdaan vqncgeanei dddeyempva lpspkvaene rqlmapcasd pcenggscse
     3781 qedmaicscp fgfsgkhcqn hlqlsfnasf rgdgyvelnr shfqpaleqt yshigivftt
     3841 nkpngllfww gqeageeytg qdfiaaavvd gyveysmrld geeavirnsd irvdngerhi
     3901 viakrdenta mleldqildt gdtrptinka mklpgnvfvg gapdvaaftg frykdnfngc
     3961 ivvvegetvg qinlssaain gvnanvcpan deplggtepp vv