Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_070074541            3927 aa            linear   INV 09-DEC-2024
            protein isoform X26 [Drosophila takahashii].
ACCESSION   XP_070074541
VERSION     XP_070074541.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_070218440.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..3927
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..3927
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X26"
                     /calculated_mol_wt=433985
     Region          84..>169
                     /region_name="CCDC66"
                     /note="Coiled-coil domain-containing protein 66;
                     pfam15236"
                     /db_xref="CDD:434558"
     Region          385..415
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(385,394,405..406)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(398,401,405,411..412)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            408..412
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          418..451
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(423,430,441..442)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(434,437,441,447..448)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            444..448
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          504..536
                     /region_name="LDLa"
                     /note="Low-density lipoprotein receptor domain class A;
                     smart00192"
                     /db_xref="CDD:197566"
     Site            order(510,518,529..530)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(522,525,529,535..536)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            532..536
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          550..583
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(555,562,573..574)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(566,569,573,579..580)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            576..580
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          593..623
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(594,602,613..614)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(606,609,613,619..620)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            616..620
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          624..658
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(629,637,648..649)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(641,644,648,654..655)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            651..655
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          664..698
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(669,677,688..689)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(681,684,688,694..695)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            691..695
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          713..781
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          813..847
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(818,826,837..838)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(830,833,837,843..844)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            840..844
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          853..887
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(858,866,877..878)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(870,873,877,883..884)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            880..884
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          896..930
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(901,909,920..921)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(913,916,920,926..927)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            923..927
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          948..1023
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          951..955
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          964..968
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          987..991
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          1001..1006
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          1015..1018
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          1122..1252
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1309..1362
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(1309,1311,1323,1333,1335,1344)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1369..1406
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domai;
                     smart00180"
                     /db_xref="CDD:214543"
     Region          1487..1623
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1624..1650
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1658..1707
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1659,1661,1668,1675,1678,1687)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <1744..1774
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          1840..1974
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          2129..2211
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2140..2144
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2156..2160
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2175..2179
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2189..2194
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2204..2207
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <2238..2291
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2322..2405
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2333..2337
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2346..2350
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2369..2373
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2383..2388
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2418..2498
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2435..2439
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          2448..2452
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          2467..2471
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          2481..2486
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          2494..2497
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          2516..2592
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2524..2528
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2537..2541
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2558..2562
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2572..2577
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2585..2588
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          2605..2692
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2615..2619
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2628..2632
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2672..2677
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2817..2893
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2826..2830
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2839..2843
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2859..2863
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2873..2878
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2905..>2972
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2916..2920
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2932..2939
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2955..2959
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3049..3117
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          3147..3213
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3155..3159
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3168..3172
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3187..3191
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3201..3206
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          3235..3382
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3405..3437
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3485..3638
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3692..3724
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3733..3886
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..3927
                     /gene="trol"
                     /coded_by="XM_070218440.1:209..11992"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf
       61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd
      121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn
      181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde
      241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln
      301 vykridrlga stdgvftfte ssvykyehqy divtprvivs gqpitekpve apqefdedpi
      361 evalptdeve gsgqdsgscr gdatfqcrrs gkticdemrc dgsrdcpdae deegcevcne
      421 lqfkcdnkcl plnkrcdnry dcedqtdeag cqryeveesq pqpqpqpqpe pepepepepe
      481 pepepepepe epitdneqpe qnsecratef rcnngdcidi rkrcdhisdc segedeneec
      541 pttrlkpsdc gpdqffcddl cynrsircng hmdcsdgsde iacsslsvlp cpqhqcpsgr
      601 cysesercdr hrhcedgsde anccyadqfr cnngdciaes ahcdgnidcs dqsdeldcgg
      661 dsqclpnqfr ckngqcvsst arcnkrsdcl dgsdeqncan epnnsgrgtn qlklktypdn
      721 qiikesrevi frcrdegpnr akvkwsrpgg rplppgftdr ngrleipnir vedagayvce
      781 avgyanyipg qhvtvnlnve rlnereirpd sacteyqatc mngecidksg icdghpdcsd
      841 gsdehscslg lkcqpnqfmc snskcvdrtw rcdgendcgd nsdetscdpe psdapcryde
      901 fqcrsghcip ksfqcdymnd ctdgtdeigc svpspmtlpa psivvmeyev leltcvgtgv
      961 ptptivwrln wghvpekces ksyggtgtlr cpnmrpqdsg ayscefintr gtfypktnsi
     1021 vtvtpvrsdv ckagffnmla rkseecvqcf cfgvstncds anlftyaiqp pilshrvvsv
     1081 elspfrqivi neaspgqdll tlhhgvqfra snvhyngret pflalpaeym gnqlksyggn
     1141 lryevryngn grpvsgpdvi itgnsftlth rvrthpgqnn rvtipflpgg wtkpdgrkgt
     1201 redimmilan vdnilirlgy ldstarevdl inialdsags adqglgsasl vekctcppgy
     1261 vgdscescas gyvrqargpw lghcvpftpe pcpagtygnp rlgvpcqecp cphagannfa
     1321 sgcqqspdgd vicrcnegya gkrcehcaqg yqgnplapgg vcrkipdssc nvdgtyniys
     1381 ngtcqckdsv igeqcdtcap ksfhlnsfty tgciecfcsg vgldcdsssw yrnqvtstfg
     1441 rtrvnhgfal isdymrntpv tvpvsmstqa nalsfvgsae qagntlywsl paaflgnklt
     1501 syggklsytl sysplpsgim srnsapdvvi ksgedlrlih yrksqvspsv antyaveike
     1561 sawqrgdelv pnrehvlmal snitaiyika tyttstkeas lrsvtldtat atnlgtarav
     1621 eveqcrcpeg ylglsceqca pgytrdpeag iylglcrpce cnghskycns etgecescsd
     1681 ntegfncdrc aagyvgdatr gtsydcqydd ggyptsrppa pgnqtaeclv ncqqegtagc
     1741 rgyqceckrn vagdrcdqcr pgtyglsaqn pdgckecycs gltnqcrsas lyrqlipvdf
     1801 istpplitde fgdimdrdnl vpdvprnvyt ykhtsytpky wslrgsvlgn qllsyggrle
     1861 yslivesvgr dhrgkdvvli gnglkliwsr pdghdneqey hvrlhedeqw tvedrgsarq
     1921 atradfmtvl sdlqhilila tpkvptvsts isnvilessi ttrapgatha sdielcqcps
     1981 gytgtscesc aplhyrdasg rcsqcpcdas ntescglvsg gnvecqcrpr wrgdrcreid
     2041 tspiieeppq icdlsrgfcc sgfqfdiapn etisfndtlq iykgnriign mtklrygcps
     2101 retneptpep dtstddpvrt qiivsiarpe itilpvggsl tlsctgrmrw tnspvfvnwy
     2161 kqgshlpegv evqggnlqlf nlqisdsgiy icqavsnetg hsftdhvsit vsqedqrspa
     2221 hivdlpndvt feeyvsneiv cevegnpppt vtwtrvdgha daqstrtdnn rlvfdsprks
     2281 degryrcqae nslsreekyv vvyvrsnppq pppqqdrlyi tpqevngvag dsfqlscqft
     2341 saaslrydws hdgrslsass prnvvvrgnv levrdanvrd sgtytcvafd lrtrrnftes
     2401 arvyieqpne pgilgdkphi ltleqniiiv qgedlsitce asgtpypsik wtkvqenlae
     2461 nvrisgnvlt iyggrsenrg lysciaensh gsdqsstsid ieprerpslt idtatqkvsv
     2521 gsqaslycaa qgipeptvew vrtdgqplsp rhkvqapgyv viddivldds gtyecrasni
     2581 agqvsglati nvqeptlvri epdrqhhivt qgdelslscv gsgvptpsvf wsfegrdvdr
     2641 mgvpegavfa qpfrtntadv kifrvskene giyvchgsnd agedqqyirv evqprrgdvg
     2701 aggddngdvd trqppnrpqi qpnplsnerl ttelgnnvtl icnvdnvnte wervdgtplp
     2761 hnaytvrntl vivfvepqnl gqyrcngigr dgrveahvvr elvllplpri tfypnipltv
     2821 elgqnldvyc qvenvrpedv hwttdnnrpl pssvriegnv lrfasitqaa ageyrcsatn
     2881 qygsrsknar vvvkqpsgfq pvphsqvqqr qvgdsiqlrc rlttqygdev rgniqfnwyr
     2941 edgsplprgv rpdsqvlqlv klqpedegry icnsydlgsg qqlppvsidl qvlrtttqyp
     3001 fnrfkggvsl kdtpcmvlyi caavpaapqn piylppvapp rsperilepq lslsvqssnl
     3061 pagdgttvec fssddsypdv vweradgapl senvqqvgnn lvisnvastd agnyvckckt
     3121 degdlyttsy kleveeqphe lksskivyak vggnadlqcg adedrqpsyr wsrqygqlqa
     3181 grslqnekls ldrvqandag tyvcsaqysd getvdfpnil vvtgaipqfr qeprsymsfp
     3241 tlsnssfkfn feltfrpena dglllfngqt rgsgdyials lkdryaefrf dfggkpllvr
     3301 aeeplaldew htvrvsrfkr dgyiqvddqh pvafptsqhq qipqlelied lyiggvpnwe
     3361 flpaeavgqq sgfvgcisrl tlqgrtveli reakfkegit dcrpcaqgpc qnkgvclesq
     3421 teqaytcvcq pgwtgrdcai egtqctagvc gsgrcenten dmeclcplnr agdrcqynei
     3481 lneqslnfks nsfaaygtpk vtkvnitlsv rpasledsvi lytaestlps gdylalvlrg
     3541 ghaellinta arldpvvvrs aeplplnrwt rieirrrlge gilkvgdgpe rkakapgsdr
     3601 ilslkthlfv ggvdrstvki nrdvnitkgf dgcisklyns qksvnllgdi rdaanvqncg
     3661 eaneidddey empvalpspk vaenerqlma pcasdpceng gscseqedma icscpfgfsg
     3721 khcqnhlqls fnasfrgdgy velnrshfqp aleqtyshig ivfttnkpng llfwwgqeag
     3781 eeytgqdfia aavvdgyvey smrldgeeav irnsdirvdn gerhiviakr dentamleld
     3841 qildtgdtrp tinkamklpg nvfvggapdv aaftgfrykd nfngcivvve getvgqinls
     3901 saaingvnan vcpandeplg gteppvv