Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_044252017            3865 aa            linear   INV 09-DEC-2024
            protein isoform X27 [Drosophila takahashii].
ACCESSION   XP_044252017
VERSION     XP_044252017.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_044396082.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..3865
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..3865
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X27"
                     /calculated_mol_wt=427030
     Region          84..>169
                     /region_name="CCDC66"
                     /note="Coiled-coil domain-containing protein 66;
                     pfam15236"
                     /db_xref="CDD:434558"
     Region          385..415
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(385,394,405..406)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(398,401,405,411..412)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            408..412
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          418..451
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(423,430,441..442)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(434,437,441,447..448)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            444..448
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          504..536
                     /region_name="LDLa"
                     /note="Low-density lipoprotein receptor domain class A;
                     smart00192"
                     /db_xref="CDD:197566"
     Site            order(510,518,529..530)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(522,525,529,535..536)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            532..536
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          550..583
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(555,562,573..574)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(566,569,573,579..580)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            576..580
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          593..623
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(594,602,613..614)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(606,609,613,619..620)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            616..620
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          624..658
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(629,637,648..649)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(641,644,648,654..655)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            651..655
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          664..698
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(669,677,688..689)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(681,684,688,694..695)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            691..695
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          713..781
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          813..847
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(818,826,837..838)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(830,833,837,843..844)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            840..844
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          853..887
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(858,866,877..878)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(870,873,877,883..884)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            880..884
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          896..930
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(901,909,920..921)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(913,916,920,926..927)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            923..927
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          948..1023
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          951..955
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:143220"
     Region          964..968
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:143220"
     Region          987..991
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:143220"
     Region          1001..1006
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:143220"
     Region          1015..1018
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:143220"
     Region          1122..1252
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1309..1362
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(1309,1311,1323,1333,1335,1344)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1369..1406
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domai;
                     smart00180"
                     /db_xref="CDD:214543"
     Region          1487..1623
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1624..1650
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1658..1707
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1659,1661,1668,1675,1678,1687)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <1744..1774
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          1840..1974
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          2067..2149
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2078..2082
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2094..2098
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2113..2117
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2127..2132
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2142..2145
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <2176..2229
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2260..2343
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2271..2275
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          2284..2288
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          2307..2311
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          2321..2326
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          2356..2436
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2373..2377
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          2386..2390
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          2405..2409
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          2419..2424
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          2432..2435
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          2454..2530
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2462..2466
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          2475..2479
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          2496..2500
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          2510..2515
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          2523..2526
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409544"
     Region          2543..2630
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2553..2557
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409548"
     Region          2566..2570
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409548"
     Region          2610..2615
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409548"
     Region          2755..2831
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2764..2768
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409541"
     Region          2777..2781
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409541"
     Region          2797..2801
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409541"
     Region          2811..2816
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409541"
     Region          2843..>2910
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2854..2858
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          2870..2877
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          2893..2897
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          2987..3055
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          3085..3151
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3093..3097
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          3106..3110
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          3125..3129
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          3139..3144
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409390"
     Region          3173..3320
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3343..3375
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3423..3576
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3630..3662
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3671..3824
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..3865
                     /gene="trol"
                     /coded_by="XM_044396082.1:209..11806"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf
       61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd
      121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn
      181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde
      241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln
      301 vykridrlga stdgvftfte ssvykyehqy divtprvivs gqpitekpve apqefdedpi
      361 evalptdeve gsgqdsgscr gdatfqcrrs gkticdemrc dgsrdcpdae deegcevcne
      421 lqfkcdnkcl plnkrcdnry dcedqtdeag cqryeveesq pqpqpqpqpe pepepepepe
      481 pepepepepe epitdneqpe qnsecratef rcnngdcidi rkrcdhisdc segedeneec
      541 pttrlkpsdc gpdqffcddl cynrsircng hmdcsdgsde iacsslsvlp cpqhqcpsgr
      601 cysesercdr hrhcedgsde anccyadqfr cnngdciaes ahcdgnidcs dqsdeldcgg
      661 dsqclpnqfr ckngqcvsst arcnkrsdcl dgsdeqncan epnnsgrgtn qlklktypdn
      721 qiikesrevi frcrdegpnr akvkwsrpgg rplppgftdr ngrleipnir vedagayvce
      781 avgyanyipg qhvtvnlnve rlnereirpd sacteyqatc mngecidksg icdghpdcsd
      841 gsdehscslg lkcqpnqfmc snskcvdrtw rcdgendcgd nsdetscdpe psdapcryde
      901 fqcrsghcip ksfqcdymnd ctdgtdeigc svpspmtlpa psivvmeyev leltcvgtgv
      961 ptptivwrln wghvpekces ksyggtgtlr cpnmrpqdsg ayscefintr gtfypktnsi
     1021 vtvtpvrsdv ckagffnmla rkseecvqcf cfgvstncds anlftyaiqp pilshrvvsv
     1081 elspfrqivi neaspgqdll tlhhgvqfra snvhyngret pflalpaeym gnqlksyggn
     1141 lryevryngn grpvsgpdvi itgnsftlth rvrthpgqnn rvtipflpgg wtkpdgrkgt
     1201 redimmilan vdnilirlgy ldstarevdl inialdsags adqglgsasl vekctcppgy
     1261 vgdscescas gyvrqargpw lghcvpftpe pcpagtygnp rlgvpcqecp cphagannfa
     1321 sgcqqspdgd vicrcnegya gkrcehcaqg yqgnplapgg vcrkipdssc nvdgtyniys
     1381 ngtcqckdsv igeqcdtcap ksfhlnsfty tgciecfcsg vgldcdsssw yrnqvtstfg
     1441 rtrvnhgfal isdymrntpv tvpvsmstqa nalsfvgsae qagntlywsl paaflgnklt
     1501 syggklsytl sysplpsgim srnsapdvvi ksgedlrlih yrksqvspsv antyaveike
     1561 sawqrgdelv pnrehvlmal snitaiyika tyttstkeas lrsvtldtat atnlgtarav
     1621 eveqcrcpeg ylglsceqca pgytrdpeag iylglcrpce cnghskycns etgecescsd
     1681 ntegfncdrc aagyvgdatr gtsydcqydd ggyptsrppa pgnqtaeclv ncqqegtagc
     1741 rgyqceckrn vagdrcdqcr pgtyglsaqn pdgckecycs gltnqcrsas lyrqlipvdf
     1801 istpplitde fgdimdrdnl vpdvprnvyt ykhtsytpky wslrgsvlgn qllsyggrle
     1861 yslivesvgr dhrgkdvvli gnglkliwsr pdghdneqey hvrlhedeqw tvedrgsarq
     1921 atradfmtvl sdlqhilila tpkvptvsts isnvilessi ttrapgatha sdielcqcps
     1981 gytgtscesc aplhyrdasg rcsqcpcdas ntescglvsg gnvecqcrpr wrgdrcreie
     2041 tneptpepdt stddpvrtqi ivsiarpeit ilpvggsltl sctgrmrwtn spvfvnwykq
     2101 gshlpegvev qggnlqlfnl qisdsgiyic qavsnetghs ftdhvsitvs qedqrspahi
     2161 vdlpndvtfe eyvsneivce vegnppptvt wtrvdghada qstrtdnnrl vfdsprksde
     2221 gryrcqaens lsreekyvvv yvrsnppqpp pqqdrlyitp qevngvagds fqlscqftsa
     2281 aslrydwshd grslsasspr nvvvrgnvle vrdanvrdsg tytcvafdlr trrnftesar
     2341 vyieqpnepg ilgdkphilt leqniiivqg edlsitceas gtpypsikwt kvqenlaenv
     2401 risgnvltiy ggrsenrgly sciaenshgs dqsstsidie prerpsltid tatqkvsvgs
     2461 qaslycaaqg ipeptvewvr tdgqplsprh kvqapgyvvi ddivlddsgt yecrasniag
     2521 qvsglatinv qeptlvriep drqhhivtqg delslscvgs gvptpsvfws fegrdvdrmg
     2581 vpegavfaqp frtntadvki frvskenegi yvchgsndag edqqyirvev qprrgdvgag
     2641 gddngdvdtr qppnrpqiqp nplsnerltt elgnnvtlic nvdnvntewe rvdgtplphn
     2701 aytvrntlvi vfvepqnlgq yrcngigrdg rveahvvrel vllplpritf ypnipltvel
     2761 gqnldvycqv envrpedvhw ttdnnrplps svriegnvlr fasitqaaag eyrcsatnqy
     2821 gsrsknarvv vkqpsgfqpv phsqvqqrqv gdsiqlrcrl ttqygdevrg niqfnwyred
     2881 gsplprgvrp dsqvlqlvkl qpedegryic nsydlgsgqq lppvsidlqv lrtttqypfn
     2941 rfkggvslkd tpcmvlyica avpaapqnpi ylppvapprs perilepqls lsvqssnlpa
     3001 gdgttvecfs sddsypdvvw eradgaplse nvqqvgnnlv isnvastdag nyvckcktde
     3061 gdlyttsykl eveeqphelk sskivyakvg gnadlqcgad edrqpsyrws rqygqlqagr
     3121 slqneklsld rvqandagty vcsaqysdge tvdfpnilvv tgaipqfrqe prsymsfptl
     3181 snssfkfnfe ltfrpenadg lllfngqtrg sgdyialslk dryaefrfdf ggkpllvrae
     3241 eplaldewht vrvsrfkrdg yiqvddqhpv afptsqhqqi pqleliedly iggvpnwefl
     3301 paeavgqqsg fvgcisrltl qgrtvelire akfkegitdc rpcaqgpcqn kgvclesqte
     3361 qaytcvcqpg wtgrdcaieg tqctagvcgs grcentendm eclcplnrag drcqyneiln
     3421 eqslnfksns faaygtpkvt kvnitlsvrp asledsvily taestlpsgd ylalvlrggh
     3481 aellintaar ldpvvvrsae plplnrwtri eirrrlgegi lkvgdgperk akapgsdril
     3541 slkthlfvgg vdrstvkinr dvnitkgfdg cisklynsqk svnllgdird aanvqncgea
     3601 neidddeyem pvalpspkva enerqlmapc asdpcenggs cseqedmaic scpfgfsgkh
     3661 cqnhlqlsfn asfrgdgyve lnrshfqpal eqtyshigiv fttnkpngll fwwgqeagee
     3721 ytgqdfiaaa vvdgyveysm rldgeeavir nsdirvdnge rhiviakrde ntamleldqi
     3781 ldtgdtrpti nkamklpgnv fvggapdvaa ftgfrykdnf ngcivvvege tvgqinlssa
     3841 aingvnanvc pandeplggt eppvv