Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_044252021            3972 aa            linear   INV 09-DEC-2024
            protein isoform X33 [Drosophila takahashii].
ACCESSION   XP_044252021
VERSION     XP_044252021.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_044396086.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..3972
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..3972
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X33"
                     /calculated_mol_wt=438856
     Region          84..>169
                     /region_name="CCDC66"
                     /note="Coiled-coil domain-containing protein 66;
                     pfam15236"
                     /db_xref="CDD:434558"
     Region          354..384
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(354,363,374..375)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(367,370,374,380..381)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            377..381
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          387..420
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(392,399,410..411)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(403,406,410,416..417)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            413..417
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          473..505
                     /region_name="LDLa"
                     /note="Low-density lipoprotein receptor domain class A;
                     smart00192"
                     /db_xref="CDD:197566"
     Site            order(479,487,498..499)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(491,494,498,504..505)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            501..505
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          513..548
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(518,527,538..539)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(531,534,538,544..545)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            541..545
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          558..592
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(563,571,582..583)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(575,578,582,588..589)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            585..589
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          644..679
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(649,658,669..670)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(662,665,669,675..676)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            672..676
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          760..794
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(765,773,784..785)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(777,780,784,790..791)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            787..791
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          800..834
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(805,813,824..825)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(817,820,824,830..831)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            827..831
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          849..917
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          949..983
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(954,962,973..974)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(966,969,973,979..980)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            976..980
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          989..1023
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(994,1002,1013..1014)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1006,1009,1013,1019..1020)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1016..1020
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1032..1066
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1037,1045,1056..1057)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1049,1052,1056,1062..1063)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1059..1063
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1084..1159
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          1087..1091
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:143220"
     Region          1100..1104
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:143220"
     Region          1123..1127
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:143220"
     Region          1137..1142
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:143220"
     Region          1151..1154
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:143220"
     Region          1258..1388
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1445..1498
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(1445,1447,1459,1469,1471,1480)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1505..1542
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domai;
                     smart00180"
                     /db_xref="CDD:214543"
     Region          1623..1759
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1760..1786
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1794..1843
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1795,1797,1804,1811,1814,1823)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <1880..1910
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          1976..2110
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          2203..2285
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2214..2218
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2230..2234
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2249..2253
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2263..2268
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2278..2281
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <2312..2365
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2396..2479
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2407..2411
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          2420..2424
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          2443..2447
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          2457..2462
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          2492..2572
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2509..2513
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          2522..2526
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          2541..2545
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          2555..2560
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          2568..2571
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          2590..2666
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2598..2602
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409543"
     Region          2611..2615
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409543"
     Region          2646..2651
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409543"
     Region          2659..2662
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409543"
     Region          2679..2766
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2689..2693
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409548"
     Region          2702..2706
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409548"
     Region          2746..2751
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409548"
     Region          2891..2967
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2900..2904
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409541"
     Region          2913..2917
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409541"
     Region          2933..2937
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409541"
     Region          2947..2952
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409541"
     Region          2979..>3046
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2990..2994
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          3006..3013
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          3029..3033
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          3094..3162
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          3192..3258
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3200..3204
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          3213..3217
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          3232..3236
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          3246..3251
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409390"
     Region          3280..3427
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3450..3482
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3530..3683
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3737..3769
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3778..3931
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..3972
                     /gene="trol"
                     /coded_by="XM_044396086.1:209..12127"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf
       61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd
      121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn
      181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde
      241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln
      301 vykridrlga stdgvftfte ssefdedpie valptdeveg sgqdsgscrg datfqcrrsg
      361 kticdemrcd gsrdcpdaed eegcevcnel qfkcdnkclp lnkrcdnryd cedqtdeagc
      421 qryeveesqp qpqpqpqpep epepepepep epepepepee pitdneqpeq nsecratefr
      481 cnngdcidir krcdhisdcs egedeneecp aacsgmeyqc rdgtrcisls qqcdghsdcs
      541 daddeehcdg sgndgedcrf defrcgtgec ipmrqvcdni ydcndysdev scaeeeedsv
      601 gipigrppqr papkhdwlde ldaneyhvyh psnvyelans knpcasnqfr cattnvcipl
      661 hlrcdnfyhc ndmsdekdce qyqrrttttt rrpstsarps ftftfttqgp gllerrnstt
      721 srttagsttr ateapqwpwa trptettttn pittvgvasc yadqfrcnng dciaesahcd
      781 gnidcsdqsd eldcggdsqc lpnqfrckng qcvsstarcn krsdcldgsd eqncanepnn
      841 sgrgtnqlkl ktypdnqiik esrevifrcr degpnrakvk wsrpggrplp pgftdrngrl
      901 eipnirveda gayvceavgy anyipgqhvt vnlnverlne reirpdsact eyqatcmnge
      961 cidksgicdg hpdcsdgsde hscslglkcq pnqfmcsnsk cvdrtwrcdg endcgdnsde
     1021 tscdpepsda pcrydefqcr sghcipksfq cdymndctdg tdeigcsvps pmtlpapsiv
     1081 vmeyevlelt cvgtgvptpt ivwrlnwghv pekcesksyg gtgtlrcpnm rpqdsgaysc
     1141 efintrgtfy pktnsivtvt pvrsdvckag ffnmlarkse ecvqcfcfgv stncdsanlf
     1201 tyaiqppils hrvvsvelsp frqivineas pgqdlltlhh gvqfrasnvh yngretpfla
     1261 lpaeymgnql ksyggnlrye vryngngrpv sgpdviitgn sftlthrvrt hpgqnnrvti
     1321 pflpggwtkp dgrkgtredi mmilanvdni lirlgyldst arevdlinia ldsagsadqg
     1381 lgsaslvekc tcppgyvgds cescasgyvr qargpwlghc vpftpepcpa gtygnprlgv
     1441 pcqecpcpha gannfasgcq qspdgdvicr cnegyagkrc ehcaqgyqgn plapggvcrk
     1501 ipdsscnvdg tyniysngtc qckdsvigeq cdtcapksfh lnsftytgci ecfcsgvgld
     1561 cdssswyrnq vtstfgrtrv nhgfalisdy mrntpvtvpv smstqanals fvgsaeqagn
     1621 tlywslpaaf lgnkltsygg klsytlsysp lpsgimsrns apdvviksge dlrlihyrks
     1681 qvspsvanty aveikesawq rgdelvpnre hvlmalsnit aiyikatytt stkeaslrsv
     1741 tldtatatnl gtaraveveq crcpegylgl sceqcapgyt rdpeagiylg lcrpcecngh
     1801 skycnsetge cescsdnteg fncdrcaagy vgdatrgtsy dcqyddggyp tsrppapgnq
     1861 taeclvncqq egtagcrgyq ceckrnvagd rcdqcrpgty glsaqnpdgc kecycsgltn
     1921 qcrsaslyrq lipvdfistp plitdefgdi mdrdnlvpdv prnvytykht sytpkywslr
     1981 gsvlgnqlls yggrleysli vesvgrdhrg kdvvligngl kliwsrpdgh dneqeyhvrl
     2041 hedeqwtved rgsarqatra dfmtvlsdlq hililatpkv ptvstsisnv ilessittra
     2101 pgathasdie lcqcpsgytg tscescaplh yrdasgrcsq cpcdasntes cglvsggnve
     2161 cqcrprwrgd rcreietnep tpepdtstdd pvrtqiivsi arpeitilpv ggsltlsctg
     2221 rmrwtnspvf vnwykqgshl pegvevqggn lqlfnlqisd sgiyicqavs netghsftdh
     2281 vsitvsqedq rspahivdlp ndvtfeeyvs neivcevegn ppptvtwtrv dghadaqstr
     2341 tdnnrlvfds prksdegryr cqaenslsre ekyvvvyvrs nppqpppqqd rlyitpqevn
     2401 gvagdsfqls cqftsaaslr ydwshdgrsl sassprnvvv rgnvlevrda nvrdsgtytc
     2461 vafdlrtrrn ftesarvyie qpnepgilgd kphiltleqn iiivqgedls itceasgtpy
     2521 psikwtkvqe nlaenvrisg nvltiyggrs enrglyscia enshgsdqss tsidieprer
     2581 psltidtatq kvsvgsqasl ycaaqgipep tvewvrtdgq plsprhkvqa pgyvviddiv
     2641 lddsgtyecr asniagqvsg latinvqept lvriepdrqh hivtqgdels lscvgsgvpt
     2701 psvfwsfegr dvdrmgvpeg avfaqpfrtn tadvkifrvs kenegiyvch gsndagedqq
     2761 yirvevqprr gdvgaggddn gdvdtrqppn rpqiqpnpls nerlttelgn nvtlicnvdn
     2821 vntewervdg tplphnaytv rntlvivfve pqnlgqyrcn gigrdgrvea hvvrelvllp
     2881 lpritfypni pltvelgqnl dvycqvenvr pedvhwttdn nrplpssvri egnvlrfasi
     2941 tqaaageyrc satnqygsrs knarvvvkqp sgfqpvphsq vqqrqvgdsi qlrcrlttqy
     3001 gdevrgniqf nwyredgspl prgvrpdsqv lqlvklqped egryicnsyd lgsgqqlppv
     3061 sidlqvltvp aapqnpiylp pvapprsper ilepqlslsv qssnlpagdg ttvecfssdd
     3121 sypdvvwera dgaplsenvq qvgnnlvisn vastdagnyv ckcktdegdl yttsykleve
     3181 eqphelkssk ivyakvggna dlqcgadedr qpsyrwsrqy gqlqagrslq neklsldrvq
     3241 andagtyvcs aqysdgetvd fpnilvvtga ipqfrqeprs ymsfptlsns sfkfnfeltf
     3301 rpenadglll fngqtrgsgd yialslkdry aefrfdfggk pllvraeepl aldewhtvrv
     3361 srfkrdgyiq vddqhpvafp tsqhqqipql eliedlyigg vpnweflpae avgqqsgfvg
     3421 cisrltlqgr tvelireakf kegitdcrpc aqgpcqnkgv clesqteqay tcvcqpgwtg
     3481 rdcaiegtqc tagvcgsgrc entendmecl cplnragdrc qyneilneqs lnfksnsfaa
     3541 ygtpkvtkvn itlsvrpasl edsvilytae stlpsgdyla lvlrgghael lintaarldp
     3601 vvvrsaeplp lnrwtrieir rrlgegilkv gdgperkaka pgsdrilslk thlfvggvdr
     3661 stvkinrdvn itkgfdgcis klynsqksvn llgdirdaan vqncgeanei dddeyempva
     3721 lpspkvaene rqlmapcasd pcenggscse qedmaicscp fgfsgkhcqn hlqlsfnasf
     3781 rgdgyvelnr shfqpaleqt yshigivftt nkpngllfww gqeageeytg qdfiaaavvd
     3841 gyveysmrld geeavirnsd irvdngerhi viakrdenta mleldqildt gdtrptinka
     3901 mklpgnvfvg gapdvaaftg frykdnfngc ivvvegetvg qinlssaain gvnanvcpan
     3961 deplggtepp vv