Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_044252024            3805 aa            linear   INV 09-DEC-2024
            protein isoform X38 [Drosophila takahashii].
ACCESSION   XP_044252024
VERSION     XP_044252024.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_044396089.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..3805
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..3805
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X38"
                     /calculated_mol_wt=420200
     Region          84..>169
                     /region_name="CCDC66"
                     /note="Coiled-coil domain-containing protein 66;
                     pfam15236"
                     /db_xref="CDD:434558"
     Region          354..384
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(354,363,374..375)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(367,370,374,380..381)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            377..381
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          387..420
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(392,399,410..411)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(403,406,410,416..417)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            413..417
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          473..505
                     /region_name="LDLa"
                     /note="Low-density lipoprotein receptor domain class A;
                     smart00192"
                     /db_xref="CDD:197566"
     Site            order(479,487,498..499)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(491,494,498,504..505)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            501..505
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          519..552
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(524,531,542..543)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(535,538,542,548..549)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            545..549
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          562..592
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(563,571,582..583)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(575,578,582,588..589)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            585..589
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          593..627
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(598,606,617..618)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(610,613,617,623..624)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            620..624
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          633..667
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(638,646,657..658)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(650,653,657,663..664)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            660..664
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          682..750
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          782..816
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(787,795,806..807)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(799,802,806,812..813)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            809..813
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          822..856
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(827,835,846..847)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(839,842,846,852..853)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            849..853
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          865..899
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(870,878,889..890)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(882,885,889,895..896)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            892..896
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          917..992
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          920..924
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:143220"
     Region          933..937
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:143220"
     Region          956..960
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:143220"
     Region          970..975
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:143220"
     Region          984..987
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:143220"
     Region          1091..1221
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1278..1331
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(1278,1280,1292,1302,1304,1313)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1338..1375
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domai;
                     smart00180"
                     /db_xref="CDD:214543"
     Region          1456..1592
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1593..1619
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1627..1676
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1628,1630,1637,1644,1647,1656)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <1713..1743
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          1809..1943
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          2036..2118
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2047..2051
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2063..2067
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2082..2086
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2096..2101
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2111..2114
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <2145..2198
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2229..2312
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2240..2244
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          2253..2257
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          2276..2280
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          2290..2295
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          2325..2405
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2342..2346
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          2355..2359
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          2374..2378
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          2388..2393
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          2401..2404
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          2423..2499
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2431..2435
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409543"
     Region          2444..2448
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409543"
     Region          2479..2484
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409543"
     Region          2492..2495
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409543"
     Region          2512..2599
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2522..2526
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409548"
     Region          2535..2539
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409548"
     Region          2579..2584
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409548"
     Region          2724..2800
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2733..2737
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409541"
     Region          2746..2750
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409541"
     Region          2766..2770
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409541"
     Region          2780..2785
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409541"
     Region          2812..>2879
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2823..2827
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          2839..2846
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          2862..2866
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          2927..2995
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          3025..3091
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3033..3037
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          3046..3050
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          3065..3069
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          3079..3084
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409390"
     Region          3113..3260
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3283..3315
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3363..3516
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3570..3602
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3611..3764
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..3805
                     /gene="trol"
                     /coded_by="XM_044396089.1:209..11626"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf
       61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd
      121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn
      181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde
      241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln
      301 vykridrlga stdgvftfte ssefdedpie valptdeveg sgqdsgscrg datfqcrrsg
      361 kticdemrcd gsrdcpdaed eegcevcnel qfkcdnkclp lnkrcdnryd cedqtdeagc
      421 qryeveesqp qpqpqpqpep epepepepep epepepepee pitdneqpeq nsecratefr
      481 cnngdcidir krcdhisdcs egedeneecp ttrlkpsdcg pdqffcddlc ynrsircngh
      541 mdcsdgsdei acsslsvlpc pqhqcpsgrc ysesercdrh rhcedgsdea nccyadqfrc
      601 nngdciaesa hcdgnidcsd qsdeldcggd sqclpnqfrc kngqcvssta rcnkrsdcld
      661 gsdeqncane pnnsgrgtnq lklktypdnq iikesrevif rcrdegpnra kvkwsrpggr
      721 plppgftdrn grleipnirv edagayvcea vgyanyipgq hvtvnlnver lnereirpds
      781 acteyqatcm ngecidksgi cdghpdcsdg sdehscslgl kcqpnqfmcs nskcvdrtwr
      841 cdgendcgdn sdetscdpep sdapcrydef qcrsghcipk sfqcdymndc tdgtdeigcs
      901 vpspmtlpap sivvmeyevl eltcvgtgvp tptivwrlnw ghvpekcesk syggtgtlrc
      961 pnmrpqdsga yscefintrg tfypktnsiv tvtpvrsdvc kagffnmlar kseecvqcfc
     1021 fgvstncdsa nlftyaiqpp ilshrvvsve lspfrqivin easpgqdllt lhhgvqfras
     1081 nvhyngretp flalpaeymg nqlksyggnl ryevryngng rpvsgpdvii tgnsftlthr
     1141 vrthpgqnnr vtipflpggw tkpdgrkgtr edimmilanv dnilirlgyl dstarevdli
     1201 nialdsagsa dqglgsaslv ekctcppgyv gdscescasg yvrqargpwl ghcvpftpep
     1261 cpagtygnpr lgvpcqecpc phagannfas gcqqspdgdv icrcnegyag krcehcaqgy
     1321 qgnplapggv crkipdsscn vdgtyniysn gtcqckdsvi geqcdtcapk sfhlnsftyt
     1381 gciecfcsgv gldcdssswy rnqvtstfgr trvnhgfali sdymrntpvt vpvsmstqan
     1441 alsfvgsaeq agntlywslp aaflgnklts yggklsytls ysplpsgims rnsapdvvik
     1501 sgedlrlihy rksqvspsva ntyaveikes awqrgdelvp nrehvlmals nitaiyikat
     1561 yttstkeasl rsvtldtata tnlgtarave veqcrcpegy lglsceqcap gytrdpeagi
     1621 ylglcrpcec nghskycnse tgecescsdn tegfncdrca agyvgdatrg tsydcqyddg
     1681 gyptsrppap gnqtaeclvn cqqegtagcr gyqceckrnv agdrcdqcrp gtyglsaqnp
     1741 dgckecycsg ltnqcrsasl yrqlipvdfi stpplitdef gdimdrdnlv pdvprnvyty
     1801 khtsytpkyw slrgsvlgnq llsyggrley slivesvgrd hrgkdvvlig nglkliwsrp
     1861 dghdneqeyh vrlhedeqwt vedrgsarqa tradfmtvls dlqhililat pkvptvstsi
     1921 snvilessit trapgathas dielcqcpsg ytgtscesca plhyrdasgr csqcpcdasn
     1981 tescglvsgg nvecqcrprw rgdrcreiet neptpepdts tddpvrtqii vsiarpeiti
     2041 lpvggsltls ctgrmrwtns pvfvnwykqg shlpegvevq ggnlqlfnlq isdsgiyicq
     2101 avsnetghsf tdhvsitvsq edqrspahiv dlpndvtfee yvsneivcev egnppptvtw
     2161 trvdghadaq strtdnnrlv fdsprksdeg ryrcqaensl sreekyvvvy vrsnppqppp
     2221 qqdrlyitpq evngvagdsf qlscqftsaa slrydwshdg rslsassprn vvvrgnvlev
     2281 rdanvrdsgt ytcvafdlrt rrnftesarv yieqpnepgi lgdkphiltl eqniiivqge
     2341 dlsitceasg tpypsikwtk vqenlaenvr isgnvltiyg grsenrglys ciaenshgsd
     2401 qsstsidiep rerpsltidt atqkvsvgsq aslycaaqgi peptvewvrt dgqplsprhk
     2461 vqapgyvvid divlddsgty ecrasniagq vsglatinvq eptlvriepd rqhhivtqgd
     2521 elslscvgsg vptpsvfwsf egrdvdrmgv pegavfaqpf rtntadvkif rvskenegiy
     2581 vchgsndage dqqyirvevq prrgdvgagg ddngdvdtrq ppnrpqiqpn plsnerltte
     2641 lgnnvtlicn vdnvntewer vdgtplphna ytvrntlviv fvepqnlgqy rcngigrdgr
     2701 veahvvrelv llplpritfy pnipltvelg qnldvycqve nvrpedvhwt tdnnrplpss
     2761 vriegnvlrf asitqaaage yrcsatnqyg srsknarvvv kqpsgfqpvp hsqvqqrqvg
     2821 dsiqlrcrlt tqygdevrgn iqfnwyredg splprgvrpd sqvlqlvklq pedegryicn
     2881 sydlgsgqql ppvsidlqvl tvpaapqnpi ylppvapprs perilepqls lsvqssnlpa
     2941 gdgttvecfs sddsypdvvw eradgaplse nvqqvgnnlv isnvastdag nyvckcktde
     3001 gdlyttsykl eveeqphelk sskivyakvg gnadlqcgad edrqpsyrws rqygqlqagr
     3061 slqneklsld rvqandagty vcsaqysdge tvdfpnilvv tgaipqfrqe prsymsfptl
     3121 snssfkfnfe ltfrpenadg lllfngqtrg sgdyialslk dryaefrfdf ggkpllvrae
     3181 eplaldewht vrvsrfkrdg yiqvddqhpv afptsqhqqi pqleliedly iggvpnwefl
     3241 paeavgqqsg fvgcisrltl qgrtvelire akfkegitdc rpcaqgpcqn kgvclesqte
     3301 qaytcvcqpg wtgrdcaieg tqctagvcgs grcentendm eclcplnrag drcqyneiln
     3361 eqslnfksns faaygtpkvt kvnitlsvrp asledsvily taestlpsgd ylalvlrggh
     3421 aellintaar ldpvvvrsae plplnrwtri eirrrlgegi lkvgdgperk akapgsdril
     3481 slkthlfvgg vdrstvkinr dvnitkgfdg cisklynsqk svnllgdird aanvqncgea
     3541 neidddeyem pvalpspkva enerqlmapc asdpcenggs cseqedmaic scpfgfsgkh
     3601 cqnhlqlsfn asfrgdgyve lnrshfqpal eqtyshigiv fttnkpngll fwwgqeagee
     3661 ytgqdfiaaa vvdgyveysm rldgeeavir nsdirvdnge rhiviakrde ntamleldqi
     3721 ldtgdtrpti nkamklpgnv fvggapdvaa ftgfrykdnf ngcivvvege tvgqinlssa
     3781 aingvnanvc pandeplggt eppvv