Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_070074546            4159 aa            linear   INV 09-DEC-2024
            protein isoform X32 [Drosophila takahashii].
ACCESSION   XP_070074546
VERSION     XP_070074546.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_070218445.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..4159
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..4159
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X32"
                     /calculated_mol_wt=459824
     Region          84..>169
                     /region_name="CCDC66"
                     /note="Coiled-coil domain-containing protein 66;
                     pfam15236"
                     /db_xref="CDD:434558"
     Region          387..420
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(392,399,410..411)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(403,406,410,416..417)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            413..417
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          473..502
                     /region_name="LDLa"
                     /note="Low-density lipoprotein receptor domain class A;
                     smart00192"
                     /db_xref="CDD:197566"
     Site            order(479,487,498..499)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Region          561..596
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(566,575,586..587)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(579,582,586,592..593)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            589..593
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          683..717
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(689,697,708..709)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(701,704,708,714..715)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            711..715
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          722..757
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(727,736,747..748)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(740,743,747,753..754)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            750..754
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          772..806
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(777,785,796..797)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(789,792,796,802..803)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            799..803
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          825..859
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(830,838,849..850)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(842,845,849,855..856)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            852..856
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          873..906
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(878,885,896..897)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(889,892,896,902..903)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            899..903
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          916..946
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(917,925,936..937)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(929,932,936,942..943)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            939..943
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          947..981
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(952,960,971..972)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(964,967,971,977..978)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            974..978
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          987..1021
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(992,1000,1011..1012)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1004,1007,1011,1017..1018)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1014..1018
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1036..1104
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          1136..1170
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1141,1149,1160..1161)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1153,1156,1160,1166..1167)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1163..1167
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1176..1210
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1181,1189,1200..1201)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1193,1196,1200,1206..1207)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1203..1207
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1219..1253
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(1224,1232,1243..1244)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(1236,1239,1243,1249..1250)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            1246..1250
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          1271..1346
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          1274..1278
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          1287..1291
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          1310..1314
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          1324..1329
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          1338..1341
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          1445..1575
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1632..1685
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(1632,1634,1646,1656,1658,1667)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1810..1946
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1947..1973
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1981..2030
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1982,1984,1991,1998,2001,2010)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <2067..2097
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          2163..2297
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          2390..2472
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2401..2405
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2417..2421
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2436..2440
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2450..2455
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2465..2468
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <2499..2552
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2583..2666
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2594..2598
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2607..2611
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2630..2634
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2644..2649
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2679..2759
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2696..2700
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          2709..2713
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          2728..2732
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          2742..2747
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          2755..2758
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          2777..2853
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2785..2789
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2798..2802
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2819..2823
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2833..2838
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2846..2849
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          2866..2953
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2876..2880
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2889..2893
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2933..2938
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          3078..3154
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3087..3091
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3100..3104
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3120..3124
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3134..3139
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          3166..>3233
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3177..3181
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3193..3200
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3216..3220
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3281..3349
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          3379..3445
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3387..3391
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3400..3404
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3419..3423
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3433..3438
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          3467..3614
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3637..3669
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3717..3870
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3924..3956
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3965..4118
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..4159
                     /gene="trol"
                     /coded_by="XM_070218445.1:209..12688"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf
       61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd
      121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn
      181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde
      241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln
      301 vykridrlga stdgvftfte ssefdedpie valptdeveg sgqdsgscrg datfqcrrsg
      361 kticdemrcd gsrdcpdaed eegcevcnel qfkcdnkclp lnkrcdnryd cedqtdeagc
      421 qryeveesqp qpqpqpqpep epepepepep epepepepee pitdneqpeq nsecratefr
      481 cnngdcidir krcdhisdcs egedeneecp eeeedsvgip igrppqrpap khdwldelda
      541 neyhvyhpsn vyelansknp casnqfrcat tnvciplhlr cdnfyhcndm sdekdceqyq
      601 rrtttttrrp stsarpsftf tfttqgpgll errnsttsrt tagsttrate apqwpwatrp
      661 tettttnpit tvgvassspq sscleniefa chnrdcipie svcdgtpdcg rsededdalc
      721 kctadkykcq hgggcipktq vcdgkpqcrd rsdesachln grlnktrlgv qcqenqyqcg
      781 dgscisgykr cngindcadd adeynciyty nedyvdpddn plnecdilef ecdyslclpl
      841 ekkcdgyadc edlsdefecq sytattrlkp sdcgpdqffc ddlcynrsir cnghmdcsdg
      901 sdeiacssls vlpcpqhqcp sgrcyseser cdrhrhcedg sdeanccyad qfrcnngdci
      961 aesahcdgni dcsdqsdeld cggdsqclpn qfrckngqcv sstarcnkrs dcldgsdeqn
     1021 canepnnsgr gtnqlklkty pdnqiikesr evifrcrdeg pnrakvkwsr pggrplppgf
     1081 tdrngrleip nirvedagay vceavgyany ipgqhvtvnl nverlnerei rpdsacteyq
     1141 atcmngecid ksgicdghpd csdgsdehsc slglkcqpnq fmcsnskcvd rtwrcdgend
     1201 cgdnsdetsc dpepsdapcr ydefqcrsgh cipksfqcdy mndctdgtde igcsvpspmt
     1261 lpapsivvme yevleltcvg tgvptptivw rlnwghvpek cesksyggtg tlrcpnmrpq
     1321 dsgayscefi ntrgtfypkt nsivtvtpvr sdvckagffn mlarkseecv qcfcfgvstn
     1381 cdsanlftya iqppilshrv vsvelspfrq ivineaspgq dlltlhhgvq frasnvhyng
     1441 retpflalpa eymgnqlksy ggnlryevry ngngrpvsgp dviitgnsft lthrvrthpg
     1501 qnnrvtipfl pggwtkpdgr kgtredimmi lanvdnilir lgyldstare vdlinialds
     1561 agsadqglgs aslvekctcp pgyvgdsces casgyvrqar gpwlghcvpf tpepcpagty
     1621 gnprlgvpcq ecpcphagan nfasgcqqsp dgdvicrcne gyagkrcehc aqgyqgnpla
     1681 pggvcrkipd sscnvdgtyn iysngtcqck dsvigeqcdt capksfhlns ftytgciecf
     1741 csgvgldcds sswyrnqvts tfgrtrvnhg falisdymrn tpvtvpvsms tqanalsfvg
     1801 saeqagntly wslpaaflgn kltsyggkls ytlsysplps gimsrnsapd vviksgedlr
     1861 lihyrksqvs psvantyave ikesawqrgd elvpnrehvl malsnitaiy ikatyttstk
     1921 easlrsvtld tatatnlgta raveveqcrc pegylglsce qcapgytrdp eagiylglcr
     1981 pcecnghsky cnsetgeces csdntegfnc drcaagyvgd atrgtsydcq yddggyptsr
     2041 ppapgnqtae clvncqqegt agcrgyqcec krnvagdrcd qcrpgtygls aqnpdgckec
     2101 ycsgltnqcr saslyrqlip vdfistppli tdefgdimdr dnlvpdvprn vytykhtsyt
     2161 pkywslrgsv lgnqllsygg rleyslives vgrdhrgkdv vlignglkli wsrpdghdne
     2221 qeyhvrlhed eqwtvedrgs arqatradfm tvlsdlqhil ilatpkvptv stsisnvile
     2281 ssittrapga thasdielcq cpsgytgtsc escaplhyrd asgrcsqcpc dasntescgl
     2341 vsggnvecqc rprwrgdrcr eietneptpe pdtstddpvr tqiivsiarp eitilpvggs
     2401 ltlsctgrmr wtnspvfvnw ykqgshlpeg vevqggnlql fnlqisdsgi yicqavsnet
     2461 ghsftdhvsi tvsqedqrsp ahivdlpndv tfeeyvsnei vcevegnppp tvtwtrvdgh
     2521 adaqstrtdn nrlvfdsprk sdegryrcqa enslsreeky vvvyvrsnpp qpppqqdrly
     2581 itpqevngva gdsfqlscqf tsaaslrydw shdgrslsas sprnvvvrgn vlevrdanvr
     2641 dsgtytcvaf dlrtrrnfte sarvyieqpn epgilgdkph iltleqniii vqgedlsitc
     2701 easgtpypsi kwtkvqenla envrisgnvl tiyggrsenr glysciaens hgsdqsstsi
     2761 dieprerpsl tidtatqkvs vgsqaslyca aqgipeptve wvrtdgqpls prhkvqapgy
     2821 vviddivldd sgtyecrasn iagqvsglat invqeptlvr iepdrqhhiv tqgdelslsc
     2881 vgsgvptpsv fwsfegrdvd rmgvpegavf aqpfrtntad vkifrvsken egiyvchgsn
     2941 dagedqqyir vevqprrgdv gaggddngdv dtrqppnrpq iqpnplsner lttelgnnvt
     3001 licnvdnvnt ewervdgtpl phnaytvrnt lvivfvepqn lgqyrcngig rdgrveahvv
     3061 relvllplpr itfypniplt velgqnldvy cqvenvrped vhwttdnnrp lpssvriegn
     3121 vlrfasitqa aageyrcsat nqygsrskna rvvvkqpsgf qpvphsqvqq rqvgdsiqlr
     3181 crlttqygde vrgniqfnwy redgsplprg vrpdsqvlql vklqpedegr yicnsydlgs
     3241 gqqlppvsid lqvltvpaap qnpiylppva pprsperile pqlslsvqss nlpagdgttv
     3301 ecfssddsyp dvvweradga plsenvqqvg nnlvisnvas tdagnyvckc ktdegdlytt
     3361 sykleveeqp helksskivy akvggnadlq cgadedrqps yrwsrqygql qagrslqnek
     3421 lsldrvqand agtyvcsaqy sdgetvdfpn ilvvtgaipq frqeprsyms fptlsnssfk
     3481 fnfeltfrpe nadglllfng qtrgsgdyia lslkdryaef rfdfggkpll vraeeplald
     3541 ewhtvrvsrf krdgyiqvdd qhpvafptsq hqqipqleli edlyiggvpn weflpaeavg
     3601 qqsgfvgcis rltlqgrtve lireakfkeg itdcrpcaqg pcqnkgvcle sqteqaytcv
     3661 cqpgwtgrdc aiegtqctag vcgsgrcent endmeclcpl nragdrcqyn eilneqslnf
     3721 ksnsfaaygt pkvtkvnitl svrpasleds vilytaestl psgdylalvl rgghaellin
     3781 taarldpvvv rsaeplplnr wtrieirrrl gegilkvgdg perkakapgs drilslkthl
     3841 fvggvdrstv kinrdvnitk gfdgciskly nsqksvnllg dirdaanvqn cgeaneiddd
     3901 eyempvalps pkvaenerql mapcasdpce nggscseqed maicscpfgf sgkhcqnhlq
     3961 lsfnasfrgd gyvelnrshf qpaleqtysh igivfttnkp ngllfwwgqe ageeytgqdf
     4021 iaaavvdgyv eysmrldgee avirnsdirv dngerhivia krdentamle ldqildtgdt
     4081 rptinkamkl pgnvfvggap dvaaftgfry kdnfngcivv vegetvgqin lssaaingvn
     4141 anvcpandep lggteppvv