Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_070074542            3845 aa            linear   INV 09-DEC-2024
            protein isoform X28 [Drosophila takahashii].
ACCESSION   XP_070074542
VERSION     XP_070074542.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_070218441.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..3845
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..3845
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X28"
                     /calculated_mol_wt=424910
     Region          84..>169
                     /region_name="CCDC66"
                     /note="Coiled-coil domain-containing protein 66;
                     pfam15236"
                     /db_xref="CDD:434558"
     Region          379..415
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(385,394,405..406)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(398,401,405,411..412)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            408..412
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          418..451
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(423,430,441..442)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(434,437,441,447..448)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            444..448
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          504..536
                     /region_name="LDLa"
                     /note="Low-density lipoprotein receptor domain class A;
                     smart00192"
                     /db_xref="CDD:197566"
     Site            order(510,518,529..530)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(522,525,529,535..536)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            532..536
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          542..576
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(547,555,566..567)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(559,562,566,572..573)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            569..573
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          582..616
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(587,595,606..607)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(599,602,606,612..613)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            609..613
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          631..699
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          731..765
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(736,744,755..756)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(748,751,755,761..762)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            758..762
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          771..805
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(776,784,795..796)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(788,791,795,801..802)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            798..802
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          814..848
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(819,827,838..839)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(831,834,838,844..845)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            841..845
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          866..941
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          869..873
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          882..886
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          905..909
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          919..924
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          933..936
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          1040..1170
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1227..1280
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(1227,1229,1241,1251,1253,1262)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1287..1331
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domai;
                     smart00180"
                     /db_xref="CDD:214543"
     Region          1405..1541
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1542..1568
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1576..1625
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1577,1579,1586,1593,1596,1605)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <1662..1692
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          1758..1892
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          2047..2129
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2058..2062
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2074..2078
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2093..2097
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2107..2112
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2122..2125
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <2156..2209
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2240..2323
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2251..2255
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2264..2268
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2287..2291
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2301..2306
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2336..2416
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2353..2357
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          2366..2370
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          2385..2389
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          2399..2404
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          2412..2415
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          2434..2510
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2442..2446
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2455..2459
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2490..2495
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2503..2506
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          2523..2610
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2533..2537
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2546..2550
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2590..2595
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2735..2811
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2744..2748
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2757..2761
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2777..2781
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2791..2796
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2823..>2890
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2834..2838
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2850..2857
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2873..2877
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2967..3035
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          3065..3131
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          3073..3077
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          3086..3090
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          3105..3109
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          3119..3124
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          3153..3300
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3323..3355
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3403..3556
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3610..3642
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3651..3804
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..3845
                     /gene="trol"
                     /coded_by="XM_070218441.1:209..11746"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf
       61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd
      121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn
      181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde
      241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln
      301 vykridrlga stdgvftfte ssvykyehqy divtprvivs gqpitekpve apqefdedpi
      361 evalptdeve gsgqdsgscr gdatfqcrrs gkticdemrc dgsrdcpdae deegcevcne
      421 lqfkcdnkcl plnkrcdnry dcedqtdeag cqryeveesq pqpqpqpqpe pepepepepe
      481 pepepepepe epitdneqpe qnsecratef rcnngdcidi rkrcdhisdc segedeneec
      541 rcyadqfrcn ngdciaesah cdgnidcsdq sdeldcggds qclpnqfrck ngqcvsstar
      601 cnkrsdcldg sdeqncanep nnsgrgtnql klktypdnqi ikesrevifr crdegpnrak
      661 vkwsrpggrp lppgftdrng rleipnirve dagayvceav gyanyipgqh vtvnlnverl
      721 nereirpdsa cteyqatcmn gecidksgic dghpdcsdgs dehscslglk cqpnqfmcsn
      781 skcvdrtwrc dgendcgdns detscdpeps dapcrydefq crsghcipks fqcdymndct
      841 dgtdeigcsv pspmtlpaps ivvmeyevle ltcvgtgvpt ptivwrlnwg hvpekcesks
      901 yggtgtlrcp nmrpqdsgay scefintrgt fypktnsivt vtpvrsdvck agffnmlark
      961 seecvqcfcf gvstncdsan lftyaiqppi lshrvvsvel spfrqivine aspgqdlltl
     1021 hhgvqfrasn vhyngretpf lalpaeymgn qlksyggnlr yevryngngr pvsgpdviit
     1081 gnsftlthrv rthpgqnnrv tipflpggwt kpdgrkgtre dimmilanvd nilirlgyld
     1141 starevdlin ialdsagsad qglgsaslve kctcppgyvg dscescasgy vrqargpwlg
     1201 hcvpftpepc pagtygnprl gvpcqecpcp hagannfasg cqqspdgdvi crcnegyagk
     1261 rcehcaqgyq gnplapggvc rkipdsscnv dgtyniysng tcqckdsvig eqcdtcapks
     1321 fhlnsftytg ciecfcsgvg ldcdssswyr nqvtstfgrt rvnhgfalis dymrntpvtv
     1381 pvsmstqana lsfvgsaeqa gntlywslpa aflgnkltsy ggklsytlsy splpsgimsr
     1441 nsapdvviks gedlrlihyr ksqvspsvan tyaveikesa wqrgdelvpn rehvlmalsn
     1501 itaiyikaty ttstkeaslr svtldtatat nlgtaravev eqcrcpegyl glsceqcapg
     1561 ytrdpeagiy lglcrpcecn ghskycnset gecescsdnt egfncdrcaa gyvgdatrgt
     1621 sydcqyddgg yptsrppapg nqtaeclvnc qqegtagcrg yqceckrnva gdrcdqcrpg
     1681 tyglsaqnpd gckecycsgl tnqcrsasly rqlipvdfis tpplitdefg dimdrdnlvp
     1741 dvprnvytyk htsytpkyws lrgsvlgnql lsyggrleys livesvgrdh rgkdvvlign
     1801 glkliwsrpd ghdneqeyhv rlhedeqwtv edrgsarqat radfmtvlsd lqhililatp
     1861 kvptvstsis nvilessitt rapgathasd ielcqcpsgy tgtscescap lhyrdasgrc
     1921 sqcpcdasnt escglvsggn vecqcrprwr gdrcreidts piieeppqic dlsrgfccsg
     1981 fqfdiapnet isfndtlqiy kgnriignmt klrygcpsre tneptpepdt stddpvrtqi
     2041 ivsiarpeit ilpvggsltl sctgrmrwtn spvfvnwykq gshlpegvev qggnlqlfnl
     2101 qisdsgiyic qavsnetghs ftdhvsitvs qedqrspahi vdlpndvtfe eyvsneivce
     2161 vegnppptvt wtrvdghada qstrtdnnrl vfdsprksde gryrcqaens lsreekyvvv
     2221 yvrsnppqpp pqqdrlyitp qevngvagds fqlscqftsa aslrydwshd grslsasspr
     2281 nvvvrgnvle vrdanvrdsg tytcvafdlr trrnftesar vyieqpnepg ilgdkphilt
     2341 leqniiivqg edlsitceas gtpypsikwt kvqenlaenv risgnvltiy ggrsenrgly
     2401 sciaenshgs dqsstsidie prerpsltid tatqkvsvgs qaslycaaqg ipeptvewvr
     2461 tdgqplsprh kvqapgyvvi ddivlddsgt yecrasniag qvsglatinv qeptlvriep
     2521 drqhhivtqg delslscvgs gvptpsvfws fegrdvdrmg vpegavfaqp frtntadvki
     2581 frvskenegi yvchgsndag edqqyirvev qprrgdvgag gddngdvdtr qppnrpqiqp
     2641 nplsnerltt elgnnvtlic nvdnvntewe rvdgtplphn aytvrntlvi vfvepqnlgq
     2701 yrcngigrdg rveahvvrel vllplpritf ypnipltvel gqnldvycqv envrpedvhw
     2761 ttdnnrplps svriegnvlr fasitqaaag eyrcsatnqy gsrsknarvv vkqpsgfqpv
     2821 phsqvqqrqv gdsiqlrcrl ttqygdevrg niqfnwyred gsplprgvrp dsqvlqlvkl
     2881 qpedegryic nsydlgsgqq lppvsidlqv lrtttqypfn rfkggvslkd tpcmvlyica
     2941 avpaapqnpi ylppvapprs perilepqls lsvqssnlpa gdgttvecfs sddsypdvvw
     3001 eradgaplse nvqqvgnnlv isnvastdag nyvckcktde gdlyttsykl eveeqphelk
     3061 sskivyakvg gnadlqcgad edrqpsyrws rqygqlqagr slqneklsld rvqandagty
     3121 vcsaqysdge tvdfpnilvv tgaipqfrqe prsymsfptl snssfkfnfe ltfrpenadg
     3181 lllfngqtrg sgdyialslk dryaefrfdf ggkpllvrae eplaldewht vrvsrfkrdg
     3241 yiqvddqhpv afptsqhqqi pqleliedly iggvpnwefl paeavgqqsg fvgcisrltl
     3301 qgrtvelire akfkegitdc rpcaqgpcqn kgvclesqte qaytcvcqpg wtgrdcaieg
     3361 tqctagvcgs grcentendm eclcplnrag drcqyneiln eqslnfksns faaygtpkvt
     3421 kvnitlsvrp asledsvily taestlpsgd ylalvlrggh aellintaar ldpvvvrsae
     3481 plplnrwtri eirrrlgegi lkvgdgperk akapgsdril slkthlfvgg vdrstvkinr
     3541 dvnitkgfdg cisklynsqk svnllgdird aanvqncgea neidddeyem pvalpspkva
     3601 enerqlmapc asdpcenggs cseqedmaic scpfgfsgkh cqnhlqlsfn asfrgdgyve
     3661 lnrshfqpal eqtyshigiv fttnkpngll fwwgqeagee ytgqdfiaaa vvdgyveysm
     3721 rldgeeavir nsdirvdnge rhiviakrde ntamleldqi ldtgdtrpti nkamklpgnv
     3781 fvggapdvaa ftgfrykdnf ngcivvvege tvgqinlssa aingvnanvc pandeplggt
     3841 eppvv