Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_044252026            3723 aa            linear   INV 09-DEC-2024
            protein isoform X39 [Drosophila takahashii].
ACCESSION   XP_044252026
VERSION     XP_044252026.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_044396091.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..3723
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..3723
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X39"
                     /calculated_mol_wt=411125
     Region          84..>169
                     /region_name="CCDC66"
                     /note="Coiled-coil domain-containing protein 66;
                     pfam15236"
                     /db_xref="CDD:434558"
     Region          348..384
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(354,363,374..375)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(367,370,374,380..381)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            377..381
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          387..420
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(392,399,410..411)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(403,406,410,416..417)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            413..417
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          473..505
                     /region_name="LDLa"
                     /note="Low-density lipoprotein receptor domain class A;
                     smart00192"
                     /db_xref="CDD:197566"
     Site            order(479,487,498..499)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(491,494,498,504..505)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            501..505
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          511..545
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(516,524,535..536)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(528,531,535,541..542)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            538..542
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          551..585
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(556,564,575..576)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(568,571,575,581..582)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            578..582
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          600..668
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          700..734
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(705,713,724..725)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(717,720,724,730..731)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            727..731
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          740..774
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(745,753,764..765)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(757,760,764,770..771)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            767..771
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          783..817
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(788,796,807..808)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(800,803,807,813..814)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            810..814
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          835..910
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          838..842
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:143220"
     Region          851..855
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:143220"
     Region          874..878
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:143220"
     Region          888..893
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:143220"
     Region          902..905
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:143220"
     Region          1009..1139
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1196..1249
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(1196,1198,1210,1220,1222,1231)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          1256..1300
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domai;
                     smart00180"
                     /db_xref="CDD:214543"
     Region          1374..1510
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1511..1537
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1545..1594
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1546,1548,1555,1562,1565,1574)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <1631..1661
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          1727..1861
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1954..2036
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          1965..1969
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          1981..1985
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2000..2004
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2014..2019
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2029..2032
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <2063..2116
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2147..2230
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2158..2162
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          2171..2175
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          2194..2198
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          2208..2213
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          2243..2323
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2260..2264
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          2273..2277
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          2292..2296
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          2306..2311
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          2319..2322
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          2341..2417
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          2349..2353
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409543"
     Region          2362..2366
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409543"
     Region          2397..2402
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409543"
     Region          2410..2413
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409543"
     Region          2430..2517
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2440..2444
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409548"
     Region          2453..2457
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409548"
     Region          2497..2502
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409548"
     Region          2642..2718
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2651..2655
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409541"
     Region          2664..2668
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409541"
     Region          2684..2688
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409541"
     Region          2698..2703
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409541"
     Region          2730..>2797
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2741..2745
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          2757..2764
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          2780..2784
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          2845..2913
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          2943..3009
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2951..2955
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409390"
     Region          2964..2968
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409390"
     Region          2983..2987
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409390"
     Region          2997..3002
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409390"
     Region          3031..3178
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3201..3233
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          3281..3434
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3488..3520
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3529..3682
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..3723
                     /gene="trol"
                     /coded_by="XM_044396091.1:209..11380"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mgspgsqapa iaigisngrr ghtgslllrl llvafvlnac hvpptnakqi tkskvddqdf
       61 iladtqslqg svdldddeaf lippdseekl dkkftpegnw wsqglhrvrr slnsffgsdd
      121 ddqekererq rqrqnnrdaa nrqkelrrqq kesqnrekql rlerqerqrl akrnnhvvfn
      181 rvtdprkras dlydeneasg lneeetttyr tyfvvnepys ddykerdslq fqnlqkllde
      241 dlrnffnrnf ennddeeqei hstlervdqt kdhfkirvql rvdlpssind fgskleqqln
      301 vykridrlga stdgvftfte ssefdedpie valptdeveg sgqdsgscrg datfqcrrsg
      361 kticdemrcd gsrdcpdaed eegcevcnel qfkcdnkclp lnkrcdnryd cedqtdeagc
      421 qryeveesqp qpqpqpqpep epepepepep epepepepee pitdneqpeq nsecratefr
      481 cnngdcidir krcdhisdcs egedeneecr cyadqfrcnn gdciaesahc dgnidcsdqs
      541 deldcggdsq clpnqfrckn gqcvsstarc nkrsdcldgs deqncanepn nsgrgtnqlk
      601 lktypdnqii kesrevifrc rdegpnrakv kwsrpggrpl ppgftdrngr leipnirved
      661 agayvceavg yanyipgqhv tvnlnverln ereirpdsac teyqatcmng ecidksgicd
      721 ghpdcsdgsd ehscslglkc qpnqfmcsns kcvdrtwrcd gendcgdnsd etscdpepsd
      781 apcrydefqc rsghcipksf qcdymndctd gtdeigcsvp spmtlpapsi vvmeyevlel
      841 tcvgtgvptp tivwrlnwgh vpekcesksy ggtgtlrcpn mrpqdsgays cefintrgtf
      901 ypktnsivtv tpvrsdvcka gffnmlarks eecvqcfcfg vstncdsanl ftyaiqppil
      961 shrvvsvels pfrqivinea spgqdlltlh hgvqfrasnv hyngretpfl alpaeymgnq
     1021 lksyggnlry evryngngrp vsgpdviitg nsftlthrvr thpgqnnrvt ipflpggwtk
     1081 pdgrkgtred immilanvdn ilirlgylds tarevdlini aldsagsadq glgsaslvek
     1141 ctcppgyvgd scescasgyv rqargpwlgh cvpftpepcp agtygnprlg vpcqecpcph
     1201 agannfasgc qqspdgdvic rcnegyagkr cehcaqgyqg nplapggvcr kipdsscnvd
     1261 gtyniysngt cqckdsvige qcdtcapksf hlnsftytgc iecfcsgvgl dcdssswyrn
     1321 qvtstfgrtr vnhgfalisd ymrntpvtvp vsmstqanal sfvgsaeqag ntlywslpaa
     1381 flgnkltsyg gklsytlsys plpsgimsrn sapdvviksg edlrlihyrk sqvspsvant
     1441 yaveikesaw qrgdelvpnr ehvlmalsni taiyikatyt tstkeaslrs vtldtatatn
     1501 lgtaraveve qcrcpegylg lsceqcapgy trdpeagiyl glcrpcecng hskycnsetg
     1561 ecescsdnte gfncdrcaag yvgdatrgts ydcqyddggy ptsrppapgn qtaeclvncq
     1621 qegtagcrgy qceckrnvag drcdqcrpgt yglsaqnpdg ckecycsglt nqcrsaslyr
     1681 qlipvdfist pplitdefgd imdrdnlvpd vprnvytykh tsytpkywsl rgsvlgnqll
     1741 syggrleysl ivesvgrdhr gkdvvligng lkliwsrpdg hdneqeyhvr lhedeqwtve
     1801 drgsarqatr adfmtvlsdl qhililatpk vptvstsisn vilessittr apgathasdi
     1861 elcqcpsgyt gtscescapl hyrdasgrcs qcpcdasnte scglvsggnv ecqcrprwrg
     1921 drcreietne ptpepdtstd dpvrtqiivs iarpeitilp vggsltlsct grmrwtnspv
     1981 fvnwykqgsh lpegvevqgg nlqlfnlqis dsgiyicqav snetghsftd hvsitvsqed
     2041 qrspahivdl pndvtfeeyv sneivceveg nppptvtwtr vdghadaqst rtdnnrlvfd
     2101 sprksdegry rcqaenslsr eekyvvvyvr snppqpppqq drlyitpqev ngvagdsfql
     2161 scqftsaasl rydwshdgrs lsassprnvv vrgnvlevrd anvrdsgtyt cvafdlrtrr
     2221 nftesarvyi eqpnepgilg dkphiltleq niiivqgedl sitceasgtp ypsikwtkvq
     2281 enlaenvris gnvltiyggr senrglysci aenshgsdqs stsidiepre rpsltidtat
     2341 qkvsvgsqas lycaaqgipe ptvewvrtdg qplsprhkvq apgyvviddi vlddsgtyec
     2401 rasniagqvs glatinvqep tlvriepdrq hhivtqgdel slscvgsgvp tpsvfwsfeg
     2461 rdvdrmgvpe gavfaqpfrt ntadvkifrv skenegiyvc hgsndagedq qyirvevqpr
     2521 rgdvgaggdd ngdvdtrqpp nrpqiqpnpl snerlttelg nnvtlicnvd nvntewervd
     2581 gtplphnayt vrntlvivfv epqnlgqyrc ngigrdgrve ahvvrelvll plpritfypn
     2641 ipltvelgqn ldvycqvenv rpedvhwttd nnrplpssvr iegnvlrfas itqaaageyr
     2701 csatnqygsr sknarvvvkq psgfqpvphs qvqqrqvgds iqlrcrlttq ygdevrgniq
     2761 fnwyredgsp lprgvrpdsq vlqlvklqpe degryicnsy dlgsgqqlpp vsidlqvltv
     2821 paapqnpiyl ppvapprspe rilepqlsls vqssnlpagd gttvecfssd dsypdvvwer
     2881 adgaplsenv qqvgnnlvis nvastdagny vckcktdegd lyttsyklev eeqphelkss
     2941 kivyakvggn adlqcgaded rqpsyrwsrq ygqlqagrsl qneklsldrv qandagtyvc
     3001 saqysdgetv dfpnilvvtg aipqfrqepr symsfptlsn ssfkfnfelt frpenadgll
     3061 lfngqtrgsg dyialslkdr yaefrfdfgg kpllvraeep laldewhtvr vsrfkrdgyi
     3121 qvddqhpvaf ptsqhqqipq leliedlyig gvpnweflpa eavgqqsgfv gcisrltlqg
     3181 rtvelireak fkegitdcrp caqgpcqnkg vclesqteqa ytcvcqpgwt grdcaiegtq
     3241 ctagvcgsgr centendmec lcplnragdr cqyneilneq slnfksnsfa aygtpkvtkv
     3301 nitlsvrpas ledsvilyta estlpsgdyl alvlrgghae llintaarld pvvvrsaepl
     3361 plnrwtriei rrrlgegilk vgdgperkak apgsdrilsl kthlfvggvd rstvkinrdv
     3421 nitkgfdgci sklynsqksv nllgdirdaa nvqncgeane idddeyempv alpspkvaen
     3481 erqlmapcas dpcenggscs eqedmaicsc pfgfsgkhcq nhlqlsfnas frgdgyveln
     3541 rshfqpaleq tyshigivft tnkpngllfw wgqeageeyt gqdfiaaavv dgyveysmrl
     3601 dgeeavirns dirvdngerh iviakrdent amleldqild tgdtrptink amklpgnvfv
     3661 ggapdvaaft gfrykdnfng civvvegetv gqinlssaai ngvnanvcpa ndeplggtep
     3721 pvv