Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

basement membrane-specific heparan sulfate proteoglycan core


LOCUS       XP_070074545            3356 aa            linear   INV 09-DEC-2024
            protein isoform X31 [Drosophila takahashii].
ACCESSION   XP_070074545
VERSION     XP_070074545.1
DBLINK      BioProject: PRJNA1194641
DBSOURCE    REFSEQ: accession XM_070218444.1
KEYWORDS    RefSeq.
SOURCE      Drosophila takahashii
  ORGANISM  Drosophila takahashii
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NC_091683) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI RefSeq
            Annotation Status           :: Full annotation
            Annotation Name             :: GCF_030179915.1-RS_2024_12
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 10.3
            Annotation Method           :: Gnomon; cmsearch; tRNAscan-SE
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            Annotation Date             :: 12/07/2024
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..3356
                     /organism="Drosophila takahashii"
                     /strain="IR98-3 E-12201"
                     /db_xref="taxon:29030"
                     /chromosome="X"
                     /sex="female"
                     /tissue_type="Whole fly"
                     /dev_stage="Adult fly"
                     /collected_by="Originally obtained from EHIME-Fly"
     Protein         1..3356
                     /product="basement membrane-specific heparan sulfate
                     proteoglycan core protein isoform X31"
                     /calculated_mol_wt=368131
     Region          22..52
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(23,31,42..43)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(35,38,42,48..49)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            45..49
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          53..87
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(58,66,77..78)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(70,73,77,83..84)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            80..84
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          93..127
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(98,106,117..118)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(110,113,117,123..124)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            120..124
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          142..210
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          242..276
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(247,255,266..267)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(259,262,266,272..273)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            269..273
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          282..316
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(287,295,306..307)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(299,302,306,312..313)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            309..313
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          325..359
                     /region_name="LDLa"
                     /note="Low Density Lipoprotein Receptor Class A domain, a
                     cysteine-rich repeat that plays a central role in
                     mammalian cholesterol metabolism; the receptor protein
                     binds LDL and transports it into cells by endocytosis; 7
                     successive cysteine-rich repeats of about...; cd00112"
                     /db_xref="CDD:238060"
     Site            order(330,338,349..350)
                     /site_type="active"
                     /note="putative binding surface [active]"
                     /db_xref="CDD:238060"
     Site            order(342,345,349,355..356)
                     /site_type="other"
                     /note="calcium-binding site [ion binding]"
                     /db_xref="CDD:238060"
     Site            352..356
                     /site_type="other"
                     /note="D-X-S-D-E motif"
                     /db_xref="CDD:238060"
     Region          377..452
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          380..384
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          393..397
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          416..420
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          430..435
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          444..447
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          551..681
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          738..791
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Site            order(738,740,752,762,764,773)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          798..835
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domai;
                     smart00180"
                     /db_xref="CDD:214543"
     Region          916..1052
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          <1053..1079
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Region          1087..1136
                     /region_name="EGF_Lam"
                     /note="Laminin-type epidermal growth factor-like domain;
                     laminins are the major noncollagenous components of
                     basement membranes that mediate cell adhesion, growth
                     migration, and differentiation; the laminin-type epidermal
                     growth factor-like module occurs in...; cd00055"
                     /db_xref="CDD:238012"
     Site            order(1088,1090,1097,1104,1107,1116)
                     /site_type="active"
                     /note="EGF-like motif [active]"
                     /db_xref="CDD:238012"
     Region          <1173..1203
                     /region_name="Laminin_EGF"
                     /note="Laminin EGF domain; pfam00053"
                     /db_xref="CDD:395007"
     Region          1269..1403
                     /region_name="Laminin_B"
                     /note="Laminin B (Domain IV); pfam00052"
                     /db_xref="CDD:459652"
     Region          1558..1640
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          1569..1573
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          1585..1589
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          1604..1608
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          1618..1623
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          1633..1636
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          <1667..1720
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          1751..1834
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          1762..1766
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          1775..1779
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          1798..1802
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          1812..1817
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          1847..1927
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          1864..1868
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409570"
     Region          1877..1881
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409570"
     Region          1896..1900
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409570"
     Region          1910..1915
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409570"
     Region          1923..1926
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409570"
     Region          1945..2021
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          1953..1957
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          1966..1970
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2001..2006
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2014..2017
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          2034..2121
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2044..2048
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2057..2061
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2101..2106
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2246..2322
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2255..2259
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2268..2272
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2288..2292
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2302..2307
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2334..>2401
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2345..2349
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2361..2368
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2384..2388
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2484..2552
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          2508..2512
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2529..2532
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2542..2547
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2576..2642
                     /region_name="IG_like"
                     /note="Immunoglobulin like; smart00410"
                     /db_xref="CDD:214653"
     Region          2584..2588
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          2597..2601
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          2616..2620
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          2630..2635
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          2664..2811
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          2834..2866
                     /region_name="EGF"
                     /note="EGF-like domain; pfam00008"
                     /db_xref="CDD:394967"
     Region          2914..3067
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     Region          3121..3153
                     /region_name="EGF_CA"
                     /note="Calcium-binding EGF-like domain, present in a large
                     number of membrane-bound and extracellular (mostly animal)
                     proteins. Many of these proteins require calcium for their
                     biological function and calcium-binding sites have been
                     found to be located at the...; cd00054"
                     /db_xref="CDD:238011"
     Region          3162..3315
                     /region_name="LamG"
                     /note="Laminin G domain; Laminin G-like domains are
                     usually Ca++ mediated receptors that can have binding
                     sites for steroids, beta1 integrins, heparin, sulfatides,
                     fibulin-1, and alpha-dystroglycans. Proteins that contain
                     LamG domains serve a variety of...; cd00110"
                     /db_xref="CDD:238058"
     CDS             1..3356
                     /gene="trol"
                     /coded_by="XM_070218444.1:346..10416"
                     /db_xref="GeneID:108055788"
ORIGIN      
        1 mdcsdgsdei acsslsvlpc pqhqcpsgrc ysesercdrh rhcedgsdea nccyadqfrc
       61 nngdciaesa hcdgnidcsd qsdeldcggd sqclpnqfrc kngqcvssta rcnkrsdcld
      121 gsdeqncane pnnsgrgtnq lklktypdnq iikesrevif rcrdegpnra kvkwsrpggr
      181 plppgftdrn grleipnirv edagayvcea vgyanyipgq hvtvnlnver lnereirpds
      241 acteyqatcm ngecidksgi cdghpdcsdg sdehscslgl kcqpnqfmcs nskcvdrtwr
      301 cdgendcgdn sdetscdpep sdapcrydef qcrsghcipk sfqcdymndc tdgtdeigcs
      361 vpspmtlpap sivvmeyevl eltcvgtgvp tptivwrlnw ghvpekcesk syggtgtlrc
      421 pnmrpqdsga yscefintrg tfypktnsiv tvtpvrsdvc kagffnmlar kseecvqcfc
      481 fgvstncdsa nlftyaiqpp ilshrvvsve lspfrqivin easpgqdllt lhhgvqfras
      541 nvhyngretp flalpaeymg nqlksyggnl ryevryngng rpvsgpdvii tgnsftlthr
      601 vrthpgqnnr vtipflpggw tkpdgrkgtr edimmilanv dnilirlgyl dstarevdli
      661 nialdsagsa dqglgsaslv ekctcppgyv gdscescasg yvrqargpwl ghcvpftpep
      721 cpagtygnpr lgvpcqecpc phagannfas gcqqspdgdv icrcnegyag krcehcaqgy
      781 qgnplapggv crkipdsscn vdgtyniysn gtcqckdsvi geqcdtcapk sfhlnsftyt
      841 gciecfcsgv gldcdssswy rnqvtstfgr trvnhgfali sdymrntpvt vpvsmstqan
      901 alsfvgsaeq agntlywslp aaflgnklts yggklsytls ysplpsgims rnsapdvvik
      961 sgedlrlihy rksqvspsva ntyaveikes awqrgdelvp nrehvlmals nitaiyikat
     1021 yttstkeasl rsvtldtata tnlgtarave veqcrcpegy lglsceqcap gytrdpeagi
     1081 ylglcrpcec nghskycnse tgecescsdn tegfncdrca agyvgdatrg tsydcqyddg
     1141 gyptsrppap gnqtaeclvn cqqegtagcr gyqceckrnv agdrcdqcrp gtyglsaqnp
     1201 dgckecycsg ltnqcrsasl yrqlipvdfi stpplitdef gdimdrdnlv pdvprnvyty
     1261 khtsytpkyw slrgsvlgnq llsyggrley slivesvgrd hrgkdvvlig nglkliwsrp
     1321 dghdneqeyh vrlhedeqwt vedrgsarqa tradfmtvls dlqhililat pkvptvstsi
     1381 snvilessit trapgathas dielcqcpsg ytgtscesca plhyrdasgr csqcpcdasn
     1441 tescglvsgg nvecqcrprw rgdrcreidt spiieeppqi cdlsrgfccs gfqfdiapne
     1501 tisfndtlqi ykgnriignm tklrygcpsr etneptpepd tstddpvrtq iivsiarpei
     1561 tilpvggslt lsctgrmrwt nspvfvnwyk qgshlpegve vqggnlqlfn lqisdsgiyi
     1621 cqavsnetgh sftdhvsitv sqedqrspah ivdlpndvtf eeyvsneivc evegnppptv
     1681 twtrvdghad aqstrtdnnr lvfdsprksd egryrcqaen slsreekyvv vyvrsnppqp
     1741 ppqqdrlyit pqevngvagd sfqlscqfts aaslrydwsh dgrslsassp rnvvvrgnvl
     1801 evrdanvrds gtytcvafdl rtrrnftesa rvyieqpnep gilgdkphil tleqniiivq
     1861 gedlsitcea sgtpypsikw tkvqenlaen vrisgnvlti yggrsenrgl ysciaenshg
     1921 sdqsstsidi eprerpslti dtatqkvsvg sqaslycaaq gipeptvewv rtdgqplspr
     1981 hkvqapgyvv iddivlddsg tyecrasnia gqvsglatin vqeptlvrie pdrqhhivtq
     2041 gdelslscvg sgvptpsvfw sfegrdvdrm gvpegavfaq pfrtntadvk ifrvskeneg
     2101 iyvchgsnda gedqqyirve vqprrgdvga ggddngdvdt rqppnrpqiq pnplsnerlt
     2161 telgnnvtli cnvdnvntew ervdgtplph naytvrntlv ivfvepqnlg qyrcngigrd
     2221 grveahvvre lvllplprit fypnipltve lgqnldvycq venvrpedvh wttdnnrplp
     2281 ssvriegnvl rfasitqaaa geyrcsatnq ygsrsknarv vvkqpsgfqp vphsqvqqrq
     2341 vgdsiqlrcr lttqygdevr gniqfnwyre dgsplprgvr pdsqvlqlvk lqpedegryi
     2401 cnsydlgsgq qlppvsidlq vlrtttqypf nrfkggvslk dtpcmvlyic aavpaapqnp
     2461 iylppvappr sperilepql slsvqssnlp agdgttvecf ssddsypdvv weradgapls
     2521 envqqvgnnl visnvastda gnyvckcktd egdlyttsyk leveeqphel ksskivyakv
     2581 ggnadlqcga dedrqpsyrw srqygqlqag rslqneklsl drvqandagt yvcsaqysdg
     2641 etvdfpnilv vtgaipqfrq eprsymsfpt lsnssfkfnf eltfrpenad glllfngqtr
     2701 gsgdyialsl kdryaefrfd fggkpllvra eeplaldewh tvrvsrfkrd gyiqvddqhp
     2761 vafptsqhqq ipqleliedl yiggvpnwef lpaeavgqqs gfvgcisrlt lqgrtvelir
     2821 eakfkegitd crpcaqgpcq nkgvclesqt eqaytcvcqp gwtgrdcaie gtqctagvcg
     2881 sgrcentend meclcplnra gdrcqyneil neqslnfksn sfaaygtpkv tkvnitlsvr
     2941 pasledsvil ytaestlpsg dylalvlrgg haellintaa rldpvvvrsa eplplnrwtr
     3001 ieirrrlgeg ilkvgdgper kakapgsdri lslkthlfvg gvdrstvkin rdvnitkgfd
     3061 gcisklynsq ksvnllgdir daanvqncge aneidddeye mpvalpspkv aenerqlmap
     3121 casdpcengg scseqedmai cscpfgfsgk hcqnhlqlsf nasfrgdgyv elnrshfqpa
     3181 leqtyshigi vfttnkpngl lfwwgqeage eytgqdfiaa avvdgyveys mrldgeeavi
     3241 rnsdirvdng erhiviakrd entamleldq ildtgdtrpt inkamklpgn vfvggapdva
     3301 aftgfrykdn fngcivvveg etvgqinlss aaingvnanv cpandeplgg teppvv