PREDICTED: Drosophila obscura protein sidekick (LOC111066347),


LOCUS       XM_041591952            7574 bp    mRNA    linear   INV 14-MAY-2021
            transcript variant X2, mRNA.
ACCESSION   XM_041591952
VERSION     XM_041591952.1
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..7574
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..7574
                     /gene="LOC111066347"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Gnomon. Supporting evidence
                     includes similarity to: 6 Proteins, and 100% coverage of
                     the annotated genomic feature by RNAseq alignments,
                     including 3 samples with support for all annotated
                     introns"
                     /db_xref="GeneID:111066347"
     CDS             351..7295
                     /gene="LOC111066347"
                     /codon_start=1
                     /product="protein sidekick isoform X2"
                     /protein_id="XP_041447886.1"
                     /db_xref="GeneID:111066347"
                     /translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL
                     FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK
                     ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS
                     EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK
                     YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV
                     RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL
                     LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP
                     CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA
                     GENSASTWLRVKTETETEAANSRIKRLAQPRILRVRASHGGSGTVAGSVTGSGSGSGS
                     TSNSHQQGRRKQFRFASAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQ
                     LVEISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPP
                     VDTIVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVR
                     ASDVGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFD
                     GNSPISKFIIQRREVSELGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQFRVS
                     AVNRVGEGSPSEPSNVVELPQEEFPLYRAPSGPPVGFVGSARSMSEIITQWQPPQEEH
                     RNGQILGYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFNNMGV
                     GVYTEGAKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKIQAWQ
                     RRQLDGEERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGDGVAS
                     QLVPVETLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDRPETM
                     KFFNLTADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHAPTTL
                     ALSNIEAFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLTVTGL
                     MPFTQYRLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRVRWIP
                     LQQMEWFGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTLYEVI
                     MNACNDVGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHRNGQI
                     DGYKVYYAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGALSTP
                     PIRVQTFEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSANLNY
                     SREFPPSDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAPSGPQ
                     VSRSQIQAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYTALGL
                     RPYTTYQFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRVQWGA
                     LETGMWNGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDRNYEI
                     VVLPFNSQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSSQNGE
                     ILGYKIFYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLAFNPA
                     GDGPRSAPVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVTYETT
                     EENEKFSKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQDGSP
                     VAPRDLTLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRENGEPSFISNRPPYLRLDD
                     SRWTKIEQSRKGTMKEFTVSYHILMPSTAYLFRVIAYNKYGISFPVYSKDSILTPSKL
                     HLEYGYLQHKPFYRQTWFMVSLAATSIVIIVMVIAVLCVKSKSYKYKQEAQKTLEESM
                     AMSIDERQELALELYRSRHGVGTGTLNSVGTLRSGTLGTLGRKSTNRHQPVSVHLGKS
                     PPRPSPASVAYHSDEESLKCYDENPDDSSVTEKPSEVSSSEASQHSESENESVRSDPH
                     SFVNHYANVNDSLRQSWKKTKPVRNYSSYTDSEPEGSAVMSLNGGQIIVNNMARSRAP
                     LPGFSSFV"
     misc_feature    597..809
                     /gene="LOC111066347"
                     /note="Immunoglobulin domain; Region: Ig_3; pfam13927"
                     /db_xref="CDD:464046"
     misc_feature    1164..1451
                     /gene="LOC111066347"
                     /note="Immunoglobulin domain; Region: Ig; cl11960"
                     /db_xref="CDD:472250"
     misc_feature    1218..1232
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409353"
     misc_feature    1263..1277
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409353"
     misc_feature    1338..1352
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409353"
     misc_feature    1380..1397
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409353"
     misc_feature    1422..1433
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409353"
     misc_feature    1461..1736
                     /gene="LOC111066347"
                     /note="Immunoglobulin domain; Region: Ig; cl11960"
                     /db_xref="CDD:472250"
     misc_feature    1515..1529
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409543"
     misc_feature    1554..1568
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409543"
     misc_feature    1632..1643
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409543"
     misc_feature    1671..1688
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409543"
     misc_feature    1710..1721
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409543"
     misc_feature    1929..2192
                     /gene="LOC111066347"
                     /note="Immunoglobulin I-set domain; Region: I-set;
                     pfam07679"
                     /db_xref="CDD:400151"
     misc_feature    1980..1994
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409562"
     misc_feature    2019..2033
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409562"
     misc_feature    2091..2102
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409562"
     misc_feature    2130..2147
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409562"
     misc_feature    2169..2180
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409562"
     misc_feature    2205..2474
                     /gene="LOC111066347"
                     /note="Immunoglobulin I-set domain; Region: I-set;
                     pfam07679"
                     /db_xref="CDD:400151"
     misc_feature    2253..2267
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409544"
     misc_feature    2298..2312
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409544"
     misc_feature    2370..2384
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409544"
     misc_feature    <2412..>3203
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; Region: FN3; COG3401"
                     /db_xref="CDD:442628"
     misc_feature    2412..2429
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409544"
     misc_feature    2451..2462
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409544"
     misc_feature    2484..2795
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(2484..2486,2715..2717,2760..2762)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(2763..2768,2772..2777)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    3147..3461
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(3426..3431,3435..3440)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(3474..3476,3669..3671,3714..3716)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    3477..3728
                     /gene="LOC111066347"
                     /note="Fibronectin type III domain; Region: fn3;
                     pfam00041"
                     /db_xref="CDD:394996"
     misc_feature    3768..4037
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(3768..3770,3966..3968,4011..4013)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(4014..4019,4023..4028)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    4074..4340
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    4395..4652
                     /gene="LOC111066347"
                     /note="Fibronectin type III domain; Region: fn3;
                     pfam00041"
                     /db_xref="CDD:394996"
     misc_feature    4614..6269
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; Region: FN3; COG3401"
                     /db_xref="CDD:442628"
     misc_feature    order(4638..4643,4647..4652)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    6222..6542
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(6222..6224,6474..6476,6519..6521)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(6522..6527,6531..6536)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
ORIGIN      
        1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg
       61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac
      121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg
      181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca
      241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga
      301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag
      361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca
      421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc
      481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg
      541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac
      601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc
      661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt
      721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg
      781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg
      841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg
      901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg
      961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg
     1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg
     1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg
     1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca
     1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac
     1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc
     1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg
     1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc
     1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg
     1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg
     1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg
     1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt
     1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacag
     1741 aaacagaaac agaagcagcc aacagccgca tcaagcgctt ggcccagcca cgcatcttaa
     1801 gagtaagagc ttcgcatggt ggttcgggaa cggtcgcggg atcggtcacg ggatcgggat
     1861 caggatctgg ttccacctcc aactcccatc agcagggtag acgcaagcag tttcgctttg
     1921 cctcagcgcc ggtctttgag cagccgcccc agaatgtgac cgccctggat ggcaaggatg
     1981 cgacgatctc ctgtcgggcc attggctcgc ccaatcccaa tgttacctgg atctacaatg
     2041 aaacccaact ggttgagata tccagtcgcg ttcagatact cgaatcgggt gatttactca
     2101 tctcgaatat ccgtgccacg gacgcgggac tctacatctg tgtgcgggcc aacgaggcgg
     2161 gcagcgtcaa gggcgaggcc ttgctaagcg tgttagtgcg gacacaaatc atacagccgc
     2221 cagtggacac catcgtgctg ctgggcctga ccgcgacact gcagtgcaag gtgtccagcg
     2281 acccgagcgt gccctacaac atcgactggt accgggaggg ccaaatggcg cccatcagca
     2341 actcgcagcg gattggagtg caggcggacg ggcagctgga gatccaggcg gtgcgggcca
     2401 gcgatgtggg cagctattcg tgcgtggtta catcgccggg cggcaatgag acacggtcgg
     2461 cccgtctcag tgtcatcgag ctgcccttcc cgcccagcaa cgtgcgggtg gagcgtctgc
     2521 cagagccgca gcagcgcagc atcaatgtgt cctggacgcc cggattcgat ggcaacagtc
     2581 caatctccaa atttattatc cagcgacgtg aggtctctga attgggtcca gttccagatc
     2641 cccttctcaa ttggatcacc gaactgagca acgtatcggc caatcagcgg tggatgctgc
     2701 tggagaacct caaggcggcc accgtctatc agtttcgtgt cagtgccgtc aatcgggtcg
     2761 gcgagggctc cccctcggag cccagcaatg ttgttgagct gccccaagaa gagtttccgc
     2821 tttatcgagc tccttcggga ccgcctgtgg gctttgtggg ctcggcacgg tccatgtccg
     2881 agatcattac gcagtggcag ccgccgcagg aggagcatcg caacggacag atcctgggct
     2941 acattctgcg ctatcgcctg ttcgggtaca acaatgtgcc gtggtcctac cagaacatca
     3001 ccaacgaggc gcagcgcaac tttctgatcc aggagctgat cacgtggaag gactacatcg
     3061 tgcagattgc ggccttcaac aacatgggcg tgggcgtcta cacggagggg gccaagatca
     3121 agaccaagga gggtgtgccc gaggcaccgc ccaccaacgt cagggtgaag gccctcaact
     3181 cgacggcggc gcagatcacg tggaagccgc cgaatccgca gcagatcaac ggcatcaacc
     3241 agggctacaa gatccaggca tggcagcgac ggcagctcga tggggaggag cgggacatgg
     3301 agcggcgcat gatgacggtg ccgcccagcc tgatcgatcc actggccgag cagacgacgg
     3361 tgctcggtgg cctggacaag ttcgccaagt tcaatgtgac cgtactctgc ttcaccgatc
     3421 ccggtgacgg tgtggccagc cagctggtgc cggtggagac tttggacgac gtgcccgacg
     3481 agataacggc cctgcacttt gacgatgtct ccgatcggtc cgtcaaagtg ctgtgggcgc
     3541 cgccgcgctt cgccaacggc atcctcaccg gctacacggt gcgctaccag gtcaaggatc
     3601 gccccgagac gatgaagttc ttcaacctga ccgccgacga caacgagctg acggtgaacc
     3661 agctgcaggc gacgacccac tactggttcg aggtgtgcgc ctggacgcgg gtgggcagcg
     3721 ggccgcccaa gacggcgacg atccaatcgg gcgtggagcc ggtgctgccg catgcgccca
     3781 ccacactggc cctgtccaac atcgaagcgt tttcggtggt gctgcagttc acgcccggct
     3841 tcgacggcaa ctcgagcatc accaagtgga aggtggaggc gcagacggcc cgcaacatga
     3901 cctggttcac gctctgtgaa atcagcgatc ccgatgcgga gaccctcacc gtgaccggcc
     3961 tgatgccctt cacccagtac cggctgcggc tgagcgccac caatgtggtg ggcagctccc
     4021 ggccctcgga ccccaccaag gactttcaaa ccattcaggc caagccgatg cacgccccct
     4081 tcaatgtgac ggtacgcgca atgagcgccc tgcagctgcg cgtccgctgg ataccgctgc
     4141 agcagatgga gtggttcggc aatccgcgcg gctacaatgt cacctaccgg caaatggagc
     4201 gcaccggcaa gccctccaag cacccgcccc gctccgtgat gatcgaggat cacacggcca
     4261 actcgcatgt gctcgagggg ctcgaggagt ggaccctcta cgaagtgatc atgaacgcct
     4321 gcaacgatgt gggctgctcg ctggacagcg gcctggccat ggagcgcacc agggaagcgg
     4381 tgcccagcta cggcccgctg catgtggagg cgaacgccac ctcctcgacg acggtggtgg
     4441 tgcgctgggg cgagataccg ccccaccatc gcaacggcca gatcgatggc tacaaggtgt
     4501 actacgcggc caccgagcgc ggcatgcagg tgctctacaa gacgataccc aacaacagct
     4561 ccttcaccac caccctcacc gagctgcaga agtttgtggt gtaccacgtc caggtgctgg
     4621 cctacacgcg gctcggcaac ggcgccctca gcaccccgcc catccgggtg cagacgttcg
     4681 aggacacgcc cggatcaccg tccaatgtga gcttcccgga cgtcaccttc tcgatggcgc
     4741 gcatcatctg ggacgtgccg atggacccca atggcgagat actcgcctac caggtcacct
     4801 acacgctcaa cggaagcgcc aatctgaact acagccgcga gtttccgccc tcggatcgca
     4861 ccttccgggc caccggcctg atgcccgagc gctactacag cttcagcgtg acggcccaga
     4921 cacgcctcgg ctggggcaaa acggcctcgg tgctggtgta cacgaccaac aacagggacc
     4981 gtccgcaggc accgtccggg ccgcaggtgt cgcgcagcca gatccaggcc catcagatca
     5041 ccttcaactg gacgccgggc cgcgacgggt tcgccccgct gcgatactac acggtcgaga
     5101 tgcgggagaa cgagggccgc tggcagccgc tgcccgagcg cgtcgatccc acactcagct
     5161 cgtacacggc cctgggtctg cgtccgtaca ccacctacca gttccgcatt caggcgacca
     5221 acgatctggg cccgtcggcg ttcagccgag agagcattgt ggtgcgcacc ctgcccgccg
     5281 ccccagcggt gggtgtgggg ggactgaagg tggtgcccat aacgaccacc tcggtgcggg
     5341 tgcagtgggg ggcgctggag acgggcatgt ggaacggcga cgcggccacc gggggatacc
     5401 gcatactgta ccagcagctg tcggacttcg caccggccct gcagtcgacc ccgaagacgg
     5461 atgtgatggg catcaatgag aacagcgtgg tgctgtccga tctgcagcag gaccgcaact
     5521 acgagatcgt ggtgctgcca ttcaattcgc agggaccggg cccggccaca ccgccgaccg
     5581 ccgtctatgt gggcgaggcg gtgcccactg gagagccgcg gggcgtggat gccacggcca
     5641 tttccagcac ggaggtgcgc ctgagctgga agccaccgaa gcagagcagc cagaacggag
     5701 agatactcgg ctacaagata ttctatttgg tgacgtggtc gccgcaggcc ctcgagccgg
     5761 gccgcaaatt cgaggaggaa atcgaagtgg tctcggccac ggccacatcg cacagcctgg
     5821 tctttctcga taagttcacc gagtaccgca tccagttgct ggccttcaat ccggccggag
     5881 acgggccgag gtccgccccc gtcactgcga agacgatgcc gggcgtgccc agtgccccgc
     5941 tcaatctgcg cttttcggac atcacaatgc agagcctgga ggtgacctgg gacccgccca
     6001 agctgctcaa cggcgagatt gttggctatc tggtcaccta cgagaccacc gaggagaacg
     6061 aaaagttcag caagcaggtg aagcagaagg tgtccaacac cacgctgcgt gtgcagaatc
     6121 tggaggagga ggtcacctac accttcaccg tgcgcgccca gacgaacgac tatggaccgg
     6181 cggtgagcgc gaatgtgacc acaggccccc aggatggctc cccggtggca ccgcgcgatc
     6241 tcacactcac aaagacactg tccagcgttg aggtacattg ggtcaatgga ccctccggcc
     6301 ggggccccat actgggctac ctcatcgagg ccaagaagcg agaaaatgga gagccctcat
     6361 ttatttctaa tagacctccc tatcttcgct tagacgactc ccgctggact aagattgagc
     6421 agtccagaaa gggtaccatg aaggagttta ccgtcagcta ccacatcctg atgccatcga
     6481 cggcgtattt gttccgggta attgcttaca ataagtatgg catatcgttc cctgtttact
     6541 cgaaggactc gatactgacg ccctcgaagc tgcatctgga gtacggctat ctgcagcaca
     6601 agcccttcta caggcagacc tggttcatgg tctccctggc ggccacctcg atcgtcatca
     6661 ttgtcatggt cattgcggtg ctctgtgtga agagcaagag ctacaagtac aagcaggagg
     6721 cacaaaagac gctggaggag tccatggcca tgtcgattga tgagcgccag gagctggccc
     6781 tggagctgta tcgttcgcgt cacggcgtcg gcaccggcac cctgaacagc gttggaacat
     6841 tgcgcagcgg aactttggga accctcggcc gtaagtccac caaccgacac cagccggtga
     6901 gtgtgcattt gggtaagagt ccaccgcgac cctcgcccgc atcggtggcg taccacagcg
     6961 atgaggagag tctcaagtgc tacgacgaga atcccgacga cagcagtgtt acggaaaagc
     7021 catccgaggt gagcagctcg gaggcatccc agcactcgga gagcgagaac gagagcgtga
     7081 ggagcgatcc gcactcgttc gtcaatcact atgcgaatgt gaatgactcg ctgcggcagt
     7141 cctggaagaa gaccaagccc gtgcgcaact actcgagcta cacagactcc gagccggagg
     7201 gcagtgcagt gatgagtctc aatggtggcc agattattgt caataatatg gccagatcga
     7261 gggcaccact gcccggcttc tcgtcatttg tctgacaatc aaccgaattc taagatctat
     7321 gccgtggtag cagcagcacc gtcatccgcg agacatttgt ctgaattatt ttggaaacga
     7381 taacggaaaa cggaaaaacg gaggctgaag ctgaaaccgg agctggagtt gcagtgggga
     7441 gcgttctaac gagttcgaca cggatgtagc gagtgggcta aactgcctgc ctgcctgcaa
     7501 ctgttctgtc tggctctccc tggatcttcg tagctgtccg gcgaggcgct gctacatgga
     7561 tatttatcgt agtt