PREDICTED: Drosophila obscura protein sidekick (LOC111066347),


LOCUS       XM_041591957            7505 bp    mRNA    linear   INV 14-MAY-2021
            transcript variant X7, mRNA.
ACCESSION   XM_041591957
VERSION     XM_041591957.1
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..7505
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..7505
                     /gene="LOC111066347"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Gnomon. Supporting evidence
                     includes similarity to: 8 Proteins, and 100% coverage of
                     the annotated genomic feature by RNAseq alignments,
                     including 7 samples with support for all annotated
                     introns"
                     /db_xref="GeneID:111066347"
     CDS             351..7226
                     /gene="LOC111066347"
                     /codon_start=1
                     /product="protein sidekick isoform X7"
                     /protein_id="XP_041447891.1"
                     /db_xref="GeneID:111066347"
                     /translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL
                     FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK
                     ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS
                     EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK
                     YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV
                     RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL
                     LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP
                     CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA
                     GENSASTWLRVKTETETEAANSRIKRLAQPRILRVRASHGGSGTVAGSVTGSGSGSGS
                     TSNSHQQGRRKQFRFASAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQ
                     LVEISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPP
                     VDTIVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVR
                     ASDVGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFD
                     GNSPISKFIIQRREVSELGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQFRVS
                     AVNRVGEGSPSEPSNVVELPQEAPSGPPVGFVGSARSMSEIITQWQPPQEEHRNGQIL
                     GYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFNNMGVGVYTEG
                     AKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKIQAWQRRQLDG
                     EERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGDGVASQLVPVE
                     TLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDRPETMKFFNLT
                     ADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHAPTTLALSNIE
                     AFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLTVTGLMPFTQY
                     RLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRVRWIPLQQMEW
                     FGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTLYEVIMNACND
                     VGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHRNGQIDGYKVY
                     YAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGALSTPPIRVQT
                     FEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSANLNYSREFPP
                     SDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAPSGPQVSRSQI
                     QAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYTALGLRPYTTY
                     QFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRVQWGALETGMW
                     NGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDRNYEIVVLPFN
                     SQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSSQNGEILGYKI
                     FYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLAFNPAGDGPRS
                     APVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVTYETTEENEKF
                     SKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQDGSPVAPRDL
                     TLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRDDSRWTKIEQSRKGTMKEFTVSYHI
                     LMPSTAYLFRVIAYNKYGISFPVYSKDSILTPSKLHLEYGYLQHKPFYRQTWFMVSLA
                     ATSIVIIVMVIAVLCVKSKSYKYKQEAQKTLEESMAMSIDERQELALELYRSRHGVGT
                     GTLNSVGTLRSGTLGTLGRKSTNRHQPVSVHLGKSPPRPSPASVAYHSDEESLKCYDE
                     NPDDSSVTEKPSEVSSSEASQHSESENESVRSDPHSFVNHYANVNDSLRQSWKKTKPV
                     RNYSSYTDSEPEGSAVMSLNGGQIIVNNMARSRAPLPGFSSFV"
     misc_feature    597..809
                     /gene="LOC111066347"
                     /note="Immunoglobulin domain; Region: Ig_3; pfam13927"
                     /db_xref="CDD:464046"
     misc_feature    1164..1451
                     /gene="LOC111066347"
                     /note="Immunoglobulin domain; Region: Ig; cl11960"
                     /db_xref="CDD:472250"
     misc_feature    1218..1232
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409353"
     misc_feature    1263..1277
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409353"
     misc_feature    1338..1352
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409353"
     misc_feature    1380..1397
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409353"
     misc_feature    1422..1433
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409353"
     misc_feature    1461..1736
                     /gene="LOC111066347"
                     /note="Immunoglobulin domain; Region: Ig; cl11960"
                     /db_xref="CDD:472250"
     misc_feature    1515..1529
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409543"
     misc_feature    1554..1568
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409543"
     misc_feature    1632..1643
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409543"
     misc_feature    1671..1688
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409543"
     misc_feature    1710..1721
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409543"
     misc_feature    1929..2192
                     /gene="LOC111066347"
                     /note="Immunoglobulin I-set domain; Region: I-set;
                     pfam07679"
                     /db_xref="CDD:400151"
     misc_feature    1980..1994
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409562"
     misc_feature    2019..2033
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409562"
     misc_feature    2091..2102
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409562"
     misc_feature    2130..2147
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409562"
     misc_feature    2169..2180
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409562"
     misc_feature    2205..2474
                     /gene="LOC111066347"
                     /note="Immunoglobulin I-set domain; Region: I-set;
                     pfam07679"
                     /db_xref="CDD:400151"
     misc_feature    2253..2267
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409544"
     misc_feature    2298..2312
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409544"
     misc_feature    2370..2384
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409544"
     misc_feature    <2412..>3185
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; Region: FN3; COG3401"
                     /db_xref="CDD:442628"
     misc_feature    2412..2429
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409544"
     misc_feature    2451..2462
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409544"
     misc_feature    2484..2795
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(2484..2486,2715..2717,2760..2762)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(2763..2768,2772..2777)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    3129..3443
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(3408..3413,3417..3422)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(3456..3458,3651..3653,3696..3698)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    3459..3710
                     /gene="LOC111066347"
                     /note="Fibronectin type III domain; Region: fn3;
                     pfam00041"
                     /db_xref="CDD:394996"
     misc_feature    3750..4019
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(3750..3752,3948..3950,3993..3995)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(3996..4001,4005..4010)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    4056..4322
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    4377..4634
                     /gene="LOC111066347"
                     /note="Fibronectin type III domain; Region: fn3;
                     pfam00041"
                     /db_xref="CDD:394996"
     misc_feature    <4584..5618
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; Region: FN3; COG3401"
                     /db_xref="CDD:442628"
     misc_feature    order(4620..4625,4629..4634)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    4965..5252
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(4965..4967,5166..5168,5211..5213)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(5214..5219,5223..5228)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    5592..5897
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    <5820..>6458
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; Region: FN3; COG3401"
                     /db_xref="CDD:442628"
     misc_feature    order(5862..5867,5871..5876)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
ORIGIN      
        1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg
       61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac
      121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg
      181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca
      241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga
      301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag
      361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca
      421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc
      481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg
      541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac
      601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc
      661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt
      721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg
      781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg
      841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg
      901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg
      961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg
     1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg
     1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg
     1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca
     1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac
     1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc
     1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg
     1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc
     1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg
     1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg
     1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg
     1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt
     1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacag
     1741 aaacagaaac agaagcagcc aacagccgca tcaagcgctt ggcccagcca cgcatcttaa
     1801 gagtaagagc ttcgcatggt ggttcgggaa cggtcgcggg atcggtcacg ggatcgggat
     1861 caggatctgg ttccacctcc aactcccatc agcagggtag acgcaagcag tttcgctttg
     1921 cctcagcgcc ggtctttgag cagccgcccc agaatgtgac cgccctggat ggcaaggatg
     1981 cgacgatctc ctgtcgggcc attggctcgc ccaatcccaa tgttacctgg atctacaatg
     2041 aaacccaact ggttgagata tccagtcgcg ttcagatact cgaatcgggt gatttactca
     2101 tctcgaatat ccgtgccacg gacgcgggac tctacatctg tgtgcgggcc aacgaggcgg
     2161 gcagcgtcaa gggcgaggcc ttgctaagcg tgttagtgcg gacacaaatc atacagccgc
     2221 cagtggacac catcgtgctg ctgggcctga ccgcgacact gcagtgcaag gtgtccagcg
     2281 acccgagcgt gccctacaac atcgactggt accgggaggg ccaaatggcg cccatcagca
     2341 actcgcagcg gattggagtg caggcggacg ggcagctgga gatccaggcg gtgcgggcca
     2401 gcgatgtggg cagctattcg tgcgtggtta catcgccggg cggcaatgag acacggtcgg
     2461 cccgtctcag tgtcatcgag ctgcccttcc cgcccagcaa cgtgcgggtg gagcgtctgc
     2521 cagagccgca gcagcgcagc atcaatgtgt cctggacgcc cggattcgat ggcaacagtc
     2581 caatctccaa atttattatc cagcgacgtg aggtctctga attgggtcca gttccagatc
     2641 cccttctcaa ttggatcacc gaactgagca acgtatcggc caatcagcgg tggatgctgc
     2701 tggagaacct caaggcggcc accgtctatc agtttcgtgt cagtgccgtc aatcgggtcg
     2761 gcgagggctc cccctcggag cccagcaatg ttgttgagct gccccaagaa gctccttcgg
     2821 gaccgcctgt gggctttgtg ggctcggcac ggtccatgtc cgagatcatt acgcagtggc
     2881 agccgccgca ggaggagcat cgcaacggac agatcctggg ctacattctg cgctatcgcc
     2941 tgttcgggta caacaatgtg ccgtggtcct accagaacat caccaacgag gcgcagcgca
     3001 actttctgat ccaggagctg atcacgtgga aggactacat cgtgcagatt gcggccttca
     3061 acaacatggg cgtgggcgtc tacacggagg gggccaagat caagaccaag gagggtgtgc
     3121 ccgaggcacc gcccaccaac gtcagggtga aggccctcaa ctcgacggcg gcgcagatca
     3181 cgtggaagcc gccgaatccg cagcagatca acggcatcaa ccagggctac aagatccagg
     3241 catggcagcg acggcagctc gatggggagg agcgggacat ggagcggcgc atgatgacgg
     3301 tgccgcccag cctgatcgat ccactggccg agcagacgac ggtgctcggt ggcctggaca
     3361 agttcgccaa gttcaatgtg accgtactct gcttcaccga tcccggtgac ggtgtggcca
     3421 gccagctggt gccggtggag actttggacg acgtgcccga cgagataacg gccctgcact
     3481 ttgacgatgt ctccgatcgg tccgtcaaag tgctgtgggc gccgccgcgc ttcgccaacg
     3541 gcatcctcac cggctacacg gtgcgctacc aggtcaagga tcgccccgag acgatgaagt
     3601 tcttcaacct gaccgccgac gacaacgagc tgacggtgaa ccagctgcag gcgacgaccc
     3661 actactggtt cgaggtgtgc gcctggacgc gggtgggcag cgggccgccc aagacggcga
     3721 cgatccaatc gggcgtggag ccggtgctgc cgcatgcgcc caccacactg gccctgtcca
     3781 acatcgaagc gttttcggtg gtgctgcagt tcacgcccgg cttcgacggc aactcgagca
     3841 tcaccaagtg gaaggtggag gcgcagacgg cccgcaacat gacctggttc acgctctgtg
     3901 aaatcagcga tcccgatgcg gagaccctca ccgtgaccgg cctgatgccc ttcacccagt
     3961 accggctgcg gctgagcgcc accaatgtgg tgggcagctc ccggccctcg gaccccacca
     4021 aggactttca aaccattcag gccaagccga tgcacgcccc cttcaatgtg acggtacgcg
     4081 caatgagcgc cctgcagctg cgcgtccgct ggataccgct gcagcagatg gagtggttcg
     4141 gcaatccgcg cggctacaat gtcacctacc ggcaaatgga gcgcaccggc aagccctcca
     4201 agcacccgcc ccgctccgtg atgatcgagg atcacacggc caactcgcat gtgctcgagg
     4261 ggctcgagga gtggaccctc tacgaagtga tcatgaacgc ctgcaacgat gtgggctgct
     4321 cgctggacag cggcctggcc atggagcgca ccagggaagc ggtgcccagc tacggcccgc
     4381 tgcatgtgga ggcgaacgcc acctcctcga cgacggtggt ggtgcgctgg ggcgagatac
     4441 cgccccacca tcgcaacggc cagatcgatg gctacaaggt gtactacgcg gccaccgagc
     4501 gcggcatgca ggtgctctac aagacgatac ccaacaacag ctccttcacc accaccctca
     4561 ccgagctgca gaagtttgtg gtgtaccacg tccaggtgct ggcctacacg cggctcggca
     4621 acggcgccct cagcaccccg cccatccggg tgcagacgtt cgaggacacg cccggatcac
     4681 cgtccaatgt gagcttcccg gacgtcacct tctcgatggc gcgcatcatc tgggacgtgc
     4741 cgatggaccc caatggcgag atactcgcct accaggtcac ctacacgctc aacggaagcg
     4801 ccaatctgaa ctacagccgc gagtttccgc cctcggatcg caccttccgg gccaccggcc
     4861 tgatgcccga gcgctactac agcttcagcg tgacggccca gacacgcctc ggctggggca
     4921 aaacggcctc ggtgctggtg tacacgacca acaacaggga ccgtccgcag gcaccgtccg
     4981 ggccgcaggt gtcgcgcagc cagatccagg cccatcagat caccttcaac tggacgccgg
     5041 gccgcgacgg gttcgccccg ctgcgatact acacggtcga gatgcgggag aacgagggcc
     5101 gctggcagcc gctgcccgag cgcgtcgatc ccacactcag ctcgtacacg gccctgggtc
     5161 tgcgtccgta caccacctac cagttccgca ttcaggcgac caacgatctg ggcccgtcgg
     5221 cgttcagccg agagagcatt gtggtgcgca ccctgcccgc cgccccagcg gtgggtgtgg
     5281 ggggactgaa ggtggtgccc ataacgacca cctcggtgcg ggtgcagtgg ggggcgctgg
     5341 agacgggcat gtggaacggc gacgcggcca ccgggggata ccgcatactg taccagcagc
     5401 tgtcggactt cgcaccggcc ctgcagtcga ccccgaagac ggatgtgatg ggcatcaatg
     5461 agaacagcgt ggtgctgtcc gatctgcagc aggaccgcaa ctacgagatc gtggtgctgc
     5521 cattcaattc gcagggaccg ggcccggcca caccgccgac cgccgtctat gtgggcgagg
     5581 cggtgcccac tggagagccg cggggcgtgg atgccacggc catttccagc acggaggtgc
     5641 gcctgagctg gaagccaccg aagcagagca gccagaacgg agagatactc ggctacaaga
     5701 tattctattt ggtgacgtgg tcgccgcagg ccctcgagcc gggccgcaaa ttcgaggagg
     5761 aaatcgaagt ggtctcggcc acggccacat cgcacagcct ggtctttctc gataagttca
     5821 ccgagtaccg catccagttg ctggccttca atccggccgg agacgggccg aggtccgccc
     5881 ccgtcactgc gaagacgatg ccgggcgtgc ccagtgcccc gctcaatctg cgcttttcgg
     5941 acatcacaat gcagagcctg gaggtgacct gggacccgcc caagctgctc aacggcgaga
     6001 ttgttggcta tctggtcacc tacgagacca ccgaggagaa cgaaaagttc agcaagcagg
     6061 tgaagcagaa ggtgtccaac accacgctgc gtgtgcagaa tctggaggag gaggtcacct
     6121 acaccttcac cgtgcgcgcc cagacgaacg actatggacc ggcggtgagc gcgaatgtga
     6181 ccacaggccc ccaggatggc tccccggtgg caccgcgcga tctcacactc acaaagacac
     6241 tgtccagcgt tgaggtacat tgggtcaatg gaccctccgg ccggggcccc atactgggct
     6301 acctcatcga ggccaagaag cgagacgact cccgctggac taagattgag cagtccagaa
     6361 agggtaccat gaaggagttt accgtcagct accacatcct gatgccatcg acggcgtatt
     6421 tgttccgggt aattgcttac aataagtatg gcatatcgtt ccctgtttac tcgaaggact
     6481 cgatactgac gccctcgaag ctgcatctgg agtacggcta tctgcagcac aagcccttct
     6541 acaggcagac ctggttcatg gtctccctgg cggccacctc gatcgtcatc attgtcatgg
     6601 tcattgcggt gctctgtgtg aagagcaaga gctacaagta caagcaggag gcacaaaaga
     6661 cgctggagga gtccatggcc atgtcgattg atgagcgcca ggagctggcc ctggagctgt
     6721 atcgttcgcg tcacggcgtc ggcaccggca ccctgaacag cgttggaaca ttgcgcagcg
     6781 gaactttggg aaccctcggc cgtaagtcca ccaaccgaca ccagccggtg agtgtgcatt
     6841 tgggtaagag tccaccgcga ccctcgcccg catcggtggc gtaccacagc gatgaggaga
     6901 gtctcaagtg ctacgacgag aatcccgacg acagcagtgt tacggaaaag ccatccgagg
     6961 tgagcagctc ggaggcatcc cagcactcgg agagcgagaa cgagagcgtg aggagcgatc
     7021 cgcactcgtt cgtcaatcac tatgcgaatg tgaatgactc gctgcggcag tcctggaaga
     7081 agaccaagcc cgtgcgcaac tactcgagct acacagactc cgagccggag ggcagtgcag
     7141 tgatgagtct caatggtggc cagattattg tcaataatat ggccagatcg agggcaccac
     7201 tgcccggctt ctcgtcattt gtctgacaat caaccgaatt ctaagatcta tgccgtggta
     7261 gcagcagcac cgtcatccgc gagacatttg tctgaattat tttggaaacg ataacggaaa
     7321 acggaaaaac ggaggctgaa gctgaaaccg gagctggagt tgcagtgggg agcgttctaa
     7381 cgagttcgac acggatgtag cgagtgggct aaactgcctg cctgcctgca actgttctgt
     7441 ctggctctcc ctggatcttc gtagctgtcc ggcgaggcgc tgctacatgg atatttatcg
     7501 tagtt