PREDICTED: Drosophila obscura protein sidekick (LOC111066347),


LOCUS       XM_041591959            7403 bp    mRNA    linear   INV 14-MAY-2021
            transcript variant X8, mRNA.
ACCESSION   XM_041591959
VERSION     XM_041591959.1
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..7403
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..7403
                     /gene="LOC111066347"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Gnomon. Supporting evidence
                     includes similarity to: 7 Proteins, and 100% coverage of
                     the annotated genomic feature by RNAseq alignments,
                     including 3 samples with support for all annotated
                     introns"
                     /db_xref="GeneID:111066347"
     CDS             351..7124
                     /gene="LOC111066347"
                     /codon_start=1
                     /product="protein sidekick isoform X8"
                     /protein_id="XP_041447893.1"
                     /db_xref="GeneID:111066347"
                     /translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL
                     FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK
                     ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS
                     EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK
                     YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV
                     RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL
                     LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP
                     CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA
                     GENSASTWLRVKTSAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQLVE
                     ISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPPVDT
                     IVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVRASD
                     VGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFDGNS
                     PISKFIIQRREVSELEKFVGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQFRV
                     SAVNRVGEGSPSEPSNVVELPQEEFPLYRAPSGPPVGFVGSARSMSEIITQWQPPQEE
                     HRNGQILGYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFNNMG
                     VGVYTEGAKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKIQAW
                     QRRQLDGEERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGDGVA
                     SQLVPVETLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDRPET
                     MKFFNLTADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHAPTT
                     LALSNIEAFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLTVTG
                     LMPFTQYRLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRVRWI
                     PLQQMEWFGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTLYEV
                     IMNACNDVGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHRNGQ
                     IDGYKVYYAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGALST
                     PPIRVQTFEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSANLN
                     YSREFPPSDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAPSGP
                     QVSRSQIQAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYTALG
                     LRPYTTYQFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRVQWG
                     ALETGMWNGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDRNYE
                     IVVLPFNSQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSSQNG
                     EILGYKIFYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLAFNP
                     AGDGPRSAPVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVTYET
                     TEENEKFSKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQDGS
                     PVAPRDLTLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRENGEPSFISNRPPYLRLD
                     DSRWTKIEQSRKGTMKEFTVSYHILMPSTAYLFRVIAYNKYGISFPVYSKDSILTPSK
                     LHLEYGYLQHKPFYRQTWFMVSLAATSIVIIVMVIAVLCVKSKSYKYKQEAQKTLEES
                     MAMSIDERQELALELYRSRHGVGTGTLNSVGTLRSGTLGTLGRKSTNRHQPVSVHLGK
                     SPPRPSPASVAYHSDEESLKCYDENPDDSSVTEKPSEVSSSEASQHSESENESVRSDP
                     HSFVNHYANVNDSLRQSWKKTKPVRNYSSYTDSEPEGSAVMSLNGGQIIVNNMARSRA
                     PLPGFSSFV"
     misc_feature    597..809
                     /gene="LOC111066347"
                     /note="Immunoglobulin domain; Region: Ig_3; pfam13927"
                     /db_xref="CDD:464046"
     misc_feature    1164..1451
                     /gene="LOC111066347"
                     /note="Immunoglobulin domain; Region: Ig; cl11960"
                     /db_xref="CDD:472250"
     misc_feature    1218..1232
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409353"
     misc_feature    1263..1277
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409353"
     misc_feature    1338..1352
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409353"
     misc_feature    1380..1397
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409353"
     misc_feature    1422..1433
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409353"
     misc_feature    1461..1736
                     /gene="LOC111066347"
                     /note="Immunoglobulin domain; Region: Ig; cl11960"
                     /db_xref="CDD:472250"
     misc_feature    1515..1529
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409543"
     misc_feature    1554..1568
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409543"
     misc_feature    1632..1643
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409543"
     misc_feature    1671..1688
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409543"
     misc_feature    1710..1721
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409543"
     misc_feature    1746..2009
                     /gene="LOC111066347"
                     /note="Immunoglobulin I-set domain; Region: I-set;
                     pfam07679"
                     /db_xref="CDD:400151"
     misc_feature    1797..1811
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409562"
     misc_feature    1836..1850
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409562"
     misc_feature    1908..1919
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409562"
     misc_feature    1947..1964
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409562"
     misc_feature    1986..1997
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409562"
     misc_feature    2022..2291
                     /gene="LOC111066347"
                     /note="Immunoglobulin I-set domain; Region: I-set;
                     pfam07679"
                     /db_xref="CDD:400151"
     misc_feature    2070..2084
                     /gene="LOC111066347"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:409544"
     misc_feature    2115..2129
                     /gene="LOC111066347"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:409544"
     misc_feature    2187..2201
                     /gene="LOC111066347"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:409544"
     misc_feature    2229..2246
                     /gene="LOC111066347"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:409544"
     misc_feature    2268..2279
                     /gene="LOC111066347"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:409544"
     misc_feature    2301..2624
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(2301..2303,2544..2546,2589..2591)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(2592..2597,2601..2606)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    2667..2954
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(2919..2924,2928..2933)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    2976..3290
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(3255..3260,3264..3269)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(3303..3305,3498..3500,3543..3545)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    3306..3557
                     /gene="LOC111066347"
                     /note="Fibronectin type III domain; Region: fn3;
                     pfam00041"
                     /db_xref="CDD:394996"
     misc_feature    3597..3866
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(3597..3599,3795..3797,3840..3842)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(3843..3848,3852..3857)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    3903..4169
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    4224..4481
                     /gene="LOC111066347"
                     /note="Fibronectin type III domain; Region: fn3;
                     pfam00041"
                     /db_xref="CDD:394996"
     misc_feature    4443..6098
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; Region: FN3; COG3401"
                     /db_xref="CDD:442628"
     misc_feature    order(4467..4472,4476..4481)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    6051..6371
                     /gene="LOC111066347"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(6051..6053,6303..6305,6348..6350)
                     /gene="LOC111066347"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(6351..6356,6360..6365)
                     /gene="LOC111066347"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
ORIGIN      
        1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg
       61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac
      121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg
      181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca
      241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga
      301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag
      361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca
      421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc
      481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg
      541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac
      601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc
      661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt
      721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg
      781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg
      841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg
      901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg
      961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg
     1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg
     1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg
     1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca
     1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac
     1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc
     1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg
     1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc
     1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg
     1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg
     1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg
     1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt
     1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacct
     1741 cagcgccggt ctttgagcag ccgccccaga atgtgaccgc cctggatggc aaggatgcga
     1801 cgatctcctg tcgggccatt ggctcgccca atcccaatgt tacctggatc tacaatgaaa
     1861 cccaactggt tgagatatcc agtcgcgttc agatactcga atcgggtgat ttactcatct
     1921 cgaatatccg tgccacggac gcgggactct acatctgtgt gcgggccaac gaggcgggca
     1981 gcgtcaaggg cgaggccttg ctaagcgtgt tagtgcggac acaaatcata cagccgccag
     2041 tggacaccat cgtgctgctg ggcctgaccg cgacactgca gtgcaaggtg tccagcgacc
     2101 cgagcgtgcc ctacaacatc gactggtacc gggagggcca aatggcgccc atcagcaact
     2161 cgcagcggat tggagtgcag gcggacgggc agctggagat ccaggcggtg cgggccagcg
     2221 atgtgggcag ctattcgtgc gtggttacat cgccgggcgg caatgagaca cggtcggccc
     2281 gtctcagtgt catcgagctg cccttcccgc ccagcaacgt gcgggtggag cgtctgccag
     2341 agccgcagca gcgcagcatc aatgtgtcct ggacgcccgg attcgatggc aacagtccaa
     2401 tctccaaatt tattatccag cgacgtgagg tctctgaatt ggaaaaattc gtaggtccag
     2461 ttccagatcc ccttctcaat tggatcaccg aactgagcaa cgtatcggcc aatcagcggt
     2521 ggatgctgct ggagaacctc aaggcggcca ccgtctatca gtttcgtgtc agtgccgtca
     2581 atcgggtcgg cgagggctcc ccctcggagc ccagcaatgt tgttgagctg ccccaagaag
     2641 agtttccgct ttatcgagct ccttcgggac cgcctgtggg ctttgtgggc tcggcacggt
     2701 ccatgtccga gatcattacg cagtggcagc cgccgcagga ggagcatcgc aacggacaga
     2761 tcctgggcta cattctgcgc tatcgcctgt tcgggtacaa caatgtgccg tggtcctacc
     2821 agaacatcac caacgaggcg cagcgcaact ttctgatcca ggagctgatc acgtggaagg
     2881 actacatcgt gcagattgcg gccttcaaca acatgggcgt gggcgtctac acggaggggg
     2941 ccaagatcaa gaccaaggag ggtgtgcccg aggcaccgcc caccaacgtc agggtgaagg
     3001 ccctcaactc gacggcggcg cagatcacgt ggaagccgcc gaatccgcag cagatcaacg
     3061 gcatcaacca gggctacaag atccaggcat ggcagcgacg gcagctcgat ggggaggagc
     3121 gggacatgga gcggcgcatg atgacggtgc cgcccagcct gatcgatcca ctggccgagc
     3181 agacgacggt gctcggtggc ctggacaagt tcgccaagtt caatgtgacc gtactctgct
     3241 tcaccgatcc cggtgacggt gtggccagcc agctggtgcc ggtggagact ttggacgacg
     3301 tgcccgacga gataacggcc ctgcactttg acgatgtctc cgatcggtcc gtcaaagtgc
     3361 tgtgggcgcc gccgcgcttc gccaacggca tcctcaccgg ctacacggtg cgctaccagg
     3421 tcaaggatcg ccccgagacg atgaagttct tcaacctgac cgccgacgac aacgagctga
     3481 cggtgaacca gctgcaggcg acgacccact actggttcga ggtgtgcgcc tggacgcggg
     3541 tgggcagcgg gccgcccaag acggcgacga tccaatcggg cgtggagccg gtgctgccgc
     3601 atgcgcccac cacactggcc ctgtccaaca tcgaagcgtt ttcggtggtg ctgcagttca
     3661 cgcccggctt cgacggcaac tcgagcatca ccaagtggaa ggtggaggcg cagacggccc
     3721 gcaacatgac ctggttcacg ctctgtgaaa tcagcgatcc cgatgcggag accctcaccg
     3781 tgaccggcct gatgcccttc acccagtacc ggctgcggct gagcgccacc aatgtggtgg
     3841 gcagctcccg gccctcggac cccaccaagg actttcaaac cattcaggcc aagccgatgc
     3901 acgccccctt caatgtgacg gtacgcgcaa tgagcgccct gcagctgcgc gtccgctgga
     3961 taccgctgca gcagatggag tggttcggca atccgcgcgg ctacaatgtc acctaccggc
     4021 aaatggagcg caccggcaag ccctccaagc acccgccccg ctccgtgatg atcgaggatc
     4081 acacggccaa ctcgcatgtg ctcgaggggc tcgaggagtg gaccctctac gaagtgatca
     4141 tgaacgcctg caacgatgtg ggctgctcgc tggacagcgg cctggccatg gagcgcacca
     4201 gggaagcggt gcccagctac ggcccgctgc atgtggaggc gaacgccacc tcctcgacga
     4261 cggtggtggt gcgctggggc gagataccgc cccaccatcg caacggccag atcgatggct
     4321 acaaggtgta ctacgcggcc accgagcgcg gcatgcaggt gctctacaag acgataccca
     4381 acaacagctc cttcaccacc accctcaccg agctgcagaa gtttgtggtg taccacgtcc
     4441 aggtgctggc ctacacgcgg ctcggcaacg gcgccctcag caccccgccc atccgggtgc
     4501 agacgttcga ggacacgccc ggatcaccgt ccaatgtgag cttcccggac gtcaccttct
     4561 cgatggcgcg catcatctgg gacgtgccga tggaccccaa tggcgagata ctcgcctacc
     4621 aggtcaccta cacgctcaac ggaagcgcca atctgaacta cagccgcgag tttccgccct
     4681 cggatcgcac cttccgggcc accggcctga tgcccgagcg ctactacagc ttcagcgtga
     4741 cggcccagac acgcctcggc tggggcaaaa cggcctcggt gctggtgtac acgaccaaca
     4801 acagggaccg tccgcaggca ccgtccgggc cgcaggtgtc gcgcagccag atccaggccc
     4861 atcagatcac cttcaactgg acgccgggcc gcgacgggtt cgccccgctg cgatactaca
     4921 cggtcgagat gcgggagaac gagggccgct ggcagccgct gcccgagcgc gtcgatccca
     4981 cactcagctc gtacacggcc ctgggtctgc gtccgtacac cacctaccag ttccgcattc
     5041 aggcgaccaa cgatctgggc ccgtcggcgt tcagccgaga gagcattgtg gtgcgcaccc
     5101 tgcccgccgc cccagcggtg ggtgtggggg gactgaaggt ggtgcccata acgaccacct
     5161 cggtgcgggt gcagtggggg gcgctggaga cgggcatgtg gaacggcgac gcggccaccg
     5221 ggggataccg catactgtac cagcagctgt cggacttcgc accggccctg cagtcgaccc
     5281 cgaagacgga tgtgatgggc atcaatgaga acagcgtggt gctgtccgat ctgcagcagg
     5341 accgcaacta cgagatcgtg gtgctgccat tcaattcgca gggaccgggc ccggccacac
     5401 cgccgaccgc cgtctatgtg ggcgaggcgg tgcccactgg agagccgcgg ggcgtggatg
     5461 ccacggccat ttccagcacg gaggtgcgcc tgagctggaa gccaccgaag cagagcagcc
     5521 agaacggaga gatactcggc tacaagatat tctatttggt gacgtggtcg ccgcaggccc
     5581 tcgagccggg ccgcaaattc gaggaggaaa tcgaagtggt ctcggccacg gccacatcgc
     5641 acagcctggt ctttctcgat aagttcaccg agtaccgcat ccagttgctg gccttcaatc
     5701 cggccggaga cgggccgagg tccgcccccg tcactgcgaa gacgatgccg ggcgtgccca
     5761 gtgccccgct caatctgcgc ttttcggaca tcacaatgca gagcctggag gtgacctggg
     5821 acccgcccaa gctgctcaac ggcgagattg ttggctatct ggtcacctac gagaccaccg
     5881 aggagaacga aaagttcagc aagcaggtga agcagaaggt gtccaacacc acgctgcgtg
     5941 tgcagaatct ggaggaggag gtcacctaca ccttcaccgt gcgcgcccag acgaacgact
     6001 atggaccggc ggtgagcgcg aatgtgacca caggccccca ggatggctcc ccggtggcac
     6061 cgcgcgatct cacactcaca aagacactgt ccagcgttga ggtacattgg gtcaatggac
     6121 cctccggccg gggccccata ctgggctacc tcatcgaggc caagaagcga gaaaatggag
     6181 agccctcatt tatttctaat agacctccct atcttcgctt agacgactcc cgctggacta
     6241 agattgagca gtccagaaag ggtaccatga aggagtttac cgtcagctac cacatcctga
     6301 tgccatcgac ggcgtatttg ttccgggtaa ttgcttacaa taagtatggc atatcgttcc
     6361 ctgtttactc gaaggactcg atactgacgc cctcgaagct gcatctggag tacggctatc
     6421 tgcagcacaa gcccttctac aggcagacct ggttcatggt ctccctggcg gccacctcga
     6481 tcgtcatcat tgtcatggtc attgcggtgc tctgtgtgaa gagcaagagc tacaagtaca
     6541 agcaggaggc acaaaagacg ctggaggagt ccatggccat gtcgattgat gagcgccagg
     6601 agctggccct ggagctgtat cgttcgcgtc acggcgtcgg caccggcacc ctgaacagcg
     6661 ttggaacatt gcgcagcgga actttgggaa ccctcggccg taagtccacc aaccgacacc
     6721 agccggtgag tgtgcatttg ggtaagagtc caccgcgacc ctcgcccgca tcggtggcgt
     6781 accacagcga tgaggagagt ctcaagtgct acgacgagaa tcccgacgac agcagtgtta
     6841 cggaaaagcc atccgaggtg agcagctcgg aggcatccca gcactcggag agcgagaacg
     6901 agagcgtgag gagcgatccg cactcgttcg tcaatcacta tgcgaatgtg aatgactcgc
     6961 tgcggcagtc ctggaagaag accaagcccg tgcgcaacta ctcgagctac acagactccg
     7021 agccggaggg cagtgcagtg atgagtctca atggtggcca gattattgtc aataatatgg
     7081 ccagatcgag ggcaccactg cccggcttct cgtcatttgt ctgacaatca accgaattct
     7141 aagatctatg ccgtggtagc agcagcaccg tcatccgcga gacatttgtc tgaattattt
     7201 tggaaacgat aacggaaaac ggaaaaacgg aggctgaagc tgaaaccgga gctggagttg
     7261 cagtggggag cgttctaacg agttcgacac ggatgtagcg agtgggctaa actgcctgcc
     7321 tgcctgcaac tgttctgtct ggctctccct ggatcttcgt agctgtccgg cgaggcgctg
     7381 ctacatggat atttatcgta gtt