PREDICTED: Drosophila obscura protein sidekick (LOC111066347),
LOCUS XM_041591957 7505 bp mRNA linear INV 14-MAY-2021
transcript variant X7, mRNA.
ACCESSION XM_041591957
VERSION XM_041591957.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..7505
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..7505
/gene="LOC111066347"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 8 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 7 samples with support for all annotated
introns"
/db_xref="GeneID:111066347"
CDS 351..7226
/gene="LOC111066347"
/codon_start=1
/product="protein sidekick isoform X7"
/protein_id="XP_041447891.1"
/db_xref="GeneID:111066347"
/translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL
FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK
ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS
EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK
YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV
RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL
LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP
CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA
GENSASTWLRVKTETETEAANSRIKRLAQPRILRVRASHGGSGTVAGSVTGSGSGSGS
TSNSHQQGRRKQFRFASAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQ
LVEISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPP
VDTIVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVR
ASDVGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFD
GNSPISKFIIQRREVSELGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQFRVS
AVNRVGEGSPSEPSNVVELPQEAPSGPPVGFVGSARSMSEIITQWQPPQEEHRNGQIL
GYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFNNMGVGVYTEG
AKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKIQAWQRRQLDG
EERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGDGVASQLVPVE
TLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDRPETMKFFNLT
ADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHAPTTLALSNIE
AFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLTVTGLMPFTQY
RLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRVRWIPLQQMEW
FGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTLYEVIMNACND
VGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHRNGQIDGYKVY
YAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGALSTPPIRVQT
FEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSANLNYSREFPP
SDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAPSGPQVSRSQI
QAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYTALGLRPYTTY
QFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRVQWGALETGMW
NGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDRNYEIVVLPFN
SQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSSQNGEILGYKI
FYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLAFNPAGDGPRS
APVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVTYETTEENEKF
SKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQDGSPVAPRDL
TLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRDDSRWTKIEQSRKGTMKEFTVSYHI
LMPSTAYLFRVIAYNKYGISFPVYSKDSILTPSKLHLEYGYLQHKPFYRQTWFMVSLA
ATSIVIIVMVIAVLCVKSKSYKYKQEAQKTLEESMAMSIDERQELALELYRSRHGVGT
GTLNSVGTLRSGTLGTLGRKSTNRHQPVSVHLGKSPPRPSPASVAYHSDEESLKCYDE
NPDDSSVTEKPSEVSSSEASQHSESENESVRSDPHSFVNHYANVNDSLRQSWKKTKPV
RNYSSYTDSEPEGSAVMSLNGGQIIVNNMARSRAPLPGFSSFV"
misc_feature 597..809
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig_3; pfam13927"
/db_xref="CDD:464046"
misc_feature 1164..1451
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1218..1232
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409353"
misc_feature 1263..1277
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409353"
misc_feature 1338..1352
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409353"
misc_feature 1380..1397
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409353"
misc_feature 1422..1433
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409353"
misc_feature 1461..1736
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1515..1529
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409543"
misc_feature 1554..1568
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409543"
misc_feature 1632..1643
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409543"
misc_feature 1671..1688
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409543"
misc_feature 1710..1721
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409543"
misc_feature 1929..2192
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 1980..1994
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409562"
misc_feature 2019..2033
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409562"
misc_feature 2091..2102
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409562"
misc_feature 2130..2147
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409562"
misc_feature 2169..2180
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409562"
misc_feature 2205..2474
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 2253..2267
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409544"
misc_feature 2298..2312
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409544"
misc_feature 2370..2384
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409544"
misc_feature <2412..>3185
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature 2412..2429
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409544"
misc_feature 2451..2462
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409544"
misc_feature 2484..2795
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(2484..2486,2715..2717,2760..2762)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(2763..2768,2772..2777)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 3129..3443
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3408..3413,3417..3422)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature order(3456..3458,3651..3653,3696..3698)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature 3459..3710
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 3750..4019
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3750..3752,3948..3950,3993..3995)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(3996..4001,4005..4010)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 4056..4322
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature 4377..4634
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature <4584..5618
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature order(4620..4625,4629..4634)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 4965..5252
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(4965..4967,5166..5168,5211..5213)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(5214..5219,5223..5228)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 5592..5897
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature <5820..>6458
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature order(5862..5867,5871..5876)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
ORIGIN
1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg
61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac
121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg
181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca
241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga
301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag
361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca
421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc
481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg
541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac
601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc
661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt
721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg
781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg
841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg
901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg
961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg
1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg
1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg
1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca
1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac
1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc
1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg
1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc
1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg
1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg
1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg
1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt
1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacag
1741 aaacagaaac agaagcagcc aacagccgca tcaagcgctt ggcccagcca cgcatcttaa
1801 gagtaagagc ttcgcatggt ggttcgggaa cggtcgcggg atcggtcacg ggatcgggat
1861 caggatctgg ttccacctcc aactcccatc agcagggtag acgcaagcag tttcgctttg
1921 cctcagcgcc ggtctttgag cagccgcccc agaatgtgac cgccctggat ggcaaggatg
1981 cgacgatctc ctgtcgggcc attggctcgc ccaatcccaa tgttacctgg atctacaatg
2041 aaacccaact ggttgagata tccagtcgcg ttcagatact cgaatcgggt gatttactca
2101 tctcgaatat ccgtgccacg gacgcgggac tctacatctg tgtgcgggcc aacgaggcgg
2161 gcagcgtcaa gggcgaggcc ttgctaagcg tgttagtgcg gacacaaatc atacagccgc
2221 cagtggacac catcgtgctg ctgggcctga ccgcgacact gcagtgcaag gtgtccagcg
2281 acccgagcgt gccctacaac atcgactggt accgggaggg ccaaatggcg cccatcagca
2341 actcgcagcg gattggagtg caggcggacg ggcagctgga gatccaggcg gtgcgggcca
2401 gcgatgtggg cagctattcg tgcgtggtta catcgccggg cggcaatgag acacggtcgg
2461 cccgtctcag tgtcatcgag ctgcccttcc cgcccagcaa cgtgcgggtg gagcgtctgc
2521 cagagccgca gcagcgcagc atcaatgtgt cctggacgcc cggattcgat ggcaacagtc
2581 caatctccaa atttattatc cagcgacgtg aggtctctga attgggtcca gttccagatc
2641 cccttctcaa ttggatcacc gaactgagca acgtatcggc caatcagcgg tggatgctgc
2701 tggagaacct caaggcggcc accgtctatc agtttcgtgt cagtgccgtc aatcgggtcg
2761 gcgagggctc cccctcggag cccagcaatg ttgttgagct gccccaagaa gctccttcgg
2821 gaccgcctgt gggctttgtg ggctcggcac ggtccatgtc cgagatcatt acgcagtggc
2881 agccgccgca ggaggagcat cgcaacggac agatcctggg ctacattctg cgctatcgcc
2941 tgttcgggta caacaatgtg ccgtggtcct accagaacat caccaacgag gcgcagcgca
3001 actttctgat ccaggagctg atcacgtgga aggactacat cgtgcagatt gcggccttca
3061 acaacatggg cgtgggcgtc tacacggagg gggccaagat caagaccaag gagggtgtgc
3121 ccgaggcacc gcccaccaac gtcagggtga aggccctcaa ctcgacggcg gcgcagatca
3181 cgtggaagcc gccgaatccg cagcagatca acggcatcaa ccagggctac aagatccagg
3241 catggcagcg acggcagctc gatggggagg agcgggacat ggagcggcgc atgatgacgg
3301 tgccgcccag cctgatcgat ccactggccg agcagacgac ggtgctcggt ggcctggaca
3361 agttcgccaa gttcaatgtg accgtactct gcttcaccga tcccggtgac ggtgtggcca
3421 gccagctggt gccggtggag actttggacg acgtgcccga cgagataacg gccctgcact
3481 ttgacgatgt ctccgatcgg tccgtcaaag tgctgtgggc gccgccgcgc ttcgccaacg
3541 gcatcctcac cggctacacg gtgcgctacc aggtcaagga tcgccccgag acgatgaagt
3601 tcttcaacct gaccgccgac gacaacgagc tgacggtgaa ccagctgcag gcgacgaccc
3661 actactggtt cgaggtgtgc gcctggacgc gggtgggcag cgggccgccc aagacggcga
3721 cgatccaatc gggcgtggag ccggtgctgc cgcatgcgcc caccacactg gccctgtcca
3781 acatcgaagc gttttcggtg gtgctgcagt tcacgcccgg cttcgacggc aactcgagca
3841 tcaccaagtg gaaggtggag gcgcagacgg cccgcaacat gacctggttc acgctctgtg
3901 aaatcagcga tcccgatgcg gagaccctca ccgtgaccgg cctgatgccc ttcacccagt
3961 accggctgcg gctgagcgcc accaatgtgg tgggcagctc ccggccctcg gaccccacca
4021 aggactttca aaccattcag gccaagccga tgcacgcccc cttcaatgtg acggtacgcg
4081 caatgagcgc cctgcagctg cgcgtccgct ggataccgct gcagcagatg gagtggttcg
4141 gcaatccgcg cggctacaat gtcacctacc ggcaaatgga gcgcaccggc aagccctcca
4201 agcacccgcc ccgctccgtg atgatcgagg atcacacggc caactcgcat gtgctcgagg
4261 ggctcgagga gtggaccctc tacgaagtga tcatgaacgc ctgcaacgat gtgggctgct
4321 cgctggacag cggcctggcc atggagcgca ccagggaagc ggtgcccagc tacggcccgc
4381 tgcatgtgga ggcgaacgcc acctcctcga cgacggtggt ggtgcgctgg ggcgagatac
4441 cgccccacca tcgcaacggc cagatcgatg gctacaaggt gtactacgcg gccaccgagc
4501 gcggcatgca ggtgctctac aagacgatac ccaacaacag ctccttcacc accaccctca
4561 ccgagctgca gaagtttgtg gtgtaccacg tccaggtgct ggcctacacg cggctcggca
4621 acggcgccct cagcaccccg cccatccggg tgcagacgtt cgaggacacg cccggatcac
4681 cgtccaatgt gagcttcccg gacgtcacct tctcgatggc gcgcatcatc tgggacgtgc
4741 cgatggaccc caatggcgag atactcgcct accaggtcac ctacacgctc aacggaagcg
4801 ccaatctgaa ctacagccgc gagtttccgc cctcggatcg caccttccgg gccaccggcc
4861 tgatgcccga gcgctactac agcttcagcg tgacggccca gacacgcctc ggctggggca
4921 aaacggcctc ggtgctggtg tacacgacca acaacaggga ccgtccgcag gcaccgtccg
4981 ggccgcaggt gtcgcgcagc cagatccagg cccatcagat caccttcaac tggacgccgg
5041 gccgcgacgg gttcgccccg ctgcgatact acacggtcga gatgcgggag aacgagggcc
5101 gctggcagcc gctgcccgag cgcgtcgatc ccacactcag ctcgtacacg gccctgggtc
5161 tgcgtccgta caccacctac cagttccgca ttcaggcgac caacgatctg ggcccgtcgg
5221 cgttcagccg agagagcatt gtggtgcgca ccctgcccgc cgccccagcg gtgggtgtgg
5281 ggggactgaa ggtggtgccc ataacgacca cctcggtgcg ggtgcagtgg ggggcgctgg
5341 agacgggcat gtggaacggc gacgcggcca ccgggggata ccgcatactg taccagcagc
5401 tgtcggactt cgcaccggcc ctgcagtcga ccccgaagac ggatgtgatg ggcatcaatg
5461 agaacagcgt ggtgctgtcc gatctgcagc aggaccgcaa ctacgagatc gtggtgctgc
5521 cattcaattc gcagggaccg ggcccggcca caccgccgac cgccgtctat gtgggcgagg
5581 cggtgcccac tggagagccg cggggcgtgg atgccacggc catttccagc acggaggtgc
5641 gcctgagctg gaagccaccg aagcagagca gccagaacgg agagatactc ggctacaaga
5701 tattctattt ggtgacgtgg tcgccgcagg ccctcgagcc gggccgcaaa ttcgaggagg
5761 aaatcgaagt ggtctcggcc acggccacat cgcacagcct ggtctttctc gataagttca
5821 ccgagtaccg catccagttg ctggccttca atccggccgg agacgggccg aggtccgccc
5881 ccgtcactgc gaagacgatg ccgggcgtgc ccagtgcccc gctcaatctg cgcttttcgg
5941 acatcacaat gcagagcctg gaggtgacct gggacccgcc caagctgctc aacggcgaga
6001 ttgttggcta tctggtcacc tacgagacca ccgaggagaa cgaaaagttc agcaagcagg
6061 tgaagcagaa ggtgtccaac accacgctgc gtgtgcagaa tctggaggag gaggtcacct
6121 acaccttcac cgtgcgcgcc cagacgaacg actatggacc ggcggtgagc gcgaatgtga
6181 ccacaggccc ccaggatggc tccccggtgg caccgcgcga tctcacactc acaaagacac
6241 tgtccagcgt tgaggtacat tgggtcaatg gaccctccgg ccggggcccc atactgggct
6301 acctcatcga ggccaagaag cgagacgact cccgctggac taagattgag cagtccagaa
6361 agggtaccat gaaggagttt accgtcagct accacatcct gatgccatcg acggcgtatt
6421 tgttccgggt aattgcttac aataagtatg gcatatcgtt ccctgtttac tcgaaggact
6481 cgatactgac gccctcgaag ctgcatctgg agtacggcta tctgcagcac aagcccttct
6541 acaggcagac ctggttcatg gtctccctgg cggccacctc gatcgtcatc attgtcatgg
6601 tcattgcggt gctctgtgtg aagagcaaga gctacaagta caagcaggag gcacaaaaga
6661 cgctggagga gtccatggcc atgtcgattg atgagcgcca ggagctggcc ctggagctgt
6721 atcgttcgcg tcacggcgtc ggcaccggca ccctgaacag cgttggaaca ttgcgcagcg
6781 gaactttggg aaccctcggc cgtaagtcca ccaaccgaca ccagccggtg agtgtgcatt
6841 tgggtaagag tccaccgcga ccctcgcccg catcggtggc gtaccacagc gatgaggaga
6901 gtctcaagtg ctacgacgag aatcccgacg acagcagtgt tacggaaaag ccatccgagg
6961 tgagcagctc ggaggcatcc cagcactcgg agagcgagaa cgagagcgtg aggagcgatc
7021 cgcactcgtt cgtcaatcac tatgcgaatg tgaatgactc gctgcggcag tcctggaaga
7081 agaccaagcc cgtgcgcaac tactcgagct acacagactc cgagccggag ggcagtgcag
7141 tgatgagtct caatggtggc cagattattg tcaataatat ggccagatcg agggcaccac
7201 tgcccggctt ctcgtcattt gtctgacaat caaccgaatt ctaagatcta tgccgtggta
7261 gcagcagcac cgtcatccgc gagacatttg tctgaattat tttggaaacg ataacggaaa
7321 acggaaaaac ggaggctgaa gctgaaaccg gagctggagt tgcagtgggg agcgttctaa
7381 cgagttcgac acggatgtag cgagtgggct aaactgcctg cctgcctgca actgttctgt
7441 ctggctctcc ctggatcttc gtagctgtcc ggcgaggcgc tgctacatgg atatttatcg
7501 tagtt