PREDICTED: Drosophila obscura protein sidekick (LOC111066347),
LOCUS XM_041591962 7322 bp mRNA linear INV 14-MAY-2021
transcript variant X11, mRNA.
ACCESSION XM_041591962
VERSION XM_041591962.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..7322
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..7322
/gene="LOC111066347"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 8 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 10 samples with support for all annotated
introns"
/db_xref="GeneID:111066347"
CDS 351..7043
/gene="LOC111066347"
/codon_start=1
/product="protein sidekick isoform X11"
/protein_id="XP_041447896.1"
/db_xref="GeneID:111066347"
/translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL
FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK
ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS
EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK
YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV
RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL
LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP
CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA
GENSASTWLRVKTSAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQLVE
ISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPPVDT
IVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVRASD
VGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFDGNS
PISKFIIQRREVSELGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQFRVSAVN
RVGEGSPSEPSNVVELPQEAPSGPPVGFVGSARSMSEIITQWQPPQEEHRNGQILGYI
LRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFNNMGVGVYTEGAKI
KTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKIQAWQRRQLDGEER
DMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGDGVASQLVPVETLD
DVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDRPETMKFFNLTADD
NELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHAPTTLALSNIEAFS
VVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLTVTGLMPFTQYRLR
LSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRVRWIPLQQMEWFGN
PRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTLYEVIMNACNDVGC
SLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHRNGQIDGYKVYYAA
TERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGALSTPPIRVQTFED
TPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSANLNYSREFPPSDR
TFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAPSGPQVSRSQIQAH
QITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYTALGLRPYTTYQFR
IQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRVQWGALETGMWNGD
AATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDRNYEIVVLPFNSQG
PGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSSQNGEILGYKIFYL
VTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLAFNPAGDGPRSAPV
TAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVTYETTEENEKFSKQ
VKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQDGSPVAPRDLTLT
KTLSSVEVHWVNGPSGRGPILGYLIEAKKRDDSRWTKIEQSRKGTMKEFTVSYHILMP
STAYLFRVIAYNKYGISFPVYSKDSILTPSKLHLEYGYLQHKPFYRQTWFMVSLAATS
IVIIVMVIAVLCVKSKSYKYKQEAQKTLEESMAMSIDERQELALELYRSRHGVGTGTL
NSVGTLRSGTLGTLGRKSTNRHQPVSVHLGKSPPRPSPASVAYHSDEESLKCYDENPD
DSSVTEKPSEVSSSEASQHSESENESVRSDPHSFVNHYANVNDSLRQSWKKTKPVRNY
SSYTDSEPEGSAVMSLNGGQIIVNNMARSRAPLPGFSSFV"
misc_feature 597..809
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig_3; pfam13927"
/db_xref="CDD:464046"
misc_feature 1164..1451
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1218..1232
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409353"
misc_feature 1263..1277
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409353"
misc_feature 1338..1352
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409353"
misc_feature 1380..1397
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409353"
misc_feature 1422..1433
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409353"
misc_feature 1461..1736
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1515..1529
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409543"
misc_feature 1554..1568
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409543"
misc_feature 1632..1643
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409543"
misc_feature 1671..1688
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409543"
misc_feature 1710..1721
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409543"
misc_feature 1746..2009
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 1797..1811
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409562"
misc_feature 1836..1850
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409562"
misc_feature 1908..1919
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409562"
misc_feature 1947..1964
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409562"
misc_feature 1986..1997
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409562"
misc_feature 2022..2291
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 2070..2084
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409544"
misc_feature 2115..2129
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409544"
misc_feature 2187..2201
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409544"
misc_feature <2229..>3002
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature 2229..2246
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409544"
misc_feature 2268..2279
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409544"
misc_feature 2301..2612
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(2301..2303,2532..2534,2577..2579)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(2580..2585,2589..2594)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 2946..3260
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3225..3230,3234..3239)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature order(3273..3275,3468..3470,3513..3515)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature 3276..3527
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 3567..3836
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3567..3569,3765..3767,3810..3812)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(3813..3818,3822..3827)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 3873..4139
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature 4194..4451
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature <4401..5435
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature order(4437..4442,4446..4451)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 4782..5069
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(4782..4784,4983..4985,5028..5030)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(5031..5036,5040..5045)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 5409..5714
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature <5637..>6275
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature order(5679..5684,5688..5693)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
ORIGIN
1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg
61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac
121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg
181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca
241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga
301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag
361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca
421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc
481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg
541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac
601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc
661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt
721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg
781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg
841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg
901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg
961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg
1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg
1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg
1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca
1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac
1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc
1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg
1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc
1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg
1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg
1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg
1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt
1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacct
1741 cagcgccggt ctttgagcag ccgccccaga atgtgaccgc cctggatggc aaggatgcga
1801 cgatctcctg tcgggccatt ggctcgccca atcccaatgt tacctggatc tacaatgaaa
1861 cccaactggt tgagatatcc agtcgcgttc agatactcga atcgggtgat ttactcatct
1921 cgaatatccg tgccacggac gcgggactct acatctgtgt gcgggccaac gaggcgggca
1981 gcgtcaaggg cgaggccttg ctaagcgtgt tagtgcggac acaaatcata cagccgccag
2041 tggacaccat cgtgctgctg ggcctgaccg cgacactgca gtgcaaggtg tccagcgacc
2101 cgagcgtgcc ctacaacatc gactggtacc gggagggcca aatggcgccc atcagcaact
2161 cgcagcggat tggagtgcag gcggacgggc agctggagat ccaggcggtg cgggccagcg
2221 atgtgggcag ctattcgtgc gtggttacat cgccgggcgg caatgagaca cggtcggccc
2281 gtctcagtgt catcgagctg cccttcccgc ccagcaacgt gcgggtggag cgtctgccag
2341 agccgcagca gcgcagcatc aatgtgtcct ggacgcccgg attcgatggc aacagtccaa
2401 tctccaaatt tattatccag cgacgtgagg tctctgaatt gggtccagtt ccagatcccc
2461 ttctcaattg gatcaccgaa ctgagcaacg tatcggccaa tcagcggtgg atgctgctgg
2521 agaacctcaa ggcggccacc gtctatcagt ttcgtgtcag tgccgtcaat cgggtcggcg
2581 agggctcccc ctcggagccc agcaatgttg ttgagctgcc ccaagaagct ccttcgggac
2641 cgcctgtggg ctttgtgggc tcggcacggt ccatgtccga gatcattacg cagtggcagc
2701 cgccgcagga ggagcatcgc aacggacaga tcctgggcta cattctgcgc tatcgcctgt
2761 tcgggtacaa caatgtgccg tggtcctacc agaacatcac caacgaggcg cagcgcaact
2821 ttctgatcca ggagctgatc acgtggaagg actacatcgt gcagattgcg gccttcaaca
2881 acatgggcgt gggcgtctac acggaggggg ccaagatcaa gaccaaggag ggtgtgcccg
2941 aggcaccgcc caccaacgtc agggtgaagg ccctcaactc gacggcggcg cagatcacgt
3001 ggaagccgcc gaatccgcag cagatcaacg gcatcaacca gggctacaag atccaggcat
3061 ggcagcgacg gcagctcgat ggggaggagc gggacatgga gcggcgcatg atgacggtgc
3121 cgcccagcct gatcgatcca ctggccgagc agacgacggt gctcggtggc ctggacaagt
3181 tcgccaagtt caatgtgacc gtactctgct tcaccgatcc cggtgacggt gtggccagcc
3241 agctggtgcc ggtggagact ttggacgacg tgcccgacga gataacggcc ctgcactttg
3301 acgatgtctc cgatcggtcc gtcaaagtgc tgtgggcgcc gccgcgcttc gccaacggca
3361 tcctcaccgg ctacacggtg cgctaccagg tcaaggatcg ccccgagacg atgaagttct
3421 tcaacctgac cgccgacgac aacgagctga cggtgaacca gctgcaggcg acgacccact
3481 actggttcga ggtgtgcgcc tggacgcggg tgggcagcgg gccgcccaag acggcgacga
3541 tccaatcggg cgtggagccg gtgctgccgc atgcgcccac cacactggcc ctgtccaaca
3601 tcgaagcgtt ttcggtggtg ctgcagttca cgcccggctt cgacggcaac tcgagcatca
3661 ccaagtggaa ggtggaggcg cagacggccc gcaacatgac ctggttcacg ctctgtgaaa
3721 tcagcgatcc cgatgcggag accctcaccg tgaccggcct gatgcccttc acccagtacc
3781 ggctgcggct gagcgccacc aatgtggtgg gcagctcccg gccctcggac cccaccaagg
3841 actttcaaac cattcaggcc aagccgatgc acgccccctt caatgtgacg gtacgcgcaa
3901 tgagcgccct gcagctgcgc gtccgctgga taccgctgca gcagatggag tggttcggca
3961 atccgcgcgg ctacaatgtc acctaccggc aaatggagcg caccggcaag ccctccaagc
4021 acccgccccg ctccgtgatg atcgaggatc acacggccaa ctcgcatgtg ctcgaggggc
4081 tcgaggagtg gaccctctac gaagtgatca tgaacgcctg caacgatgtg ggctgctcgc
4141 tggacagcgg cctggccatg gagcgcacca gggaagcggt gcccagctac ggcccgctgc
4201 atgtggaggc gaacgccacc tcctcgacga cggtggtggt gcgctggggc gagataccgc
4261 cccaccatcg caacggccag atcgatggct acaaggtgta ctacgcggcc accgagcgcg
4321 gcatgcaggt gctctacaag acgataccca acaacagctc cttcaccacc accctcaccg
4381 agctgcagaa gtttgtggtg taccacgtcc aggtgctggc ctacacgcgg ctcggcaacg
4441 gcgccctcag caccccgccc atccgggtgc agacgttcga ggacacgccc ggatcaccgt
4501 ccaatgtgag cttcccggac gtcaccttct cgatggcgcg catcatctgg gacgtgccga
4561 tggaccccaa tggcgagata ctcgcctacc aggtcaccta cacgctcaac ggaagcgcca
4621 atctgaacta cagccgcgag tttccgccct cggatcgcac cttccgggcc accggcctga
4681 tgcccgagcg ctactacagc ttcagcgtga cggcccagac acgcctcggc tggggcaaaa
4741 cggcctcggt gctggtgtac acgaccaaca acagggaccg tccgcaggca ccgtccgggc
4801 cgcaggtgtc gcgcagccag atccaggccc atcagatcac cttcaactgg acgccgggcc
4861 gcgacgggtt cgccccgctg cgatactaca cggtcgagat gcgggagaac gagggccgct
4921 ggcagccgct gcccgagcgc gtcgatccca cactcagctc gtacacggcc ctgggtctgc
4981 gtccgtacac cacctaccag ttccgcattc aggcgaccaa cgatctgggc ccgtcggcgt
5041 tcagccgaga gagcattgtg gtgcgcaccc tgcccgccgc cccagcggtg ggtgtggggg
5101 gactgaaggt ggtgcccata acgaccacct cggtgcgggt gcagtggggg gcgctggaga
5161 cgggcatgtg gaacggcgac gcggccaccg ggggataccg catactgtac cagcagctgt
5221 cggacttcgc accggccctg cagtcgaccc cgaagacgga tgtgatgggc atcaatgaga
5281 acagcgtggt gctgtccgat ctgcagcagg accgcaacta cgagatcgtg gtgctgccat
5341 tcaattcgca gggaccgggc ccggccacac cgccgaccgc cgtctatgtg ggcgaggcgg
5401 tgcccactgg agagccgcgg ggcgtggatg ccacggccat ttccagcacg gaggtgcgcc
5461 tgagctggaa gccaccgaag cagagcagcc agaacggaga gatactcggc tacaagatat
5521 tctatttggt gacgtggtcg ccgcaggccc tcgagccggg ccgcaaattc gaggaggaaa
5581 tcgaagtggt ctcggccacg gccacatcgc acagcctggt ctttctcgat aagttcaccg
5641 agtaccgcat ccagttgctg gccttcaatc cggccggaga cgggccgagg tccgcccccg
5701 tcactgcgaa gacgatgccg ggcgtgccca gtgccccgct caatctgcgc ttttcggaca
5761 tcacaatgca gagcctggag gtgacctggg acccgcccaa gctgctcaac ggcgagattg
5821 ttggctatct ggtcacctac gagaccaccg aggagaacga aaagttcagc aagcaggtga
5881 agcagaaggt gtccaacacc acgctgcgtg tgcagaatct ggaggaggag gtcacctaca
5941 ccttcaccgt gcgcgcccag acgaacgact atggaccggc ggtgagcgcg aatgtgacca
6001 caggccccca ggatggctcc ccggtggcac cgcgcgatct cacactcaca aagacactgt
6061 ccagcgttga ggtacattgg gtcaatggac cctccggccg gggccccata ctgggctacc
6121 tcatcgaggc caagaagcga gacgactccc gctggactaa gattgagcag tccagaaagg
6181 gtaccatgaa ggagtttacc gtcagctacc acatcctgat gccatcgacg gcgtatttgt
6241 tccgggtaat tgcttacaat aagtatggca tatcgttccc tgtttactcg aaggactcga
6301 tactgacgcc ctcgaagctg catctggagt acggctatct gcagcacaag cccttctaca
6361 ggcagacctg gttcatggtc tccctggcgg ccacctcgat cgtcatcatt gtcatggtca
6421 ttgcggtgct ctgtgtgaag agcaagagct acaagtacaa gcaggaggca caaaagacgc
6481 tggaggagtc catggccatg tcgattgatg agcgccagga gctggccctg gagctgtatc
6541 gttcgcgtca cggcgtcggc accggcaccc tgaacagcgt tggaacattg cgcagcggaa
6601 ctttgggaac cctcggccgt aagtccacca accgacacca gccggtgagt gtgcatttgg
6661 gtaagagtcc accgcgaccc tcgcccgcat cggtggcgta ccacagcgat gaggagagtc
6721 tcaagtgcta cgacgagaat cccgacgaca gcagtgttac ggaaaagcca tccgaggtga
6781 gcagctcgga ggcatcccag cactcggaga gcgagaacga gagcgtgagg agcgatccgc
6841 actcgttcgt caatcactat gcgaatgtga atgactcgct gcggcagtcc tggaagaaga
6901 ccaagcccgt gcgcaactac tcgagctaca cagactccga gccggagggc agtgcagtga
6961 tgagtctcaa tggtggccag attattgtca ataatatggc cagatcgagg gcaccactgc
7021 ccggcttctc gtcatttgtc tgacaatcaa ccgaattcta agatctatgc cgtggtagca
7081 gcagcaccgt catccgcgag acatttgtct gaattatttt ggaaacgata acggaaaacg
7141 gaaaaacgga ggctgaagct gaaaccggag ctggagttgc agtggggagc gttctaacga
7201 gttcgacacg gatgtagcga gtgggctaaa ctgcctgcct gcctgcaact gttctgtctg
7261 gctctccctg gatcttcgta gctgtccggc gaggcgctgc tacatggata tttatcgtag
7321 tt