PREDICTED: Drosophila obscura protein sidekick (LOC111066347),
LOCUS XM_041591952 7574 bp mRNA linear INV 14-MAY-2021
transcript variant X2, mRNA.
ACCESSION XM_041591952
VERSION XM_041591952.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..7574
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..7574
/gene="LOC111066347"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 6 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 3 samples with support for all annotated
introns"
/db_xref="GeneID:111066347"
CDS 351..7295
/gene="LOC111066347"
/codon_start=1
/product="protein sidekick isoform X2"
/protein_id="XP_041447886.1"
/db_xref="GeneID:111066347"
/translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL
FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK
ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS
EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK
YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV
RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL
LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP
CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA
GENSASTWLRVKTETETEAANSRIKRLAQPRILRVRASHGGSGTVAGSVTGSGSGSGS
TSNSHQQGRRKQFRFASAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQ
LVEISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPP
VDTIVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVR
ASDVGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFD
GNSPISKFIIQRREVSELGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQFRVS
AVNRVGEGSPSEPSNVVELPQEEFPLYRAPSGPPVGFVGSARSMSEIITQWQPPQEEH
RNGQILGYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFNNMGV
GVYTEGAKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKIQAWQ
RRQLDGEERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGDGVAS
QLVPVETLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDRPETM
KFFNLTADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHAPTTL
ALSNIEAFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLTVTGL
MPFTQYRLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRVRWIP
LQQMEWFGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTLYEVI
MNACNDVGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHRNGQI
DGYKVYYAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGALSTP
PIRVQTFEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSANLNY
SREFPPSDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAPSGPQ
VSRSQIQAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYTALGL
RPYTTYQFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRVQWGA
LETGMWNGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDRNYEI
VVLPFNSQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSSQNGE
ILGYKIFYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLAFNPA
GDGPRSAPVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVTYETT
EENEKFSKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQDGSP
VAPRDLTLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRENGEPSFISNRPPYLRLDD
SRWTKIEQSRKGTMKEFTVSYHILMPSTAYLFRVIAYNKYGISFPVYSKDSILTPSKL
HLEYGYLQHKPFYRQTWFMVSLAATSIVIIVMVIAVLCVKSKSYKYKQEAQKTLEESM
AMSIDERQELALELYRSRHGVGTGTLNSVGTLRSGTLGTLGRKSTNRHQPVSVHLGKS
PPRPSPASVAYHSDEESLKCYDENPDDSSVTEKPSEVSSSEASQHSESENESVRSDPH
SFVNHYANVNDSLRQSWKKTKPVRNYSSYTDSEPEGSAVMSLNGGQIIVNNMARSRAP
LPGFSSFV"
misc_feature 597..809
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig_3; pfam13927"
/db_xref="CDD:464046"
misc_feature 1164..1451
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1218..1232
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409353"
misc_feature 1263..1277
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409353"
misc_feature 1338..1352
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409353"
misc_feature 1380..1397
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409353"
misc_feature 1422..1433
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409353"
misc_feature 1461..1736
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1515..1529
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409543"
misc_feature 1554..1568
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409543"
misc_feature 1632..1643
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409543"
misc_feature 1671..1688
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409543"
misc_feature 1710..1721
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409543"
misc_feature 1929..2192
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 1980..1994
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409562"
misc_feature 2019..2033
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409562"
misc_feature 2091..2102
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409562"
misc_feature 2130..2147
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409562"
misc_feature 2169..2180
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409562"
misc_feature 2205..2474
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 2253..2267
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409544"
misc_feature 2298..2312
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409544"
misc_feature 2370..2384
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409544"
misc_feature <2412..>3203
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature 2412..2429
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409544"
misc_feature 2451..2462
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409544"
misc_feature 2484..2795
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(2484..2486,2715..2717,2760..2762)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(2763..2768,2772..2777)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 3147..3461
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3426..3431,3435..3440)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature order(3474..3476,3669..3671,3714..3716)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature 3477..3728
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 3768..4037
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3768..3770,3966..3968,4011..4013)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(4014..4019,4023..4028)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 4074..4340
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature 4395..4652
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 4614..6269
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature order(4638..4643,4647..4652)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 6222..6542
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(6222..6224,6474..6476,6519..6521)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(6522..6527,6531..6536)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
ORIGIN
1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg
61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac
121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg
181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca
241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga
301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag
361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca
421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc
481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg
541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac
601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc
661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt
721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg
781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg
841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg
901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg
961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg
1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg
1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg
1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca
1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac
1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc
1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg
1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc
1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg
1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg
1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg
1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt
1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacag
1741 aaacagaaac agaagcagcc aacagccgca tcaagcgctt ggcccagcca cgcatcttaa
1801 gagtaagagc ttcgcatggt ggttcgggaa cggtcgcggg atcggtcacg ggatcgggat
1861 caggatctgg ttccacctcc aactcccatc agcagggtag acgcaagcag tttcgctttg
1921 cctcagcgcc ggtctttgag cagccgcccc agaatgtgac cgccctggat ggcaaggatg
1981 cgacgatctc ctgtcgggcc attggctcgc ccaatcccaa tgttacctgg atctacaatg
2041 aaacccaact ggttgagata tccagtcgcg ttcagatact cgaatcgggt gatttactca
2101 tctcgaatat ccgtgccacg gacgcgggac tctacatctg tgtgcgggcc aacgaggcgg
2161 gcagcgtcaa gggcgaggcc ttgctaagcg tgttagtgcg gacacaaatc atacagccgc
2221 cagtggacac catcgtgctg ctgggcctga ccgcgacact gcagtgcaag gtgtccagcg
2281 acccgagcgt gccctacaac atcgactggt accgggaggg ccaaatggcg cccatcagca
2341 actcgcagcg gattggagtg caggcggacg ggcagctgga gatccaggcg gtgcgggcca
2401 gcgatgtggg cagctattcg tgcgtggtta catcgccggg cggcaatgag acacggtcgg
2461 cccgtctcag tgtcatcgag ctgcccttcc cgcccagcaa cgtgcgggtg gagcgtctgc
2521 cagagccgca gcagcgcagc atcaatgtgt cctggacgcc cggattcgat ggcaacagtc
2581 caatctccaa atttattatc cagcgacgtg aggtctctga attgggtcca gttccagatc
2641 cccttctcaa ttggatcacc gaactgagca acgtatcggc caatcagcgg tggatgctgc
2701 tggagaacct caaggcggcc accgtctatc agtttcgtgt cagtgccgtc aatcgggtcg
2761 gcgagggctc cccctcggag cccagcaatg ttgttgagct gccccaagaa gagtttccgc
2821 tttatcgagc tccttcggga ccgcctgtgg gctttgtggg ctcggcacgg tccatgtccg
2881 agatcattac gcagtggcag ccgccgcagg aggagcatcg caacggacag atcctgggct
2941 acattctgcg ctatcgcctg ttcgggtaca acaatgtgcc gtggtcctac cagaacatca
3001 ccaacgaggc gcagcgcaac tttctgatcc aggagctgat cacgtggaag gactacatcg
3061 tgcagattgc ggccttcaac aacatgggcg tgggcgtcta cacggagggg gccaagatca
3121 agaccaagga gggtgtgccc gaggcaccgc ccaccaacgt cagggtgaag gccctcaact
3181 cgacggcggc gcagatcacg tggaagccgc cgaatccgca gcagatcaac ggcatcaacc
3241 agggctacaa gatccaggca tggcagcgac ggcagctcga tggggaggag cgggacatgg
3301 agcggcgcat gatgacggtg ccgcccagcc tgatcgatcc actggccgag cagacgacgg
3361 tgctcggtgg cctggacaag ttcgccaagt tcaatgtgac cgtactctgc ttcaccgatc
3421 ccggtgacgg tgtggccagc cagctggtgc cggtggagac tttggacgac gtgcccgacg
3481 agataacggc cctgcacttt gacgatgtct ccgatcggtc cgtcaaagtg ctgtgggcgc
3541 cgccgcgctt cgccaacggc atcctcaccg gctacacggt gcgctaccag gtcaaggatc
3601 gccccgagac gatgaagttc ttcaacctga ccgccgacga caacgagctg acggtgaacc
3661 agctgcaggc gacgacccac tactggttcg aggtgtgcgc ctggacgcgg gtgggcagcg
3721 ggccgcccaa gacggcgacg atccaatcgg gcgtggagcc ggtgctgccg catgcgccca
3781 ccacactggc cctgtccaac atcgaagcgt tttcggtggt gctgcagttc acgcccggct
3841 tcgacggcaa ctcgagcatc accaagtgga aggtggaggc gcagacggcc cgcaacatga
3901 cctggttcac gctctgtgaa atcagcgatc ccgatgcgga gaccctcacc gtgaccggcc
3961 tgatgccctt cacccagtac cggctgcggc tgagcgccac caatgtggtg ggcagctccc
4021 ggccctcgga ccccaccaag gactttcaaa ccattcaggc caagccgatg cacgccccct
4081 tcaatgtgac ggtacgcgca atgagcgccc tgcagctgcg cgtccgctgg ataccgctgc
4141 agcagatgga gtggttcggc aatccgcgcg gctacaatgt cacctaccgg caaatggagc
4201 gcaccggcaa gccctccaag cacccgcccc gctccgtgat gatcgaggat cacacggcca
4261 actcgcatgt gctcgagggg ctcgaggagt ggaccctcta cgaagtgatc atgaacgcct
4321 gcaacgatgt gggctgctcg ctggacagcg gcctggccat ggagcgcacc agggaagcgg
4381 tgcccagcta cggcccgctg catgtggagg cgaacgccac ctcctcgacg acggtggtgg
4441 tgcgctgggg cgagataccg ccccaccatc gcaacggcca gatcgatggc tacaaggtgt
4501 actacgcggc caccgagcgc ggcatgcagg tgctctacaa gacgataccc aacaacagct
4561 ccttcaccac caccctcacc gagctgcaga agtttgtggt gtaccacgtc caggtgctgg
4621 cctacacgcg gctcggcaac ggcgccctca gcaccccgcc catccgggtg cagacgttcg
4681 aggacacgcc cggatcaccg tccaatgtga gcttcccgga cgtcaccttc tcgatggcgc
4741 gcatcatctg ggacgtgccg atggacccca atggcgagat actcgcctac caggtcacct
4801 acacgctcaa cggaagcgcc aatctgaact acagccgcga gtttccgccc tcggatcgca
4861 ccttccgggc caccggcctg atgcccgagc gctactacag cttcagcgtg acggcccaga
4921 cacgcctcgg ctggggcaaa acggcctcgg tgctggtgta cacgaccaac aacagggacc
4981 gtccgcaggc accgtccggg ccgcaggtgt cgcgcagcca gatccaggcc catcagatca
5041 ccttcaactg gacgccgggc cgcgacgggt tcgccccgct gcgatactac acggtcgaga
5101 tgcgggagaa cgagggccgc tggcagccgc tgcccgagcg cgtcgatccc acactcagct
5161 cgtacacggc cctgggtctg cgtccgtaca ccacctacca gttccgcatt caggcgacca
5221 acgatctggg cccgtcggcg ttcagccgag agagcattgt ggtgcgcacc ctgcccgccg
5281 ccccagcggt gggtgtgggg ggactgaagg tggtgcccat aacgaccacc tcggtgcggg
5341 tgcagtgggg ggcgctggag acgggcatgt ggaacggcga cgcggccacc gggggatacc
5401 gcatactgta ccagcagctg tcggacttcg caccggccct gcagtcgacc ccgaagacgg
5461 atgtgatggg catcaatgag aacagcgtgg tgctgtccga tctgcagcag gaccgcaact
5521 acgagatcgt ggtgctgcca ttcaattcgc agggaccggg cccggccaca ccgccgaccg
5581 ccgtctatgt gggcgaggcg gtgcccactg gagagccgcg gggcgtggat gccacggcca
5641 tttccagcac ggaggtgcgc ctgagctgga agccaccgaa gcagagcagc cagaacggag
5701 agatactcgg ctacaagata ttctatttgg tgacgtggtc gccgcaggcc ctcgagccgg
5761 gccgcaaatt cgaggaggaa atcgaagtgg tctcggccac ggccacatcg cacagcctgg
5821 tctttctcga taagttcacc gagtaccgca tccagttgct ggccttcaat ccggccggag
5881 acgggccgag gtccgccccc gtcactgcga agacgatgcc gggcgtgccc agtgccccgc
5941 tcaatctgcg cttttcggac atcacaatgc agagcctgga ggtgacctgg gacccgccca
6001 agctgctcaa cggcgagatt gttggctatc tggtcaccta cgagaccacc gaggagaacg
6061 aaaagttcag caagcaggtg aagcagaagg tgtccaacac cacgctgcgt gtgcagaatc
6121 tggaggagga ggtcacctac accttcaccg tgcgcgccca gacgaacgac tatggaccgg
6181 cggtgagcgc gaatgtgacc acaggccccc aggatggctc cccggtggca ccgcgcgatc
6241 tcacactcac aaagacactg tccagcgttg aggtacattg ggtcaatgga ccctccggcc
6301 ggggccccat actgggctac ctcatcgagg ccaagaagcg agaaaatgga gagccctcat
6361 ttatttctaa tagacctccc tatcttcgct tagacgactc ccgctggact aagattgagc
6421 agtccagaaa gggtaccatg aaggagttta ccgtcagcta ccacatcctg atgccatcga
6481 cggcgtattt gttccgggta attgcttaca ataagtatgg catatcgttc cctgtttact
6541 cgaaggactc gatactgacg ccctcgaagc tgcatctgga gtacggctat ctgcagcaca
6601 agcccttcta caggcagacc tggttcatgg tctccctggc ggccacctcg atcgtcatca
6661 ttgtcatggt cattgcggtg ctctgtgtga agagcaagag ctacaagtac aagcaggagg
6721 cacaaaagac gctggaggag tccatggcca tgtcgattga tgagcgccag gagctggccc
6781 tggagctgta tcgttcgcgt cacggcgtcg gcaccggcac cctgaacagc gttggaacat
6841 tgcgcagcgg aactttggga accctcggcc gtaagtccac caaccgacac cagccggtga
6901 gtgtgcattt gggtaagagt ccaccgcgac cctcgcccgc atcggtggcg taccacagcg
6961 atgaggagag tctcaagtgc tacgacgaga atcccgacga cagcagtgtt acggaaaagc
7021 catccgaggt gagcagctcg gaggcatccc agcactcgga gagcgagaac gagagcgtga
7081 ggagcgatcc gcactcgttc gtcaatcact atgcgaatgt gaatgactcg ctgcggcagt
7141 cctggaagaa gaccaagccc gtgcgcaact actcgagcta cacagactcc gagccggagg
7201 gcagtgcagt gatgagtctc aatggtggcc agattattgt caataatatg gccagatcga
7261 gggcaccact gcccggcttc tcgtcatttg tctgacaatc aaccgaattc taagatctat
7321 gccgtggtag cagcagcacc gtcatccgcg agacatttgt ctgaattatt ttggaaacga
7381 taacggaaaa cggaaaaacg gaggctgaag ctgaaaccgg agctggagtt gcagtgggga
7441 gcgttctaac gagttcgaca cggatgtagc gagtgggcta aactgcctgc ctgcctgcaa
7501 ctgttctgtc tggctctccc tggatcttcg tagctgtccg gcgaggcgct gctacatgga
7561 tatttatcgt agtt