PREDICTED: Drosophila obscura protein sidekick (LOC111066347),
LOCUS XM_041591954 7562 bp mRNA linear INV 14-MAY-2021
transcript variant X4, mRNA.
ACCESSION XM_041591954
VERSION XM_041591954.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..7562
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..7562
/gene="LOC111066347"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 6 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 3 samples with support for all annotated
introns"
/db_xref="GeneID:111066347"
CDS 351..7283
/gene="LOC111066347"
/codon_start=1
/product="protein sidekick isoform X4"
/protein_id="XP_041447888.1"
/db_xref="GeneID:111066347"
/translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL
FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK
ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS
EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK
YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV
RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL
LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP
CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA
GENSASTWLRVKTETETEAANSRIKRLAQPRILRVRASHGGSGTVAGSVTGSGSGSGS
TSNSHQQGRRKQFRFASAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQ
LVEISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPP
VDTIVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVR
ASDVGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFD
GNSPISKFIIQRREVSELEKFVGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQ
FRVSAVNRVGEGSPSEPSNVVELPQEEFPLYRAPSGPPVGFVGSARSMSEIITQWQPP
QEEHRNGQILGYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFN
NMGVGVYTEGAKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKI
QAWQRRQLDGEERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGD
GVASQLVPVETLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDR
PETMKFFNLTADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHA
PTTLALSNIEAFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLT
VTGLMPFTQYRLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRV
RWIPLQQMEWFGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTL
YEVIMNACNDVGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHR
NGQIDGYKVYYAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGA
LSTPPIRVQTFEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSA
NLNYSREFPPSDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAP
SGPQVSRSQIQAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYT
ALGLRPYTTYQFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRV
QWGALETGMWNGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDR
NYEIVVLPFNSQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSS
QNGEILGYKIFYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLA
FNPAGDGPRSAPVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVT
YETTEENEKFSKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQ
DGSPVAPRDLTLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRENGEPSFISNRPPYL
RLDDSRWTKIEQSRKGTMKEFTVSYHILMPSTAYLFRVIAYNKYGISFPVYSKDSILT
PSKLHLEYGYLQHKPFYRQTWFMVSLAATSIVIIVMVIAVLCVKSKSYKYKQEAQKTL
EESMAMSIDERQELALELYRSRHGVGTGTLNSVGTLRSGTLGTLGRKSTNRHQPVSVH
LGKSPPRPSPASVAYHSDEESLKCYDENPDDSSVTEKPSEHSESENESVRSDPHSFVN
HYANVNDSLRQSWKKTKPVRNYSSYTDSEPEGSAVMSLNGGQIIVNNMARSRAPLPGF
SSFV"
misc_feature 597..809
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig_3; pfam13927"
/db_xref="CDD:464046"
misc_feature 1164..1451
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1218..1232
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409353"
misc_feature 1263..1277
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409353"
misc_feature 1338..1352
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409353"
misc_feature 1380..1397
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409353"
misc_feature 1422..1433
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409353"
misc_feature 1461..1736
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1515..1529
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409543"
misc_feature 1554..1568
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409543"
misc_feature 1632..1643
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409543"
misc_feature 1671..1688
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409543"
misc_feature 1710..1721
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409543"
misc_feature 1929..2192
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 1980..1994
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409562"
misc_feature 2019..2033
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409562"
misc_feature 2091..2102
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409562"
misc_feature 2130..2147
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409562"
misc_feature 2169..2180
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409562"
misc_feature 2205..2474
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 2253..2267
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409544"
misc_feature 2298..2312
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409544"
misc_feature 2370..2384
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409544"
misc_feature 2412..2429
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409544"
misc_feature 2451..2462
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409544"
misc_feature 2484..2807
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(2484..2486,2727..2729,2772..2774)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(2775..2780,2784..2789)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 2850..3137
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3102..3107,3111..3116)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 3159..3473
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3438..3443,3447..3452)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature order(3486..3488,3681..3683,3726..3728)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature 3489..3740
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 3780..4049
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3780..3782,3978..3980,4023..4025)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(4026..4031,4035..4040)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 4086..4352
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature 4407..4664
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 4626..6281
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature order(4650..4655,4659..4664)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 6234..6554
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(6234..6236,6486..6488,6531..6533)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(6534..6539,6543..6548)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
ORIGIN
1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg
61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac
121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg
181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca
241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga
301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag
361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca
421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc
481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg
541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac
601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc
661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt
721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg
781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg
841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg
901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg
961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg
1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg
1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg
1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca
1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac
1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc
1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg
1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc
1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg
1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg
1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg
1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt
1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacag
1741 aaacagaaac agaagcagcc aacagccgca tcaagcgctt ggcccagcca cgcatcttaa
1801 gagtaagagc ttcgcatggt ggttcgggaa cggtcgcggg atcggtcacg ggatcgggat
1861 caggatctgg ttccacctcc aactcccatc agcagggtag acgcaagcag tttcgctttg
1921 cctcagcgcc ggtctttgag cagccgcccc agaatgtgac cgccctggat ggcaaggatg
1981 cgacgatctc ctgtcgggcc attggctcgc ccaatcccaa tgttacctgg atctacaatg
2041 aaacccaact ggttgagata tccagtcgcg ttcagatact cgaatcgggt gatttactca
2101 tctcgaatat ccgtgccacg gacgcgggac tctacatctg tgtgcgggcc aacgaggcgg
2161 gcagcgtcaa gggcgaggcc ttgctaagcg tgttagtgcg gacacaaatc atacagccgc
2221 cagtggacac catcgtgctg ctgggcctga ccgcgacact gcagtgcaag gtgtccagcg
2281 acccgagcgt gccctacaac atcgactggt accgggaggg ccaaatggcg cccatcagca
2341 actcgcagcg gattggagtg caggcggacg ggcagctgga gatccaggcg gtgcgggcca
2401 gcgatgtggg cagctattcg tgcgtggtta catcgccggg cggcaatgag acacggtcgg
2461 cccgtctcag tgtcatcgag ctgcccttcc cgcccagcaa cgtgcgggtg gagcgtctgc
2521 cagagccgca gcagcgcagc atcaatgtgt cctggacgcc cggattcgat ggcaacagtc
2581 caatctccaa atttattatc cagcgacgtg aggtctctga attggaaaaa ttcgtaggtc
2641 cagttccaga tccccttctc aattggatca ccgaactgag caacgtatcg gccaatcagc
2701 ggtggatgct gctggagaac ctcaaggcgg ccaccgtcta tcagtttcgt gtcagtgccg
2761 tcaatcgggt cggcgagggc tccccctcgg agcccagcaa tgttgttgag ctgccccaag
2821 aagagtttcc gctttatcga gctccttcgg gaccgcctgt gggctttgtg ggctcggcac
2881 ggtccatgtc cgagatcatt acgcagtggc agccgccgca ggaggagcat cgcaacggac
2941 agatcctggg ctacattctg cgctatcgcc tgttcgggta caacaatgtg ccgtggtcct
3001 accagaacat caccaacgag gcgcagcgca actttctgat ccaggagctg atcacgtgga
3061 aggactacat cgtgcagatt gcggccttca acaacatggg cgtgggcgtc tacacggagg
3121 gggccaagat caagaccaag gagggtgtgc ccgaggcacc gcccaccaac gtcagggtga
3181 aggccctcaa ctcgacggcg gcgcagatca cgtggaagcc gccgaatccg cagcagatca
3241 acggcatcaa ccagggctac aagatccagg catggcagcg acggcagctc gatggggagg
3301 agcgggacat ggagcggcgc atgatgacgg tgccgcccag cctgatcgat ccactggccg
3361 agcagacgac ggtgctcggt ggcctggaca agttcgccaa gttcaatgtg accgtactct
3421 gcttcaccga tcccggtgac ggtgtggcca gccagctggt gccggtggag actttggacg
3481 acgtgcccga cgagataacg gccctgcact ttgacgatgt ctccgatcgg tccgtcaaag
3541 tgctgtgggc gccgccgcgc ttcgccaacg gcatcctcac cggctacacg gtgcgctacc
3601 aggtcaagga tcgccccgag acgatgaagt tcttcaacct gaccgccgac gacaacgagc
3661 tgacggtgaa ccagctgcag gcgacgaccc actactggtt cgaggtgtgc gcctggacgc
3721 gggtgggcag cgggccgccc aagacggcga cgatccaatc gggcgtggag ccggtgctgc
3781 cgcatgcgcc caccacactg gccctgtcca acatcgaagc gttttcggtg gtgctgcagt
3841 tcacgcccgg cttcgacggc aactcgagca tcaccaagtg gaaggtggag gcgcagacgg
3901 cccgcaacat gacctggttc acgctctgtg aaatcagcga tcccgatgcg gagaccctca
3961 ccgtgaccgg cctgatgccc ttcacccagt accggctgcg gctgagcgcc accaatgtgg
4021 tgggcagctc ccggccctcg gaccccacca aggactttca aaccattcag gccaagccga
4081 tgcacgcccc cttcaatgtg acggtacgcg caatgagcgc cctgcagctg cgcgtccgct
4141 ggataccgct gcagcagatg gagtggttcg gcaatccgcg cggctacaat gtcacctacc
4201 ggcaaatgga gcgcaccggc aagccctcca agcacccgcc ccgctccgtg atgatcgagg
4261 atcacacggc caactcgcat gtgctcgagg ggctcgagga gtggaccctc tacgaagtga
4321 tcatgaacgc ctgcaacgat gtgggctgct cgctggacag cggcctggcc atggagcgca
4381 ccagggaagc ggtgcccagc tacggcccgc tgcatgtgga ggcgaacgcc acctcctcga
4441 cgacggtggt ggtgcgctgg ggcgagatac cgccccacca tcgcaacggc cagatcgatg
4501 gctacaaggt gtactacgcg gccaccgagc gcggcatgca ggtgctctac aagacgatac
4561 ccaacaacag ctccttcacc accaccctca ccgagctgca gaagtttgtg gtgtaccacg
4621 tccaggtgct ggcctacacg cggctcggca acggcgccct cagcaccccg cccatccggg
4681 tgcagacgtt cgaggacacg cccggatcac cgtccaatgt gagcttcccg gacgtcacct
4741 tctcgatggc gcgcatcatc tgggacgtgc cgatggaccc caatggcgag atactcgcct
4801 accaggtcac ctacacgctc aacggaagcg ccaatctgaa ctacagccgc gagtttccgc
4861 cctcggatcg caccttccgg gccaccggcc tgatgcccga gcgctactac agcttcagcg
4921 tgacggccca gacacgcctc ggctggggca aaacggcctc ggtgctggtg tacacgacca
4981 acaacaggga ccgtccgcag gcaccgtccg ggccgcaggt gtcgcgcagc cagatccagg
5041 cccatcagat caccttcaac tggacgccgg gccgcgacgg gttcgccccg ctgcgatact
5101 acacggtcga gatgcgggag aacgagggcc gctggcagcc gctgcccgag cgcgtcgatc
5161 ccacactcag ctcgtacacg gccctgggtc tgcgtccgta caccacctac cagttccgca
5221 ttcaggcgac caacgatctg ggcccgtcgg cgttcagccg agagagcatt gtggtgcgca
5281 ccctgcccgc cgccccagcg gtgggtgtgg ggggactgaa ggtggtgccc ataacgacca
5341 cctcggtgcg ggtgcagtgg ggggcgctgg agacgggcat gtggaacggc gacgcggcca
5401 ccgggggata ccgcatactg taccagcagc tgtcggactt cgcaccggcc ctgcagtcga
5461 ccccgaagac ggatgtgatg ggcatcaatg agaacagcgt ggtgctgtcc gatctgcagc
5521 aggaccgcaa ctacgagatc gtggtgctgc cattcaattc gcagggaccg ggcccggcca
5581 caccgccgac cgccgtctat gtgggcgagg cggtgcccac tggagagccg cggggcgtgg
5641 atgccacggc catttccagc acggaggtgc gcctgagctg gaagccaccg aagcagagca
5701 gccagaacgg agagatactc ggctacaaga tattctattt ggtgacgtgg tcgccgcagg
5761 ccctcgagcc gggccgcaaa ttcgaggagg aaatcgaagt ggtctcggcc acggccacat
5821 cgcacagcct ggtctttctc gataagttca ccgagtaccg catccagttg ctggccttca
5881 atccggccgg agacgggccg aggtccgccc ccgtcactgc gaagacgatg ccgggcgtgc
5941 ccagtgcccc gctcaatctg cgcttttcgg acatcacaat gcagagcctg gaggtgacct
6001 gggacccgcc caagctgctc aacggcgaga ttgttggcta tctggtcacc tacgagacca
6061 ccgaggagaa cgaaaagttc agcaagcagg tgaagcagaa ggtgtccaac accacgctgc
6121 gtgtgcagaa tctggaggag gaggtcacct acaccttcac cgtgcgcgcc cagacgaacg
6181 actatggacc ggcggtgagc gcgaatgtga ccacaggccc ccaggatggc tccccggtgg
6241 caccgcgcga tctcacactc acaaagacac tgtccagcgt tgaggtacat tgggtcaatg
6301 gaccctccgg ccggggcccc atactgggct acctcatcga ggccaagaag cgagaaaatg
6361 gagagccctc atttatttct aatagacctc cctatcttcg cttagacgac tcccgctgga
6421 ctaagattga gcagtccaga aagggtacca tgaaggagtt taccgtcagc taccacatcc
6481 tgatgccatc gacggcgtat ttgttccggg taattgctta caataagtat ggcatatcgt
6541 tccctgttta ctcgaaggac tcgatactga cgccctcgaa gctgcatctg gagtacggct
6601 atctgcagca caagcccttc tacaggcaga cctggttcat ggtctccctg gcggccacct
6661 cgatcgtcat cattgtcatg gtcattgcgg tgctctgtgt gaagagcaag agctacaagt
6721 acaagcagga ggcacaaaag acgctggagg agtccatggc catgtcgatt gatgagcgcc
6781 aggagctggc cctggagctg tatcgttcgc gtcacggcgt cggcaccggc accctgaaca
6841 gcgttggaac attgcgcagc ggaactttgg gaaccctcgg ccgtaagtcc accaaccgac
6901 accagccggt gagtgtgcat ttgggtaaga gtccaccgcg accctcgccc gcatcggtgg
6961 cgtaccacag cgatgaggag agtctcaagt gctacgacga gaatcccgac gacagcagtg
7021 ttacggaaaa gccatccgag cactcggaga gcgagaacga gagcgtgagg agcgatccgc
7081 actcgttcgt caatcactat gcgaatgtga atgactcgct gcggcagtcc tggaagaaga
7141 ccaagcccgt gcgcaactac tcgagctaca cagactccga gccggagggc agtgcagtga
7201 tgagtctcaa tggtggccag attattgtca ataatatggc cagatcgagg gcaccactgc
7261 ccggcttctc gtcatttgtc tgacaatcaa ccgaattcta agatctatgc cgtggtagca
7321 gcagcaccgt catccgcgag acatttgtct gaattatttt ggaaacgata acggaaaacg
7381 gaaaaacgga ggctgaagct gaaaccggag ctggagttgc agtggggagc gttctaacga
7441 gttcgacacg gatgtagcga gtgggctaaa ctgcctgcct gcctgcaact gttctgtctg
7501 gctctccctg gatcttcgta gctgtccggc gaggcgctgc tacatggata tttatcgtag
7561 tt