PREDICTED: Drosophila obscura protein sidekick (LOC111066347),
LOCUS XM_041591953 7568 bp mRNA linear INV 14-MAY-2021
transcript variant X3, mRNA.
ACCESSION XM_041591953
VERSION XM_041591953.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..7568
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..7568
/gene="LOC111066347"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 6 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 3 samples with support for all annotated
introns"
/db_xref="GeneID:111066347"
CDS 351..7289
/gene="LOC111066347"
/codon_start=1
/product="protein sidekick isoform X3"
/protein_id="XP_041447887.1"
/db_xref="GeneID:111066347"
/translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL
FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK
ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS
EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK
YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV
RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL
LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP
CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA
GENSASTWLRVKTETETEAANSRIKRLAQPRILRVRASHGGSGTVAGSVTGSGSGSGS
TSNSHQQGRRKQFRFASAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQ
LVEISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPP
VDTIVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVR
ASDVGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFD
GNSPISKFIIQRREVSELEKFVGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQ
FRVSAVNRVGEGSPSEPSNVVELPQEAPSGPPVGFVGSARSMSEIITQWQPPQEEHRN
GQILGYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFNNMGVGV
YTEGAKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKIQAWQRR
QLDGEERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGDGVASQL
VPVETLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDRPETMKF
FNLTADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHAPTTLAL
SNIEAFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLTVTGLMP
FTQYRLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRVRWIPLQ
QMEWFGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTLYEVIMN
ACNDVGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHRNGQIDG
YKVYYAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGALSTPPI
RVQTFEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSANLNYSR
EFPPSDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAPSGPQVS
RSQIQAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYTALGLRP
YTTYQFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRVQWGALE
TGMWNGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDRNYEIVV
LPFNSQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSSQNGEIL
GYKIFYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLAFNPAGD
GPRSAPVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVTYETTEE
NEKFSKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQDGSPVA
PRDLTLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRENGEPSFISNRPPYLRLDDSR
WTKIEQSRKGTMKEFTVSYHILMPSTAYLFRVIAYNKYGISFPVYSKDSILTPSKLHL
EYGYLQHKPFYRQTWFMVSLAATSIVIIVMVIAVLCVKSKSYKYKQEAQKTLEESMAM
SIDERQELALELYRSRHGVGTGTLNSVGTLRSGTLGTLGRKSTNRHQPVSVHLGKSPP
RPSPASVAYHSDEESLKCYDENPDDSSVTEKPSEVSSSEASQHSESENESVRSDPHSF
VNHYANVNDSLRQSWKKTKPVRNYSSYTDSEPEGSAVMSLNGGQIIVNNMARSRAPLP
GFSSFV"
misc_feature 597..809
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig_3; pfam13927"
/db_xref="CDD:464046"
misc_feature 1164..1451
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1218..1232
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409353"
misc_feature 1263..1277
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409353"
misc_feature 1338..1352
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409353"
misc_feature 1380..1397
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409353"
misc_feature 1422..1433
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409353"
misc_feature 1461..1736
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1515..1529
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409543"
misc_feature 1554..1568
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409543"
misc_feature 1632..1643
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409543"
misc_feature 1671..1688
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409543"
misc_feature 1710..1721
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409543"
misc_feature 1929..2192
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 1980..1994
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409562"
misc_feature 2019..2033
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409562"
misc_feature 2091..2102
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409562"
misc_feature 2130..2147
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409562"
misc_feature 2169..2180
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409562"
misc_feature 2205..2474
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 2253..2267
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409544"
misc_feature 2298..2312
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409544"
misc_feature 2370..2384
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409544"
misc_feature <2412..>3197
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature 2412..2429
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409544"
misc_feature 2451..2462
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409544"
misc_feature 2484..2807
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(2484..2486,2727..2729,2772..2774)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(2775..2780,2784..2789)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 3141..3455
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3420..3425,3429..3434)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature order(3468..3470,3663..3665,3708..3710)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature 3471..3722
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 3762..4031
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3762..3764,3960..3962,4005..4007)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(4008..4013,4017..4022)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 4068..4334
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature 4389..4646
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 4608..6263
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature order(4632..4637,4641..4646)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 6216..6536
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(6216..6218,6468..6470,6513..6515)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(6516..6521,6525..6530)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
ORIGIN
1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg
61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac
121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg
181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca
241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga
301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag
361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca
421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc
481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg
541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac
601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc
661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt
721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg
781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg
841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg
901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg
961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg
1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg
1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg
1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca
1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac
1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc
1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg
1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc
1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg
1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg
1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg
1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt
1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacag
1741 aaacagaaac agaagcagcc aacagccgca tcaagcgctt ggcccagcca cgcatcttaa
1801 gagtaagagc ttcgcatggt ggttcgggaa cggtcgcggg atcggtcacg ggatcgggat
1861 caggatctgg ttccacctcc aactcccatc agcagggtag acgcaagcag tttcgctttg
1921 cctcagcgcc ggtctttgag cagccgcccc agaatgtgac cgccctggat ggcaaggatg
1981 cgacgatctc ctgtcgggcc attggctcgc ccaatcccaa tgttacctgg atctacaatg
2041 aaacccaact ggttgagata tccagtcgcg ttcagatact cgaatcgggt gatttactca
2101 tctcgaatat ccgtgccacg gacgcgggac tctacatctg tgtgcgggcc aacgaggcgg
2161 gcagcgtcaa gggcgaggcc ttgctaagcg tgttagtgcg gacacaaatc atacagccgc
2221 cagtggacac catcgtgctg ctgggcctga ccgcgacact gcagtgcaag gtgtccagcg
2281 acccgagcgt gccctacaac atcgactggt accgggaggg ccaaatggcg cccatcagca
2341 actcgcagcg gattggagtg caggcggacg ggcagctgga gatccaggcg gtgcgggcca
2401 gcgatgtggg cagctattcg tgcgtggtta catcgccggg cggcaatgag acacggtcgg
2461 cccgtctcag tgtcatcgag ctgcccttcc cgcccagcaa cgtgcgggtg gagcgtctgc
2521 cagagccgca gcagcgcagc atcaatgtgt cctggacgcc cggattcgat ggcaacagtc
2581 caatctccaa atttattatc cagcgacgtg aggtctctga attggaaaaa ttcgtaggtc
2641 cagttccaga tccccttctc aattggatca ccgaactgag caacgtatcg gccaatcagc
2701 ggtggatgct gctggagaac ctcaaggcgg ccaccgtcta tcagtttcgt gtcagtgccg
2761 tcaatcgggt cggcgagggc tccccctcgg agcccagcaa tgttgttgag ctgccccaag
2821 aagctccttc gggaccgcct gtgggctttg tgggctcggc acggtccatg tccgagatca
2881 ttacgcagtg gcagccgccg caggaggagc atcgcaacgg acagatcctg ggctacattc
2941 tgcgctatcg cctgttcggg tacaacaatg tgccgtggtc ctaccagaac atcaccaacg
3001 aggcgcagcg caactttctg atccaggagc tgatcacgtg gaaggactac atcgtgcaga
3061 ttgcggcctt caacaacatg ggcgtgggcg tctacacgga gggggccaag atcaagacca
3121 aggagggtgt gcccgaggca ccgcccacca acgtcagggt gaaggccctc aactcgacgg
3181 cggcgcagat cacgtggaag ccgccgaatc cgcagcagat caacggcatc aaccagggct
3241 acaagatcca ggcatggcag cgacggcagc tcgatgggga ggagcgggac atggagcggc
3301 gcatgatgac ggtgccgccc agcctgatcg atccactggc cgagcagacg acggtgctcg
3361 gtggcctgga caagttcgcc aagttcaatg tgaccgtact ctgcttcacc gatcccggtg
3421 acggtgtggc cagccagctg gtgccggtgg agactttgga cgacgtgccc gacgagataa
3481 cggccctgca ctttgacgat gtctccgatc ggtccgtcaa agtgctgtgg gcgccgccgc
3541 gcttcgccaa cggcatcctc accggctaca cggtgcgcta ccaggtcaag gatcgccccg
3601 agacgatgaa gttcttcaac ctgaccgccg acgacaacga gctgacggtg aaccagctgc
3661 aggcgacgac ccactactgg ttcgaggtgt gcgcctggac gcgggtgggc agcgggccgc
3721 ccaagacggc gacgatccaa tcgggcgtgg agccggtgct gccgcatgcg cccaccacac
3781 tggccctgtc caacatcgaa gcgttttcgg tggtgctgca gttcacgccc ggcttcgacg
3841 gcaactcgag catcaccaag tggaaggtgg aggcgcagac ggcccgcaac atgacctggt
3901 tcacgctctg tgaaatcagc gatcccgatg cggagaccct caccgtgacc ggcctgatgc
3961 ccttcaccca gtaccggctg cggctgagcg ccaccaatgt ggtgggcagc tcccggccct
4021 cggaccccac caaggacttt caaaccattc aggccaagcc gatgcacgcc cccttcaatg
4081 tgacggtacg cgcaatgagc gccctgcagc tgcgcgtccg ctggataccg ctgcagcaga
4141 tggagtggtt cggcaatccg cgcggctaca atgtcaccta ccggcaaatg gagcgcaccg
4201 gcaagccctc caagcacccg ccccgctccg tgatgatcga ggatcacacg gccaactcgc
4261 atgtgctcga ggggctcgag gagtggaccc tctacgaagt gatcatgaac gcctgcaacg
4321 atgtgggctg ctcgctggac agcggcctgg ccatggagcg caccagggaa gcggtgccca
4381 gctacggccc gctgcatgtg gaggcgaacg ccacctcctc gacgacggtg gtggtgcgct
4441 ggggcgagat accgccccac catcgcaacg gccagatcga tggctacaag gtgtactacg
4501 cggccaccga gcgcggcatg caggtgctct acaagacgat acccaacaac agctccttca
4561 ccaccaccct caccgagctg cagaagtttg tggtgtacca cgtccaggtg ctggcctaca
4621 cgcggctcgg caacggcgcc ctcagcaccc cgcccatccg ggtgcagacg ttcgaggaca
4681 cgcccggatc accgtccaat gtgagcttcc cggacgtcac cttctcgatg gcgcgcatca
4741 tctgggacgt gccgatggac cccaatggcg agatactcgc ctaccaggtc acctacacgc
4801 tcaacggaag cgccaatctg aactacagcc gcgagtttcc gccctcggat cgcaccttcc
4861 gggccaccgg cctgatgccc gagcgctact acagcttcag cgtgacggcc cagacacgcc
4921 tcggctgggg caaaacggcc tcggtgctgg tgtacacgac caacaacagg gaccgtccgc
4981 aggcaccgtc cgggccgcag gtgtcgcgca gccagatcca ggcccatcag atcaccttca
5041 actggacgcc gggccgcgac gggttcgccc cgctgcgata ctacacggtc gagatgcggg
5101 agaacgaggg ccgctggcag ccgctgcccg agcgcgtcga tcccacactc agctcgtaca
5161 cggccctggg tctgcgtccg tacaccacct accagttccg cattcaggcg accaacgatc
5221 tgggcccgtc ggcgttcagc cgagagagca ttgtggtgcg caccctgccc gccgccccag
5281 cggtgggtgt ggggggactg aaggtggtgc ccataacgac cacctcggtg cgggtgcagt
5341 ggggggcgct ggagacgggc atgtggaacg gcgacgcggc caccggggga taccgcatac
5401 tgtaccagca gctgtcggac ttcgcaccgg ccctgcagtc gaccccgaag acggatgtga
5461 tgggcatcaa tgagaacagc gtggtgctgt ccgatctgca gcaggaccgc aactacgaga
5521 tcgtggtgct gccattcaat tcgcagggac cgggcccggc cacaccgccg accgccgtct
5581 atgtgggcga ggcggtgccc actggagagc cgcggggcgt ggatgccacg gccatttcca
5641 gcacggaggt gcgcctgagc tggaagccac cgaagcagag cagccagaac ggagagatac
5701 tcggctacaa gatattctat ttggtgacgt ggtcgccgca ggccctcgag ccgggccgca
5761 aattcgagga ggaaatcgaa gtggtctcgg ccacggccac atcgcacagc ctggtctttc
5821 tcgataagtt caccgagtac cgcatccagt tgctggcctt caatccggcc ggagacgggc
5881 cgaggtccgc ccccgtcact gcgaagacga tgccgggcgt gcccagtgcc ccgctcaatc
5941 tgcgcttttc ggacatcaca atgcagagcc tggaggtgac ctgggacccg cccaagctgc
6001 tcaacggcga gattgttggc tatctggtca cctacgagac caccgaggag aacgaaaagt
6061 tcagcaagca ggtgaagcag aaggtgtcca acaccacgct gcgtgtgcag aatctggagg
6121 aggaggtcac ctacaccttc accgtgcgcg cccagacgaa cgactatgga ccggcggtga
6181 gcgcgaatgt gaccacaggc ccccaggatg gctccccggt ggcaccgcgc gatctcacac
6241 tcacaaagac actgtccagc gttgaggtac attgggtcaa tggaccctcc ggccggggcc
6301 ccatactggg ctacctcatc gaggccaaga agcgagaaaa tggagagccc tcatttattt
6361 ctaatagacc tccctatctt cgcttagacg actcccgctg gactaagatt gagcagtcca
6421 gaaagggtac catgaaggag tttaccgtca gctaccacat cctgatgcca tcgacggcgt
6481 atttgttccg ggtaattgct tacaataagt atggcatatc gttccctgtt tactcgaagg
6541 actcgatact gacgccctcg aagctgcatc tggagtacgg ctatctgcag cacaagccct
6601 tctacaggca gacctggttc atggtctccc tggcggccac ctcgatcgtc atcattgtca
6661 tggtcattgc ggtgctctgt gtgaagagca agagctacaa gtacaagcag gaggcacaaa
6721 agacgctgga ggagtccatg gccatgtcga ttgatgagcg ccaggagctg gccctggagc
6781 tgtatcgttc gcgtcacggc gtcggcaccg gcaccctgaa cagcgttgga acattgcgca
6841 gcggaacttt gggaaccctc ggccgtaagt ccaccaaccg acaccagccg gtgagtgtgc
6901 atttgggtaa gagtccaccg cgaccctcgc ccgcatcggt ggcgtaccac agcgatgagg
6961 agagtctcaa gtgctacgac gagaatcccg acgacagcag tgttacggaa aagccatccg
7021 aggtgagcag ctcggaggca tcccagcact cggagagcga gaacgagagc gtgaggagcg
7081 atccgcactc gttcgtcaat cactatgcga atgtgaatga ctcgctgcgg cagtcctgga
7141 agaagaccaa gcccgtgcgc aactactcga gctacacaga ctccgagccg gagggcagtg
7201 cagtgatgag tctcaatggt ggccagatta ttgtcaataa tatggccaga tcgagggcac
7261 cactgcccgg cttctcgtca tttgtctgac aatcaaccga attctaagat ctatgccgtg
7321 gtagcagcag caccgtcatc cgcgagacat ttgtctgaat tattttggaa acgataacgg
7381 aaaacggaaa aacggaggct gaagctgaaa ccggagctgg agttgcagtg gggagcgttc
7441 taacgagttc gacacggatg tagcgagtgg gctaaactgc ctgcctgcct gcaactgttc
7501 tgtctggctc tccctggatc ttcgtagctg tccggcgagg cgctgctaca tggatattta
7561 tcgtagtt