PREDICTED: Drosophila obscura protein sidekick (LOC111066347),
LOCUS XM_041591959 7403 bp mRNA linear INV 14-MAY-2021
transcript variant X8, mRNA.
ACCESSION XM_041591959
VERSION XM_041591959.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..7403
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..7403
/gene="LOC111066347"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 7 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 3 samples with support for all annotated
introns"
/db_xref="GeneID:111066347"
CDS 351..7124
/gene="LOC111066347"
/codon_start=1
/product="protein sidekick isoform X8"
/protein_id="XP_041447893.1"
/db_xref="GeneID:111066347"
/translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL
FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK
ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS
EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK
YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV
RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL
LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP
CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA
GENSASTWLRVKTSAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQLVE
ISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPPVDT
IVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVRASD
VGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFDGNS
PISKFIIQRREVSELEKFVGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQFRV
SAVNRVGEGSPSEPSNVVELPQEEFPLYRAPSGPPVGFVGSARSMSEIITQWQPPQEE
HRNGQILGYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFNNMG
VGVYTEGAKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKIQAW
QRRQLDGEERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGDGVA
SQLVPVETLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDRPET
MKFFNLTADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHAPTT
LALSNIEAFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLTVTG
LMPFTQYRLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRVRWI
PLQQMEWFGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTLYEV
IMNACNDVGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHRNGQ
IDGYKVYYAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGALST
PPIRVQTFEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSANLN
YSREFPPSDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAPSGP
QVSRSQIQAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYTALG
LRPYTTYQFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRVQWG
ALETGMWNGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDRNYE
IVVLPFNSQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSSQNG
EILGYKIFYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLAFNP
AGDGPRSAPVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVTYET
TEENEKFSKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQDGS
PVAPRDLTLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRENGEPSFISNRPPYLRLD
DSRWTKIEQSRKGTMKEFTVSYHILMPSTAYLFRVIAYNKYGISFPVYSKDSILTPSK
LHLEYGYLQHKPFYRQTWFMVSLAATSIVIIVMVIAVLCVKSKSYKYKQEAQKTLEES
MAMSIDERQELALELYRSRHGVGTGTLNSVGTLRSGTLGTLGRKSTNRHQPVSVHLGK
SPPRPSPASVAYHSDEESLKCYDENPDDSSVTEKPSEVSSSEASQHSESENESVRSDP
HSFVNHYANVNDSLRQSWKKTKPVRNYSSYTDSEPEGSAVMSLNGGQIIVNNMARSRA
PLPGFSSFV"
misc_feature 597..809
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig_3; pfam13927"
/db_xref="CDD:464046"
misc_feature 1164..1451
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1218..1232
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409353"
misc_feature 1263..1277
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409353"
misc_feature 1338..1352
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409353"
misc_feature 1380..1397
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409353"
misc_feature 1422..1433
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409353"
misc_feature 1461..1736
/gene="LOC111066347"
/note="Immunoglobulin domain; Region: Ig; cl11960"
/db_xref="CDD:472250"
misc_feature 1515..1529
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409543"
misc_feature 1554..1568
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409543"
misc_feature 1632..1643
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409543"
misc_feature 1671..1688
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409543"
misc_feature 1710..1721
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409543"
misc_feature 1746..2009
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 1797..1811
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409562"
misc_feature 1836..1850
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409562"
misc_feature 1908..1919
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409562"
misc_feature 1947..1964
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409562"
misc_feature 1986..1997
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409562"
misc_feature 2022..2291
/gene="LOC111066347"
/note="Immunoglobulin I-set domain; Region: I-set;
pfam07679"
/db_xref="CDD:400151"
misc_feature 2070..2084
/gene="LOC111066347"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:409544"
misc_feature 2115..2129
/gene="LOC111066347"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:409544"
misc_feature 2187..2201
/gene="LOC111066347"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:409544"
misc_feature 2229..2246
/gene="LOC111066347"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:409544"
misc_feature 2268..2279
/gene="LOC111066347"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:409544"
misc_feature 2301..2624
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(2301..2303,2544..2546,2589..2591)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(2592..2597,2601..2606)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 2667..2954
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(2919..2924,2928..2933)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 2976..3290
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3255..3260,3264..3269)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature order(3303..3305,3498..3500,3543..3545)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature 3306..3557
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 3597..3866
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3597..3599,3795..3797,3840..3842)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(3843..3848,3852..3857)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 3903..4169
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature 4224..4481
/gene="LOC111066347"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 4443..6098
/gene="LOC111066347"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature order(4467..4472,4476..4481)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 6051..6371
/gene="LOC111066347"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(6051..6053,6303..6305,6348..6350)
/gene="LOC111066347"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(6351..6356,6360..6365)
/gene="LOC111066347"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
ORIGIN
1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg
61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac
121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg
181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca
241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga
301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag
361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca
421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc
481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg
541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac
601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc
661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt
721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg
781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg
841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg
901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg
961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg
1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg
1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg
1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca
1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac
1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc
1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg
1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc
1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg
1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg
1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg
1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt
1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacct
1741 cagcgccggt ctttgagcag ccgccccaga atgtgaccgc cctggatggc aaggatgcga
1801 cgatctcctg tcgggccatt ggctcgccca atcccaatgt tacctggatc tacaatgaaa
1861 cccaactggt tgagatatcc agtcgcgttc agatactcga atcgggtgat ttactcatct
1921 cgaatatccg tgccacggac gcgggactct acatctgtgt gcgggccaac gaggcgggca
1981 gcgtcaaggg cgaggccttg ctaagcgtgt tagtgcggac acaaatcata cagccgccag
2041 tggacaccat cgtgctgctg ggcctgaccg cgacactgca gtgcaaggtg tccagcgacc
2101 cgagcgtgcc ctacaacatc gactggtacc gggagggcca aatggcgccc atcagcaact
2161 cgcagcggat tggagtgcag gcggacgggc agctggagat ccaggcggtg cgggccagcg
2221 atgtgggcag ctattcgtgc gtggttacat cgccgggcgg caatgagaca cggtcggccc
2281 gtctcagtgt catcgagctg cccttcccgc ccagcaacgt gcgggtggag cgtctgccag
2341 agccgcagca gcgcagcatc aatgtgtcct ggacgcccgg attcgatggc aacagtccaa
2401 tctccaaatt tattatccag cgacgtgagg tctctgaatt ggaaaaattc gtaggtccag
2461 ttccagatcc ccttctcaat tggatcaccg aactgagcaa cgtatcggcc aatcagcggt
2521 ggatgctgct ggagaacctc aaggcggcca ccgtctatca gtttcgtgtc agtgccgtca
2581 atcgggtcgg cgagggctcc ccctcggagc ccagcaatgt tgttgagctg ccccaagaag
2641 agtttccgct ttatcgagct ccttcgggac cgcctgtggg ctttgtgggc tcggcacggt
2701 ccatgtccga gatcattacg cagtggcagc cgccgcagga ggagcatcgc aacggacaga
2761 tcctgggcta cattctgcgc tatcgcctgt tcgggtacaa caatgtgccg tggtcctacc
2821 agaacatcac caacgaggcg cagcgcaact ttctgatcca ggagctgatc acgtggaagg
2881 actacatcgt gcagattgcg gccttcaaca acatgggcgt gggcgtctac acggaggggg
2941 ccaagatcaa gaccaaggag ggtgtgcccg aggcaccgcc caccaacgtc agggtgaagg
3001 ccctcaactc gacggcggcg cagatcacgt ggaagccgcc gaatccgcag cagatcaacg
3061 gcatcaacca gggctacaag atccaggcat ggcagcgacg gcagctcgat ggggaggagc
3121 gggacatgga gcggcgcatg atgacggtgc cgcccagcct gatcgatcca ctggccgagc
3181 agacgacggt gctcggtggc ctggacaagt tcgccaagtt caatgtgacc gtactctgct
3241 tcaccgatcc cggtgacggt gtggccagcc agctggtgcc ggtggagact ttggacgacg
3301 tgcccgacga gataacggcc ctgcactttg acgatgtctc cgatcggtcc gtcaaagtgc
3361 tgtgggcgcc gccgcgcttc gccaacggca tcctcaccgg ctacacggtg cgctaccagg
3421 tcaaggatcg ccccgagacg atgaagttct tcaacctgac cgccgacgac aacgagctga
3481 cggtgaacca gctgcaggcg acgacccact actggttcga ggtgtgcgcc tggacgcggg
3541 tgggcagcgg gccgcccaag acggcgacga tccaatcggg cgtggagccg gtgctgccgc
3601 atgcgcccac cacactggcc ctgtccaaca tcgaagcgtt ttcggtggtg ctgcagttca
3661 cgcccggctt cgacggcaac tcgagcatca ccaagtggaa ggtggaggcg cagacggccc
3721 gcaacatgac ctggttcacg ctctgtgaaa tcagcgatcc cgatgcggag accctcaccg
3781 tgaccggcct gatgcccttc acccagtacc ggctgcggct gagcgccacc aatgtggtgg
3841 gcagctcccg gccctcggac cccaccaagg actttcaaac cattcaggcc aagccgatgc
3901 acgccccctt caatgtgacg gtacgcgcaa tgagcgccct gcagctgcgc gtccgctgga
3961 taccgctgca gcagatggag tggttcggca atccgcgcgg ctacaatgtc acctaccggc
4021 aaatggagcg caccggcaag ccctccaagc acccgccccg ctccgtgatg atcgaggatc
4081 acacggccaa ctcgcatgtg ctcgaggggc tcgaggagtg gaccctctac gaagtgatca
4141 tgaacgcctg caacgatgtg ggctgctcgc tggacagcgg cctggccatg gagcgcacca
4201 gggaagcggt gcccagctac ggcccgctgc atgtggaggc gaacgccacc tcctcgacga
4261 cggtggtggt gcgctggggc gagataccgc cccaccatcg caacggccag atcgatggct
4321 acaaggtgta ctacgcggcc accgagcgcg gcatgcaggt gctctacaag acgataccca
4381 acaacagctc cttcaccacc accctcaccg agctgcagaa gtttgtggtg taccacgtcc
4441 aggtgctggc ctacacgcgg ctcggcaacg gcgccctcag caccccgccc atccgggtgc
4501 agacgttcga ggacacgccc ggatcaccgt ccaatgtgag cttcccggac gtcaccttct
4561 cgatggcgcg catcatctgg gacgtgccga tggaccccaa tggcgagata ctcgcctacc
4621 aggtcaccta cacgctcaac ggaagcgcca atctgaacta cagccgcgag tttccgccct
4681 cggatcgcac cttccgggcc accggcctga tgcccgagcg ctactacagc ttcagcgtga
4741 cggcccagac acgcctcggc tggggcaaaa cggcctcggt gctggtgtac acgaccaaca
4801 acagggaccg tccgcaggca ccgtccgggc cgcaggtgtc gcgcagccag atccaggccc
4861 atcagatcac cttcaactgg acgccgggcc gcgacgggtt cgccccgctg cgatactaca
4921 cggtcgagat gcgggagaac gagggccgct ggcagccgct gcccgagcgc gtcgatccca
4981 cactcagctc gtacacggcc ctgggtctgc gtccgtacac cacctaccag ttccgcattc
5041 aggcgaccaa cgatctgggc ccgtcggcgt tcagccgaga gagcattgtg gtgcgcaccc
5101 tgcccgccgc cccagcggtg ggtgtggggg gactgaaggt ggtgcccata acgaccacct
5161 cggtgcgggt gcagtggggg gcgctggaga cgggcatgtg gaacggcgac gcggccaccg
5221 ggggataccg catactgtac cagcagctgt cggacttcgc accggccctg cagtcgaccc
5281 cgaagacgga tgtgatgggc atcaatgaga acagcgtggt gctgtccgat ctgcagcagg
5341 accgcaacta cgagatcgtg gtgctgccat tcaattcgca gggaccgggc ccggccacac
5401 cgccgaccgc cgtctatgtg ggcgaggcgg tgcccactgg agagccgcgg ggcgtggatg
5461 ccacggccat ttccagcacg gaggtgcgcc tgagctggaa gccaccgaag cagagcagcc
5521 agaacggaga gatactcggc tacaagatat tctatttggt gacgtggtcg ccgcaggccc
5581 tcgagccggg ccgcaaattc gaggaggaaa tcgaagtggt ctcggccacg gccacatcgc
5641 acagcctggt ctttctcgat aagttcaccg agtaccgcat ccagttgctg gccttcaatc
5701 cggccggaga cgggccgagg tccgcccccg tcactgcgaa gacgatgccg ggcgtgccca
5761 gtgccccgct caatctgcgc ttttcggaca tcacaatgca gagcctggag gtgacctggg
5821 acccgcccaa gctgctcaac ggcgagattg ttggctatct ggtcacctac gagaccaccg
5881 aggagaacga aaagttcagc aagcaggtga agcagaaggt gtccaacacc acgctgcgtg
5941 tgcagaatct ggaggaggag gtcacctaca ccttcaccgt gcgcgcccag acgaacgact
6001 atggaccggc ggtgagcgcg aatgtgacca caggccccca ggatggctcc ccggtggcac
6061 cgcgcgatct cacactcaca aagacactgt ccagcgttga ggtacattgg gtcaatggac
6121 cctccggccg gggccccata ctgggctacc tcatcgaggc caagaagcga gaaaatggag
6181 agccctcatt tatttctaat agacctccct atcttcgctt agacgactcc cgctggacta
6241 agattgagca gtccagaaag ggtaccatga aggagtttac cgtcagctac cacatcctga
6301 tgccatcgac ggcgtatttg ttccgggtaa ttgcttacaa taagtatggc atatcgttcc
6361 ctgtttactc gaaggactcg atactgacgc cctcgaagct gcatctggag tacggctatc
6421 tgcagcacaa gcccttctac aggcagacct ggttcatggt ctccctggcg gccacctcga
6481 tcgtcatcat tgtcatggtc attgcggtgc tctgtgtgaa gagcaagagc tacaagtaca
6541 agcaggaggc acaaaagacg ctggaggagt ccatggccat gtcgattgat gagcgccagg
6601 agctggccct ggagctgtat cgttcgcgtc acggcgtcgg caccggcacc ctgaacagcg
6661 ttggaacatt gcgcagcgga actttgggaa ccctcggccg taagtccacc aaccgacacc
6721 agccggtgag tgtgcatttg ggtaagagtc caccgcgacc ctcgcccgca tcggtggcgt
6781 accacagcga tgaggagagt ctcaagtgct acgacgagaa tcccgacgac agcagtgtta
6841 cggaaaagcc atccgaggtg agcagctcgg aggcatccca gcactcggag agcgagaacg
6901 agagcgtgag gagcgatccg cactcgttcg tcaatcacta tgcgaatgtg aatgactcgc
6961 tgcggcagtc ctggaagaag accaagcccg tgcgcaacta ctcgagctac acagactccg
7021 agccggaggg cagtgcagtg atgagtctca atggtggcca gattattgtc aataatatgg
7081 ccagatcgag ggcaccactg cccggcttct cgtcatttgt ctgacaatca accgaattct
7141 aagatctatg ccgtggtagc agcagcaccg tcatccgcga gacatttgtc tgaattattt
7201 tggaaacgat aacggaaaac ggaaaaacgg aggctgaagc tgaaaccgga gctggagttg
7261 cagtggggag cgttctaacg agttcgacac ggatgtagcg agtgggctaa actgcctgcc
7321 tgcctgcaac tgttctgtct ggctctccct ggatcttcgt agctgtccgg cgaggcgctg
7381 ctacatggat atttatcgta gtt