Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XM_041591951 7586 bp mRNA linear INV 14-MAY-2021 transcript variant X1, mRNA. ACCESSION XM_041591951 VERSION XM_041591951.1 DBLINK BioProject: PRJNA728747 KEYWORDS RefSeq. SOURCE Drosophila obscura ORGANISM Drosophila obscura Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_024542752.1) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Name :: Drosophila obscura Annotation Release 101 Annotation Version :: 101 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 8.6 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7586 /organism="Drosophila obscura" /mol_type="mRNA" /isolate="BZ-5 IFL" /db_xref="taxon:7282" /chromosome="Unknown" /sex="male" /tissue_type="whole fly" /dev_stage="Adult fly" /geo_loc_name="Serbia: Babin Zub" /collection_date="2017" gene 1..7586 /gene="LOC111066347" /note="Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 6 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 3 samples with support for all annotated introns" /db_xref="GeneID:111066347" CDS 351..7307 /gene="LOC111066347" /codon_start=1 /product="protein sidekick isoform X1" /protein_id="XP_041447885.1" /db_xref="GeneID:111066347" /translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA GENSASTWLRVKTETETEAANSRIKRLAQPRILRVRASHGGSGTVAGSVTGSGSGSGS TSNSHQQGRRKQFRFASAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQ LVEISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPP VDTIVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVR ASDVGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFD GNSPISKFIIQRREVSELEKFVGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQ FRVSAVNRVGEGSPSEPSNVVELPQEEFPLYRAPSGPPVGFVGSARSMSEIITQWQPP QEEHRNGQILGYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFN NMGVGVYTEGAKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKI QAWQRRQLDGEERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGD GVASQLVPVETLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDR PETMKFFNLTADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHA PTTLALSNIEAFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLT VTGLMPFTQYRLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRV RWIPLQQMEWFGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTL YEVIMNACNDVGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHR NGQIDGYKVYYAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGA LSTPPIRVQTFEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSA NLNYSREFPPSDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAP SGPQVSRSQIQAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYT ALGLRPYTTYQFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRV QWGALETGMWNGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDR NYEIVVLPFNSQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSS QNGEILGYKIFYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLA FNPAGDGPRSAPVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVT YETTEENEKFSKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQ DGSPVAPRDLTLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRENGEPSFISNRPPYL RLDDSRWTKIEQSRKGTMKEFTVSYHILMPSTAYLFRVIAYNKYGISFPVYSKDSILT PSKLHLEYGYLQHKPFYRQTWFMVSLAATSIVIIVMVIAVLCVKSKSYKYKQEAQKTL EESMAMSIDERQELALELYRSRHGVGTGTLNSVGTLRSGTLGTLGRKSTNRHQPVSVH LGKSPPRPSPASVAYHSDEESLKCYDENPDDSSVTEKPSEVSSSEASQHSESENESVR SDPHSFVNHYANVNDSLRQSWKKTKPVRNYSSYTDSEPEGSAVMSLNGGQIIVNNMAR SRAPLPGFSSFV" misc_feature 597..809 /gene="LOC111066347" /note="Immunoglobulin domain; Region: Ig_3; pfam13927" /db_xref="CDD:464046" misc_feature 1164..1451 /gene="LOC111066347" /note="Immunoglobulin domain; Region: Ig; cl11960" /db_xref="CDD:472250" misc_feature 1218..1232 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409353" misc_feature 1263..1277 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409353" misc_feature 1338..1352 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409353" misc_feature 1380..1397 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409353" misc_feature 1422..1433 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409353" misc_feature 1461..1736 /gene="LOC111066347" /note="Immunoglobulin domain; Region: Ig; cl11960" /db_xref="CDD:472250" misc_feature 1515..1529 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409543" misc_feature 1554..1568 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409543" misc_feature 1632..1643 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409543" misc_feature 1671..1688 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409543" misc_feature 1710..1721 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409543" misc_feature 1929..2192 /gene="LOC111066347" /note="Immunoglobulin I-set domain; Region: I-set; pfam07679" /db_xref="CDD:400151" misc_feature 1980..1994 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409562" misc_feature 2019..2033 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409562" misc_feature 2091..2102 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409562" misc_feature 2130..2147 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409562" misc_feature 2169..2180 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409562" misc_feature 2205..2474 /gene="LOC111066347" /note="Immunoglobulin I-set domain; Region: I-set; pfam07679" /db_xref="CDD:400151" misc_feature 2253..2267 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409544" misc_feature 2298..2312 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409544" misc_feature 2370..2384 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409544" misc_feature 2412..2429 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409544" misc_feature 2451..2462 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409544" misc_feature 2484..2807 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(2484..2486,2727..2729,2772..2774) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature order(2775..2780,2784..2789) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature 2850..3137 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(3102..3107,3111..3116) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature 3159..3473 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(3438..3443,3447..3452) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature order(3486..3488,3681..3683,3726..3728) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature 3489..3740 /gene="LOC111066347" /note="Fibronectin type III domain; Region: fn3; pfam00041" /db_xref="CDD:394996" misc_feature 3780..4049 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(3780..3782,3978..3980,4023..4025) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature order(4026..4031,4035..4040) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature 4086..4352 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature 4407..4664 /gene="LOC111066347" /note="Fibronectin type III domain; Region: fn3; pfam00041" /db_xref="CDD:394996" misc_feature 4626..6281 /gene="LOC111066347" /note="Fibronectin type 3 domain [General function prediction only]; Region: FN3; COG3401" /db_xref="CDD:442628" misc_feature order(4650..4655,4659..4664) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature 6234..6554 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(6234..6236,6486..6488,6531..6533) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature order(6534..6539,6543..6548) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" ORIGIN 1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg 61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac 121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg 181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca 241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga 301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag 361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca 421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc 481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg 541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac 601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc 661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt 721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg 781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg 841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg 901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg 961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg 1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg 1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg 1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca 1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac 1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc 1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg 1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc 1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg 1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg 1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg 1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt 1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacag 1741 aaacagaaac agaagcagcc aacagccgca tcaagcgctt ggcccagcca cgcatcttaa 1801 gagtaagagc ttcgcatggt ggttcgggaa cggtcgcggg atcggtcacg ggatcgggat 1861 caggatctgg ttccacctcc aactcccatc agcagggtag acgcaagcag tttcgctttg 1921 cctcagcgcc ggtctttgag cagccgcccc agaatgtgac cgccctggat ggcaaggatg 1981 cgacgatctc ctgtcgggcc attggctcgc ccaatcccaa tgttacctgg atctacaatg 2041 aaacccaact ggttgagata tccagtcgcg ttcagatact cgaatcgggt gatttactca 2101 tctcgaatat ccgtgccacg gacgcgggac tctacatctg tgtgcgggcc aacgaggcgg 2161 gcagcgtcaa gggcgaggcc ttgctaagcg tgttagtgcg gacacaaatc atacagccgc 2221 cagtggacac catcgtgctg ctgggcctga ccgcgacact gcagtgcaag gtgtccagcg 2281 acccgagcgt gccctacaac atcgactggt accgggaggg ccaaatggcg cccatcagca 2341 actcgcagcg gattggagtg caggcggacg ggcagctgga gatccaggcg gtgcgggcca 2401 gcgatgtggg cagctattcg tgcgtggtta catcgccggg cggcaatgag acacggtcgg 2461 cccgtctcag tgtcatcgag ctgcccttcc cgcccagcaa cgtgcgggtg gagcgtctgc 2521 cagagccgca gcagcgcagc atcaatgtgt cctggacgcc cggattcgat ggcaacagtc 2581 caatctccaa atttattatc cagcgacgtg aggtctctga attggaaaaa ttcgtaggtc 2641 cagttccaga tccccttctc aattggatca ccgaactgag caacgtatcg gccaatcagc 2701 ggtggatgct gctggagaac ctcaaggcgg ccaccgtcta tcagtttcgt gtcagtgccg 2761 tcaatcgggt cggcgagggc tccccctcgg agcccagcaa tgttgttgag ctgccccaag 2821 aagagtttcc gctttatcga gctccttcgg gaccgcctgt gggctttgtg ggctcggcac 2881 ggtccatgtc cgagatcatt acgcagtggc agccgccgca ggaggagcat cgcaacggac 2941 agatcctggg ctacattctg cgctatcgcc tgttcgggta caacaatgtg ccgtggtcct 3001 accagaacat caccaacgag gcgcagcgca actttctgat ccaggagctg atcacgtgga 3061 aggactacat cgtgcagatt gcggccttca acaacatggg cgtgggcgtc tacacggagg 3121 gggccaagat caagaccaag gagggtgtgc ccgaggcacc gcccaccaac gtcagggtga 3181 aggccctcaa ctcgacggcg gcgcagatca cgtggaagcc gccgaatccg cagcagatca 3241 acggcatcaa ccagggctac aagatccagg catggcagcg acggcagctc gatggggagg 3301 agcgggacat ggagcggcgc atgatgacgg tgccgcccag cctgatcgat ccactggccg 3361 agcagacgac ggtgctcggt ggcctggaca agttcgccaa gttcaatgtg accgtactct 3421 gcttcaccga tcccggtgac ggtgtggcca gccagctggt gccggtggag actttggacg 3481 acgtgcccga cgagataacg gccctgcact ttgacgatgt ctccgatcgg tccgtcaaag 3541 tgctgtgggc gccgccgcgc ttcgccaacg gcatcctcac cggctacacg gtgcgctacc 3601 aggtcaagga tcgccccgag acgatgaagt tcttcaacct gaccgccgac gacaacgagc 3661 tgacggtgaa ccagctgcag gcgacgaccc actactggtt cgaggtgtgc gcctggacgc 3721 gggtgggcag cgggccgccc aagacggcga cgatccaatc gggcgtggag ccggtgctgc 3781 cgcatgcgcc caccacactg gccctgtcca acatcgaagc gttttcggtg gtgctgcagt 3841 tcacgcccgg cttcgacggc aactcgagca tcaccaagtg gaaggtggag gcgcagacgg 3901 cccgcaacat gacctggttc acgctctgtg aaatcagcga tcccgatgcg gagaccctca 3961 ccgtgaccgg cctgatgccc ttcacccagt accggctgcg gctgagcgcc accaatgtgg 4021 tgggcagctc ccggccctcg gaccccacca aggactttca aaccattcag gccaagccga 4081 tgcacgcccc cttcaatgtg acggtacgcg caatgagcgc cctgcagctg cgcgtccgct 4141 ggataccgct gcagcagatg gagtggttcg gcaatccgcg cggctacaat gtcacctacc 4201 ggcaaatgga gcgcaccggc aagccctcca agcacccgcc ccgctccgtg atgatcgagg 4261 atcacacggc caactcgcat gtgctcgagg ggctcgagga gtggaccctc tacgaagtga 4321 tcatgaacgc ctgcaacgat gtgggctgct cgctggacag cggcctggcc atggagcgca 4381 ccagggaagc ggtgcccagc tacggcccgc tgcatgtgga ggcgaacgcc acctcctcga 4441 cgacggtggt ggtgcgctgg ggcgagatac cgccccacca tcgcaacggc cagatcgatg 4501 gctacaaggt gtactacgcg gccaccgagc gcggcatgca ggtgctctac aagacgatac 4561 ccaacaacag ctccttcacc accaccctca ccgagctgca gaagtttgtg gtgtaccacg 4621 tccaggtgct ggcctacacg cggctcggca acggcgccct cagcaccccg cccatccggg 4681 tgcagacgtt cgaggacacg cccggatcac cgtccaatgt gagcttcccg gacgtcacct 4741 tctcgatggc gcgcatcatc tgggacgtgc cgatggaccc caatggcgag atactcgcct 4801 accaggtcac ctacacgctc aacggaagcg ccaatctgaa ctacagccgc gagtttccgc 4861 cctcggatcg caccttccgg gccaccggcc tgatgcccga gcgctactac agcttcagcg 4921 tgacggccca gacacgcctc ggctggggca aaacggcctc ggtgctggtg tacacgacca 4981 acaacaggga ccgtccgcag gcaccgtccg ggccgcaggt gtcgcgcagc cagatccagg 5041 cccatcagat caccttcaac tggacgccgg gccgcgacgg gttcgccccg ctgcgatact 5101 acacggtcga gatgcgggag aacgagggcc gctggcagcc gctgcccgag cgcgtcgatc 5161 ccacactcag ctcgtacacg gccctgggtc tgcgtccgta caccacctac cagttccgca 5221 ttcaggcgac caacgatctg ggcccgtcgg cgttcagccg agagagcatt gtggtgcgca 5281 ccctgcccgc cgccccagcg gtgggtgtgg ggggactgaa ggtggtgccc ataacgacca 5341 cctcggtgcg ggtgcagtgg ggggcgctgg agacgggcat gtggaacggc gacgcggcca 5401 ccgggggata ccgcatactg taccagcagc tgtcggactt cgcaccggcc ctgcagtcga 5461 ccccgaagac ggatgtgatg ggcatcaatg agaacagcgt ggtgctgtcc gatctgcagc 5521 aggaccgcaa ctacgagatc gtggtgctgc cattcaattc gcagggaccg ggcccggcca 5581 caccgccgac cgccgtctat gtgggcgagg cggtgcccac tggagagccg cggggcgtgg 5641 atgccacggc catttccagc acggaggtgc gcctgagctg gaagccaccg aagcagagca 5701 gccagaacgg agagatactc ggctacaaga tattctattt ggtgacgtgg tcgccgcagg 5761 ccctcgagcc gggccgcaaa ttcgaggagg aaatcgaagt ggtctcggcc acggccacat 5821 cgcacagcct ggtctttctc gataagttca ccgagtaccg catccagttg ctggccttca 5881 atccggccgg agacgggccg aggtccgccc ccgtcactgc gaagacgatg ccgggcgtgc 5941 ccagtgcccc gctcaatctg cgcttttcgg acatcacaat gcagagcctg gaggtgacct 6001 gggacccgcc caagctgctc aacggcgaga ttgttggcta tctggtcacc tacgagacca 6061 ccgaggagaa cgaaaagttc agcaagcagg tgaagcagaa ggtgtccaac accacgctgc 6121 gtgtgcagaa tctggaggag gaggtcacct acaccttcac cgtgcgcgcc cagacgaacg 6181 actatggacc ggcggtgagc gcgaatgtga ccacaggccc ccaggatggc tccccggtgg 6241 caccgcgcga tctcacactc acaaagacac tgtccagcgt tgaggtacat tgggtcaatg 6301 gaccctccgg ccggggcccc atactgggct acctcatcga ggccaagaag cgagaaaatg 6361 gagagccctc atttatttct aatagacctc cctatcttcg cttagacgac tcccgctgga 6421 ctaagattga gcagtccaga aagggtacca tgaaggagtt taccgtcagc taccacatcc 6481 tgatgccatc gacggcgtat ttgttccggg taattgctta caataagtat ggcatatcgt 6541 tccctgttta ctcgaaggac tcgatactga cgccctcgaa gctgcatctg gagtacggct 6601 atctgcagca caagcccttc tacaggcaga cctggttcat ggtctccctg gcggccacct 6661 cgatcgtcat cattgtcatg gtcattgcgg tgctctgtgt gaagagcaag agctacaagt 6721 acaagcagga ggcacaaaag acgctggagg agtccatggc catgtcgatt gatgagcgcc 6781 aggagctggc cctggagctg tatcgttcgc gtcacggcgt cggcaccggc accctgaaca 6841 gcgttggaac attgcgcagc ggaactttgg gaaccctcgg ccgtaagtcc accaaccgac 6901 accagccggt gagtgtgcat ttgggtaaga gtccaccgcg accctcgccc gcatcggtgg 6961 cgtaccacag cgatgaggag agtctcaagt gctacgacga gaatcccgac gacagcagtg 7021 ttacggaaaa gccatccgag gtgagcagct cggaggcatc ccagcactcg gagagcgaga 7081 acgagagcgt gaggagcgat ccgcactcgt tcgtcaatca ctatgcgaat gtgaatgact 7141 cgctgcggca gtcctggaag aagaccaagc ccgtgcgcaa ctactcgagc tacacagact 7201 ccgagccgga gggcagtgca gtgatgagtc tcaatggtgg ccagattatt gtcaataata 7261 tggccagatc gagggcacca ctgcccggct tctcgtcatt tgtctgacaa tcaaccgaat 7321 tctaagatct atgccgtggt agcagcagca ccgtcatccg cgagacattt gtctgaatta 7381 ttttggaaac gataacggaa aacggaaaaa cggaggctga agctgaaacc ggagctggag 7441 ttgcagtggg gagcgttcta acgagttcga cacggatgta gcgagtgggc taaactgcct 7501 gcctgcctgc aactgttctg tctggctctc cctggatctt cgtagctgtc cggcgaggcg 7561 ctgctacatg gatatttatc gtagtt