Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XM_041591952 7574 bp mRNA linear INV 14-MAY-2021 transcript variant X2, mRNA. ACCESSION XM_041591952 VERSION XM_041591952.1 DBLINK BioProject: PRJNA728747 KEYWORDS RefSeq. SOURCE Drosophila obscura ORGANISM Drosophila obscura Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_024542752.1) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Name :: Drosophila obscura Annotation Release 101 Annotation Version :: 101 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 8.6 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7574 /organism="Drosophila obscura" /mol_type="mRNA" /isolate="BZ-5 IFL" /db_xref="taxon:7282" /chromosome="Unknown" /sex="male" /tissue_type="whole fly" /dev_stage="Adult fly" /geo_loc_name="Serbia: Babin Zub" /collection_date="2017" gene 1..7574 /gene="LOC111066347" /note="Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 6 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 3 samples with support for all annotated introns" /db_xref="GeneID:111066347" CDS 351..7295 /gene="LOC111066347" /codon_start=1 /product="protein sidekick isoform X2" /protein_id="XP_041447886.1" /db_xref="GeneID:111066347" /translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA GENSASTWLRVKTETETEAANSRIKRLAQPRILRVRASHGGSGTVAGSVTGSGSGSGS TSNSHQQGRRKQFRFASAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQ LVEISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPP VDTIVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVR ASDVGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFD GNSPISKFIIQRREVSELGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQFRVS AVNRVGEGSPSEPSNVVELPQEEFPLYRAPSGPPVGFVGSARSMSEIITQWQPPQEEH RNGQILGYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFNNMGV GVYTEGAKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKIQAWQ RRQLDGEERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGDGVAS QLVPVETLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDRPETM KFFNLTADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHAPTTL ALSNIEAFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLTVTGL MPFTQYRLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRVRWIP LQQMEWFGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTLYEVI MNACNDVGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHRNGQI DGYKVYYAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGALSTP PIRVQTFEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSANLNY SREFPPSDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAPSGPQ VSRSQIQAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYTALGL RPYTTYQFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRVQWGA LETGMWNGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDRNYEI VVLPFNSQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSSQNGE ILGYKIFYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLAFNPA GDGPRSAPVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVTYETT EENEKFSKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQDGSP VAPRDLTLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRENGEPSFISNRPPYLRLDD SRWTKIEQSRKGTMKEFTVSYHILMPSTAYLFRVIAYNKYGISFPVYSKDSILTPSKL HLEYGYLQHKPFYRQTWFMVSLAATSIVIIVMVIAVLCVKSKSYKYKQEAQKTLEESM AMSIDERQELALELYRSRHGVGTGTLNSVGTLRSGTLGTLGRKSTNRHQPVSVHLGKS PPRPSPASVAYHSDEESLKCYDENPDDSSVTEKPSEVSSSEASQHSESENESVRSDPH SFVNHYANVNDSLRQSWKKTKPVRNYSSYTDSEPEGSAVMSLNGGQIIVNNMARSRAP LPGFSSFV" misc_feature 597..809 /gene="LOC111066347" /note="Immunoglobulin domain; Region: Ig_3; pfam13927" /db_xref="CDD:464046" misc_feature 1164..1451 /gene="LOC111066347" /note="Immunoglobulin domain; Region: Ig; cl11960" /db_xref="CDD:472250" misc_feature 1218..1232 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409353" misc_feature 1263..1277 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409353" misc_feature 1338..1352 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409353" misc_feature 1380..1397 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409353" misc_feature 1422..1433 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409353" misc_feature 1461..1736 /gene="LOC111066347" /note="Immunoglobulin domain; Region: Ig; cl11960" /db_xref="CDD:472250" misc_feature 1515..1529 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409543" misc_feature 1554..1568 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409543" misc_feature 1632..1643 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409543" misc_feature 1671..1688 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409543" misc_feature 1710..1721 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409543" misc_feature 1929..2192 /gene="LOC111066347" /note="Immunoglobulin I-set domain; Region: I-set; pfam07679" /db_xref="CDD:400151" misc_feature 1980..1994 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409562" misc_feature 2019..2033 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409562" misc_feature 2091..2102 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409562" misc_feature 2130..2147 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409562" misc_feature 2169..2180 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409562" misc_feature 2205..2474 /gene="LOC111066347" /note="Immunoglobulin I-set domain; Region: I-set; pfam07679" /db_xref="CDD:400151" misc_feature 2253..2267 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409544" misc_feature 2298..2312 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409544" misc_feature 2370..2384 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409544" misc_feature <2412..>3203 /gene="LOC111066347" /note="Fibronectin type 3 domain [General function prediction only]; Region: FN3; COG3401" /db_xref="CDD:442628" misc_feature 2412..2429 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409544" misc_feature 2451..2462 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409544" misc_feature 2484..2795 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(2484..2486,2715..2717,2760..2762) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature order(2763..2768,2772..2777) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature 3147..3461 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(3426..3431,3435..3440) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature order(3474..3476,3669..3671,3714..3716) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature 3477..3728 /gene="LOC111066347" /note="Fibronectin type III domain; Region: fn3; pfam00041" /db_xref="CDD:394996" misc_feature 3768..4037 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(3768..3770,3966..3968,4011..4013) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature order(4014..4019,4023..4028) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature 4074..4340 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature 4395..4652 /gene="LOC111066347" /note="Fibronectin type III domain; Region: fn3; pfam00041" /db_xref="CDD:394996" misc_feature 4614..6269 /gene="LOC111066347" /note="Fibronectin type 3 domain [General function prediction only]; Region: FN3; COG3401" /db_xref="CDD:442628" misc_feature order(4638..4643,4647..4652) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature 6222..6542 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(6222..6224,6474..6476,6519..6521) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature order(6522..6527,6531..6536) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" ORIGIN 1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg 61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac 121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg 181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca 241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga 301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag 361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca 421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc 481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg 541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac 601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc 661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt 721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg 781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg 841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg 901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg 961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg 1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg 1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg 1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca 1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac 1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc 1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg 1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc 1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg 1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg 1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg 1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt 1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacag 1741 aaacagaaac agaagcagcc aacagccgca tcaagcgctt ggcccagcca cgcatcttaa 1801 gagtaagagc ttcgcatggt ggttcgggaa cggtcgcggg atcggtcacg ggatcgggat 1861 caggatctgg ttccacctcc aactcccatc agcagggtag acgcaagcag tttcgctttg 1921 cctcagcgcc ggtctttgag cagccgcccc agaatgtgac cgccctggat ggcaaggatg 1981 cgacgatctc ctgtcgggcc attggctcgc ccaatcccaa tgttacctgg atctacaatg 2041 aaacccaact ggttgagata tccagtcgcg ttcagatact cgaatcgggt gatttactca 2101 tctcgaatat ccgtgccacg gacgcgggac tctacatctg tgtgcgggcc aacgaggcgg 2161 gcagcgtcaa gggcgaggcc ttgctaagcg tgttagtgcg gacacaaatc atacagccgc 2221 cagtggacac catcgtgctg ctgggcctga ccgcgacact gcagtgcaag gtgtccagcg 2281 acccgagcgt gccctacaac atcgactggt accgggaggg ccaaatggcg cccatcagca 2341 actcgcagcg gattggagtg caggcggacg ggcagctgga gatccaggcg gtgcgggcca 2401 gcgatgtggg cagctattcg tgcgtggtta catcgccggg cggcaatgag acacggtcgg 2461 cccgtctcag tgtcatcgag ctgcccttcc cgcccagcaa cgtgcgggtg gagcgtctgc 2521 cagagccgca gcagcgcagc atcaatgtgt cctggacgcc cggattcgat ggcaacagtc 2581 caatctccaa atttattatc cagcgacgtg aggtctctga attgggtcca gttccagatc 2641 cccttctcaa ttggatcacc gaactgagca acgtatcggc caatcagcgg tggatgctgc 2701 tggagaacct caaggcggcc accgtctatc agtttcgtgt cagtgccgtc aatcgggtcg 2761 gcgagggctc cccctcggag cccagcaatg ttgttgagct gccccaagaa gagtttccgc 2821 tttatcgagc tccttcggga ccgcctgtgg gctttgtggg ctcggcacgg tccatgtccg 2881 agatcattac gcagtggcag ccgccgcagg aggagcatcg caacggacag atcctgggct 2941 acattctgcg ctatcgcctg ttcgggtaca acaatgtgcc gtggtcctac cagaacatca 3001 ccaacgaggc gcagcgcaac tttctgatcc aggagctgat cacgtggaag gactacatcg 3061 tgcagattgc ggccttcaac aacatgggcg tgggcgtcta cacggagggg gccaagatca 3121 agaccaagga gggtgtgccc gaggcaccgc ccaccaacgt cagggtgaag gccctcaact 3181 cgacggcggc gcagatcacg tggaagccgc cgaatccgca gcagatcaac ggcatcaacc 3241 agggctacaa gatccaggca tggcagcgac ggcagctcga tggggaggag cgggacatgg 3301 agcggcgcat gatgacggtg ccgcccagcc tgatcgatcc actggccgag cagacgacgg 3361 tgctcggtgg cctggacaag ttcgccaagt tcaatgtgac cgtactctgc ttcaccgatc 3421 ccggtgacgg tgtggccagc cagctggtgc cggtggagac tttggacgac gtgcccgacg 3481 agataacggc cctgcacttt gacgatgtct ccgatcggtc cgtcaaagtg ctgtgggcgc 3541 cgccgcgctt cgccaacggc atcctcaccg gctacacggt gcgctaccag gtcaaggatc 3601 gccccgagac gatgaagttc ttcaacctga ccgccgacga caacgagctg acggtgaacc 3661 agctgcaggc gacgacccac tactggttcg aggtgtgcgc ctggacgcgg gtgggcagcg 3721 ggccgcccaa gacggcgacg atccaatcgg gcgtggagcc ggtgctgccg catgcgccca 3781 ccacactggc cctgtccaac atcgaagcgt tttcggtggt gctgcagttc acgcccggct 3841 tcgacggcaa ctcgagcatc accaagtgga aggtggaggc gcagacggcc cgcaacatga 3901 cctggttcac gctctgtgaa atcagcgatc ccgatgcgga gaccctcacc gtgaccggcc 3961 tgatgccctt cacccagtac cggctgcggc tgagcgccac caatgtggtg ggcagctccc 4021 ggccctcgga ccccaccaag gactttcaaa ccattcaggc caagccgatg cacgccccct 4081 tcaatgtgac ggtacgcgca atgagcgccc tgcagctgcg cgtccgctgg ataccgctgc 4141 agcagatgga gtggttcggc aatccgcgcg gctacaatgt cacctaccgg caaatggagc 4201 gcaccggcaa gccctccaag cacccgcccc gctccgtgat gatcgaggat cacacggcca 4261 actcgcatgt gctcgagggg ctcgaggagt ggaccctcta cgaagtgatc atgaacgcct 4321 gcaacgatgt gggctgctcg ctggacagcg gcctggccat ggagcgcacc agggaagcgg 4381 tgcccagcta cggcccgctg catgtggagg cgaacgccac ctcctcgacg acggtggtgg 4441 tgcgctgggg cgagataccg ccccaccatc gcaacggcca gatcgatggc tacaaggtgt 4501 actacgcggc caccgagcgc ggcatgcagg tgctctacaa gacgataccc aacaacagct 4561 ccttcaccac caccctcacc gagctgcaga agtttgtggt gtaccacgtc caggtgctgg 4621 cctacacgcg gctcggcaac ggcgccctca gcaccccgcc catccgggtg cagacgttcg 4681 aggacacgcc cggatcaccg tccaatgtga gcttcccgga cgtcaccttc tcgatggcgc 4741 gcatcatctg ggacgtgccg atggacccca atggcgagat actcgcctac caggtcacct 4801 acacgctcaa cggaagcgcc aatctgaact acagccgcga gtttccgccc tcggatcgca 4861 ccttccgggc caccggcctg atgcccgagc gctactacag cttcagcgtg acggcccaga 4921 cacgcctcgg ctggggcaaa acggcctcgg tgctggtgta cacgaccaac aacagggacc 4981 gtccgcaggc accgtccggg ccgcaggtgt cgcgcagcca gatccaggcc catcagatca 5041 ccttcaactg gacgccgggc cgcgacgggt tcgccccgct gcgatactac acggtcgaga 5101 tgcgggagaa cgagggccgc tggcagccgc tgcccgagcg cgtcgatccc acactcagct 5161 cgtacacggc cctgggtctg cgtccgtaca ccacctacca gttccgcatt caggcgacca 5221 acgatctggg cccgtcggcg ttcagccgag agagcattgt ggtgcgcacc ctgcccgccg 5281 ccccagcggt gggtgtgggg ggactgaagg tggtgcccat aacgaccacc tcggtgcggg 5341 tgcagtgggg ggcgctggag acgggcatgt ggaacggcga cgcggccacc gggggatacc 5401 gcatactgta ccagcagctg tcggacttcg caccggccct gcagtcgacc ccgaagacgg 5461 atgtgatggg catcaatgag aacagcgtgg tgctgtccga tctgcagcag gaccgcaact 5521 acgagatcgt ggtgctgcca ttcaattcgc agggaccggg cccggccaca ccgccgaccg 5581 ccgtctatgt gggcgaggcg gtgcccactg gagagccgcg gggcgtggat gccacggcca 5641 tttccagcac ggaggtgcgc ctgagctgga agccaccgaa gcagagcagc cagaacggag 5701 agatactcgg ctacaagata ttctatttgg tgacgtggtc gccgcaggcc ctcgagccgg 5761 gccgcaaatt cgaggaggaa atcgaagtgg tctcggccac ggccacatcg cacagcctgg 5821 tctttctcga taagttcacc gagtaccgca tccagttgct ggccttcaat ccggccggag 5881 acgggccgag gtccgccccc gtcactgcga agacgatgcc gggcgtgccc agtgccccgc 5941 tcaatctgcg cttttcggac atcacaatgc agagcctgga ggtgacctgg gacccgccca 6001 agctgctcaa cggcgagatt gttggctatc tggtcaccta cgagaccacc gaggagaacg 6061 aaaagttcag caagcaggtg aagcagaagg tgtccaacac cacgctgcgt gtgcagaatc 6121 tggaggagga ggtcacctac accttcaccg tgcgcgccca gacgaacgac tatggaccgg 6181 cggtgagcgc gaatgtgacc acaggccccc aggatggctc cccggtggca ccgcgcgatc 6241 tcacactcac aaagacactg tccagcgttg aggtacattg ggtcaatgga ccctccggcc 6301 ggggccccat actgggctac ctcatcgagg ccaagaagcg agaaaatgga gagccctcat 6361 ttatttctaa tagacctccc tatcttcgct tagacgactc ccgctggact aagattgagc 6421 agtccagaaa gggtaccatg aaggagttta ccgtcagcta ccacatcctg atgccatcga 6481 cggcgtattt gttccgggta attgcttaca ataagtatgg catatcgttc cctgtttact 6541 cgaaggactc gatactgacg ccctcgaagc tgcatctgga gtacggctat ctgcagcaca 6601 agcccttcta caggcagacc tggttcatgg tctccctggc ggccacctcg atcgtcatca 6661 ttgtcatggt cattgcggtg ctctgtgtga agagcaagag ctacaagtac aagcaggagg 6721 cacaaaagac gctggaggag tccatggcca tgtcgattga tgagcgccag gagctggccc 6781 tggagctgta tcgttcgcgt cacggcgtcg gcaccggcac cctgaacagc gttggaacat 6841 tgcgcagcgg aactttggga accctcggcc gtaagtccac caaccgacac cagccggtga 6901 gtgtgcattt gggtaagagt ccaccgcgac cctcgcccgc atcggtggcg taccacagcg 6961 atgaggagag tctcaagtgc tacgacgaga atcccgacga cagcagtgtt acggaaaagc 7021 catccgaggt gagcagctcg gaggcatccc agcactcgga gagcgagaac gagagcgtga 7081 ggagcgatcc gcactcgttc gtcaatcact atgcgaatgt gaatgactcg ctgcggcagt 7141 cctggaagaa gaccaagccc gtgcgcaact actcgagcta cacagactcc gagccggagg 7201 gcagtgcagt gatgagtctc aatggtggcc agattattgt caataatatg gccagatcga 7261 gggcaccact gcccggcttc tcgtcatttg tctgacaatc aaccgaattc taagatctat 7321 gccgtggtag cagcagcacc gtcatccgcg agacatttgt ctgaattatt ttggaaacga 7381 taacggaaaa cggaaaaacg gaggctgaag ctgaaaccgg agctggagtt gcagtgggga 7441 gcgttctaac gagttcgaca cggatgtagc gagtgggcta aactgcctgc ctgcctgcaa 7501 ctgttctgtc tggctctccc tggatcttcg tagctgtccg gcgaggcgct gctacatgga 7561 tatttatcgt agtt