Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XM_041591953 7568 bp mRNA linear INV 14-MAY-2021 transcript variant X3, mRNA. ACCESSION XM_041591953 VERSION XM_041591953.1 DBLINK BioProject: PRJNA728747 KEYWORDS RefSeq. SOURCE Drosophila obscura ORGANISM Drosophila obscura Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NW_024542752.1) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Status :: Full annotation Annotation Name :: Drosophila obscura Annotation Release 101 Annotation Version :: 101 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 8.6 Annotation Method :: Best-placed RefSeq; Gnomon Features Annotated :: Gene; mRNA; CDS; ncRNA ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..7568 /organism="Drosophila obscura" /mol_type="mRNA" /isolate="BZ-5 IFL" /db_xref="taxon:7282" /chromosome="Unknown" /sex="male" /tissue_type="whole fly" /dev_stage="Adult fly" /geo_loc_name="Serbia: Babin Zub" /collection_date="2017" gene 1..7568 /gene="LOC111066347" /note="Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 6 Proteins, and 100% coverage of the annotated genomic feature by RNAseq alignments, including 3 samples with support for all annotated introns" /db_xref="GeneID:111066347" CDS 351..7289 /gene="LOC111066347" /codon_start=1 /product="protein sidekick isoform X3" /protein_id="XP_041447887.1" /db_xref="GeneID:111066347" /translation="MKRDQRRSSASSLRRRRRWCVDVNEKGTRMWLKISLSQPLEASL FVLAALLLLNADSCSCYADANPQQQQQLVQQQQQQLQAPRFTTHPSSSGSIVSEGSTK ILQCHALGYPQPTYRWLKDGKSVGEFSSSQFYRFHSTRREDAGSYQCIAKNDAGSIFS EKSDVVVAYMGIFENVTEGRLTVVSGHPAIFDMPAIESVPTPSVLWQSADGSLNYDIK YAFTQANQLIILSVDENDRRGYRARAINTQLGKEEISAFVHLNVSGDPYIEVAPEIIV RPQDVKVKTGTGVLELQCIANARPLHELETIWLKDGLAVDTTGVRHTLNDPWNRTLAL LQANSSHSGEYTCQVRLRSGGYPTVTASARVQILEPPVFFTPMRAETFGEFGGQVQLP CDVVGEPTPQVEWFRNAESVEANVQSGRYSLGEDNTLIIKKLILDDSAMFQCLARNEA GENSASTWLRVKTETETEAANSRIKRLAQPRILRVRASHGGSGTVAGSVTGSGSGSGS TSNSHQQGRRKQFRFASAPVFEQPPQNVTALDGKDATISCRAIGSPNPNVTWIYNETQ LVEISSRVQILESGDLLISNIRATDAGLYICVRANEAGSVKGEALLSVLVRTQIIQPP VDTIVLLGLTATLQCKVSSDPSVPYNIDWYREGQMAPISNSQRIGVQADGQLEIQAVR ASDVGSYSCVVTSPGGNETRSARLSVIELPFPPSNVRVERLPEPQQRSINVSWTPGFD GNSPISKFIIQRREVSELEKFVGPVPDPLLNWITELSNVSANQRWMLLENLKAATVYQ FRVSAVNRVGEGSPSEPSNVVELPQEAPSGPPVGFVGSARSMSEIITQWQPPQEEHRN GQILGYILRYRLFGYNNVPWSYQNITNEAQRNFLIQELITWKDYIVQIAAFNNMGVGV YTEGAKIKTKEGVPEAPPTNVRVKALNSTAAQITWKPPNPQQINGINQGYKIQAWQRR QLDGEERDMERRMMTVPPSLIDPLAEQTTVLGGLDKFAKFNVTVLCFTDPGDGVASQL VPVETLDDVPDEITALHFDDVSDRSVKVLWAPPRFANGILTGYTVRYQVKDRPETMKF FNLTADDNELTVNQLQATTHYWFEVCAWTRVGSGPPKTATIQSGVEPVLPHAPTTLAL SNIEAFSVVLQFTPGFDGNSSITKWKVEAQTARNMTWFTLCEISDPDAETLTVTGLMP FTQYRLRLSATNVVGSSRPSDPTKDFQTIQAKPMHAPFNVTVRAMSALQLRVRWIPLQ QMEWFGNPRGYNVTYRQMERTGKPSKHPPRSVMIEDHTANSHVLEGLEEWTLYEVIMN ACNDVGCSLDSGLAMERTREAVPSYGPLHVEANATSSTTVVVRWGEIPPHHRNGQIDG YKVYYAATERGMQVLYKTIPNNSSFTTTLTELQKFVVYHVQVLAYTRLGNGALSTPPI RVQTFEDTPGSPSNVSFPDVTFSMARIIWDVPMDPNGEILAYQVTYTLNGSANLNYSR EFPPSDRTFRATGLMPERYYSFSVTAQTRLGWGKTASVLVYTTNNRDRPQAPSGPQVS RSQIQAHQITFNWTPGRDGFAPLRYYTVEMRENEGRWQPLPERVDPTLSSYTALGLRP YTTYQFRIQATNDLGPSAFSRESIVVRTLPAAPAVGVGGLKVVPITTTSVRVQWGALE TGMWNGDAATGGYRILYQQLSDFAPALQSTPKTDVMGINENSVVLSDLQQDRNYEIVV LPFNSQGPGPATPPTAVYVGEAVPTGEPRGVDATAISSTEVRLSWKPPKQSSQNGEIL GYKIFYLVTWSPQALEPGRKFEEEIEVVSATATSHSLVFLDKFTEYRIQLLAFNPAGD GPRSAPVTAKTMPGVPSAPLNLRFSDITMQSLEVTWDPPKLLNGEIVGYLVTYETTEE NEKFSKQVKQKVSNTTLRVQNLEEEVTYTFTVRAQTNDYGPAVSANVTTGPQDGSPVA PRDLTLTKTLSSVEVHWVNGPSGRGPILGYLIEAKKRENGEPSFISNRPPYLRLDDSR WTKIEQSRKGTMKEFTVSYHILMPSTAYLFRVIAYNKYGISFPVYSKDSILTPSKLHL EYGYLQHKPFYRQTWFMVSLAATSIVIIVMVIAVLCVKSKSYKYKQEAQKTLEESMAM SIDERQELALELYRSRHGVGTGTLNSVGTLRSGTLGTLGRKSTNRHQPVSVHLGKSPP RPSPASVAYHSDEESLKCYDENPDDSSVTEKPSEVSSSEASQHSESENESVRSDPHSF VNHYANVNDSLRQSWKKTKPVRNYSSYTDSEPEGSAVMSLNGGQIIVNNMARSRAPLP GFSSFV" misc_feature 597..809 /gene="LOC111066347" /note="Immunoglobulin domain; Region: Ig_3; pfam13927" /db_xref="CDD:464046" misc_feature 1164..1451 /gene="LOC111066347" /note="Immunoglobulin domain; Region: Ig; cl11960" /db_xref="CDD:472250" misc_feature 1218..1232 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409353" misc_feature 1263..1277 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409353" misc_feature 1338..1352 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409353" misc_feature 1380..1397 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409353" misc_feature 1422..1433 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409353" misc_feature 1461..1736 /gene="LOC111066347" /note="Immunoglobulin domain; Region: Ig; cl11960" /db_xref="CDD:472250" misc_feature 1515..1529 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409543" misc_feature 1554..1568 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409543" misc_feature 1632..1643 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409543" misc_feature 1671..1688 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409543" misc_feature 1710..1721 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409543" misc_feature 1929..2192 /gene="LOC111066347" /note="Immunoglobulin I-set domain; Region: I-set; pfam07679" /db_xref="CDD:400151" misc_feature 1980..1994 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409562" misc_feature 2019..2033 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409562" misc_feature 2091..2102 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409562" misc_feature 2130..2147 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409562" misc_feature 2169..2180 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409562" misc_feature 2205..2474 /gene="LOC111066347" /note="Immunoglobulin I-set domain; Region: I-set; pfam07679" /db_xref="CDD:400151" misc_feature 2253..2267 /gene="LOC111066347" /note="Ig strand B [structural motif]; Region: Ig strand B" /db_xref="CDD:409544" misc_feature 2298..2312 /gene="LOC111066347" /note="Ig strand C [structural motif]; Region: Ig strand C" /db_xref="CDD:409544" misc_feature 2370..2384 /gene="LOC111066347" /note="Ig strand E [structural motif]; Region: Ig strand E" /db_xref="CDD:409544" misc_feature <2412..>3197 /gene="LOC111066347" /note="Fibronectin type 3 domain [General function prediction only]; Region: FN3; COG3401" /db_xref="CDD:442628" misc_feature 2412..2429 /gene="LOC111066347" /note="Ig strand F [structural motif]; Region: Ig strand F" /db_xref="CDD:409544" misc_feature 2451..2462 /gene="LOC111066347" /note="Ig strand G [structural motif]; Region: Ig strand G" /db_xref="CDD:409544" misc_feature 2484..2807 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(2484..2486,2727..2729,2772..2774) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature order(2775..2780,2784..2789) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature 3141..3455 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(3420..3425,3429..3434) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature order(3468..3470,3663..3665,3708..3710) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature 3471..3722 /gene="LOC111066347" /note="Fibronectin type III domain; Region: fn3; pfam00041" /db_xref="CDD:394996" misc_feature 3762..4031 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(3762..3764,3960..3962,4005..4007) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature order(4008..4013,4017..4022) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature 4068..4334 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature 4389..4646 /gene="LOC111066347" /note="Fibronectin type III domain; Region: fn3; pfam00041" /db_xref="CDD:394996" misc_feature 4608..6263 /gene="LOC111066347" /note="Fibronectin type 3 domain [General function prediction only]; Region: FN3; COG3401" /db_xref="CDD:442628" misc_feature order(4632..4637,4641..4646) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" misc_feature 6216..6536 /gene="LOC111066347" /note="Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all...; Region: FN3; cd00063" /db_xref="CDD:238020" misc_feature order(6216..6218,6468..6470,6513..6515) /gene="LOC111066347" /note="Interdomain contacts [active]" /db_xref="CDD:238020" misc_feature order(6516..6521,6525..6530) /gene="LOC111066347" /note="Cytokine receptor motif [active]" /db_xref="CDD:238020" ORIGIN 1 cctgcgcgca taacggttgt tcttgccgac tacgtcgcat cgtcgtcgtc attttcgtcg 61 tcgctcgtag ttcgtggctc tcggtcgctc caacttgctg cggcgcgtgt ttcgaaacac 121 agcagaccag acgaggaaga agtctagaga agcaaatggt tcaataaaga aacaacattg 181 aaagcaggcg caagagaaag tcaacatgaa ttaagaaaaa cgcaagaaaa ttcaaaaaca 241 attaaaattt aatcaaacaa aacaacaaaa aaaccaaaaa caaaattcag aagcaggcga 301 aaagaatcaa cgatatcgaa gaaaaacagc cgcaagagag agagccgaaa atgaagagag 361 accagcggcg atcttcagcg tcgtcgctgc gtcgccgtcg tcgttggtgc gtcgacgtca 421 acgaaaaagg aacacgaatg tggctcaaaa tttcgctgtc gcagccgctg gaagcgtcgc 481 tgtttgtgct ggcagcgctg ctgctgctca atgcggacag ctgctcatgt tacgcggatg 541 ccaatccgca gcaacaacag cagctggtcc agcagcagca gcaacaactt caggcgccac 601 gttttaccac acacccatcg tcatcgggct cgattgtgag cgagggcagc accaagatcc 661 tacagtgcca tgctttgggt tatccacagc cgacatatcg ttggctgaag gacggcaagt 721 ccgtgggcga gttctcatcg agtcagttct atcggttcca cagcacacgg cgcgaggatg 781 cgggcagcta tcagtgcatt gccaagaacg atgccggatc catattcagc gagaagagcg 841 acgttgtagt ggcctacatg ggcatctttg agaacgtcac cgagggacgc ctaactgttg 901 tgagcggaca tccggccatc ttcgatatgc cggccattga gtcggtgcca acgccatcgg 961 tgctgtggca gtcggcggac gggtcgctca actacgacat caagtacgcc ttcacccagg 1021 ccaatcagct gattatactg agcgtggacg agaacgatcg gaggggctac cgggcgcggg 1081 cgatcaacac gcagctgggc aaggaggaga tcagcgcgtt cgttcatctg aatgtcagtg 1141 gcgatccgta catagaggtg gcacccgaga taattgtacg gccgcaggat gtcaaggtca 1201 agaccggcac tggcgtcctc gagctgcagt gcatcgccaa tgcgcgaccc ctgcacgaac 1261 tggagacgat ttggctgaag gacggcctcg ccgtggacac gaccggcgtg cggcacaccc 1321 tcaacgatcc ctggaaccgc accctggccc tcctgcaagc caacagctcg cactccggcg 1381 agtacacctg tcaggtgcgc ctgcgcagcg gtggctatcc aacggtcacc gcctcagccc 1441 gcgtccaaat tctcgagccg cccgtcttct tcacgcccat gcgagcggaa acctttggtg 1501 aatttggcgg ccaggtgcag ctgccctgcg atgtggtggg cgagcccacg ccccaagttg 1561 aatggttccg gaatgcggag tctgtcgagg cgaatgtgca aagcggaaga tactcactgg 1621 gagaggataa tacgctgata attaagaaac taatactgga tgattcggcc atgtttcagt 1681 gcctggcccg aaatgaggcc ggcgagaact cagccagcac ctggctgcgc gtcaaaacag 1741 aaacagaaac agaagcagcc aacagccgca tcaagcgctt ggcccagcca cgcatcttaa 1801 gagtaagagc ttcgcatggt ggttcgggaa cggtcgcggg atcggtcacg ggatcgggat 1861 caggatctgg ttccacctcc aactcccatc agcagggtag acgcaagcag tttcgctttg 1921 cctcagcgcc ggtctttgag cagccgcccc agaatgtgac cgccctggat ggcaaggatg 1981 cgacgatctc ctgtcgggcc attggctcgc ccaatcccaa tgttacctgg atctacaatg 2041 aaacccaact ggttgagata tccagtcgcg ttcagatact cgaatcgggt gatttactca 2101 tctcgaatat ccgtgccacg gacgcgggac tctacatctg tgtgcgggcc aacgaggcgg 2161 gcagcgtcaa gggcgaggcc ttgctaagcg tgttagtgcg gacacaaatc atacagccgc 2221 cagtggacac catcgtgctg ctgggcctga ccgcgacact gcagtgcaag gtgtccagcg 2281 acccgagcgt gccctacaac atcgactggt accgggaggg ccaaatggcg cccatcagca 2341 actcgcagcg gattggagtg caggcggacg ggcagctgga gatccaggcg gtgcgggcca 2401 gcgatgtggg cagctattcg tgcgtggtta catcgccggg cggcaatgag acacggtcgg 2461 cccgtctcag tgtcatcgag ctgcccttcc cgcccagcaa cgtgcgggtg gagcgtctgc 2521 cagagccgca gcagcgcagc atcaatgtgt cctggacgcc cggattcgat ggcaacagtc 2581 caatctccaa atttattatc cagcgacgtg aggtctctga attggaaaaa ttcgtaggtc 2641 cagttccaga tccccttctc aattggatca ccgaactgag caacgtatcg gccaatcagc 2701 ggtggatgct gctggagaac ctcaaggcgg ccaccgtcta tcagtttcgt gtcagtgccg 2761 tcaatcgggt cggcgagggc tccccctcgg agcccagcaa tgttgttgag ctgccccaag 2821 aagctccttc gggaccgcct gtgggctttg tgggctcggc acggtccatg tccgagatca 2881 ttacgcagtg gcagccgccg caggaggagc atcgcaacgg acagatcctg ggctacattc 2941 tgcgctatcg cctgttcggg tacaacaatg tgccgtggtc ctaccagaac atcaccaacg 3001 aggcgcagcg caactttctg atccaggagc tgatcacgtg gaaggactac atcgtgcaga 3061 ttgcggcctt caacaacatg ggcgtgggcg tctacacgga gggggccaag atcaagacca 3121 aggagggtgt gcccgaggca ccgcccacca acgtcagggt gaaggccctc aactcgacgg 3181 cggcgcagat cacgtggaag ccgccgaatc cgcagcagat caacggcatc aaccagggct 3241 acaagatcca ggcatggcag cgacggcagc tcgatgggga ggagcgggac atggagcggc 3301 gcatgatgac ggtgccgccc agcctgatcg atccactggc cgagcagacg acggtgctcg 3361 gtggcctgga caagttcgcc aagttcaatg tgaccgtact ctgcttcacc gatcccggtg 3421 acggtgtggc cagccagctg gtgccggtgg agactttgga cgacgtgccc gacgagataa 3481 cggccctgca ctttgacgat gtctccgatc ggtccgtcaa agtgctgtgg gcgccgccgc 3541 gcttcgccaa cggcatcctc accggctaca cggtgcgcta ccaggtcaag gatcgccccg 3601 agacgatgaa gttcttcaac ctgaccgccg acgacaacga gctgacggtg aaccagctgc 3661 aggcgacgac ccactactgg ttcgaggtgt gcgcctggac gcgggtgggc agcgggccgc 3721 ccaagacggc gacgatccaa tcgggcgtgg agccggtgct gccgcatgcg cccaccacac 3781 tggccctgtc caacatcgaa gcgttttcgg tggtgctgca gttcacgccc ggcttcgacg 3841 gcaactcgag catcaccaag tggaaggtgg aggcgcagac ggcccgcaac atgacctggt 3901 tcacgctctg tgaaatcagc gatcccgatg cggagaccct caccgtgacc ggcctgatgc 3961 ccttcaccca gtaccggctg cggctgagcg ccaccaatgt ggtgggcagc tcccggccct 4021 cggaccccac caaggacttt caaaccattc aggccaagcc gatgcacgcc cccttcaatg 4081 tgacggtacg cgcaatgagc gccctgcagc tgcgcgtccg ctggataccg ctgcagcaga 4141 tggagtggtt cggcaatccg cgcggctaca atgtcaccta ccggcaaatg gagcgcaccg 4201 gcaagccctc caagcacccg ccccgctccg tgatgatcga ggatcacacg gccaactcgc 4261 atgtgctcga ggggctcgag gagtggaccc tctacgaagt gatcatgaac gcctgcaacg 4321 atgtgggctg ctcgctggac agcggcctgg ccatggagcg caccagggaa gcggtgccca 4381 gctacggccc gctgcatgtg gaggcgaacg ccacctcctc gacgacggtg gtggtgcgct 4441 ggggcgagat accgccccac catcgcaacg gccagatcga tggctacaag gtgtactacg 4501 cggccaccga gcgcggcatg caggtgctct acaagacgat acccaacaac agctccttca 4561 ccaccaccct caccgagctg cagaagtttg tggtgtacca cgtccaggtg ctggcctaca 4621 cgcggctcgg caacggcgcc ctcagcaccc cgcccatccg ggtgcagacg ttcgaggaca 4681 cgcccggatc accgtccaat gtgagcttcc cggacgtcac cttctcgatg gcgcgcatca 4741 tctgggacgt gccgatggac cccaatggcg agatactcgc ctaccaggtc acctacacgc 4801 tcaacggaag cgccaatctg aactacagcc gcgagtttcc gccctcggat cgcaccttcc 4861 gggccaccgg cctgatgccc gagcgctact acagcttcag cgtgacggcc cagacacgcc 4921 tcggctgggg caaaacggcc tcggtgctgg tgtacacgac caacaacagg gaccgtccgc 4981 aggcaccgtc cgggccgcag gtgtcgcgca gccagatcca ggcccatcag atcaccttca 5041 actggacgcc gggccgcgac gggttcgccc cgctgcgata ctacacggtc gagatgcggg 5101 agaacgaggg ccgctggcag ccgctgcccg agcgcgtcga tcccacactc agctcgtaca 5161 cggccctggg tctgcgtccg tacaccacct accagttccg cattcaggcg accaacgatc 5221 tgggcccgtc ggcgttcagc cgagagagca ttgtggtgcg caccctgccc gccgccccag 5281 cggtgggtgt ggggggactg aaggtggtgc ccataacgac cacctcggtg cgggtgcagt 5341 ggggggcgct ggagacgggc atgtggaacg gcgacgcggc caccggggga taccgcatac 5401 tgtaccagca gctgtcggac ttcgcaccgg ccctgcagtc gaccccgaag acggatgtga 5461 tgggcatcaa tgagaacagc gtggtgctgt ccgatctgca gcaggaccgc aactacgaga 5521 tcgtggtgct gccattcaat tcgcagggac cgggcccggc cacaccgccg accgccgtct 5581 atgtgggcga ggcggtgccc actggagagc cgcggggcgt ggatgccacg gccatttcca 5641 gcacggaggt gcgcctgagc tggaagccac cgaagcagag cagccagaac ggagagatac 5701 tcggctacaa gatattctat ttggtgacgt ggtcgccgca ggccctcgag ccgggccgca 5761 aattcgagga ggaaatcgaa gtggtctcgg ccacggccac atcgcacagc ctggtctttc 5821 tcgataagtt caccgagtac cgcatccagt tgctggcctt caatccggcc ggagacgggc 5881 cgaggtccgc ccccgtcact gcgaagacga tgccgggcgt gcccagtgcc ccgctcaatc 5941 tgcgcttttc ggacatcaca atgcagagcc tggaggtgac ctgggacccg cccaagctgc 6001 tcaacggcga gattgttggc tatctggtca cctacgagac caccgaggag aacgaaaagt 6061 tcagcaagca ggtgaagcag aaggtgtcca acaccacgct gcgtgtgcag aatctggagg 6121 aggaggtcac ctacaccttc accgtgcgcg cccagacgaa cgactatgga ccggcggtga 6181 gcgcgaatgt gaccacaggc ccccaggatg gctccccggt ggcaccgcgc gatctcacac 6241 tcacaaagac actgtccagc gttgaggtac attgggtcaa tggaccctcc ggccggggcc 6301 ccatactggg ctacctcatc gaggccaaga agcgagaaaa tggagagccc tcatttattt 6361 ctaatagacc tccctatctt cgcttagacg actcccgctg gactaagatt gagcagtcca 6421 gaaagggtac catgaaggag tttaccgtca gctaccacat cctgatgcca tcgacggcgt 6481 atttgttccg ggtaattgct tacaataagt atggcatatc gttccctgtt tactcgaagg 6541 actcgatact gacgccctcg aagctgcatc tggagtacgg ctatctgcag cacaagccct 6601 tctacaggca gacctggttc atggtctccc tggcggccac ctcgatcgtc atcattgtca 6661 tggtcattgc ggtgctctgt gtgaagagca agagctacaa gtacaagcag gaggcacaaa 6721 agacgctgga ggagtccatg gccatgtcga ttgatgagcg ccaggagctg gccctggagc 6781 tgtatcgttc gcgtcacggc gtcggcaccg gcaccctgaa cagcgttgga acattgcgca 6841 gcggaacttt gggaaccctc ggccgtaagt ccaccaaccg acaccagccg gtgagtgtgc 6901 atttgggtaa gagtccaccg cgaccctcgc ccgcatcggt ggcgtaccac agcgatgagg 6961 agagtctcaa gtgctacgac gagaatcccg acgacagcag tgttacggaa aagccatccg 7021 aggtgagcag ctcggaggca tcccagcact cggagagcga gaacgagagc gtgaggagcg 7081 atccgcactc gttcgtcaat cactatgcga atgtgaatga ctcgctgcgg cagtcctgga 7141 agaagaccaa gcccgtgcgc aactactcga gctacacaga ctccgagccg gagggcagtg 7201 cagtgatgag tctcaatggt ggccagatta ttgtcaataa tatggccaga tcgagggcac 7261 cactgcccgg cttctcgtca tttgtctgac aatcaaccga attctaagat ctatgccgtg 7321 gtagcagcag caccgtcatc cgcgagacat ttgtctgaat tattttggaa acgataacgg 7381 aaaacggaaa aacggaggct gaagctgaaa ccggagctgg agttgcagtg gggagcgttc 7441 taacgagttc gacacggatg tagcgagtgg gctaaactgc ctgcctgcct gcaactgttc 7501 tgtctggctc tccctggatc ttcgtagctg tccggcgagg cgctgctaca tggatattta 7561 tcgtagtt