Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]
LOCUS XM_017153659 10050 bp mRNA linear INV 09-DEC-2024 (N), mRNA. ACCESSION XM_017153659 VERSION XM_017153659.3 DBLINK BioProject: PRJNA1194641 KEYWORDS RefSeq. SOURCE Drosophila takahashii ORGANISM Drosophila takahashii Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. COMMENT MODEL REFSEQ: This record is predicted by automated computational analysis. This record is derived from a genomic sequence (NC_091683) annotated using gene prediction method: Gnomon. Also see: Documentation of NCBI's Annotation Process On Dec 9, 2024 this sequence version replaced XM_017153659.2. ##Genome-Annotation-Data-START## Annotation Provider :: NCBI RefSeq Annotation Status :: Full annotation Annotation Name :: GCF_030179915.1-RS_2024_12 Annotation Pipeline :: NCBI eukaryotic genome annotation pipeline Annotation Software Version :: 10.3 Annotation Method :: Gnomon; cmsearch; tRNAscan-SE Features Annotated :: Gene; mRNA; CDS; ncRNA Annotation Date :: 12/07/2024 ##Genome-Annotation-Data-END## FEATURES Location/Qualifiers source 1..10050 /organism="Drosophila takahashii" /mol_type="mRNA" /strain="IR98-3 E-12201" /db_xref="taxon:29030" /chromosome="X" /sex="female" /tissue_type="Whole fly" /dev_stage="Adult fly" /collected_by="Originally obtained from EHIME-Fly" gene 1..10050 /gene="N" /note="neurogenic locus Notch protein; Derived by automated computational analysis using gene prediction method: Gnomon. Supporting evidence includes similarity to: 80 Proteins" /db_xref="GeneID:108065618" CDS 697..8820 /gene="N" /codon_start=1 /product="neurogenic locus Notch protein" /protein_id="XP_017009148.2" /db_xref="GeneID:108065618" /translation="MQSPRSRRRSRAPNTWICCWINKMHAVASLSASLPLLLLTLAFA NLPSTVRGTDTALVPLSCTSVVCQNGGTCVTQLNGKIYCACDSHYVGDYCQHRNPCNS MRCQNGGTCQVTFRNGRPGISCKCPLGFDESLCEIAVPNACDHVTCHNGGTCQLKTLQ EYTCACANGYTGERCETKNLCASSPCRNGGTCTALAGSSSFTCSCPPGFNGDTCSFDI EECLSNPCKYGGTCVNTHGSYQCMCPTGYTGKDCDTKYKPCSPSPCQNGGVCRSNGLS YDCKCPKGFEGKNCEQNFDDCLGHLCQNGGTCVDGISDYSCRCPPNFTGKFCQDDVDE CAQRDHPVCQNGATCTNTHGSYSCICVNGWAGLDCSNNTDDCKQAACFYGATCIDGVG SFYCQCTKGKTGLLCHLDDACTSNPCHADAICDTSPINGSYACSCATGYKGVDCSEDI DECDQGSPCEHNGICVNTPGSYRCNCSQGFTGPRCETNINECESHPCQNEGSCLDDPG TFRCVCMPGFTGTQCEIDIDECQSNPCLNDGTCHDKINGFKCSCALGFTGARCQINID DCQSQPCRNRGICHDSIAGYSCECPPGYTGTSCEININDCDSNPCHRGKCIDDVNSFK CLCDPGYTGYICQKQINECESNPCQFDGHCQDRVGSYYCQCQAGTSGKNCEVNVNECH SNPCNNGATCIDGINSYKCQCVPGFTGVHCEKNVDECISSPCANNGVCIDQVNGYKCE CPRGFYDAHCLSDVDECASNPCVNEGRCEDSINEFICHCPPGYTGKRCELDIDECSSN PCQHGGTCYDKLNAFSCQCMPGYTGQKCETNIDDCVQNPCGNGGTCIDKVNGYKCVCK VPFTGRDCESKMDPCASNRCKNEAKCTPSSNFLDFSCTCKLGYTGRYCDEDIDECSLS SPCRNGASCLNVPGSYRCLCTKGYEGRDCAINTDDCASFPCQNGGTCLDGIGDYSCLC VDGFDGKHCETDINECLSQPCQNGATCSQYVNSYTCTCPLGFSGINCQTNDEDCTESS CLNGGSCIDGINGYNCSCLAGFSGANCQYKLNKCDSNPCLNGGTCHEQRDEYTCHCPS GFTGKQCSEYVDWCGQSPCENGATCSQMKHQFSCKCSAGWTGKLCDVQTISCQDAADR KGLSLRQLCNNGTCKDYGNSHVCYCSQGYAGSYCQKEIDECQSQPCQNGGTCLDLIGA YECKCRQGFQGQNCELNIDDCAPNPCQNGGTCHDRVMNFSCSCPPGTVGIICEINKDD CKPGACHNNGSCIDRVGGFECVCQPGFVGARCEGDINECLSNPCSNAGTLDCVQLVNN YHCNCRPGHMGRHCEHKVDFCAQSPCQNGGNCNIRQSGHHCICNNGFYGKNCELSGQD CDSNPCRVGNCVVADEGFGYRCECPRGTLGEHCEIDTLDECSPNPCAQGAACEDLLGD FECFCPSKWKGKRCDIYDVSYPGWNGGNDRYAADLEQQRAMCEKRGCNEKQTNGICDS ECNTYACNFDGNDCSLGINPWANCTANECWNKFKNGKCNEECNNAACHYDGHDCERKL KSCDSLFDAYCQKHYGDGFCDYGCNNAECSWDGLDCENKTQSPVLAEGAISVVMLMNE EQFREIQAQFLRNMSHMLRTTVRLKKDAFGRDMITKWRDNQPVAEIDNTDFARKNKIL YTQQVYQTGIQIQIEIDNRKCTECFTHAVEAAEFLAATAAKHQLRNDFQIHSVGVVKN PGDDDSGEPPANVKYVITGIILVIIALAFAGMVLSTQRKRAHGVTWFPEGFRAPAAVM SRRRRDPHGQEMRNLNKQVGLQSQGVGQTGAHWSDDESDMPLPKRQRSAPISVINQQG LGNNGGYASDHTMVSEYEEADQRVWSQAHLDVADVRAIMTPPAHQDGGIGLSGISNKH DVDARGPHGQTPLMIAVRGGGGLDTGEDIENNEDSTAQVISDLLAQGAELNATMDKTG ETSLHLAARFARADAAKRLLDAGADANCQDNTGRTPLHAAVAADAMGVFQILLRNRAT NLNARMHDGTTPLILAARLAIEGMVEDLITADADINAADNSGKTALHWAAAVNNTEAV NILLMHHANRDAQDDKDETPLFLAAREGSYEACKALLDNFANREITDHMDRLPRDVAS ERLHHDIVRLLDEHVPRSPQMLSMTPQAMIGSPPPGQQQQQQQLITQPTVISAGNGNG NGNASGKQSSQSAKQKAAKKAKLIEGSPDNGLDATGGGSLRRKASSKKTSAASKKAAT LNGQLTGAAQAQAQAAQAAAAAAAAAAAAAAMTMSHELEGSPVGVGMGGNLPSPYDTS SMYSNAMAAPLANGNPNTGAKQPPSYEDCIKNAQSLQSLQGNGLDMIKLDNYGYSMGS PFQQELLNGQGLGMNGNGQRNGVVGGVLPGGLCGMGGLTGAGNGNSHEQGLSPPYSNQ SPPHSVQSSLALSPHAYLGSPSPAKSRPSLPTSPTHIQAMRHATQQKQFGGSNLNSLL GGANGGGVVGGGGGNGGQGPQNSPVSLGIISPTGSDMGIMLAPPQSSKGSAIMQTLSP QQQQQQQQQQQQQQQHQQQQQQQQQQQQQQQQQQLGGLEFGSAGLDLNGFCGSPDSFH SGQMNPPSIQSSMSGSSPSTNMLSPSSQHNQQAFYQYLTPPSQHSGGHTPQHLVQTLD SYPTPSPESPGHWSSSSPRSNSDWSEGVQSPAANNLYISGGHQANKGSEAIYI" misc_feature 1231..1338 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature 1345..1452 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(1345..1347,1354..1356,1396..1398) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 1474..1569 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature 1573..1683 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(1573..1575,1582..1584,1624..1626) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 1687..1803 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(1687..1689,1696..1698,1747..1749) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 2041..2154 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(2041..2043,2050..2052,2095..2097) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 2158..2268 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(2158..2160,2167..2169,2209..2211) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 2272..2382 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(2272..2274,2281..2283,2323..2325) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 2386..2496 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(2386..2388,2395..2397,2437..2439) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 2500..2607 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(2500..2502,2509..2511,2548..2550) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 2614..2721 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature 2725..2835 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(2725..2727,2734..2736,2776..2778) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 2839..2946 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(2839..2841,2848..2850,2890..2892) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 2953..3063 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(2953..2955,2962..2964,3004..3006) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 3067..3177 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(3067..3069,3076..3078,3118..3120) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 3181..3291 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(3181..3183,3190..3192,3232..3234) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 3415..3525 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(3415..3417,3424..3426,3469..3471) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 3532..3642 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(3532..3534,3541..3543,3583..3585) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 3646..3756 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(3646..3648,3655..3657,3697..3699) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 3775..3870 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature 3886..3978 /gene="N" /note="EGF-like domain; Region: EGF; pfam00008" /db_xref="CDD:394967" misc_feature 4246..4353 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature 4357..4467 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(4357..4359,4366..4368,4408..4410) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 4471..4581 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(4471..4473,4480..4482,4522..4524) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 4585..4701 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature order(4585..4587,4594..4596,4642..4644) /gene="N" /note="Ca2+ binding site [ion binding]; other site" /db_xref="CDD:238011" misc_feature 4708..4815 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature 4945..5046 /gene="N" /note="Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the...; Region: EGF_CA; cd00054" /db_xref="CDD:238011" misc_feature 5110..5220 /gene="N" /note="Domain found in Notch and Lin-12; Region: NL; smart00004" /db_xref="CDD:197463" misc_feature 5239..5343 /gene="N" /note="LNR domain; Region: Notch; pfam00066" /db_xref="CDD:459658" misc_feature 5377..5463 /gene="N" /note="LNR domain; Region: Notch; pfam00066" /db_xref="CDD:459658" misc_feature 5476..5640 /gene="N" /note="NOTCH protein; Region: NOD; pfam06816" /db_xref="CDD:462014" misc_feature 5839..6102 /gene="N" /note="juxtamembrane and transmembrane (JMTM) domain found in Drosophila melanogaster neurogenic locus Notch protein (dNotch) and similar proteins; Region: JMTM_dNotch; cd21706" /db_xref="CDD:411989" misc_feature order(5842..5850,5857..5865,5923..5928,5932..5934, 5938..5964,5971..5976,5983..5991) /gene="N" /note="putative polypeptide substrate binding site [polypeptide binding]; other site" /db_xref="CDD:411989" misc_feature 6382..6660 /gene="N" /note="Ankyrin repeats (3 copies); Region: Ank_2; pfam12796" /db_xref="CDD:463710" misc_feature 6418..6561 /gene="N" /note="ANK repeat [structural motif]; Region: ANK repeat" /db_xref="CDD:293786" misc_feature 6523..7134 /gene="N" /note="Ankyrin repeat [Signal transduction mechanisms]; Region: ANKYR; COG0666" /db_xref="CDD:440430" misc_feature 6568..6660 /gene="N" /note="ANK repeat [structural motif]; Region: ANK repeat" /db_xref="CDD:293786" misc_feature 6667..6762 /gene="N" /note="ANK repeat [structural motif]; Region: ANK repeat" /db_xref="CDD:293786" misc_feature order(6766..6768,6772..6774,6784..6789,6796..6804, 6808..6813,6823..6825,6832..6834,6859..6861,6865..6867, 6871..6873,6883..6888,6895..6903,6907..6912,6922..6924, 6931..6933,6958..6960,6964..6966,6970..6972,6982..6987, 6994..7002,7006..7011,7021..7023,7030..7032,7057..7059) /gene="N" /note="oligomer interface [polypeptide binding]; other site" /db_xref="CDD:293786" misc_feature 6766..6861 /gene="N" /note="ANK repeat [structural motif]; Region: ANK repeat" /db_xref="CDD:293786" misc_feature 6865..6960 /gene="N" /note="ANK repeat [structural motif]; Region: ANK repeat" /db_xref="CDD:293786" misc_feature 6964..7059 /gene="N" /note="ANK repeat [structural motif]; Region: ANK repeat" /db_xref="CDD:293786" polyA_site 10050 /gene="N" /experiment="COORDINATES: polyA evidence [ECO:0006239]" ORIGIN 1 gtttttttga gtggacgaaa cggttgtgag agcggacgat cgcaaagtca gaggaacctg 61 gaaaacacac agcacagtta tttttttttt tgaaagtgag tgcaaggaaa accgaaacaa 121 aatagaggat acaaaataaa aaacaacaag aaaaaatacg ggaatggcgc tacaaaaaac 181 gccgctaaaa aaggagttct tgtggaaggg ataacaacac aaaaagaaga agaaagaaac 241 aaggattttt cgaaaagtgt atctgacccg agtcgcgtgt gtgtgagtga gaaggcgagt 301 gagtcgaacg agggagagag caatacgcaa gcgtgcggcg tcggtttgaa tttgaatttt 361 tgctcgatcc tcacaatata aaaaaggcaa caaaaaagag cgctgttttt ttttgccatt 421 ttttttttaa tgtttcaaaa cggaaaatgt cgccgtcgtc gggagagtgc ctcctcttag 481 ttgatcaaat aaagctttaa gatctaaagt tttgaaacta tcaaagcgaa aaaccgcccg 541 caaaatagaa ataaaaaact ttaaaatttc catatcctcg aaataaaata catcctcaaa 601 gagagagacc aaagaggaga gcccgcaaag aaaccgagca gagaagagga agaggtgaaa 661 aactggaaaa gtggttaact ggaaaactgg attacaatgc aatcgccgcg cagccgacgt 721 cgcagtcgcg caccaaacac ttggatttgt tgttggatta acaaaatgca cgccgttgcg 781 tcgctgtcgg cgtcgctgcc tctgctgctg ctgacgctgg cgtttgcgaa tttgccaagc 841 accgtccgcg gaactgatac cgcgttggtg cccctttcct gcacaagtgt ggtctgccag 901 aatggcggaa cctgcgtcac acaactcaat ggcaaaatct actgcgcttg cgattcacac 961 tatgtcggcg actactgtca acaccgaaat ccgtgcaatt cgatgcgttg ccagaatggc 1021 ggcacctgtc aggtgacctt tcgcaacggt cgtcctggca tctcctgcaa gtgtcccctg 1081 ggcttcgacg agtccttgtg cgaaatagcg gtgccgaatg cctgcgatca tgtgacctgt 1141 cacaacggag gcacctgtca gctgaagacc ctgcaggagt acacgtgtgc gtgtgccaat 1201 ggctatacag gtgagcgatg cgagacgaag aatctgtgcg ccagttcgcc ctgccggaat 1261 ggaggcacct gcaccgcctt ggccggaagc agcagcttca cctgctcctg tccaccggga 1321 ttcaatggtg acacctgctc cttcgacatc gaggagtgcc tgtcgaatcc ctgcaagtac 1381 ggcggcacct gcgtcaacac ccacggatcc taccaatgca tgtgtcccac gggctacacg 1441 ggcaaggatt gcgacaccaa gtacaagccc tgctcaccat cgccctgcca gaatggcgga 1501 gtttgccgat cgaacggtct gtcctacgac tgcaagtgcc ccaaaggttt cgagggcaag 1561 aactgcgagc agaacttcga cgactgcctg ggtcacctgt gccaaaacgg aggcacctgc 1621 gtggacggga tctcggacta cagctgccgg tgtccgccca acttcacggg aaagttctgc 1681 caggacgacg tggacgagtg cgcccagcgg gatcatccgg tgtgccagaa tggagccacc 1741 tgtacgaata cccatggctc ctacagctgc atctgcgtga atggctgggc gggattggat 1801 tgcagcaaca atacggatga ctgcaagcag gcagcgtgtt tctatggagc cacctgcatc 1861 gatggcgtcg gtagcttcta ctgccagtgc accaagggaa agacggggtt gctctgccac 1921 ctggacgacg cctgcacctc gaatccctgc cacgcggatg ccatttgcga tacgagtccg 1981 atcaacggat cctatgcctg ttcctgtgcc accggctaca agggcgtcga ttgctccgag 2041 gacattgacg aatgcgatca gggatccccc tgcgagcaca acggcatttg tgtgaatacc 2101 ccgggtagct atcgctgcaa ttgctcccag ggattcacgg gtccgcgctg cgaaacgaac 2161 atcaacgaat gcgaatcgca tccttgccag aatgagggat cctgcctgga tgatcccggc 2221 acctttcgtt gcgtctgcat gccgggattc acgggcaccc agtgcgagat tgacattgat 2281 gaatgccaat cgaatccctg cctgaatgac ggcacctgcc acgacaagat caacggcttc 2341 aagtgcagtt gtgctttggg tttcaccgga gccaggtgtc aaatcaacat agacgactgc 2401 cagtcgcaac cgtgcaggaa tcgtggaatt tgccatgatt ccatagctgg ttatagctgc 2461 gaatgtccac cgggatacac aggaaccagt tgcgaaatca acatcaatga ctgcgactcg 2521 aatccctgcc acaggggcaa gtgcatcgac gatgtgaata gcttcaagtg tttgtgtgat 2581 ccaggatata cgggatacat ttgccagaag cagatcaacg aatgcgaatc gaatccctgc 2641 cagtttgatg gccactgcca ggatcgcgtg ggcagctact attgccaatg tcaggcgggc 2701 acaagtggca agaattgcga ggtgaacgtg aacgaatgcc acagcaatcc ctgcaacaat 2761 ggggcaacct gcattgatgg tattaattcc tataaatgcc aatgtgtccc aggattcacg 2821 ggcgtccatt gcgagaagaa tgtggacgag tgcatcagca gtccgtgtgc gaataatggt 2881 gtatgcatcg atcaggttaa tgggtataaa tgcgaatgcc cacggggatt ctacgatgcc 2941 cattgcctca gtgacgtcga cgaatgcgcc tcgaatcctt gcgtaaacga aggtcgctgc 3001 gaggacagca tcaacgagtt catctgccac tgcccaccag gatacacggg caagcgatgc 3061 gaactggaca tcgatgagtg cagcagcaat ccctgccagc atggaggaac ttgctacgac 3121 aaactgaacg cctttagttg ccaatgtatg cctggttata cgggtcagaa atgtgagacc 3181 aacatagatg actgcgtcca gaatccctgt ggaaatggag gcacttgcat agacaaggtg 3241 aatggttaca agtgcgtctg caaggtgccc ttcaccggac gcgattgcga gagcaaaatg 3301 gatccctgtg ccagtaatcg ctgcaagaat gaggcaaagt gtacgccgag ttcgaatttc 3361 ctggactttt cctgcacctg caaactgggc tacacgggac gctactgcga tgaggatatc 3421 gatgagtgct ccttgagttc tccctgccgc aatggagcca gttgtttgaa tgttcctggt 3481 tcgtatagat gcctctgcac caagggttac gagggtcgcg attgcgccat caacacggat 3541 gactgcgcct cgtttccctg ccaaaatgga ggaacctgtc tggatggaat cggtgactac 3601 agctgcctgt gcgtggatgg tttcgatggc aagcactgcg agacggatat caatgagtgt 3661 ttgagtcaac cctgccaaaa tggagccacc tgcagtcagt atgtgaatag ttatacctgc 3721 acttgtccct tgggattctc ggggattaat tgccagacga atgatgaaga ttgcacagag 3781 agttcgtgcc tcaatggagg aagctgcatt gatggaatca atgggtataa ctgtagctgc 3841 ttagcgggat tctccggagc caattgccag tacaagctga acaagtgcga ctcgaatccc 3901 tgcctgaatg gaggaacctg tcatgagcag agggatgagt acacgtgcca ctgccccagt 3961 ggattcaccg gaaagcagtg ctccgagtac gtggactggt gtggccaatc accgtgtgaa 4021 aatggagcca cctgcagcca gatgaagcat cagtttagct gcaaatgctc cgccggctgg 4081 acgggcaagc tgtgcgatgt gcagaccatt tcctgccagg atgcagcgga taggaaaggc 4141 ctgagtctcc gccagctgtg caataatgga acgtgcaagg attacgggaa tagccatgtg 4201 tgctactgtt cgcaaggata tgcgggtagc tattgccaaa aggagattga cgagtgccag 4261 tcgcaacctt gtcagaatgg aggaacctgc ttggatttga taggagccta tgaatgcaag 4321 tgtcgccagg gtttccaggg gcaaaattgc gaactgaaca tagatgattg tgccccgaat 4381 ccctgccaga atggaggaac ctgccacgat cgcgtgatga actttagttg cagctgtcca 4441 ccgggaacag tgggcattat ttgcgagatc aacaaggatg actgcaaacc gggagcctgt 4501 cacaacaatg gtagctgcat tgatcgcgtg ggaggctttg aatgcgtctg ccagccggga 4561 tttgtgggtg cccgttgcga gggagacatc aatgagtgcc tgagtaatcc ctgctcgaat 4621 gcggggactc tcgactgcgt tcagttggtg aataactatc actgcaactg cagacccggt 4681 cacatgggtc gtcactgcga gcacaaggtg gatttctgtg cccagagtcc ttgccagaat 4741 ggcggtaact gcaatatccg gcagagtggc caccactgca tctgcaataa tggtttctac 4801 gggaagaact gcgagttatc tggccaggat tgcgattcga atccctgccg cgtgggcaac 4861 tgtgtggtgg ccgacgaggg attcggctat agatgtgaat gtccgcgagg aaccctaggt 4921 gaacactgcg agatcgacac gctggacgag tgctcgccga atccctgcgc ccagggagcc 4981 gcctgcgagg atctcctggg ggatttcgag tgcttctgcc cgagcaaatg gaagggcaag 5041 cgttgcgata tctacgatgt gagctatccg ggttggaatg gtggcaacga tcgatatgca 5101 gccgatttgg agcaacagcg agcgatgtgc gagaagcgtg gatgcaacga gaagcagacg 5161 aatggcattt gcgattcgga gtgcaatacg tatgcctgta atttcgatgg caacgactgc 5221 tcgctgggca tcaatccgtg ggccaattgt acggctaacg agtgctggaa taagttcaag 5281 aatggcaaat gcaacgagga gtgcaacaat gcggcatgcc actacgatgg gcatgattgc 5341 gagaggaaac tgaagagttg cgatagcctt ttcgatgcct actgccaaaa gcactatggc 5401 gatggcttct gcgactatgg atgcaacaat gccgagtgca gttgggatgg tttggattgc 5461 gagaataaaa cacagtcgcc tgttttggca gagggagcaa tctcggtggt aatgctgatg 5521 aacgaggagc aattccgtga gattcaggcg cagtttttga ggaacatgag ccacatgctg 5581 agaactacag tgaggctaaa gaaggatgcc tttggcaggg atatgataac caaatggagg 5641 gataatcaac cggtagccga gattgataac actgattttg ccaggaaaaa caagatatta 5701 tatacgcagc aggtttatca aacaggcata cagatacaga ttgaaattga taacaggaaa 5761 tgcactgaat gctttaccca tgcggtggag gcagctgaat tcctggctgc cacggcggca 5821 aagcatcagt tgcgcaacga ttttcagatc cacagcgtgg gcgtggtcaa gaatccgggg 5881 gatgatgata gcggagaacc tcctgctaat gttaagtatg tcatcacagg gattatcctg 5941 gtcatcatcg ccttggcctt tgctggaatg gtcttgagca cgcagagaaa gcgcgcacat 6001 ggcgtcacct ggttccccga aggattccgc gccccagcgg ctgtgatgtc ccgcaggaga 6061 agagatcccc atggtcagga gatgaggaac ctcaacaagc aagtgggtct gcagtcgcag 6121 ggcgtcggcc aaacaggagc ccattggtcg gatgatgagt ccgatatgcc gctgcccaag 6181 cgacaaagga gtgctcccat ctctgtgata aaccagcagg gattgggcaa caatggtggc 6241 tatgccagcg atcacacgat ggtcagcgag tacgaggagg cggaccagag ggtctggtcg 6301 caggcccacc tggatgtggc cgatgtgagg gccataatga cgccgccggc gcatcaggat 6361 gggggaattg gattatctgg catctccaac aaacacgatg tggatgccag aggaccacat 6421 ggtcagacgc ctctgatgat tgctgtgcgc ggaggcggag gcctggatac cggcgaggat 6481 atcgagaaca acgaggacag cacggcgcag gtgatctccg atctgctggc ccagggagcc 6541 gaactgaatg ccaccatgga caaaacgggc gagacgtcgc tgcatttggc ggcgcgattc 6601 gctcgagcgg atgccgccaa gcggctgctg gacgccggag cggatgccaa ttgccaggac 6661 aacacgggaa ggacgccact ccacgcggcc gtggctgccg atgcaatggg tgtgttccag 6721 atactcctgc gcaatagggc caccaatttg aatgccagga tgcatgatgg caccactcct 6781 ttgatactcg ccgctcgcct ggccatcgag ggaatggtgg aggatcttat caccgccgat 6841 gcggatatta atgcagcgga taattcagga aagaccgctc tccattgggc agcggcagtg 6901 aataatacag aggcggtgaa tatactcctc atgcatcatg ccaatcggga tgcccaggat 6961 gacaaggatg agacgccgct gtttttggcc gcccgcgagg gaagctacga ggcctgcaag 7021 gcgctgctgg ataactttgc caatcgcgag atcaccgatc acatggaccg actgccacgc 7081 gatgtggcca gcgagcgact gcatcacgat atcgttaggt tgctggatga gcatgttcct 7141 cgatcaccgc aaatgctcag catgacgccg caggcgatga ttggatcacc gccgccggga 7201 caacagcagc agcagcagca gctgatcacc cagcccacgg tgatttccgc cggaaacgga 7261 aatggcaatg gcaacgccag cggcaagcag agcagccagt cggccaagca gaaggcggcc 7321 aagaaggcca agttgatcga gggcagcccc gacaacggac tcgatgccac cggcggtgga 7381 agtctgcgtc gcaaggccag ttcaaaaaag acaagtgcgg cctcaaaaaa ggccgccacc 7441 ttgaatggcc agctgactgg agcggcccag gcccaagccc aagcggccca agctgcggcg 7501 gcggcggcgg cagcagcggc ggcagcagct gccatgacca tgtcccacga actggaggga 7561 tcgccggtgg gcgtgggcat gggcggcaat ctgcccagcc cctacgacac cagctccatg 7621 tactcgaacg cgatggccgc tccgctggcg aacggcaatc cgaacacggg cgccaagcag 7681 ccgccgagct atgaggattg catcaagaat gcgcaatcgc tgcagtcgct gcagggcaac 7741 ggcctggaca tgataaagct ggacaactac ggctactcga tgggctcgcc atttcagcag 7801 gagctgctca acggccaagg actcgggatg aatgggaacg ggcagcggaa cggcgtcgtc 7861 ggcggcgtcc tgcccggcgg tctgtgcgga atgggcggcc tgaccggagc cggaaacgga 7921 aatagccacg agcagggact cagtccgccg tactcgaacc aatcgccgcc gcattcggtg 7981 cagagcagcc tggcgctttc gccccacgct tatttgggct ctccatcgcc ggccaagtcg 8041 cgacccagtt tgcccacctc gccaactcac atccaggcga tgaggcatgc cacacagcag 8101 aagcagttcg gtggcagcaa tctgaacagc ctgctgggcg gtgccaacgg agggggcgtg 8161 gtgggcggag gaggaggcaa cggtggacag gggccgcaga actcgccggt gagcttgggc 8221 atcatctcgc cgacgggcag cgatatgggc atcatgctcg ccccgccgca atcctcgaaa 8281 ggcagtgcaa taatgcagac gctatccccc cagcaacagc aacagcagca gcagcaacag 8341 cagcaacaac agcagcatca gcagcagcaa cagcaacagc agcaacagca gcagcagcaa 8401 cagcagcagc aactcggagg cctggagttc ggttcagcgg gcttggacct gaatggattt 8461 tgtggatctc cggactcatt tcactcgggt caaatgaatc cgccctcgat acaaagttca 8521 atgtccggct cgtcgccgtc gaccaacatg ctgtcgccgt cgtcgcagca caaccagcag 8581 gccttctacc agtacctaac gccaccgagc cagcattccg gcggccacac gccgcagcat 8641 ttggtccaga cgctggacag ctacccgacg ccctcgccgg agtcgcccgg ccactggtcc 8701 tcctcctcgc cgcgatcgaa ttccgattgg agcgagggcg tccagtcgcc ggcggccaat 8761 aatctctaca tttccggcgg ccatcaggcc aacaagggat ccgaggccat ctacatttga 8821 ccgtgaaccc cgaaaccgat gatatggaaa tggagatggt ttgccaagga taacagcaga 8881 ggggactctt aaagagtccg cggctacgtt ctatctaaaa tgctatataa taataataat 8941 aatattaata tgtaattctc taatccctac gattatgtat tttttcaacg cgacttttat 9001 tttttttttt atcgtgtgtg tctataagtt ttagcattta gatcgaaata cgcaattctg 9061 gctgaagaag aagcgtcata aattgtagta acttaaatta tttttatgct tgttgtacag 9121 tgcgcgccga tcgcatatat atatatataa agatatatat atatatatat ctactcatag 9181 atataccagt taagagctcg ttttgtggcg cattttattt tattgtcgta acatcgtcgg 9241 agcttccagt aaaaaagaaa agaaagaaat tataaccagc gagaggagaa actaaactta 9301 gcgcaaattc tgattgatcg atttaacgtt ggtgggacat ttgtagttgt aagcgaaccc 9361 attatatcac taggccataa gtctaggcta agtttttatc gaattctaca tttaattatg 9421 gcccacatgc tatataaata tatacatata tttcgtagtt ctgtaggttt cgaaactgaa 9481 gtgcttatat aaacaaaatt gttagacgta caccgcatca aacctttttt tttacaccaa 9541 gttttttttt ttattattat tattatgact attatgtatt attacatatt gactaagcta 9601 aactgtaaaa agattctcat aactgtttga tttaagttga caaaaaaaaa acaaaggaaa 9661 ttgcaaaaat tatacaaaaa tccccagcga gaaagtataa tgctgattaa aagagaaaaa 9721 aagcacaaaa aacaacaaaa aagacagtaa ccagccaagt tttctatata tataaggccc 9781 cccattatga tctaagttga taaatagata ttttcaaaaa atatacaacc agaaaaagac 9841 actaaaaagt ccacaaatac gattttgccg ttcgatttgc tctatttata atgcgaacgg 9901 tatgctttct ttcgtctata ttttgtgtaa gtttttaatg tgtaaattgt aaaattattt 9961 gttttcctaa cttaaacagc aagacagcaa aacaaaaatg caaacttttt gagataaata 10021 ataaatatta aataattatt ttaaaaacaa