Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

Drosophila melanogaster polyhomeotic distal (ph-d), transcript


LOCUS       NM_001103395            6020 bp    mRNA    linear   INV 26-DEC-2023
            variant B, mRNA.
ACCESSION   NM_001103395
VERSION     NM_001103395.2
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 6020)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 6020)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 6020)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 6020)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 6020)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 6020)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 6020)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 6020)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 6020)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 6020)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 6020)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 6020)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 6020)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 6020)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 6020)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 6020)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 6020)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 6020)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 6020)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 6020)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   21 (bases 1 to 6020)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (bases 1 to 6020)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   23 (bases 1 to 6020)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NC_004354).
            
            On Jul 15, 2014 this sequence version replaced NM_001103395.1.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..6020
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="X"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..6020
                     /gene="ph-d"
                     /locus_tag="Dmel_CG3895"
                     /gene_synonym="CG3895; Dmel\CG3895; DROZFP;
                     EG:BACN25G24.3; ph; Ph; PH; ph-D; Ph-d; PH-d; phd; PhD;
                     PHD; phm; ph[d]; ph[D]"
                     /note="polyhomeotic distal"
                     /map="2D2-2D2"
                     /db_xref="FLYBASE:FBgn0004860"
                     /db_xref="GeneID:44889"
     CDS             802..4881
                     /gene="ph-d"
                     /locus_tag="Dmel_CG3895"
                     /gene_synonym="CG3895; Dmel\CG3895; DROZFP;
                     EG:BACN25G24.3; ph; Ph; PH; ph-D; Ph-d; PH-d; phd; PhD;
                     PHD; phm; ph[d]; ph[D]"
                     /note="CG3895 gene product from transcript CG3895-RB;
                     CG3895-PB; ph-d-PB; polyhomeotic; polyhomeotic-distal;
                     polyhomeotic proximal"
                     /codon_start=1
                     /product="polyhomeotic distal, isoform B"
                     /protein_id="NP_001096865.1"
                     /db_xref="FLYBASE:FBpp0111756"
                     /db_xref="GeneID:44889"
                     /db_xref="FLYBASE:FBgn0004860"
                     /translation="MEVQQQLQLQQQLSEANGGGAASAGAGGAASPANSQQSQQQQHS
                     TAISTMSPMQLAAATGGVGGDWTQGRTVQLMQPSTSFLYPQMIVSGNLLHPGGLGQQP
                     IQVITAGKPFQGNGPQMLTTTTQNAKQMIGGQAGFAGGNYATCIPSNHNQSPQTVLIS
                     PVNVISHSPQQQQNLLQSMAAAAQQQQLTQQQQQQLNQQQQQLNQQQQQQQLTAALAK
                     VGVDAQGKLAQKVVQKVTTTSSTVQAATGPGSTGSTQTQQVQQVQQQQQQTTQTTQQC
                     VQVSQSTLPVGVGGQSVQTAQLLNAGQAQQMQIPWFWQNAAGLQPFGSNQIILRNQPD
                     GTQGMFIQQQPATQTLQTQQNQIIQCNVTQTPTKARTQLDALAPKQQQQQQQVGTTNQ
                     TQQQQLAVATAQLQQQQQQLTAAALQRPGAPVMPHNGTQVRPASSVSTQTAQNQSLLK
                     AKMRNKQQPVRPALATLKTEIGQVAGQNKVVGHLTTVQQQQQATNLQQVVNAAGNKMV
                     VMSTTGTPITLQNGQTLHAATAAGVDKQQQQLQLFQKQHILQQQMLQQQIAAIQMQQQ
                     QAAVQAQQQQQQQVSQQQQVNAQQQQAVAQQQQAVAQAQQQQREQQQQVAQAQAQHQQ
                     ALANATQQILQVAPNQFITSHQQQQQQQLHNQLIQQQLQQQAQAQVQAQVQAQAQQQQ
                     QQREQQQNIIQQIVVQQSTGATSQQQQQQPQQQSGQLQLSSVPFSVSPSMTAEDIAGI
                     TSSALQEALSVSGAIFQTTKPITCSSSTLPTSSVVTITSQSSTPLVTSSTVASMQQAQ
                     TQGTQIHQHQQLISATIAGGSQQQQQQQQLGLPSLTPTTPSPTTNPILAMTSMMNATV
                     GHLSTAPPVSVSSTAVTPSSGQLVTLSSASSGGGAGFPATPTKETPSKGPTATLVPID
                     SPKTPVSGKDTCTTPKSSTPATVSASVEASSSTGEALSNGDASDRSSTPSKGATTPTS
                     KQSNAAVQPPSSTIPNSVSGKEEPKLTTCGSLTSATSTSTTTTITNGIGVARTTASTA
                     VSTASTTTTSSGTFTTSCTSTTTTTTSSISNGSKDLPKAMIKPNVLTHVIDGFIIQEA
                     NEPFPVTRQRYADKDVSDEPPKKKATMQEDIKLSGIASAPGSDMVACEQCGKMEHKAK
                     LKRKRYCSPGCSRQAKNGIGGVGSGETNGLGTGGIVGVDAMALVDRLDEAMAEEKMQT
                     ESYQTVSDALPIQAATPEVPPISMPVLAAMSTSSPLSLPLTLPLPIAIAPTVSLPVVS
                     AGVVAPVLAIPSSNINGSDRPPISSWSVEEVSNFIRELPGCQDYVDDFIQQEIDGQAL
                     LLLKENHLVNAMGMKLGPALKIVAKVESIKEVPPGDVKD"
     misc_feature    <3361..3807
                     /gene="ph-d"
                     /locus_tag="Dmel_CG3895"
                     /gene_synonym="CG3895; Dmel\CG3895; DROZFP;
                     EG:BACN25G24.3; ph; Ph; PH; ph-D; Ph-d; PH-d; phd; PhD;
                     PHD; phm; ph[d]; ph[D]"
                     /note="large tegument protein UL36; Provisional; Region:
                     PHA03247"
                     /db_xref="CDD:223021"
     misc_feature    4648..4854
                     /gene="ph-d"
                     /locus_tag="Dmel_CG3895"
                     /gene_synonym="CG3895; Dmel\CG3895; DROZFP;
                     EG:BACN25G24.3; ph; Ph; PH; ph-D; Ph-d; PH-d; phd; PhD;
                     PHD; phm; ph[d]; ph[D]"
                     /note="SAM domain of Ph (polyhomeotic) proteins of
                     Polycomb group; Region: SAM_Ph1,2,3; cd09577"
                     /db_xref="CDD:188976"
     misc_feature    order(4702..4707,4807..4812,4819..4824,4831..4833)
                     /gene="ph-d"
                     /locus_tag="Dmel_CG3895"
                     /gene_synonym="CG3895; Dmel\CG3895; DROZFP;
                     EG:BACN25G24.3; ph; Ph; PH; ph-D; Ph-d; PH-d; phd; PhD;
                     PHD; phm; ph[d]; ph[D]"
                     /note="oligomer interface EH [polypeptide binding]; other
                     site"
                     /db_xref="CDD:188976"
     misc_feature    order(4735..4749,4753..4758,4765..4770,4780..4782,
                     4795..4797)
                     /gene="ph-d"
                     /locus_tag="Dmel_CG3895"
                     /gene_synonym="CG3895; Dmel\CG3895; DROZFP;
                     EG:BACN25G24.3; ph; Ph; PH; ph-D; Ph-d; PH-d; phd; PhD;
                     PHD; phm; ph[d]; ph[D]"
                     /note="oligomer interface ML [polypeptide binding]; other
                     site"
                     /db_xref="CDD:188976"
ORIGIN      
        1 cttgcgctct gctctctttg cgttcgtgtt ggtgtggtgt cgatgtgtcg cataccgcat
       61 gtgtattgaa cgggggaaaa aaaaagcgcc gacgcgacgc acacacaccc taccaccgtt
      121 tcgtagtatt tatttatata tttatttttg gcgatcaatg caaatcggtg ctgactataa
      181 gtgattagtg aaaacaatta atgctgtgcg gcgtctaaag ttgcgtcgtt ttgaatgtta
      241 ggcatgtaca ggtgcctcca aatatacaat aatacagtac agcaaggaaa gcaaaaatga
      301 aaacgcagtg gggacaccga aagtgaatca gcaacaacaa tacgtacacc accaccttcc
      361 ccggaagcca ccacatctgt aaaggtcaac tccaccactc gcgtggaccc ccagcggcca
      421 ctaaggtgcc tggaaacact cgcccagaag gcaggcatca gcttcgacga ggactttgcc
      481 aagagtccat cccaatcgcc cagctctaag gcagcacgtg ggtcagtcgg aacgccatca
      541 atcagacggc gccacccact actaccgctc agcagcagat cgccaagcgc acccgactca
      601 aagacaaccg gccgcaaact ggagaagtca cagagtccag ctcaacaggt ggcggccgcc
      661 accaatgtgc cgctgcagat ctcccccgag cagctgcagc agttatatgc aaacaatccc
      721 tacgccattc aggtgaagca agagtttccc acgcacacga ccagtggcag tggaactgaa
      781 ctaaagcatg caaccaacat tatggaagtt cagcagcagt tgcagctgca gcagcagctg
      841 tcggaagcca acggtggagg agcagcctcg gccggagccg gaggagcagc tagtccggcc
      901 aactcgcagc aaagccagca acagcagcac tccacagcca tcagcaccat gtcgccgatg
      961 caattggcag cggccactgg aggagttggc ggggattgga cacagggaag gacggtgcag
     1021 ctaatgcaac cctccaccag tttcctgtat ccccaaatga ttgtgtcggg aaatctgttg
     1081 catccaggag gcctcggtca gcagccaatc caggtgatca ccgccggcaa gccattccaa
     1141 ggcaacggcc cccagatgct taccaccacg actcaaaacg ccaagcaaat gatcggtggc
     1201 caagcgggat tcgctggcgg aaattacgcg acctgcattc cttcgaacca caatcagtcg
     1261 cctcagacgg tgctcatctc gccggtgaat gtcatctccc actcgccaca gcagcagcaa
     1321 aaccttctgc aatcaatggc cgccgcagct caacaacagc aacttaccca acagcagcag
     1381 caacagctta accagcagca acagcagctc aaccagcagc agcaacagca acagctgact
     1441 gccgctctgg ccaaggtggg agtggatgcg cagggcaagc tggcccagaa agtggttcag
     1501 aaggtgacca ccaccagcag cacggtgcag gcggcgacgg gtcctggatc tactgggtca
     1561 acacagaccc agcaggtgca gcaggttcag caacagcagc agcagaccac ccaaaccact
     1621 cagcagtgcg tgcaggtttc acagtcgact ttgccagtcg gtgtgggtgg acagtctgtt
     1681 cagactgccc aacttttgaa cgctggccaa gcgcaacaaa tgcagatccc ttggttctgg
     1741 cagaatgcgg cgggcctgca acccttcggc tccaatcaga tcatcctgcg aaaccagcca
     1801 gacggaaccc aaggcatgtt cattcaacag caaccggcga cgcagacttt gcagacccag
     1861 caaaaccaga ttattcaatg caacgtgacg cagacgccca ctaaggcacg cactcaactg
     1921 gatgcacttg ctcccaagca gcaacagcag cagcagcagg ttggcactac caaccagacg
     1981 cagcagcagc aactagcggt ggctactgcc cagttgcagc aacagcagca gcaactcact
     2041 gcagccgctc tgcagcgacc aggagcccct gtcatgcccc acaatggaac tcaagtgcgt
     2101 ccggccagtt ccgtatccac acagactgcc cagaaccaga gcctgctgaa ggccaaaatg
     2161 cgcaacaagc agcagccggt gcgccccgct ttagccacat tgaaaaccga aatcggtcaa
     2221 gtcgcaggac aaaataaggt agtaggccac ctgaccaccg tgcagcagca gcaacaggcg
     2281 acgaatctcc agcaggtggt taatgcggcg ggcaacaaaa tggttgtgat gagcacaacg
     2341 ggcactccga tcaccctgca gaatggacag acccttcatg cagccactgc ggcaggagtc
     2401 gacaagcagc aacagcagct acaactgttt cagaaacagc acatccttca acaacaaatg
     2461 ttgcaacagc aaattgctgc cattcaaatg cagcagcagc aagcggctgt tcaggcccag
     2521 caacaacagc agcaacaggt ctctcagcag cagcaggtta acgcccagca acagcaagcg
     2581 gtggcgcaac aacaacaggc agtcgcgcag gctcagcaac agcagaggga gcaacagcag
     2641 caagttgccc aagcccaggc gcagcatcaa caggctctcg cgaatgccac tcagcaaatc
     2701 cttcaggtgg cgccaaatca attcatcacg tcccaccagc aacagcagca gcagcaactt
     2761 cacaaccaac tgatacagca gcagctacag caacaggcgc aggcacaagt tcaagcccaa
     2821 gtgcaggctc aagcgcaaca gcaacaacag cagcgagagc agcagcagaa tattatccag
     2881 cagattgtgg tgcaacagtc aactggagcg acttctcaac agcagcagca gcaaccgcaa
     2941 cagcagtctg gacagttgca gcttagcagc gtgccgtttt cggtttcacc atcgatgacg
     3001 gcggaagata ttgccggaat aacatccagt gccctacaag aagctctctc ggtgtctggc
     3061 gccatctttc agacaaccaa accgattact tgcagttcct ctacgctccc cacaagcagt
     3121 gtggtcacaa ttaccagcca gagcagcact cctctggtca ccagcagtac ggtggccagt
     3181 atgcagcagg ctcagacgca aggtactcag atccatcaac atcagcagct aatcagcgcc
     3241 actattgccg gagggtctca acagcagcag cagcagcagc aactgggact accttcactt
     3301 acacccacca cgccctcacc tacaacaaat cccattctgg ccatgacctc gatgatgaat
     3361 gccaccgtgg gtcacctatc cactgcccca cccgttagtg tttctagcac cgctgtcact
     3421 ccatcgtctg gacagctggt cacactaagc agtgctagta gcggtggagg agcaggcttt
     3481 ccagccacgc ccaccaaaga gacaccttca aaagggccca ccgcaaccct ggtgcccatt
     3541 gattcgccca agactcctgt atcaggaaag gacacctgca ctacccccaa atcatctact
     3601 cctgccactg ttagcgcatc cgtagaggcc agtagttcca caggcgaagc cctgtccaat
     3661 ggagatgcct cagataggtc ttccacgccg tcaaagggcg ctaccactcc caccagcaag
     3721 caaagcaatg cagcagtgca gccaccgagt agcaccattc ccaacagtgt cagtgggaaa
     3781 gaagagccga agctgacaac ctgcggcagt ttaacgtccg caacatcaac atcaaccacg
     3841 acaacgatca ccaatgggat tggagtagcc agaacgacag ccagcacggc tgtctcaacc
     3901 gctagcacaa ccactaccag ttctggcacc tttaccacaa gttgcaccag cacaaccaca
     3961 accaccacgt cgagtatcag taatggatcg aaggatctcc ccaaggcgat gattaagccg
     4021 aacgtcttaa ctcacgtcat cgatggcttc atcatccagg aggccaacga gccatttccc
     4081 gtcaccagac agcgatatgc agacaaagac gtcagcgatg agccgccaaa gaaaaaggca
     4141 accatgcagg aggacatcaa gctaagtgga atagcatcag ctccaggctc ggatatggtt
     4201 gcttgcgagc agtgtggaaa gatggagcac aaagcaaagc tgaaacggaa gcgctactgt
     4261 tcgccaggat gctcgaggca ggcaaagaac ggcatcggtg gagttggatc aggagagacg
     4321 aacggcctgg ggacaggtgg tatagttggg gtggacgcaa tggcattggt ggacagactg
     4381 gatgaagcca tggctgagga gaagatgcag acagaatcat accagacagt atcggacgct
     4441 ttgccaattc aagcggctac gccggaggtc ccaccgattt cgatgccagt gctggcggct
     4501 atgtcgacat cttcaccact ttcgttgccc ctgacattgc ccttgccaat tgcaatagct
     4561 cccactgtgt cactgccagt ggtttcagct ggagtggttg cgccggtcct agcaatacca
     4621 tcctcgaata taaatggatc cgatcgccct cccatcagca gttggagtgt ggaagaagtt
     4681 agcaatttca tccgagaact gcctggttgc caggactacg tggacgactt tatacagcag
     4741 gagatcgacg gccaagcgct gctgctgctc aaagaaaacc atttggttaa cgccatgggc
     4801 atgaagctgg gtccagctct caaaattgtg gccaaggtgg agtccattaa ggaggtcccg
     4861 ccaggcgatg taaaggatta aaaacacgca acaaagtcaa ggtttcaaaa gaccgctttc
     4921 tttagtttcc cgcgtttcac ctaaatgtaa cgacatttac ttcgtgagcg aatgtgatca
     4981 gacagaacaa agtgaatcac gttccgactc accacttctc acagcacgta caccctaatc
     5041 atcagctaca tgcacctaat ctacaaaggg aactccccag agagcaaccg gtgcctggaa
     5101 tcactgactc tgttgcgagg cccatcccat ccagaatcta tgcgagaaat ccataattag
     5161 gtgatgtagt tgtttttccc gcacatgacg aaagcaagga atatgaccct ccttcggcgc
     5221 cgaagctgca gctagtttaa gcaccccgat cagaccccaa gattgtggca atagtagagt
     5281 ccatgactct gtgcgacgaa aaggacgggg aggttatagg accgctggcg ccaccgccgt
     5341 tggatcaaca gtcttcagca gtctaccaga gtctgaggat aggagcgggc agtatcctga
     5401 gcttctattt gaccatgctg atctcccacg acctttgcat ggacctttgc ctacctgtgg
     5461 accgggtcca accgggtctg ccacccaagc tgaatttgat tcacttgaag cggactttca
     5521 atccttttgt acgtaatcta aacatagcaa gaggggtaat atcgtagctc aaatgacagg
     5581 acgccacatg tatgatagag catagactgc catcccagtc atcaattcat cactttgtat
     5641 agaagttcac aattactcat aatcactaac gtatttattt gaacgttgtt aatcatttca
     5701 caacacccta tgcaaagaac atgaaaaaat catttgaact agaggtaatc tgggattata
     5761 tttacgtagt tagaattaaa acttaagcct gaagtaattc ctaagtgaaa ctgaattgaa
     5821 tccaactaaa cctttaattt attgataaac ttaggtcgta atgagtaacg tctgaagaat
     5881 ttttgccttt attgcaaggt tccgaaaacg ctgtccaact tttatagatc aaatcagtgg
     5941 tcggagttta attttttttt tagttttacg aaaccttttt ttcctactgc tgaagtaaat
     6001 aaaactgtaa agtgattaat