Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

Drosophila melanogaster small ovary (sov), transcript variant A,


LOCUS       NM_132120              10768 bp    mRNA    linear   INV 26-DEC-2023
            mRNA.
ACCESSION   NM_132120
VERSION     NM_132120.2
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 10768)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 10768)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 10768)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 10768)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 10768)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 10768)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 10768)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 10768)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 10768)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 10768)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 10768)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 10768)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 10768)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 10768)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 10768)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 10768)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 10768)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 10768)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 10768)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 10768)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   21 (bases 1 to 10768)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (bases 1 to 10768)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   23 (bases 1 to 10768)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NC_004354).
            
            On May 8, 2012 this sequence version replaced NM_132120.1.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..10768
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="X"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..10768
                     /gene="sov"
                     /locus_tag="Dmel_CG14438"
                     /gene_synonym="CG14438; Dmel\CG14438; EM25; fs(1)A1304;
                     fs(1)M105; l(1)6Dc; l(1)EA42; l(1)EM25; Sov"
                     /note="small ovary"
                     /map="6C12-6C13"
                     /db_xref="FLYBASE:FBgn0287725"
                     /db_xref="GeneID:31615"
     CDS             412..10353
                     /gene="sov"
                     /locus_tag="Dmel_CG14438"
                     /gene_synonym="CG14438; Dmel\CG14438; EM25; fs(1)A1304;
                     fs(1)M105; l(1)6Dc; l(1)EA42; l(1)EM25; Sov"
                     /note="CG14438 gene product from transcript CG14438-RA;
                     CG14438-PA; sov-PA; female sterile(1)M105; lethal (1) 6Dc;
                     female sterile (1) A1304"
                     /codon_start=1
                     /product="small ovary, isoform A"
                     /protein_id="NP_572348.1"
                     /db_xref="FLYBASE:FBpp0070922"
                     /db_xref="GeneID:31615"
                     /db_xref="FLYBASE:FBgn0287725"
                     /translation="MEDSEDDVVVVSCDTSMKEKVKAKLVEIRKFVPFIRRVRIDFQD
                     TLSKVQGHRLDALVNLLDREDVSMSSLNKIEVIIDKLRTRFNPRIEIDTGEIIDITEN
                     TEYDSTASSSSGSSAQPPAQRAKSSDKGGLLNGSSLAGRLNASLNPASDATKDHLSTP
                     KQLELERNSKVSCSTSKIKTSSISAKISVFPATSMDLNIPKTTSAKASDEGQRSPAEP
                     RAALQAIVQDTKTPTIPEPTSPAALKHSSLRGSRGFLAVMQKALIEEKKQRASEQKTD
                     KETNGVTQLETNFSRRSYTTSSQSTCRSSEISVRAENPDFKRRSTSLVQHAPLQEASP
                     GQSKKDLPISLSVQGLPALVSASTASPANTLEEARKKLAALKYGLGTTVPSMPPLASN
                     INDPRGRKGINLPETNNNKDNDLGIALQSPPPMRTPSPIPPPPRMKAGTWASFSNVPQ
                     ESAFTGQHAVQRNSVPPGDSRAFGDALAHEPRSFYGTDSREPRDPRIWKSKTSQQQQH
                     QQAPQAQIPPYSSDPRRSISTYSGFEESANRGQTNSKNNNNHNQNQNVTANGSAYIAN
                     WSGNPYGNGPNPRPPFPSNQNGNEYAGGFNGSHSFRGGFRGGHNKRGFGRHNDVPRTY
                     GEHRKAKARAEAEAKAKAEAEAKAKAAAEAKAKAAAEVRQLETEVSREMEAQEKNKQQ
                     KEKPEESEAEKSTIAVTQVPELDTSYRNVNLGVLNKKLDFRIPKKTLPPATTITSTSP
                     VNGNGENPSCPSNSPTSKSCDANQDKDTYKNKDRYLNKAKAKDKVDKGNEVSENNLDK
                     SEKLEKSQDKKANDKENKSDKKEKKRLNREPEKKSKVENPLEIVDSNSVVSEESSENT
                     DNVENEPPLSETNASPVPELATSTQDSQQDQSVSEELDILAKNRRMSGTRIKTPISST
                     GNPALKRRADDDVEDKLENPTKKNCAKWEAKPDKEKSEDDTIDKIKSMKVTKFADVEM
                     KVTEESQSAEEEEITEQKESTEEEEGTEHKKSTEEKDKPPKISKIKIVLTPIAHTTQV
                     VRPNDGFKNNQEKILDNMATDEHDDEEVPGPPAQFLRRIMQRRNSLAPTYMKPMVDKD
                     KIASSSFTYEDLPEQKRGNQNARNLAIIFEKTSDNCSVSTQNIINGKRRTRGCETSFN
                     ETQLSRNIFGMGQINRSRPKATEGKATRAKISVPAKDSTRADDDPIPTPNLKRKRQAI
                     HKETEDDVEMKPKKARLEAQEINGVSVTPDEQQVENNVEVTQKEVEAISSEPLLSSEV
                     EPTRKPRTKPRKNELDKLNDDIAQMYYGEEVMRATSRRACTRRSRTSSHTRTSSQHSR
                     TSSVSRTDSISTVSDISSIIVRNTARRGRGIRSSENGINRATFNASLNAKKPKLCRVR
                     IKRCAALMEMIKDQEKEEQEKKEQKNKEPKKKKVGVQKKPLKSKPKRENSVILNTNPE
                     WHSISKAVIKCVVCSKWVRRSPLSHYMMCHKEHYAARLPPDVLKELRAGRGNRPDYWV
                     SQRGGYTLHFTCPFCQKPLLLCQKGMIEHLIGHMGESRFYCSNCNMPQNRLSRLLDHT
                     ASCGPGAKPLSSKTVCLPMSVHVCHICQFMQYSKENMDRHLTVQHGLTKEELESVERE
                     ELMLCDTTDVPYADSNKDGSAVGPEIQKNKPKSKASAALSNTTKKNKQQKKTNNSRFM
                     KMVKKSVSLLKREREEQDDQNMTQANEGSEVPEIPPPPPEIEPLFVVNECLMTSEMDT
                     DMEEVLEQPVQHMSLMVDEKPVTLLSGATEQLEPSVPDPEPVVPSAQDDGKDVNEDED
                     VDVEAVVDSLQSHTDQTATSMLAEVSLAELAGDVLDGIGSDASDYEMDDNSEQVDTTN
                     KNGYGDDDDDALTDDWVDLETAKRNSKSAKSIFRVFNRFCSRLNKLPRSSRAVPSNGS
                     ENSDGSDNNDDDGDNPDPSELMPTMQPLEPEPEMGDSSTSTGAKSLSERVENVGFQKP
                     SSDEDQNRVAASYYCVQPGCTFLFSNELEGLENHFALEHPLVRWSGKCGMCRQKITAT
                     ETNLRISEELRHMRDVHMKDISTLPPPQSSAVESPAVIESCLNQREPVPESEPDPVPE
                     LPKLRVRRFTGDRLVVDSQAEKSQPVAIVVSDDDNPRNGMLRDLLAADPRPPNQQLDL
                     QAAGLGEFLCAKPDSPSTEPVKQTPVIVGYSSGLGLKIGQVLSRTQISANSRLSPVVN
                     DPLPEKSSAPAAVEENRNRFRCMATNCNFVAHKLMFMREHMKFHSYSFSSTGHLNCAY
                     CSHVAVDVDDYLRHGVIIHDLAPRSELESSTGPPSVTQKIRDMLSQRENGRVPPPTPQ
                     VTLSDVVLGLLECTGYSEDKLYACPQKGCIVRLTDEQLVNHLRYHIRSTHQGSELVKC
                     KFCTKAMHPPALRTHLQQYHARHSIFCGICLATSVNQRIMMYHMSTVHSKAYGRPNAR
                     LAFVSLPVKIDASKKNVESEFYVAVVEQPFGNLQMQDFQRKLFDEMDRRRSGTKTYFR
                     SSEVHILPTQPTFQRPLYCTECPFSTTSRVNMQMHLYEHKDETIREASKLADLIVPAT
                     SSVLTVSASTLVAPPRPGKDSEKPSTSGQSGDAATEQLNPDVPGTHKPIKPPLRYVPP
                     DQRYRCGFLRCSVLCFSESAVRKHMQANHKYSEVVRCPHCKNCQGQFGVDKYFDHLAM
                     HKRHIFQCGACSRHNSRRVIERHIQERHNIQDVDMIVHRHNDSNKTTEARWLKAPKLA
                     RHSLMEYTCNLCLKYFPTTVQIMAHAASVHKRNYQYHCPYCEFGGNLATALIEHILRE
                     HPEREVQPVQIYQRIVCKNKQTLGFYCTTCHEVASSFQKIAMHCDKEHKSRNPVQCPH
                     CIFGHLAERQVVLHIQEKHPHERGLAMVQFERVLNDIPNSISWEIGRPIEVEPEKEIP
                     NNGESAFLPLSQRQVVTEVVDLLDSDDEADEYGEQDDAKIVEFACTHCDGTNTNLPDL
                     RSQHWAREHPDQPFYFRVQPMLLCSECKRFRGNAKALREHLRATHSIRSIVAADIRRP
                     MECAYCDYRYKNRHDLAKHISEIGHLPNDLKHVTDDEIDALMLLSASGSGGAVNEYYQ
                     CGLCSVVMPTKETIVQHGQVEHCKPDERFCFRQLVSPVIYHCSFCMFNSTDELTTLRH
                     MVDHYSRFLVCHFCTRSQPGGFDEYIQHCYTYHRDDIKSFRDVHTFSDLKRYLSQVHY
                     QFQNGLIITKSSLRYTRYKSDKCMLELDAELMAKAQRPPIPRLHIRLKSTGVQMQSPE
                     GADVEKPVSLLRITKRRKTLNPGELLRSFREENEVQPQPPASSTSSGTAPSPAAGSVF
                     NLFKRRNSLVVRPATSNLDQH"
ORIGIN      
        1 caacatcaaa gtacgtggct taatttcttt catttctatg ccttatggcc agctccttcg
       61 agtaacatct acaaaaaaat aagaaaaatt ttctccccga aacgaaaaac cctagttttt
      121 cccaactttc acttacgacg aaaatcatcg cggcgaattg cagacacgta gccttggcag
      181 agtgagcatt ttgctctgct attcgtgtct atcgtattcg ccggcaattg cgcattgtga
      241 gacacgggca tagttccctc acttcggcgt caactttgga agtgttcgga tcggctcgta
      301 tttcgacacc tccgcagaca aggccaaatt gctgagcatc gcacaaaact ccagaaataa
      361 cggaggaaag taatctaaaa agagtcggat acgaacagtg cctgctgcaa aatggaggat
      421 agcgaggacg acgtggtggt ggtgagctgc gatacctcga tgaaggagaa ggtaaaggcc
      481 aagctggtgg agatccgtaa gtttgtgccc tttatccggc gtgtgcgaat agacttccag
      541 gatactttgt ccaaggttca gggtcatcgt ctggatgccc tggttaacct gctggatcgc
      601 gaggacgtat cgatgagctc tcttaacaag atcgaggtga tcattgataa gctaaggacg
      661 cgcttcaatc cgaggatcga aattgacact ggcgaaatca ttgatatcac tgaaaacact
      721 gagtatgatt ccactgcgtc gagttcctcc gggtcaagcg cacaaccacc agctcaaaga
      781 gcgaagtcct cagataaggg tggcctgttg aacggctcat ctctggctgg caggttaaat
      841 gcttcactta accccgcatc agatgctacc aaggaccatc tatcgacacc aaagcaacta
      901 gaattggagc gcaactctaa agtctcttgc tcaacttcaa aaataaagac ttcttccatt
      961 tctgccaaga tttcagtttt tccagccact tcaatggatt tgaacatccc aaaaactacc
     1021 agcgccaagg catcggatga ggggcagcgg tcacctgcag aaccacgtgc cgcccttcaa
     1081 gctatagttc aagatacgaa aacaccaacc attccagaac caacatcacc agcggcgctt
     1141 aagcattcct cccttcgtgg cagtcgtgga tttctggctg tcatgcagaa ggccttaatt
     1201 gaagagaaga agcagcgagc tagcgaacag aaaactgata aagaaactaa cggtgtaacg
     1261 cagctagaga caaacttctc gcggcgatct tatacaacat cgtcacagtc aacctgccgt
     1321 tcttcagaaa tatcggtaag agcagaaaac ccagatttta agcgacgaag cacatcgctt
     1381 gtgcagcatg ctcctctaca ggaggcctcc ccagggcaat ccaaaaaaga cttgcccata
     1441 tccttgtcgg tacagggtct accagctttg gtcagtgcca gcactgcaag tccagcaaat
     1501 acgcttgagg aggcccgcaa gaagctggcg gccttgaaat atggactagg aacaacggta
     1561 ccaagcatgc ctccactggc ctccaatata aatgatccac gcggtagaaa aggaataaac
     1621 ctgcctgaaa ctaacaacaa taaagacaac gacttgggta tagcgctgca atccccgccg
     1681 cctatgcgga ctccctcgcc tattccgccg ccaccaagga tgaaggccgg tacgtgggcc
     1741 tcattttcaa atgttcccca ggaaagcgca tttacaggcc agcatgctgt gcagcgcaac
     1801 tcggtacctc cgggagattc tcgagccttt ggggatgctt tggcacatga accaaggtcc
     1861 ttctatggca ctgattcccg agaaccccga gaccctcgta tctggaagag caagacttcc
     1921 caacagcagc aacatcagca ggcgccacag gcacaaattc ctccgtattc cagtgacccg
     1981 cgtcgttcta taagcactta cagcggtttc gaagagtcag ccaatagagg gcaaactaat
     2041 agtaaaaata ataataatca taatcagaat cagaatgtta ccgccaatgg aagtgcatac
     2101 attgcgaact ggagtggcaa tccatatgga aacggtccaa acccaagacc gccctttcct
     2161 agcaatcaaa acgggaatga atatgctggt ggttttaatg gttcgcatag tttcaggggc
     2221 ggatttcgcg gcggtcacaa caaacggggc tttggacgac acaatgacgt gccacgcaca
     2281 tatggggaac accgcaaagc caaggcccgt gctgaggcgg aggccaaggc taaggctgag
     2341 gcggaggcca aggctaaggc tgcggcggag gccaaggcta aggctgcggc ggaggtacgc
     2401 caattagaaa cggaagtttc gcgggagatg gaagcccaag aaaaaaataa acagcaaaag
     2461 gaaaagccgg aggagagcga ggcggagaag tcgacgatcg cagtgactca ggttccggaa
     2521 ttggacacct cctaccgcaa cgttaatctg ggggtgctaa acaagaagct agactttcga
     2581 ataccgaaga aaaccctccc accggcaaca acaataacct caacaagtcc agtcaatggt
     2641 aatggggaga atccaagctg cccctcaaat tcccccacaa gcaaaagctg tgatgccaac
     2701 caggacaaag atacttataa gaataaagat aggtatttaa ataaggctaa ggctaaagac
     2761 aaggtagata agggcaatga ggtgtcggag aacaatctgg ataagtctga gaagcttgaa
     2821 aaatcgcagg ataagaaggc aaatgacaag gagaacaagt ccgacaaaaa ggagaagaag
     2881 agactgaaca gggagcctga aaagaaatca aaggttgaga accccctcga gattgtggac
     2941 tcgaatagcg tggtcagtga ggaaagctcg gaaaatacag acaatgtgga aaatgaaccg
     3001 cctcttagcg agactaacgc gtctccagtt ccagagctag ctaccagcac tcaggacagt
     3061 caacaggacc agtcagtgag tgaagagttg gacatccttg ccaaaaaccg tagaatgtcc
     3121 ggaactagaa taaagactcc catttcgtct actggaaacc ctgcactaaa gcgacgggca
     3181 gatgatgatg ttgaggacaa gttggaaaat ccgactaaaa agaactgcgc aaagtgggag
     3241 gcaaagcctg acaaggaaaa atcggaggat gataccatcg acaagattaa atccatgaaa
     3301 gtaactaaat tcgctgatgt cgagatgaag gttacagaag aaagccagag tgctgaggag
     3361 gaggagatta ctgagcagaa agagagtact gaggaggagg aaggtactga gcataaaaag
     3421 agtactgaag agaaggataa gccgccaaaa atctccaaga taaaaattgt ccttactccc
     3481 attgcccata caacacaagt ggttcgtcct aatgatggct tcaagaacaa tcaagaaaag
     3541 atcttggaca acatggcaac tgatgagcac gatgatgagg aggtccccgg gcccccagct
     3601 caattcctac gccggattat gcagcgtcgg aactccttgg ctcctacgta tatgaagccg
     3661 atggtggaca aggataagat tgcttcttcc agttttacct acgaggatct gccggagcag
     3721 aagcgcggaa atcagaacgc ccgaaacctg gccatcattt tcgaaaaaac tagtgacaac
     3781 tgcagcgtgt ccactcaaaa cattattaat ggcaaacgtc gcactcgtgg atgtgagacc
     3841 tcttttaacg agacccaatt gagccgaaac atctttggca tgggccagat aaacaggtcg
     3901 cggccaaagg ccaccgaggg taaagctacc agggcaaaga tttctgtccc tgcaaaagat
     3961 tcaacccgtg ctgatgatga tcctattccc actccaaatc ttaagcggaa aaggcaagct
     4021 attcacaagg aaacagagga tgatgtggag atgaagccta agaaggctcg attggaagca
     4081 caggagataa atggggtcag tgttacgcct gatgaacagc aggtagagaa caacgtggaa
     4141 gtcacccaaa aggaggtgga agcaatatcc tcagagccac ttctctcttc tgaggttgaa
     4201 ccgacacgaa agcctcgcac gaaaccgcga aaaaacgagc tggacaagct aaacgacgac
     4261 attgcgcaaa tgtattacgg ggaggaagtg atgcgtgcca ccagtcgcag ggcttgtacc
     4321 cgtcgatcgc gcacgtcctc gcacacgcgc accagtagcc agcattccag gacgtcctct
     4381 gtatcgcgaa ccgatagcat atccaccgta tcggatatta gttccataat cgtcaggaac
     4441 acggcgcgaa ggggtagagg catcagatca tccgaaaatg gcatcaaccg tgccacgttt
     4501 aatgcatcct tgaatgcaaa aaaaccaaag ttgtgccgtg ttagaataaa gcgatgtgct
     4561 gcattgatgg agatgataaa ggaccaggaa aaggaggaac aggagaagaa ggagcagaag
     4621 aataaagaac cgaagaagaa gaaagtgggt gtgcaaaaaa agccattgaa aagtaagccg
     4681 aaaagagaga atagcgttat tcttaacaca aatcccgaat ggcactccat ttcgaaggct
     4741 gttatcaagt gtgtcgtctg ctcgaagtgg gttcgcagga gcccactctc tcattatatg
     4801 atgtgccata aggagcacta tgccgcccga ttgccacccg atgtgcttaa agagctgcgg
     4861 gccgggcgcg gaaatcgacc ggattactgg gtttcgcaac gcggcggcta cacattgcac
     4921 ttcacttgcc cgttctgcca gaagccactg ctactctgcc aaaaaggcat gatcgagcac
     4981 ttgatcggcc atatgggcga gtctcgtttt tactgctcca actgtaatat gccacagaac
     5041 cgcctcagta ggctgctgga ccacaccgca tcctgtgggc caggtgcgaa gcctttaagt
     5101 agcaaaaccg tctgcctacc gatgagtgtt cacgtgtgcc acatctgcca gtttatgcag
     5161 tacagcaagg aaaatatgga ccggcatctt actgttcagc atggcctaac gaaggaggaa
     5221 ctagaaagtg tggagcgcga ggagttgatg ctctgcgaca caacagacgt accatatgca
     5281 gattcgaata aggatggcag cgccgttggg cctgaaattc aaaaaaacaa accgaagtcg
     5341 aaagcaagtg cagcactttc aaatacaacc aaaaaaaata aacagcaaaa gaagacaaac
     5401 aacagtcgct tcatgaaaat ggtcaaaaaa tccgtaagtc tattgaagcg agaaagagaa
     5461 gagcaggatg accagaacat gacccaggct aacgaagggt cggaagtgcc ggaaataccg
     5521 ccgcctccgc cagaaattga gcccttgttt gtggtcaatg agtgtctaat gacctctgaa
     5581 atggacacgg acatggaaga agtcttggaa cagcccgttc aacatatgag cttaatggta
     5641 gacgaaaagc ctgtgacgct actcagtggg gccacagaac agctggagcc tagtgtccct
     5701 gatcccgagc ctgttgttcc atctgcacaa gatgatggca aagatgtaaa tgaagatgaa
     5761 gacgtagacg tggaggcagt agtggattcc cttcagtcac acactgacca gacggctact
     5821 tctatgttag cagaagtcag tctagccgaa ttggctgggg atgtacttga tggtattggc
     5881 agcgacgcgt ccgactatga gatggatgat aattcagagc aagtggatac aactaacaaa
     5941 aacggctacg gtgatgatga cgatgatgcg cttaccgacg attgggtgga tctggagact
     6001 gccaagcgca attccaagtc cgccaagagc atttttagag tgttcaatcg cttctgctcg
     6061 cgtttaaaca aattaccccg atccagcaga gcagtgccct cgaatgggag tgaaaacagc
     6121 gatggcagcg acaacaacga cgacgacggc gataatcctg atcccagcga gctaatgcca
     6181 acaatgcaac cattggagcc ggagccagag atgggggatt catccacatc tacaggtgct
     6241 aagtcgctat ccgaacgggt ggagaatgtg ggctttcaaa agccctcttc agacgaggat
     6301 caaaatcgcg tggcagcatc ctattactgc gtgcagccgg gttgcacttt cctcttttcc
     6361 aatgagctgg aaggcctcga gaatcatttt gcgttagagc accctcttgt tcgatggagc
     6421 ggcaaatgtg gcatgtgccg tcagaaaatc acggcaacgg aaacgaatct cagaatttct
     6481 gaagagttgc gccacatgag ggacgtgcac atgaaggaca tatccaccct gcctcctcct
     6541 cagtcatctg cggttgaaag cccagccgtt attgaatcct gcctgaatca gcgtgaacca
     6601 gtacctgaat cagaacctga tcccgttcct gagcttccca agctgcgtgt tcgacgcttc
     6661 actggggatc gccttgttgt ggattcacaa gcggaaaaga gccaaccggt agcaatagtt
     6721 gtcagtgatg atgataatcc gcgaaatggg atgctaaggg acttgctggc ggcggatcca
     6781 cggccaccca atcagcagtt ggacctccaa gccgctggac tgggcgagtt cctttgcgcc
     6841 aagcccgatt caccgtcaac agaaccggtc aagcagacgc ccgtaattgt tggctattcg
     6901 agtggcttgg gcttgaaaat cggccaggtc cttagcagaa ctcagatttc agctaactca
     6961 cggctatcgc cagtcgttaa cgatcccctg ccagagaagt cttctgctcc tgctgccgtt
     7021 gaagagaatc gtaatcgatt caggtgcatg gccaccaact gcaattttgt tgctcacaag
     7081 ctcatgttca tgcgggagca catgaagttt cacagctaca gtttcagcag caccggtcac
     7141 ctgaactgcg cgtactgctc ccatgtggca gtcgatgtgg atgattactt gcgccacgga
     7201 gtgatcattc acgacctggc accacgctcc gaactggaga gttcaactgg accaccatct
     7261 gttacccaga aaatccggga tatgctcagc cagcgggaaa atggtcgtgt tccaccacca
     7321 actcctcaag tcactctgtc tgatgtggtc ctgggtcttt tagaatgcac cggatacagc
     7381 gaggataaac tgtacgcctg tccccaaaag ggctgcatcg tgcggctgac agatgagcag
     7441 cttgtaaacc atttgcgcta ccacattcgt agcactcatc agggcagcga gttggtgaaa
     7501 tgcaagtttt gcaccaaggc gatgcatccg ccggcacttc gtacgcatct gcagcagtac
     7561 cacgcccggc acagcatctt ctgcggcatt tgcttagcca catcggtcaa ccagcgcata
     7621 atgatgtatc acatgagcac ggtgcactcc aaggcctacg gccggcctaa cgcgcggctg
     7681 gcgtttgtgt cactgcccgt gaagatcgac gcgagtaaga agaacgtaga aagcgagttc
     7741 tacgtggccg tcgtggaaca gccctttggc aacctccaga tgcaggattt ccagcgcaag
     7801 ctgttcgatg aaatggaccg tcggcgttcg ggaacaaaga cgtacttccg cagctccgag
     7861 gtgcatatcc tgccaacgca gccaacattc cagcgaccgc tatactgtac ggagtgcccc
     7921 ttctccacca cgtcaagggt taacatgcag atgcacctct atgagcacaa ggatgagacc
     7981 attcgggaag cctccaaatt ggcggacttg atagttccag caacctcttc ggtattaact
     8041 gtttcggcga gtacgttggt ggcaccgccg aggccaggca aagattcaga aaaaccatct
     8101 acttccggac aaagtggtga tgcagcgacg gagcagctga atccagatgt tcctggaacc
     8161 cacaagccca tcaagccacc gttacgctat gtgcccccgg accaacgcta ccgctgtggc
     8221 ttcctccgat gtagcgtcct ttgtttttcg gaatctgcgg tgcgcaaaca catgcaggct
     8281 aaccacaaat actcggaggt ggtaaggtgc ccgcactgca agaactgcca gggtcagttt
     8341 ggagtagata agtactttga ccatcttgca atgcataagc ggcacatctt ccaatgcggc
     8401 gcttgctcac gtcacaatag caggcgtgtc atcgagcggc acatacagga acgtcacaat
     8461 attcaagatg tggacatgat cgtacaccgc cataatgaca gcaacaaaac gaccgaagcc
     8521 cgctggctga aggcgcctaa attggcacgt cattcgctaa tggagtacac gtgtaacctg
     8581 tgcctcaagt actttccaac gaccgtgcag atcatggccc atgcggcgtc cgttcacaaa
     8641 cgcaactacc agtaccactg tccgtactgt gaatttggtg gaaacctcgc caccgcgctc
     8701 attgaacaca tccttcgcga gcacccggaa agggaagtgc agcctgtgca aatctaccag
     8761 cgcatcgtgt gtaagaacaa gcagacgcta ggcttctact gcaccacctg tcacgaggtg
     8821 gccagcagct tccagaagat cgctatgcac tgcgacaagg agcataagtc gcgcaatccg
     8881 gtgcaatgtc cccactgcat tttcgggcat ttggccgaac gccaggttgt cttacacata
     8941 caagagaagc atccccatga acgcggactg gcaatggtgc agttcgaacg cgtgcttaat
     9001 gacatcccga acagcataag ctgggagata ggtcggccca tcgaagtgga gcctgagaag
     9061 gagatcccga acaatgggga gagtgcattc ctgccgctaa gccagagaca ggttgtaacg
     9121 gaagtggtgg acctgctgga ttcagacgac gaggcggacg agtacggtga acaagatgac
     9181 gcgaaaatcg tggagttcgc ctgcacacac tgcgacggga caaacaccaa cttgccggac
     9241 ctacgctccc agcactgggc ccgcgaacat cccgaccagc ccttctattt ccgcgttcag
     9301 ccgatgctgc tctgctccga gtgcaagaga tttaggggca atgcaaaggc acttcgcgag
     9361 cacctgcgtg cgacacactc tatccggagc atagtggctg cggacattcg tcgaccgatg
     9421 gagtgcgctt actgcgacta ccgctataaa aacaggcacg atcttgcgaa acacatcagt
     9481 gagataggtc acctgcccaa tgacctgaag cacgtaacag atgatgaaat tgatgccctg
     9541 atgctgctca gtgccagtgg aagtggtggg gctgttaacg aatactacca gtgcggattg
     9601 tgcagtgtgg ttatgccaac gaaggagaca attgtccagc acggccaagt ggaacactgc
     9661 aagcccgacg agcgtttctg cttccggcag ctagtgtcgc cagtgatata ccattgttcc
     9721 ttctgcatgt tcaactcgac cgatgagctg actacgctgc gccatatggt ggaccactac
     9781 agccgcttcc tggtctgcca tttctgcaca cgctctcagc cgggtggttt cgatgagtac
     9841 atccagcact gctataccta ccaccgggac gatatcaaat ccttccggga cgtgcacacg
     9901 tttagcgatc tgaagaggta ccttagtcag gtgcattacc aattccagaa tgggttgatt
     9961 atcacaaaaa gcagtctccg ttatacacgt tacaaatccg acaaatgtat gcttgagcta
    10021 gacgctgagc taatggccaa ggcccagcgg ccacccattc cgcgtctgca tatcagactc
    10081 aagtcgaccg gcgttcagat gcagagcccc gagggggctg atgtggagaa acctgtgtcg
    10141 ttgttgcgga tcacaaagcg acgaaaaacg cttaatcctg gcgaattgct ccgctcattc
    10201 cgcgaggaga atgaggtaca gccacagcca ccggcctctt caacatcgtc ggggacggct
    10261 ccttctcctg cggcaggttc tgtgttcaac ctgttcaagc gccgcaacag tctcgttgtc
    10321 cgcccagcaa ccagcaactt ggatcaacac taacaccaca acattattat aatttttttt
    10381 tttttttatt tcttctggac cccatttatg catatttcgt actagtttca aactatttaa
    10441 gcattttttt tttaaacata attatgagat catgtttact acatttgtaa gatcaataat
    10501 gataagatag gactctaact gacaacggct tcaaataatt aaactattgt tttaactgaa
    10561 tcaattcatg tattctggta taacatggca aggaataaag atggacatga actaactctg
    10621 atgccaacgc cgtctgaaag ctcaaggaag ctgggttcgg aaaaatcgaa attgaaattt
    10681 gactgctgga atcgtttaga actttgatac gaaaagcatt ttggacaatt agaacacgca
    10741 tgataagtaa gaatcgattt tgtttaat