Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

Drosophila melanogaster small ovary (sov), transcript variant B,


LOCUS       NM_167090              10332 bp    mRNA    linear   INV 26-DEC-2023
            mRNA.
ACCESSION   NM_167090
VERSION     NM_167090.2
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 10332)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 10332)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 10332)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 10332)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 10332)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 10332)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 10332)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 10332)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 10332)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 10332)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 10332)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 10332)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 10332)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 10332)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 10332)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 10332)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 10332)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 10332)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 10332)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 10332)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   21 (bases 1 to 10332)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (bases 1 to 10332)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   23 (bases 1 to 10332)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NC_004354).
            
            On May 8, 2012 this sequence version replaced NM_167090.1.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..10332
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="X"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..10332
                     /gene="sov"
                     /locus_tag="Dmel_CG14438"
                     /gene_synonym="CG14438; Dmel\CG14438; EM25; fs(1)A1304;
                     fs(1)M105; l(1)6Dc; l(1)EA42; l(1)EM25; Sov"
                     /note="small ovary"
                     /map="6C12-6C13"
                     /db_xref="FLYBASE:FBgn0287725"
                     /db_xref="GeneID:31615"
     CDS             191..10132
                     /gene="sov"
                     /locus_tag="Dmel_CG14438"
                     /gene_synonym="CG14438; Dmel\CG14438; EM25; fs(1)A1304;
                     fs(1)M105; l(1)6Dc; l(1)EA42; l(1)EM25; Sov"
                     /note="CG14438 gene product from transcript CG14438-RB;
                     CG14438-PB; sov-PB; female sterile(1)M105; lethal (1) 6Dc;
                     female sterile (1) A1304"
                     /codon_start=1
                     /product="small ovary, isoform B"
                     /protein_id="NP_727123.1"
                     /db_xref="FLYBASE:FBpp0070923"
                     /db_xref="GeneID:31615"
                     /db_xref="FLYBASE:FBgn0287725"
                     /translation="MEDSEDDVVVVSCDTSMKEKVKAKLVEIRKFVPFIRRVRIDFQD
                     TLSKVQGHRLDALVNLLDREDVSMSSLNKIEVIIDKLRTRFNPRIEIDTGEIIDITEN
                     TEYDSTASSSSGSSAQPPAQRAKSSDKGGLLNGSSLAGRLNASLNPASDATKDHLSTP
                     KQLELERNSKVSCSTSKIKTSSISAKISVFPATSMDLNIPKTTSAKASDEGQRSPAEP
                     RAALQAIVQDTKTPTIPEPTSPAALKHSSLRGSRGFLAVMQKALIEEKKQRASEQKTD
                     KETNGVTQLETNFSRRSYTTSSQSTCRSSEISVRAENPDFKRRSTSLVQHAPLQEASP
                     GQSKKDLPISLSVQGLPALVSASTASPANTLEEARKKLAALKYGLGTTVPSMPPLASN
                     INDPRGRKGINLPETNNNKDNDLGIALQSPPPMRTPSPIPPPPRMKAGTWASFSNVPQ
                     ESAFTGQHAVQRNSVPPGDSRAFGDALAHEPRSFYGTDSREPRDPRIWKSKTSQQQQH
                     QQAPQAQIPPYSSDPRRSISTYSGFEESANRGQTNSKNNNNHNQNQNVTANGSAYIAN
                     WSGNPYGNGPNPRPPFPSNQNGNEYAGGFNGSHSFRGGFRGGHNKRGFGRHNDVPRTY
                     GEHRKAKARAEAEAKAKAEAEAKAKAAAEAKAKAAAEVRQLETEVSREMEAQEKNKQQ
                     KEKPEESEAEKSTIAVTQVPELDTSYRNVNLGVLNKKLDFRIPKKTLPPATTITSTSP
                     VNGNGENPSCPSNSPTSKSCDANQDKDTYKNKDRYLNKAKAKDKVDKGNEVSENNLDK
                     SEKLEKSQDKKANDKENKSDKKEKKRLNREPEKKSKVENPLEIVDSNSVVSEESSENT
                     DNVENEPPLSETNASPVPELATSTQDSQQDQSVSEELDILAKNRRMSGTRIKTPISST
                     GNPALKRRADDDVEDKLENPTKKNCAKWEAKPDKEKSEDDTIDKIKSMKVTKFADVEM
                     KVTEESQSAEEEEITEQKESTEEEEGTEHKKSTEEKDKPPKISKIKIVLTPIAHTTQV
                     VRPNDGFKNNQEKILDNMATDEHDDEEVPGPPAQFLRRIMQRRNSLAPTYMKPMVDKD
                     KIASSSFTYEDLPEQKRGNQNARNLAIIFEKTSDNCSVSTQNIINGKRRTRGCETSFN
                     ETQLSRNIFGMGQINRSRPKATEGKATRAKISVPAKDSTRADDDPIPTPNLKRKRQAI
                     HKETEDDVEMKPKKARLEAQEINGVSVTPDEQQVENNVEVTQKEVEAISSEPLLSSEV
                     EPTRKPRTKPRKNELDKLNDDIAQMYYGEEVMRATSRRACTRRSRTSSHTRTSSQHSR
                     TSSVSRTDSISTVSDISSIIVRNTARRGRGIRSSENGINRATFNASLNAKKPKLCRVR
                     IKRCAALMEMIKDQEKEEQEKKEQKNKEPKKKKVGVQKKPLKSKPKRENSVILNTNPE
                     WHSISKAVIKCVVCSKWVRRSPLSHYMMCHKEHYAARLPPDVLKELRAGRGNRPDYWV
                     SQRGGYTLHFTCPFCQKPLLLCQKGMIEHLIGHMGESRFYCSNCNMPQNRLSRLLDHT
                     ASCGPGAKPLSSKTVCLPMSVHVCHICQFMQYSKENMDRHLTVQHGLTKEELESVERE
                     ELMLCDTTDVPYADSNKDGSAVGPEIQKNKPKSKASAALSNTTKKNKQQKKTNNSRFM
                     KMVKKSVSLLKREREEQDDQNMTQANEGSEVPEIPPPPPEIEPLFVVNECLMTSEMDT
                     DMEEVLEQPVQHMSLMVDEKPVTLLSGATEQLEPSVPDPEPVVPSAQDDGKDVNEDED
                     VDVEAVVDSLQSHTDQTATSMLAEVSLAELAGDVLDGIGSDASDYEMDDNSEQVDTTN
                     KNGYGDDDDDALTDDWVDLETAKRNSKSAKSIFRVFNRFCSRLNKLPRSSRAVPSNGS
                     ENSDGSDNNDDDGDNPDPSELMPTMQPLEPEPEMGDSSTSTGAKSLSERVENVGFQKP
                     SSDEDQNRVAASYYCVQPGCTFLFSNELEGLENHFALEHPLVRWSGKCGMCRQKITAT
                     ETNLRISEELRHMRDVHMKDISTLPPPQSSAVESPAVIESCLNQREPVPESEPDPVPE
                     LPKLRVRRFTGDRLVVDSQAEKSQPVAIVVSDDDNPRNGMLRDLLAADPRPPNQQLDL
                     QAAGLGEFLCAKPDSPSTEPVKQTPVIVGYSSGLGLKIGQVLSRTQISANSRLSPVVN
                     DPLPEKSSAPAAVEENRNRFRCMATNCNFVAHKLMFMREHMKFHSYSFSSTGHLNCAY
                     CSHVAVDVDDYLRHGVIIHDLAPRSELESSTGPPSVTQKIRDMLSQRENGRVPPPTPQ
                     VTLSDVVLGLLECTGYSEDKLYACPQKGCIVRLTDEQLVNHLRYHIRSTHQGSELVKC
                     KFCTKAMHPPALRTHLQQYHARHSIFCGICLATSVNQRIMMYHMSTVHSKAYGRPNAR
                     LAFVSLPVKIDASKKNVESEFYVAVVEQPFGNLQMQDFQRKLFDEMDRRRSGTKTYFR
                     SSEVHILPTQPTFQRPLYCTECPFSTTSRVNMQMHLYEHKDETIREASKLADLIVPAT
                     SSVLTVSASTLVAPPRPGKDSEKPSTSGQSGDAATEQLNPDVPGTHKPIKPPLRYVPP
                     DQRYRCGFLRCSVLCFSESAVRKHMQANHKYSEVVRCPHCKNCQGQFGVDKYFDHLAM
                     HKRHIFQCGACSRHNSRRVIERHIQERHNIQDVDMIVHRHNDSNKTTEARWLKAPKLA
                     RHSLMEYTCNLCLKYFPTTVQIMAHAASVHKRNYQYHCPYCEFGGNLATALIEHILRE
                     HPEREVQPVQIYQRIVCKNKQTLGFYCTTCHEVASSFQKIAMHCDKEHKSRNPVQCPH
                     CIFGHLAERQVVLHIQEKHPHERGLAMVQFERVLNDIPNSISWEIGRPIEVEPEKEIP
                     NNGESAFLPLSQRQVVTEVVDLLDSDDEADEYGEQDDAKIVEFACTHCDGTNTNLPDL
                     RSQHWAREHPDQPFYFRVQPMLLCSECKRFRGNAKALREHLRATHSIRSIVAADIRRP
                     MECAYCDYRYKNRHDLAKHISEIGHLPNDLKHVTDDEIDALMLLSASGSGGAVNEYYQ
                     CGLCSVVMPTKETIVQHGQVEHCKPDERFCFRQLVSPVIYHCSFCMFNSTDELTTLRH
                     MVDHYSRFLVCHFCTRSQPGGFDEYIQHCYTYHRDDIKSFRDVHTFSDLKRYLSQVHY
                     QFQNGLIITKSSLRYTRYKSDKCMLELDAELMAKAQRPPIPRLHIRLKSTGVQMQSPE
                     GADVEKPVSLLRITKRRKTLNPGELLRSFREENEVQPQPPASSTSSGTAPSPAAGSVF
                     NLFKRRNSLVVRPATSNLDQH"
ORIGIN      
        1 attcctttga cgacgccgtt ttgtgctgcc tcctttagaa acgttaatta agttccaaga
       61 aatgaatgga aatgtgtttt attttataca ttgcgtattc ttatataaga agtcgcagtt
      121 ttgagggata ttcatggcgc actagttgtg ttcgtccgaa gagtcggata cgaacagtgc
      181 ctgctgcaaa atggaggata gcgaggacga cgtggtggtg gtgagctgcg atacctcgat
      241 gaaggagaag gtaaaggcca agctggtgga gatccgtaag tttgtgccct ttatccggcg
      301 tgtgcgaata gacttccagg atactttgtc caaggttcag ggtcatcgtc tggatgccct
      361 ggttaacctg ctggatcgcg aggacgtatc gatgagctct cttaacaaga tcgaggtgat
      421 cattgataag ctaaggacgc gcttcaatcc gaggatcgaa attgacactg gcgaaatcat
      481 tgatatcact gaaaacactg agtatgattc cactgcgtcg agttcctccg ggtcaagcgc
      541 acaaccacca gctcaaagag cgaagtcctc agataagggt ggcctgttga acggctcatc
      601 tctggctggc aggttaaatg cttcacttaa ccccgcatca gatgctacca aggaccatct
      661 atcgacacca aagcaactag aattggagcg caactctaaa gtctcttgct caacttcaaa
      721 aataaagact tcttccattt ctgccaagat ttcagttttt ccagccactt caatggattt
      781 gaacatccca aaaactacca gcgccaaggc atcggatgag gggcagcggt cacctgcaga
      841 accacgtgcc gcccttcaag ctatagttca agatacgaaa acaccaacca ttccagaacc
      901 aacatcacca gcggcgctta agcattcctc ccttcgtggc agtcgtggat ttctggctgt
      961 catgcagaag gccttaattg aagagaagaa gcagcgagct agcgaacaga aaactgataa
     1021 agaaactaac ggtgtaacgc agctagagac aaacttctcg cggcgatctt atacaacatc
     1081 gtcacagtca acctgccgtt cttcagaaat atcggtaaga gcagaaaacc cagattttaa
     1141 gcgacgaagc acatcgcttg tgcagcatgc tcctctacag gaggcctccc cagggcaatc
     1201 caaaaaagac ttgcccatat ccttgtcggt acagggtcta ccagctttgg tcagtgccag
     1261 cactgcaagt ccagcaaata cgcttgagga ggcccgcaag aagctggcgg ccttgaaata
     1321 tggactagga acaacggtac caagcatgcc tccactggcc tccaatataa atgatccacg
     1381 cggtagaaaa ggaataaacc tgcctgaaac taacaacaat aaagacaacg acttgggtat
     1441 agcgctgcaa tccccgccgc ctatgcggac tccctcgcct attccgccgc caccaaggat
     1501 gaaggccggt acgtgggcct cattttcaaa tgttccccag gaaagcgcat ttacaggcca
     1561 gcatgctgtg cagcgcaact cggtacctcc gggagattct cgagcctttg gggatgcttt
     1621 ggcacatgaa ccaaggtcct tctatggcac tgattcccga gaaccccgag accctcgtat
     1681 ctggaagagc aagacttccc aacagcagca acatcagcag gcgccacagg cacaaattcc
     1741 tccgtattcc agtgacccgc gtcgttctat aagcacttac agcggtttcg aagagtcagc
     1801 caatagaggg caaactaata gtaaaaataa taataatcat aatcagaatc agaatgttac
     1861 cgccaatgga agtgcataca ttgcgaactg gagtggcaat ccatatggaa acggtccaaa
     1921 cccaagaccg ccctttccta gcaatcaaaa cgggaatgaa tatgctggtg gttttaatgg
     1981 ttcgcatagt ttcaggggcg gatttcgcgg cggtcacaac aaacggggct ttggacgaca
     2041 caatgacgtg ccacgcacat atggggaaca ccgcaaagcc aaggcccgtg ctgaggcgga
     2101 ggccaaggct aaggctgagg cggaggccaa ggctaaggct gcggcggagg ccaaggctaa
     2161 ggctgcggcg gaggtacgcc aattagaaac ggaagtttcg cgggagatgg aagcccaaga
     2221 aaaaaataaa cagcaaaagg aaaagccgga ggagagcgag gcggagaagt cgacgatcgc
     2281 agtgactcag gttccggaat tggacacctc ctaccgcaac gttaatctgg gggtgctaaa
     2341 caagaagcta gactttcgaa taccgaagaa aaccctccca ccggcaacaa caataacctc
     2401 aacaagtcca gtcaatggta atggggagaa tccaagctgc ccctcaaatt cccccacaag
     2461 caaaagctgt gatgccaacc aggacaaaga tacttataag aataaagata ggtatttaaa
     2521 taaggctaag gctaaagaca aggtagataa gggcaatgag gtgtcggaga acaatctgga
     2581 taagtctgag aagcttgaaa aatcgcagga taagaaggca aatgacaagg agaacaagtc
     2641 cgacaaaaag gagaagaaga gactgaacag ggagcctgaa aagaaatcaa aggttgagaa
     2701 ccccctcgag attgtggact cgaatagcgt ggtcagtgag gaaagctcgg aaaatacaga
     2761 caatgtggaa aatgaaccgc ctcttagcga gactaacgcg tctccagttc cagagctagc
     2821 taccagcact caggacagtc aacaggacca gtcagtgagt gaagagttgg acatccttgc
     2881 caaaaaccgt agaatgtccg gaactagaat aaagactccc atttcgtcta ctggaaaccc
     2941 tgcactaaag cgacgggcag atgatgatgt tgaggacaag ttggaaaatc cgactaaaaa
     3001 gaactgcgca aagtgggagg caaagcctga caaggaaaaa tcggaggatg ataccatcga
     3061 caagattaaa tccatgaaag taactaaatt cgctgatgtc gagatgaagg ttacagaaga
     3121 aagccagagt gctgaggagg aggagattac tgagcagaaa gagagtactg aggaggagga
     3181 aggtactgag cataaaaaga gtactgaaga gaaggataag ccgccaaaaa tctccaagat
     3241 aaaaattgtc cttactccca ttgcccatac aacacaagtg gttcgtccta atgatggctt
     3301 caagaacaat caagaaaaga tcttggacaa catggcaact gatgagcacg atgatgagga
     3361 ggtccccggg cccccagctc aattcctacg ccggattatg cagcgtcgga actccttggc
     3421 tcctacgtat atgaagccga tggtggacaa ggataagatt gcttcttcca gttttaccta
     3481 cgaggatctg ccggagcaga agcgcggaaa tcagaacgcc cgaaacctgg ccatcatttt
     3541 cgaaaaaact agtgacaact gcagcgtgtc cactcaaaac attattaatg gcaaacgtcg
     3601 cactcgtgga tgtgagacct cttttaacga gacccaattg agccgaaaca tctttggcat
     3661 gggccagata aacaggtcgc ggccaaaggc caccgagggt aaagctacca gggcaaagat
     3721 ttctgtccct gcaaaagatt caacccgtgc tgatgatgat cctattccca ctccaaatct
     3781 taagcggaaa aggcaagcta ttcacaagga aacagaggat gatgtggaga tgaagcctaa
     3841 gaaggctcga ttggaagcac aggagataaa tggggtcagt gttacgcctg atgaacagca
     3901 ggtagagaac aacgtggaag tcacccaaaa ggaggtggaa gcaatatcct cagagccact
     3961 tctctcttct gaggttgaac cgacacgaaa gcctcgcacg aaaccgcgaa aaaacgagct
     4021 ggacaagcta aacgacgaca ttgcgcaaat gtattacggg gaggaagtga tgcgtgccac
     4081 cagtcgcagg gcttgtaccc gtcgatcgcg cacgtcctcg cacacgcgca ccagtagcca
     4141 gcattccagg acgtcctctg tatcgcgaac cgatagcata tccaccgtat cggatattag
     4201 ttccataatc gtcaggaaca cggcgcgaag gggtagaggc atcagatcat ccgaaaatgg
     4261 catcaaccgt gccacgttta atgcatcctt gaatgcaaaa aaaccaaagt tgtgccgtgt
     4321 tagaataaag cgatgtgctg cattgatgga gatgataaag gaccaggaaa aggaggaaca
     4381 ggagaagaag gagcagaaga ataaagaacc gaagaagaag aaagtgggtg tgcaaaaaaa
     4441 gccattgaaa agtaagccga aaagagagaa tagcgttatt cttaacacaa atcccgaatg
     4501 gcactccatt tcgaaggctg ttatcaagtg tgtcgtctgc tcgaagtggg ttcgcaggag
     4561 cccactctct cattatatga tgtgccataa ggagcactat gccgcccgat tgccacccga
     4621 tgtgcttaaa gagctgcggg ccgggcgcgg aaatcgaccg gattactggg tttcgcaacg
     4681 cggcggctac acattgcact tcacttgccc gttctgccag aagccactgc tactctgcca
     4741 aaaaggcatg atcgagcact tgatcggcca tatgggcgag tctcgttttt actgctccaa
     4801 ctgtaatatg ccacagaacc gcctcagtag gctgctggac cacaccgcat cctgtgggcc
     4861 aggtgcgaag cctttaagta gcaaaaccgt ctgcctaccg atgagtgttc acgtgtgcca
     4921 catctgccag tttatgcagt acagcaagga aaatatggac cggcatctta ctgttcagca
     4981 tggcctaacg aaggaggaac tagaaagtgt ggagcgcgag gagttgatgc tctgcgacac
     5041 aacagacgta ccatatgcag attcgaataa ggatggcagc gccgttgggc ctgaaattca
     5101 aaaaaacaaa ccgaagtcga aagcaagtgc agcactttca aatacaacca aaaaaaataa
     5161 acagcaaaag aagacaaaca acagtcgctt catgaaaatg gtcaaaaaat ccgtaagtct
     5221 attgaagcga gaaagagaag agcaggatga ccagaacatg acccaggcta acgaagggtc
     5281 ggaagtgccg gaaataccgc cgcctccgcc agaaattgag cccttgtttg tggtcaatga
     5341 gtgtctaatg acctctgaaa tggacacgga catggaagaa gtcttggaac agcccgttca
     5401 acatatgagc ttaatggtag acgaaaagcc tgtgacgcta ctcagtgggg ccacagaaca
     5461 gctggagcct agtgtccctg atcccgagcc tgttgttcca tctgcacaag atgatggcaa
     5521 agatgtaaat gaagatgaag acgtagacgt ggaggcagta gtggattccc ttcagtcaca
     5581 cactgaccag acggctactt ctatgttagc agaagtcagt ctagccgaat tggctgggga
     5641 tgtacttgat ggtattggca gcgacgcgtc cgactatgag atggatgata attcagagca
     5701 agtggataca actaacaaaa acggctacgg tgatgatgac gatgatgcgc ttaccgacga
     5761 ttgggtggat ctggagactg ccaagcgcaa ttccaagtcc gccaagagca tttttagagt
     5821 gttcaatcgc ttctgctcgc gtttaaacaa attaccccga tccagcagag cagtgccctc
     5881 gaatgggagt gaaaacagcg atggcagcga caacaacgac gacgacggcg ataatcctga
     5941 tcccagcgag ctaatgccaa caatgcaacc attggagccg gagccagaga tgggggattc
     6001 atccacatct acaggtgcta agtcgctatc cgaacgggtg gagaatgtgg gctttcaaaa
     6061 gccctcttca gacgaggatc aaaatcgcgt ggcagcatcc tattactgcg tgcagccggg
     6121 ttgcactttc ctcttttcca atgagctgga aggcctcgag aatcattttg cgttagagca
     6181 ccctcttgtt cgatggagcg gcaaatgtgg catgtgccgt cagaaaatca cggcaacgga
     6241 aacgaatctc agaatttctg aagagttgcg ccacatgagg gacgtgcaca tgaaggacat
     6301 atccaccctg cctcctcctc agtcatctgc ggttgaaagc ccagccgtta ttgaatcctg
     6361 cctgaatcag cgtgaaccag tacctgaatc agaacctgat cccgttcctg agcttcccaa
     6421 gctgcgtgtt cgacgcttca ctggggatcg ccttgttgtg gattcacaag cggaaaagag
     6481 ccaaccggta gcaatagttg tcagtgatga tgataatccg cgaaatggga tgctaaggga
     6541 cttgctggcg gcggatccac ggccacccaa tcagcagttg gacctccaag ccgctggact
     6601 gggcgagttc ctttgcgcca agcccgattc accgtcaaca gaaccggtca agcagacgcc
     6661 cgtaattgtt ggctattcga gtggcttggg cttgaaaatc ggccaggtcc ttagcagaac
     6721 tcagatttca gctaactcac ggctatcgcc agtcgttaac gatcccctgc cagagaagtc
     6781 ttctgctcct gctgccgttg aagagaatcg taatcgattc aggtgcatgg ccaccaactg
     6841 caattttgtt gctcacaagc tcatgttcat gcgggagcac atgaagtttc acagctacag
     6901 tttcagcagc accggtcacc tgaactgcgc gtactgctcc catgtggcag tcgatgtgga
     6961 tgattacttg cgccacggag tgatcattca cgacctggca ccacgctccg aactggagag
     7021 ttcaactgga ccaccatctg ttacccagaa aatccgggat atgctcagcc agcgggaaaa
     7081 tggtcgtgtt ccaccaccaa ctcctcaagt cactctgtct gatgtggtcc tgggtctttt
     7141 agaatgcacc ggatacagcg aggataaact gtacgcctgt ccccaaaagg gctgcatcgt
     7201 gcggctgaca gatgagcagc ttgtaaacca tttgcgctac cacattcgta gcactcatca
     7261 gggcagcgag ttggtgaaat gcaagttttg caccaaggcg atgcatccgc cggcacttcg
     7321 tacgcatctg cagcagtacc acgcccggca cagcatcttc tgcggcattt gcttagccac
     7381 atcggtcaac cagcgcataa tgatgtatca catgagcacg gtgcactcca aggcctacgg
     7441 ccggcctaac gcgcggctgg cgtttgtgtc actgcccgtg aagatcgacg cgagtaagaa
     7501 gaacgtagaa agcgagttct acgtggccgt cgtggaacag ccctttggca acctccagat
     7561 gcaggatttc cagcgcaagc tgttcgatga aatggaccgt cggcgttcgg gaacaaagac
     7621 gtacttccgc agctccgagg tgcatatcct gccaacgcag ccaacattcc agcgaccgct
     7681 atactgtacg gagtgcccct tctccaccac gtcaagggtt aacatgcaga tgcacctcta
     7741 tgagcacaag gatgagacca ttcgggaagc ctccaaattg gcggacttga tagttccagc
     7801 aacctcttcg gtattaactg tttcggcgag tacgttggtg gcaccgccga ggccaggcaa
     7861 agattcagaa aaaccatcta cttccggaca aagtggtgat gcagcgacgg agcagctgaa
     7921 tccagatgtt cctggaaccc acaagcccat caagccaccg ttacgctatg tgcccccgga
     7981 ccaacgctac cgctgtggct tcctccgatg tagcgtcctt tgtttttcgg aatctgcggt
     8041 gcgcaaacac atgcaggcta accacaaata ctcggaggtg gtaaggtgcc cgcactgcaa
     8101 gaactgccag ggtcagtttg gagtagataa gtactttgac catcttgcaa tgcataagcg
     8161 gcacatcttc caatgcggcg cttgctcacg tcacaatagc aggcgtgtca tcgagcggca
     8221 catacaggaa cgtcacaata ttcaagatgt ggacatgatc gtacaccgcc ataatgacag
     8281 caacaaaacg accgaagccc gctggctgaa ggcgcctaaa ttggcacgtc attcgctaat
     8341 ggagtacacg tgtaacctgt gcctcaagta ctttccaacg accgtgcaga tcatggccca
     8401 tgcggcgtcc gttcacaaac gcaactacca gtaccactgt ccgtactgtg aatttggtgg
     8461 aaacctcgcc accgcgctca ttgaacacat ccttcgcgag cacccggaaa gggaagtgca
     8521 gcctgtgcaa atctaccagc gcatcgtgtg taagaacaag cagacgctag gcttctactg
     8581 caccacctgt cacgaggtgg ccagcagctt ccagaagatc gctatgcact gcgacaagga
     8641 gcataagtcg cgcaatccgg tgcaatgtcc ccactgcatt ttcgggcatt tggccgaacg
     8701 ccaggttgtc ttacacatac aagagaagca tccccatgaa cgcggactgg caatggtgca
     8761 gttcgaacgc gtgcttaatg acatcccgaa cagcataagc tgggagatag gtcggcccat
     8821 cgaagtggag cctgagaagg agatcccgaa caatggggag agtgcattcc tgccgctaag
     8881 ccagagacag gttgtaacgg aagtggtgga cctgctggat tcagacgacg aggcggacga
     8941 gtacggtgaa caagatgacg cgaaaatcgt ggagttcgcc tgcacacact gcgacgggac
     9001 aaacaccaac ttgccggacc tacgctccca gcactgggcc cgcgaacatc ccgaccagcc
     9061 cttctatttc cgcgttcagc cgatgctgct ctgctccgag tgcaagagat ttaggggcaa
     9121 tgcaaaggca cttcgcgagc acctgcgtgc gacacactct atccggagca tagtggctgc
     9181 ggacattcgt cgaccgatgg agtgcgctta ctgcgactac cgctataaaa acaggcacga
     9241 tcttgcgaaa cacatcagtg agataggtca cctgcccaat gacctgaagc acgtaacaga
     9301 tgatgaaatt gatgccctga tgctgctcag tgccagtgga agtggtgggg ctgttaacga
     9361 atactaccag tgcggattgt gcagtgtggt tatgccaacg aaggagacaa ttgtccagca
     9421 cggccaagtg gaacactgca agcccgacga gcgtttctgc ttccggcagc tagtgtcgcc
     9481 agtgatatac cattgttcct tctgcatgtt caactcgacc gatgagctga ctacgctgcg
     9541 ccatatggtg gaccactaca gccgcttcct ggtctgccat ttctgcacac gctctcagcc
     9601 gggtggtttc gatgagtaca tccagcactg ctatacctac caccgggacg atatcaaatc
     9661 cttccgggac gtgcacacgt ttagcgatct gaagaggtac cttagtcagg tgcattacca
     9721 attccagaat gggttgatta tcacaaaaag cagtctccgt tatacacgtt acaaatccga
     9781 caaatgtatg cttgagctag acgctgagct aatggccaag gcccagcggc cacccattcc
     9841 gcgtctgcat atcagactca agtcgaccgg cgttcagatg cagagccccg agggggctga
     9901 tgtggagaaa cctgtgtcgt tgttgcggat cacaaagcga cgaaaaacgc ttaatcctgg
     9961 cgaattgctc cgctcattcc gcgaggagaa tgaggtacag ccacagccac cggcctcttc
    10021 aacatcgtcg gggacggctc cttctcctgc ggcaggttct gtgttcaacc tgttcaagcg
    10081 ccgcaacagt ctcgttgtcc gcccagcaac cagcaacttg gatcaacact aacaccacaa
    10141 cattattata attttttttt ttttttattt cttctggacc ccatttatgc atatttcgta
    10201 ctagtttcaa actatttaag catttttttt ttaaacataa ttatgagatc atgtttacta
    10261 catttgtaag atcaataatg ataagatagg actctaactg acaacggctt caaataatta
    10321 aactattgtt tt