Unfortunately due to lack of commercial feasibility, the SkyBLAST service has been suspended from December 1st, 2025.
All subscriptions for paid accounts have been paused. For further information or enquiries, please email [email protected]

Drosophila melanogaster small ovary (sov), transcript variant C,


LOCUS       NM_001298033           10329 bp    mRNA    linear   INV 26-DEC-2023
            mRNA.
ACCESSION   NM_001298033
VERSION     NM_001298033.1
DBLINK      BioProject: PRJNA164
            BioSample: SAMN02803731
KEYWORDS    RefSeq.
SOURCE      Drosophila melanogaster (fruit fly)
  ORGANISM  Drosophila melanogaster
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
REFERENCE   1  (bases 1 to 10329)
  AUTHORS   Matthews,B.B., Dos Santos,G., Crosby,M.A., Emmert,D.B., St
            Pierre,S.E., Gramates,L.S., Zhou,P., Schroeder,A.J., Falls,K.,
            Strelets,V., Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: Impact of
            High-Throughput Data
  JOURNAL   G3 (Bethesda) 5 (8), 1721-1736 (2015)
   PUBMED   26109357
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 10329)
  AUTHORS   Crosby,M.A., Gramates,L.S., Dos Santos,G., Matthews,B.B., St
            Pierre,S.E., Zhou,P., Schroeder,A.J., Falls,K., Emmert,D.B.,
            Russo,S.M. and Gelbart,W.M.
  CONSRTM   FlyBase Consortium
  TITLE     Gene Model Annotations for Drosophila melanogaster: The
            Rule-Benders
  JOURNAL   G3 (Bethesda) 5 (8), 1737-1749 (2015)
   PUBMED   26109356
  REMARK    Publication Status: Online-Only
REFERENCE   3  (bases 1 to 10329)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Wan,K.H., Park,S., Mendez,I.,
            Galle,S.E., Booth,B.W., Pfeiffer,B.D., George,R.A., Svirskas,R.,
            Krzywinski,M., Schein,J., Accardo,M.C., Damia,E., Messina,G.,
            Mendez-Lago,M., de Pablos,B., Demakova,O.V., Andreyeva,E.N.,
            Boldyreva,L.V., Marra,M., Carvalho,A.B., Dimitri,P., Villasante,A.,
            Zhimulev,I.F., Rubin,G.M., Karpen,G.H. and Celniker,S.E.
  TITLE     The Release 6 reference sequence of the Drosophila melanogaster
            genome
  JOURNAL   Genome Res 25 (3), 445-458 (2015)
   PUBMED   25589440
REFERENCE   4  (bases 1 to 10329)
  AUTHORS   Hoskins,R.A., Carlson,J.W., Kennedy,C., Acevedo,D., Evans-Holm,M.,
            Frise,E., Wan,K.H., Park,S., Mendez-Lago,M., Rossi,F.,
            Villasante,A., Dimitri,P., Karpen,G.H. and Celniker,S.E.
  TITLE     Sequence finishing and mapping of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1625-1628 (2007)
   PUBMED   17569867
REFERENCE   5  (bases 1 to 10329)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  TITLE     The Release 5.1 annotation of Drosophila melanogaster
            heterochromatin
  JOURNAL   Science 316 (5831), 1586-1591 (2007)
   PUBMED   17569856
  REMARK    Erratum:[Science. 2007 Sep 7;317(5843):1325]
REFERENCE   6  (bases 1 to 10329)
  AUTHORS   Quesneville,H., Bergman,C.M., Andrieu,O., Autard,D., Nouaud,D.,
            Ashburner,M. and Anxolabehere,D.
  TITLE     Combined evidence annotation of transposable elements in genome
            sequences
  JOURNAL   PLoS Comput Biol 1 (2), 166-175 (2005)
   PUBMED   16110336
REFERENCE   7  (bases 1 to 10329)
  AUTHORS   Hoskins,R.A., Smith,C.D., Carlson,J.W., Carvalho,A.B., Halpern,A.,
            Kaminker,J.S., Kennedy,C., Mungall,C.J., Sullivan,B.A.,
            Sutton,G.G., Yasuhara,J.C., Wakimoto,B.T., Myers,E.W.,
            Celniker,S.E., Rubin,G.M. and Karpen,G.H.
  TITLE     Heterochromatic sequences in a Drosophila whole-genome shotgun
            assembly
  JOURNAL   Genome Biol 3 (12), RESEARCH0085 (2002)
   PUBMED   12537574
REFERENCE   8  (bases 1 to 10329)
  AUTHORS   Kaminker,J.S., Bergman,C.M., Kronmiller,B., Carlson,J.,
            Svirskas,R., Patel,S., Frise,E., Wheeler,D.A., Lewis,S.E.,
            Rubin,G.M., Ashburner,M. and Celniker,S.E.
  TITLE     The transposable elements of the Drosophila melanogaster
            euchromatin: a genomics perspective
  JOURNAL   Genome Biol 3 (12), RESEARCH0084 (2002)
   PUBMED   12537573
REFERENCE   9  (bases 1 to 10329)
  AUTHORS   Misra,S., Crosby,M.A., Mungall,C.J., Matthews,B.B., Campbell,K.S.,
            Hradecky,P., Huang,Y., Kaminker,J.S., Millburn,G.H., Prochnik,S.E.,
            Smith,C.D., Tupy,J.L., Whitfied,E.J., Bayraktaroglu,L.,
            Berman,B.P., Bettencourt,B.R., Celniker,S.E., de Grey,A.D.,
            Drysdale,R.A., Harris,N.L., Richter,J., Russo,S., Schroeder,A.J.,
            Shu,S.Q., Stapleton,M., Yamada,C., Ashburner,M., Gelbart,W.M.,
            Rubin,G.M. and Lewis,S.E.
  TITLE     Annotation of the Drosophila melanogaster euchromatic genome: a
            systematic review
  JOURNAL   Genome Biol 3 (12), RESEARCH0083 (2002)
   PUBMED   12537572
REFERENCE   10 (bases 1 to 10329)
  AUTHORS   Celniker,S.E., Wheeler,D.A., Kronmiller,B., Carlson,J.W.,
            Halpern,A., Patel,S., Adams,M., Champe,M., Dugan,S.P., Frise,E.,
            Hodgson,A., George,R.A., Hoskins,R.A., Laverty,T., Muzny,D.M.,
            Nelson,C.R., Pacleb,J.M., Park,S., Pfeiffer,B.D., Richards,S.,
            Sodergren,E.J., Svirskas,R., Tabor,P.E., Wan,K., Stapleton,M.,
            Sutton,G.G., Venter,C., Weinstock,G., Scherer,S.E., Myers,E.W.,
            Gibbs,R.A. and Rubin,G.M.
  TITLE     Finishing a whole-genome shotgun: release 3 of the Drosophila
            melanogaster euchromatic genome sequence
  JOURNAL   Genome Biol 3 (12), RESEARCH0079 (2002)
   PUBMED   12537568
REFERENCE   11 (bases 1 to 10329)
  AUTHORS   Adams,M.D., Celniker,S.E., Holt,R.A., Evans,C.A., Gocayne,J.D.,
            Amanatides,P.G., Scherer,S.E., Li,P.W., Hoskins,R.A., Galle,R.F.,
            George,R.A., Lewis,S.E., Richards,S., Ashburner,M., Henderson,S.N.,
            Sutton,G.G., Wortman,J.R., Yandell,M.D., Zhang,Q., Chen,L.X.,
            Brandon,R.C., Rogers,Y.H., Blazej,R.G., Champe,M., Pfeiffer,B.D.,
            Wan,K.H., Doyle,C., Baxter,E.G., Helt,G., Nelson,C.R., Gabor,G.L.,
            Abril,J.F., Agbayani,A., An,H.J., Andrews-Pfannkoch,C., Baldwin,D.,
            Ballew,R.M., Basu,A., Baxendale,J., Bayraktaroglu,L., Beasley,E.M.,
            Beeson,K.Y., Benos,P.V., Berman,B.P., Bhandari,D., Bolshakov,S.,
            Borkova,D., Botchan,M.R., Bouck,J., Brokstein,P., Brottier,P.,
            Burtis,K.C., Busam,D.A., Butler,H., Cadieu,E., Center,A.,
            Chandra,I., Cherry,J.M., Cawley,S., Dahlke,C., Davenport,L.B.,
            Davies,P., de Pablos,B., Delcher,A., Deng,Z., Mays,A.D., Dew,I.,
            Dietz,S.M., Dodson,K., Doup,L.E., Downes,M., Dugan-Rocha,S.,
            Dunkov,B.C., Dunn,P., Durbin,K.J., Evangelista,C.C., Ferraz,C.,
            Ferriera,S., Fleischmann,W., Fosler,C., Gabrielian,A.E., Garg,N.S.,
            Gelbart,W.M., Glasser,K., Glodek,A., Gong,F., Gorrell,J.H., Gu,Z.,
            Guan,P., Harris,M., Harris,N.L., Harvey,D., Heiman,T.J.,
            Hernandez,J.R., Houck,J., Hostin,D., Houston,K.A., Howland,T.J.,
            Wei,M.H., Ibegwam,C., Jalali,M., Kalush,F., Karpen,G.H., Ke,Z.,
            Kennison,J.A., Ketchum,K.A., Kimmel,B.E., Kodira,C.D., Kraft,C.,
            Kravitz,S., Kulp,D., Lai,Z., Lasko,P., Lei,Y., Levitsky,A.A.,
            Li,J., Li,Z., Liang,Y., Lin,X., Liu,X., Mattei,B., McIntosh,T.C.,
            McLeod,M.P., McPherson,D., Merkulov,G., Milshina,N.V., Mobarry,C.,
            Morris,J., Moshrefi,A., Mount,S.M., Moy,M., Murphy,B., Murphy,L.,
            Muzny,D.M., Nelson,D.L., Nelson,D.R., Nelson,K.A., Nixon,K.,
            Nusskern,D.R., Pacleb,J.M., Palazzolo,M., Pittman,G.S., Pan,S.,
            Pollard,J., Puri,V., Reese,M.G., Reinert,K., Remington,K.,
            Saunders,R.D., Scheeler,F., Shen,H., Shue,B.C., Siden-Kiamos,I.,
            Simpson,M., Skupski,M.P., Smith,T., Spier,E., Spradling,A.C.,
            Stapleton,M., Strong,R., Sun,E., Svirskas,R., Tector,C., Turner,R.,
            Venter,E., Wang,A.H., Wang,X., Wang,Z.Y., Wassarman,D.A.,
            Weinstock,G.M., Weissenbach,J., Williams,S.M., WoodageT,
            Worley,K.C., Wu,D., Yang,S., Yao,Q.A., Ye,J., Yeh,R.F.,
            Zaveri,J.S., Zhan,M., Zhang,G., Zhao,Q., Zheng,L., Zheng,X.H.,
            Zhong,F.N., Zhong,W., Zhou,X., Zhu,S., Zhu,X., Smith,H.O.,
            Gibbs,R.A., Myers,E.W., Rubin,G.M. and Venter,J.C.
  TITLE     The genome sequence of Drosophila melanogaster
  JOURNAL   Science 287 (5461), 2185-2195 (2000)
   PUBMED   10731132
REFERENCE   12 (bases 1 to 10329)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Pfeiffer,B., Frise,E., George,R.,
            Hoskins,R., Stapleton,M., Pacleb,J., Park,S., Svirskas,R.,
            Smith,E., Yu,C. and Rubin,G.
  CONSRTM   Berkeley Drosophila Genome Project
  TITLE     Drosophila melanogaster release 4 sequence
  JOURNAL   Unpublished
REFERENCE   13 (bases 1 to 10329)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (20-DEC-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   14 (bases 1 to 10329)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2023) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   15 (bases 1 to 10329)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (19-OCT-2022) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   16 (bases 1 to 10329)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (20-APR-2020) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   17 (bases 1 to 10329)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (22-APR-2019) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   18 (bases 1 to 10329)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (24-MAY-2018) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   19 (bases 1 to 10329)
  CONSRTM   FlyBase
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2016) FlyBase, Harvard University, Biological
            Laboratories, 16 Divinity Ave, Cambridge, MA 02138, USA
REFERENCE   20 (bases 1 to 10329)
  AUTHORS   Celniker,S., Carlson,J., Kennedy,C., Wan,K., Frise,E., Hoskins,R.,
            Park,S., Svirskas,R. and Karpen,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One #Cyclotron RoadOne
            Cyclotron Road, MS 64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   21 (bases 1 to 10329)
  AUTHORS   Celniker,S., Carlson,J., Wan,K., Frise,E., Hoskins,R., Park,S.,
            Svirskas,R. and Rubin,G.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-AUG-2006) Berkeley Drosophila Genome Project,
            Lawrence Berkeley National Laboratory, One Cyclotron Road, MS
            64-121, Berkeley, CA 94720, USA
  REMARK    Direct Submission
REFERENCE   22 (bases 1 to 10329)
  AUTHORS   Smith,C.D., Shu,S., Mungall,C.J. and Karpen,G.H.
  CONSRTM   Drosophila Heterochromatin Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-AUG-2006) Drosophila Heterochromatin Genome Project,
            Ernest Orlando Lawrence Berkeley National Laboratory, 1 Cyclotron
            Road, Mailstop 64-121, Berkeley, CA 94720, USA
REFERENCE   23 (bases 1 to 10329)
  AUTHORS   Adams,M.D., Celniker,S.E., Gibbs,R.A., Rubin,G.M. and Venter,C.J.
  TITLE     Direct Submission
  JOURNAL   Submitted (21-MAR-2000) Celera Genomics, 45 West Gude Drive,
            Rockville, MD 20850, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by FlyBase. This
            record is derived from an annotated genomic sequence (NC_004354).
            
            ##Genome-Annotation-Data-START##
            Annotation Provider :: FlyBase
            Annotation Status   :: Full annotation
            Annotation Version  :: Release 6.54
            URL                 :: http://flybase.org
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..10329
                     /organism="Drosophila melanogaster"
                     /mol_type="mRNA"
                     /db_xref="taxon:7227"
                     /chromosome="X"
                     /genotype="y[1]; Gr22b[1] Gr22d[1] cn[1] CG33964[R4.2]
                     bw[1] sp[1]; LysC[1] MstProx[1] GstD5[1] Rh6[1]"
     gene            1..10329
                     /gene="sov"
                     /locus_tag="Dmel_CG14438"
                     /gene_synonym="CG14438; Dmel\CG14438; EM25; fs(1)A1304;
                     fs(1)M105; l(1)6Dc; l(1)EA42; l(1)EM25; Sov"
                     /note="small ovary"
                     /map="6C12-6C13"
                     /db_xref="FLYBASE:FBgn0287725"
                     /db_xref="GeneID:31615"
     CDS             188..10129
                     /gene="sov"
                     /locus_tag="Dmel_CG14438"
                     /gene_synonym="CG14438; Dmel\CG14438; EM25; fs(1)A1304;
                     fs(1)M105; l(1)6Dc; l(1)EA42; l(1)EM25; Sov"
                     /note="CG14438 gene product from transcript CG14438-RC;
                     CG14438-PC; sov-PC; female sterile(1)M105; lethal (1) 6Dc;
                     female sterile (1) A1304"
                     /codon_start=1
                     /product="small ovary, isoform C"
                     /protein_id="NP_001284962.1"
                     /db_xref="FLYBASE:FBpp0308550"
                     /db_xref="GeneID:31615"
                     /db_xref="FLYBASE:FBgn0287725"
                     /translation="MEDSEDDVVVVSCDTSMKEKVKAKLVEIRKFVPFIRRVRIDFQD
                     TLSKVQGHRLDALVNLLDREDVSMSSLNKIEVIIDKLRTRFNPRIEIDTGEIIDITEN
                     TEYDSTASSSSGSSAQPPAQRAKSSDKGGLLNGSSLAGRLNASLNPASDATKDHLSTP
                     KQLELERNSKVSCSTSKIKTSSISAKISVFPATSMDLNIPKTTSAKASDEGQRSPAEP
                     RAALQAIVQDTKTPTIPEPTSPAALKHSSLRGSRGFLAVMQKALIEEKKQRASEQKTD
                     KETNGVTQLETNFSRRSYTTSSQSTCRSSEISVRAENPDFKRRSTSLVQHAPLQEASP
                     GQSKKDLPISLSVQGLPALVSASTASPANTLEEARKKLAALKYGLGTTVPSMPPLASN
                     INDPRGRKGINLPETNNNKDNDLGIALQSPPPMRTPSPIPPPPRMKAGTWASFSNVPQ
                     ESAFTGQHAVQRNSVPPGDSRAFGDALAHEPRSFYGTDSREPRDPRIWKSKTSQQQQH
                     QQAPQAQIPPYSSDPRRSISTYSGFEESANRGQTNSKNNNNHNQNQNVTANGSAYIAN
                     WSGNPYGNGPNPRPPFPSNQNGNEYAGGFNGSHSFRGGFRGGHNKRGFGRHNDVPRTY
                     GEHRKAKARAEAEAKAKAEAEAKAKAAAEAKAKAAAEVRQLETEVSREMEAQEKNKQQ
                     KEKPEESEAEKSTIAVTQVPELDTSYRNVNLGVLNKKLDFRIPKKTLPPATTITSTSP
                     VNGNGENPSCPSNSPTSKSCDANQDKDTYKNKDRYLNKAKAKDKVDKGNEVSENNLDK
                     SEKLEKSQDKKANDKENKSDKKEKKRLNREPEKKSKVENPLEIVDSNSVVSEESSENT
                     DNVENEPPLSETNASPVPELATSTQDSQQDQSVSEELDILAKNRRMSGTRIKTPISST
                     GNPALKRRADDDVEDKLENPTKKNCAKWEAKPDKEKSEDDTIDKIKSMKVTKFADVEM
                     KVTEESQSAEEEEITEQKESTEEEEGTEHKKSTEEKDKPPKISKIKIVLTPIAHTTQV
                     VRPNDGFKNNQEKILDNMATDEHDDEEVPGPPAQFLRRIMQRRNSLAPTYMKPMVDKD
                     KIASSSFTYEDLPEQKRGNQNARNLAIIFEKTSDNCSVSTQNIINGKRRTRGCETSFN
                     ETQLSRNIFGMGQINRSRPKATEGKATRAKISVPAKDSTRADDDPIPTPNLKRKRQAI
                     HKETEDDVEMKPKKARLEAQEINGVSVTPDEQQVENNVEVTQKEVEAISSEPLLSSEV
                     EPTRKPRTKPRKNELDKLNDDIAQMYYGEEVMRATSRRACTRRSRTSSHTRTSSQHSR
                     TSSVSRTDSISTVSDISSIIVRNTARRGRGIRSSENGINRATFNASLNAKKPKLCRVR
                     IKRCAALMEMIKDQEKEEQEKKEQKNKEPKKKKVGVQKKPLKSKPKRENSVILNTNPE
                     WHSISKAVIKCVVCSKWVRRSPLSHYMMCHKEHYAARLPPDVLKELRAGRGNRPDYWV
                     SQRGGYTLHFTCPFCQKPLLLCQKGMIEHLIGHMGESRFYCSNCNMPQNRLSRLLDHT
                     ASCGPGAKPLSSKTVCLPMSVHVCHICQFMQYSKENMDRHLTVQHGLTKEELESVERE
                     ELMLCDTTDVPYADSNKDGSAVGPEIQKNKPKSKASAALSNTTKKNKQQKKTNNSRFM
                     KMVKKSVSLLKREREEQDDQNMTQANEGSEVPEIPPPPPEIEPLFVVNECLMTSEMDT
                     DMEEVLEQPVQHMSLMVDEKPVTLLSGATEQLEPSVPDPEPVVPSAQDDGKDVNEDED
                     VDVEAVVDSLQSHTDQTATSMLAEVSLAELAGDVLDGIGSDASDYEMDDNSEQVDTTN
                     KNGYGDDDDDALTDDWVDLETAKRNSKSAKSIFRVFNRFCSRLNKLPRSSRAVPSNGS
                     ENSDGSDNNDDDGDNPDPSELMPTMQPLEPEPEMGDSSTSTGAKSLSERVENVGFQKP
                     SSDEDQNRVAASYYCVQPGCTFLFSNELEGLENHFALEHPLVRWSGKCGMCRQKITAT
                     ETNLRISEELRHMRDVHMKDISTLPPPQSSAVESPAVIESCLNQREPVPESEPDPVPE
                     LPKLRVRRFTGDRLVVDSQAEKSQPVAIVVSDDDNPRNGMLRDLLAADPRPPNQQLDL
                     QAAGLGEFLCAKPDSPSTEPVKQTPVIVGYSSGLGLKIGQVLSRTQISANSRLSPVVN
                     DPLPEKSSAPAAVEENRNRFRCMATNCNFVAHKLMFMREHMKFHSYSFSSTGHLNCAY
                     CSHVAVDVDDYLRHGVIIHDLAPRSELESSTGPPSVTQKIRDMLSQRENGRVPPPTPQ
                     VTLSDVVLGLLECTGYSEDKLYACPQKGCIVRLTDEQLVNHLRYHIRSTHQGSELVKC
                     KFCTKAMHPPALRTHLQQYHARHSIFCGICLATSVNQRIMMYHMSTVHSKAYGRPNAR
                     LAFVSLPVKIDASKKNVESEFYVAVVEQPFGNLQMQDFQRKLFDEMDRRRSGTKTYFR
                     SSEVHILPTQPTFQRPLYCTECPFSTTSRVNMQMHLYEHKDETIREASKLADLIVPAT
                     SSVLTVSASTLVAPPRPGKDSEKPSTSGQSGDAATEQLNPDVPGTHKPIKPPLRYVPP
                     DQRYRCGFLRCSVLCFSESAVRKHMQANHKYSEVVRCPHCKNCQGQFGVDKYFDHLAM
                     HKRHIFQCGACSRHNSRRVIERHIQERHNIQDVDMIVHRHNDSNKTTEARWLKAPKLA
                     RHSLMEYTCNLCLKYFPTTVQIMAHAASVHKRNYQYHCPYCEFGGNLATALIEHILRE
                     HPEREVQPVQIYQRIVCKNKQTLGFYCTTCHEVASSFQKIAMHCDKEHKSRNPVQCPH
                     CIFGHLAERQVVLHIQEKHPHERGLAMVQFERVLNDIPNSISWEIGRPIEVEPEKEIP
                     NNGESAFLPLSQRQVVTEVVDLLDSDDEADEYGEQDDAKIVEFACTHCDGTNTNLPDL
                     RSQHWAREHPDQPFYFRVQPMLLCSECKRFRGNAKALREHLRATHSIRSIVAADIRRP
                     MECAYCDYRYKNRHDLAKHISEIGHLPNDLKHVTDDEIDALMLLSASGSGGAVNEYYQ
                     CGLCSVVMPTKETIVQHGQVEHCKPDERFCFRQLVSPVIYHCSFCMFNSTDELTTLRH
                     MVDHYSRFLVCHFCTRSQPGGFDEYIQHCYTYHRDDIKSFRDVHTFSDLKRYLSQVHY
                     QFQNGLIITKSSLRYTRYKSDKCMLELDAELMAKAQRPPIPRLHIRLKSTGVQMQSPE
                     GADVEKPVSLLRITKRRKTLNPGELLRSFREENEVQPQPPASSTSSGTAPSPAAGSVF
                     NLFKRRNSLVVRPATSNLDQH"
ORIGIN      
        1 attcctttga cgacgccgtt ttgtgctgcc tcctttagaa acgttaatta agttccaaga
       61 aatgaatgga aatgtgtttt attttataca ttgcgtattc ttatataaga agtcgcagtt
      121 ttgagggata ttcatggcgc actagttgtg ttcgtccgag tcggatacga acagtgcctg
      181 ctgcaaaatg gaggatagcg aggacgacgt ggtggtggtg agctgcgata cctcgatgaa
      241 ggagaaggta aaggccaagc tggtggagat ccgtaagttt gtgcccttta tccggcgtgt
      301 gcgaatagac ttccaggata ctttgtccaa ggttcagggt catcgtctgg atgccctggt
      361 taacctgctg gatcgcgagg acgtatcgat gagctctctt aacaagatcg aggtgatcat
      421 tgataagcta aggacgcgct tcaatccgag gatcgaaatt gacactggcg aaatcattga
      481 tatcactgaa aacactgagt atgattccac tgcgtcgagt tcctccgggt caagcgcaca
      541 accaccagct caaagagcga agtcctcaga taagggtggc ctgttgaacg gctcatctct
      601 ggctggcagg ttaaatgctt cacttaaccc cgcatcagat gctaccaagg accatctatc
      661 gacaccaaag caactagaat tggagcgcaa ctctaaagtc tcttgctcaa cttcaaaaat
      721 aaagacttct tccatttctg ccaagatttc agtttttcca gccacttcaa tggatttgaa
      781 catcccaaaa actaccagcg ccaaggcatc ggatgagggg cagcggtcac ctgcagaacc
      841 acgtgccgcc cttcaagcta tagttcaaga tacgaaaaca ccaaccattc cagaaccaac
      901 atcaccagcg gcgcttaagc attcctccct tcgtggcagt cgtggatttc tggctgtcat
      961 gcagaaggcc ttaattgaag agaagaagca gcgagctagc gaacagaaaa ctgataaaga
     1021 aactaacggt gtaacgcagc tagagacaaa cttctcgcgg cgatcttata caacatcgtc
     1081 acagtcaacc tgccgttctt cagaaatatc ggtaagagca gaaaacccag attttaagcg
     1141 acgaagcaca tcgcttgtgc agcatgctcc tctacaggag gcctccccag ggcaatccaa
     1201 aaaagacttg cccatatcct tgtcggtaca gggtctacca gctttggtca gtgccagcac
     1261 tgcaagtcca gcaaatacgc ttgaggaggc ccgcaagaag ctggcggcct tgaaatatgg
     1321 actaggaaca acggtaccaa gcatgcctcc actggcctcc aatataaatg atccacgcgg
     1381 tagaaaagga ataaacctgc ctgaaactaa caacaataaa gacaacgact tgggtatagc
     1441 gctgcaatcc ccgccgccta tgcggactcc ctcgcctatt ccgccgccac caaggatgaa
     1501 ggccggtacg tgggcctcat tttcaaatgt tccccaggaa agcgcattta caggccagca
     1561 tgctgtgcag cgcaactcgg tacctccggg agattctcga gcctttgggg atgctttggc
     1621 acatgaacca aggtccttct atggcactga ttcccgagaa ccccgagacc ctcgtatctg
     1681 gaagagcaag acttcccaac agcagcaaca tcagcaggcg ccacaggcac aaattcctcc
     1741 gtattccagt gacccgcgtc gttctataag cacttacagc ggtttcgaag agtcagccaa
     1801 tagagggcaa actaatagta aaaataataa taatcataat cagaatcaga atgttaccgc
     1861 caatggaagt gcatacattg cgaactggag tggcaatcca tatggaaacg gtccaaaccc
     1921 aagaccgccc tttcctagca atcaaaacgg gaatgaatat gctggtggtt ttaatggttc
     1981 gcatagtttc aggggcggat ttcgcggcgg tcacaacaaa cggggctttg gacgacacaa
     2041 tgacgtgcca cgcacatatg gggaacaccg caaagccaag gcccgtgctg aggcggaggc
     2101 caaggctaag gctgaggcgg aggccaaggc taaggctgcg gcggaggcca aggctaaggc
     2161 tgcggcggag gtacgccaat tagaaacgga agtttcgcgg gagatggaag cccaagaaaa
     2221 aaataaacag caaaaggaaa agccggagga gagcgaggcg gagaagtcga cgatcgcagt
     2281 gactcaggtt ccggaattgg acacctccta ccgcaacgtt aatctggggg tgctaaacaa
     2341 gaagctagac tttcgaatac cgaagaaaac cctcccaccg gcaacaacaa taacctcaac
     2401 aagtccagtc aatggtaatg gggagaatcc aagctgcccc tcaaattccc ccacaagcaa
     2461 aagctgtgat gccaaccagg acaaagatac ttataagaat aaagataggt atttaaataa
     2521 ggctaaggct aaagacaagg tagataaggg caatgaggtg tcggagaaca atctggataa
     2581 gtctgagaag cttgaaaaat cgcaggataa gaaggcaaat gacaaggaga acaagtccga
     2641 caaaaaggag aagaagagac tgaacaggga gcctgaaaag aaatcaaagg ttgagaaccc
     2701 cctcgagatt gtggactcga atagcgtggt cagtgaggaa agctcggaaa atacagacaa
     2761 tgtggaaaat gaaccgcctc ttagcgagac taacgcgtct ccagttccag agctagctac
     2821 cagcactcag gacagtcaac aggaccagtc agtgagtgaa gagttggaca tccttgccaa
     2881 aaaccgtaga atgtccggaa ctagaataaa gactcccatt tcgtctactg gaaaccctgc
     2941 actaaagcga cgggcagatg atgatgttga ggacaagttg gaaaatccga ctaaaaagaa
     3001 ctgcgcaaag tgggaggcaa agcctgacaa ggaaaaatcg gaggatgata ccatcgacaa
     3061 gattaaatcc atgaaagtaa ctaaattcgc tgatgtcgag atgaaggtta cagaagaaag
     3121 ccagagtgct gaggaggagg agattactga gcagaaagag agtactgagg aggaggaagg
     3181 tactgagcat aaaaagagta ctgaagagaa ggataagccg ccaaaaatct ccaagataaa
     3241 aattgtcctt actcccattg cccatacaac acaagtggtt cgtcctaatg atggcttcaa
     3301 gaacaatcaa gaaaagatct tggacaacat ggcaactgat gagcacgatg atgaggaggt
     3361 ccccgggccc ccagctcaat tcctacgccg gattatgcag cgtcggaact ccttggctcc
     3421 tacgtatatg aagccgatgg tggacaagga taagattgct tcttccagtt ttacctacga
     3481 ggatctgccg gagcagaagc gcggaaatca gaacgcccga aacctggcca tcattttcga
     3541 aaaaactagt gacaactgca gcgtgtccac tcaaaacatt attaatggca aacgtcgcac
     3601 tcgtggatgt gagacctctt ttaacgagac ccaattgagc cgaaacatct ttggcatggg
     3661 ccagataaac aggtcgcggc caaaggccac cgagggtaaa gctaccaggg caaagatttc
     3721 tgtccctgca aaagattcaa cccgtgctga tgatgatcct attcccactc caaatcttaa
     3781 gcggaaaagg caagctattc acaaggaaac agaggatgat gtggagatga agcctaagaa
     3841 ggctcgattg gaagcacagg agataaatgg ggtcagtgtt acgcctgatg aacagcaggt
     3901 agagaacaac gtggaagtca cccaaaagga ggtggaagca atatcctcag agccacttct
     3961 ctcttctgag gttgaaccga cacgaaagcc tcgcacgaaa ccgcgaaaaa acgagctgga
     4021 caagctaaac gacgacattg cgcaaatgta ttacggggag gaagtgatgc gtgccaccag
     4081 tcgcagggct tgtacccgtc gatcgcgcac gtcctcgcac acgcgcacca gtagccagca
     4141 ttccaggacg tcctctgtat cgcgaaccga tagcatatcc accgtatcgg atattagttc
     4201 cataatcgtc aggaacacgg cgcgaagggg tagaggcatc agatcatccg aaaatggcat
     4261 caaccgtgcc acgtttaatg catccttgaa tgcaaaaaaa ccaaagttgt gccgtgttag
     4321 aataaagcga tgtgctgcat tgatggagat gataaaggac caggaaaagg aggaacagga
     4381 gaagaaggag cagaagaata aagaaccgaa gaagaagaaa gtgggtgtgc aaaaaaagcc
     4441 attgaaaagt aagccgaaaa gagagaatag cgttattctt aacacaaatc ccgaatggca
     4501 ctccatttcg aaggctgtta tcaagtgtgt cgtctgctcg aagtgggttc gcaggagccc
     4561 actctctcat tatatgatgt gccataagga gcactatgcc gcccgattgc cacccgatgt
     4621 gcttaaagag ctgcgggccg ggcgcggaaa tcgaccggat tactgggttt cgcaacgcgg
     4681 cggctacaca ttgcacttca cttgcccgtt ctgccagaag ccactgctac tctgccaaaa
     4741 aggcatgatc gagcacttga tcggccatat gggcgagtct cgtttttact gctccaactg
     4801 taatatgcca cagaaccgcc tcagtaggct gctggaccac accgcatcct gtgggccagg
     4861 tgcgaagcct ttaagtagca aaaccgtctg cctaccgatg agtgttcacg tgtgccacat
     4921 ctgccagttt atgcagtaca gcaaggaaaa tatggaccgg catcttactg ttcagcatgg
     4981 cctaacgaag gaggaactag aaagtgtgga gcgcgaggag ttgatgctct gcgacacaac
     5041 agacgtacca tatgcagatt cgaataagga tggcagcgcc gttgggcctg aaattcaaaa
     5101 aaacaaaccg aagtcgaaag caagtgcagc actttcaaat acaaccaaaa aaaataaaca
     5161 gcaaaagaag acaaacaaca gtcgcttcat gaaaatggtc aaaaaatccg taagtctatt
     5221 gaagcgagaa agagaagagc aggatgacca gaacatgacc caggctaacg aagggtcgga
     5281 agtgccggaa ataccgccgc ctccgccaga aattgagccc ttgtttgtgg tcaatgagtg
     5341 tctaatgacc tctgaaatgg acacggacat ggaagaagtc ttggaacagc ccgttcaaca
     5401 tatgagctta atggtagacg aaaagcctgt gacgctactc agtggggcca cagaacagct
     5461 ggagcctagt gtccctgatc ccgagcctgt tgttccatct gcacaagatg atggcaaaga
     5521 tgtaaatgaa gatgaagacg tagacgtgga ggcagtagtg gattcccttc agtcacacac
     5581 tgaccagacg gctacttcta tgttagcaga agtcagtcta gccgaattgg ctggggatgt
     5641 acttgatggt attggcagcg acgcgtccga ctatgagatg gatgataatt cagagcaagt
     5701 ggatacaact aacaaaaacg gctacggtga tgatgacgat gatgcgctta ccgacgattg
     5761 ggtggatctg gagactgcca agcgcaattc caagtccgcc aagagcattt ttagagtgtt
     5821 caatcgcttc tgctcgcgtt taaacaaatt accccgatcc agcagagcag tgccctcgaa
     5881 tgggagtgaa aacagcgatg gcagcgacaa caacgacgac gacggcgata atcctgatcc
     5941 cagcgagcta atgccaacaa tgcaaccatt ggagccggag ccagagatgg gggattcatc
     6001 cacatctaca ggtgctaagt cgctatccga acgggtggag aatgtgggct ttcaaaagcc
     6061 ctcttcagac gaggatcaaa atcgcgtggc agcatcctat tactgcgtgc agccgggttg
     6121 cactttcctc ttttccaatg agctggaagg cctcgagaat cattttgcgt tagagcaccc
     6181 tcttgttcga tggagcggca aatgtggcat gtgccgtcag aaaatcacgg caacggaaac
     6241 gaatctcaga atttctgaag agttgcgcca catgagggac gtgcacatga aggacatatc
     6301 caccctgcct cctcctcagt catctgcggt tgaaagccca gccgttattg aatcctgcct
     6361 gaatcagcgt gaaccagtac ctgaatcaga acctgatccc gttcctgagc ttcccaagct
     6421 gcgtgttcga cgcttcactg gggatcgcct tgttgtggat tcacaagcgg aaaagagcca
     6481 accggtagca atagttgtca gtgatgatga taatccgcga aatgggatgc taagggactt
     6541 gctggcggcg gatccacggc cacccaatca gcagttggac ctccaagccg ctggactggg
     6601 cgagttcctt tgcgccaagc ccgattcacc gtcaacagaa ccggtcaagc agacgcccgt
     6661 aattgttggc tattcgagtg gcttgggctt gaaaatcggc caggtcctta gcagaactca
     6721 gatttcagct aactcacggc tatcgccagt cgttaacgat cccctgccag agaagtcttc
     6781 tgctcctgct gccgttgaag agaatcgtaa tcgattcagg tgcatggcca ccaactgcaa
     6841 ttttgttgct cacaagctca tgttcatgcg ggagcacatg aagtttcaca gctacagttt
     6901 cagcagcacc ggtcacctga actgcgcgta ctgctcccat gtggcagtcg atgtggatga
     6961 ttacttgcgc cacggagtga tcattcacga cctggcacca cgctccgaac tggagagttc
     7021 aactggacca ccatctgtta cccagaaaat ccgggatatg ctcagccagc gggaaaatgg
     7081 tcgtgttcca ccaccaactc ctcaagtcac tctgtctgat gtggtcctgg gtcttttaga
     7141 atgcaccgga tacagcgagg ataaactgta cgcctgtccc caaaagggct gcatcgtgcg
     7201 gctgacagat gagcagcttg taaaccattt gcgctaccac attcgtagca ctcatcaggg
     7261 cagcgagttg gtgaaatgca agttttgcac caaggcgatg catccgccgg cacttcgtac
     7321 gcatctgcag cagtaccacg cccggcacag catcttctgc ggcatttgct tagccacatc
     7381 ggtcaaccag cgcataatga tgtatcacat gagcacggtg cactccaagg cctacggccg
     7441 gcctaacgcg cggctggcgt ttgtgtcact gcccgtgaag atcgacgcga gtaagaagaa
     7501 cgtagaaagc gagttctacg tggccgtcgt ggaacagccc tttggcaacc tccagatgca
     7561 ggatttccag cgcaagctgt tcgatgaaat ggaccgtcgg cgttcgggaa caaagacgta
     7621 cttccgcagc tccgaggtgc atatcctgcc aacgcagcca acattccagc gaccgctata
     7681 ctgtacggag tgccccttct ccaccacgtc aagggttaac atgcagatgc acctctatga
     7741 gcacaaggat gagaccattc gggaagcctc caaattggcg gacttgatag ttccagcaac
     7801 ctcttcggta ttaactgttt cggcgagtac gttggtggca ccgccgaggc caggcaaaga
     7861 ttcagaaaaa ccatctactt ccggacaaag tggtgatgca gcgacggagc agctgaatcc
     7921 agatgttcct ggaacccaca agcccatcaa gccaccgtta cgctatgtgc ccccggacca
     7981 acgctaccgc tgtggcttcc tccgatgtag cgtcctttgt ttttcggaat ctgcggtgcg
     8041 caaacacatg caggctaacc acaaatactc ggaggtggta aggtgcccgc actgcaagaa
     8101 ctgccagggt cagtttggag tagataagta ctttgaccat cttgcaatgc ataagcggca
     8161 catcttccaa tgcggcgctt gctcacgtca caatagcagg cgtgtcatcg agcggcacat
     8221 acaggaacgt cacaatattc aagatgtgga catgatcgta caccgccata atgacagcaa
     8281 caaaacgacc gaagcccgct ggctgaaggc gcctaaattg gcacgtcatt cgctaatgga
     8341 gtacacgtgt aacctgtgcc tcaagtactt tccaacgacc gtgcagatca tggcccatgc
     8401 ggcgtccgtt cacaaacgca actaccagta ccactgtccg tactgtgaat ttggtggaaa
     8461 cctcgccacc gcgctcattg aacacatcct tcgcgagcac ccggaaaggg aagtgcagcc
     8521 tgtgcaaatc taccagcgca tcgtgtgtaa gaacaagcag acgctaggct tctactgcac
     8581 cacctgtcac gaggtggcca gcagcttcca gaagatcgct atgcactgcg acaaggagca
     8641 taagtcgcgc aatccggtgc aatgtcccca ctgcattttc gggcatttgg ccgaacgcca
     8701 ggttgtctta cacatacaag agaagcatcc ccatgaacgc ggactggcaa tggtgcagtt
     8761 cgaacgcgtg cttaatgaca tcccgaacag cataagctgg gagataggtc ggcccatcga
     8821 agtggagcct gagaaggaga tcccgaacaa tggggagagt gcattcctgc cgctaagcca
     8881 gagacaggtt gtaacggaag tggtggacct gctggattca gacgacgagg cggacgagta
     8941 cggtgaacaa gatgacgcga aaatcgtgga gttcgcctgc acacactgcg acgggacaaa
     9001 caccaacttg ccggacctac gctcccagca ctgggcccgc gaacatcccg accagccctt
     9061 ctatttccgc gttcagccga tgctgctctg ctccgagtgc aagagattta ggggcaatgc
     9121 aaaggcactt cgcgagcacc tgcgtgcgac acactctatc cggagcatag tggctgcgga
     9181 cattcgtcga ccgatggagt gcgcttactg cgactaccgc tataaaaaca ggcacgatct
     9241 tgcgaaacac atcagtgaga taggtcacct gcccaatgac ctgaagcacg taacagatga
     9301 tgaaattgat gccctgatgc tgctcagtgc cagtggaagt ggtggggctg ttaacgaata
     9361 ctaccagtgc ggattgtgca gtgtggttat gccaacgaag gagacaattg tccagcacgg
     9421 ccaagtggaa cactgcaagc ccgacgagcg tttctgcttc cggcagctag tgtcgccagt
     9481 gatataccat tgttccttct gcatgttcaa ctcgaccgat gagctgacta cgctgcgcca
     9541 tatggtggac cactacagcc gcttcctggt ctgccatttc tgcacacgct ctcagccggg
     9601 tggtttcgat gagtacatcc agcactgcta tacctaccac cgggacgata tcaaatcctt
     9661 ccgggacgtg cacacgttta gcgatctgaa gaggtacctt agtcaggtgc attaccaatt
     9721 ccagaatggg ttgattatca caaaaagcag tctccgttat acacgttaca aatccgacaa
     9781 atgtatgctt gagctagacg ctgagctaat ggccaaggcc cagcggccac ccattccgcg
     9841 tctgcatatc agactcaagt cgaccggcgt tcagatgcag agccccgagg gggctgatgt
     9901 ggagaaacct gtgtcgttgt tgcggatcac aaagcgacga aaaacgctta atcctggcga
     9961 attgctccgc tcattccgcg aggagaatga ggtacagcca cagccaccgg cctcttcaac
    10021 atcgtcgggg acggctcctt ctcctgcggc aggttctgtg ttcaacctgt tcaagcgccg
    10081 caacagtctc gttgtccgcc cagcaaccag caacttggat caacactaac accacaacat
    10141 tattataatt tttttttttt tttatttctt ctggacccca tttatgcata tttcgtacta
    10201 gtttcaaact atttaagcat ttttttttta aacataatta tgagatcatg tttactacat
    10261 ttgtaagatc aataatgata agataggact ctaactgaca acggcttcaa ataattaaac
    10321 tattgtttt