PREDICTED: Drosophila obscura serine/threonine-protein kinase Smg1


LOCUS       XM_022367787           10554 bp    mRNA    linear   INV 14-MAY-2021
            (LOC111074829), mRNA.
ACCESSION   XM_022367787
VERSION     XM_022367787.2
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq; corrected model.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            On May 14, 2021 this sequence version replaced XM_022367787.1.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
            
            ##RefSeq-Attributes-START##
            frameshifts :: corrected 2 indels
            ##RefSeq-Attributes-END##
PRIMARY     REFSEQ_SPAN         PRIMARY_IDENTIFIER PRIMARY_SPAN        COMP
            1-865               JAECWW010000165.1  1289069-1289933
            866-1274            JAECWW010000165.1  1289998-1290406
            1275-3594           JAECWW010000165.1  1290475-1292794
            3595-4026           JAECWW010000165.1  1292868-1293299
            4027-4028           "NN"               1-2
            4029-4791           JAECWW010000165.1  1293300-1294062
            4792-7164           JAECWW010000165.1  1294064-1296436
            7165-7463           JAECWW010000165.1  1301765-1302063
            7464-8865           JAECWW010000165.1  1302143-1303544
            8866-9852           JAECWW010000165.1  1303623-1304609
            9853-10213          JAECWW010000165.1  1304692-1305052
            10214-10554         JAECWW010000165.1  1305124-1305464
FEATURES             Location/Qualifiers
     source          1..10554
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..10554
                     /gene="LOC111074829"
                     /note="The sequence of the model RefSeq transcript was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: inserted 2 bases in 1 codon;
                     deleted 1 base in 1 codon; Derived by automated
                     computational analysis using gene prediction method:
                     Gnomon. Supporting evidence includes similarity to: 4
                     Proteins, and 99% coverage of the annotated genomic
                     feature by RNAseq alignments, including 17 samples with
                     support for all annotated introns"
                     /db_xref="GeneID:111074829"
     CDS             430..10413
                     /gene="LOC111074829"
                     /note="The sequence of the model RefSeq protein was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: inserted 2 bases in 1 codon;
                     deleted 1 base in 1 codon"
                     /codon_start=1
                     /product="LOW QUALITY PROTEIN: serine/threonine-protein
                     kinase Smg1"
                     /protein_id="XP_022223479.2"
                     /db_xref="GeneID:111074829"
                     /translation="MSAVVPSVYSMRRALSVDSDYEAEEEASVSASTSASASASASVP
                     LRVQLDRVLRNSNGNASSNQIRYGGSDSYSCSSESMWNHEAAVAQAFASAASAATAAG
                     SSNASATGMSPMMSRNANMPLDMTLAATKCMQQNSHRRAYPCKGERDQIANKQAGNSN
                     NNNHHGGGNSDDLRLTKIIRRLNTETNSAVALELCRKLAVAVRTPLNMAYMARSFDLV
                     LEGMLMLYKHCAPPVLEECSKTLGLMGYLNRLAYPIYEEFIVKHYRSCKRMQHYLIVA
                     LRTTLSLDTSCELHLYSEKMMLMLKDFLESAESADSFMDVSSAIVQFSHSYRHPFECH
                     FTDVVDIIIGWQLEQGQPKRLKSHCAQVLEQLTPYFSKKIEFSYGLLAQFVEDITVLE
                     DEHDHEHEQDGQRKGICWSERLGSFLGAFNTLLKCSVRMQIFKGHPACESIVKVAVDH
                     ITRILPSVLLLHPDANADADALININELLCICLLHNCGGSLQPEELEKVLLGQFETMH
                     RLSEPQRQSALYLLLCAVRRLRVRLPASLVQLVFQSGTDGHPYVAEVRRQTGGAAYKM
                     LLRTCQEMLILKNVPLLQQAYGHLVRDINACIQVLREPPPTHAQWDSNEAEIVLIFHL
                     AALAALAKQTSAIIGMYACQPSILKLLISNCCAQDLCLWARHPTTHQALLGLLVVHCQ
                     ANHNFRTNSILFRDHALSKENTSPTAHSFECILRFLDNILTQAGRLAAENLKLLLQWT
                     QLLLAECRERAPLLLEQDNFLGICHSVGGTAAKWRPLESAACMQAVLDFGPEALHSRQ
                     DLLVLYRETALQQLQVLTTSGHAPYAQIYAQLPLHMTLFESGLQGRRVCVWQQKMSQC
                     SAVRDNVFQQFVDHLQRPEQEQLTQCLRELFVRSYQVAPPDERQEKLSQCTRRCQRLA
                     AAWLQFEAARYCVDQRLRTTLGKPQETFLGFEAVIMRHARLLSGCAKEAERSSLSGLS
                     LEQLSAIQVNLSMLLGFLDALEKLIYNAAEGSAFALRPPEKPVAAFFRLNNPTCQSWF
                     NRIRIGVVIIAMHIQQPELVIRYAQQILITQKTQDATYSQALVYLAWAWISCQEADSL
                     RGLYVWARAKCSSGTKSYQWLQHAAEQAAGRKEAALSGYQNVLAGEEGQELEPHTHQF
                     VLAQMMQCLQDTGQWSQLVDIKQHKLLRPEDKELNPFLQCRNVDVTALERLLAXASEN
                     CSNSNHSEELDLALQQLSLWPSNWESGASHSHASFSNFHVRRCLEDIMLQKTLMEPQS
                     QPHSQACHLMDLHWRDSLLNPSSDQRPCRELILLKHIVQGLASGQELTLLPLVAERRR
                     PSNVSSDILIRCLAWTQLLRQHCAPGSWETLCLDAAAAARDEGNLQTCQSLLTQYFGQ
                     PLPEISLSADSLDTENAEVLRSYSELAKCLYLQQTQVPNSSSSLDLGPAINVCASLCL
                     SLHRRSAHSHTQQTDIGAGMLLTLSEWISARSSCNGLPMALAPNLQSPAVDQLLEELP
                     ECPLTHGSNSSQLVAIPQGERLVARLIHASVDQRPNCEEALIAYGNWSYRWGKKIVDS
                     GSVLSQSDVAALSRALDRQTPLESEHLNELLQALSMEQPPAGCVEVCPEAAKMRDDEA
                     SKSCLRRLNLLEDQPDSVLEEILRIWRQAIANTYDFYKDAARSYFQYLSIKAGKGTGS
                     GSGYGLAACQKGDRYQVDDSNLVTTTLRLLRLIVKHASGLQDVLEQGLRTTPIAPWKV
                     IIPQLFSRLYHHEPYVRKSVCALLCRLAESRPQLVIFPAVVGANREKQLEKDKDKEKD
                     EPKDKKNCCYGYLLGALSQQAPEAVKHVQLMVEELRRVCLLWDEYWIHSLAHIYNTYV
                     GRVNAFATEFKPDDHEGKNNRFNSWRPQLLLDLEALVAFTARPPETSYERSFRKRFDQ
                     HIKSTLEVLRTRRYPEAWEKLKQLYHTLQANMIRGTAMTLKMQTVSPVLCGIGRMRIS
                     MPGLDEQEGYDQNDQESDQVHIESVEGTVCILPTKTKPKKVAFYGSNGQRYTFLFKGL
                     EDLHLDERIMQFLSISNAIMACRSDQPRPGGATTNYRAHHYSVIPLGPQSGLISWVDG
                     VTPLFALYKKWQQRQPVLSPGGNAGAPSHRFTDLFYNKLTPLLAKSGLSVNDPRRQWP
                     VAALRQVLGELSEETPSDLLARELWCQAGNAAEWRQSVRCYSACMSVMSMIGYVIGLG
                     DRHLDNVLINLGSGEIVHIDYNVCFEKGRTLRIPEKVPFRLTQNLINALGITGIEGAF
                     RLGCEYVLKVMRKERETLLTLLEAFVYDPLVDWTLNDDAQTQRRSINPKQQQTESGGA
                     KCHKNKDKAQRHRAPDWEAKRPIFTEKLAQLHKFWSTNEDEILLQLQEVGQELGHLEA
                     AQVQRLAAERELEELNERSALVAEIKSLGPAMKSHSFSTAPLRLLQTRKHLAALMELT
                     VDRRADFAFVQCLLASYSQCLLPYHLSQLYAQLQSLHLEAGGVVGRWREEHLALADVL
                     QIYVPAVLLAGIESTRTQVDSLLGGINEGARHCADLMQQYAAVMAFYPNQCHRQNLLV
                     RFHDSYAAYLQLEGAVDPAATSASICTQSVKNLAETLEVVWLHLGCQLHDAAQRLSVK
                     QVQILSLVSPTPTATILQTLQQTDCSLLLLNSRLMRTLDMGSSSFRAYEAQALAEQDK
                     ELLQHQLHYIHVVRSMCHGVMGARSTGSGSGSGDYDQHEQDLHLGQLETLLTALMQLK
                     RLFEYDLPLSIFRQLLLQPNLDQLQELSKLGPKRLQQLYSDTLALAEQRPKVATEQQR
                     FMLSLQPIHQQFQLVVTSLEDMRRCLKQWVEDINASHAQQYLELILFKEHHVELNDNC
                     FYRVIHQTLAASSSCDVREMSRPMGTLVHCIEKKCLAGLLAQVAYDYYRGYGSTCQPP
                     ENTFTPSDVEKAEHLCGGLFMALQTDVELLQQEQEVALIRQQIELHSLVASAQFWGYS
                     EALGSQLRGGPHIVCRHKLASDIGQSWGDLVRSVDGLEQLLLGLDAQLGQLEALRSNW
                     NRNHIDSLLRNEREQRRQVSDQVALVRQLMEGASAVCHLEQIIVDSVGSGMEIQTLHN
                     HLEQWLAAFGNWKASCASVSAVEQAVVELLDPEDGIDDYWMENVQGLIEEQTCKVQRE
                     FAILSGEQQAKLRFICTLLKETQRLLEHVPRFFVGNFCSNAKAMGHGGGKGVPADVQQ
                     LCDHLRDSQLLLQSFLEHLTALRKDICAERHVLQPELVEGWQHQLQQIQAMAGQEASE
                     YFKRIKELLLQVADSDSLTYDTFTHTKGAGNLHEQKRNAYGVSVWKKIRMKLEGRDPD
                     GSQRSSIAEQVDHVISDATNPDNLAVLYEGWTPWV"
     misc_feature    2347..4047
                     /gene="LOC111074829"
                     /note="Serine/threonine-protein kinase smg-1; Region:
                     SMG1; pfam15785"
                     /db_xref="CDD:464869"
     misc_feature    6367..7275
                     /gene="LOC111074829"
                     /note="Catalytic domain of Suppressor of Morphogenetic
                     effect on Genitalia-1; Region: PIKKc_SMG1; cd05170"
                     /db_xref="CDD:270714"
     misc_feature    order(6397..6399,6403..6411,6415..6417,6463..6465,
                     6469..6471,6478..6480,6598..6600,6634..6645,6658..6660,
                     7015..7017,7021..7023,7054..7059)
                     /gene="LOC111074829"
                     /note="ATP binding site [chemical binding]; other site"
                     /db_xref="CDD:270714"
     misc_feature    6991..7017
                     /gene="LOC111074829"
                     /note="catalytic loop [active]"
                     /db_xref="CDD:270714"
     misc_feature    7057..7125
                     /gene="LOC111074829"
                     /note="activation loop (A-loop); other site"
                     /db_xref="CDD:270714"
     misc_feature    10321..10407
                     /gene="LOC111074829"
                     /note="FATC domain; Region: FATC; pfam02260"
                     /db_xref="CDD:460514"
ORIGIN      
        1 aacccagtag gcaaacaatt tgaacaaatt actttccagc aatgcgacag ttcaaggaat
       61 gttttgacag gctttcagtc tgtgctgcga tctgctacgt tgcaagtaag gcaagcagtg
      121 gtttacataa gccccaagga tattattcta agaaagtaaa gcccctcgtg cttcaaggag
      181 gccaaggact tgcatttgcg gatagtacca aaaacgatag tggtccattg cgtgccatct
      241 cgatcaaggg cattatcgaa ctatcgcatc gatcgcgata acgtgccacc cacctataaa
      301 gcagagcaga gcagagcatt ccattcaaga aacacagaac gaaactcgaa cagacccacg
      361 aaaatccaaa gaaaaaaagt aagtgtgtgt tgaggaaagt gcgtgaatga agaccgtgag
      421 ggatacttga tgtctgcggt ggtgccgtca gtttattcaa tgcgaagagc actatccgtg
      481 gacagcgact atgaagccga agaagaggcg tccgtgtccg catcgacatc cgcatccgca
      541 tcagcatcag cgtccgtccc cttgcgtgtg cagctagacc gcgttttgcg gaacagcaat
      601 ggcaatgcca gcagcaacca aattcgatat ggcggcagcg atagctacag ctgttccagc
      661 gaatcgatgt ggaatcacga ggcggctgtg gcgcaggcct ttgcctccgc cgcatctgcg
      721 gcaacagctg ctggctctag caatgccagt gccactggaa tgtcgcccat gatgtcgcgg
      781 aatgccaata tgccgcttga catgaccctg gctgccacaa agtgtatgca acagaacagc
      841 catcgacggg cgtatccctg caaaggcgag cgcgatcaaa ttgccaacaa gcaggcgggc
      901 aacagcaaca acaataacca tcatggtggc ggaaatagcg atgacctgcg tctcactaag
      961 atcatacgtc gcctgaacac ggagacgaat agcgcggtag cgctcgagct gtgcaggaag
     1021 ctggctgtgg ccgtacgcac gcccctgaac atggcctaca tggcgcgctc cttcgatctt
     1081 gtgctcgagg gcatgctgat gctgtacaag cactgcgccc cgcccgtcct cgaggagtgc
     1141 agcaagaccc tcggcctgat gggctacctc aatcgcctgg cctatcccat ctacgaggag
     1201 ttcattgtga agcactatcg ctcctgcaag cgcatgcagc actacctgat agtcgcccta
     1261 agaaccacac taagccttga cacgagctgc gagctgcatt tgtactcgga gaagatgatg
     1321 ctgatgctga aggacttttt ggagagcgcg gagagtgccg acagcttcat ggatgtcagc
     1381 agcgccattg tgcagttctc gcacagctac cgccatccct tcgagtgcca cttcaccgac
     1441 gtggtggaca tcatcatcgg ctggcagctg gagcagggac agccgaagcg tctcaagtcc
     1501 cactgcgccc aggtgctcga acagctgacg ccctacttca gcaaaaagat cgagttcagc
     1561 tacggcctgc tggcccagtt tgtcgaggac attacggtgc tcgaggacga gcacgaccac
     1621 gagcacgaac aggatgggca gcgaaagggc atctgctggt cggagcggct gggctccttt
     1681 ctgggcgcct tcaatacgct gcttaaatgt tccgtccgca tgcagatatt caagggccat
     1741 cccgcctgcg agtcgattgt gaaggtggcc gtcgatcaca tcaccagaat cctgccaagt
     1801 gtcctcctcc tacatccgga tgcgaatgcg gatgcggacg ccctgatcaa catcaacgag
     1861 ctgctgtgca tctgtctgct gcacaactgc ggcggcagcc tgcagcccga ggaactggag
     1921 aaggtgctgc tcggccagtt cgagacgatg caccgcctca gcgagcccca gcgccagagt
     1981 gcgctctatc tgctgctttg tgcggtgcgt cggctgcgcg tccgcctgcc cgccagcctg
     2041 gtgcagctgg tcttccagtc gggcaccgac ggtcatccct atgtggcaga ggtgcgccgc
     2101 cagacgggcg gggcagcata caagatgctg ctgcgcacct gccaggagat gctgatcctc
     2161 aagaatgtgc cgctgctgca gcaggcgtac gggcatctgg tgagggacat caatgcctgc
     2221 atccaggtgc tgcgagagcc gccccccacc cacgcacagt gggactccaa cgaggccgaa
     2281 atagtgctga tcttccattt ggccgcactg gcggccctag ccaagcagac gtccgccatc
     2341 attgggatgt acgcctgcca gccgtcgata ctgaagctgc tgatcagcaa ttgctgcgcc
     2401 caggatctgt gcctctgggc caggcacccg accacgcatc aagcgctcct tggcctgctc
     2461 gtcgtccact gccaggcgaa ccacaacttc cgcaccaact ccattctctt ccgcgaccac
     2521 gccctctcca aggagaacac ctcgccgacg gcgcacagct tcgagtgcat cctccgcttc
     2581 ctggacaaca tcctcacgca agccggccgt ctggcggccg aaaatctcaa gctgctgctg
     2641 caatggacgc agctgctgct ggccgagtgc cgggagcggg cgccgctgct gctggagcag
     2701 gataacttcc tggggatctg ccacagcgtt ggaggcaccg ccgccaagtg gcggcccctc
     2761 gagagtgccg cctgcatgca ggcagtgctc gactttggcc ccgaggccct acactcccga
     2821 caggacctgc tcgtcctgta ccgcgagacg gcgctgcagc agctgcaagt actcacgacc
     2881 agcggccatg ccccctacgc ccagatctac gcccagctgc cgctgcacat gacgctgttc
     2941 gagtcggggc tgcagggccg tcgggtgtgc gtgtggcagc agaagatgag ccagtgcagc
     3001 gcggtgcggg acaacgtgtt ccagcagttc gtcgatcatc tgcagcggcc ggaacaggag
     3061 cagctgacgc agtgcctgcg cgagctgttc gtccgcagct accaggtggc gccgcccgat
     3121 gagcggcagg agaagctgtc acagtgcacg cgtcgctgcc agcgtctggc cgccgcctgg
     3181 ctgcagttcg aggcggcccg ctactgtgtc gatcagcggc tgcggacgac gctcggcaag
     3241 ccgcaggaga cgttcctggg cttcgaggcg gtgatcatgc ggcacgcccg ccttctcagc
     3301 ggctgtgcca aggaggcgga gcgctcctcc ctcagcggcc tgtcgctgga gcagctctcg
     3361 gccatccagg tgaacctgtc gatgctgctg ggcttcctcg atgccctcga gaagctcatc
     3421 tacaatgcgg ccgagggcag cgccttcgcc ctacgaccgc ccgagaagcc ggtggccgcc
     3481 ttctttcgcc tgaacaaccc cacctgccag tcgtggttca atcgcatccg catcggcgtc
     3541 gtcatcatag ccatgcacat ccagcagcca gagctggtca tccgctacgc ccaacaaatt
     3601 ctcatcacac agaagacgca ggatgccaca tacagccagg cgttggtgta cctggcctgg
     3661 gcctggatca gctgccagga ggcggactcg ctgcggggcc tatacgtgtg ggcgcgagcc
     3721 aagtgcagca gcggtaccaa gtcctaccag tggctgcagc atgcggcaga gcaggcggcc
     3781 gggcgcaaag aggctgcact gagcggctac cagaacgtgt tggccggcga ggaggggcag
     3841 gagctggagc cgcacacgca tcagtttgtg ctcgcccaga tgatgcagtg tctgcaggac
     3901 acgggccagt ggtcgcagct ggtggacatc aagcagcaca agctgctgcg ccccgaggac
     3961 aaggaactga atcccttcct gcagtgccgc aacgtggatg tgacagcgct ggagcggctg
     4021 ctcgccnnag caagcgaaaa ctgctccaac tcgaaccaca gcgaggagct cgacctggcc
     4081 ttgcagcagc tcagcctgtg gccgagcaat tgggagagcg gtgccagcca cagccacgcc
     4141 agcttctcca atttccatgt gcgccgctgc ctggaggaca tcatgctgca gaagacgctg
     4201 atggagccgc agtcgcagcc gcatagccag gcctgccacc tgatggacct acactggcgc
     4261 gacagcctcc tgaatcccag cagcgaccag cggccatgcc gcgagctcat cctcctcaag
     4321 cacatcgttc aggggctggc cagtggccag gagctcaccc tcctgccgct ggtcgcggag
     4381 cggcggcgtc cctcgaacgt ctccagcgac atactgatcc gctgcctggc ctggacacag
     4441 cttctgcggc agcactgcgc cccaggcagc tgggagacgc tctgcctcga tgcagcggcc
     4501 gccgcccgcg acgagggcaa cctacagaca tgccagtcgc tgctcaccca gtactttggc
     4561 cagccgctgc cggagatcag cctcagtgcg gacagcctgg acacggagaa cgcggaggtg
     4621 ctgcgcagct acagcgaact ggccaagtgc ctgtacctcc agcagacgca ggtgccgaac
     4681 agcagtagca gcttggatct ggggccagcg atcaatgtgt gcgcctccct gtgcctgagc
     4741 ctgcaccgga ggagcgctca cagccacacc cagcaaacgg acataggcgc gggcatgctg
     4801 ctgactctct ccgagtggat aagcgcgcga agcagctgca atgggctgcc catggccctg
     4861 gcacccaatc tccagtcgcc cgccgtcgac cagctgctgg aggagctgcc agaatgcccg
     4921 ctgacgcatg gttccaactc cagccaactg gtggccattc cccagggtga gcggctggtg
     4981 gctcgcctca tccatgccag cgtcgaccag cggcccaact gcgaggaggc gctaatcgcg
     5041 tacggcaact ggtcctatcg ctggggcaag aagatcgtcg acagcggcag cgtgctctcc
     5101 cagtcggacg tggccgccct cagtcgtgcc ctggaccgcc agacgccgct ggagagcgaa
     5161 cacctcaacg agctgctgca ggcgctcagc atggaacagc cgcccgccgg atgcgtggag
     5221 gtatgccccg aggcggcgaa gatgcgcgac gacgaggcct ccaagagctg cttgcgacgg
     5281 ctgaacctgc tggaggacca gccggacagt gtgctggagg agatactccg gatctggcga
     5341 caggccattg ccaatacgta cgatttctac aaggatgcgg ctcgctccta cttccagtat
     5401 ctcagcatca aggcgggcaa gggaacagga tccggatccg gatacggctt ggctgcctgc
     5461 cagaaggggg acagatacca ggtggatgac agcaatctgg tgaccacaac gctgcgtcta
     5521 ctgcggctga tcgtgaagca tgccagcggg ctgcaggacg tcctcgagca gggactgcgc
     5581 accacgccca ttgccccctg gaaggtgatc attccgcaac tgttcagccg cctctaccac
     5641 cacgagccct acgtccggaa gagcgtgtgc gctctgctgt gtcgcctcgc ggagagtcgc
     5701 ccccagctgg tgatctttcc cgctgtcgtg ggtgccaatc gggagaagca gctggaaaaa
     5761 gacaaggaca aggagaagga tgagccaaag gacaagaaga actgctgcta tggctacctg
     5821 ttgggcgccc tgtcccagca ggcgcccgag gccgtgaagc atgtgcagct gatggtcgag
     5881 gagctgcgcc gggtgtgcct gctgtgggac gagtactgga tccactcgct ggcgcacatc
     5941 tacaacacat atgtgggccg tgtgaacgcc tttgcgacgg aattcaagcc ggacgatcac
     6001 gagggcaaga acaatcgctt caacagctgg cggccgcagc tgctgctcga cttggaggcc
     6061 cttgtggcgt tcaccgcccg tccgccggag accagctacg agcggagctt ccgcaagcga
     6121 ttcgatcagc atatcaagtc gacgctggag gtgctgcgca cgcgccgcta tccggaggcc
     6181 tgggagaagc tcaagcagct ctatcacacg ctgcaggcga acatgatccg gggcacggcc
     6241 atgaccctca agatgcagac ggtcagtccc gtgctgtgcg gcatcggccg catgcgcatc
     6301 tccatgcccg gcctggacga gcaggagggg tacgaccaga acgaccagga atccgatcag
     6361 gtgcacatcg agagtgtcga gggcactgtg tgcatcctgc ccacaaagac aaagcccaag
     6421 aaggtggcct tctatggcag caacgggcag cggtacacgt tcctcttcaa ggggctggag
     6481 gatctgcatc tggacgagcg catcatgcag tttctgtcca tctcgaacgc cattatggcc
     6541 tgccgcagcg accagccccg accgggcggt gcgaccacca actatcgcgc ccaccactac
     6601 tcggtgatac cgctgggacc ccagtcgggc ctcatcagct gggtggatgg cgtcacgccg
     6661 ctcttcgccc tctacaagaa gtggcagcag cgccagccgg ttctttcgcc gggtggcaat
     6721 gccggcgccc cctcgcatcg cttcacggac ctcttctaca acaagctgac gccgctgctg
     6781 gccaagagcg ggctgagcgt aaacgatccg cgccgccagt ggcccgtcgc cgccctgcgg
     6841 caagtgctcg gtgagctgtc agaggagacg cccagcgatc tgctcgcccg cgaactgtgg
     6901 tgccaggcgg gcaatgccgc cgagtggcgc cagtccgtgc gctgctactc cgcctgcatg
     6961 tccgtcatgt cgatgatcgg ctatgtgatc ggcttgggcg atcggcatct ggacaatgtg
     7021 ctcatcaacc tcggtagcgg cgaaatcgtg cacatcgact acaacgtctg cttcgagaag
     7081 ggacgcacgc tgcgcatacc cgagaaggtg cccttccgcc tcacccagaa cctgatcaat
     7141 gccctcggca tcactggcat cgagggcgcc tttcgtctgg gctgcgagta cgtgctgaag
     7201 gtgatgcgca aggagcgcga gaccctgctg accctactcg aggccttcgt ctacgatccg
     7261 ctggtcgact ggacgctcaa tgacgatgcc cagacacagc gtcgctccat caatcccaag
     7321 cagcagcaga cggagagcgg tggcgccaag tgccacaaga acaaggacaa ggcccagagg
     7381 catcgcgccc ccgactggga ggccaagcgt ccgattttca ccgagaagct ggctcaattg
     7441 cacaagttct ggtccaccaa tgaagacgaa atcctactgc agctgcagga agtgggccag
     7501 gagttaggcc acttggaggc ggcgcaagtg cagcgcctgg ctgctgaaag ggaactggag
     7561 gagctgaacg agcgcagcgc cttggtggcg gagatcaagt cgttggggcc ggccatgaag
     7621 agccacagct tcagtacggc accgctgcgt ctgctgcaga cgcgcaaaca cttggccgcc
     7681 ctcatggagc tgacggtgga ccggcgggca gactttgcgt tcgtgcagtg cctcctcgcc
     7741 agctacagcc agtgcctgct gccgtaccac ctctcgcagc tgtacgccca gctgcagtcg
     7801 ctgcacctcg aggcgggggg cgtcgtcggc aggtggcgcg aggagcacct ggccctggcg
     7861 gatgtgctgc agatctatgt gcccgcagtt ctactcgccg gcatcgagag cacacgcacg
     7921 caggtggact cgctgctggg gggcatcaac gagggggcgc gccactgcgc ggatctgatg
     7981 caacagtatg cggccgtgat ggccttctat ccgaaccagt gccaccggca gaacctcttg
     8041 gtgcgcttcc acgacagcta tgcggcttac ctacagctgg agggggccgt cgatccggcg
     8101 gccaccagcg cctccatctg cacgcagagc gtgaagaatc tggccgagac gctggaggtg
     8161 gtgtggctgc atctcggctg ccagctgcac gacgccgccc agcggctgtc cgtcaagcag
     8221 gtgcagatac tcagcctcgt gtcgcccacg ccaacggcta ccatactcca gacgctgcag
     8281 caaaccgact gcagcctgct gctgctcaac tcccgcctga tgcgcaccct ggacatgggc
     8341 agctcctcgt tccgggcgta cgaggcgcag gccctcgccg agcaggacaa agagctgctg
     8401 cagcatcagc tgcactacat ccacgtggtg cggtccatgt gccatggtgt aatgggtgcc
     8461 agaagcaccg gcagcggcag tggcagcggt gactacgacc agcacgagca ggacctgcat
     8521 ctcggccagc tggagaccct gctgacggcc ctcatgcagc ttaagcgact gttcgagtac
     8581 gacctgccat tgagcatctt ccggcagctg ctgctccagc ccaatctcga ccagctgcag
     8641 gagctgtcca agctggggcc caagcggctg cagcagctct acagcgacac tctggcgctg
     8701 gccgagcagc ggcccaaggt ggccaccgag cagcagcgat tcatgctctc cctgcagccg
     8761 atccaccagc agttccagct ggtcgtcacg tcgctcgagg acatgcggcg gtgcctcaag
     8821 cagtgggtcg aagacatcaa tgcctcccat gcccagcagt atctggagct gatcctcttt
     8881 aaggaacacc acgtggagct gaacgacaat tgcttctacc gggtgatcca tcagacattg
     8941 gcagccagca gcagctgcga tgtgcgcgag atgtcgcgac ccatgggcac tctggtgcac
     9001 tgcatcgaga agaaatgcct ggccggactg ctggcccagg tggcctatga ctactacaga
     9061 ggctacggct ccacctgcca gccgcccgag aacaccttca cgccgtcgga tgtggagaag
     9121 gccgagcatt tgtgcggcgg cctctttatg gcgctgcaaa cggacgtgga gctgctgcag
     9181 caggagcagg aggtggccct gattagacag caaatcgaac tccactcgct ggtggcctcc
     9241 gcccagttct ggggctactc ggaggccctc ggatcgcagc tgcgcggcgg cccgcatatc
     9301 gtgtgccgcc acaagctggc gtcggacatt ggccagagct ggggcgacct ggtgcggagt
     9361 gtggacgggc tggagcagct gctgctcggt ctggacgcac agctggggca gctggaggcg
     9421 ctgcgcagca actggaaccg caaccacatc gacagcctgc tgcgcaacga gcgggagcag
     9481 cgccgccagg tgtccgacca agtggcgctc gtccgtcagc taatggaggg tgccagtgcc
     9541 gtctgccacc tggagcagat catagtcgat tcggtgggtt ccggaatgga gatccaaacg
     9601 ctgcacaacc atctggagca gtggctggcg gcgttcggca actggaaggc gagctgtgcc
     9661 agcgtcagtg cggtggagca ggccgtcgtc gagctgctgg atcccgagga tggcatcgac
     9721 gactactgga tggagaatgt ccagggcctg atcgaggagc agacgtgcaa ggtgcagcgc
     9781 gagtttgcca tactctcggg ggagcagcag gccaagctgc ggttcatctg caccctgctc
     9841 aaggagacgc agcgtctgct ggagcatgtg ccgcggttct ttgtgggcaa cttctgttcc
     9901 aatgccaagg ccatggggca cggtgggggc aagggtgtgc ccgcggatgt gcagcagctg
     9961 tgcgaccatc tgcgcgacag ccagctgctg ttgcagagct tcctcgagca cctgaccgcc
    10021 ctgcgcaagg acatttgcgc ggaacggcac gtcctgcagc cggagctcgt cgagggctgg
    10081 cagcatcagc tgcagcagat ccaggccatg gccggccagg aggccagcga atactttaag
    10141 cggatcaagg agctgctgct gcaggtcgcc gactcggact ccctcaccta cgacaccttc
    10201 acgcacacca agggtgccgg taatctgcac gaacagaagc gcaatgcgta tggcgtgtcc
    10261 gtctggaaga agatacgcat gaagctggag ggccgcgatc cggacggcag ccagcgcagc
    10321 agcattgccg agcaggtgga ccatgtgata agcgacgcca ccaatccgga caatctagcc
    10381 gtcctctacg agggttggac gccatgggtc tagggaccac caggataaac ggccgcattt
    10441 tggccttact gacatataac aaaacgcact ccccaaaacg gggagcggac gactacattc
    10501 aacattttgg gcggtaactc ttttggcctt ctggcctggc ctggtctggc cgcg