PREDICTED: Drosophila obscura polyhomeotic-proximal chromatin


LOCUS       XM_041591921            5648 bp    mRNA    linear   INV 14-MAY-2021
            protein (LOC111070665), mRNA.
ACCESSION   XM_041591921
VERSION     XM_041591921.1
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq; corrected model.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
            
            ##RefSeq-Attributes-START##
            frameshifts :: corrected 1 indel
            ##RefSeq-Attributes-END##
PRIMARY     REFSEQ_SPAN         PRIMARY_IDENTIFIER PRIMARY_SPAN        COMP
            1-224               JAECWW010000165.1  356804-357027       c
            225-741             JAECWW010000165.1  353231-353747       c
            742-2047            JAECWW010000165.1  351924-353229       c
            2048-2573           JAECWW010000165.1  351326-351851       c
            2574-4558           JAECWW010000165.1  349263-351247       c
            4559-5648           JAECWW010000165.1  348105-349194       c
FEATURES             Location/Qualifiers
     source          1..5648
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..5648
                     /gene="LOC111070665"
                     /note="The sequence of the model RefSeq transcript was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: deleted 1 base in 1 codon;
                     Derived by automated computational analysis using gene
                     prediction method: Gnomon. Supporting evidence includes
                     similarity to: 5 Proteins, and 99% coverage of the
                     annotated genomic feature by RNAseq alignments, including
                     17 samples with support for all annotated introns"
                     /db_xref="GeneID:111070665"
     CDS             190..5481
                     /gene="LOC111070665"
                     /note="The sequence of the model RefSeq protein was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: deleted 1 base in 1 codon"
                     /codon_start=1
                     /product="LOW QUALITY PROTEIN: polyhomeotic-proximal
                     chromatin protein"
                     /protein_id="XP_041447855.1"
                     /db_xref="GeneID:111070665"
                     /translation="MDRRALKFMQKRADTESDTTTTATTVTAPGSSLATAPAGGGTLP
                     LKDNSNIREKPLHNSNNNNNNNHNNNNNNNSSQQQPSKQHERPLKCLETLAQKAGITF
                     DEKYDVASPPHPGIAQQPSSTPSAATAQAQAHRNGGANTGAGTPPTARRHAHTPVTPN
                     TPSTPNTTSSTTPQHARHNSNPNSASHTMEKSQSPAQQVASATTVPLQISPEQLQQFY
                     ASNPYAIQVKQEFPTHTAGTTTTELKHATGLLDASQASQLQQMQLQQLTAAAADAAGG
                     NGSAGGGGGAQGGGAPSPANQQGQQQQQQQHSTAISTMSPMQLAAATGGVTGDWSQGR
                     TVQLMQPSTGFIYPPMMISGNLLHSAGLGQQPIQVITAGKPFQGNGPQMITTTTQNAK
                     QMIGGQGGFAGGTYAIPSSQSPQTLLFSPVNVISHSPQQQQSLLQSMVAQQQQQQQQL
                     NAQQQQQLNAQQQQQLTAQQAVAMAKAGVGVGVGVGADAQGKMQAQKVVQKVTTTTNT
                     VQAASAGAGGAQSQQQQQQQTTTQQCVQVSQSTLPGVGVGVGVGVGGQLLNPLGGAGA
                     GQAQQMQLGPWFWQNGLQPFGSNSIILRGQPDGTQGMFIQQQPTTQTLQTQQNQIIQC
                     NVTQTPTKPRTQLDALASKQQQQQQQQQQQAAANSQAQQQQQQQQQQQQQLAVATAQL
                     QQQQQQLTALQRPGAPIMPHNGTQVRPASSVSTQTAQNQNLLKAKMRNKQQPVRPALP
                     ALKTENGQVVAVGAVQSKAVGQHMAAVQQQQQQHQQQQQQQQANLHQVVTTAGNKMVV
                     MSTGTPITLQNGQTLHAATAGGVDKQQQQQQQLQLMQKQQFLQQQMFQQQIAAIQIQQ
                     QAAAQQQQQQVAQQQQQQQQQHQQQQQQQQAVAQAQQDQRQQVAQAQAQAQAQVQAQQ
                     HQQQQALAQQILQVAPNTFITSHQQQQQQLHNQLLQQQLQQQAQAQVQAQVQAQAQAQ
                     AQQQQQQQQQQQKQREQQQQQNIIQQIVVQQAAGAGQQQQQQQQQQTQAGQLQLSSVP
                     FSVSSTTTPAGIATSSALQAALSASGAIFQTAKSTSSSSSLPTSSVVTISNHTTGPLV
                     TSSTMAASIHQAQVQQQQHQQQQQQQQQQQQQQQQHQLISASIAAATQQQQQQQQQHQ
                     QQQQGPPALAAASPSPATNPIMAMTSMMNATAGPVTSSGVMSSPATLVAFSAASGGSH
                     PATPTKETPLKMPTPTATLVPIGSPLNSSATSQDHQPSSVNTTPRSAANASASASATA
                     EASSSTSDSSRVNGEAPEASHSSSSTTTTPTKATTSTPTTRQSNVVLPTSSCSTTSST
                     TSSSTTTHSGKDDDKDGAATATSFTSSSAPSTPTTTTVSNGIGIATLARAGSTTVTTT
                     TTTNSSSTATTTPTTTTTTTTSISNGSSNAGGKDLPKAMIKPNVLTHVIDGFIIQEAN
                     EPFPVTRQRYADKDTSDEPPKKKAAMQEEAKPCGIATATPATATTKTPSPTAGSGSAT
                     DMVACEQCGKLEHKAKLKRKRFCSPGCARQAKTGVAGIAAAAGGGVGVGVGESNGMGM
                     EMEIGGIVGVDAMALVDKLDEAMAEEKMQMQTDALQALQPEPMSLVPLSSNTEVPLVS
                     LPVLPVMAGTPVPVPPLVAVALAVPASVALPATPSPGATPPAAAVAPQPPVPAAAPSS
                     SAAGERSPICNWSVDEVADFIRNLPGCQDYVDDFVQQEIDGQALLLLKENHLVNAMGM
                     KLGPALKIVAKVESMKEVVPAPGSGEAKEATAAGGAQ"
     misc_feature    5212..5418
                     /gene="LOC111070665"
                     /note="SAM domain of Ph (polyhomeotic) proteins of
                     Polycomb group; Region: SAM_Ph1,2,3; cd09577"
                     /db_xref="CDD:188976"
     misc_feature    order(5266..5271,5371..5376,5383..5388,5395..5397)
                     /gene="LOC111070665"
                     /note="oligomer interface EH [polypeptide binding]; other
                     site"
                     /db_xref="CDD:188976"
     misc_feature    order(5299..5313,5317..5322,5329..5334,5344..5346,
                     5359..5361)
                     /gene="LOC111070665"
                     /note="oligomer interface ML [polypeptide binding]; other
                     site"
                     /db_xref="CDD:188976"
ORIGIN      
        1 tgttccaaat caaattaaaa tttgaagaaa ctaaacgaaa aggccacggc aatatgctat
       61 tattaacaaa agcaagcaga accgccgccg tcgctgccgc aaacatcaga aaactgctgc
      121 ctctgccgtc gccgccgccg tcgcatgcat aactttaatt ttttaaatat attgttttta
      181 ttttgaataa tggatcgtcg tgcattgaag tttatgcaga aaagagcgga cacagaaagc
      241 gatacaacaa caacagcaac aacagtaaca gcaccaggat cctctttggc aacagctcca
      301 gcgggaggcg gcacactacc tctgaaggat aactcgaata ttcgcgaaaa gccactgcac
      361 aacagtaaca acaacaacaa caacaaccac aacaataaca acaacaacaa cagctcccag
      421 cagcagccca gcaaacagca cgagcggccc ctgaagtgcc tggagacgct cgcacagaag
      481 gcgggaatca cttttgacga gaaatacgat gtggccagtc ccccgcatcc gggtattgcc
      541 cagcagccgt catcgacacc atccgcagcc acagcccaag cccaagccca tcgcaacgga
      601 ggagccaaca caggcgccgg aacaccaccc actgcacgtc gccacgcaca cacaccggtc
      661 acgccgaaca ccccgagcac tcccaacact accagtagca ccaccccaca gcacgcccga
      721 cacaacagca accccaacag cgccagccac acgatggaga agtcacaaag ccccgcacaa
      781 caggtggcgt ccgccacgac ggtgcccctg cagatctcac cggaacagct gcagcagttc
      841 tacgcgagca acccgtacgc catccaggtg aagcaggagt tccccacgca cacggccggc
      901 accactacca cggaactgaa gcatgcgacg ggtctcctgg acgccagcca ggcgagccag
      961 ttgcagcaga tgcagctcca gcagctgacg gcggcggcag cggatgcagc cgggggaaac
     1021 ggttctgcag gcggtggagg aggagcccag ggcggaggcg cacccagtcc ggcgaaccag
     1081 cagggacagc aacagcagca gcagcagcac tcgacggcca ttagtacgat gtcgccgatg
     1141 cagctggcgg cagccaccgg cggagtgacc ggcgactggt cacagggtcg gaccgtgcag
     1201 ctgatgcagc cttcgacggg gttcatctac ccacccatga tgatctccgg caacctgctg
     1261 cactccgcgg gcctcggcca gcagcccata caagtgatca ccgccggcaa gccgttccag
     1321 ggtaacggcc cacagatgat caccaccacc acgcagaacg ccaagcagat gatcgggggg
     1381 caaggcgggt tcgccggagg tacttacgcc atcccctcca gccagtcacc gcagacgctg
     1441 ctcttctcac ccgtcaacgt catctcccac tcgccgcagc agcagcaaag cctcctccag
     1501 tcgatggtcg cccagcagca acagcagcag caacaactca acgcccagca gcagcaacaa
     1561 ctcaacgccc agcagcagca gcagctgacg gctcagcagg cggtggccat ggccaaggca
     1621 ggagtgggag tgggtgtggg tgtgggagcc gacgcccagg gcaagatgca ggcgcagaag
     1681 gtggtccaga aggtgaccac caccaccaac acggtgcagg ctgcgtcggc aggcgctggg
     1741 ggggcacagt cgcagcagca acagcagcag caaaccacca cccagcagtg cgtccaggtc
     1801 tcgcagtcga cactgcccgg cgtgggagtg ggtgtgggtg tgggcgtggg cgggcagctg
     1861 ctgaatccgc tgggcggtgc cggcgcgggc caggcgcagc agatgcagct cggtccctgg
     1921 ttctggcaga acggcctgca gcccttcggc tcgaactcca tcatcctgcg gggccagccg
     1981 gacggcactc agggcatgtt catccagcag cagcccacca cgcagaccct ccagacgcag
     2041 cagaaccaaa tcatccagtg caatgtaacc cagacaccta ccaagcctcg cacccagctg
     2101 gatgccctgg cttccaagca acagcagcag cagcaacaac aacagcagca ggcggcggcc
     2161 aacagtcaag cgcagcagca gcaacaacaa caacaacagc agcagcagca gctggcggtg
     2221 gccacggccc aactgcaaca acagcagcag cagctgacgg ccctgcagcg tcctggcgca
     2281 ccgattatgc cccacaatgg aacgcaggtg cgcccggcca gctccgtgtc cacgcagacg
     2341 gcgcagaacc agaacctgct gaaggccaag atgcggaaca agcagcagcc tgtccgtccg
     2401 gcgttgccgg ccctcaagac ggagaatggt caggtggtgg cggttggtgc ggtgcagagc
     2461 aaggcagtgg gccagcacat ggctgccgtt cagcagcagc aacagcagca ccagcagcaa
     2521 caacagcaac aacaggcgaa ccttcaccag gtggtcacca cagcgggaaa caagatggtc
     2581 gtgatgagca cgggcacgcc cataaccctg cagaatggcc agaccctgca tgcagccact
     2641 gcgggcggag tggacaagca gcagcaacag cagcagcagc tgcagctcat gcagaagcag
     2701 cagtttctgc agcagcaaat gttccaacag cagatagccg ccatccagat ccagcagcag
     2761 gcagcagcac aacagcagca gcaacaagtc gcccagcagc aacagcagca gcaacaacaa
     2821 catcagcaac aacagcagca gcagcaggcg gtggcccaag cgcagcagga tcagcggcaa
     2881 caggtggcac aggctcaggc tcaggcccaa gctcaggttc aggcgcagca acatcagcag
     2941 caacaggccc tggctcagca aatactgcag gtagcgccca acaccttcat cacctcccac
     3001 caacagcagc agcagcagct ccacaaccaa ctgcttcagc agcagctcca gcagcaggca
     3061 caggctcaag tgcaagctca ggttcaggct caggctcagg cacaggcaca gcagcagcaa
     3121 caacaacagc aacaacaaca aaaacaacgg gagcagcagc agcagcagaa catcatccaa
     3181 caaattgtgg tgcagcaggc ggctggggca ggccaacagc aacaacaaca gcaacagcag
     3241 cagacgcagg cgggacaatt gcagctgagc agcgtcccct tctcggtatc ctcgaccacg
     3301 acgcccgcag gaatagccac ctcgagtgcc ctccaggccg ccctctcggc ctctggcgcc
     3361 atcttccaga cggccaagtc gaccagcagc agctcctctc tgcccaccag cagcgtagtg
     3421 acaataagta accacacaac gggtcccctg gtcaccagca gcacgatggc agccagcatc
     3481 caccaagccc aggtccagca gcagcaacac caacagcagc agcagcaaca acagcagcag
     3541 caacagcaac agcagcagca tcagttaatc tccgccagca ttgcagcggc cacacagcag
     3601 cagcagcaac agcaacagca gcatcaacaa caacagcagg gaccacccgc tctggcggct
     3661 gcatcgccct cacccgccac gaaccccatc atggccatga catccatgat gaacgccacg
     3721 gctggacctg tcaccagcag cggagtgatg tcctctcctg caacgctggt cgcgttcagc
     3781 gctgccagtg gaggtagtca tccggcgaca cccaccaagg agacgccgct gaagatgccc
     3841 acccccaccg ccaccctggt gcccattggg tcccctctaa acagcagcgc cactagccag
     3901 gatcaccagc catcgtccgt caacaccacc cccagatccg ctgcaaacgc cagtgccagt
     3961 gccagtgcca ccgcggaggc aagtagctcc acgagtgact cctccagggt gaatggagag
     4021 gccccggagg cgtctcatag cagcagcagc accaccacca cgcccacgaa ggccaccacc
     4081 agcacgccca ccacaaggca gagcaatgtg gtgctgccca cgagtagctg cagcaccacc
     4141 agcagcacca ctagctcctc cacaaccacg cacagcggaa aggatgatga caaggacgga
     4201 gcggctacag ccaccagctt caccagcagt agcgcacctt caacgccgac cacgacgaca
     4261 gtcagcaacg ggattgggat agcaaccctg gccagggcag ggagcaccac tgtgaccacc
     4321 accacgacga ccaacagcag cagcactgcg acgactacac ccacaactac aactacaacg
     4381 acaacgagca tcagcaatgg gagcagcaac gcgggaggga aggatctgcc gaaggccatg
     4441 atcaagccca atgtgctgac ccatgtcatt gacggattca tcatccagga ggccaacgag
     4501 cccttccctg tgacgaggca gcgctatgct gacaaggaca ccagcgacga gccgccaaag
     4561 aaaaaggctg ccatgcagga ggaggcgaag ccatgcggaa tagccaccgc aacgccagcc
     4621 acagccacaa ccaaaacccc atccccaaca gcaggcagtg gcagtgcgac ggacatggtg
     4681 gcctgcgagc agtgcggaaa gctggagcac aaggcgaagc tcaagcggaa gcgcttctgc
     4741 tccccaggct gcgccaggca ggcgaagact ggtgtcgcag gcattgcggc agcagctgga
     4801 ggcggagtgg gagtgggagt aggagagagc aatggaatgg gaatggaaat ggaaattgga
     4861 ggaattgtgg gagtggatgc catggcgctg gtggacaaac tggacgaggc catggccgag
     4921 gagaagatgc agatgcagac ggacgcactg caggcgctgc agcccgaacc gatgtccctt
     4981 gtgccattgt caagcaacac ggaggtgcca ctggtgtccc ttcctgtcct gccagtcatg
     5041 gcaggcaccc ccgttccagt ccctccccta gttgcagtcg cactcgcagt tcccgcttcc
     5101 gtggcgctgc ctgcgactcc gtctccgggt gccacaccac cagctgcagc ggtggcgccc
     5161 cagccaccag taccagcagc agcaccctcc tcgagcgcag cgggcgagcg ttcgcccatc
     5221 tgcaactgga gcgtggacga ggtggctgac ttcatacgga acctgccagg ctgccaggac
     5281 tatgtggacg actttgtcca gcaggagatc gacggccagg cgctgctgct gctcaaggag
     5341 aatcacctgg tgaatgccat ggggatgaag ctgggccccg ccctcaagat tgtggccaag
     5401 gtggagtcca tgaaggaggt ggtcccggcg ccgggctctg gcgaggccaa ggaggcaacg
     5461 gccgcgggag gagctcaata ataccagcct gatgctccag ccgatgccat tgccgaggca
     5521 gatgacgagg acattcccat gccctcctac tcgacatctc cgccaccatt ctcgcttctc
     5581 cgtctccggc ttacgtacgg atcgaggcaa cagagggaat tgccagaggg aactgggctg
     5641 gtggagca