PREDICTED: Drosophila obscura polyhomeotic-proximal chromatin
LOCUS XM_041591921 5648 bp mRNA linear INV 14-MAY-2021
protein (LOC111070665), mRNA.
ACCESSION XM_041591921
VERSION XM_041591921.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq; corrected model.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
frameshifts :: corrected 1 indel
##RefSeq-Attributes-END##
PRIMARY REFSEQ_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP
1-224 JAECWW010000165.1 356804-357027 c
225-741 JAECWW010000165.1 353231-353747 c
742-2047 JAECWW010000165.1 351924-353229 c
2048-2573 JAECWW010000165.1 351326-351851 c
2574-4558 JAECWW010000165.1 349263-351247 c
4559-5648 JAECWW010000165.1 348105-349194 c
FEATURES Location/Qualifiers
source 1..5648
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..5648
/gene="LOC111070665"
/note="The sequence of the model RefSeq transcript was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon;
Derived by automated computational analysis using gene
prediction method: Gnomon. Supporting evidence includes
similarity to: 5 Proteins, and 99% coverage of the
annotated genomic feature by RNAseq alignments, including
17 samples with support for all annotated introns"
/db_xref="GeneID:111070665"
CDS 190..5481
/gene="LOC111070665"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon"
/codon_start=1
/product="LOW QUALITY PROTEIN: polyhomeotic-proximal
chromatin protein"
/protein_id="XP_041447855.1"
/db_xref="GeneID:111070665"
/translation="MDRRALKFMQKRADTESDTTTTATTVTAPGSSLATAPAGGGTLP
LKDNSNIREKPLHNSNNNNNNNHNNNNNNNSSQQQPSKQHERPLKCLETLAQKAGITF
DEKYDVASPPHPGIAQQPSSTPSAATAQAQAHRNGGANTGAGTPPTARRHAHTPVTPN
TPSTPNTTSSTTPQHARHNSNPNSASHTMEKSQSPAQQVASATTVPLQISPEQLQQFY
ASNPYAIQVKQEFPTHTAGTTTTELKHATGLLDASQASQLQQMQLQQLTAAAADAAGG
NGSAGGGGGAQGGGAPSPANQQGQQQQQQQHSTAISTMSPMQLAAATGGVTGDWSQGR
TVQLMQPSTGFIYPPMMISGNLLHSAGLGQQPIQVITAGKPFQGNGPQMITTTTQNAK
QMIGGQGGFAGGTYAIPSSQSPQTLLFSPVNVISHSPQQQQSLLQSMVAQQQQQQQQL
NAQQQQQLNAQQQQQLTAQQAVAMAKAGVGVGVGVGADAQGKMQAQKVVQKVTTTTNT
VQAASAGAGGAQSQQQQQQQTTTQQCVQVSQSTLPGVGVGVGVGVGGQLLNPLGGAGA
GQAQQMQLGPWFWQNGLQPFGSNSIILRGQPDGTQGMFIQQQPTTQTLQTQQNQIIQC
NVTQTPTKPRTQLDALASKQQQQQQQQQQQAAANSQAQQQQQQQQQQQQQLAVATAQL
QQQQQQLTALQRPGAPIMPHNGTQVRPASSVSTQTAQNQNLLKAKMRNKQQPVRPALP
ALKTENGQVVAVGAVQSKAVGQHMAAVQQQQQQHQQQQQQQQANLHQVVTTAGNKMVV
MSTGTPITLQNGQTLHAATAGGVDKQQQQQQQLQLMQKQQFLQQQMFQQQIAAIQIQQ
QAAAQQQQQQVAQQQQQQQQQHQQQQQQQQAVAQAQQDQRQQVAQAQAQAQAQVQAQQ
HQQQQALAQQILQVAPNTFITSHQQQQQQLHNQLLQQQLQQQAQAQVQAQVQAQAQAQ
AQQQQQQQQQQQKQREQQQQQNIIQQIVVQQAAGAGQQQQQQQQQQTQAGQLQLSSVP
FSVSSTTTPAGIATSSALQAALSASGAIFQTAKSTSSSSSLPTSSVVTISNHTTGPLV
TSSTMAASIHQAQVQQQQHQQQQQQQQQQQQQQQQHQLISASIAAATQQQQQQQQQHQ
QQQQGPPALAAASPSPATNPIMAMTSMMNATAGPVTSSGVMSSPATLVAFSAASGGSH
PATPTKETPLKMPTPTATLVPIGSPLNSSATSQDHQPSSVNTTPRSAANASASASATA
EASSSTSDSSRVNGEAPEASHSSSSTTTTPTKATTSTPTTRQSNVVLPTSSCSTTSST
TSSSTTTHSGKDDDKDGAATATSFTSSSAPSTPTTTTVSNGIGIATLARAGSTTVTTT
TTTNSSSTATTTPTTTTTTTTSISNGSSNAGGKDLPKAMIKPNVLTHVIDGFIIQEAN
EPFPVTRQRYADKDTSDEPPKKKAAMQEEAKPCGIATATPATATTKTPSPTAGSGSAT
DMVACEQCGKLEHKAKLKRKRFCSPGCARQAKTGVAGIAAAAGGGVGVGVGESNGMGM
EMEIGGIVGVDAMALVDKLDEAMAEEKMQMQTDALQALQPEPMSLVPLSSNTEVPLVS
LPVLPVMAGTPVPVPPLVAVALAVPASVALPATPSPGATPPAAAVAPQPPVPAAAPSS
SAAGERSPICNWSVDEVADFIRNLPGCQDYVDDFVQQEIDGQALLLLKENHLVNAMGM
KLGPALKIVAKVESMKEVVPAPGSGEAKEATAAGGAQ"
misc_feature 5212..5418
/gene="LOC111070665"
/note="SAM domain of Ph (polyhomeotic) proteins of
Polycomb group; Region: SAM_Ph1,2,3; cd09577"
/db_xref="CDD:188976"
misc_feature order(5266..5271,5371..5376,5383..5388,5395..5397)
/gene="LOC111070665"
/note="oligomer interface EH [polypeptide binding]; other
site"
/db_xref="CDD:188976"
misc_feature order(5299..5313,5317..5322,5329..5334,5344..5346,
5359..5361)
/gene="LOC111070665"
/note="oligomer interface ML [polypeptide binding]; other
site"
/db_xref="CDD:188976"
ORIGIN
1 tgttccaaat caaattaaaa tttgaagaaa ctaaacgaaa aggccacggc aatatgctat
61 tattaacaaa agcaagcaga accgccgccg tcgctgccgc aaacatcaga aaactgctgc
121 ctctgccgtc gccgccgccg tcgcatgcat aactttaatt ttttaaatat attgttttta
181 ttttgaataa tggatcgtcg tgcattgaag tttatgcaga aaagagcgga cacagaaagc
241 gatacaacaa caacagcaac aacagtaaca gcaccaggat cctctttggc aacagctcca
301 gcgggaggcg gcacactacc tctgaaggat aactcgaata ttcgcgaaaa gccactgcac
361 aacagtaaca acaacaacaa caacaaccac aacaataaca acaacaacaa cagctcccag
421 cagcagccca gcaaacagca cgagcggccc ctgaagtgcc tggagacgct cgcacagaag
481 gcgggaatca cttttgacga gaaatacgat gtggccagtc ccccgcatcc gggtattgcc
541 cagcagccgt catcgacacc atccgcagcc acagcccaag cccaagccca tcgcaacgga
601 ggagccaaca caggcgccgg aacaccaccc actgcacgtc gccacgcaca cacaccggtc
661 acgccgaaca ccccgagcac tcccaacact accagtagca ccaccccaca gcacgcccga
721 cacaacagca accccaacag cgccagccac acgatggaga agtcacaaag ccccgcacaa
781 caggtggcgt ccgccacgac ggtgcccctg cagatctcac cggaacagct gcagcagttc
841 tacgcgagca acccgtacgc catccaggtg aagcaggagt tccccacgca cacggccggc
901 accactacca cggaactgaa gcatgcgacg ggtctcctgg acgccagcca ggcgagccag
961 ttgcagcaga tgcagctcca gcagctgacg gcggcggcag cggatgcagc cgggggaaac
1021 ggttctgcag gcggtggagg aggagcccag ggcggaggcg cacccagtcc ggcgaaccag
1081 cagggacagc aacagcagca gcagcagcac tcgacggcca ttagtacgat gtcgccgatg
1141 cagctggcgg cagccaccgg cggagtgacc ggcgactggt cacagggtcg gaccgtgcag
1201 ctgatgcagc cttcgacggg gttcatctac ccacccatga tgatctccgg caacctgctg
1261 cactccgcgg gcctcggcca gcagcccata caagtgatca ccgccggcaa gccgttccag
1321 ggtaacggcc cacagatgat caccaccacc acgcagaacg ccaagcagat gatcgggggg
1381 caaggcgggt tcgccggagg tacttacgcc atcccctcca gccagtcacc gcagacgctg
1441 ctcttctcac ccgtcaacgt catctcccac tcgccgcagc agcagcaaag cctcctccag
1501 tcgatggtcg cccagcagca acagcagcag caacaactca acgcccagca gcagcaacaa
1561 ctcaacgccc agcagcagca gcagctgacg gctcagcagg cggtggccat ggccaaggca
1621 ggagtgggag tgggtgtggg tgtgggagcc gacgcccagg gcaagatgca ggcgcagaag
1681 gtggtccaga aggtgaccac caccaccaac acggtgcagg ctgcgtcggc aggcgctggg
1741 ggggcacagt cgcagcagca acagcagcag caaaccacca cccagcagtg cgtccaggtc
1801 tcgcagtcga cactgcccgg cgtgggagtg ggtgtgggtg tgggcgtggg cgggcagctg
1861 ctgaatccgc tgggcggtgc cggcgcgggc caggcgcagc agatgcagct cggtccctgg
1921 ttctggcaga acggcctgca gcccttcggc tcgaactcca tcatcctgcg gggccagccg
1981 gacggcactc agggcatgtt catccagcag cagcccacca cgcagaccct ccagacgcag
2041 cagaaccaaa tcatccagtg caatgtaacc cagacaccta ccaagcctcg cacccagctg
2101 gatgccctgg cttccaagca acagcagcag cagcaacaac aacagcagca ggcggcggcc
2161 aacagtcaag cgcagcagca gcaacaacaa caacaacagc agcagcagca gctggcggtg
2221 gccacggccc aactgcaaca acagcagcag cagctgacgg ccctgcagcg tcctggcgca
2281 ccgattatgc cccacaatgg aacgcaggtg cgcccggcca gctccgtgtc cacgcagacg
2341 gcgcagaacc agaacctgct gaaggccaag atgcggaaca agcagcagcc tgtccgtccg
2401 gcgttgccgg ccctcaagac ggagaatggt caggtggtgg cggttggtgc ggtgcagagc
2461 aaggcagtgg gccagcacat ggctgccgtt cagcagcagc aacagcagca ccagcagcaa
2521 caacagcaac aacaggcgaa ccttcaccag gtggtcacca cagcgggaaa caagatggtc
2581 gtgatgagca cgggcacgcc cataaccctg cagaatggcc agaccctgca tgcagccact
2641 gcgggcggag tggacaagca gcagcaacag cagcagcagc tgcagctcat gcagaagcag
2701 cagtttctgc agcagcaaat gttccaacag cagatagccg ccatccagat ccagcagcag
2761 gcagcagcac aacagcagca gcaacaagtc gcccagcagc aacagcagca gcaacaacaa
2821 catcagcaac aacagcagca gcagcaggcg gtggcccaag cgcagcagga tcagcggcaa
2881 caggtggcac aggctcaggc tcaggcccaa gctcaggttc aggcgcagca acatcagcag
2941 caacaggccc tggctcagca aatactgcag gtagcgccca acaccttcat cacctcccac
3001 caacagcagc agcagcagct ccacaaccaa ctgcttcagc agcagctcca gcagcaggca
3061 caggctcaag tgcaagctca ggttcaggct caggctcagg cacaggcaca gcagcagcaa
3121 caacaacagc aacaacaaca aaaacaacgg gagcagcagc agcagcagaa catcatccaa
3181 caaattgtgg tgcagcaggc ggctggggca ggccaacagc aacaacaaca gcaacagcag
3241 cagacgcagg cgggacaatt gcagctgagc agcgtcccct tctcggtatc ctcgaccacg
3301 acgcccgcag gaatagccac ctcgagtgcc ctccaggccg ccctctcggc ctctggcgcc
3361 atcttccaga cggccaagtc gaccagcagc agctcctctc tgcccaccag cagcgtagtg
3421 acaataagta accacacaac gggtcccctg gtcaccagca gcacgatggc agccagcatc
3481 caccaagccc aggtccagca gcagcaacac caacagcagc agcagcaaca acagcagcag
3541 caacagcaac agcagcagca tcagttaatc tccgccagca ttgcagcggc cacacagcag
3601 cagcagcaac agcaacagca gcatcaacaa caacagcagg gaccacccgc tctggcggct
3661 gcatcgccct cacccgccac gaaccccatc atggccatga catccatgat gaacgccacg
3721 gctggacctg tcaccagcag cggagtgatg tcctctcctg caacgctggt cgcgttcagc
3781 gctgccagtg gaggtagtca tccggcgaca cccaccaagg agacgccgct gaagatgccc
3841 acccccaccg ccaccctggt gcccattggg tcccctctaa acagcagcgc cactagccag
3901 gatcaccagc catcgtccgt caacaccacc cccagatccg ctgcaaacgc cagtgccagt
3961 gccagtgcca ccgcggaggc aagtagctcc acgagtgact cctccagggt gaatggagag
4021 gccccggagg cgtctcatag cagcagcagc accaccacca cgcccacgaa ggccaccacc
4081 agcacgccca ccacaaggca gagcaatgtg gtgctgccca cgagtagctg cagcaccacc
4141 agcagcacca ctagctcctc cacaaccacg cacagcggaa aggatgatga caaggacgga
4201 gcggctacag ccaccagctt caccagcagt agcgcacctt caacgccgac cacgacgaca
4261 gtcagcaacg ggattgggat agcaaccctg gccagggcag ggagcaccac tgtgaccacc
4321 accacgacga ccaacagcag cagcactgcg acgactacac ccacaactac aactacaacg
4381 acaacgagca tcagcaatgg gagcagcaac gcgggaggga aggatctgcc gaaggccatg
4441 atcaagccca atgtgctgac ccatgtcatt gacggattca tcatccagga ggccaacgag
4501 cccttccctg tgacgaggca gcgctatgct gacaaggaca ccagcgacga gccgccaaag
4561 aaaaaggctg ccatgcagga ggaggcgaag ccatgcggaa tagccaccgc aacgccagcc
4621 acagccacaa ccaaaacccc atccccaaca gcaggcagtg gcagtgcgac ggacatggtg
4681 gcctgcgagc agtgcggaaa gctggagcac aaggcgaagc tcaagcggaa gcgcttctgc
4741 tccccaggct gcgccaggca ggcgaagact ggtgtcgcag gcattgcggc agcagctgga
4801 ggcggagtgg gagtgggagt aggagagagc aatggaatgg gaatggaaat ggaaattgga
4861 ggaattgtgg gagtggatgc catggcgctg gtggacaaac tggacgaggc catggccgag
4921 gagaagatgc agatgcagac ggacgcactg caggcgctgc agcccgaacc gatgtccctt
4981 gtgccattgt caagcaacac ggaggtgcca ctggtgtccc ttcctgtcct gccagtcatg
5041 gcaggcaccc ccgttccagt ccctccccta gttgcagtcg cactcgcagt tcccgcttcc
5101 gtggcgctgc ctgcgactcc gtctccgggt gccacaccac cagctgcagc ggtggcgccc
5161 cagccaccag taccagcagc agcaccctcc tcgagcgcag cgggcgagcg ttcgcccatc
5221 tgcaactgga gcgtggacga ggtggctgac ttcatacgga acctgccagg ctgccaggac
5281 tatgtggacg actttgtcca gcaggagatc gacggccagg cgctgctgct gctcaaggag
5341 aatcacctgg tgaatgccat ggggatgaag ctgggccccg ccctcaagat tgtggccaag
5401 gtggagtcca tgaaggaggt ggtcccggcg ccgggctctg gcgaggccaa ggaggcaacg
5461 gccgcgggag gagctcaata ataccagcct gatgctccag ccgatgccat tgccgaggca
5521 gatgacgagg acattcccat gccctcctac tcgacatctc cgccaccatt ctcgcttctc
5581 cgtctccggc ttacgtacgg atcgaggcaa cagagggaat tgccagaggg aactgggctg
5641 gtggagca