LOW QUALITY PROTEIN: polyhomeotic-proximal chromatin protein
LOCUS XP_041447855 1763 aa linear INV 14-MAY-2021
[Drosophila obscura].
ACCESSION XP_041447855
VERSION XP_041447855.1
DBLINK BioProject: PRJNA728747
DBSOURCE REFSEQ: accession XM_041591921.1
KEYWORDS RefSeq; corrected model.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
frameshifts :: corrected 1 indel
##RefSeq-Attributes-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..1763
/organism="Drosophila obscura"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
Protein 1..1763
/product="LOW QUALITY PROTEIN: polyhomeotic-proximal
chromatin protein"
/calculated_mol_wt=184144
Region 1675..1743
/region_name="SAM_Ph1,2,3"
/note="SAM domain of Ph (polyhomeotic) proteins of
Polycomb group; cd09577"
/db_xref="CDD:188976"
Site order(1693..1694,1728..1729,1732..1733,1736)
/site_type="other"
/note="oligomer interface EH [polypeptide binding]"
/db_xref="CDD:188976"
Site order(1704..1708,1710..1711,1714..1715,1719,1724)
/site_type="other"
/note="oligomer interface ML [polypeptide binding]"
/db_xref="CDD:188976"
CDS 1..1763
/gene="LOC111070665"
/coded_by="XM_041591921.1:190..5481"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon"
/db_xref="GeneID:111070665"
ORIGIN
1 mdrralkfmq kradtesdtt ttattvtapg sslatapagg gtlplkdnsn irekplhnsn
61 nnnnnnhnnn nnnnssqqqp skqherplkc letlaqkagi tfdekydvas pphpgiaqqp
121 sstpsaataq aqahrnggan tgagtpptar rhahtpvtpn tpstpnttss ttpqharhns
181 npnsashtme ksqspaqqva sattvplqis peqlqqfyas npyaiqvkqe fpthtagttt
241 telkhatgll dasqasqlqq mqlqqltaaa adaaggngsa gggggaqggg apspanqqgq
301 qqqqqqhsta istmspmqla aatggvtgdw sqgrtvqlmq pstgfiyppm misgnllhsa
361 glgqqpiqvi tagkpfqgng pqmittttqn akqmiggqgg faggtyaips sqspqtllfs
421 pvnvishspq qqqsllqsmv aqqqqqqqql naqqqqqlna qqqqqltaqq avamakagvg
481 vgvgvgadaq gkmqaqkvvq kvttttntvq aasagaggaq sqqqqqqqtt tqqcvqvsqs
541 tlpgvgvgvg vgvggqllnp lggagagqaq qmqlgpwfwq nglqpfgsns iilrgqpdgt
601 qgmfiqqqpt tqtlqtqqnq iiqcnvtqtp tkprtqldal askqqqqqqq qqqqaaansq
661 aqqqqqqqqq qqqqlavata qlqqqqqqlt alqrpgapim phngtqvrpa ssvstqtaqn
721 qnllkakmrn kqqpvrpalp alktengqvv avgavqskav gqhmaavqqq qqqhqqqqqq
781 qqanlhqvvt tagnkmvvms tgtpitlqng qtlhaatagg vdkqqqqqqq lqlmqkqqfl
841 qqqmfqqqia aiqiqqqaaa qqqqqqvaqq qqqqqqqhqq qqqqqqavaq aqqdqrqqva
901 qaqaqaqaqv qaqqhqqqqa laqqilqvap ntfitshqqq qqqlhnqllq qqlqqqaqaq
961 vqaqvqaqaq aqaqqqqqqq qqqqkqreqq qqqniiqqiv vqqaagagqq qqqqqqqqtq
1021 agqlqlssvp fsvsstttpa giatssalqa alsasgaifq takstsssss lptssvvtis
1081 nhttgplvts stmaasihqa qvqqqqhqqq qqqqqqqqqq qqqhqlisas iaaatqqqqq
1141 qqqqhqqqqq gppalaaasp spatnpimam tsmmnatagp vtssgvmssp atlvafsaas
1201 ggshpatptk etplkmptpt atlvpigspl nssatsqdhq pssvnttprs aanasasasa
1261 taeassstsd ssrvngeape ashssssttt tptkattstp ttrqsnvvlp tsscsttsst
1321 tsssttthsg kdddkdgaat atsftsssap stpttttvsn gigiatlara gsttvttttt
1381 tnssstattt ptttttttts isngssnagg kdlpkamikp nvlthvidgf iiqeanepfp
1441 vtrqryadkd tsdeppkkka amqeeakpcg iatatpatat tktpsptags gsatdmvace
1501 qcgklehkak lkrkrfcspg carqaktgva giaaaagggv gvgvgesngm gmemeiggiv
1561 gvdamalvdk ldeamaeekm qmqtdalqal qpepmslvpl ssntevplvs lpvlpvmagt
1621 pvpvpplvav alavpasval patpspgatp paaavapqpp vpaaapsssa agerspicnw
1681 svdevadfir nlpgcqdyvd dfvqqeidgq allllkenhl vnamgmklgp alkivakves
1741 mkevvpapgs geakeataag gaq