mucin-5AC isoform X1 [Drosophila obscura].
LOCUS XP_022220001 1199 aa linear INV 14-MAY-2021
ACCESSION XP_022220001
VERSION XP_022220001.2
DBLINK BioProject: PRJNA728747
DBSOURCE REFSEQ: accession XM_022364309.2
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XP_022220001.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..1199
/organism="Drosophila obscura"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
Protein 1..1199
/product="mucin-5AC isoform X1"
/calculated_mol_wt=129952
Region 412..475
/region_name="MBD"
/note="MeCP2, MBD1, MBD2, MBD3, MBD4, CLLD8-like, and
BAZ2A-like proteins constitute a family of proteins that
share the methyl-CpG-binding domain (MBD). The MBD
consists of about 70 residues and is defined as the
minimal region required for binding to...; cl00110"
/db_xref="CDD:469618"
Site order(423,425,427,434,436,445,448,452)
/site_type="DNA binding"
/note="DNA binding site [nucleotide binding]"
/db_xref="CDD:238069"
Region <810..839
/region_name="F-box-like"
/note="pfam12937"
/db_xref="CDD:463757"
Region <904..>991
/region_name="LRR"
/note="Leucine-rich repeat (LRR) protein [Transcription];
COG4886"
/db_xref="CDD:443914"
CDS 1..1199
/gene="LOC111072436"
/coded_by="XM_022364309.2:101..3700"
/db_xref="GeneID:111072436"
ORIGIN
1 msspdepgps ngrksrskrr dksvsrssns ssgsnfgrhs snvvnggslr drllvanama
61 tdetnsmvng ndknqrkkkk kkssrrksqt glkgysgrnp slsylqneis ddsdedtrsp
121 aeiralctsg lrvfpreqqt estadsavqa aaaaattpda padpaspdtd tptssqmsss
181 pspfsrteqt kttiqlptat epshpsiqlt tttttpntgi ststipatss ctnlmtpvss
241 aspvttlqsp ttppsptfsp tasannnnsn hssenisnhs svasitsepe pesrttttat
301 asqqtatdld kkdevmipvt dssqlsrkrp ksrmgiktnd nnvenkakvs aasippskkk
361 kknqtpcvfi prklicgddp lpdplpevap rrksiysdaa kkpkpnlsde lyrlpfkfgw
421 krevvtpspl sntsnhtiyt spcgkrcrqi seivplltne ltiehfifgt hllgagsefe
481 tsrqalsrem hyaalkerrk slaadkstas atikverkrr qtmgaesypk easasapfgk
541 rrksinpaes vtepskpmav estkplagev aalvtgkrvp kpkvpkgasp ptegwtstma
601 vkgnarllaa asngnartag ssnasgssna apvshakrat cgsclklikg svcqscvrss
661 areegpnagr imdeyeelee deeeseesyq agtvsnglkv tkmieppgem ppaelfstdk
721 spvttptdly mpqevvvigg rkaisivgep tttsqpkviv pnppdlssye efygkriaap
781 kqlasptslw gaalgegfnc hfllslmktl nqqdrvncsn vcklwnlvsr dsdvwksvsl
841 rdtkvnnwpa lvremarnct reldmmgaii pntgtliagd mrvltdmrtv rtnhtkadfl
901 hqifsglpql ekligtcvss glimtdidkm enlnelrirm tdtkasitgl phvgklthlr
961 vlslrgvknl gnllflkelp nletlnlgyc qsmhrlqlgn evlptltklq rfrletdprk
1021 ktsfpideim kglahaggvr rlelvnvdvd ckfsqllsnc ntveelllip kcqtntavmi
1081 rsvmgisrng sqlkqfklvl itqlltatga mlrnpdvpmv pvirpipgil lgdrlnscsk
1141 ecqeqqhdrc vaglpferlk iimselmpns spsvvtmamm dtptiqlgrl ppdatppnf