protein sidekick isoform X10 [Drosophila obscura].
LOCUS XP_041447895 2234 aa linear INV 14-MAY-2021
ACCESSION XP_041447895
VERSION XP_041447895.1
DBLINK BioProject: PRJNA728747
DBSOURCE REFSEQ: accession XM_041591961.1
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..2234
/organism="Drosophila obscura"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
Protein 1..2234
/product="protein sidekick isoform X10"
/calculated_mol_wt=247802
Region 83..153
/region_name="Ig_3"
/note="Immunoglobulin domain; pfam13927"
/db_xref="CDD:464046"
Region 272..367
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 290..294
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409353"
Region 305..309
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409353"
Region 330..334
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409353"
Region 344..349
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409353"
Region 358..361
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409353"
Region 371..462
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 389..393
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409543"
Region 402..406
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409543"
Region 428..431
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409543"
Region 441..446
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409543"
Region 454..457
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409543"
Region 466..553
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 483..487
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409562"
Region 496..500
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409562"
Region 520..523
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409562"
Region 533..538
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409562"
Region 546..549
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409562"
Region 558..647
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 574..578
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409544"
Region 589..593
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409544"
Region 613..617
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409544"
Region <627..>888
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Region 627..632
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409544"
Region 640..643
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409544"
Region 651..758
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(651,732,747)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(748..749,751..752)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 870..974
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(963..964,966..967)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Site order(979,1044,1059)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Region 980..1063
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region 1077..1166
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1077,1143,1158)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(1159..1160,1162..1163)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1179..1267
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Region 1286..1371
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region <1355..1699
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Site order(1367..1368,1370..1371)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1482..1577
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1482,1549,1564)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(1565..1566,1568..1569)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1691..1792
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Region <1767..>1979
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Site order(1781..1782,1784..1785)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
CDS 1..2234
/gene="LOC111066347"
/coded_by="XM_041591961.1:351..7055"
/db_xref="GeneID:111066347"
ORIGIN
1 mkrdqrrssa sslrrrrrwc vdvnekgtrm wlkislsqpl easlfvlaal lllnadscsc
61 yadanpqqqq qlvqqqqqql qaprftthps ssgsivsegs tkilqchalg ypqptyrwlk
121 dgksvgefss sqfyrfhstr redagsyqci akndagsifs eksdvvvaym gifenvtegr
181 ltvvsghpai fdmpaiesvp tpsvlwqsad gslnydikya ftqanqliil svdendrrgy
241 raraintqlg keeisafvhl nvsgdpyiev apeiivrpqd vkvktgtgvl elqcianarp
301 lheletiwlk dglavdttgv rhtlndpwnr tlallqanss hsgeytcqvr lrsggyptvt
361 asarvqilep pvfftpmrae tfgefggqvq lpcdvvgept pqvewfrnae sveanvqsgr
421 yslgedntli ikklilddsa mfqclarnea gensastwlr vktsapvfeq ppqnvtaldg
481 kdatiscrai gspnpnvtwi ynetqlveis srvqilesgd llisniratd aglyicvran
541 eagsvkgeal lsvlvrtqii qppvdtivll gltatlqckv ssdpsvpyni dwyregqmap
601 isnsqrigvq adgqleiqav rasdvgsysc vvtspggnet rsarlsviel pfppsnvrve
661 rlpepqqrsi nvswtpgfdg nspiskfiiq rrevselekf vgpvpdplln witelsnvsa
721 nqrwmllenl kaatvyqfrv savnrvgegs psepsnvvel pqeapsgppv gfvgsarsms
781 eiitqwqppq eehrngqilg yilryrlfgy nnvpwsyqni tneaqrnfli qelitwkdyi
841 vqiaafnnmg vgvytegaki ktkegvpeap ptnvrvkaln staaqitwkp pnpqqingin
901 qgykiqawqr rqldgeerdm errmmtvpps lidplaeqtt vlggldkfak fnvtvlcftd
961 pgdgvasqlv pvetlddvpd eitalhfddv sdrsvkvlwa pprfangilt gytvryqvkd
1021 rpetmkffnl taddneltvn qlqatthywf evcawtrvgs gppktatiqs gvepvlphap
1081 ttlalsniea fsvvlqftpg fdgnssitkw kveaqtarnm twftlceisd pdaetltvtg
1141 lmpftqyrlr lsatnvvgss rpsdptkdfq tiqakpmhap fnvtvramsa lqlrvrwipl
1201 qqmewfgnpr gynvtyrqme rtgkpskhpp rsvmiedhta nshvleglee wtlyevimna
1261 cndvgcslds glamertrea vpsygplhve anatssttvv vrwgeipphh rngqidgykv
1321 yyaatergmq vlyktipnns sftttltelq kfvvyhvqvl aytrlgngal stppirvqtf
1381 edtpgspsnv sfpdvtfsma riiwdvpmdp ngeilayqvt ytlngsanln ysrefppsdr
1441 tfratglmpe ryysfsvtaq trlgwgktas vlvyttnnrd rpqapsgpqv srsqiqahqi
1501 tfnwtpgrdg faplryytve mrenegrwqp lpervdptls sytalglrpy ttyqfriqat
1561 ndlgpsafsr esivvrtlpa apavgvgglk vvpitttsvr vqwgaletgm wngdaatggy
1621 rilyqqlsdf apalqstpkt dvmginensv vlsdlqqdrn yeivvlpfns qgpgpatppt
1681 avyvgeavpt geprgvdata isstevrlsw kppkqssqng eilgykifyl vtwspqalep
1741 grkfeeeiev vsatatshsl vfldkfteyr iqllafnpag dgprsapvta ktmpgvpsap
1801 lnlrfsditm qslevtwdpp kllngeivgy lvtyetteen ekfskqvkqk vsnttlrvqn
1861 leeevtytft vraqtndygp avsanvttgp qdgspvaprd ltltktlssv evhwvngpsg
1921 rgpilgylie akkrddsrwt kieqsrkgtm keftvsyhil mpstaylfrv iaynkygisf
1981 pvyskdsilt psklhleygy lqhkpfyrqt wfmvslaats iviivmviav lcvksksyky
2041 kqeaqktlee smamsiderq elalelyrsr hgvgtgtlns vgtlrsgtlg tlgrkstnrh
2101 qpvsvhlgks pprpspasva yhsdeeslkc ydenpddssv tekpsevsss easqhsesen
2161 esvrsdphsf vnhyanvnds lrqswkktkp vrnyssytds epegsavmsl nggqiivnnm
2221 arsraplpgf ssfv