protein sidekick isoform X11 [Drosophila obscura].
LOCUS XP_041447896 2230 aa linear INV 14-MAY-2021
ACCESSION XP_041447896
VERSION XP_041447896.1
DBLINK BioProject: PRJNA728747
DBSOURCE REFSEQ: accession XM_041591962.1
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..2230
/organism="Drosophila obscura"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
Protein 1..2230
/product="protein sidekick isoform X11"
/calculated_mol_wt=247298
Region 83..153
/region_name="Ig_3"
/note="Immunoglobulin domain; pfam13927"
/db_xref="CDD:464046"
Region 272..367
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 290..294
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409353"
Region 305..309
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409353"
Region 330..334
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409353"
Region 344..349
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409353"
Region 358..361
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409353"
Region 371..462
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 389..393
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409543"
Region 402..406
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409543"
Region 428..431
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409543"
Region 441..446
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409543"
Region 454..457
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409543"
Region 466..553
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 483..487
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409562"
Region 496..500
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409562"
Region 520..523
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409562"
Region 533..538
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409562"
Region 546..549
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409562"
Region 558..647
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 574..578
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409544"
Region 589..593
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409544"
Region 613..617
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409544"
Region <627..>884
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Region 627..632
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409544"
Region 640..643
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409544"
Region 651..754
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(651,728,743)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(744..745,747..748)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 866..970
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(959..960,962..963)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Site order(975,1040,1055)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Region 976..1059
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region 1073..1162
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1073,1139,1154)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(1155..1156,1158..1159)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1175..1263
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Region 1282..1367
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region <1351..1695
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Site order(1363..1364,1366..1367)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1478..1573
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1478,1545,1560)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(1561..1562,1564..1565)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1687..1788
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Region <1763..>1975
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Site order(1777..1778,1780..1781)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
CDS 1..2230
/gene="LOC111066347"
/coded_by="XM_041591962.1:351..7043"
/db_xref="GeneID:111066347"
ORIGIN
1 mkrdqrrssa sslrrrrrwc vdvnekgtrm wlkislsqpl easlfvlaal lllnadscsc
61 yadanpqqqq qlvqqqqqql qaprftthps ssgsivsegs tkilqchalg ypqptyrwlk
121 dgksvgefss sqfyrfhstr redagsyqci akndagsifs eksdvvvaym gifenvtegr
181 ltvvsghpai fdmpaiesvp tpsvlwqsad gslnydikya ftqanqliil svdendrrgy
241 raraintqlg keeisafvhl nvsgdpyiev apeiivrpqd vkvktgtgvl elqcianarp
301 lheletiwlk dglavdttgv rhtlndpwnr tlallqanss hsgeytcqvr lrsggyptvt
361 asarvqilep pvfftpmrae tfgefggqvq lpcdvvgept pqvewfrnae sveanvqsgr
421 yslgedntli ikklilddsa mfqclarnea gensastwlr vktsapvfeq ppqnvtaldg
481 kdatiscrai gspnpnvtwi ynetqlveis srvqilesgd llisniratd aglyicvran
541 eagsvkgeal lsvlvrtqii qppvdtivll gltatlqckv ssdpsvpyni dwyregqmap
601 isnsqrigvq adgqleiqav rasdvgsysc vvtspggnet rsarlsviel pfppsnvrve
661 rlpepqqrsi nvswtpgfdg nspiskfiiq rrevselgpv pdpllnwite lsnvsanqrw
721 mllenlkaat vyqfrvsavn rvgegspsep snvvelpqea psgppvgfvg sarsmseiit
781 qwqppqeehr ngqilgyilr yrlfgynnvp wsyqnitnea qrnfliqeli twkdyivqia
841 afnnmgvgvy tegakiktke gvpeapptnv rvkalnstaa qitwkppnpq qinginqgyk
901 iqawqrrqld geerdmerrm mtvppslidp laeqttvlgg ldkfakfnvt vlcftdpgdg
961 vasqlvpvet lddvpdeita lhfddvsdrs vkvlwapprf angiltgytv ryqvkdrpet
1021 mkffnltadd neltvnqlqa tthywfevca wtrvgsgppk tatiqsgvep vlphapttla
1081 lsnieafsvv lqftpgfdgn ssitkwkvea qtarnmtwft lceisdpdae tltvtglmpf
1141 tqyrlrlsat nvvgssrpsd ptkdfqtiqa kpmhapfnvt vramsalqlr vrwiplqqme
1201 wfgnprgynv tyrqmertgk pskhpprsvm iedhtanshv legleewtly evimnacndv
1261 gcsldsglam ertreavpsy gplhveanat ssttvvvrwg eipphhrngq idgykvyyaa
1321 tergmqvlyk tipnnssftt tltelqkfvv yhvqvlaytr lgngalstpp irvqtfedtp
1381 gspsnvsfpd vtfsmariiw dvpmdpngei layqvtytln gsanlnysre fppsdrtfra
1441 tglmperyys fsvtaqtrlg wgktasvlvy ttnnrdrpqa psgpqvsrsq iqahqitfnw
1501 tpgrdgfapl ryytvemren egrwqplper vdptlssyta lglrpyttyq friqatndlg
1561 psafsresiv vrtlpaapav gvgglkvvpi tttsvrvqwg aletgmwngd aatggyrily
1621 qqlsdfapal qstpktdvmg inensvvlsd lqqdrnyeiv vlpfnsqgpg patpptavyv
1681 geavptgepr gvdataisst evrlswkppk qssqngeilg ykifylvtws pqalepgrkf
1741 eeeievvsat atshslvfld kfteyriqll afnpagdgpr sapvtaktmp gvpsaplnlr
1801 fsditmqsle vtwdppklln geivgylvty etteenekfs kqvkqkvsnt tlrvqnleee
1861 vtytftvraq tndygpavsa nvttgpqdgs pvaprdltlt ktlssvevhw vngpsgrgpi
1921 lgylieakkr ddsrwtkieq srkgtmkeft vsyhilmpst aylfrviayn kygisfpvys
1981 kdsiltpskl hleygylqhk pfyrqtwfmv slaatsivii vmviavlcvk sksykykqea
2041 qktleesmam siderqelal elyrsrhgvg tgtlnsvgtl rsgtlgtlgr kstnrhqpvs
2101 vhlgkspprp spasvayhsd eeslkcyden pddssvtekp sevssseasq hsesenesvr
2161 sdphsfvnhy anvndslrqs wkktkpvrny ssytdsepeg savmslnggq iivnnmarsr
2221 aplpgfssfv