protein sidekick isoform X5 [Drosophila obscura].
LOCUS XP_041447889 2309 aa linear INV 14-MAY-2021
ACCESSION XP_041447889
VERSION XP_041447889.1
DBLINK BioProject: PRJNA728747
DBSOURCE REFSEQ: accession XM_041591955.1
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..2309
/organism="Drosophila obscura"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
Protein 1..2309
/product="protein sidekick isoform X5"
/calculated_mol_wt=255896
Region 83..153
/region_name="Ig_3"
/note="Immunoglobulin domain; pfam13927"
/db_xref="CDD:464046"
Region 272..367
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 290..294
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409353"
Region 305..309
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409353"
Region 330..334
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409353"
Region 344..349
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409353"
Region 358..361
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409353"
Region 371..462
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 389..393
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409543"
Region 402..406
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409543"
Region 428..431
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409543"
Region 441..446
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409543"
Region 454..457
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409543"
Region 527..614
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 544..548
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409562"
Region 557..561
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409562"
Region 581..584
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409562"
Region 594..599
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409562"
Region 607..610
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409562"
Region 619..708
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 635..639
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409544"
Region 650..654
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409544"
Region 674..678
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409544"
Region 688..693
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409544"
Region 701..704
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409544"
Region 712..819
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(712,793,808)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(809..810,812..813)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 834..929
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(918..919,921..922)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 937..1041
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1030..1031,1033..1034)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Site order(1046,1111,1126)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Region 1047..1130
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region 1144..1233
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1144,1210,1225)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(1226..1227,1229..1230)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1246..1334
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Region 1353..1438
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region 1426..1977
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Site order(1434..1435,1437..1438)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1962..2059
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1962,2037,2052)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(2053..2054,2056..2057)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
CDS 1..2309
/gene="LOC111066347"
/coded_by="XM_041591955.1:351..7280"
/db_xref="GeneID:111066347"
ORIGIN
1 mkrdqrrssa sslrrrrrwc vdvnekgtrm wlkislsqpl easlfvlaal lllnadscsc
61 yadanpqqqq qlvqqqqqql qaprftthps ssgsivsegs tkilqchalg ypqptyrwlk
121 dgksvgefss sqfyrfhstr redagsyqci akndagsifs eksdvvvaym gifenvtegr
181 ltvvsghpai fdmpaiesvp tpsvlwqsad gslnydikya ftqanqliil svdendrrgy
241 raraintqlg keeisafvhl nvsgdpyiev apeiivrpqd vkvktgtgvl elqcianarp
301 lheletiwlk dglavdttgv rhtlndpwnr tlallqanss hsgeytcqvr lrsggyptvt
361 asarvqilep pvfftpmrae tfgefggqvq lpcdvvgept pqvewfrnae sveanvqsgr
421 yslgedntli ikklilddsa mfqclarnea gensastwlr vkteteteaa nsrikrlaqp
481 rilrvrashg gsgtvagsvt gsgsgsgsts nshqqgrrkq frfasapvfe qppqnvtald
541 gkdatiscra igspnpnvtw iynetqlvei ssrvqilesg dllisnirat daglyicvra
601 neagsvkgea llsvlvrtqi iqppvdtivl lgltatlqck vssdpsvpyn idwyregqma
661 pisnsqrigv qadgqleiqa vrasdvgsys cvvtspggne trsarlsvie lpfppsnvrv
721 erlpepqqrs invswtpgfd gnspiskfii qrrevselek fvgpvpdpll nwitelsnvs
781 anqrwmllen lkaatvyqfr vsavnrvgeg spsepsnvve lpqeefplyr apsgppvgfv
841 gsarsmseii tqwqppqeeh rngqilgyil ryrlfgynnv pwsyqnitne aqrnfliqel
901 itwkdyivqi aafnnmgvgv ytegakiktk egvpeapptn vrvkalnsta aqitwkppnp
961 qqinginqgy kiqawqrrql dgeerdmerr mmtvppslid plaeqttvlg gldkfakfnv
1021 tvlcftdpgd gvasqlvpve tlddvpdeit alhfddvsdr svkvlwappr fangiltgyt
1081 vryqvkdrpe tmkffnltad dneltvnqlq atthywfevc awtrvgsgpp ktatiqsgve
1141 pvlphapttl alsnieafsv vlqftpgfdg nssitkwkve aqtarnmtwf tlceisdpda
1201 etltvtglmp ftqyrlrlsa tnvvgssrps dptkdfqtiq akpmhapfnv tvramsalql
1261 rvrwiplqqm ewfgnprgyn vtyrqmertg kpskhpprsv miedhtansh vlegleewtl
1321 yevimnacnd vgcsldsgla mertreavps ygplhveana tssttvvvrw geipphhrng
1381 qidgykvyya atergmqvly ktipnnssft ttltelqkfv vyhvqvlayt rlgngalstp
1441 pirvqtfedt pgspsnvsfp dvtfsmarii wdvpmdpnge ilayqvtytl ngsanlnysr
1501 efppsdrtfr atglmperyy sfsvtaqtrl gwgktasvlv yttnnrdrpq apsgpqvsrs
1561 qiqahqitfn wtpgrdgfap lryytvemre negrwqplpe rvdptlssyt alglrpytty
1621 qfriqatndl gpsafsresi vvrtlpaapa vgvgglkvvp itttsvrvqw galetgmwng
1681 daatggyril yqqlsdfapa lqstpktdvm ginensvvls dlqqdrnyei vvlpfnsqgp
1741 gpatpptavy vgeavptgep rgvdataiss tevrlswkpp kqssqngeil gykifylvtw
1801 spqalepgrk feeeievvsa tatshslvfl dkfteyriql lafnpagdgp rsapvtaktm
1861 pgvpsaplnl rfsditmqsl evtwdppkll ngeivgylvt yetteenekf skqvkqkvsn
1921 ttlrvqnlee evtytftvra qtndygpavs anvttgpqdg spvaprdltl tktlssvevh
1981 wvngpsgrgp ilgylieakk rengepsfiy dsrwtkieqs rkgtmkeftv syhilmpsta
2041 ylfrviaynk ygisfpvysk dsiltpsklh leygylqhkp fyrqtwfmvs laatsiviiv
2101 mviavlcvks ksykykqeaq ktleesmams iderqelale lyrsrhgvgt gtlnsvgtlr
2161 sgtlgtlgrk stnrhqpvsv hlgkspprps pasvayhsde eslkcydenp ddssvtekps
2221 evssseasqh sesenesvrs dphsfvnhya nvndslrqsw kktkpvrnys sytdsepegs
2281 avmslnggqi ivnnmarsra plpgfssfv