protein sidekick isoform X7 [Drosophila obscura].
LOCUS XP_041447891 2291 aa linear INV 14-MAY-2021
ACCESSION XP_041447891
VERSION XP_041447891.1
DBLINK BioProject: PRJNA728747
DBSOURCE REFSEQ: accession XM_041591957.1
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..2291
/organism="Drosophila obscura"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
Protein 1..2291
/product="protein sidekick isoform X7"
/calculated_mol_wt=253664
Region 83..153
/region_name="Ig_3"
/note="Immunoglobulin domain; pfam13927"
/db_xref="CDD:464046"
Region 272..367
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 290..294
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409353"
Region 305..309
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409353"
Region 330..334
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409353"
Region 344..349
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409353"
Region 358..361
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409353"
Region 371..462
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 389..393
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409543"
Region 402..406
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409543"
Region 428..431
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409543"
Region 441..446
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409543"
Region 454..457
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409543"
Region 527..614
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 544..548
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409562"
Region 557..561
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409562"
Region 581..584
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409562"
Region 594..599
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409562"
Region 607..610
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409562"
Region 619..708
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 635..639
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409544"
Region 650..654
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409544"
Region 674..678
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409544"
Region <688..>945
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Region 688..693
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409544"
Region 701..704
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409544"
Region 712..815
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(712,789,804)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(805..806,808..809)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 927..1031
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1020..1021,1023..1024)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Site order(1036,1101,1116)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Region 1037..1120
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region 1134..1223
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1134,1200,1215)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(1216..1217,1219..1220)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1236..1324
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Region 1343..1428
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region <1412..1756
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Site order(1424..1425,1427..1428)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1539..1634
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1539,1606,1621)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(1622..1623,1625..1626)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1748..1849
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Region <1824..>2036
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Site order(1838..1839,1841..1842)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
CDS 1..2291
/gene="LOC111066347"
/coded_by="XM_041591957.1:351..7226"
/db_xref="GeneID:111066347"
ORIGIN
1 mkrdqrrssa sslrrrrrwc vdvnekgtrm wlkislsqpl easlfvlaal lllnadscsc
61 yadanpqqqq qlvqqqqqql qaprftthps ssgsivsegs tkilqchalg ypqptyrwlk
121 dgksvgefss sqfyrfhstr redagsyqci akndagsifs eksdvvvaym gifenvtegr
181 ltvvsghpai fdmpaiesvp tpsvlwqsad gslnydikya ftqanqliil svdendrrgy
241 raraintqlg keeisafvhl nvsgdpyiev apeiivrpqd vkvktgtgvl elqcianarp
301 lheletiwlk dglavdttgv rhtlndpwnr tlallqanss hsgeytcqvr lrsggyptvt
361 asarvqilep pvfftpmrae tfgefggqvq lpcdvvgept pqvewfrnae sveanvqsgr
421 yslgedntli ikklilddsa mfqclarnea gensastwlr vkteteteaa nsrikrlaqp
481 rilrvrashg gsgtvagsvt gsgsgsgsts nshqqgrrkq frfasapvfe qppqnvtald
541 gkdatiscra igspnpnvtw iynetqlvei ssrvqilesg dllisnirat daglyicvra
601 neagsvkgea llsvlvrtqi iqppvdtivl lgltatlqck vssdpsvpyn idwyregqma
661 pisnsqrigv qadgqleiqa vrasdvgsys cvvtspggne trsarlsvie lpfppsnvrv
721 erlpepqqrs invswtpgfd gnspiskfii qrrevselgp vpdpllnwit elsnvsanqr
781 wmllenlkaa tvyqfrvsav nrvgegspse psnvvelpqe apsgppvgfv gsarsmseii
841 tqwqppqeeh rngqilgyil ryrlfgynnv pwsyqnitne aqrnfliqel itwkdyivqi
901 aafnnmgvgv ytegakiktk egvpeapptn vrvkalnsta aqitwkppnp qqinginqgy
961 kiqawqrrql dgeerdmerr mmtvppslid plaeqttvlg gldkfakfnv tvlcftdpgd
1021 gvasqlvpve tlddvpdeit alhfddvsdr svkvlwappr fangiltgyt vryqvkdrpe
1081 tmkffnltad dneltvnqlq atthywfevc awtrvgsgpp ktatiqsgve pvlphapttl
1141 alsnieafsv vlqftpgfdg nssitkwkve aqtarnmtwf tlceisdpda etltvtglmp
1201 ftqyrlrlsa tnvvgssrps dptkdfqtiq akpmhapfnv tvramsalql rvrwiplqqm
1261 ewfgnprgyn vtyrqmertg kpskhpprsv miedhtansh vlegleewtl yevimnacnd
1321 vgcsldsgla mertreavps ygplhveana tssttvvvrw geipphhrng qidgykvyya
1381 atergmqvly ktipnnssft ttltelqkfv vyhvqvlayt rlgngalstp pirvqtfedt
1441 pgspsnvsfp dvtfsmarii wdvpmdpnge ilayqvtytl ngsanlnysr efppsdrtfr
1501 atglmperyy sfsvtaqtrl gwgktasvlv yttnnrdrpq apsgpqvsrs qiqahqitfn
1561 wtpgrdgfap lryytvemre negrwqplpe rvdptlssyt alglrpytty qfriqatndl
1621 gpsafsresi vvrtlpaapa vgvgglkvvp itttsvrvqw galetgmwng daatggyril
1681 yqqlsdfapa lqstpktdvm ginensvvls dlqqdrnyei vvlpfnsqgp gpatpptavy
1741 vgeavptgep rgvdataiss tevrlswkpp kqssqngeil gykifylvtw spqalepgrk
1801 feeeievvsa tatshslvfl dkfteyriql lafnpagdgp rsapvtaktm pgvpsaplnl
1861 rfsditmqsl evtwdppkll ngeivgylvt yetteenekf skqvkqkvsn ttlrvqnlee
1921 evtytftvra qtndygpavs anvttgpqdg spvaprdltl tktlssvevh wvngpsgrgp
1981 ilgylieakk rddsrwtkie qsrkgtmkef tvsyhilmps taylfrviay nkygisfpvy
2041 skdsiltpsk lhleygylqh kpfyrqtwfm vslaatsivi ivmviavlcv ksksykykqe
2101 aqktleesma msiderqela lelyrsrhgv gtgtlnsvgt lrsgtlgtlg rkstnrhqpv
2161 svhlgksppr pspasvayhs deeslkcyde npddssvtek psevssseas qhsesenesv
2221 rsdphsfvnh yanvndslrq swkktkpvrn yssytdsepe gsavmslngg qiivnnmars
2281 raplpgfssf v