protein sidekick isoform X2 [Drosophila obscura].
LOCUS XP_041447886 2314 aa linear INV 14-MAY-2021
ACCESSION XP_041447886
VERSION XP_041447886.1
DBLINK BioProject: PRJNA728747
DBSOURCE REFSEQ: accession XM_041591952.1
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..2314
/organism="Drosophila obscura"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
Protein 1..2314
/product="protein sidekick isoform X2"
/calculated_mol_wt=256441
Region 83..153
/region_name="Ig_3"
/note="Immunoglobulin domain; pfam13927"
/db_xref="CDD:464046"
Region 272..367
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 290..294
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409353"
Region 305..309
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409353"
Region 330..334
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409353"
Region 344..349
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409353"
Region 358..361
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409353"
Region 371..462
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 389..393
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409543"
Region 402..406
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409543"
Region 428..431
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409543"
Region 441..446
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409543"
Region 454..457
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409543"
Region 527..614
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 544..548
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409562"
Region 557..561
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409562"
Region 581..584
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409562"
Region 594..599
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409562"
Region 607..610
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409562"
Region 619..708
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 635..639
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409544"
Region 650..654
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409544"
Region 674..678
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409544"
Region <688..>951
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Region 688..693
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409544"
Region 701..704
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409544"
Region 712..815
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(712,789,804)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(805..806,808..809)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 933..1037
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1026..1027,1029..1030)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Site order(1042,1107,1122)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Region 1043..1126
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region 1140..1229
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1140,1206,1221)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(1222..1223,1225..1226)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1242..1330
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Region 1349..1434
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region 1422..1973
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Site order(1430..1431,1433..1434)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1958..2064
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1958,2042,2057)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(2058..2059,2061..2062)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
CDS 1..2314
/gene="LOC111066347"
/coded_by="XM_041591952.1:351..7295"
/db_xref="GeneID:111066347"
ORIGIN
1 mkrdqrrssa sslrrrrrwc vdvnekgtrm wlkislsqpl easlfvlaal lllnadscsc
61 yadanpqqqq qlvqqqqqql qaprftthps ssgsivsegs tkilqchalg ypqptyrwlk
121 dgksvgefss sqfyrfhstr redagsyqci akndagsifs eksdvvvaym gifenvtegr
181 ltvvsghpai fdmpaiesvp tpsvlwqsad gslnydikya ftqanqliil svdendrrgy
241 raraintqlg keeisafvhl nvsgdpyiev apeiivrpqd vkvktgtgvl elqcianarp
301 lheletiwlk dglavdttgv rhtlndpwnr tlallqanss hsgeytcqvr lrsggyptvt
361 asarvqilep pvfftpmrae tfgefggqvq lpcdvvgept pqvewfrnae sveanvqsgr
421 yslgedntli ikklilddsa mfqclarnea gensastwlr vkteteteaa nsrikrlaqp
481 rilrvrashg gsgtvagsvt gsgsgsgsts nshqqgrrkq frfasapvfe qppqnvtald
541 gkdatiscra igspnpnvtw iynetqlvei ssrvqilesg dllisnirat daglyicvra
601 neagsvkgea llsvlvrtqi iqppvdtivl lgltatlqck vssdpsvpyn idwyregqma
661 pisnsqrigv qadgqleiqa vrasdvgsys cvvtspggne trsarlsvie lpfppsnvrv
721 erlpepqqrs invswtpgfd gnspiskfii qrrevselgp vpdpllnwit elsnvsanqr
781 wmllenlkaa tvyqfrvsav nrvgegspse psnvvelpqe efplyrapsg ppvgfvgsar
841 smseiitqwq ppqeehrngq ilgyilryrl fgynnvpwsy qnitneaqrn fliqelitwk
901 dyivqiaafn nmgvgvyteg akiktkegvp eapptnvrvk alnstaaqit wkppnpqqin
961 ginqgykiqa wqrrqldgee rdmerrmmtv ppslidplae qttvlggldk fakfnvtvlc
1021 ftdpgdgvas qlvpvetldd vpdeitalhf ddvsdrsvkv lwapprfang iltgytvryq
1081 vkdrpetmkf fnltaddnel tvnqlqatth ywfevcawtr vgsgppktat iqsgvepvlp
1141 hapttlalsn ieafsvvlqf tpgfdgnssi tkwkveaqta rnmtwftlce isdpdaetlt
1201 vtglmpftqy rlrlsatnvv gssrpsdptk dfqtiqakpm hapfnvtvra msalqlrvrw
1261 iplqqmewfg nprgynvtyr qmertgkpsk hpprsvmied htanshvleg leewtlyevi
1321 mnacndvgcs ldsglamert reavpsygpl hveanatsst tvvvrwgeip phhrngqidg
1381 ykvyyaater gmqvlyktip nnssftttlt elqkfvvyhv qvlaytrlgn galstppirv
1441 qtfedtpgsp snvsfpdvtf smariiwdvp mdpngeilay qvtytlngsa nlnysrefpp
1501 sdrtfratgl mperyysfsv taqtrlgwgk tasvlvyttn nrdrpqapsg pqvsrsqiqa
1561 hqitfnwtpg rdgfaplryy tvemrenegr wqplpervdp tlssytalgl rpyttyqfri
1621 qatndlgpsa fsresivvrt lpaapavgvg glkvvpittt svrvqwgale tgmwngdaat
1681 ggyrilyqql sdfapalqst pktdvmgine nsvvlsdlqq drnyeivvlp fnsqgpgpat
1741 pptavyvgea vptgeprgvd ataisstevr lswkppkqss qngeilgyki fylvtwspqa
1801 lepgrkfeee ievvsatats hslvfldkft eyriqllafn pagdgprsap vtaktmpgvp
1861 saplnlrfsd itmqslevtw dppkllngei vgylvtyett eenekfskqv kqkvsnttlr
1921 vqnleeevty tftvraqtnd ygpavsanvt tgpqdgspva prdltltktl ssvevhwvng
1981 psgrgpilgy lieakkreng epsfisnrpp ylrlddsrwt kieqsrkgtm keftvsyhil
2041 mpstaylfrv iaynkygisf pvyskdsilt psklhleygy lqhkpfyrqt wfmvslaats
2101 iviivmviav lcvksksyky kqeaqktlee smamsiderq elalelyrsr hgvgtgtlns
2161 vgtlrsgtlg tlgrkstnrh qpvsvhlgks pprpspasva yhsdeeslkc ydenpddssv
2221 tekpsevsss easqhsesen esvrsdphsf vnhyanvnds lrqswkktkp vrnyssytds
2281 epegsavmsl nggqiivnnm arsraplpgf ssfv