protein sidekick isoform X3 [Drosophila obscura].
LOCUS XP_041447887 2312 aa linear INV 14-MAY-2021
ACCESSION XP_041447887
VERSION XP_041447887.1
DBLINK BioProject: PRJNA728747
DBSOURCE REFSEQ: accession XM_041591953.1
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..2312
/organism="Drosophila obscura"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
Protein 1..2312
/product="protein sidekick isoform X3"
/calculated_mol_wt=256139
Region 83..153
/region_name="Ig_3"
/note="Immunoglobulin domain; pfam13927"
/db_xref="CDD:464046"
Region 272..367
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 290..294
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409353"
Region 305..309
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409353"
Region 330..334
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409353"
Region 344..349
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409353"
Region 358..361
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409353"
Region 371..462
/region_name="Ig"
/note="Immunoglobulin domain; cl11960"
/db_xref="CDD:472250"
Region 389..393
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409543"
Region 402..406
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409543"
Region 428..431
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409543"
Region 441..446
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409543"
Region 454..457
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409543"
Region 527..614
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 544..548
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409562"
Region 557..561
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409562"
Region 581..584
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409562"
Region 594..599
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409562"
Region 607..610
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409562"
Region 619..708
/region_name="I-set"
/note="Immunoglobulin I-set domain; pfam07679"
/db_xref="CDD:400151"
Region 635..639
/region_name="Ig strand B"
/note="Ig strand B [structural motif]"
/db_xref="CDD:409544"
Region 650..654
/region_name="Ig strand C"
/note="Ig strand C [structural motif]"
/db_xref="CDD:409544"
Region 674..678
/region_name="Ig strand E"
/note="Ig strand E [structural motif]"
/db_xref="CDD:409544"
Region <688..>949
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Region 688..693
/region_name="Ig strand F"
/note="Ig strand F [structural motif]"
/db_xref="CDD:409544"
Region 701..704
/region_name="Ig strand G"
/note="Ig strand G [structural motif]"
/db_xref="CDD:409544"
Region 712..819
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(712,793,808)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(809..810,812..813)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 931..1035
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1024..1025,1027..1028)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Site order(1040,1105,1120)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Region 1041..1124
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region 1138..1227
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1138,1204,1219)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(1220..1221,1223..1224)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1240..1328
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Region 1347..1432
/region_name="fn3"
/note="Fibronectin type III domain; pfam00041"
/db_xref="CDD:394996"
Region 1420..1971
/region_name="FN3"
/note="Fibronectin type 3 domain [General function
prediction only]; COG3401"
/db_xref="CDD:442628"
Site order(1428..1429,1431..1432)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
Region 1956..2062
/region_name="FN3"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; cd00063"
/db_xref="CDD:238020"
Site order(1956,2040,2055)
/site_type="active"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
Site order(2056..2057,2059..2060)
/site_type="active"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
CDS 1..2312
/gene="LOC111066347"
/coded_by="XM_041591953.1:351..7289"
/db_xref="GeneID:111066347"
ORIGIN
1 mkrdqrrssa sslrrrrrwc vdvnekgtrm wlkislsqpl easlfvlaal lllnadscsc
61 yadanpqqqq qlvqqqqqql qaprftthps ssgsivsegs tkilqchalg ypqptyrwlk
121 dgksvgefss sqfyrfhstr redagsyqci akndagsifs eksdvvvaym gifenvtegr
181 ltvvsghpai fdmpaiesvp tpsvlwqsad gslnydikya ftqanqliil svdendrrgy
241 raraintqlg keeisafvhl nvsgdpyiev apeiivrpqd vkvktgtgvl elqcianarp
301 lheletiwlk dglavdttgv rhtlndpwnr tlallqanss hsgeytcqvr lrsggyptvt
361 asarvqilep pvfftpmrae tfgefggqvq lpcdvvgept pqvewfrnae sveanvqsgr
421 yslgedntli ikklilddsa mfqclarnea gensastwlr vkteteteaa nsrikrlaqp
481 rilrvrashg gsgtvagsvt gsgsgsgsts nshqqgrrkq frfasapvfe qppqnvtald
541 gkdatiscra igspnpnvtw iynetqlvei ssrvqilesg dllisnirat daglyicvra
601 neagsvkgea llsvlvrtqi iqppvdtivl lgltatlqck vssdpsvpyn idwyregqma
661 pisnsqrigv qadgqleiqa vrasdvgsys cvvtspggne trsarlsvie lpfppsnvrv
721 erlpepqqrs invswtpgfd gnspiskfii qrrevselek fvgpvpdpll nwitelsnvs
781 anqrwmllen lkaatvyqfr vsavnrvgeg spsepsnvve lpqeapsgpp vgfvgsarsm
841 seiitqwqpp qeehrngqil gyilryrlfg ynnvpwsyqn itneaqrnfl iqelitwkdy
901 ivqiaafnnm gvgvytegak iktkegvpea pptnvrvkal nstaaqitwk ppnpqqingi
961 nqgykiqawq rrqldgeerd merrmmtvpp slidplaeqt tvlggldkfa kfnvtvlcft
1021 dpgdgvasql vpvetlddvp deitalhfdd vsdrsvkvlw apprfangil tgytvryqvk
1081 drpetmkffn ltaddneltv nqlqatthyw fevcawtrvg sgppktatiq sgvepvlpha
1141 pttlalsnie afsvvlqftp gfdgnssitk wkveaqtarn mtwftlceis dpdaetltvt
1201 glmpftqyrl rlsatnvvgs srpsdptkdf qtiqakpmha pfnvtvrams alqlrvrwip
1261 lqqmewfgnp rgynvtyrqm ertgkpskhp prsvmiedht anshvlegle ewtlyevimn
1321 acndvgcsld sglamertre avpsygplhv eanatssttv vvrwgeipph hrngqidgyk
1381 vyyaatergm qvlyktipnn ssftttltel qkfvvyhvqv laytrlgnga lstppirvqt
1441 fedtpgspsn vsfpdvtfsm ariiwdvpmd pngeilayqv tytlngsanl nysrefppsd
1501 rtfratglmp eryysfsvta qtrlgwgkta svlvyttnnr drpqapsgpq vsrsqiqahq
1561 itfnwtpgrd gfaplryytv emrenegrwq plpervdptl ssytalglrp yttyqfriqa
1621 tndlgpsafs resivvrtlp aapavgvggl kvvpitttsv rvqwgaletg mwngdaatgg
1681 yrilyqqlsd fapalqstpk tdvmginens vvlsdlqqdr nyeivvlpfn sqgpgpatpp
1741 tavyvgeavp tgeprgvdat aisstevrls wkppkqssqn geilgykify lvtwspqale
1801 pgrkfeeeie vvsatatshs lvfldkftey riqllafnpa gdgprsapvt aktmpgvpsa
1861 plnlrfsdit mqslevtwdp pkllngeivg ylvtyettee nekfskqvkq kvsnttlrvq
1921 nleeevtytf tvraqtndyg pavsanvttg pqdgspvapr dltltktlss vevhwvngps
1981 grgpilgyli eakkrengep sfisnrppyl rlddsrwtki eqsrkgtmke ftvsyhilmp
2041 staylfrvia ynkygisfpv yskdsiltps klhleygylq hkpfyrqtwf mvslaatsiv
2101 iivmviavlc vksksykykq eaqktleesm amsiderqel alelyrsrhg vgtgtlnsvg
2161 tlrsgtlgtl grkstnrhqp vsvhlgkspp rpspasvayh sdeeslkcyd enpddssvte
2221 kpsevsssea sqhsesenes vrsdphsfvn hyanvndslr qswkktkpvr nyssytdsep
2281 egsavmslng gqiivnnmar sraplpgfss fv