protein sidekick isoform X3 [Drosophila obscura].


LOCUS       XP_041447887            2312 aa            linear   INV 14-MAY-2021
ACCESSION   XP_041447887
VERSION     XP_041447887.1
DBLINK      BioProject: PRJNA728747
DBSOURCE    REFSEQ: accession XM_041591953.1
KEYWORDS    RefSeq.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..2312
                     /organism="Drosophila obscura"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     Protein         1..2312
                     /product="protein sidekick isoform X3"
                     /calculated_mol_wt=256139
     Region          83..153
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          272..367
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          290..294
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          305..309
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          330..334
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          344..349
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          358..361
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          371..462
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          389..393
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409543"
     Region          402..406
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409543"
     Region          428..431
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409543"
     Region          441..446
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409543"
     Region          454..457
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409543"
     Region          527..614
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          544..548
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409562"
     Region          557..561
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409562"
     Region          581..584
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409562"
     Region          594..599
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409562"
     Region          607..610
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409562"
     Region          619..708
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          635..639
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          650..654
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          674..678
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          <688..>949
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; COG3401"
                     /db_xref="CDD:442628"
     Region          688..693
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          701..704
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409544"
     Region          712..819
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(712,793,808)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Site            order(809..810,812..813)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Region          931..1035
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(1024..1025,1027..1028)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Site            order(1040,1105,1120)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Region          1041..1124
                     /region_name="fn3"
                     /note="Fibronectin type III domain; pfam00041"
                     /db_xref="CDD:394996"
     Region          1138..1227
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(1138,1204,1219)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Site            order(1220..1221,1223..1224)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Region          1240..1328
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Region          1347..1432
                     /region_name="fn3"
                     /note="Fibronectin type III domain; pfam00041"
                     /db_xref="CDD:394996"
     Region          1420..1971
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; COG3401"
                     /db_xref="CDD:442628"
     Site            order(1428..1429,1431..1432)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Region          1956..2062
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(1956,2040,2055)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Site            order(2056..2057,2059..2060)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     CDS             1..2312
                     /gene="LOC111066347"
                     /coded_by="XM_041591953.1:351..7289"
                     /db_xref="GeneID:111066347"
ORIGIN      
        1 mkrdqrrssa sslrrrrrwc vdvnekgtrm wlkislsqpl easlfvlaal lllnadscsc
       61 yadanpqqqq qlvqqqqqql qaprftthps ssgsivsegs tkilqchalg ypqptyrwlk
      121 dgksvgefss sqfyrfhstr redagsyqci akndagsifs eksdvvvaym gifenvtegr
      181 ltvvsghpai fdmpaiesvp tpsvlwqsad gslnydikya ftqanqliil svdendrrgy
      241 raraintqlg keeisafvhl nvsgdpyiev apeiivrpqd vkvktgtgvl elqcianarp
      301 lheletiwlk dglavdttgv rhtlndpwnr tlallqanss hsgeytcqvr lrsggyptvt
      361 asarvqilep pvfftpmrae tfgefggqvq lpcdvvgept pqvewfrnae sveanvqsgr
      421 yslgedntli ikklilddsa mfqclarnea gensastwlr vkteteteaa nsrikrlaqp
      481 rilrvrashg gsgtvagsvt gsgsgsgsts nshqqgrrkq frfasapvfe qppqnvtald
      541 gkdatiscra igspnpnvtw iynetqlvei ssrvqilesg dllisnirat daglyicvra
      601 neagsvkgea llsvlvrtqi iqppvdtivl lgltatlqck vssdpsvpyn idwyregqma
      661 pisnsqrigv qadgqleiqa vrasdvgsys cvvtspggne trsarlsvie lpfppsnvrv
      721 erlpepqqrs invswtpgfd gnspiskfii qrrevselek fvgpvpdpll nwitelsnvs
      781 anqrwmllen lkaatvyqfr vsavnrvgeg spsepsnvve lpqeapsgpp vgfvgsarsm
      841 seiitqwqpp qeehrngqil gyilryrlfg ynnvpwsyqn itneaqrnfl iqelitwkdy
      901 ivqiaafnnm gvgvytegak iktkegvpea pptnvrvkal nstaaqitwk ppnpqqingi
      961 nqgykiqawq rrqldgeerd merrmmtvpp slidplaeqt tvlggldkfa kfnvtvlcft
     1021 dpgdgvasql vpvetlddvp deitalhfdd vsdrsvkvlw apprfangil tgytvryqvk
     1081 drpetmkffn ltaddneltv nqlqatthyw fevcawtrvg sgppktatiq sgvepvlpha
     1141 pttlalsnie afsvvlqftp gfdgnssitk wkveaqtarn mtwftlceis dpdaetltvt
     1201 glmpftqyrl rlsatnvvgs srpsdptkdf qtiqakpmha pfnvtvrams alqlrvrwip
     1261 lqqmewfgnp rgynvtyrqm ertgkpskhp prsvmiedht anshvlegle ewtlyevimn
     1321 acndvgcsld sglamertre avpsygplhv eanatssttv vvrwgeipph hrngqidgyk
     1381 vyyaatergm qvlyktipnn ssftttltel qkfvvyhvqv laytrlgnga lstppirvqt
     1441 fedtpgspsn vsfpdvtfsm ariiwdvpmd pngeilayqv tytlngsanl nysrefppsd
     1501 rtfratglmp eryysfsvta qtrlgwgkta svlvyttnnr drpqapsgpq vsrsqiqahq
     1561 itfnwtpgrd gfaplryytv emrenegrwq plpervdptl ssytalglrp yttyqfriqa
     1621 tndlgpsafs resivvrtlp aapavgvggl kvvpitttsv rvqwgaletg mwngdaatgg
     1681 yrilyqqlsd fapalqstpk tdvmginens vvlsdlqqdr nyeivvlpfn sqgpgpatpp
     1741 tavyvgeavp tgeprgvdat aisstevrls wkppkqssqn geilgykify lvtwspqale
     1801 pgrkfeeeie vvsatatshs lvfldkftey riqllafnpa gdgprsapvt aktmpgvpsa
     1861 plnlrfsdit mqslevtwdp pkllngeivg ylvtyettee nekfskqvkq kvsnttlrvq
     1921 nleeevtytf tvraqtndyg pavsanvttg pqdgspvapr dltltktlss vevhwvngps
     1981 grgpilgyli eakkrengep sfisnrppyl rlddsrwtki eqsrkgtmke ftvsyhilmp
     2041 staylfrvia ynkygisfpv yskdsiltps klhleygylq hkpfyrqtwf mvslaatsiv
     2101 iivmviavlc vksksykykq eaqktleesm amsiderqel alelyrsrhg vgtgtlnsvg
     2161 tlrsgtlgtl grkstnrhqp vsvhlgkspp rpspasvayh sdeeslkcyd enpddssvte
     2221 kpsevsssea sqhsesenes vrsdphsfvn hyanvndslr qswkktkpvr nyssytdsep
     2281 egsavmslng gqiivnnmar sraplpgfss fv