protein sidekick isoform X5 [Drosophila obscura].


LOCUS       XP_041447889            2309 aa            linear   INV 14-MAY-2021
ACCESSION   XP_041447889
VERSION     XP_041447889.1
DBLINK      BioProject: PRJNA728747
DBSOURCE    REFSEQ: accession XM_041591955.1
KEYWORDS    RefSeq.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..2309
                     /organism="Drosophila obscura"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     Protein         1..2309
                     /product="protein sidekick isoform X5"
                     /calculated_mol_wt=255896
     Region          83..153
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          272..367
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          290..294
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          305..309
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          330..334
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          344..349
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          358..361
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          371..462
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          389..393
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409543"
     Region          402..406
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409543"
     Region          428..431
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409543"
     Region          441..446
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409543"
     Region          454..457
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409543"
     Region          527..614
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          544..548
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409562"
     Region          557..561
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409562"
     Region          581..584
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409562"
     Region          594..599
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409562"
     Region          607..610
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409562"
     Region          619..708
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          635..639
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          650..654
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          674..678
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          688..693
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          701..704
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409544"
     Region          712..819
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(712,793,808)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Site            order(809..810,812..813)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Region          834..929
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(918..919,921..922)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Region          937..1041
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(1030..1031,1033..1034)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Site            order(1046,1111,1126)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Region          1047..1130
                     /region_name="fn3"
                     /note="Fibronectin type III domain; pfam00041"
                     /db_xref="CDD:394996"
     Region          1144..1233
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(1144,1210,1225)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Site            order(1226..1227,1229..1230)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Region          1246..1334
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Region          1353..1438
                     /region_name="fn3"
                     /note="Fibronectin type III domain; pfam00041"
                     /db_xref="CDD:394996"
     Region          1426..1977
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; COG3401"
                     /db_xref="CDD:442628"
     Site            order(1434..1435,1437..1438)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Region          1962..2059
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(1962,2037,2052)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Site            order(2053..2054,2056..2057)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     CDS             1..2309
                     /gene="LOC111066347"
                     /coded_by="XM_041591955.1:351..7280"
                     /db_xref="GeneID:111066347"
ORIGIN      
        1 mkrdqrrssa sslrrrrrwc vdvnekgtrm wlkislsqpl easlfvlaal lllnadscsc
       61 yadanpqqqq qlvqqqqqql qaprftthps ssgsivsegs tkilqchalg ypqptyrwlk
      121 dgksvgefss sqfyrfhstr redagsyqci akndagsifs eksdvvvaym gifenvtegr
      181 ltvvsghpai fdmpaiesvp tpsvlwqsad gslnydikya ftqanqliil svdendrrgy
      241 raraintqlg keeisafvhl nvsgdpyiev apeiivrpqd vkvktgtgvl elqcianarp
      301 lheletiwlk dglavdttgv rhtlndpwnr tlallqanss hsgeytcqvr lrsggyptvt
      361 asarvqilep pvfftpmrae tfgefggqvq lpcdvvgept pqvewfrnae sveanvqsgr
      421 yslgedntli ikklilddsa mfqclarnea gensastwlr vkteteteaa nsrikrlaqp
      481 rilrvrashg gsgtvagsvt gsgsgsgsts nshqqgrrkq frfasapvfe qppqnvtald
      541 gkdatiscra igspnpnvtw iynetqlvei ssrvqilesg dllisnirat daglyicvra
      601 neagsvkgea llsvlvrtqi iqppvdtivl lgltatlqck vssdpsvpyn idwyregqma
      661 pisnsqrigv qadgqleiqa vrasdvgsys cvvtspggne trsarlsvie lpfppsnvrv
      721 erlpepqqrs invswtpgfd gnspiskfii qrrevselek fvgpvpdpll nwitelsnvs
      781 anqrwmllen lkaatvyqfr vsavnrvgeg spsepsnvve lpqeefplyr apsgppvgfv
      841 gsarsmseii tqwqppqeeh rngqilgyil ryrlfgynnv pwsyqnitne aqrnfliqel
      901 itwkdyivqi aafnnmgvgv ytegakiktk egvpeapptn vrvkalnsta aqitwkppnp
      961 qqinginqgy kiqawqrrql dgeerdmerr mmtvppslid plaeqttvlg gldkfakfnv
     1021 tvlcftdpgd gvasqlvpve tlddvpdeit alhfddvsdr svkvlwappr fangiltgyt
     1081 vryqvkdrpe tmkffnltad dneltvnqlq atthywfevc awtrvgsgpp ktatiqsgve
     1141 pvlphapttl alsnieafsv vlqftpgfdg nssitkwkve aqtarnmtwf tlceisdpda
     1201 etltvtglmp ftqyrlrlsa tnvvgssrps dptkdfqtiq akpmhapfnv tvramsalql
     1261 rvrwiplqqm ewfgnprgyn vtyrqmertg kpskhpprsv miedhtansh vlegleewtl
     1321 yevimnacnd vgcsldsgla mertreavps ygplhveana tssttvvvrw geipphhrng
     1381 qidgykvyya atergmqvly ktipnnssft ttltelqkfv vyhvqvlayt rlgngalstp
     1441 pirvqtfedt pgspsnvsfp dvtfsmarii wdvpmdpnge ilayqvtytl ngsanlnysr
     1501 efppsdrtfr atglmperyy sfsvtaqtrl gwgktasvlv yttnnrdrpq apsgpqvsrs
     1561 qiqahqitfn wtpgrdgfap lryytvemre negrwqplpe rvdptlssyt alglrpytty
     1621 qfriqatndl gpsafsresi vvrtlpaapa vgvgglkvvp itttsvrvqw galetgmwng
     1681 daatggyril yqqlsdfapa lqstpktdvm ginensvvls dlqqdrnyei vvlpfnsqgp
     1741 gpatpptavy vgeavptgep rgvdataiss tevrlswkpp kqssqngeil gykifylvtw
     1801 spqalepgrk feeeievvsa tatshslvfl dkfteyriql lafnpagdgp rsapvtaktm
     1861 pgvpsaplnl rfsditmqsl evtwdppkll ngeivgylvt yetteenekf skqvkqkvsn
     1921 ttlrvqnlee evtytftvra qtndygpavs anvttgpqdg spvaprdltl tktlssvevh
     1981 wvngpsgrgp ilgylieakk rengepsfiy dsrwtkieqs rkgtmkeftv syhilmpsta
     2041 ylfrviaynk ygisfpvysk dsiltpsklh leygylqhkp fyrqtwfmvs laatsiviiv
     2101 mviavlcvks ksykykqeaq ktleesmams iderqelale lyrsrhgvgt gtlnsvgtlr
     2161 sgtlgtlgrk stnrhqpvsv hlgkspprps pasvayhsde eslkcydenp ddssvtekps
     2221 evssseasqh sesenesvrs dphsfvnhya nvndslrqsw kktkpvrnys sytdsepegs
     2281 avmslnggqi ivnnmarsra plpgfssfv