protein sidekick isoform X2 [Drosophila obscura].


LOCUS       XP_041447886            2314 aa            linear   INV 14-MAY-2021
ACCESSION   XP_041447886
VERSION     XP_041447886.1
DBLINK      BioProject: PRJNA728747
DBSOURCE    REFSEQ: accession XM_041591952.1
KEYWORDS    RefSeq.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..2314
                     /organism="Drosophila obscura"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     Protein         1..2314
                     /product="protein sidekick isoform X2"
                     /calculated_mol_wt=256441
     Region          83..153
                     /region_name="Ig_3"
                     /note="Immunoglobulin domain; pfam13927"
                     /db_xref="CDD:464046"
     Region          272..367
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          290..294
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409353"
     Region          305..309
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409353"
     Region          330..334
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409353"
     Region          344..349
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409353"
     Region          358..361
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409353"
     Region          371..462
                     /region_name="Ig"
                     /note="Immunoglobulin domain; cl11960"
                     /db_xref="CDD:472250"
     Region          389..393
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409543"
     Region          402..406
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409543"
     Region          428..431
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409543"
     Region          441..446
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409543"
     Region          454..457
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409543"
     Region          527..614
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          544..548
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409562"
     Region          557..561
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409562"
     Region          581..584
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409562"
     Region          594..599
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409562"
     Region          607..610
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409562"
     Region          619..708
                     /region_name="I-set"
                     /note="Immunoglobulin I-set domain; pfam07679"
                     /db_xref="CDD:400151"
     Region          635..639
                     /region_name="Ig strand B"
                     /note="Ig strand B [structural motif]"
                     /db_xref="CDD:409544"
     Region          650..654
                     /region_name="Ig strand C"
                     /note="Ig strand C [structural motif]"
                     /db_xref="CDD:409544"
     Region          674..678
                     /region_name="Ig strand E"
                     /note="Ig strand E [structural motif]"
                     /db_xref="CDD:409544"
     Region          <688..>951
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; COG3401"
                     /db_xref="CDD:442628"
     Region          688..693
                     /region_name="Ig strand F"
                     /note="Ig strand F [structural motif]"
                     /db_xref="CDD:409544"
     Region          701..704
                     /region_name="Ig strand G"
                     /note="Ig strand G [structural motif]"
                     /db_xref="CDD:409544"
     Region          712..815
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(712,789,804)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Site            order(805..806,808..809)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Region          933..1037
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(1026..1027,1029..1030)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Site            order(1042,1107,1122)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Region          1043..1126
                     /region_name="fn3"
                     /note="Fibronectin type III domain; pfam00041"
                     /db_xref="CDD:394996"
     Region          1140..1229
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(1140,1206,1221)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Site            order(1222..1223,1225..1226)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Region          1242..1330
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Region          1349..1434
                     /region_name="fn3"
                     /note="Fibronectin type III domain; pfam00041"
                     /db_xref="CDD:394996"
     Region          1422..1973
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; COG3401"
                     /db_xref="CDD:442628"
     Site            order(1430..1431,1433..1434)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     Region          1958..2064
                     /region_name="FN3"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; cd00063"
                     /db_xref="CDD:238020"
     Site            order(1958,2042,2057)
                     /site_type="active"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     Site            order(2058..2059,2061..2062)
                     /site_type="active"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     CDS             1..2314
                     /gene="LOC111066347"
                     /coded_by="XM_041591952.1:351..7295"
                     /db_xref="GeneID:111066347"
ORIGIN      
        1 mkrdqrrssa sslrrrrrwc vdvnekgtrm wlkislsqpl easlfvlaal lllnadscsc
       61 yadanpqqqq qlvqqqqqql qaprftthps ssgsivsegs tkilqchalg ypqptyrwlk
      121 dgksvgefss sqfyrfhstr redagsyqci akndagsifs eksdvvvaym gifenvtegr
      181 ltvvsghpai fdmpaiesvp tpsvlwqsad gslnydikya ftqanqliil svdendrrgy
      241 raraintqlg keeisafvhl nvsgdpyiev apeiivrpqd vkvktgtgvl elqcianarp
      301 lheletiwlk dglavdttgv rhtlndpwnr tlallqanss hsgeytcqvr lrsggyptvt
      361 asarvqilep pvfftpmrae tfgefggqvq lpcdvvgept pqvewfrnae sveanvqsgr
      421 yslgedntli ikklilddsa mfqclarnea gensastwlr vkteteteaa nsrikrlaqp
      481 rilrvrashg gsgtvagsvt gsgsgsgsts nshqqgrrkq frfasapvfe qppqnvtald
      541 gkdatiscra igspnpnvtw iynetqlvei ssrvqilesg dllisnirat daglyicvra
      601 neagsvkgea llsvlvrtqi iqppvdtivl lgltatlqck vssdpsvpyn idwyregqma
      661 pisnsqrigv qadgqleiqa vrasdvgsys cvvtspggne trsarlsvie lpfppsnvrv
      721 erlpepqqrs invswtpgfd gnspiskfii qrrevselgp vpdpllnwit elsnvsanqr
      781 wmllenlkaa tvyqfrvsav nrvgegspse psnvvelpqe efplyrapsg ppvgfvgsar
      841 smseiitqwq ppqeehrngq ilgyilryrl fgynnvpwsy qnitneaqrn fliqelitwk
      901 dyivqiaafn nmgvgvyteg akiktkegvp eapptnvrvk alnstaaqit wkppnpqqin
      961 ginqgykiqa wqrrqldgee rdmerrmmtv ppslidplae qttvlggldk fakfnvtvlc
     1021 ftdpgdgvas qlvpvetldd vpdeitalhf ddvsdrsvkv lwapprfang iltgytvryq
     1081 vkdrpetmkf fnltaddnel tvnqlqatth ywfevcawtr vgsgppktat iqsgvepvlp
     1141 hapttlalsn ieafsvvlqf tpgfdgnssi tkwkveaqta rnmtwftlce isdpdaetlt
     1201 vtglmpftqy rlrlsatnvv gssrpsdptk dfqtiqakpm hapfnvtvra msalqlrvrw
     1261 iplqqmewfg nprgynvtyr qmertgkpsk hpprsvmied htanshvleg leewtlyevi
     1321 mnacndvgcs ldsglamert reavpsygpl hveanatsst tvvvrwgeip phhrngqidg
     1381 ykvyyaater gmqvlyktip nnssftttlt elqkfvvyhv qvlaytrlgn galstppirv
     1441 qtfedtpgsp snvsfpdvtf smariiwdvp mdpngeilay qvtytlngsa nlnysrefpp
     1501 sdrtfratgl mperyysfsv taqtrlgwgk tasvlvyttn nrdrpqapsg pqvsrsqiqa
     1561 hqitfnwtpg rdgfaplryy tvemrenegr wqplpervdp tlssytalgl rpyttyqfri
     1621 qatndlgpsa fsresivvrt lpaapavgvg glkvvpittt svrvqwgale tgmwngdaat
     1681 ggyrilyqql sdfapalqst pktdvmgine nsvvlsdlqq drnyeivvlp fnsqgpgpat
     1741 pptavyvgea vptgeprgvd ataisstevr lswkppkqss qngeilgyki fylvtwspqa
     1801 lepgrkfeee ievvsatats hslvfldkft eyriqllafn pagdgprsap vtaktmpgvp
     1861 saplnlrfsd itmqslevtw dppkllngei vgylvtyett eenekfskqv kqkvsnttlr
     1921 vqnleeevty tftvraqtnd ygpavsanvt tgpqdgspva prdltltktl ssvevhwvng
     1981 psgrgpilgy lieakkreng epsfisnrpp ylrlddsrwt kieqsrkgtm keftvsyhil
     2041 mpstaylfrv iaynkygisf pvyskdsilt psklhleygy lqhkpfyrqt wfmvslaats
     2101 iviivmviav lcvksksyky kqeaqktlee smamsiderq elalelyrsr hgvgtgtlns
     2161 vgtlrsgtlg tlgrkstnrh qpvsvhlgks pprpspasva yhsdeeslkc ydenpddssv
     2221 tekpsevsss easqhsesen esvrsdphsf vnhyanvnds lrqswkktkp vrnyssytds
     2281 epegsavmsl nggqiivnnm arsraplpgf ssfv