PREDICTED: Drosophila obscura tyrosine-protein phosphatase 10D


LOCUS       XM_022354812            6707 bp    mRNA    linear   INV 14-MAY-2021
            (LOC111066310), mRNA.
ACCESSION   XM_022354812
VERSION     XM_022354812.2
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq; corrected model.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            On May 14, 2021 this sequence version replaced XM_022354812.1.
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
            
            ##RefSeq-Attributes-START##
            frameshifts :: corrected 1 indel
            ##RefSeq-Attributes-END##
PRIMARY     REFSEQ_SPAN         PRIMARY_IDENTIFIER PRIMARY_SPAN        COMP
            1-319               JAECWW010000165.1  2450418-2450736
            320-711             JAECWW010000165.1  2452121-2452512
            712-958             JAECWW010000165.1  2459212-2459458
            959-1237            JAECWW010000165.1  2459522-2459800
            1238-1913           JAECWW010000165.1  2461079-2461754
            1914-2356           JAECWW010000165.1  2461815-2462257
            2357-2785           JAECWW010000165.1  2462344-2462772
            2786-5025           JAECWW010000165.1  2462853-2465092
            5026-5284           JAECWW010000165.1  2465171-2465429
            5285-5413           JAECWW010000165.1  2465517-2465645
            5414-5880           JAECWW010000165.1  2466270-2466736
            5881-5977           JAECWW010000165.1  2466739-2466835
            5978-6707           JAECWW010000165.1  2467384-2468113
FEATURES             Location/Qualifiers
     source          1..6707
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..6707
                     /gene="LOC111066310"
                     /note="The sequence of the model RefSeq transcript was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: deleted 2 bases in 1 codon;
                     Derived by automated computational analysis using gene
                     prediction method: Gnomon. Supporting evidence includes
                     similarity to: 5 Proteins, and 98% coverage of the
                     annotated genomic feature by RNAseq alignments, including
                     9 samples with support for all annotated introns"
                     /db_xref="GeneID:111066310"
     CDS             523..5952
                     /gene="LOC111066310"
                     /note="The sequence of the model RefSeq protein was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: deleted 2 bases in 1 codon"
                     /codon_start=1
                     /product="LOW QUALITY PROTEIN: tyrosine-protein
                     phosphatase 10D"
                     /protein_id="XP_022210504.2"
                     /db_xref="GeneID:111066310"
                     /translation="MDCAAQQQQRRSQPQQQQQEEAPAEAEQQTHGRRRRRQQHTQIQ
                     PSQPHMWIVLSILTIFWPQYSSGADLVINVPNASSNDNAFYRIDYSPPFGYPEPNTTI
                     PASEIGKDIKFSRALPGTEYNFWLFYTNSTHQELLTWTVNITTAPDPPASLSVQLRSS
                     KSAFITWKPPRTGNYSGFRIKVLGLTDLPFNRSYALDDRETLQLSAKELTPGGSYQVQ
                     AYTIYQGKESVAYTSRNFTTKPNTPGKFIVWFRNETTLLVLWQPPFPAGIYTHYKVSI
                     TPDDAIESVLYVEREGEPPGPAQAAFKGLVPGREYNISVQTVSEDETSSVPTTARYLT
                     VPERVLNVTFDEAHTTTSSFRVRWEPPRTYSEFDGYQVMLSTSRRIFNVPRTDATSVT
                     FDYPDILEPGRTYDVVVKTIADNVNSWPATAVVTLRPRPVRTLGGFLDDRSNALHISW
                     EPAETGKQDGYRIRYHEQTNDSEVAVASALDTVIDTALTEYTLDSLQAGRRYLVGVQA
                     LSNGVASNVTYMTRYTRPAAPIIQELRSIPLGLNISWKSDVNSQQDRYEVHYQRNGTR
                     EERTIATNETALTINYLHAGSGYEIKVYAISYGIRSEPHSYFQAVFPKPPLNLTLQTV
                     HTNLVVLHWQPPEGSDFSEYVVRYRTAASPWQRLANVHESHARIEDMHYGERYLIQVN
                     TISFGVESPSPLELNVTMPPHPVSNVVPLVDSRNLTLEWPRPDGHIDYYTLKWWPTDE
                     VDRVEYKNVTQLEDLTSPSVRIPIEDLSPGRQYRFEVQASSNGIRSGITHLSTRTMPL
                     IQSDVFISNARSSDGGGNRAKAITKGNGNGNGEEETITLSYTPTPASSTRFDIYRFSM
                     GDPQIKDKEKMANDTERKLTFTGLSPGRLYNVTVWTVSGGVASLPIQRLYRLHPLPIS
                     GLEATQVAAREISLRWTAPAGEFTDFELQYLSADEEAPQLLQNVTRLTQMTLHGLRPY
                     HNYTFTVVVRSGSSSGSSTGREDGLAPPPNLMRSSAPISASYQTLAAVPGRVDYFQPS
                     DVQPAEVSFVWLLEAREQHGPIDYFRISCQNIDDAADVVASHEFPVNATHGRIDNLVP
                     GNKYIFRLQAKSALGFGAERDHVQVMPILAPPVPGPNVTPLEVGRTSSTIEISYRQSY
                     FSNAHGLVRFYTIIVAEDVGKNASGLEMPSWHDVQSYAVWLPYQAIEPYNPFGGGAAS
                     NASVVRRSMSMSSGSVMERFTIGSANCDKQRTGYCNGPLRAGTTYRIKVRAFTDDDKF
                     TDTAYSVPITTDRNDTLIVAVTMVSLVLIAAILLVAYCQRRCHLIRRATKLSRMQDEL
                     AALPEGYVTPNRPIHVKDFGEHYRLMSADSDFRFSEEFEELKHVGRDQACSFANLPCN
                     RPKNRFTNILPYDHSRFKLQPVDDDDGSDYINANYMPGHNSPREFIVTQGPLHSTREE
                     FWRMCWESNSRAIVMLTRCFEKGREKCDQYWPVDRVAMFYGDIKVQLIIDTHFRDWSI
                     SEFMVSRNCESRIMRHFHFTTWPDFGVPEPPQSLVRFVRAFRDAIGTDMRPIIVHCSA
                     GVGRSGTFIALDRILQHIHKSDYVDIFGIVFAMRKERVFMVQTEQQYVCIHQCLLAVL
                     EGKEHLLADSLELHANDGYEERPNQQLQQQHQQQQQMKPKMGTLGAVMGAKTLRASLA
                     LAEELDQELMGKPEEEHAMAEVSLSDSINKPNEKADQDAEDDEEDDDDDDDDDDDDDD
                     DDQQPLNNETTATLSTASSSSISEDQVDNVEQQQQQQHNKQRDQGRICTKSDADTDED
                     DTDEDEDKEDGDGAKRREADEDGWW"
     misc_feature    961..1236
                     /gene="LOC111066310"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(961..963,1153..1155,1198..1200)
                     /gene="LOC111066310"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(1201..1206,1210..1215)
                     /gene="LOC111066310"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(1240..1242,1441..1443,1486..1488)
                     /gene="LOC111066310"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    1249..1494
                     /gene="LOC111066310"
                     /note="Fibronectin type III domain; Region: fn3;
                     pfam00041"
                     /db_xref="CDD:394996"
     misc_feature    1468..>2778
                     /gene="LOC111066310"
                     /note="Fibronectin type 3 domain [General function
                     prediction only]; Region: FN3; COG3401"
                     /db_xref="CDD:442628"
     misc_feature    order(1489..1494,1498..1503)
                     /gene="LOC111066310"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    3022..3210
                     /gene="LOC111066310"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cl21522"
                     /db_xref="CDD:473895"
     misc_feature    3256..3453
                     /gene="LOC111066310"
                     /note="Fibronectin type III domain; Region: fn3;
                     pfam00041"
                     /db_xref="CDD:394996"
     misc_feature    3583..3861
                     /gene="LOC111066310"
                     /note="Fibronectin type 3 domain; One of three types of
                     internal repeats found in the plasma protein fibronectin.
                     Its tenth fibronectin type III repeat contains an RGD cell
                     recognition sequence in a flexible loop between 2 strands.
                     Approximately 2% of all...; Region: FN3; cd00063"
                     /db_xref="CDD:238020"
     misc_feature    order(3583..3585,3784..3786,3829..3831)
                     /gene="LOC111066310"
                     /note="Interdomain contacts [active]"
                     /db_xref="CDD:238020"
     misc_feature    order(3832..3837,3841..3846)
                     /gene="LOC111066310"
                     /note="Cytokine receptor motif [active]"
                     /db_xref="CDD:238020"
     misc_feature    3952..4338
                     /gene="LOC111066310"
                     /note="TM proximal of protein tyrosine phosphatase,
                     receptor type J; Region: PTP_tm; pfam18861"
                     /db_xref="CDD:465889"
     misc_feature    4669..5334
                     /gene="LOC111066310"
                     /note="catalytic domain of R3 subfamily receptor-type
                     tyrosine-protein phosphatases and similar proteins;
                     Region: R3-PTPc; cd14548"
                     /db_xref="CDD:350396"
ORIGIN      
        1 gccgtacgac aacaaatact tcgatcgcgg cggcggtata tttggatctc gtgctgttgt
       61 gcgaaatgtc agtaactagt ttctggtaaa tctcgctgta aattcgaccg ttttctgtaa
      121 acagtttggc aagcgagatt tttttttgtg tgtgtgtccc ttttttttaa cctagtgttt
      181 acgacgcgct tgaattagta cgcgatcgtt gttatttatt tggatttact gtttgtgctg
      241 ccgccgccac cgctgccagc gtcatcgtgt acgccgccgt cgctgccact gcgcagacgt
      301 tgggcgggat ttgtggcagc tccaccccca ttgaaaacaa gtgcaatatt aattaagtaa
      361 tgaccggttg agcacatgta catatcgtat gtacatgcat acataaatgc atatatagag
      421 tacatacaga tgcccggata aatgacaaat gatgttataa cagcaaatag caaaagcaac
      481 agcaaattct cgcacatctc cgggttttac tgtacctgaa atatggactg cgcagctcaa
      541 caacaacagc ggcggtcaca accgcaacaa caacaacaag aagaagcacc agcagaggca
      601 gaacagcaga cacatggacg aagacggcgg cgacaacaac atacacaaat acagccgtcg
      661 cagccacaca tgtggattgt attgagcatt ttgacaatat tttggccgca gtattcgagt
      721 ggtgctgatc tggtgatcaa tgttcccaat gcgagcagca atgacaatgc cttctacagg
      781 atcgactaca gtccgccgtt tggctatccg gaaccgaaca ccacgatacc ggccagcgag
      841 attggcaagg acattaagtt ctcccgtgcc ctgcccggca ccgagtacaa cttttggctg
      901 ttttacacaa actccacgca ccaggagctg ctcacctgga cggtgaacat aacaactgct
      961 cccgatccgc cagcgagtct gagcgtacag ttgcgctcca gcaagagtgc attcatcaca
     1021 tggaagcccc caaggacggg caactattcg ggctttcgca tcaaggtgct gggcctcacc
     1081 gatctgccct tcaatcggag ctatgcgctg gacgacagag agacgctcca gttgtcggcc
     1141 aaggagctga caccgggcgg cagctaccaa gtgcaggcct ataccatcta ccagggcaaa
     1201 gagtcggtgg cctatacgag tcgcaacttt accacaaagc cgaatacgcc ggggaagttc
     1261 attgtgtggt tccgaaacga gacgacgctc ctggtgctgt ggcagccgcc ctttccggct
     1321 ggcatctaca cccactacaa ggtctcgatc acaccggacg atgccatcga gagcgtgttg
     1381 tatgtggagc gcgagggcga gccgccgggt ccggcgcagg cagcattcaa gggcctcgtc
     1441 ccagggcggg aatacaatat atcggtgcag acggtgtccg aggatgagac ctcgtctgtg
     1501 cccaccacag cccgctacct cacggtgccg gagcgtgtgt tgaatgtgac cttcgacgag
     1561 gcccacacca ccaccagctc gttccgggtg cgctgggagc cgccgcgcac ttacagcgaa
     1621 ttcgatggct accaggtgat gttgtcgaca tcgcggcgaa tcttcaatgt gccgcgcacg
     1681 gacgccacca gcgtcacctt cgactacccc gatattctgg agccgggacg cacctacgat
     1741 gtggtggtga agaccattgc cgacaatgtc aactcgtggc cggccactgc ggtggtgacg
     1801 ctgcgtccac gacccgttcg cacattgggc ggcttcctgg acgatcgcag caatgcgctg
     1861 cacatctcct gggagccggc ggagactgga aaacaggatg gttatcgtat tagataccat
     1921 gagcaaacga acgacagcga agtggctgtg gcatcagcgc tggacacagt gatagacacc
     1981 gctctcaccg agtacacgct ggactctctg caggcgggta gacgctatct ggtcggggtg
     2041 caggccctct ccaatggcgt cgcctcgaat gtcacataca tgacacgcta cacacggccg
     2101 gcagcgccca tcatccagga gctgcgcagc atacccttgg gcctgaatat cagctggaag
     2161 agcgatgtga attcacagca ggatcgctac gaggtgcact accagcggaa tggtacgcgc
     2221 gaggagcgca ccatagccac caacgagacg gcactgacga tcaactatct gcacgcgggc
     2281 tccggctatg agatcaaggt gtacgccatt agctatggca tacgcagtga gccgcactcc
     2341 tacttccagg ccgtctttcc caagccgccg ctgaatctca cactgcaaac ggtgcacaca
     2401 aatctggtgg tgctgcattg gcagccgccg gagggcagcg atttcagcga gtatgtagtg
     2461 cgatatcgca cggctgcatc gccgtggcag cggctggcca atgtccatga gagtcatgcc
     2521 cgcatcgagg atatgcacta cggggagcga tatctcatcc aggtgaacac catcagcttt
     2581 ggggtggaga gtcccagtcc actggagctg aatgtcacca tgccgccgca tccagtgtcg
     2641 aatgtggtgc cgctggtgga ctcgcgcaac ctcaccctgg aatggccccg tcccgatggc
     2701 catatcgatt actataccct caagtggtgg cccaccgatg aggtggatcg tgtcgaatac
     2761 aagaatgtca cacagctgga ggatttgaca tcgcccagcg tgcggatacc cattgaggat
     2821 ttgtcgcccg gcagacagta cagatttgag gtgcaggcca gctccaatgg catacgatcg
     2881 ggcatcacgc atctctccac gcgcaccatg cccctcatcc agtcggatgt gttcatttcg
     2941 aatgcccgat catcagacgg aggggggaac agggcgaaag cgataacgaa aggcaacggc
     3001 aatggcaatg gcgaggaaga gaccatcaca ttgagctaca caccgacacc ggcatcgagc
     3061 actcgcttcg acatctatcg cttctcgatg ggcgatcccc agatcaagga caaggagaag
     3121 atggccaacg atacggaacg aaagctgacg tttacgggtc tgtcgccggg acgtctgtac
     3181 aatgtgaccg tgtggacagt gagcggtggc gtggccagcc tgcccatcca gcgcctatac
     3241 aggctccatc cgttgccgat cagcggcctg gaggccacac aggtggcggc acgcgagatc
     3301 agccttcgct ggacggcccc cgcgggcgag tttaccgact ttgagctgca gtatctcagc
     3361 gccgacgagg aggcgccgca gctgctgcag aatgtgacga ggctcacaca gatgacgctc
     3421 catgggctac ggccgtacca caattatacc ttcacggtgg ttgtgcgctc cggcagcagc
     3481 agcggcagca gcacgggtcg agaggatgga ctggcaccgc cgcccaacct gatgcgcagc
     3541 agtgccccga tctcggccag ctatcagaca ctggccgcgg tgcccggccg cgtcgactac
     3601 ttccagccga gcgatgtgca gcccgcggag gtgagctttg tgtggctgct ggaggcgcgt
     3661 gagcagcacg gacccatcga ttatttccgc atcagctgcc agaacatcga cgatgcggcg
     3721 gatgttgtgg ccagccacga gtttccggtc aacgccaccc acgggcggat cgataatctg
     3781 gtgcccggca acaagtacat ctttcgcctt caagcgaaat ccgctttggg tttcggcgcc
     3841 gagcgcgacc atgtccaggt aatgcccatt ctggcgccgc ctgtcccggg gccgaatgtg
     3901 acgcccctgg aggtggggcg gacgagcagc accattgaga taagctaccg gcagagctac
     3961 ttctccaatg cccatggcct ggtgcgcttc tacaccatca ttgtggcgga ggatgtgggc
     4021 aagaatgcgt ccggcctgga gatgcccagc tggcacgatg tgcagtcgta tgcggtgtgg
     4081 ctgccctatc aggccatcga gccctacaat ccctttggag ggggggccgc gtcgaatgcg
     4141 agtgtcgtcc gcaggagcat gagcatgagc agtggctcag tcatggagcg cttcacgatc
     4201 ggctcggcca actgcgataa acagcgcact ggatattgca atgggccgct gcgggcgggc
     4261 accacatacc ggatcaaggt gcgcgccttc accgacgacg acaagttcac ggacaccgcg
     4321 tacagtgtgc ccatcacaac ggatcgcaac gacacgctca ttgtggctgt gacgatggtg
     4381 tcgctggtgc tgatcgccgc cattctactg gtcgcctact gtcagcggcg ctgtcatctg
     4441 attcgtcgcg ccaccaagct gtcgcgaatg caggacgagc tggccgctct gcccgagggc
     4501 tatgtgacgc ccaatcggcc cattcatgtg aaggacttcg gggagcacta tcgcctgatg
     4561 tccgccgact cggactttcg cttcagcgag gagttcgagg agctgaagca tgtgggacgc
     4621 gaccaggcct gcagtttcgc caatctgccg tgcaatcggc ccaagaatcg cttcaccaac
     4681 atcctgccct acgaccattc acgcttcaag ctgcagccgg tggacgatga cgatggctcg
     4741 gactatatca atgccaacta catgcccggc cataattcgc cgcgtgagtt tatcgtcacc
     4801 caggggccgt tacactcgac gcgcgaggag ttctggcgca tgtgctggga gagcaactcg
     4861 cgggccattg tcatgctgac gcgctgcttc gagaaggggc gtgaaaagtg cgaccagtat
     4921 tggcccgtcg atcgtgtggc catgttctat ggcgacatca aggtccagct gatcatcgat
     4981 acgcatttcc gggactggag catctccgag ttcatggtct ccaggaactg cgaatcgcgg
     5041 atcatgaggc acttccattt caccacttgg ccggactttg gggtgccgga gccgccgcag
     5101 tcgctggtgc gtttcgtgcg tgcctttcgc gatgccatcg gcaccgatat gcgtcccatt
     5161 attgtccact gcagcgctgg cgtcggcaga tcgggcacat ttattgccct cgatcggata
     5221 ctgcagcaca tccacaagtc cgactatgtg gatatatttg gcattgtgtt tgccatgcgc
     5281 aaggagcgcg tatttatggt gcagacggaa cagcagtatg tgtgcatcca tcagtgcctg
     5341 ctggcggtgc tcgagggcaa ggagcatctg ctggccgatt cgttggagct gcatgccaat
     5401 gatggctacg aggagcgccc gaatcagcaa cttcagcagc agcaccagca gcagcagcaa
     5461 atgaaaccca aaatgggaac ccttggagcg gtcatgggag ccaagacctt gagggcatca
     5521 ctggcactgg ccgaggagtt ggaccaggag ttgatgggaa agccggagga ggagcacgcg
     5581 atggcagaag tgtcactgtc ggacagcatc aataaaccga atgaaaaagc tgatcaggat
     5641 gccgaggatg acgaggaaga tgatgacgat gacgatgatg atgatgatga tgatgatgat
     5701 gatgatcagc agccattgaa caatgagacg acagccacac tgtctacagc cagcagtagc
     5761 agtattagtg aggatcaggt ggataatgtg gaacagcagc agcagcagca gcacaacaag
     5821 cagcgtgacc agggaagaat ttgcaccaag tccgatgcag ataccgacga agatgatacg
     5881 gacgaggatg aggacaagga agatggagat ggggccaagc gtcgcgaggc tgatgaggat
     5941 ggatggtggt agtgataaga gaaaggtccg cttttagatg acgaaggaat cgccgagacg
     6001 ggcatctaac gaatggaaat caatggatgt atcacacaca caaacacaca tcgatatata
     6061 tatatattat atacagtagt agtagctatt agatattagc tattagtgtt agttacatat
     6121 ttttttaaca tatctacttg tacgtaaaat ctgcctgtga tgtgctagat gtgatttgtc
     6181 tacagcggca tactacggag ctgctgggga gaggagaaga gtcagagaag agtctgtctg
     6241 ccattatgct cgccgctcca tgccgccaaa actagataaa aaaccacaag agcgtaacct
     6301 taagcttcag gtgtaaccat tatcccttat agtcccccac cccccctagt ttgttctcct
     6361 attatccaga aattgtgtat agtttttgta aatttattct gtgccccctt tttgatcagc
     6421 aacgatcgat gttgtgcgac tactactcta attatttgta ttaccctccc cacgcccacg
     6481 aactacgtat tgtacgatca attgtgtttt atgttattat tattattatt atgtagctta
     6541 aatacgtttt atagttagac tttagtcagc atttagtcag agaaatgttt agctctagca
     6601 acagtctccg gcctgtgtgt cctggccata taggagatgg ctcatggctc aggggcagca
     6661 ggcagcagga acagcaatag ctagcttttt agcatttatt ttaactt