PREDICTED: Drosophila obscura tyrosine-protein phosphatase 10D
LOCUS XM_022354812 6707 bp mRNA linear INV 14-MAY-2021
(LOC111066310), mRNA.
ACCESSION XM_022354812
VERSION XM_022354812.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq; corrected model.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022354812.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
frameshifts :: corrected 1 indel
##RefSeq-Attributes-END##
PRIMARY REFSEQ_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP
1-319 JAECWW010000165.1 2450418-2450736
320-711 JAECWW010000165.1 2452121-2452512
712-958 JAECWW010000165.1 2459212-2459458
959-1237 JAECWW010000165.1 2459522-2459800
1238-1913 JAECWW010000165.1 2461079-2461754
1914-2356 JAECWW010000165.1 2461815-2462257
2357-2785 JAECWW010000165.1 2462344-2462772
2786-5025 JAECWW010000165.1 2462853-2465092
5026-5284 JAECWW010000165.1 2465171-2465429
5285-5413 JAECWW010000165.1 2465517-2465645
5414-5880 JAECWW010000165.1 2466270-2466736
5881-5977 JAECWW010000165.1 2466739-2466835
5978-6707 JAECWW010000165.1 2467384-2468113
FEATURES Location/Qualifiers
source 1..6707
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..6707
/gene="LOC111066310"
/note="The sequence of the model RefSeq transcript was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 2 bases in 1 codon;
Derived by automated computational analysis using gene
prediction method: Gnomon. Supporting evidence includes
similarity to: 5 Proteins, and 98% coverage of the
annotated genomic feature by RNAseq alignments, including
9 samples with support for all annotated introns"
/db_xref="GeneID:111066310"
CDS 523..5952
/gene="LOC111066310"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 2 bases in 1 codon"
/codon_start=1
/product="LOW QUALITY PROTEIN: tyrosine-protein
phosphatase 10D"
/protein_id="XP_022210504.2"
/db_xref="GeneID:111066310"
/translation="MDCAAQQQQRRSQPQQQQQEEAPAEAEQQTHGRRRRRQQHTQIQ
PSQPHMWIVLSILTIFWPQYSSGADLVINVPNASSNDNAFYRIDYSPPFGYPEPNTTI
PASEIGKDIKFSRALPGTEYNFWLFYTNSTHQELLTWTVNITTAPDPPASLSVQLRSS
KSAFITWKPPRTGNYSGFRIKVLGLTDLPFNRSYALDDRETLQLSAKELTPGGSYQVQ
AYTIYQGKESVAYTSRNFTTKPNTPGKFIVWFRNETTLLVLWQPPFPAGIYTHYKVSI
TPDDAIESVLYVEREGEPPGPAQAAFKGLVPGREYNISVQTVSEDETSSVPTTARYLT
VPERVLNVTFDEAHTTTSSFRVRWEPPRTYSEFDGYQVMLSTSRRIFNVPRTDATSVT
FDYPDILEPGRTYDVVVKTIADNVNSWPATAVVTLRPRPVRTLGGFLDDRSNALHISW
EPAETGKQDGYRIRYHEQTNDSEVAVASALDTVIDTALTEYTLDSLQAGRRYLVGVQA
LSNGVASNVTYMTRYTRPAAPIIQELRSIPLGLNISWKSDVNSQQDRYEVHYQRNGTR
EERTIATNETALTINYLHAGSGYEIKVYAISYGIRSEPHSYFQAVFPKPPLNLTLQTV
HTNLVVLHWQPPEGSDFSEYVVRYRTAASPWQRLANVHESHARIEDMHYGERYLIQVN
TISFGVESPSPLELNVTMPPHPVSNVVPLVDSRNLTLEWPRPDGHIDYYTLKWWPTDE
VDRVEYKNVTQLEDLTSPSVRIPIEDLSPGRQYRFEVQASSNGIRSGITHLSTRTMPL
IQSDVFISNARSSDGGGNRAKAITKGNGNGNGEEETITLSYTPTPASSTRFDIYRFSM
GDPQIKDKEKMANDTERKLTFTGLSPGRLYNVTVWTVSGGVASLPIQRLYRLHPLPIS
GLEATQVAAREISLRWTAPAGEFTDFELQYLSADEEAPQLLQNVTRLTQMTLHGLRPY
HNYTFTVVVRSGSSSGSSTGREDGLAPPPNLMRSSAPISASYQTLAAVPGRVDYFQPS
DVQPAEVSFVWLLEAREQHGPIDYFRISCQNIDDAADVVASHEFPVNATHGRIDNLVP
GNKYIFRLQAKSALGFGAERDHVQVMPILAPPVPGPNVTPLEVGRTSSTIEISYRQSY
FSNAHGLVRFYTIIVAEDVGKNASGLEMPSWHDVQSYAVWLPYQAIEPYNPFGGGAAS
NASVVRRSMSMSSGSVMERFTIGSANCDKQRTGYCNGPLRAGTTYRIKVRAFTDDDKF
TDTAYSVPITTDRNDTLIVAVTMVSLVLIAAILLVAYCQRRCHLIRRATKLSRMQDEL
AALPEGYVTPNRPIHVKDFGEHYRLMSADSDFRFSEEFEELKHVGRDQACSFANLPCN
RPKNRFTNILPYDHSRFKLQPVDDDDGSDYINANYMPGHNSPREFIVTQGPLHSTREE
FWRMCWESNSRAIVMLTRCFEKGREKCDQYWPVDRVAMFYGDIKVQLIIDTHFRDWSI
SEFMVSRNCESRIMRHFHFTTWPDFGVPEPPQSLVRFVRAFRDAIGTDMRPIIVHCSA
GVGRSGTFIALDRILQHIHKSDYVDIFGIVFAMRKERVFMVQTEQQYVCIHQCLLAVL
EGKEHLLADSLELHANDGYEERPNQQLQQQHQQQQQMKPKMGTLGAVMGAKTLRASLA
LAEELDQELMGKPEEEHAMAEVSLSDSINKPNEKADQDAEDDEEDDDDDDDDDDDDDD
DDQQPLNNETTATLSTASSSSISEDQVDNVEQQQQQQHNKQRDQGRICTKSDADTDED
DTDEDEDKEDGDGAKRREADEDGWW"
misc_feature 961..1236
/gene="LOC111066310"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(961..963,1153..1155,1198..1200)
/gene="LOC111066310"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(1201..1206,1210..1215)
/gene="LOC111066310"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature order(1240..1242,1441..1443,1486..1488)
/gene="LOC111066310"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature 1249..1494
/gene="LOC111066310"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 1468..>2778
/gene="LOC111066310"
/note="Fibronectin type 3 domain [General function
prediction only]; Region: FN3; COG3401"
/db_xref="CDD:442628"
misc_feature order(1489..1494,1498..1503)
/gene="LOC111066310"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 3022..3210
/gene="LOC111066310"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cl21522"
/db_xref="CDD:473895"
misc_feature 3256..3453
/gene="LOC111066310"
/note="Fibronectin type III domain; Region: fn3;
pfam00041"
/db_xref="CDD:394996"
misc_feature 3583..3861
/gene="LOC111066310"
/note="Fibronectin type 3 domain; One of three types of
internal repeats found in the plasma protein fibronectin.
Its tenth fibronectin type III repeat contains an RGD cell
recognition sequence in a flexible loop between 2 strands.
Approximately 2% of all...; Region: FN3; cd00063"
/db_xref="CDD:238020"
misc_feature order(3583..3585,3784..3786,3829..3831)
/gene="LOC111066310"
/note="Interdomain contacts [active]"
/db_xref="CDD:238020"
misc_feature order(3832..3837,3841..3846)
/gene="LOC111066310"
/note="Cytokine receptor motif [active]"
/db_xref="CDD:238020"
misc_feature 3952..4338
/gene="LOC111066310"
/note="TM proximal of protein tyrosine phosphatase,
receptor type J; Region: PTP_tm; pfam18861"
/db_xref="CDD:465889"
misc_feature 4669..5334
/gene="LOC111066310"
/note="catalytic domain of R3 subfamily receptor-type
tyrosine-protein phosphatases and similar proteins;
Region: R3-PTPc; cd14548"
/db_xref="CDD:350396"
ORIGIN
1 gccgtacgac aacaaatact tcgatcgcgg cggcggtata tttggatctc gtgctgttgt
61 gcgaaatgtc agtaactagt ttctggtaaa tctcgctgta aattcgaccg ttttctgtaa
121 acagtttggc aagcgagatt tttttttgtg tgtgtgtccc ttttttttaa cctagtgttt
181 acgacgcgct tgaattagta cgcgatcgtt gttatttatt tggatttact gtttgtgctg
241 ccgccgccac cgctgccagc gtcatcgtgt acgccgccgt cgctgccact gcgcagacgt
301 tgggcgggat ttgtggcagc tccaccccca ttgaaaacaa gtgcaatatt aattaagtaa
361 tgaccggttg agcacatgta catatcgtat gtacatgcat acataaatgc atatatagag
421 tacatacaga tgcccggata aatgacaaat gatgttataa cagcaaatag caaaagcaac
481 agcaaattct cgcacatctc cgggttttac tgtacctgaa atatggactg cgcagctcaa
541 caacaacagc ggcggtcaca accgcaacaa caacaacaag aagaagcacc agcagaggca
601 gaacagcaga cacatggacg aagacggcgg cgacaacaac atacacaaat acagccgtcg
661 cagccacaca tgtggattgt attgagcatt ttgacaatat tttggccgca gtattcgagt
721 ggtgctgatc tggtgatcaa tgttcccaat gcgagcagca atgacaatgc cttctacagg
781 atcgactaca gtccgccgtt tggctatccg gaaccgaaca ccacgatacc ggccagcgag
841 attggcaagg acattaagtt ctcccgtgcc ctgcccggca ccgagtacaa cttttggctg
901 ttttacacaa actccacgca ccaggagctg ctcacctgga cggtgaacat aacaactgct
961 cccgatccgc cagcgagtct gagcgtacag ttgcgctcca gcaagagtgc attcatcaca
1021 tggaagcccc caaggacggg caactattcg ggctttcgca tcaaggtgct gggcctcacc
1081 gatctgccct tcaatcggag ctatgcgctg gacgacagag agacgctcca gttgtcggcc
1141 aaggagctga caccgggcgg cagctaccaa gtgcaggcct ataccatcta ccagggcaaa
1201 gagtcggtgg cctatacgag tcgcaacttt accacaaagc cgaatacgcc ggggaagttc
1261 attgtgtggt tccgaaacga gacgacgctc ctggtgctgt ggcagccgcc ctttccggct
1321 ggcatctaca cccactacaa ggtctcgatc acaccggacg atgccatcga gagcgtgttg
1381 tatgtggagc gcgagggcga gccgccgggt ccggcgcagg cagcattcaa gggcctcgtc
1441 ccagggcggg aatacaatat atcggtgcag acggtgtccg aggatgagac ctcgtctgtg
1501 cccaccacag cccgctacct cacggtgccg gagcgtgtgt tgaatgtgac cttcgacgag
1561 gcccacacca ccaccagctc gttccgggtg cgctgggagc cgccgcgcac ttacagcgaa
1621 ttcgatggct accaggtgat gttgtcgaca tcgcggcgaa tcttcaatgt gccgcgcacg
1681 gacgccacca gcgtcacctt cgactacccc gatattctgg agccgggacg cacctacgat
1741 gtggtggtga agaccattgc cgacaatgtc aactcgtggc cggccactgc ggtggtgacg
1801 ctgcgtccac gacccgttcg cacattgggc ggcttcctgg acgatcgcag caatgcgctg
1861 cacatctcct gggagccggc ggagactgga aaacaggatg gttatcgtat tagataccat
1921 gagcaaacga acgacagcga agtggctgtg gcatcagcgc tggacacagt gatagacacc
1981 gctctcaccg agtacacgct ggactctctg caggcgggta gacgctatct ggtcggggtg
2041 caggccctct ccaatggcgt cgcctcgaat gtcacataca tgacacgcta cacacggccg
2101 gcagcgccca tcatccagga gctgcgcagc atacccttgg gcctgaatat cagctggaag
2161 agcgatgtga attcacagca ggatcgctac gaggtgcact accagcggaa tggtacgcgc
2221 gaggagcgca ccatagccac caacgagacg gcactgacga tcaactatct gcacgcgggc
2281 tccggctatg agatcaaggt gtacgccatt agctatggca tacgcagtga gccgcactcc
2341 tacttccagg ccgtctttcc caagccgccg ctgaatctca cactgcaaac ggtgcacaca
2401 aatctggtgg tgctgcattg gcagccgccg gagggcagcg atttcagcga gtatgtagtg
2461 cgatatcgca cggctgcatc gccgtggcag cggctggcca atgtccatga gagtcatgcc
2521 cgcatcgagg atatgcacta cggggagcga tatctcatcc aggtgaacac catcagcttt
2581 ggggtggaga gtcccagtcc actggagctg aatgtcacca tgccgccgca tccagtgtcg
2641 aatgtggtgc cgctggtgga ctcgcgcaac ctcaccctgg aatggccccg tcccgatggc
2701 catatcgatt actataccct caagtggtgg cccaccgatg aggtggatcg tgtcgaatac
2761 aagaatgtca cacagctgga ggatttgaca tcgcccagcg tgcggatacc cattgaggat
2821 ttgtcgcccg gcagacagta cagatttgag gtgcaggcca gctccaatgg catacgatcg
2881 ggcatcacgc atctctccac gcgcaccatg cccctcatcc agtcggatgt gttcatttcg
2941 aatgcccgat catcagacgg aggggggaac agggcgaaag cgataacgaa aggcaacggc
3001 aatggcaatg gcgaggaaga gaccatcaca ttgagctaca caccgacacc ggcatcgagc
3061 actcgcttcg acatctatcg cttctcgatg ggcgatcccc agatcaagga caaggagaag
3121 atggccaacg atacggaacg aaagctgacg tttacgggtc tgtcgccggg acgtctgtac
3181 aatgtgaccg tgtggacagt gagcggtggc gtggccagcc tgcccatcca gcgcctatac
3241 aggctccatc cgttgccgat cagcggcctg gaggccacac aggtggcggc acgcgagatc
3301 agccttcgct ggacggcccc cgcgggcgag tttaccgact ttgagctgca gtatctcagc
3361 gccgacgagg aggcgccgca gctgctgcag aatgtgacga ggctcacaca gatgacgctc
3421 catgggctac ggccgtacca caattatacc ttcacggtgg ttgtgcgctc cggcagcagc
3481 agcggcagca gcacgggtcg agaggatgga ctggcaccgc cgcccaacct gatgcgcagc
3541 agtgccccga tctcggccag ctatcagaca ctggccgcgg tgcccggccg cgtcgactac
3601 ttccagccga gcgatgtgca gcccgcggag gtgagctttg tgtggctgct ggaggcgcgt
3661 gagcagcacg gacccatcga ttatttccgc atcagctgcc agaacatcga cgatgcggcg
3721 gatgttgtgg ccagccacga gtttccggtc aacgccaccc acgggcggat cgataatctg
3781 gtgcccggca acaagtacat ctttcgcctt caagcgaaat ccgctttggg tttcggcgcc
3841 gagcgcgacc atgtccaggt aatgcccatt ctggcgccgc ctgtcccggg gccgaatgtg
3901 acgcccctgg aggtggggcg gacgagcagc accattgaga taagctaccg gcagagctac
3961 ttctccaatg cccatggcct ggtgcgcttc tacaccatca ttgtggcgga ggatgtgggc
4021 aagaatgcgt ccggcctgga gatgcccagc tggcacgatg tgcagtcgta tgcggtgtgg
4081 ctgccctatc aggccatcga gccctacaat ccctttggag ggggggccgc gtcgaatgcg
4141 agtgtcgtcc gcaggagcat gagcatgagc agtggctcag tcatggagcg cttcacgatc
4201 ggctcggcca actgcgataa acagcgcact ggatattgca atgggccgct gcgggcgggc
4261 accacatacc ggatcaaggt gcgcgccttc accgacgacg acaagttcac ggacaccgcg
4321 tacagtgtgc ccatcacaac ggatcgcaac gacacgctca ttgtggctgt gacgatggtg
4381 tcgctggtgc tgatcgccgc cattctactg gtcgcctact gtcagcggcg ctgtcatctg
4441 attcgtcgcg ccaccaagct gtcgcgaatg caggacgagc tggccgctct gcccgagggc
4501 tatgtgacgc ccaatcggcc cattcatgtg aaggacttcg gggagcacta tcgcctgatg
4561 tccgccgact cggactttcg cttcagcgag gagttcgagg agctgaagca tgtgggacgc
4621 gaccaggcct gcagtttcgc caatctgccg tgcaatcggc ccaagaatcg cttcaccaac
4681 atcctgccct acgaccattc acgcttcaag ctgcagccgg tggacgatga cgatggctcg
4741 gactatatca atgccaacta catgcccggc cataattcgc cgcgtgagtt tatcgtcacc
4801 caggggccgt tacactcgac gcgcgaggag ttctggcgca tgtgctggga gagcaactcg
4861 cgggccattg tcatgctgac gcgctgcttc gagaaggggc gtgaaaagtg cgaccagtat
4921 tggcccgtcg atcgtgtggc catgttctat ggcgacatca aggtccagct gatcatcgat
4981 acgcatttcc gggactggag catctccgag ttcatggtct ccaggaactg cgaatcgcgg
5041 atcatgaggc acttccattt caccacttgg ccggactttg gggtgccgga gccgccgcag
5101 tcgctggtgc gtttcgtgcg tgcctttcgc gatgccatcg gcaccgatat gcgtcccatt
5161 attgtccact gcagcgctgg cgtcggcaga tcgggcacat ttattgccct cgatcggata
5221 ctgcagcaca tccacaagtc cgactatgtg gatatatttg gcattgtgtt tgccatgcgc
5281 aaggagcgcg tatttatggt gcagacggaa cagcagtatg tgtgcatcca tcagtgcctg
5341 ctggcggtgc tcgagggcaa ggagcatctg ctggccgatt cgttggagct gcatgccaat
5401 gatggctacg aggagcgccc gaatcagcaa cttcagcagc agcaccagca gcagcagcaa
5461 atgaaaccca aaatgggaac ccttggagcg gtcatgggag ccaagacctt gagggcatca
5521 ctggcactgg ccgaggagtt ggaccaggag ttgatgggaa agccggagga ggagcacgcg
5581 atggcagaag tgtcactgtc ggacagcatc aataaaccga atgaaaaagc tgatcaggat
5641 gccgaggatg acgaggaaga tgatgacgat gacgatgatg atgatgatga tgatgatgat
5701 gatgatcagc agccattgaa caatgagacg acagccacac tgtctacagc cagcagtagc
5761 agtattagtg aggatcaggt ggataatgtg gaacagcagc agcagcagca gcacaacaag
5821 cagcgtgacc agggaagaat ttgcaccaag tccgatgcag ataccgacga agatgatacg
5881 gacgaggatg aggacaagga agatggagat ggggccaagc gtcgcgaggc tgatgaggat
5941 ggatggtggt agtgataaga gaaaggtccg cttttagatg acgaaggaat cgccgagacg
6001 ggcatctaac gaatggaaat caatggatgt atcacacaca caaacacaca tcgatatata
6061 tatatattat atacagtagt agtagctatt agatattagc tattagtgtt agttacatat
6121 ttttttaaca tatctacttg tacgtaaaat ctgcctgtga tgtgctagat gtgatttgtc
6181 tacagcggca tactacggag ctgctgggga gaggagaaga gtcagagaag agtctgtctg
6241 ccattatgct cgccgctcca tgccgccaaa actagataaa aaaccacaag agcgtaacct
6301 taagcttcag gtgtaaccat tatcccttat agtcccccac cccccctagt ttgttctcct
6361 attatccaga aattgtgtat agtttttgta aatttattct gtgccccctt tttgatcagc
6421 aacgatcgat gttgtgcgac tactactcta attatttgta ttaccctccc cacgcccacg
6481 aactacgtat tgtacgatca attgtgtttt atgttattat tattattatt atgtagctta
6541 aatacgtttt atagttagac tttagtcagc atttagtcag agaaatgttt agctctagca
6601 acagtctccg gcctgtgtgt cctggccata taggagatgg ctcatggctc aggggcagca
6661 ggcagcagga acagcaatag ctagcttttt agcatttatt ttaactt