PREDICTED: Drosophila obscura trypsin-2 (LOC111081606), mRNA.
LOCUS XM_022377714 1213 bp mRNA linear INV 14-MAY-2021
ACCESSION XM_022377714
VERSION XM_022377714.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022377714.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..1213
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..1213
/gene="LOC111081606"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 26 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 13 samples with support for all annotated
introns"
/db_xref="GeneID:111081606"
CDS 75..1127
/gene="LOC111081606"
/codon_start=1
/product="trypsin-2"
/protein_id="XP_022233406.2"
/db_xref="GeneID:111081606"
/translation="MVPISIQLWATTLAIIFLLPTALNGGSVPTAGEARAIRPRFDVD
PGRIINGTEATMEATRHQVGLRRALNDGYYFGTGHICGGALIRHNWVLTAAHCFVDQE
IYDGTFVAKEKFIVVMGNVDRFNRTNSLTFEIDLLVLQLDKFDLSTYDRDIALLRLND
SVPSNHPTIRPIDITALAVPADTVCQVTGWGYTEAGYTSDYLMTVDVPMISEAVCIND
SDLGHLIKPGMVCAGYLEQGERDACSGDSGGPLVCRSQLAGIVSWGIGCAEPNLPGVY
TEVSYYHDWILEQIGEEGEASGEGSGEGSGEGSGDGEGEGSGDGDGGGAKAALAGVLS
LLLPGLLSMRLFSSHL"
misc_feature 213..932
/gene="LOC111081606"
/note="Trypsin-like serine protease; Region: Tryp_SPc;
smart00020"
/db_xref="CDD:214473"
misc_feature 216..218
/gene="LOC111081606"
/note="cleavage site [active]"
/db_xref="CDD:238113"
misc_feature order(360..362,528..530,813..815)
/gene="LOC111081606"
/note="active site"
/db_xref="CDD:238113"
misc_feature order(795..797,858..860,864..866)
/gene="LOC111081606"
/note="substrate binding sites [chemical binding]; other
site"
/db_xref="CDD:238113"
ORIGIN
1 cgcggttggg gtccggttcc gccctggccc agccattagt ttggattgcc acgtcgggca
61 gctagcaacg caaaatggtt ccgatatcga tacaactctg ggcgacgaca ttggccatca
121 tcttcctgct gcccacggcc ctaaatggtg gctctgtgcc aacggctggc gaggctagag
181 cgattcgtcc tcgcttcgat gtggatccgg gcaggattat caatggcact gaggccacca
241 tggaggccac caggcatcag gtgggactga ggagggccct caacgatggc tactactttg
301 gcacgggtca catctgtggt ggagctctga tccgtcacaa ctgggtgctc actgccgccc
361 attgctttgt ggatcaggag atctacgacg gcaccttcgt ggccaaggag aagttcattg
421 tggtgatggg caacgtggat cgcttcaatc gcaccaactc gctgaccttc gagattgatc
481 tgctcgtcct gcagctggac aagtttgatc tgagcaccta cgacagggac attgccctgc
541 tgcggctgaa cgactcagtg ccgagcaatc atcccaccat acggcccatc gatatcaccg
601 cgctggcagt gccagcggac accgtttgcc aggtgacggg ctggggctac acggaggctg
661 gctacacctc ggactacctc atgaccgtcg atgtgcccat gatcagtgag gcggtgtgca
721 tcaacgacag cgatctgggg cacctgatca agcccggcat ggtgtgtgcc ggctatctgg
781 agcagggcga acgggatgcc tgcagtggcg attcgggggg tccactcgtc tgccgcagcc
841 agctggccgg catcgtatcc tggggcattg gctgtgccga gcccaatttg ccgggcgtct
901 acacggaggt ctcctactac cacgactgga tactggagca gatcggcgag gagggggagg
961 cgagtgggga gggtagtggc gagggcagtg gcgagggaag tggtgacggt gaaggtgagg
1021 gaagtggaga tggagatggc ggtggtgcca aggcagccct cgctggcgtc cttagcctcc
1081 tcctgcctgg cctcctctcc atgaggttat tttcaagcca tctctagtgt tctctcgccc
1141 gctttctctc tcttttagta gcaaataaaa tttgtatcta ttacttaagg aaatgtctct
1201 gataaaaaaa ctc