PREDICTED: Drosophila obscura trypsin (LOC121403885), transcript
LOCUS XM_041591966 1320 bp mRNA linear INV 14-MAY-2021
variant X4, mRNA.
ACCESSION XM_041591966
VERSION XM_041591966.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..1320
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..1320
/gene="LOC121403885"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 9 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 5 samples with support for all annotated
introns"
/db_xref="GeneID:121403885"
CDS 71..799
/gene="LOC121403885"
/codon_start=1
/product="trypsin I-P1 isoform X4"
/protein_id="XP_041447900.1"
/db_xref="GeneID:121403885"
/translation="MPCGNNTHNPVRYHDPTMFVVVAGSTQIDQADRHTKEYLVQQII
AHSAYNASSLENDIALLFLNGYVSWQSRAVRAIPLATKAQTEGTTCLINGWAANLFPK
VGKSASLQQAPVPILNKSLCRAIYMLPVSQLCAGFMQGGIDACQGDSGGPLICDGQLA
GIISWGVGCADPGFPGVYTNVSHFVDWIQMVNATLDYSKYTVVGSADLASGAQRTWWL
ALAFLLIALGGSWSGGWNLNWNWG"
misc_feature 101..643
/gene="LOC121403885"
/note="Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad...; Region:
Tryp_SPc; cd00190"
/db_xref="CDD:238113"
misc_feature order(497..499,560..562,566..568)
/gene="LOC121403885"
/note="substrate binding sites [chemical binding]; other
site"
/db_xref="CDD:238113"
ORIGIN
1 tacggattgg gcctcatctg tggcggggca ttgatctcgc agcgtgtggc ctgctcggcg
61 gcccactgct atgccgtgtg gaaacaacac acacaacccg gtgaggtatc acgatccgac
121 catgttcgtg gtggtggcgg gcagcacaca aatcgaccag gccgacaggc acaccaagga
181 gtatctggtg cagcaaatca tcgcccacag cgcctacaat gcctcctcgc tggagaacga
241 catcgcgctg ctcttcctca acggctacgt gtcgtggcag tcgagggccg tgcgagccat
301 tccgctggcc accaaggccc agactgaggg caccacgtgc ctgatcaatg gctgggctgc
361 gaatctcttc ccgaaggtgg gcaaatcggc atcgctgcag caggcgccgg tgcccatcct
421 gaacaagagc ctctgccgtg ccatctacat gctgccagtg tcccagctgt gcgctggctt
481 catgcagggc ggcatcgatg cctgtcaggg tgactctggc ggtcccctaa tctgcgacgg
541 ccagctggcg ggcatcatct cgtggggcgt gggctgtgcg gatcccggct ttcccggcgt
601 ctataccaat gtctcgcact ttgttgattg gatccaaatg gtaaacgcca cgctcgacta
661 ctccaaatac acggtggtgg gctcggcgga tctggccagc ggtgcccaac gtacctggtg
721 gctggccttg gcatttctgc ttatcgctct gggcgggagc tggagcgggg gctggaactt
781 gaactggaac tggggctgat gatgtgcggg ccaagatatg gaatgttgga cagcaataat
841 aaaatcatca ttcagcttgt ctcttctgtc atttaataat ctaatcatct ccgcatacgc
901 acttgtgtat cgattgccca accatagata cagaaatcga attgccgtat gactaaaagg
961 gtgacacaat aaccaatttt aatcagccac aaggactggg acagggagtg ggactgggac
1021 tgggagtgtt gcgcctgccc ctcaagatca aagaccgaga aaacccaacg acctacacct
1081 acctccaaga cccaacaaat atcgttttaa ctgctcgatt ctgtgggttc caatagataa
1141 tatttattaa cggataagag cgcattcaat atatctgatt gttatctgat gatatgccaa
1201 atcaaatcat aattggatca taaatatgta cagacaatac gagaatgaac cgaaaagtac
1261 aaacctcctt ttggattttt gattgttgca attgccgagc ctcgaacctc ttcaaaatat