PREDICTED: Drosophila obscura uncharacterized LOC111066225


LOCUS       XM_041591923            5697 bp    mRNA    linear   INV 14-MAY-2021
            (LOC111066225), transcript variant X2, mRNA.
ACCESSION   XM_041591923
VERSION     XM_041591923.1
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..5697
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..5697
                     /gene="LOC111066225"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Gnomon. Supporting evidence
                     includes similarity to: 1 Protein, and 99% coverage of the
                     annotated genomic feature by RNAseq alignments"
                     /db_xref="GeneID:111066225"
     CDS             372..5681
                     /gene="LOC111066225"
                     /codon_start=1
                     /product="uncharacterized protein LOC111066225 isoform X2"
                     /protein_id="XP_041447857.1"
                     /db_xref="GeneID:111066225"
                     /translation="MDTVAIKTALPHAPLAAQQQQQQQQQQQQQPQNAQTATINAKNN
                     IGAFFQTCKVLWHLDAFRRSFRGLNQHVCGGQDCIFCALKELFQQLQTSSEPALCPEP
                     LRRALASGPLAGRRFPLGCLGDAAECFELLLHRVHSHISSDDGDSCESSACIAHRRFA
                     MRVIEQSVCKCGANSEQLPFTQMVHYVSASALTSQKSLALQSHQQLSFGQLLRAAGNM
                     GDIRDCPNTCGAKIGIRRALLNRPDVVSIGIVWDSERPAADQVHAVLKAVGTSLRLAD
                     VFHQVSEQRWAQQAQHELVGIVSYYGKHYTTFFFHTKLKVWVYFDDANVKEVGPSWDG
                     VVDKCSRGRYQPLLLLYAVPQPPSSAAGMDQLPVAGSLPSPAASMTAAGLAATVVRRA
                     VTPSPEKPSLGSTRRAITPTPLRTPTNDYQNLSVIQKSIFPSNGAGTDETDAYISRKT
                     VEHVLSAQYQNLSVIQDKINAVPGSVATSNADKDGGFLNRKPIEAMLSAQLARRQHLQ
                     LQRSHSAESSSGHASNGSSPPSDGLTMPEHLNQPRRRDSGNWSGDRNSASSASSTTLD
                     NPYLYMVAKRGVAAVPQSPTRHGLPYDPGYDSFSLSSTDSYPPKHILNPQLAKIPEAA
                     MGMGMGMGMGMGAGVGVGAGNVQGAGVGVGAGVPVGHGLALSGDCEKLCHEADQLLEK
                     SRIVEESHDLETALVLCNAAAGRARAAMDAPYSNPHTMTFARMKHNTCVMRARSLHRR
                     ILVEKGAESEAMPELKHMREGSTSSVKHVRQNSKDRTLEKEQLQMQLQLQQQLQQQLV
                     PSAVSKNIEIYATLPKRKSPLKALAAAASGSGSVGMDDNAIEYEQQVVASSSSPASSQ
                     KPERERESRSLFGRKDKDKDKEKEKRSRSEDRNKLTREFSLTETLLVNAKDTLKKHKE
                     DKDEKKEKDKSGKKQHKIRRKLLMGGLIRRKNRSMPDLTEAVDDAAAPISGPAHHHHH
                     LQLHHPCAPTAGHLMGSSVDDSAVGLGKHALAMSGYISEGHFDYPLGTGPCTSNVGVG
                     SGSGSNPNPNLERSKLMRKSFHGSGRQLTMPVPKVAPPPPMRTSSALTPHQQQQQQQQ
                     QFHHEANLSNLSAMSSNTSISEDSCQTIITTCAQVHSEQSPLKDMQMLGMAQGQEELP
                     LPLPPMELPPYPSPPHSVCHSRQASEDFPPPPPPELDLEPLNQQLSQLQQLEAANKQQ
                     RQQQLDSLEGTTSILAQLQQRQHLLKLRKEQTGAGSDATWLKELQAKQAADLRTMQRK
                     LEASSPSSVRDLAHRFEQTATIRSYASQELLAQPRTQMNGLPSQAKLDADEVDCAGPV
                     RASLPLPLPLPHQLMLPKPKYEMSQSQIAEEIREVELLNSMVQQTLNQSAAAGAPPKR
                     VKKKSVSFCDQVILVATAGNEEDDDFIPNPILERVLRTAQHPNEKVTAQMIQQQQQNM
                     LRLQTEPQPRQQHQQQQQQQQQQQQQQQQQQQQQQQQLQLQHQQQQQQPSPMLGRPAA
                     TDMQRYVQRMQQQQQQQQQQQQQQQEMYRQQQQQQQHRTSMSGIVSQQHLQLQQQQQQ
                     QQQQHQIDAQYTSMPRPLHHPHLLPQTYSVQLQKHQQHQQQLQQQHQQQPSPQLSLGY
                     FSNGQAAASEQVPQPQPLPSPYQRVPLPHGYQPTGGPYYPLPNQQQQQLIIANGKPAQ
                     KKVSFEPGTKGEADCLPPPPPPPQTKLALQNGLPESASPGAGPGSAPAAITAIPTRVY
                     NNAIVKASAKAVECNLCRKRHVIAPAVYCTNCEYYLQMLNQRR"
     misc_feature    498..1364
                     /gene="LOC111066225"
                     /note="Peptidase C19 contains ubiquitinyl hydrolases. They
                     are intracellular peptidases that remove ubiquitin
                     molecules from polyubiquinated peptides by cleavage of
                     isopeptide bonds. They hydrolyse bonds involving the
                     carboxyl group of the C-terminal Gly...; Region:
                     Peptidase_C19; cl02553"
                     /db_xref="CDD:470612"
     misc_feature    <5589..>5657
                     /gene="LOC111066225"
                     /note="protein kinase C conserved region 1 (C1 domain)
                     superfamily; Region: C1; cl00040"
                     /db_xref="CDD:412127"
ORIGIN      
        1 acaacagcaa caaaaacagc aacaacagca acaaaaagtt tagttaagtg aacgcaaaca
       61 ggtgcaaaac gaagcacgag aagaagaacc aaaaaaagtg taagagctgg aagagagaag
      121 aaccagaagt ggaggaaaac aacaacagca aaaaaaaaat agaagcagca gcagcagttg
      181 cccaagaagt ggaagcaaaa gggttgtcga tactcgcaac acttcccttt cgggcttcct
      241 ctggcggaag ttgcaagtgg caacaacagc aacaacagca acaacagcaa caacaacaac
      301 aacaacaacc atgtgctgat ctcaaagcgc ggggaatagc cttactgttc ggttgccaaa
      361 actcattcaa tatggatacg gtggccatta agacagcact gccgcatgcc ccattggctg
      421 cccaacagca gcagcagcaa cagcaacagc agcagcagca accacagaat gcccaaacag
      481 caacaatcaa tgccaaaaac aacattgggg cattttttca gacctgcaag gtcttgtggc
      541 acttggacgc ctttcgacgc tcgttccggg gccttaacca gcatgtatgc ggcggccaag
      601 actgtatatt ctgcgctcta aaggaactct tccagcagct gcagaccagc tcggagcccg
      661 cactctgtcc ggaacccttg cggcgagcct tggccagtgg acccctggcc ggacgacgat
      721 ttccgctggg ctgcctgggc gatgctgccg aatgctttga gctgctgctg catcgcgtcc
      781 actcgcacat ctcgtccgat gatggggact cgtgcgagtc gagtgcctgc attgcccatc
      841 gccgctttgc catgcgcgtc atcgagcaga gtgtctgcaa gtgtggggcc aactccgagc
      901 agcttccctt cacgcagatg gtgcactatg tgtccgcctc ggcccttaca tcgcagaaga
      961 gcctggccct gcagagccac cagcagctga gctttggcca gctgctgaga gctgccggca
     1021 acatgggcga catacgcgac tgtccgaaca cctgtggagc caagatcggg atacgccgag
     1081 cgctgctcaa tcgtcccgat gtcgtctcca ttggcatcgt ttgggactcg gaacgacccg
     1141 ccgccgacca ggtgcatgcc gtgctcaagg cggtcggaac gagtctgcgc ctggcggacg
     1201 tcttccatca ggtcagcgag cagcgctggg cccagcaggc gcagcacgag ctggtgggga
     1261 tcgtctccta ctacggcaag cactacacca ccttcttctt ccacaccaag ctgaaggtgt
     1321 gggtgtactt cgacgacgcc aacgtcaagg aggtgggtcc cagctgggac ggagtggtgg
     1381 acaagtgcag tcgcggccgc taccagccgc tgctgctgct gtacgcggtg ccccagccgc
     1441 ccagctccgc ggcgggcatg gatcagctgc ccgtcgccgg atcgctgccc tcgccggctg
     1501 cctcaatgac ggcggcggga ctggccgcga cggtggtgcg ccgggcggtg acacccagcc
     1561 cggagaagcc ctcgctgggc agcactcgac gggccattac gcccacgccc ctgcgcacgc
     1621 ccaccaatga ctaccagaat ctgagtgtca tccagaagag cattttcccg tcgaatgggg
     1681 cgggcaccga tgagacggat gcctacatca gccgcaagac ggtggagcac gtgctgagcg
     1741 cacagtacca gaacctgagc gtcatccagg acaagatcaa cgccgtgccc ggctcggtgg
     1801 ccacgtccaa tgccgacaag gatgggggct tcctcaatcg gaagcccatc gaggcgatgc
     1861 tgagcgccca gctggcgcgc cggcagcacc tccagctgca gcgcagccac agcgccgagt
     1921 cgagctccgg ccacgcctcc aatggcagct cgccgcccag cgacgggctg accatgcccg
     1981 agcacctgaa ccagccccgg agacgggact cgggcaactg gtcgggggat cgcaacagcg
     2041 ccagctcggc cagctccacc accctggaca atccctacct gtacatggtg gccaagcggg
     2101 gcgtggcggc ggtgccgcag agccccacgc ggcacggcct gccctacgat cccggctacg
     2161 actcgttctc gctgagctcc acggactcct atccgcccaa gcacatcctc aatccgcagc
     2221 tggccaagat acccgaggca gccatgggaa tgggaatggg catgggcatg ggcatgggcg
     2281 ccggagtcgg agttggggcg ggaaacgttc aaggagcagg agtcggagtc ggagcaggag
     2341 tcccagtcgg acatggcctg gccctgtccg gggactgtga gaagctctgc cacgaggcgg
     2401 accagctgct ggagaagtcg cgcatcgtgg aggagtcgca cgacctggag acggccctgg
     2461 tgctgtgcaa tgcggccgcc ggccgggccc gggccgccat ggatgcgccc tacagcaacc
     2521 cgcacacgat gacctttgcc cggatgaagc acaacacgtg cgtgatgcgg gcccgcagcc
     2581 tgcaccggcg catcctcgtg gagaagggcg ccgaatcgga ggccatgccc gagctgaagc
     2641 acatgcggga gggcagcacc agcagcgtga agcacgtccg ccagaacagc aaggaccgca
     2701 ccctggagaa ggagcagctc cagatgcagc tccagctgca gcagcagctg cagcagcagc
     2761 tcgtgccaag cgccgtctcc aagaacatcg agatctacgc gacgctgccc aagcgaaaga
     2821 gcccgctcaa ggcgctggcc gctgccgcct ctggctctgg ttcagtgggc atggatgaca
     2881 atgccatcga gtacgagcag caggtggtgg ccagcagcag cagtccggcc agcagccaga
     2941 agccggagcg ggagcgggag agccgctcgc tgttcgggcg caaggacaag gacaaagaca
     3001 aggagaagga gaagcgcagc cggagcgagg accggaacaa gctcaccagg gagttctcgc
     3061 tgacggagac gctgctggtc aatgccaagg acacgctgaa gaagcacaag gaggacaagg
     3121 acgagaagaa ggagaaggac aagagcggca agaagcagca caagatacgc cgcaagctgc
     3181 tcatgggcgg cctcatccgc cgcaagaacc gctccatgcc cgatctgacc gaggccgtgg
     3241 acgacgccgc agcgccgatc agcggcccag cccatcatca ccatcacctg cagctgcacc
     3301 atccctgcgc cccgaccgct ggccacctca tgggcagctc cgtggacgac agcgccgtgg
     3361 gcctgggcaa gcacgccctg gccatgagcg gctacatctc cgagggtcac tttgactatc
     3421 cgctgggcac tgggccgtgc accagtaatg tgggagtggg cagtgggagc ggcagcaatc
     3481 ccaatcccaa tctggagcga agcaaactga tgcggaagag cttccacggc agtggccgcc
     3541 agctgaccat gcccgtgccc aaggtggcac cgccgccgcc catgcgcacc agctccgccc
     3601 tgacgcccca ccagcagcag cagcaacagc agcagcaatt ccatcacgag gccaatctgt
     3661 cgaacctctc cgccatgagc tcaaacacct cgatcagtga ggactcctgc cagacgatca
     3721 tcacgacctg tgcccaggtg cactccgagc agagcccgct caaggacatg caaatgctgg
     3781 gcatggccca gggccaggag gagctgccgc tgccgctgcc gccgatggag ctgccgccgt
     3841 acccgagtcc gccgcactcc gtttgccact cgcggcaggc cagcgaggac ttcccgccac
     3901 cgccgccacc cgagctcgat ctggagccgc tgaaccagca gctcagccag ctgcagcagc
     3961 tggaggcggc caacaagcag cagcgccagc agcaactcga ctcactggag ggcaccacca
     4021 gcatcctggc ccagctgcag cagcgccagc atctgctcaa gctgcgcaag gagcagacgg
     4081 gagccggcag cgacgccacc tggctgaagg agctgcaggc caagcaggcc gccgatctgc
     4141 gcacgatgca gcgcaagctg gaggcctcct cacccagcag cgtgcgcgat ctggcgcacc
     4201 gcttcgagca gacggccacc attcggtcgt acgcgtcgca ggagctcctg gcccagccac
     4261 gcacccagat gaacggcctg cccagccagg cgaagctcga cgctgatgag gttgactgtg
     4321 cgggcccagt gcgtgcctct ctcccgctgc cactgccgct gcctcatcag ctgatgctgc
     4381 ccaagcccaa gtacgagatg tcgcagtcgc agatcgccga ggagatccgc gaggtggagc
     4441 tgctcaactc gatggtgcag cagacgctca accagagtgc cgccgctgga gccccaccca
     4501 agcgggtgaa gaagaagagc gtgtcgttct gcgaccaggt gatcctggtg gcaacggccg
     4561 gcaatgaaga ggacgacgac ttcataccca acccgatact ggagcgcgta ctgcgcaccg
     4621 cacagcaccc gaacgagaag gtaacggccc agatgatcca gcagcagcag cagaacatgc
     4681 tccgactgca gacggagccg cagccccgac aacagcacca gcagcagcag cagcagcaac
     4741 agcagcagca gcagcaacag cagcagcagc agcaacagca gcagcaacaa ttgcagcttc
     4801 aacatcaaca gcagcagcag cagccttctc ccatgctggg aagaccagcg gcaacggata
     4861 tgcagcgata tgtgcagaga atgcaacaac aacagcagca acaacagcag cagcaacaac
     4921 agcagcagga gatgtaccgc cagcagcagc aacagcagca gcatcgcacc agcatgagtg
     4981 gcatcgtcag ccagcaacac ctgcaactgc aacagcagca gcagcagcag cagcagcaac
     5041 accagatcga tgcacagtac accagcatgc cccggccatt gcatcatccc catctactgc
     5101 cacagaccta ctcagtgcag ctgcagaaac accagcagca ccagcaacag ctgcaacagc
     5161 agcaccaaca gcagccatcg ccgcaactga gcctgggata cttcagcaat ggccaggcag
     5221 cggccagcga gcaagtgcca cagccacagc cactgccttc gccttatcag cgagtgccac
     5281 tgccacatgg ctaccagccc acaggcggac cctactaccc tttgccaaac caacagcagc
     5341 aacagctgat tattgccaat ggaaaacctg cccagaagaa ggtcagcttc gagccaggga
     5401 ccaagggcga agccgactgc ctgccaccgc caccaccgcc gccacaaacg aagctggccc
     5461 tgcagaatgg tctgcccgag tcggccagcc caggagctgg ccctggctcg gcaccggcgg
     5521 ccatcacagc catacccacc cgcgtctaca acaatgccat cgtgaaggcc tcggccaagg
     5581 ccgtggagtg caatctgtgc cgcaagcggc acgtcattgc gccagccgtc tactgcacca
     5641 actgcgagta ctatctgcag atgctgaacc agcgcagatg atgcactccc agtccca