PREDICTED: Drosophila obscura axonemal 84 kDa protein


LOCUS       XM_041591980            3591 bp    mRNA    linear   INV 14-MAY-2021
            (LOC111072409), mRNA.
ACCESSION   XM_041591980
VERSION     XM_041591980.1
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq; corrected model; includes ab initio.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
            
            ##RefSeq-Attributes-START##
            ab initio   :: 3% of CDS bases
            frameshifts :: corrected 1 indel
            ##RefSeq-Attributes-END##
PRIMARY     REFSEQ_SPAN         PRIMARY_IDENTIFIER PRIMARY_SPAN        COMP
            1-132               JAECWW010000165.1  1583654-1583785
            133-221             JAECWW010000165.1  1585650-1585738
            222-660             JAECWW010000165.1  1585816-1586254
            661-818             JAECWW010000165.1  1586316-1586473
            819-905             JAECWW010000165.1  1586550-1586636
            906-1105            JAECWW010000165.1  1586714-1586913
            1106-1465           JAECWW010000165.1  1586976-1587335
            1466-1622           JAECWW010000165.1  1587410-1587566
            1623-2430           JAECWW010000165.1  1587626-1588433
            2431-2777           JAECWW010000165.1  1588435-1588781
            2778-2899           JAECWW010000165.1  1588845-1588966
            2900-2975           JAECWW010000165.1  1589035-1589110
            2976-3591           JAECWW010000165.1  1589193-1589808
FEATURES             Location/Qualifiers
     source          1..3591
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..3591
                     /gene="LOC111072409"
                     /note="The sequence of the model RefSeq transcript was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: deleted 1 base in 1 codon;
                     Derived by automated computational analysis using gene
                     prediction method: Gnomon. Supporting evidence includes
                     similarity to: 4 Proteins, and 29% coverage of the
                     annotated genomic feature by RNAseq alignments"
                     /db_xref="GeneID:111072409"
     CDS             1..3591
                     /gene="LOC111072409"
                     /note="The sequence of the model RefSeq protein was
                     modified relative to its source genomic sequence to
                     represent the inferred CDS: deleted 1 base in 1 codon"
                     /codon_start=1
                     /product="LOW QUALITY PROTEIN: axonemal 84 kDa protein"
                     /protein_id="XP_041447914.1"
                     /db_xref="GeneID:111072409"
                     /translation="MDTKDGGLAWCGVADGWCGAVRCAGDWCKILKTITTADKHHHCV
                     PPKKKLSKKEKARLEAEQAELLRIEMEKEKLKKLEEARQRKKLEMEQAKHRQQQEVAE
                     NRLRRTQLKDSMLFFEAVRTAIEAIKDGERHERDWEKFMRCNGLPNAASPSDLRKYIH
                     QWHADIAQRHREARNWLLRTDERTLLTQDAQVADLTRISLRQQQGRLGDVYAHRIKEV
                     LGILSELDEALPFRQRSVHVAEDLAKLKTEMRVFLRQHLDEFTYKTMSHIERDMEIDR
                     PGVSKHIYSSDVFQSFVWTFSKDAQIALNPKARIGEQSGHNEIDFPTIEMQISLPPTV
                     QLQSSALRGLWLNYDHFSDYCSSYQLRHARSENANILRQTKREWRKRKEILQAMLDEC
                     GRDIPLSELEQLTQEQHSASSQPQRERVYDVDKLYAEYEDELSRAHRRAIGPEAYGML
                     ETDVNLRRYRIIGGVYCIDFLETPQQDKQLNARSFIRTISCANSLVLKEYYQTYKPPP
                     PVLPGVRRLPEEIEAEMRMIESALDKLALVTLQLPDSVIWFEPPVACRWETHIETLEM
                     ELADGMKPVPPAPADAAAATTATPTEATPLSTNAQPSPLKSSPRCPAARLRPHKSTSH
                     PQQTLREITDFDLRAIPSDVDLYGLVKEFVVPRLPQGFCVRLEATTPAESATGMGSGL
                     RRRRVMFLARQTHVRLGQGLGHGTAGGVEAEPTDHLQLGLGLDLGLGIGADFLDTAEP
                     AEMFPPLCFDKLLKFRQAAPVLPQGYLFSQLIEDMDRLWRLQLPREQQAELIQQISEQ
                     LSPTGCAGSEAGQLQQQQQQELEVDPIQPLSQELVDVLQPAAAAAAFAYYTGISFVRD
                     SQLPHDSEMEQSAAATKQPSDGLDSSEDDEEDVDSLLELLEGASTAGNTLHELLAPGP
                     GQSHSENSDSEMRRQKLSSTAAITQGKWSTRDVHDTKFNEDKLSIQFRTGRLGIFGFA
                     LNKYSNMPYQTWDLRPDMKNPGTILFSFTASLISLDMTITVAGYTVNNFQGGSTQGLT
                     EMIGKTLSLAELKATLILSAVDIFPDEDAFCYTEGSCEKNYVMEMHCYACLSTLAQSH
                     NFSWSRWNLLAGSRTAVLLVRELIEGKKVPYYSTLLVTPLKTSIIDCTEVSASFNAVG
                     IAGMEYYADLYQLSQAHAQPVSLEKQRTMSPVLRDNVARILMAIRPLSLC"
     misc_feature    166..>498
                     /gene="LOC111072409"
                     /note="Cancer susceptibility candidate 1 N-terminus;
                     Region: Casc1_N; pfam15927"
                     /db_xref="CDD:464947"
     misc_feature    <1000..>1308
                     /gene="LOC111072409"
                     /note="exonuclease SbcC; Region: sbcc; TIGR00618"
                     /db_xref="CDD:129705"
     misc_feature    2824..3414
                     /gene="LOC111072409"
                     /note="Cancer susceptibility candidate 1 C-terminal;
                     Region: Casc1_C; pfam12366"
                     /db_xref="CDD:432508"
ORIGIN      
        1 atggatacca aagatggtgg cctggcatgg tgtggcgtgg cggatgggtg gtgcggtgcg
       61 gtgcggtgtg cgggtgactg gtgcaagatt ctaaagacca tcaccaccgc tgacaagcac
      121 caccattgtg tgccaccaaa gaaaaagctt tccaagaagg agaaggcccg tctggaggcc
      181 gagcaggcgg agctgctgcg catcgaaatg gaaaaggaaa aacttaagaa actggaggag
      241 gcgcgccagc gcaaaaaact ggaaatggaa caggcaaagc atcgccagca gcaggaggtg
      301 gccgagaatc gtctgcgtcg gacacagctg aaggacagca tgctcttctt tgaggcagtg
      361 cgtacggcga ttgaggccat caaggatggc gagcggcacg agcgcgactg ggagaagttc
      421 atgcgctgca atgggctgcc gaatgcagcc agtccgagcg atctgcgcaa gtacatccat
      481 cagtggcacg cggacattgc ccagcggcac agggaggcgc gcaactggct gctgcgcaca
      541 gacgagcgaa cgctactcac acaggatgcc caggtggcgg atctgacgcg catctcgctg
      601 cgccagcagc agggacggct gggcgatgtc tatgcccatc gcatcaagga ggtgcttggg
      661 attctcagcg agctggatga ggcgttgccc ttcaggcagc gttcggtgca tgtggcagag
      721 gatctggcca agctcaagac ggagatgcgg gtgtttctgc ggcagcatct ggacgagttt
      781 acctacaaga caatgtccca cattgagcgg gacatggaaa tcgatagacc gggcgtctca
      841 aagcacatct acagctcgga tgtctttcag agctttgtgt ggaccttctc caaggatgcc
      901 caaattgccc tcaatcccaa ggcacgcatt ggggagcagt cgggccacaa tgagatcgat
      961 ttcccaacca ttgaaatgca aatttcactg cccccgacag tgcagctgca gagctcggcg
     1021 ctgcgcggcc tgtggctcaa ttatgatcac ttcagtgact actgcagcag ctatcaactg
     1081 cggcatgccc gctccgagaa tgccaatatt ctgcggcaga cgaagcgcga gtggcgcaaa
     1141 cgcaaggaga tcctgcaggc gatgctggac gagtgcggca gggacattcc gctgtcggag
     1201 ctggagcagc tcacccagga acagcactcg gccagctcac agccgcagcg ggagcgcgtc
     1261 tacgatgtgg acaagctgta tgcggaatac gaggatgagc tgagtcgtgc ccatcgccgg
     1321 gccatcggcc ccgaggccta tggcatgctc gagacggatg tgaatctgcg ccgataccgc
     1381 atcattggcg gcgtctactg catcgatttc ctcgagacgc cgcagcagga caagcagctc
     1441 aatgcgcgct ccttcatacg aacaattagt tgtgccaaca gcttggtgct caaggaatac
     1501 tatcagacct acaagccgcc gccgcctgtg ctgcccggcg tgcgacgact gcccgaggaa
     1561 attgaggctg agatgcggat gatcgagtcg gcgctggaca agctggcgct ggtcacgctg
     1621 cagttgcccg attctgtgat ttggttcgag cctccggtcg cttgccgctg ggagacgcac
     1681 atcgagacac tcgaaatgga gctggcggat ggcatgaaac cggtgccgcc tgccccagcg
     1741 gatgcggcag ctgccaccac ggcgacgccc acagaggcga cgcccctctc gaccaacgcc
     1801 cagccgagtc cgctgaagag ttcgccccgt tgtccggccg cccgcctgcg gccccacaag
     1861 tccacgtcgc acccacagca gacgctgcgc gagatcacgg actttgacct gagggccata
     1921 cccagcgacg tggacctgta cgggctggtc aaggagtttg tggtgccgcg gctgccgcag
     1981 ggcttctgcg tgcgtctcga ggcgaccacc ccggcggagt cggccacggg catgggctcg
     2041 ggtctgcgcc gccgcagggt gatgttcctc gcccgccaga cgcacgtgcg actgggtcag
     2101 ggtctgggtc atggcacagc aggcggagtc gaggctgagc ccaccgatca tctgcagctg
     2161 ggcttgggcc tggacttggg cctgggcatt ggagccgatt tcctggacac tgcggagccg
     2221 gcggagatgt ttccgccgct gtgctttgac aaactgctga agttccgcca ggcagcccct
     2281 gtgctgccgc agggctacct tttctcgcag ctcatcgagg acatggaccg cctgtggcgt
     2341 ctgcagctgc cgcgcgagca gcaggcggag ctcatccagc agatctcgga gcaactctcg
     2401 cccactggct gtgcgggcag cgaggcgggg cagctgcagc agcagcagca gcaggagctg
     2461 gaggtggatc ccatccagcc actcagccag gagctggtgg acgtcctgca gccagcagcg
     2521 gcggccgccg cctttgccta ctacacgggc atctcatttg tgcgcgactc gcagctgccg
     2581 cacgactcgg aaatggagca atcggcggcg gccaccaagc agccatcaga tggcctggac
     2641 agcagcgagg acgacgaaga ggatgtggac agtctgctgg agctgctcga gggcgcctcc
     2701 acagcgggca acacactgca cgagctgctg gcccctggcc ccggccagag tcacagcgag
     2761 aacagcgaca gcgagatgcg caggcagaag cttagcagca ctgcggccat cactcagggc
     2821 aagtggagca ctcgcgatgt ccacgacaca aagttcaatg aggacaagct ctcgattcag
     2881 ttccgcacag gcagattggg gatctttggc ttcgccctga acaaatacag caacatgccc
     2941 taccagacct gggacctgcg accggacatg aaaaatcctg gcaccatcct cttcagcttc
     3001 acggcatcgc tgatcagcct ggacatgacc atcaccgttg cgggctacac ggtgaacaat
     3061 tttcagggcg gcagcaccca gggcctcacc gaaatgatcg gcaaaacgct gtcactggcc
     3121 gagctgaagg ccacactgat cctctcggcg gtggacattt tccccgacga ggatgccttc
     3181 tgctatacgg agggctcctg cgagaagaac tatgtgatgg agatgcactg ctatgcctgc
     3241 ctctcgaccc tcgcccaatc gcacaacttc agctggtcgc gctggaacct gctggccggc
     3301 tcccgcaccg ccgttctgct cgtccgcgag ctcatcgagg gcaaaaaggt gccgtactac
     3361 tcgacgttgc tggtgacgcc gctcaagaca tcgatcattg actgcaccga agtatcggcc
     3421 agcttcaatg cggtgggcat tgccggcatg gagtactatg cggatctcta tcagctcagc
     3481 caggcccacg cccagccggt gagtctcgag aagcagcgca cgatgagtcc ggtgctcagg
     3541 gacaatgtgg caaggattct gatggccata cggccattga gtctctgttg a