PREDICTED: Drosophila obscura axonemal 84 kDa protein
LOCUS XM_041591980 3591 bp mRNA linear INV 14-MAY-2021
(LOC111072409), mRNA.
ACCESSION XM_041591980
VERSION XM_041591980.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq; corrected model; includes ab initio.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
ab initio :: 3% of CDS bases
frameshifts :: corrected 1 indel
##RefSeq-Attributes-END##
PRIMARY REFSEQ_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP
1-132 JAECWW010000165.1 1583654-1583785
133-221 JAECWW010000165.1 1585650-1585738
222-660 JAECWW010000165.1 1585816-1586254
661-818 JAECWW010000165.1 1586316-1586473
819-905 JAECWW010000165.1 1586550-1586636
906-1105 JAECWW010000165.1 1586714-1586913
1106-1465 JAECWW010000165.1 1586976-1587335
1466-1622 JAECWW010000165.1 1587410-1587566
1623-2430 JAECWW010000165.1 1587626-1588433
2431-2777 JAECWW010000165.1 1588435-1588781
2778-2899 JAECWW010000165.1 1588845-1588966
2900-2975 JAECWW010000165.1 1589035-1589110
2976-3591 JAECWW010000165.1 1589193-1589808
FEATURES Location/Qualifiers
source 1..3591
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..3591
/gene="LOC111072409"
/note="The sequence of the model RefSeq transcript was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon;
Derived by automated computational analysis using gene
prediction method: Gnomon. Supporting evidence includes
similarity to: 4 Proteins, and 29% coverage of the
annotated genomic feature by RNAseq alignments"
/db_xref="GeneID:111072409"
CDS 1..3591
/gene="LOC111072409"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon"
/codon_start=1
/product="LOW QUALITY PROTEIN: axonemal 84 kDa protein"
/protein_id="XP_041447914.1"
/db_xref="GeneID:111072409"
/translation="MDTKDGGLAWCGVADGWCGAVRCAGDWCKILKTITTADKHHHCV
PPKKKLSKKEKARLEAEQAELLRIEMEKEKLKKLEEARQRKKLEMEQAKHRQQQEVAE
NRLRRTQLKDSMLFFEAVRTAIEAIKDGERHERDWEKFMRCNGLPNAASPSDLRKYIH
QWHADIAQRHREARNWLLRTDERTLLTQDAQVADLTRISLRQQQGRLGDVYAHRIKEV
LGILSELDEALPFRQRSVHVAEDLAKLKTEMRVFLRQHLDEFTYKTMSHIERDMEIDR
PGVSKHIYSSDVFQSFVWTFSKDAQIALNPKARIGEQSGHNEIDFPTIEMQISLPPTV
QLQSSALRGLWLNYDHFSDYCSSYQLRHARSENANILRQTKREWRKRKEILQAMLDEC
GRDIPLSELEQLTQEQHSASSQPQRERVYDVDKLYAEYEDELSRAHRRAIGPEAYGML
ETDVNLRRYRIIGGVYCIDFLETPQQDKQLNARSFIRTISCANSLVLKEYYQTYKPPP
PVLPGVRRLPEEIEAEMRMIESALDKLALVTLQLPDSVIWFEPPVACRWETHIETLEM
ELADGMKPVPPAPADAAAATTATPTEATPLSTNAQPSPLKSSPRCPAARLRPHKSTSH
PQQTLREITDFDLRAIPSDVDLYGLVKEFVVPRLPQGFCVRLEATTPAESATGMGSGL
RRRRVMFLARQTHVRLGQGLGHGTAGGVEAEPTDHLQLGLGLDLGLGIGADFLDTAEP
AEMFPPLCFDKLLKFRQAAPVLPQGYLFSQLIEDMDRLWRLQLPREQQAELIQQISEQ
LSPTGCAGSEAGQLQQQQQQELEVDPIQPLSQELVDVLQPAAAAAAFAYYTGISFVRD
SQLPHDSEMEQSAAATKQPSDGLDSSEDDEEDVDSLLELLEGASTAGNTLHELLAPGP
GQSHSENSDSEMRRQKLSSTAAITQGKWSTRDVHDTKFNEDKLSIQFRTGRLGIFGFA
LNKYSNMPYQTWDLRPDMKNPGTILFSFTASLISLDMTITVAGYTVNNFQGGSTQGLT
EMIGKTLSLAELKATLILSAVDIFPDEDAFCYTEGSCEKNYVMEMHCYACLSTLAQSH
NFSWSRWNLLAGSRTAVLLVRELIEGKKVPYYSTLLVTPLKTSIIDCTEVSASFNAVG
IAGMEYYADLYQLSQAHAQPVSLEKQRTMSPVLRDNVARILMAIRPLSLC"
misc_feature 166..>498
/gene="LOC111072409"
/note="Cancer susceptibility candidate 1 N-terminus;
Region: Casc1_N; pfam15927"
/db_xref="CDD:464947"
misc_feature <1000..>1308
/gene="LOC111072409"
/note="exonuclease SbcC; Region: sbcc; TIGR00618"
/db_xref="CDD:129705"
misc_feature 2824..3414
/gene="LOC111072409"
/note="Cancer susceptibility candidate 1 C-terminal;
Region: Casc1_C; pfam12366"
/db_xref="CDD:432508"
ORIGIN
1 atggatacca aagatggtgg cctggcatgg tgtggcgtgg cggatgggtg gtgcggtgcg
61 gtgcggtgtg cgggtgactg gtgcaagatt ctaaagacca tcaccaccgc tgacaagcac
121 caccattgtg tgccaccaaa gaaaaagctt tccaagaagg agaaggcccg tctggaggcc
181 gagcaggcgg agctgctgcg catcgaaatg gaaaaggaaa aacttaagaa actggaggag
241 gcgcgccagc gcaaaaaact ggaaatggaa caggcaaagc atcgccagca gcaggaggtg
301 gccgagaatc gtctgcgtcg gacacagctg aaggacagca tgctcttctt tgaggcagtg
361 cgtacggcga ttgaggccat caaggatggc gagcggcacg agcgcgactg ggagaagttc
421 atgcgctgca atgggctgcc gaatgcagcc agtccgagcg atctgcgcaa gtacatccat
481 cagtggcacg cggacattgc ccagcggcac agggaggcgc gcaactggct gctgcgcaca
541 gacgagcgaa cgctactcac acaggatgcc caggtggcgg atctgacgcg catctcgctg
601 cgccagcagc agggacggct gggcgatgtc tatgcccatc gcatcaagga ggtgcttggg
661 attctcagcg agctggatga ggcgttgccc ttcaggcagc gttcggtgca tgtggcagag
721 gatctggcca agctcaagac ggagatgcgg gtgtttctgc ggcagcatct ggacgagttt
781 acctacaaga caatgtccca cattgagcgg gacatggaaa tcgatagacc gggcgtctca
841 aagcacatct acagctcgga tgtctttcag agctttgtgt ggaccttctc caaggatgcc
901 caaattgccc tcaatcccaa ggcacgcatt ggggagcagt cgggccacaa tgagatcgat
961 ttcccaacca ttgaaatgca aatttcactg cccccgacag tgcagctgca gagctcggcg
1021 ctgcgcggcc tgtggctcaa ttatgatcac ttcagtgact actgcagcag ctatcaactg
1081 cggcatgccc gctccgagaa tgccaatatt ctgcggcaga cgaagcgcga gtggcgcaaa
1141 cgcaaggaga tcctgcaggc gatgctggac gagtgcggca gggacattcc gctgtcggag
1201 ctggagcagc tcacccagga acagcactcg gccagctcac agccgcagcg ggagcgcgtc
1261 tacgatgtgg acaagctgta tgcggaatac gaggatgagc tgagtcgtgc ccatcgccgg
1321 gccatcggcc ccgaggccta tggcatgctc gagacggatg tgaatctgcg ccgataccgc
1381 atcattggcg gcgtctactg catcgatttc ctcgagacgc cgcagcagga caagcagctc
1441 aatgcgcgct ccttcatacg aacaattagt tgtgccaaca gcttggtgct caaggaatac
1501 tatcagacct acaagccgcc gccgcctgtg ctgcccggcg tgcgacgact gcccgaggaa
1561 attgaggctg agatgcggat gatcgagtcg gcgctggaca agctggcgct ggtcacgctg
1621 cagttgcccg attctgtgat ttggttcgag cctccggtcg cttgccgctg ggagacgcac
1681 atcgagacac tcgaaatgga gctggcggat ggcatgaaac cggtgccgcc tgccccagcg
1741 gatgcggcag ctgccaccac ggcgacgccc acagaggcga cgcccctctc gaccaacgcc
1801 cagccgagtc cgctgaagag ttcgccccgt tgtccggccg cccgcctgcg gccccacaag
1861 tccacgtcgc acccacagca gacgctgcgc gagatcacgg actttgacct gagggccata
1921 cccagcgacg tggacctgta cgggctggtc aaggagtttg tggtgccgcg gctgccgcag
1981 ggcttctgcg tgcgtctcga ggcgaccacc ccggcggagt cggccacggg catgggctcg
2041 ggtctgcgcc gccgcagggt gatgttcctc gcccgccaga cgcacgtgcg actgggtcag
2101 ggtctgggtc atggcacagc aggcggagtc gaggctgagc ccaccgatca tctgcagctg
2161 ggcttgggcc tggacttggg cctgggcatt ggagccgatt tcctggacac tgcggagccg
2221 gcggagatgt ttccgccgct gtgctttgac aaactgctga agttccgcca ggcagcccct
2281 gtgctgccgc agggctacct tttctcgcag ctcatcgagg acatggaccg cctgtggcgt
2341 ctgcagctgc cgcgcgagca gcaggcggag ctcatccagc agatctcgga gcaactctcg
2401 cccactggct gtgcgggcag cgaggcgggg cagctgcagc agcagcagca gcaggagctg
2461 gaggtggatc ccatccagcc actcagccag gagctggtgg acgtcctgca gccagcagcg
2521 gcggccgccg cctttgccta ctacacgggc atctcatttg tgcgcgactc gcagctgccg
2581 cacgactcgg aaatggagca atcggcggcg gccaccaagc agccatcaga tggcctggac
2641 agcagcgagg acgacgaaga ggatgtggac agtctgctgg agctgctcga gggcgcctcc
2701 acagcgggca acacactgca cgagctgctg gcccctggcc ccggccagag tcacagcgag
2761 aacagcgaca gcgagatgcg caggcagaag cttagcagca ctgcggccat cactcagggc
2821 aagtggagca ctcgcgatgt ccacgacaca aagttcaatg aggacaagct ctcgattcag
2881 ttccgcacag gcagattggg gatctttggc ttcgccctga acaaatacag caacatgccc
2941 taccagacct gggacctgcg accggacatg aaaaatcctg gcaccatcct cttcagcttc
3001 acggcatcgc tgatcagcct ggacatgacc atcaccgttg cgggctacac ggtgaacaat
3061 tttcagggcg gcagcaccca gggcctcacc gaaatgatcg gcaaaacgct gtcactggcc
3121 gagctgaagg ccacactgat cctctcggcg gtggacattt tccccgacga ggatgccttc
3181 tgctatacgg agggctcctg cgagaagaac tatgtgatgg agatgcactg ctatgcctgc
3241 ctctcgaccc tcgcccaatc gcacaacttc agctggtcgc gctggaacct gctggccggc
3301 tcccgcaccg ccgttctgct cgtccgcgag ctcatcgagg gcaaaaaggt gccgtactac
3361 tcgacgttgc tggtgacgcc gctcaagaca tcgatcattg actgcaccga agtatcggcc
3421 agcttcaatg cggtgggcat tgccggcatg gagtactatg cggatctcta tcagctcagc
3481 caggcccacg cccagccggt gagtctcgag aagcagcgca cgatgagtcc ggtgctcagg
3541 gacaatgtgg caaggattct gatggccata cggccattga gtctctgttg a