PREDICTED: Drosophila obscura uncharacterized LOC111066361
LOCUS XM_041591970 7380 bp mRNA linear INV 14-MAY-2021
(LOC111066361), mRNA.
ACCESSION XM_041591970
VERSION XM_041591970.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq; corrected model; includes ab initio.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
ab initio :: 5% of CDS bases
frameshifts :: corrected 1 indel
##RefSeq-Attributes-END##
PRIMARY REFSEQ_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP
1-6459 JAECWW010000165.1 2026990-2033448 c
6460-7380 JAECWW010000165.1 2026068-2026988 c
FEATURES Location/Qualifiers
source 1..7380
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..7380
/gene="LOC111066361"
/note="The sequence of the model RefSeq transcript was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon;
Derived by automated computational analysis using gene
prediction method: Gnomon. Supporting evidence includes
similarity to: 4 Proteins, and 94% coverage of the
annotated genomic feature by RNAseq alignments"
/db_xref="GeneID:111066361"
CDS 152..7273
/gene="LOC111066361"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon"
/codon_start=1
/product="LOW QUALITY PROTEIN: uncharacterized protein
LOC111066361"
/protein_id="XP_041447904.1"
/db_xref="GeneID:111066361"
/translation="MEARKCLPEADPCPVRQYLARQAEAQANQRQSLNQYLEKTMSDF
PPPHLPGEATKLAASASAAATCQCVARTRPNAILNQFVGGGRKPGKRLLVPDLATDWF
RKQKVFLDKISPEELAQGQDRQDRQDRCFGHLDASEYRLPETRCACPATKRSQGQSRV
SLARQRHQQLQGKATCGCRSGANGAEDAPAGDEQQQDQQRLAKQMSSSQTITRCRPHV
AGCGTEGGMKNSRSRSSRSRSRSSRTSRGVSRSRSRSSSRSTNRSRAKRRNAGGHSPH
RRHPIVRRSSCTCLRRSPSCRCPGQMQFQSDGQPMSRECQLATRRTEPLMPCLRRKYE
LQRAPQRRLEPQHFRRISPEFFGILKKYEAAMTESCPLYGISIRHWGESAPCCICEED
DASPEHMQCLGDESVHAAGAVPSIQQLMASEADTEEYTAAGEHPGVEEQGSEAPAASS
GPSPCNDDTCPVRRLEPLPAPVPRAKPKEPEPRPQEPRRKSEAKQEQAEPEEDPQPPS
KEPEKKKSGFRCCPCWSRDPLKDDDAAGTQREKPKKEKKEKKEKKEKKEKEKKPKKEK
KSKKGDKSAGEVPNAIIPPQAFLSGFAGLCRRCMRSGRHLLYSHNSSYQRIRRQNQGY
RNEQAPPAAAAAAAPPPMPYHMRGVCQLQPDVWQFEQQPRTRQQTATDTETEMEAETV
SWAAVGAAGGKTQPEWEPSEDEGGSLDSCNCSSTTLRPSSSCEAQPSRGPLCTTRRSQ
AGACNLDGCPSSAIWTLQKRSSRPLKRRFVNENFGRRRLPERAERAERAERAEQERTE
EQIKGGEDATLRDNNTSSSYVEGFCLKVNKHGTRACYLANDYLQLMKMVARKRARYDC
LKTKKTTCRPKPEPSPRPTARSRRASPAAAARGNAAAAAHSNRSRSTGMRTQSAPAAA
TTARHPRRRLPPPKMLASPSTDADSDGASSSQDGGSRKAERKKGTSSARNSQTRSSLR
KRPARASSSPSPSASPARLKSRSARSSSSSRNVERKPASPTRSARSSAARRHSAQRQR
QSTESHTKAVDRVASSSDLDEDCRPNENGTCAQYRAAISHRYPDYAMDETESVPQFKP
SDSYASECLCKTYSYSRAGTNAMATAMAPAVAPQDVTEPSPWSAETAAPYQGNAAADW
TQEKQANGRKSPEAPKSPDWMGMEEEQQQQPQQPQQQQLQQQQPPSVATQDDTQFDEN
PVEWVEEQREPAAAAAADNYAEQPAFAGQRELCHQDCPYWRTVHEEERLQQQQQQQQQ
QQHLQLQQQQPQFYSQEGGYDQDDEERPTTSKHLKRLSRSYRLRPPCGGPLRWIANGF
TSSKRSSLSDSAPFERGRERAPPAAATAAAAAAVVKKQRPRPPIERGRRYYTETQPKM
WYDQCGGLDGMSSSIRSSHSRCSIRALSSNAETVECPCPPLPSIKSENVKPDRWGMQQ
PQFQQQQRQPQFQQQQQQPQFQQQQTPRVTPLMQPRQTQMQPQMQPQLQPQMQPKLQP
QMQPQMQPQMQPQMPPHHMQMQPRPIQPQPQPQMQMQMQHAMEYQNPNQNQNQNHIEQ
NRYRHYEQQQQQQQQQAAAYNPMLSMATAASPLNFPIGSGFGPSKKIPKLLSNQPLEL
KPSLAQKLEAKKPKWMHHNHAPLTLRTARPRAEAYYEGKMRRRCLKFMLSIEQGRQPA
IPPAQAPAEQEDWSITEQQQQQQHQHQQQHESEPSEELQEEEEQPQERVGQQEEQEPL
NDSSEEQVWFGEPGYDVDEGPQWSQTRRSLSGSVVMRPANNAATRIEHPTHQWPYSGQ
SQRHDERPSQPSPTYDPYYRVDADHWPPEQQAAHWETPPYHPKQQSPQQQQQQQPPLN
QNYQEPQSQRQTSRRPQAQPQPQPQPQPQRQAQPQPQKLPQRQPQTERQPQRQEQPQR
QEQPQRQSQRQPRHEQQQPQPERQQDPHLHAHPPHPHAHAHRQPQPQPHVQRQTPLPP
PPQPDINARWHWRIPRSVSETRPFYDEEEEQDYAPQEERQPTNWQKHAPPPSYMQPMH
HHGVARYSADSGRAAASRSPPRRASRHRAPQRQGGGAAGAITAAALGLRGLVQALGGP
RYARNLSTVSGESAEGGGDHGNALDCGQPQPMDFSREQLQFTVEPATAVHRGAPLGAT
EQTAPGAVRSYGCALGREQRARTYAVQKRPSAFLARSSRVNGYSKPPSTDNERSSPLS
LFAVPTSSSNLAWRAETRAARTITSTRIPGGQEEQPRMPPPVSILSDLDFHTHSTSHN
SICSTSGGYRAPLPSLALSEDGTSSLAVFLDNLRARNVLLLPGKPSPLAELPPSGSAA
KCSGKQHQQQQEEEEEEEEEKEKVEPTQGRRWFPATPLVIPPTYNALLEEQVLQEYLP
TPKCRRFKT"
misc_feature 1436..>1687
/gene="LOC111066361"
/note="Procyclic acidic repetitive protein (PARP); Region:
Trypan_PARP; pfam05887"
/db_xref="CDD:368653"
ORIGIN
1 cagagaaatg tcgagaagtg tacgaattaa acaaaacctt cgactttcga cggaaatcag
61 ttatcggaaa tagctcaaaa ttttaagtac gacatcagaa gcagagagca gagagcacag
121 agcacagaga ctccatccat agacacggca gatggaagca agaaaatgcc tgccggaggc
181 cgatccctgc cccgtgcgac aatatctcgc acggcaggcc gaggcccagg ccaatcagcg
241 ccagagcctc aatcagtatc tggagaagac catgtccgat ttcccgccgc cccacctgcc
301 cggcgaggcc acaaaactcg ccgcgtccgc gtccgccgcc gccacttgcc agtgcgtggc
361 caggaccagg ccgaatgcga ttctgaatca gtttgtcggc ggtggacgca agccggggaa
421 gcgcctgctg gtgccggatc tggccaccga ttggtttcgc aaacagaagg ttttccttga
481 caagatcagc cccgaggagc tggcccaggg ccaggatcgc caggatcgcc aggaccgttg
541 cttcggccac ctcgatgcct cagagtaccg gctgccagag acccgttgcg cctgcccggc
601 cacaaaacgg agccagggcc aaagtcgcgt gtcgctcgcc cgacagcggc accagcagct
661 gcagggcaaa gccacgtgcg gctgccggag cggggccaac ggggcagagg acgctccagc
721 gggagacgag cagcagcagg atcagcagcg gctggccaag caaatgtcca gcagtcagac
781 catcacacgc tgccggcccc atgtggcagg atgcggcacc gagggtggca tgaagaacag
841 caggagcagg agcagcagga gcagaagtag gagcagcaga actagcagag gtgtaagcag
901 aagtaggagc aggagcagca gcaggagcac aaacagaagc cgagcgaaac gcaggaatgc
961 tggcggccac agtccccatc gccgacaccc gatcgtgcgg cgcagcagct gcacttgcct
1021 acggcgcagt ccctcatgcc gctgccccgg ccagatgcag ttccagagcg acggccagcc
1081 catgtccagg gagtgccagc tggcgacgcg tcgcacagag ccgctgatgc cgtgcctgcg
1141 gcgtaaatat gaactgcagc gggcaccgca gcgtcgactc gagccgcagc actttcgccg
1201 catctcgccg gagttctttg ggatactgaa gaagtacgag gcggcgatga ccgagagctg
1261 tccgctctac ggcatctcga tacggcactg gggcgagtcg gcgccctgct gcatctgcga
1321 ggaggacgat gccagtcccg agcacatgca gtgtctgggc gatgagtcgg tgcatgcggc
1381 gggcgcagtg ccctccattc agcagttgat ggcctccgag gcggacacgg aagagtatac
1441 ggctgcgggc gagcatccgg gcgtcgagga gcaaggttcc gaggctccgg ctgcaagcag
1501 cggtccatcg ccctgcaacg atgacacctg tcccgtccga aggctcgagc ccctgccggc
1561 accagtgccg agggctaagc ccaaggaacc ggagccgaga ccacaagagc cgcgcaggaa
1621 gtccgaagcc aagcaagaac aagccgaacc agaagaagat ccgcagccac ccagcaagga
1681 gccagagaaa aagaaaagcg gcttcaggtg ctgcccctgc tggtccaggg atccccttaa
1741 ggatgatgat gctgccggaa cgcagcggga aaagccaaag aaagagaaga aagagaagaa
1801 ggagaagaag gagaagaagg agaaagagaa gaagccaaag aaggaaaaga aatccaaaaa
1861 gggcgacaaa tccgccgggg aggtgccgaa tgccatcatt cctccgcagg cattcttgag
1921 tggatttgcg ggcctgtgca ggcgctgcat gcgctcgggc cgccatctgc tgtacagcca
1981 caattcgagc taccagcgga tccgtcgcca gaaccagggc taccgcaacg agcaggctcc
2041 accagcagca gcagcagcag cagcaccacc cccaatgcca tatcacatgc gtggcgtgtg
2101 tcagcttcag cccgatgtct ggcaattcga gcagcagcca aggacaagac aacagacagc
2161 cacagacacg gagacggaga tggaagcgga aactgtgtcc tgggcagcag taggagcagc
2221 aggcggaaaa acgcaaccgg aatgggagcc atcagaggat gaaggcggct cccttgacag
2281 ctgcaattgc agcagcacca ccctgaggcc cagctccagc tgcgaggccc agcccagccg
2341 tggccctctc tgcacgacga gacgcagcca ggcgggtgcc tgcaacctcg atggctgccc
2401 atcatcggcc atttggacgc tgcagaagcg tagctccaga ccgctgaagc ggcgatttgt
2461 taacgagaac tttggcaggc gaaggctgcc agagagagcc gaaagagccg aaagagcaga
2521 aagagccgag caggagagaa cagaggaaca gatcaagggc ggagaggatg ccacgctgag
2581 ggataataat acgtcgtcgt cgtatgtgga gggtttctgc ctgaaggtga acaagcatgg
2641 cacacgggcc tgctatctgg ccaacgatta tctgcagctg atgaaaatgg tggccagaaa
2701 gcgggcccgt tacgactgcc tcaagaccaa gaagaccacc tgccggccca agccagagcc
2761 cagtccccgg cccacggcgc gcagtcgacg agcttcgccc gcggcagcag cgcgcggcaa
2821 tgcagcagct gccgcccact cgaatcggtc acggagtacc ggaatgcgta cgcagtccgc
2881 ccccgctgcg gcaacaacgg ccaggcatcc aaggcgtagg ctaccaccac ccaagatgct
2941 ggcgagccca tccacagatg ccgactcgga tggggccagc agctcccagg acgggggcag
3001 ccgcaaggcg gagaggaaga aaggcacaag ctccgctaga aattcacaaa caagaagctc
3061 cctcaggaag cgaccggcca gagccagttc cagtcccagt cccagtgcga gtcccgcccg
3121 cttaaagtcg cgctcggccc gcagctcgtc gtccagcagg aacgtggaga gaaagccagc
3181 atcgcccaca cgtagtgcca gatcgtcggc agccaggaga cactcggcac aaaggcagcg
3241 ccagtcgacg gagtcgcata cgaaggctgt cgatcgtgtg gcatcgagca gcgacctcga
3301 tgaggactgt cggccgaatg aaaacggaac ttgtgcccag tatcgtgccg caatcagcca
3361 cagatatccg gactatgcca tggacgaaac ggaatccgtg ccgcaattca agccatccga
3421 ttcgtatgcc agtgaatgcc tgtgcaaaac gtattcctat tcgagggcag gaaccaatgc
3481 catggccacg gccatggccc cggccgtggc cccccaagat gtaacagagc ccagtccgtg
3541 gtcagcggag accgctgcac cctatcaggg taatgccgct gcagattgga cccaggagaa
3601 acaggcaaat ggccgcaaaa gccccgaagc cccgaagagc cccgattgga tgggcatgga
3661 ggaggagcag cagcagcagc cgcagcagcc gcagcagcaa cagctgcaac agcagcagcc
3721 gccttcagtt gcaacgcagg atgatactca gtttgatgag aatcccgtgg aatgggtgga
3781 ggagcagcgt gagccagcgg ctgcagctgc agctgacaat tatgctgagc agccagcctt
3841 tgctgggcag cgagaactgt gccaccagga ttgtccttat tggaggactg tgcatgagga
3901 ggagagactg caacagcagc agcaacagca gcagcaacag cagcaccttc agctgcaaca
3961 gcagcagcca cagttctatt cacaggaggg cggctatgat caggacgatg aggagcgacc
4021 cacaacgagc aagcatctga aaaggctgtc gcgctcatac agactacgac cgccatgcgg
4081 tgggccactg cgttggattg ccaatggatt caccagctcc aaacggagct ccctctcgga
4141 cagcgctccc ttcgagcggg gaagagaacg ggcaccacca gcagcagcaa cagcagcagc
4201 agcggcagca gttgtcaaaa agcagaggcc acgaccaccc atagagaggg gacgtcgcta
4261 ctacacggaa acgcagccca aaatgtggta cgatcagtgt ggcgggctgg atgggatgag
4321 cagcagcatt cggtcgagcc actcgcgctg ctcgatacgt gccctcagct ccaacgcgga
4381 aaccgttgag tgtccctgcc cgccgttacc gtcaatcaaa agtgagaatg tgaagccaga
4441 tagatggggc atgcagcagc cacaatttca gcagcagcag cggcagccac agttccaaca
4501 gcaacagcag cagccacagt tccagcagca gcagacaccg agggtgactc ctctgatgca
4561 gccacggcag acacagatgc agcctcagat gcagcctcag ctgcagcctc agatgcagcc
4621 taagctgcag cctcagatgc agcctcaaat gcagcctcag atgcagccac agatgccgcc
4681 acatcatatg cagatgcagc cacggccaat acagccacag ccacagccac agatgcaaat
4741 gcaaatgcag cacgcgatgg agtaccaaaa tcccaaccag aaccagaacc agaaccacat
4801 cgaacagaat cgctataggc actacgagca gcagcagcag cagcagcagc agcaagcggc
4861 tgcttacaat ccgatgctgt cgatggctac tgctgctagt ccactcaact ttcccattgg
4921 ctcgggcttt gggcccagca agaagatacc caaactgttg agcaatcagc cgctggagct
4981 gaagccatcg ctcgcccaga agctggaggc caagaagccc aaatggatgc accacaatca
5041 cgcaccgctg acgctgcgca ccgcccgacc gcgggcagag gcctactacg agggcaagat
5101 gcgacggcgt tgcctcaagt tcatgttgag catcgagcaa ggcagacagc cagcaatacc
5161 accagcacag gcgcctgccg agcaggagga ctggtccatc accgagcagc agcagcagca
5221 gcagcatcag catcagcagc agcacgaaag tgagccatcc gaggagctgc aagaggaaga
5281 ggaacaacca caagaacgcg tggggcagca ggaagagcag gagccattga acgactcatc
5341 ggaggaacaa gtttggtttg gagagcccgg ctacgacgtc gatgagggtc cgcaatggag
5401 ccaaacgcgt cgctcgctct ccggcagcgt ggtgatgcgt cctgcaaaca atgcggccac
5461 gcggatcgaa catccaacgc atcaatggcc gtattcaggt cagtcgcagc ggcatgacga
5521 gcgcccgagt cagccgagtc ccacatacga tccctactat cgtgtggacg cggaccactg
5581 gccgccagaa cagcaagccg cccattggga gacgcctcct taccacccga agcagcaatc
5641 gccacagcag cagcagcagc aacagccgcc actcaatcag aattatcagg agccacagtc
5701 tcagcgacag acttcacgcc ggccacaggc gcagccgcag ccacagccac agccgcagcc
5761 acagcggcaa gcacagccgc agccacagaa actgccccag cggcagccac agactgagcg
5821 gcagccacag cgccaggagc agccacagcg ccaggagcag ccacagcgcc aatcacagcg
5881 ccagccacgg catgagcagc agcagccaca gcctgaacgc cagcaagatc ctcatcttca
5941 tgcccatccc ccccatcccc atgcccatgc ccatcgtcag cctcagcctc agcctcatgt
6001 tcagcgccag acgccgcttc cgcccccgcc acagccggac attaatgcca gatggcattg
6061 gcgtataccg agatctgtgt cagagaccag acccttctac gacgaggagg aggagcaaga
6121 ctatgccccg caagaggagc gacagccaac caattggcag aagcatgcac cgccgccgtc
6181 gtacatgcag ccaatgcatc accacggcgt ggcacgttat tctgctgact ctggccgggc
6241 ggctgcaagc cgtagtcccc cccgtcgagc cagcagacac cgggcgccgc agcggcaggg
6301 cggtggtgcg gccggtgcca tcaccgcagc cgcactggga ctgaggggcc tggtccaagc
6361 tctgggcggg cccagatatg cgagaaacct ctcgacggtt tccggcgagt ccgcagaggg
6421 gggaggcgac catggcaatg ccctcgattg cggccagcct cagcccatgg acttctcaag
6481 ggagcagctt cagttcacgg tggaaccggc aacagcggta catcgcggcg caccgctcgg
6541 cgccacggag cagacagcac caggggcagt ccgcagctat ggctgtgcgt tgggcagaga
6601 gcagcgggcc cggacatacg ctgtgcagaa aagaccttcg gcatttctgg cccgctccag
6661 ccgggtcaat ggctactcca agccgccatc cacggacaac gagcggagct ctcccctgtc
6721 gctgttcgcc gtgcccacat ccagcagcaa tctggcgtgg cgagccgaga cgagagccgc
6781 cagaacgata acgagcacgc gaatcccagg cggtcaggag gagcagccac gcatgccgcc
6841 tccagtctcg attctctccg atctggactt tcacacacat tccacatcac acaacagcat
6901 ctgctcgaca tcgggcggct atcgggcgcc gttaccgtct ctggcactgt ccgaggatgg
6961 cacttcgtcc ctggccgtat tcctggacaa tcttcgagcc agaaatgtac tgctgctgcc
7021 cggtaagccc agccctctgg cggagttacc gcccagcgga agtgcagcaa agtgcagcgg
7081 aaagcagcat cagcagcagc aggaggagga ggaggaggag gaggaggaga aggagaaggt
7141 ggaacccacg caaggacgtc gctggtttcc agctacaccg ctggtcatcc cgcccaccta
7201 caatgccctg ctggaggagc aggtacttca ggaatatctg cccacgccca aatgcagacg
7261 cttcaagact tgaatgttcc aacgatccca cagcagcaat caatcctcca agtagccata
7321 gcccggatag ctcagcgttt cacatcaaga accaataaat agctcagaat ttcccttaaa