PREDICTED: Drosophila obscura pleiotropic regulator 1
LOCUS XM_022361372 1614 bp mRNA linear INV 14-MAY-2021
(LOC111070663), mRNA.
ACCESSION XM_022361372
VERSION XM_022361372.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022361372.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..1614
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..1614
/gene="LOC111070663"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 4 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 17 samples with support for all annotated
introns"
/db_xref="GeneID:111070663"
CDS 78..1493
/gene="LOC111070663"
/codon_start=1
/product="pleiotropic regulator 1"
/protein_id="XP_022217064.1"
/db_xref="GeneID:111070663"
/translation="MEDVQKHSVHTLIFRSLKRTHDLFVSNQGNLPDVDDRLDKQRRS
IKAKDTYGSLMDRVVTANEPNRTPSGHSVERPLAITDGSTDTASSLMKFNADARRPDT
VLSIPGNNSTSLVRASMPSNGPGSNGASNLQLIPKKAPTIPKPKWHAPWKLSRVISGH
LGWVRCIAVEPGNEWFATGAGDRVIKIWDLASGKLKLSLTGHVSTVRGVAVSTKHPYL
FSCGEDRQVKCWDLEYNKVIRHYHGHLSAVYSLALHPTIDVLATSGRDSTARIWDMRT
KANVHTLTGHTNTVASVVAQATNPQIITGSHDSTVRLWDLAAGKSVCTLTNHKKSVRS
VVLHPSLYMFASASPDNIKQWRCPEGNFVQNISGHTAIVNCMAANNDGVLVSGGDNGT
MFFWDWRTGYNFQRFQAPVQPGSMDSEAGIFAMCFDHSGTRLITAEADKTIKVYKEDD
EASEETHPVNWRPDLLKRRKF"
misc_feature 534..1418
/gene="LOC111070663"
/note="WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions including
adaptor/regulatory modules in signal transduction,
pre-mRNA processing and cytoskeleton assembly; typically
contains a GH dipeptide 11-24 residues from...; Region:
WD40; cd00200"
/db_xref="CDD:238121"
misc_feature order(555..557,609..611,621..623,639..644,678..683,
735..737,747..749,765..770,804..809,858..860,873..875,
891..896,933..935,984..986,999..1001,1017..1022,
1056..1061,1113..1115,1125..1127,1140..1145,1179..1184,
1230..1232,1245..1247,1263..1268,1302..1307,1383..1385,
1395..1397,1413..1418)
/gene="LOC111070663"
/note="structural tetrad [active]"
/db_xref="CDD:238121"
misc_feature 573..680
/gene="LOC111070663"
/note="WD40 repeat [structural motif]; Region: WD40
repeat"
/db_xref="CDD:293791"
misc_feature 696..806
/gene="LOC111070663"
/note="WD40 repeat [structural motif]; Region: WD40
repeat"
/db_xref="CDD:293791"
misc_feature 819..929
/gene="LOC111070663"
/note="WD40 repeat [structural motif]; Region: WD40
repeat"
/db_xref="CDD:293791"
misc_feature 948..1055
/gene="LOC111070663"
/note="WD40 repeat [structural motif]; Region: WD40
repeat"
/db_xref="CDD:293791"
misc_feature 1071..1178
/gene="LOC111070663"
/note="WD40 repeat [structural motif]; Region: WD40
repeat"
/db_xref="CDD:293791"
misc_feature 1200..1322
/gene="LOC111070663"
/note="WD40 repeat [structural motif]; Region: WD40
repeat"
/db_xref="CDD:293791"
misc_feature 1341..1415
/gene="LOC111070663"
/note="WD40 repeat [structural motif]; Region: WD40
repeat"
/db_xref="CDD:293791"
ORIGIN
1 tgcagcccta gttgtgactg accggactgt tttgccgcat ttttgtttag tttatctcgg
61 atccatttaa ttgatcaatg gaggacgtac aaaagcacag cgtgcacacg cttatatttc
121 gctccctaaa gcgcacacac gatctgttcg tctccaatca aggaaacctg cccgatgtcg
181 acgaccgact ggacaaacaa cggcgatcga tcaaggccaa ggatacctat ggctcgttaa
241 tggacagggt tgtgacagcc aacgagccaa ataggactcc atcaggacac tccgtggaaa
301 ggccattggc catcacagac gggagcacag acactgccag cagcctgatg aaattcaatg
361 ccgatgcgcg tcgccccgac acggtgctgt ctataccggg taacaatagc acatccctag
421 tgcgggcatc catgcccagc aatggacctg gatctaatgg cgccagcaat ctgcagctca
481 ttccaaagaa ggcacccacc ataccgaagc ccaagtggca tgcaccctgg aagctgtctc
541 gcgtcatttc cggacatttg ggttgggtgc gttgcatagc cgtggagccg ggcaatgagt
601 ggttcgccac cggtgccggg gatcgtgtca ttaaaatctg ggatttggcc agcggtaagc
661 tgaaactctc gctcaccggg cacgtgagca ctgtacgggg agtggcagtc agcaccaagc
721 atccgtatct gttcagctgc ggcgaggatc gtcaggtgaa gtgctgggat ctggagtaca
781 acaaagtgat ccgacactat catggacatc tgtctgccgt ctactcgctg gccctccacc
841 caaccatcga tgtgttggcc acctctgggc gcgactccac ggctcgcatc tgggacatga
901 ggaccaaggc caatgtgcac acgctgaccg gccacaccaa tacggtggct agtgtggtgg
961 cccaggccac caatccacag atcatcaccg gctcccatga ttccaccgtg cggctgtggg
1021 atctagctgc cggcaagagc gtctgcaccc tgaccaacca taagaagtct gtgcgcagcg
1081 ttgtgctgca tccatcgctg tacatgttcg cctccgcctc gcccgacaac atcaaacagt
1141 ggcgctgtcc cgagggcaac tttgtgcaga acatctccgg ccacacggcc attgtcaact
1201 gtatggcggc caataacgat ggggtgctgg tctctggcgg cgacaacggc accatgttct
1261 tctgggactg gcgcaccggc tataatttcc agcgtttcca ggcccccgtc cagcccggct
1321 ccatggacag cgaggcgggc atatttgcca tgtgctttga ccattcgggc acgcgtctaa
1381 tcaccgccga ggcggacaag accatcaagg tgtacaagga ggatgacgaa gccagcgagg
1441 agacgcatcc agtcaattgg cggccagatc tgttgaagcg tcgcaagttc taagttctat
1501 ccggtagtct gtagtccaca aaaatatcta atgtataatt taagcgtaaa actgcccccg
1561 gaaacagatt tcgattaaat aatggtttgt tccaatgcaa ttctacatca aata