PREDICTED: Drosophila obscura uncharacterized LOC111072415
LOCUS XM_022364261 2083 bp mRNA linear INV 14-MAY-2021
(LOC111072415), mRNA.
ACCESSION XM_022364261
VERSION XM_022364261.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022364261.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..2083
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..2083
/gene="LOC111072415"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 1 Protein, and 100% coverage of
the annotated genomic feature by RNAseq alignments"
/db_xref="GeneID:111072415"
CDS 158..1567
/gene="LOC111072415"
/codon_start=1
/product="uncharacterized protein LOC111072415"
/protein_id="XP_022219953.1"
/db_xref="GeneID:111072415"
/translation="MRVCQLAVAVGCLHVLLSCSASNHWVLKEDEITQKLDSPFHMRE
PQNLIAFLEQIRYNEYVERSYLDLLRKREQIVEHLRFSMRFGEDLAEQAKCAMDYYML
EKRMSYLKITPDQLKRINLPEQSDVNQPHGKSGTGTKQRTLEPICSNYHKLSVGPATY
DHLESFQPKIMDSAYVEREHDTNIGTATVELTRRFAVDGLQHHPASWKFHTLSSYYWR
MRGNAREALPCARLATLLAPPIFKDIPLLSLGTILFRMGRLIDADLILTAAVEHAPRV
AENHVVLASALAMKHDFNRSLQHFDEAERLDPSTLPRTQQVRNFISCLENLTKKTSKM
YSYVKYMKNEVKEFKKLKFHISQNHERLVQQQLPLGARRFLDSKSSNDDLHRRGQYCS
TRTPNGSDEPVLFCDFYSDMQMRLESKDVDIDVLERDLKANTDAVIRQVSTEIRKQFN
LEQLKAAKAQMPAKATKST"
misc_feature 815..1090
/gene="LOC111072415"
/note="Type IV pilus assembly protein PilF/PilW [Cell
motility, Extracellular structures]; Region: PilF;
COG3063"
/db_xref="CDD:442297"
misc_feature 893..970
/gene="LOC111072415"
/note="TPR repeat [structural motif]; Region: TPR repeat"
/db_xref="CDD:276809"
misc_feature 983..1075
/gene="LOC111072415"
/note="TPR repeat [structural motif]; Region: TPR repeat"
/db_xref="CDD:276809"
ORIGIN
1 tttgatcggt aactccgcgg tcacactgct ggctatcgcc gcagtttatt taccaaattt
61 gtttagatgt tttttctatt atcctctgtg ctgtttgctt gttaaccggc gtgcagatta
121 tagttacgcc ctttgtggga ccatactcca atgtaatatg cgtgtctgtc agctcgccgt
181 ggccgtcggc tgcctgcacg tgctcctcag ctgctcggcc agcaatcatt gggtgctgaa
241 ggaagacgaa attacccaaa agctggactc gccctttcac atgcgcgagc cccagaacct
301 gattgccttc ctggagcaga tacgctacaa tgagtacgtt gagcgctcct acctggacct
361 gttgcgaaag cgggaacaga ttgtcgagca cttgcgcttc tccatgcgct tcggggaaga
421 cctggccgag caggccaagt gtgcgatgga ctactacatg ctggagaagc gaatgtccta
481 tctgaagatc acacccgatc aactgaaacg catcaatctg ccggagcaga gcgacgtgaa
541 tcagccgcac ggaaagtctg gcactgggac aaaacaacgt accctggagc ccatctgcag
601 caactatcat aagctgagtg ttggcccggc cacctacgac catctcgaaa gcttccagcc
661 aaagattatg gactcggcat acgtggagcg ggagcatgac accaatatcg ggacggctac
721 cgtcgaactg acacgacggt ttgccgttga cggactgcag catcatccgg cgtcatggaa
781 attccacacg ctcagctcct actactggcg catgcgaggc aatgctcgcg aggctctgcc
841 ctgcgcccgt ctggccacat tgctggcccc gcccatcttc aaggacatac cgctgctcag
901 tctgggcaca atactctttc gcatgggtcg cctgatcgat gccgatctaa tactgacggc
961 tgccgtcgag catgccccca gggtggcgga gaatcatgtg gtactcgcct cagcgctggc
1021 catgaagcat gacttcaatc ggtccctgca gcactttgat gaggccgaac gcctcgatcc
1081 cagcaccctg ccgcgcaccc agcaagtgcg caacttcatc agctgcctgg agaatctcac
1141 caagaagacc tccaaaatgt acagctacgt gaagtacatg aaaaacgagg tgaaggagtt
1201 caagaagctc aagtttcata tatcccagaa ccatgagcga ctcgtgcagc agcagctgcc
1261 gctgggtgca cggcgcttcc tcgactcaaa gtccagcaac gatgatctgc atcgtcgggg
1321 ccaatactgc agtacgcgca cacccaacgg ctccgacgag ccggtgctct tctgtgattt
1381 ctactcggac atgcagatgc ggctggagag caaggacgtg gatatcgatg tgctcgagcg
1441 ggatctgaag gccaacactg atgcggttat tcggcaggtg tcgacggaaa tacgcaagca
1501 attcaatctc gaacagctga aggcagccaa ggcccaaatg cccgccaagg cgacaaaatc
1561 aacgtagaaa agcaaccgaa gacgggtaga gcagaggagc ggagcggaga ggagaccacc
1621 aaatgcattc cctaacaaca agtagcagta gtagtagtat atatatatat atatatatat
1681 atatatattt cggatcacaa attagttttt aatctgttca atagtattta cgaccgtgtg
1741 tgtgtctgtc cctgtgtgtg tggctgggaa catttatgtg tacatatatt gataaagata
1801 ttcaataagt catcaatcag gtaatagctt attaatcggc atctaaggct aggcatgtaa
1861 gatgagtctg aatttttcac ataccccaat gctctcgtgc atttgaatat tcgatcaaag
1921 gagagcacag cgcgcagagc cgtgactaag tatcgcgcgt cttatgttac gattaggcat
1981 ataaagattt atatatatat atgtatgtat gtatgtatat ttttttcgtt tcgtttcata
2041 ttttttgtac acataagagc agccagtaag ttatgcatta aac