PREDICTED: Drosophila obscura vacuolar protein sorting-associated
LOCUS XM_022367753 2108 bp mRNA linear INV 14-MAY-2021
protein 4 (LOC111074802), transcript variant X2, mRNA.
ACCESSION XM_022367753
VERSION XM_022367753.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022367753.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..2108
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..2108
/gene="LOC111074802"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 5 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 17 samples with support for all annotated
introns"
/db_xref="GeneID:111074802"
CDS 171..1496
/gene="LOC111074802"
/codon_start=1
/product="vacuolar protein sorting-associated protein 4"
/protein_id="XP_022223445.1"
/db_xref="GeneID:111074802"
/translation="MASGTTLQKAIDLVTKATEEDRNKNYAEALRLYEHGVEYFLHTI
KYEAQGEKAKDSIRAKCLQYLDRAEKLKEYLKKGKKKPIKEGDESNSKDDKDKKSDSD
DEDDDPEKKKLQAKLEGAIVIEKPQVQWSDVAGLDAAKEALKEAVILPIKFPQLFTGK
RIPWKGILLFGPPGTGKSYLAKAVATEANRSTFFSVSSSDLMSKWLGESEKLVKNLFE
LARQHKPSIIFIDEIDSMCSARSDNENDSVRRIKTEFLVQMQGVGNDTDGILVLGATN
IPWVLDSAIRRRFEKRIYIPLPEAHARLVMFKIHLGNTTHVLTEQDLKEMAGKTEGYS
GADISIVVRDALMEPVRKVQTATHFKKVTGPSPSNKDETVDDLLIPCSPGDAGAVEMN
WMDVPSDKLFEPAVTMRDMLKSLSRTKPTVNEDDLKKLRKFTEDFGQEG"
misc_feature 186..377
/gene="LOC111074802"
/note="MIT: domain contained within Microtubule
Interacting and Trafficking molecules. This sub-family of
MIT domains is found in intracellular protein transport
proteins of the AAA-ATPase family. The molecular function
of the MIT domain is unclear; Region: MIT_VPS4; cd02678"
/db_xref="CDD:239141"
misc_feature 543..1055
/gene="LOC111074802"
/note="ATPase domain of vacuolar protein
sorting-associated protein 4; Region: RecA-like_VPS4;
cd19521"
/db_xref="CDD:410929"
misc_feature order(546..548,603..605,615..617,624..629,639..641,
651..653,687..692,702..704,714..716,747..749,753..755,
759..767,774..776,798..800,861..866,873..875,888..890,
894..896,900..908,912..920,927..932,999..1001,1008..1010)
/gene="LOC111074802"
/note="hexamer interface [polypeptide binding]; other
site"
/db_xref="CDD:410929"
misc_feature order(564..572,576..578,687..707,861..863,996..998)
/gene="LOC111074802"
/note="ATP binding site [chemical binding]; other site"
/db_xref="CDD:410929"
misc_feature 1134..1253
/gene="LOC111074802"
/note="AAA+ lid domain; Region: AAA_lid_3; pfam17862"
/db_xref="CDD:465537"
misc_feature 1302..1484
/gene="LOC111074802"
/note="Vps4 C terminal oligomerization domain; Region:
Vps4_C; pfam09336"
/db_xref="CDD:462762"
ORIGIN
1 aaataaaaaa ataaaaaaaa caacaaaaca agacggagac gtcgtcattc gcagtttcta
61 tttctgttcc actcccagcc gcatccaaca gcccccaagc gtacgaccca ctgcatcctg
121 tgcgcggcaa gagtttagtt tttgggattt ggagccagac agcgtaaatc atggcatccg
181 gcaccacact acagaaggcc atcgacctgg tgaccaaggc caccgaggag gatcgcaaca
241 agaactatgc cgaggcactg cgcctctatg agcacggagt ggagtacttt ctgcatacga
301 ttaaatacga ggcgcagggc gagaaggcca aggactcgat cagagccaag tgcttgcagt
361 atttggatcg ggccgagaag ctcaaggagt acctgaagaa gggcaagaag aaaccgataa
421 aggagggcga cgagtccaat tccaaggacg acaaggacaa gaagagcgac agcgatgacg
481 aggacgacga cccggagaag aagaagctgc aggccaagct ggagggggcg attgtcatcg
541 agaagccgca ggtgcagtgg tccgatgtgg ccggtctgga cgccgccaag gaggccctca
601 aagaggccgt catcttgccc atcaagttcc cacagctgtt caccggcaag cgcataccct
661 ggaagggcat cctgctgttc ggcccacccg gtacgggcaa gtcctacctg gccaaggcag
721 tcgccaccga agccaaccgg tccacattct tctccgtgtc cagctccgat ctgatgtcca
781 aatggctggg cgagtccgag aagctggtca agaatctctt tgagctggcc cgccagcaca
841 agccatcgat catattcatc gacgagatcg attccatgtg ctcggcccgt tccgacaacg
901 agaacgacag tgtgcgtcgc atcaagaccg agttcctggt gcagatgcag ggcgtgggca
961 acgacacgga cggcatcctg gtcctgggcg ccaccaatat accctgggtc ctcgactctg
1021 ccatccggcg acgcttcgag aagcgcatct acattccgct gccggaggcg catgcccgcc
1081 ttgtcatgtt caagatacac ttgggcaaca ccacacacgt cctcaccgaa caggatctca
1141 aggagatggc tggcaaaacc gagggatact ctggtgcgga tatatcgatt gtggtgcgcg
1201 atgcactgat ggagcccgtg cgaaaggtcc aaacggccac gcacttcaag aaggtgacgg
1261 gacccagtcc ctcgaataag gacgagactg tcgatgacct gcttatccca tgctctccag
1321 gtgatgcggg cgccgtcgag atgaactgga tggatgtgcc tagcgacaag ctcttcgaac
1381 cggccgtgac catgcgcgac atgctaaaat cgctgtcgcg cacgaaaccc acagtcaacg
1441 aagatgacct gaagaagctg cgcaaattca cagaggactt tggccaggag ggctagggct
1501 agggctctcc cccagcccat tccccgccag cactccagca ccaccagcta attctaattg
1561 ttgcctattg cataaattca aacacgtttt attcgggctc tccattacct attttctttt
1621 tttttttaca tacatatata tatacgataa ataacatacg cgaacgatat atgtacggtt
1681 aaatgtttaa ttcaaatttc aatacgattg caagagaagg aagaaaagaa agtaaaagag
1741 taagagcaag gagaaaggag caaggagcag agagatggct atcaccagaa gcagcctgat
1801 gttgttgtaa tcacaaagtc aatgtatttt tgtactcgta atgttaagca atagaagaga
1861 tccaacccgt aatacccgta ataccatacc ccatgtacac gacggacgat acgtgtgtcg
1921 gagccaccag acagtgtgtg ttctagcgca atgatgcccg tatatgtagt atatgctagt
1981 aggacagctg cagaagtgat cagcagatga agaacgataa gtagtgagcg aagccaacca
2041 attcaagcaa ccaagcaaac ctaattcaaa tgtaacttat gaaaaataaa cgaaaagtat
2101 atactgta