PREDICTED: Drosophila obscura vacuolar protein sorting-associated
LOCUS XM_022367752 2111 bp mRNA linear INV 14-MAY-2021
protein 4 (LOC111074802), transcript variant X1, mRNA.
ACCESSION XM_022367752
VERSION XM_022367752.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022367752.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..2111
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..2111
/gene="LOC111074802"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 5 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 17 samples with support for all annotated
introns"
/db_xref="GeneID:111074802"
CDS 174..1499
/gene="LOC111074802"
/codon_start=1
/product="vacuolar protein sorting-associated protein 4"
/protein_id="XP_022223444.1"
/db_xref="GeneID:111074802"
/translation="MASGTTLQKAIDLVTKATEEDRNKNYAEALRLYEHGVEYFLHTI
KYEAQGEKAKDSIRAKCLQYLDRAEKLKEYLKKGKKKPIKEGDESNSKDDKDKKSDSD
DEDDDPEKKKLQAKLEGAIVIEKPQVQWSDVAGLDAAKEALKEAVILPIKFPQLFTGK
RIPWKGILLFGPPGTGKSYLAKAVATEANRSTFFSVSSSDLMSKWLGESEKLVKNLFE
LARQHKPSIIFIDEIDSMCSARSDNENDSVRRIKTEFLVQMQGVGNDTDGILVLGATN
IPWVLDSAIRRRFEKRIYIPLPEAHARLVMFKIHLGNTTHVLTEQDLKEMAGKTEGYS
GADISIVVRDALMEPVRKVQTATHFKKVTGPSPSNKDETVDDLLIPCSPGDAGAVEMN
WMDVPSDKLFEPAVTMRDMLKSLSRTKPTVNEDDLKKLRKFTEDFGQEG"
misc_feature 189..380
/gene="LOC111074802"
/note="MIT: domain contained within Microtubule
Interacting and Trafficking molecules. This sub-family of
MIT domains is found in intracellular protein transport
proteins of the AAA-ATPase family. The molecular function
of the MIT domain is unclear; Region: MIT_VPS4; cd02678"
/db_xref="CDD:239141"
misc_feature 546..1058
/gene="LOC111074802"
/note="ATPase domain of vacuolar protein
sorting-associated protein 4; Region: RecA-like_VPS4;
cd19521"
/db_xref="CDD:410929"
misc_feature order(549..551,606..608,618..620,627..632,642..644,
654..656,690..695,705..707,717..719,750..752,756..758,
762..770,777..779,801..803,864..869,876..878,891..893,
897..899,903..911,915..923,930..935,1002..1004,1011..1013)
/gene="LOC111074802"
/note="hexamer interface [polypeptide binding]; other
site"
/db_xref="CDD:410929"
misc_feature order(567..575,579..581,690..710,864..866,999..1001)
/gene="LOC111074802"
/note="ATP binding site [chemical binding]; other site"
/db_xref="CDD:410929"
misc_feature 1137..1256
/gene="LOC111074802"
/note="AAA+ lid domain; Region: AAA_lid_3; pfam17862"
/db_xref="CDD:465537"
misc_feature 1305..1487
/gene="LOC111074802"
/note="Vps4 C terminal oligomerization domain; Region:
Vps4_C; pfam09336"
/db_xref="CDD:462762"
ORIGIN
1 aaataaaaaa ataaaaaaaa caacaaaaca agacggagac gtcgtcattc gcagtttcta
61 tttctgttcc actcccagcc gcatccaaca gcccccaagc gtacgaccca ctgcatcctg
121 tgcgcggcaa gagtttagtt tttgggattt ggagccagac agcgtaaatc agcatggcat
181 ccggcaccac actacagaag gccatcgacc tggtgaccaa ggccaccgag gaggatcgca
241 acaagaacta tgccgaggca ctgcgcctct atgagcacgg agtggagtac tttctgcata
301 cgattaaata cgaggcgcag ggcgagaagg ccaaggactc gatcagagcc aagtgcttgc
361 agtatttgga tcgggccgag aagctcaagg agtacctgaa gaagggcaag aagaaaccga
421 taaaggaggg cgacgagtcc aattccaagg acgacaagga caagaagagc gacagcgatg
481 acgaggacga cgacccggag aagaagaagc tgcaggccaa gctggagggg gcgattgtca
541 tcgagaagcc gcaggtgcag tggtccgatg tggccggtct ggacgccgcc aaggaggccc
601 tcaaagaggc cgtcatcttg cccatcaagt tcccacagct gttcaccggc aagcgcatac
661 cctggaaggg catcctgctg ttcggcccac ccggtacggg caagtcctac ctggccaagg
721 cagtcgccac cgaagccaac cggtccacat tcttctccgt gtccagctcc gatctgatgt
781 ccaaatggct gggcgagtcc gagaagctgg tcaagaatct ctttgagctg gcccgccagc
841 acaagccatc gatcatattc atcgacgaga tcgattccat gtgctcggcc cgttccgaca
901 acgagaacga cagtgtgcgt cgcatcaaga ccgagttcct ggtgcagatg cagggcgtgg
961 gcaacgacac ggacggcatc ctggtcctgg gcgccaccaa tataccctgg gtcctcgact
1021 ctgccatccg gcgacgcttc gagaagcgca tctacattcc gctgccggag gcgcatgccc
1081 gccttgtcat gttcaagata cacttgggca acaccacaca cgtcctcacc gaacaggatc
1141 tcaaggagat ggctggcaaa accgagggat actctggtgc ggatatatcg attgtggtgc
1201 gcgatgcact gatggagccc gtgcgaaagg tccaaacggc cacgcacttc aagaaggtga
1261 cgggacccag tccctcgaat aaggacgaga ctgtcgatga cctgcttatc ccatgctctc
1321 caggtgatgc gggcgccgtc gagatgaact ggatggatgt gcctagcgac aagctcttcg
1381 aaccggccgt gaccatgcgc gacatgctaa aatcgctgtc gcgcacgaaa cccacagtca
1441 acgaagatga cctgaagaag ctgcgcaaat tcacagagga ctttggccag gagggctagg
1501 gctagggctc tcccccagcc cattccccgc cagcactcca gcaccaccag ctaattctaa
1561 ttgttgccta ttgcataaat tcaaacacgt tttattcggg ctctccatta cctattttct
1621 tttttttttt acatacatat atatatacga taaataacat acgcgaacga tatatgtacg
1681 gttaaatgtt taattcaaat ttcaatacga ttgcaagaga aggaagaaaa gaaagtaaaa
1741 gagtaagagc aaggagaaag gagcaaggag cagagagatg gctatcacca gaagcagcct
1801 gatgttgttg taatcacaaa gtcaatgtat ttttgtactc gtaatgttaa gcaatagaag
1861 agatccaacc cgtaataccc gtaataccat accccatgta cacgacggac gatacgtgtg
1921 tcggagccac cagacagtgt gtgttctagc gcaatgatgc ccgtatatgt agtatatgct
1981 agtaggacag ctgcagaagt gatcagcaga tgaagaacga taagtagtga gcgaagccaa
2041 ccaattcaag caaccaagca aacctaattc aaatgtaact tatgaaaaat aaacgaaaag
2101 tatatactgt a