PREDICTED: Drosophila obscura probable cationic amino acid
LOCUS XM_022364238 4546 bp mRNA linear INV 14-MAY-2021
transporter (LOC111072394), mRNA.
ACCESSION XM_022364238
VERSION XM_022364238.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022364238.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..4546
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..4546
/gene="LOC111072394"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 5 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 10 samples with support for all annotated
introns"
/db_xref="GeneID:111072394"
CDS 733..3192
/gene="LOC111072394"
/codon_start=1
/product="probable cationic amino acid transporter"
/protein_id="XP_022219930.2"
/db_xref="GeneID:111072394"
/translation="MKPSDLLLVLEKVKFRVPLPPGVSSTTLLPKLIRTKDVRQLQDG
NAQPQKPKLTKCLNTLDLTSLGIGSCCGTGMYLVAGMVAQKIAGPGVIISFIIAAIAS
IFSGACYAEFGVRVPHTSGSAYMYSYVAVGEFVAFIIGWNMILEYLIGTSACACALSS
SFDSLTGNAIARTISESIGTIFGKPPDFIAFGITLLMTCVLAMGASKSVIFNHSLNAV
NLATWVFVMAAGLFYVDTKTWSEHQGFLPYGWSGVFSGAATCFYAFIGFDIIATTGEE
AHNPQKSIPKAIVGSLVVVLIAYVSVSLVLTLVVPYDHINTGAALVQMWSYVNAPKCR
AVVAIGATAGLSVAMFGSMFPMPRVIYAMAQDGLIFRQLSQLWQRTNVPGLATIGSGL
AAALVALTIRLEILVEMMSIGTLLAYTLVSTCVLVLRYQPHSTSLVELLPAQLRTPVA
PGTVSDSHATPAEVLEVPNKLTIKRVTRGMSDSDDSFIDDSPEGYLGGRDDQFLVSDR
SENKFYGSVHGAPTGPTGQATAFDTMGLNFITRKIHDYAYLCPGFFPWINPGQATTDS
GMYVTKLVGIMFGLIFFLDLFAAIGWSGGLAVFIYLILFIGIFVILLIISRQPQNRYA
LAFLTPGLPFIPAIAITVNIYLIFKLSILTLVRFTVWMSLGLVMYFYYGITHSSLEQA
SDDLELHVDKDYSKNVEEKAVWDQQSYNQTHEPVWASKEVKQPSKQRYNYGKSSSSST
QSSSRNAQSNAATKPPGRSSTGTRPVAPLPHTGAGQAKGSGKGTGTGTGTGTGGPPME
RQYTGQFSMFVDEGQFPSWED"
misc_feature 886..2763
/gene="LOC111072394"
/note="cationic amino acid transport permease; Region:
2A0303; TIGR00906"
/db_xref="CDD:273330"
ORIGIN
1 acagagagag agatagggag acaggacagt gttgggcaga gacaaaaaac tgaaaacgaa
61 actgaaaaga aagtgaaaat ttgcctaacg tggacaagtg caaaagaaag gaggagaggg
121 gtgagatcct tcagacgtga aactgttgaa tatagaatga cgttgctgag agaaatcaat
181 gtggaaaaca acaaaaaagc aacagtagca gcaccaccag cacgagtaac tgtaaactta
241 tcctcaacaa cagcaacagc agcagcagaa gcagcagcag gcgaaggaac agtagcagca
301 ggagctgcga aaacaacaga agaagcaaca gcaacatcaa gaagctcacc acgcaagcag
361 agataaagcg aagaaggtca tcaagctgca aagcaaagaa taggaaagcg cctgaaagaa
421 gagttagaac tagtgggaaa agggcaacag ccagaaccgg aagaatccca aggaagcagc
481 agaagatcag caacacacaa atacagcaag cagcaccaca tacacacaca acaacaacaa
541 caacaaaaaa aacaattcat catctcactt tgccatttta ttggccactc tttcaccgtt
601 ccgttcgatt gttggtgctg ttgctgggtc ctccattgac tttaaataga agaagttgtg
661 ctcaacgcat ctgattgctg ttgctgcatt tggctagcgg gggtggaaag ggaagcaaga
721 aagaagaacg caatgaagcc atcggatttg ttgttggtgc tagagaaggt caagttcagg
781 gtgccattgc cgcctggcgt tagctcgacg acgctcctac cgaagctcat acgcaccaag
841 gatgtgaggc agctgcagga cggcaatgca cagccacaga agcccaagct gacgaaatgt
901 ctgaacactt tggacctgac ctcgctgggc atcggctcct gctgtggcac tggcatgtat
961 ctggtcgccg gcatggtggc ccagaagata gccggacccg gtgtgattat tagttttatt
1021 atagccgcca tagcgagtat tttctcaggc gcctgctatg cggaatttgg cgttcgtgtg
1081 ccacacacat ccggctcggc ctatatgtat tcatatgtgg cagttggcga atttgtagct
1141 tttataattg gttggaacat gatattggaa tatctaatag gaacaagtgc ctgtgcctgt
1201 gctttaagtt ctagcttcga ttccctgact ggcaatgcca ttgcgcgcac gataagcgag
1261 tccattggca caatattcgg caaaccaccg gactttatag cctttggcat aacgctactc
1321 atgacctgcg tattggcgat gggcgccagc aagtcggtca tctttaatca ttccctgaat
1381 gccgtcaatc tggccacatg ggtctttgtc atggctgccg gccttttcta tgtggacacg
1441 aagacgtggt cggaacatca gggattcctg ccctatggct ggagtggtgt attctctggg
1501 gccgccacat gcttctatgc atttatcgga ttcgacataa ttgcaacaac cggcgaggag
1561 gcgcataatc cacagaagag cataccgaag gccattgtgg gctccctggt tgtggttttg
1621 attgcctatg tcagcgtcag tctagtcctt actttagtcg tgccctatga tcacatcaac
1681 acgggagcgg ctctggtcca gatgtggtcg tatgtgaatg cacccaagtg ccgtgcggtg
1741 gtggcgattg gggccacagc tggtctgtcc gtggccatgt ttggctcaat gtttccgatg
1801 ccgcgcgtca tctatgccat ggcccaggat ggcctgattt tcagacaact ctcgcagctg
1861 tggcagcgca ccaatgtgcc tggtctggcc accattggca gtggactggc tgcggctttg
1921 gtcgccctca ccatacgcct ggagatactc gttgagatga tgtccattgg caccctgctg
1981 gcctataccc tggtctcgac atgtgtcctg gtgctgcgct accagccgca cagcacctcg
2041 ttggtggaac tgttgcccgc ccagctgcgt accccggtgg caccgggcac cgtcagcgat
2101 tcgcatgcaa cgcccgccga ggtgctggag gtgcccaata agctgaccat caagcgggtg
2161 acacgcggca tgtccgattc ggatgactcc ttcatcgatg acagccccga gggctatctg
2221 ggcgggcggg acgatcagtt tctggtgtcg gatcgctccg agaataagtt ctatggcagc
2281 gttcatggtg cacccaccgg acccacgggc caggcgaccg ccttcgacac gatgggcttg
2341 aactttatca cacgaaagat ccatgactat gcgtacctgt gccccggctt ctttccctgg
2401 atcaatcccg gccaggccac caccgacagt ggcatgtatg tcacgaaact agttggaatc
2461 atgttcggcc tcatattctt cttggatttg ttcgcggcca ttgggtggtc cggtggccta
2521 gccgtcttca tttatttaat tttgtttatc ggcatatttg tgatattatt gattatatcg
2581 agacaaccgc agaatagata tgcactggcc tttctaacgc ccggactacc cttcatacca
2641 gccattgcca tcaccgtgaa catctatctg atattcaaat taagcatcct gaccctggtc
2701 cggttcaccg tctggatgtc cctcggcctt gtcatgtact tctactatgg gatcacgcac
2761 agcagcctgg agcaggccag cgatgatctc gagctgcatg tggacaagga ttatagcaaa
2821 aacgtggagg agaaggccgt ctgggatcag cagtcgtata atcaaaccca cgagccagtc
2881 tgggccagca aggaggtgaa gcagccatca aagcaacgct acaactatgg aaagtcttcg
2941 tcgtcgtcta ctcagagcag ctccagaaat gcccaaagca atgcggcgac aaagccgcca
3001 ggccgcagct caacgggcac acggccagtg gcgccattgc cgcacacagg cgctggtcag
3061 gcgaagggtt cgggcaaggg aacgggaacg ggcacaggaa cgggaacggg aggaccgccc
3121 atggagcgtc aatataccgg gcagtttagc atgtttgtgg atgagggcca gttcccgtcg
3181 tgggaagact aaattggaat attcaaaccc ctcctcaccc cgaacccata gttgttttcc
3241 cttttcccaa tccccgctcc tggatcctcg gttgatatcc cctgatgtgt ccagcaggac
3301 tgctgatccc caaatcccaa atcccacgtt tcgaatcgga tagtcctggc aacaaaaaaa
3361 ctgggccgct cctcaaggcg tcgatctgtg atcgtagccg tgtacgcttg ggtgtccaga
3421 acggaaccag tcccccaggt agagctcact caacttgtat atacaattag agatgttaag
3481 tttaccggaa tatgtacatg tatacgtata ctcgttacat acatacatac atgttacggt
3541 tatgggaaac gtgacgagac gagagacgag aggagacgag cgagcgttta gcgctgaccg
3601 ccgttataca cttataccat tatacaacat tagtatatac agatatatat acatatacat
3661 acatacatat atatacctat gcatatacta tttacaaaac atagattggt tattagtgca
3721 aagttatcaa aaatcctgtg tatcctgtgt atgccatata ctacatattc tacgtcccct
3781 cccttccttt tgtgctcagc ttcgattggc gagcgttttg ggtgacgctg gtcaatcggc
3841 attcagcaca agggacaaca agggacaaca agaggacacg acgaccctgg gggacactgt
3901 accctggaaa cagtaggacg acagatgaat attattggga aaagtaagca aaaactgtgt
3961 gtgtaagagt gccgggtgtg cacaactgta cattttatac actcaatatt ctcacacaca
4021 cacacacaca cacatgataa tacatgcata catgcctaag cttatggaaa gagcgatgga
4081 tcagtgaaga ctttttttcc acccccacaa aaagatatgc gctgaccgaa aggaactttg
4141 cattaagctt aggatccaca cacagacaca gggaacacat ccaaaacact tatagagaca
4201 cacacacatg gacacacaac tcttatgtat ttatgattgt attgttgttc aaatatttat
4261 gctacctgct atgctatgca ttttatataa cattttgaaa tttgaccttt ttgatgtatt
4321 tatgtatgaa tacagttact actctctctc ccctctccaa gccacctccc cccatctcaa
4381 tctatccaca attttattgt ttcctgattc tagaggctcg cgctaagcta aaccacacat
4441 tttaaagcgg catttcgcat ttgttgtaat tcaataattc atttttctta acaaatacca
4501 aacaagaaaa caaaacaaaa aaaaaaacaa attaccgaag gaaaag