PREDICTED: Drosophila obscura amyloid protein-binding protein 2
LOCUS XM_022371716 3861 bp mRNA linear INV 14-MAY-2021
(LOC111077440), mRNA.
ACCESSION XM_022371716
VERSION XM_022371716.2
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
On May 14, 2021 this sequence version replaced XM_022371716.1.
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
FEATURES Location/Qualifiers
source 1..3861
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..3861
/gene="LOC111077440"
/note="Derived by automated computational analysis using
gene prediction method: Gnomon. Supporting evidence
includes similarity to: 2 Proteins, and 100% coverage of
the annotated genomic feature by RNAseq alignments,
including 17 samples with support for all annotated
introns"
/db_xref="GeneID:111077440"
CDS 306..2471
/gene="LOC111077440"
/codon_start=1
/product="amyloid protein-binding protein 2"
/protein_id="XP_022227408.2"
/db_xref="GeneID:111077440"
/translation="MANNNQLMNLIASAPANNNSPKPLYDLSLHASICSLDGNTVPAL
DRISRLPELPRNLLIDVYEMMSQRDSLQDTLLEELSRLEVFARLVRHAPARSKLLRIV
AALMANRKMLASRLSEAYVGRYSTKKEQDGDEDGEQPGEHQGRGGGGAEGQEQSQDSL
DERAKQLLQEYSQLPIPDEDMASTSSSASAMARARSSQTECPNHELELSVITTAMDQP
GSTTPNADAAAAAADDDLDELNGEELQEIDLGLRLGSFLSEAGWMQESISVLTCLNER
LKRLAPHKHWLVMRLDCLQRLLYAESAHCNFKLAQRTYNELIELKRSISDRVPSDLVA
MTYTQISAMFFARNEYNNSHMWSLHAMRRLKHSATPRIAIDVLRQAAKACVVKRDFAR
ANLLICQAVRRAREHFGRKHQKYGDTLLDYGFFLLNVDSVFQSVNVYKEALGVRRGIF
GNMNFHVAIAHEDLSYAYYVHEYSTGDFSCAQDHVDKAVGIMKKLVPSNHLMLASAKR
VKALLLEEIALDKMADGMDEEDLLLQSEELHNFALLLSLEVFGEVNVQTAKHYGNLGR
LYQTMNRFEEAERMHQKAIKIKTDLLGPFDYEVGLSIGHLASLYNYQMKKYREAEKLY
LRSIDISLRLFGLSYSGLEYDYLGLCHVYETLHDFEKYLQYAHTLENWQMLRGQNITQ
NKFSYPAIEQDYSIDEVKNKFYDTCESSSSSSPQKTEGK"
misc_feature 1836..>2303
/gene="LOC111077440"
/note="Predicted O-linked N-acetylglucosamine transferase,
SPINDLY family [Posttranslational modification, protein
turnover, chaperones]; Region: Spy; COG3914"
/db_xref="CDD:443119"
misc_feature 1971..2195
/gene="LOC111077440"
/note="Tetratricopeptide repeat; Region: TPR_12;
pfam13424"
/db_xref="CDD:315987"
misc_feature 1977..2063
/gene="LOC111077440"
/note="TPR repeat [structural motif]; Region: TPR repeat"
/db_xref="CDD:276809"
misc_feature order(1980..1982,1989..1994,2001..2006,2010..2012,
2106..2108,2115..2120,2127..2132,2139..2144,2235..2237,
2244..2249,2256..2261,2268..2270)
/gene="LOC111077440"
/note="putative protein binding surface [polypeptide
binding]; other site"
/db_xref="CDD:276809"
misc_feature 2100..2195
/gene="LOC111077440"
/note="TPR repeat [structural motif]; Region: TPR repeat"
/db_xref="CDD:276809"
ORIGIN
1 gcccttatcg agtaaaaaag ggaaaaaaga attcgagtgc tacagtgttt ttttacgctt
61 cacggcccga aaaacagcgg agttcctcca gagtggcccg cagtacacgt gagagagaaa
121 cttagatgca ttcgagtttg tagtgcaccc gagtgccacc ccgcccctgc tccacacgca
181 ccagcgagag acgtgagcgc agcagtgtcg ccgcagccca gccccagctc taactgctgc
241 ccccgagccc gagccaagta gcaaacggtg cagaggatca tcatcatcgt cgtcgtccca
301 tcgcaatggc caacaacaat caactgatga atctcatcgc gtcggccccc gccaacaaca
361 acagccccaa gcccctctac gacctgtccc tgcatgcatc gatctgcagc ctggatggga
421 acacggtgcc agcactggat cggattagcc ggttgccgga gctgccgcgc aatttgctta
481 tcgatgtcta cgaaatgatg tcgcagcggg attccctaca ggacaccctg ctggaggagc
541 tctcgcgcct ggaggtgttc gcacggctcg tgcgccacgc accagcacgc agcaaactgc
601 tgagaattgt ggccgccctg atggccaaca gaaagatgct cgctagccga cttagcgagg
661 cctatgtggg tcgctacagc acaaaaaagg agcaggatgg cgacgaggat ggggagcagc
721 caggcgaaca tcaggggaga ggaggaggag gtgccgaggg gcaagaacag tcgcaggaca
781 gtctggatga gcgggccaaa cagctgctgc aggagtacag ccagctgccg ataccggacg
841 aggatatggc cagcaccagc agcagtgcca gtgccatggc cagggccaga tccagccaaa
901 cggagtgccc caaccatgag ctcgaactca gtgtgattac tactgccatg gatcagccag
961 gatctacaac gccgaatgca gatgcagctg cagctgcagc cgatgacgac ctggacgaac
1021 tgaacgggga ggagctgcag gaaattgatc tgggtctgcg tctcggttcg ttcctctccg
1081 aggccggttg gatgcaggag agcatctcag tgctgacgtg cctcaacgaa agactaaagc
1141 gtctggcgcc ccacaagcac tggctggtga tgcgccttga ctgcctgcag cgcctcctgt
1201 atgcggaatc ggcccactgt aacttcaagc tggcccagcg cacctacaac gaactgatag
1261 agctgaagcg ctcaattagc gaccgggtgc cctcggatct ggtggccatg acctacacac
1321 agatatcggc catgttcttt gcccgcaacg agtacaacaa cagccacatg tggagcctgc
1381 atgcgatgcg tcgcctcaag cactcggcca caccgcgcat cgccatcgat gtgctgaggc
1441 aggcggccaa ggcctgtgtg gttaagcgag actttgcccg ggccaatctg ttaatctgtc
1501 aggctgtgcg acgcgctagg gaacactttg gccgcaagca tcaaaagtac ggggacaccc
1561 tcctggatta tggcttcttc ttgcttaatg tggactcggt gttccagtcg gtgaacgtgt
1621 acaaggaggc gctgggcgtc cgtcgcggca tctttggcaa catgaacttc catgtggcca
1681 ttgcccacga ggatctgtcg tatgcgtact acgtgcacga gtacagcacc ggcgacttta
1741 gctgtgccca ggaccatgtg gacaaggctg tgggcattat gaagaagctg gtgcccagca
1801 accatctaat gttggcctcg gccaagcgtg tgaaggccct gctgctggag gagattgcac
1861 tggacaagat ggccgatggc atggacgagg aggatctact gctgcagtcc gaggagctgc
1921 ataactttgc attgctgctc tcgctcgagg tattcggcga ggtgaacgtg cagacggcca
1981 agcactatgg caatctgggg cggctctatc agacaatgaa ccgcttcgag gaggccgaac
2041 gtatgcatca gaaggcaatc aagatcaaaa cggatctgct ggggccgttc gactacgagg
2101 tgggcctctc cattggccac ctggcctcgc tctacaacta ccaaatgaaa aagtaccgcg
2161 aggcggaaaa gctctacctg cgcagcatcg atataagtct acgtctgttc gggctgtcgt
2221 attccggcct ggagtacgac tatctggggc tgtgtcatgt ctacgagacg ctgcatgact
2281 ttgagaagta tctgcagtac gcgcacacgc tggagaactg gcagatgcta cgcggccaga
2341 acatcaccca gaataagttc agctaccccg ccatcgagca ggactactcc atcgatgagg
2401 tcaaaaacaa attctatgac acgtgcgaga gcagcagcag cagcagcccc cagaaaacgg
2461 agggaaagtg agccagggag ggagcatcgt catggagcag cagccgcact ggaggcggca
2521 tctgttgtat ggcatagcga tagggtttat ccacccaatg cactgtgtgc aatgcacccc
2581 cctccccctt gctctataca tacttttttt ttattgtaaa tattatgtcc tgtgtccagc
2641 atggacacac ggatgcagca gattgattag ccttgtattc ttttattttg tttgtcattt
2701 ccatcgaata agtccaaaac gtagttaaaa atatttagta gggcccccat acccaggagt
2761 cagagcagcc acgtagccag caagccagcc agtcagccag cagtaggtaa tcgtatcgta
2821 tcgtatccac tatccactat taagaaccat cgaatacgtc cagttccacg ccccccgtca
2881 gttagcagtc agcagcagta aggcggttac cccacacctc tcccccctgc cctaggctaa
2941 gttccaattg aaagttgaaa gttaggaagc aaaaaaagag tattgctagt ttttgctgat
3001 acgatatctt gatatctttg attttccaca catattttta ttcgtcttat ttggtttttg
3061 tttattgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgcgt gaatatttgt taaaaattca
3121 ctgaaaagca agcagttgat tatcgttctt aaaaagttct taattttgtt tttttgatgg
3181 tttatagtta attacacatg tgattgagat gatacgcaaa aaagatagcg atagctacag
3241 cagctttcca tatagagaaa atattgttta tttttttaag cgcatgcccg ggcaacaact
3301 gtcctcgatt aattagaacg attttgtagc acaatttgaa gtttaatttc gaggctaccc
3361 cgccaacgac caaccgccaa ttcccaaacc ctaaccgcca accgccaccg ccagtaaatg
3421 ctaactagaa gcgagaagaa aacaaaatgt tgtatagacg accggagaag gaataagcga
3481 aaaaaatgta tagttctaag cacaatcatt atccactatg attaatattt cataccgatg
3541 gtagctgatg cgtgcacaca tgtaacaagt atttgcagct aagtagtagg cacacccaga
3601 gccatcctca tccacgtatg cccacgtgga aaacaatgaa gtggacggaa ccggtgggag
3661 cgataatcgt acgccgtatg tggtagcggc aagaagccac cagtgacatc tacaaacatt
3721 tatgagtaaa tttcatacac aattgaaacc caatacacgt atacttatga acattgcagt
3781 tgcaagcaat ttttggcatt acttagcagt agcttaagag caacgtcgta aatctcaaaa
3841 taaatacaaa aaccaatcaa a