PREDICTED: Drosophila obscura protein ELYS homolog (LOC111072421),
LOCUS XM_041591981 6602 bp mRNA linear INV 14-MAY-2021
mRNA.
ACCESSION XM_041591981
VERSION XM_041591981.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq; corrected model; includes ab initio.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
ab initio :: 1% of CDS bases
frameshifts :: corrected 1 indel
##RefSeq-Attributes-END##
PRIMARY REFSEQ_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP
1-206 JAECWW010000165.1 1660426-1660631 c
207-839 JAECWW010000165.1 1659661-1660293 c
840-1147 JAECWW010000165.1 1659273-1659580 c
1148-1511 JAECWW010000165.1 1658849-1659212 c
1512-1702 JAECWW010000165.1 1658593-1658783 c
1703-2683 JAECWW010000165.1 1657547-1658527 c
2684-4316 JAECWW010000165.1 1655913-1657545 c
4317-6428 JAECWW010000165.1 1653760-1655871 c
6429-6602 JAECWW010000165.1 1653506-1653679 c
FEATURES Location/Qualifiers
source 1..6602
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..6602
/gene="LOC111072421"
/note="The sequence of the model RefSeq transcript was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon;
Derived by automated computational analysis using gene
prediction method: Gnomon. Supporting evidence includes
similarity to: 2 Proteins, and 98% coverage of the
annotated genomic feature by RNAseq alignments"
/db_xref="GeneID:111072421"
CDS 113..6511
/gene="LOC111072421"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon"
/codon_start=1
/product="LOW QUALITY PROTEIN: protein ELYS homolog"
/protein_id="XP_041447915.1"
/db_xref="GeneID:111072421"
/translation="MDWYKVDVDERRSSKFPERAIPGYCQQSDDRTDYLGGIIYGGQW
GWMTNRYSKDATLLICSLSGGECILRHVFWREEANSGQRCSISCVEELLPGQRDAMIL
LAVCLETWSDKEIRPENCAKAKTQIAILSTYYNQVIRYIDLDGLYCSTLKYLDTQICQ
RTRLRNFDGCLAVGTDAGAVLLCDIKLQNLIENRSRNIRLPQNEIDCGQVVRRHSSEC
SIDNINTWLHECRDYNGHLAVEVDVGKSEVRCLIFVHLITGFAAGLKDGRIMIYDLDS
FHITAVLRPPSDLEVSVERICCVVPPDDPKPCLFILGMYGGATNTTTILHSIHYRHSY
ADDESYGVYFKNYKSAETCIRLLLDGGNCSVLGCTTASTCSFSGDNGTLLAILSWYSH
TEGRNKLVLFDMNQWYKDEMPNCLHLYEKPNFLSGYSLAGQAPGLGIHLNASKILHFI
SLQRYDEHFYPNSLTFDFSLLTAEGCQYYVHEGLQHRFLRNLQSEKASLFLVPEATHK
EIVQLRLLPQFSELNPDATFSKMAIYEEILSVALEHGCISLLKDCARNWIDGSYMCNL
RVPTALSLSTLTNWIVKRAAQIKARCSELCQGIFDYGGYPLDERERKELKVLNGQLGN
LDKLQSYIVSVGKRCLAHSLLTEIEANEQVIRTVYTYQRVLLFFIRWGLLPEGQQQLQ
QQEEEEDDEQQSPPGKQPHSALIQLRRLYADRRPSGRDNLYIGGLLRHISQESGGAAV
DLAYPPDTLQSIMQLMLSRTTDMDHKHEVLLYLLFDLDGVQMAKVKLSDRFKTAFGLQ
TQLVTRVKSLWALDHGDHSVSCEELQFNVCVVFNLPSDAHPIPPHAEMHETALELLVS
RNPYLNWHVELLVEALLSKGAISDAMSVVHRPPGPLSSLTRLKVLMASKNIPEAFDYA
RRNDNDETGRPLLEYFFRQSIEMQQFKALAQLCLRESEEELVLRLLRECKTPVTDSVQ
LILLLQKSKYIEAVSFMDEVAAEREIAAGDSSRDIIFAYRSTMNPVSQTLAGAYFRIR
ENLDVEPTEKVHPEPLSSQLIRDKKKGLRGGVFQSSALSAHWATLYSMDSTAKGQAIP
CHQVPFLRSSLYGLSQLPRRRRTVRPVPHQAVEKRVREQEESELVGCRGGRAGDSHTL
QPSKRRRMLSDEFVAGVCDFVGRKLNPIAEPDQTEVDQHQQANELLRVPQFLRPKKPP
LQQSINSPPVTILKRRSALESVASTPPVATSTTSMVEAKRFRFMQPTPLPCHGLNDSN
AMEVDSSLNAQEKDEEDEEEEEETDEIIVEIEPAVSISDSSQSDEDEFVSPLSTPNVS
VASPRLKASPEQRQQELSPPPMAPPAGPQPRSSLNQGVRSESSSGFGSFASVLTSETA
SHEFVPTVCSSKMGTQSIISTGTASVKISERTTICGEMDDPDSEPDSTAPDWYSSPPV
RLQEQEHKEPQMLDTTLGMSTYDVTAREEPLLLTTKNDDIMQVLEVEEVEEEEPWPLE
EIDEEIQLEEIEEDELMEDYLQEAQEQAQLQARPSSYNRTYMGSSDDPPQERSTSPSL
SLSSEMSNMSSSRTLPEAMQSADAMYSIVIESTGSITTSRSVTHTPTSFLPSDTNVSQ
NSSPRATRGTGGNGSPRALYRANSLETVDDLDTTKGSLEEEEEDDEDDCVIALDGTRV
GGYVARPSQSAPSSSAELFAFKDETHKENVPGDGPSLSVGATATDTSIVILDSSEDEV
EIKAEVEAEAGDRTGDASKLKSRVDSGSGSDSGSETGDSGSGSGPSSPSGSGSGSTSA
SESESSSSSKLSSSGAAAPAVNKLPLPPLPEIAEDVEVESPDEKSPQNVPEERKEKSE
SKDEEQKEQQQQQPQAEDEPQTTAHDDDSKHSLTLAFSDDEEERVAAVAPVAPRVLRS
RSKSRTSPVPVPVPPSPTKPTLRARLRSGDGSSSGGSGTPPVLTPKRRNGAHHKNALE
VIDEQKSLGSGPTVVLTRTRKQRSVEPEGTPTKSARAKRATSTTPLSQATTAEPRQLR
GISEPPVKSESRKQLRTSAAEEKLPEASGLESVPLVRRTSRRLNSGLSDQQPSPATTP
IREIAGRELRPRRTRTSSESVASPATPTPRQVSSEANVPLKKRGRSKPVDDGDGKSKK
"
misc_feature <602..1537
/gene="LOC111072421"
/note="beta-propeller of ELYS nucleoporin; Region:
ELYS-bb; pfam16687"
/db_xref="CDD:465233"
misc_feature 2276..3001
/gene="LOC111072421"
/note="Nuclear pore complex assembly; Region: ELYS;
pfam13934"
/db_xref="CDD:464050"
ORIGIN
1 ggtttttgat tgcattgctg gtcacactgc ttgaaatgta aaaattcgca caacaaaaaa
61 ttgctacata cgtatatata ttttgttaaa ttttccggta aatgatataa aaatggattg
121 gtataaagtg gatgtagatg agagacgcag ctcgaaattt ccagagcgcg ctattcccgg
181 ctactgccag cagagtgacg accgcaccga ttaccttggc ggcatcatct atggcggcca
241 gtggggatgg atgacaaaca gatactcaaa ggatgccact ctgctgatct gttcgctgag
301 cgggggcgag tgcattttgc ggcacgtatt ctggagagaa gaggctaaca gtggccagcg
361 ctgcagcatc agttgtgtgg aggaactcct tcctggccaa agagacgcca tgatattgct
421 cgctgtgtgc ctggagacgt ggagcgataa ggaaatccga cccgagaact gtgccaaggc
481 caagacacag attgccatcc tctcgacata ttacaatcag gtgattcgct acatcgacct
541 ggacggcctc tactgcagca ccctcaaata tctggacacc cagatctgcc aacgtacgcg
601 tcttcgcaac tttgatggct gcctggcggt gggcaccgat gcgggagctg ttctgctctg
661 cgacataaaa ttacaaaatt tgattgaaaa tcgcagtcgg aacatacgac tgccgcagaa
721 cgagatcgat tgtggacagg tggtccgaag gcactcctcc gagtgctcca tagacaacat
781 aaacacttgg ctgcacgaat gtcgcgacta caatggccac ttggcagtgg aggtggatgt
841 gggcaaaagc gaggtgcgtt gcctcatatt cgttcattta atcaccggct ttgccgctgg
901 cctcaaggat ggccgcatta tgatctacga tctggatagt ttccatatca ccgctgtttt
961 gcgtcccccg agtgacttgg aggtgagcgt ggagcgaata tgctgcgttg tgccgccaga
1021 tgatcccaag ccctgcctct tcattctcgg catgtacggt ggggccacca atacgaccac
1081 catactccat tccatccact acagacactc gtacgcggat gatgagagct acggagtcta
1141 ctttaagaac tataagagcg ccgaaacctg cattcgtctg ctgctcgatg gcggcaattg
1201 ctccgtgctc ggctgcacaa cggcctccac atgcagcttt tccggcgaca atggcaccct
1261 gctggccata ctcagctggt attcgcacac ggagggcagg aacaagcttg tgctctttga
1321 catgaatcag tggtacaagg acgagatgcc caactgcctg cacctgtacg agaagcccaa
1381 ctttctgtcc ggctactcac tggctgggca ggcgccaggc ctgggcattc acctgaatgc
1441 cagcaaaatt ctgcatttca tttcgttgca gcgctacgat gagcacttct atccgaactc
1501 cttgaccttc gatttctcac tgctaacagc tgagggttgc cagtactatg tgcacgaggg
1561 gttgcagcat cgctttctgc gcaacttgca gagcgagaag gcctcactct ttctggttcc
1621 ggaggccaca cacaaggaaa ttgttcagct gcgcctcctg ccccagttca gtgagctcaa
1681 tccggatgcc acattctcca agatggccat ctatgaggag atcctctcgg tggccctcga
1741 acacggctgt ataagcttgc tgaaggattg tgcacgaaac tggatagatg gcagctacat
1801 gtgcaacctg cgcgttccca cagccctctc cctgtcgacc cttaccaact ggatagtgaa
1861 gcgtgccgca cagattaagg cacgctgctc ggagctgtgc cagggcatct ttgactatgg
1921 cggctatccg ctggacgagc gcgagcgcaa ggagctcaag gtgctaaacg gccagttggg
1981 caatttggac aagctgcagt cgtacatcgt gagtgtgggc aagcgctgcc tggcgcactc
2041 gctcctcacc gaaatcgagg ccaatgaaca ggtgatccgc acagtctata cctatcagcg
2101 ggtgctcctg ttcttcataa ggtggggcct gctgccagag ggtcagcagc agctacaaca
2161 gcaggaggag gaggaggatg atgagcagca gtcgccgcct ggcaagcagc cgcattcggc
2221 tctgattcag ctgcgacgtc tctacgccga cagacgcccg tcggggcgtg ataatctcta
2281 catcggtggc ctgttgcggc atatttccca ggagagtggc ggcgcagccg tggacctggc
2341 ctatccgcca gacacgctgc agtccatcat gcagttgatg ttgtcccgga ccaccgatat
2401 ggatcacaag catgaggtcc tattgtactt gctcttcgat ctggacggtg tccagatggc
2461 gaaagtgaaa ctcagcgaca gatttaagac cgccttcggt ctgcaaacgc agctcgtgac
2521 gcgtgtgaag tccctgtggg ccctggatca cggcgatcat tcggtaagtt gcgaggaact
2581 acaatttaat gtgtgtgttg tgttcaatct cccatctgat gcccatccca ttcccccaca
2641 cgcagaaatg catgaaactg ctctcgagct ccttgtgtcc agaaatccct acctcaactg
2701 gcatgtggag ctgctggtgg aagctctcct gtccaagggc gccatctcgg atgccatgag
2761 cgtggtgcat cgaccaccgg ggccgttatc ctctctgacg cgcttgaagg tgcttatggc
2821 cagcaagaac atacccgagg ccttcgatta tgcacggcgc aatgacaacg acgaaacggg
2881 ccggcccctg ctggagtact ttttccggca gtccatcgag atgcagcagt tcaaggcact
2941 cgctcagctg tgcctgcgcg aatcggagga ggagctggtc ctgcgccttc tccgggaatg
3001 caagacaccg gtgacggata gcgttcagtt gattctgctg ctgcagaagt caaagtacat
3061 tgaggcggtc tccttcatgg atgaggtggc cgccgagcgg gagatcgccg cgggagattc
3121 cagccgtgac attatctttg cctaccgctc cacgatgaat cccgtgtccc aaacgcttgc
3181 cggcgcctac tttcgcatcc gcgagaatct ggatgtggag ccaaccgaga aggtgcatcc
3241 ggagccctta agcagccaac tgatccggga caagaagaag ggactgaggg gcggcgtctt
3301 ccaaagctct gccctcagcg cccactgggc gacgctctac agcatggact caacggcgaa
3361 aggacaggcg ataccctgcc accaggtacc cttcctgcgc agctcgctgt acggcctgag
3421 ccagctgccc cgtcgtcgac gcacggtgcg tcccgtgccc caccaggcgg tggagaagcg
3481 tgtacgcgag caggaggaat ccgagctggt cggctgccgc ggcggccgcg ctggcgactc
3541 ccacaccctg cagccatcca agcggcgccg tatgctgagc gatgagtttg tggcgggtgt
3601 gtgcgacttc gttgggagaa agttgaatcc catagctgaa ccagaccaga ccgaagtgga
3661 tcagcaccaa caggccaacg agctgctcag ggtgccgcag ttcctgcggc caaagaagcc
3721 gccgctacag caaagcatca attcaccgcc agtgaccata ctgaagcggc gatcggcact
3781 cgagtcagtg gccagcactc caccggtggc cacatccacc acgtccatgg tggaggcaaa
3841 gcggtttcgg ttcatgcagc caacgccact gccatgccat ggccttaacg atagcaatgc
3901 catggaggtg gacagctcat tgaatgccca ggagaaagat gaggaagatg aggaggagga
3961 ggaggagact gacgagataa ttgtggagat cgagccggca gtctccatct cggacagcag
4021 ccagtcggat gaggatgagt ttgtgtcgcc cctgagtaca cccaatgttt cggtggccag
4081 ccccagattg aaggcatctc cagagcaacg acaacaggaa ctgagtcccc cgcccatggc
4141 tccgcctgct ggacctcaac cacgcagctc gcttaaccag ggcgtaagga gtgagagtag
4201 cagcggattc ggtagctttg cctccgtcct cacgtcagag acggcatccc acgagtttgt
4261 gcccaccgtc tgctcctcaa aaatgggaac acagtcgata atttccaccg gcacagcctc
4321 tgttaagatc tctgagcgta ccacgatctg tggggagatg gacgatcccg attccgaacc
4381 agactcaacc gcaccggact ggtactcgtc gccaccagtt cgattgcagg agcaagagca
4441 caaggagccc caaatgctgg acaccacact gggcatgtcc acatacgatg tgacggcacg
4501 ggaggagccg cttctattga ccacgaaaaa tgacgatatt atgcaagtgc tggaggtgga
4561 ggaggtggag gaggaggagc catggccatt ggaggagatc gatgaggaga tacaactgga
4621 ggagatcgaa gaggatgaac tgatggagga ttatctgcag gaggcccagg agcaagccca
4681 gctgcaggcg aggcccagca gctacaatcg aacgtacatg ggcagctccg atgatccgcc
4741 ccaggagcgc agcacctcgc cctccctaag cctgagcagc gaaatgtcga acatgtcctc
4801 gtcccgcacc ctgccggagg ccatgcagag tgccgatgcc atgtactcga ttgtaatcga
4861 gtcgaccggc tccataacca catcgcgttc ggtcactcac acgcccacct ccttcctgcc
4921 cagcgacacg aatgtgtccc agaactcgag tccgcgggca acgcgcggca cgggcggcaa
4981 cggctcaccg cgtgccctgt accgggccaa tagcctggag accgtcgatg atctggatac
5041 caccaagggg tccctggagg aggaggagga ggatgatgag gatgactgtg tgatcgccct
5101 ggatggcact cgggtgggcg gctatgtggc ccgtccctcc caatcggcgc cctccagcag
5161 tgccgagctg ttcgccttca aggatgagac tcacaaggag aacgtccctg gcgacggtcc
5221 ctccctttcc gttggagcta cggctacgga tacatcaatt gtcattttgg acagcagcga
5281 agatgaggtt gaaataaagg ctgaggttga ggctgaggct ggggacagga ctggggatgc
5341 ttcgaagtta aagtccagag tggactctgg ttcgggatcc gattcaggat cagagacggg
5401 agattccggt tccggttcgg gtccgtcatc cccatcggga tctggttctg gatctacatc
5461 cgcatccgag tcagaatcca gttctagttc caaattgtca tccagtggcg ctgcagctcc
5521 agcggtaaat aagttgcctt tgcccccgct tcccgagatt gcggaggatg tggaggtcga
5581 gtcgccggat gaaaaatccc cgcagaacgt tcccgaagag aggaaggaga aatctgaatc
5641 gaaagacgag gagcagaagg aacagcagca gcagcagccg caggccgaag atgagccaca
5701 gacgacggcg catgatgacg actccaaaca cagtctcacg ttggccttct ccgacgatga
5761 ggaggagcgt gttgctgcag tcgccccagt tgcaccacgc gtccttcgct cgcgttccaa
5821 atctaggacc agtccagtcc cagtccctgt tccgccgtcg ccaacgaagc ctactctaag
5881 ggcgcgcctg cgcagcggtg acggttcgtc cagtggcggc agtggcacgc cacctgtgct
5941 gacgccaaag cgccgcaacg gagcgcacca caagaatgcg ctggaggtga tcgacgagca
6001 gaaatctctg ggctctggtc caacggtggt gctcacccgt acgcgcaaac aacgaagcgt
6061 ggagccggag ggcactccaa caaaatcagc gcgcgccaag cgagccacct cgacgactcc
6121 gctgtcgcag gcaacaaccg ctgagccccg ccagttgcgt ggcattagcg agccacccgt
6181 gaagtccgaa agcagaaagc agctcaggac gagcgcagcc gaggagaagt tgcccgaagc
6241 gtccgggctt gaatcggtgc ctctggtgcg acgcaccagc agacgactca actctgggct
6301 gagcgatcag cagccgtcgc ctgcgaccac ccccatacgg gaaatagctg ggcgcgagct
6361 tcgtcctcgt cgcactcgaa ccagttccga atctgttgcg tcgcccgcca caccaacgcc
6421 gagacaagtg agcagcgaag ccaatgtgcc cctcaagaaa cgcggaagat caaagccagt
6481 ggatgatggc gatggcaaat ccaagaagtg aatggaacac tagcattgca gcagacacta
6541 gaagactctt gtgttattgt atggccttac ctcacaccaa ctcccccatc catccccccc
6601 ta