PREDICTED: Drosophila obscura integrator complex subunit 6
LOCUS XM_041591908 5247 bp mRNA linear INV 14-MAY-2021
(LOC111070651), mRNA.
ACCESSION XM_041591908
VERSION XM_041591908.1
DBLINK BioProject: PRJNA728747
KEYWORDS RefSeq; corrected model.
SOURCE Drosophila obscura
ORGANISM Drosophila obscura
Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT MODEL REFSEQ: This record is predicted by automated computational
analysis. This record is derived from a genomic sequence
(NW_024542752.1) annotated using gene prediction method: Gnomon.
Also see:
Documentation of NCBI's Annotation Process
##Genome-Annotation-Data-START##
Annotation Provider :: NCBI
Annotation Status :: Full annotation
Annotation Name :: Drosophila obscura Annotation
Release 101
Annotation Version :: 101
Annotation Pipeline :: NCBI eukaryotic genome annotation
pipeline
Annotation Software Version :: 8.6
Annotation Method :: Best-placed RefSeq; Gnomon
Features Annotated :: Gene; mRNA; CDS; ncRNA
##Genome-Annotation-Data-END##
##RefSeq-Attributes-START##
frameshifts :: corrected 1 indel
##RefSeq-Attributes-END##
PRIMARY REFSEQ_SPAN PRIMARY_IDENTIFIER PRIMARY_SPAN COMP
1-106 JAECWW010000165.1 394439-394544
107-835 JAECWW010000165.1 394622-395350
836-5247 JAECWW010000165.1 395352-399763
FEATURES Location/Qualifiers
source 1..5247
/organism="Drosophila obscura"
/mol_type="mRNA"
/isolate="BZ-5 IFL"
/db_xref="taxon:7282"
/chromosome="Unknown"
/sex="male"
/tissue_type="whole fly"
/dev_stage="Adult fly"
/geo_loc_name="Serbia: Babin Zub"
/collection_date="2017"
gene 1..5247
/gene="LOC111070651"
/note="The sequence of the model RefSeq transcript was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon;
Derived by automated computational analysis using gene
prediction method: Gnomon. Supporting evidence includes
similarity to: 1 Protein, and 99% coverage of the
annotated genomic feature by RNAseq alignments, including
17 samples with support for all annotated introns"
/db_xref="GeneID:111070651"
CDS 176..4369
/gene="LOC111070651"
/note="The sequence of the model RefSeq protein was
modified relative to its source genomic sequence to
represent the inferred CDS: deleted 1 base in 1 codon"
/codon_start=1
/product="LOW QUALITY PROTEIN: integrator complex subunit
6"
/protein_id="XP_041447842.1"
/db_xref="GeneID:111070651"
/translation="MTIILFLVDTSSSMCQKAYVNGVQKTYLDIAKGAVETFLKYRQR
TQDCLGDRYMLLTFEEPPANVKAGWKENHATFMNELKNLQSHGLTSMGESLRNAFDLL
NLNRMQSGIDTYGQGRCPFYLEPSVIIVITDGGRYSYRNGVHQEIILPLSNQIPGTKF
TKEPFRWDQRLFSLVLRMPGNKIDERVDGKVPHDDSPIERMCEVTGGRSYRVRSHYVL
NQCIESLVQKVQPGVVLQFEPLLPKDSASGSSSSGTNAGAGGGGGGSATQQSSSSSSS
SSLSEPTPDIVFQPIKKMIYVQKHITQKTFPIGYWPLPEPYWPDSKAITLPPRDAHPK
LKIMTPAVDEPQLVRSFPVDKYEIEGCPLTLQILNKREMNKCWQVIVTNGMHGFELPF
GYLKASPTFSQVHLYVLAYNYPALLPLLHDLIHKYNMSPPNDLMYKFNAYVRSIPPYY
CPFLRKALVNINVPYQLLQFLLPENVDNYLSPTIANQLKHIKNTAKQDQENLCMKVYK
QLKQAKPPYRQVETAKLCTGMALRRDLVRHPLLRDTFAKLHAEIEPVENYTIVVPQLT
HQASAKTYRNPFDIPRRDLVEEIARMRETFLRPTLLVAKDSGHCLPISEMGNYQEYLK
NKDNPLREIEPTNVRQHMFGNPYKKDKHMVMVDEADLSDVAPMKSPNGNGPGGSPPGS
SGSGSSGSSSSGTSSSGGPGGPGMGGGPGGMSGLMLGPHSASGLGKKLDGTSRGRKRK
AGPLPRGFEFRRASTDSRSSCSSPRPSSPSDSAPSSPNPAATAACDSSASTSATSTGS
GSGSGLTGSGASSGPVSGGRSSPSCFDEDSNSSFASSSSSTEPSSDSGLSLSHSERLG
GIGFDFSEEETSDQDTALNGHVHNFINGISDEASAADTEAPPGPGPSPVAPPPSNVLP
AAAAPEPIVTATFVPVVGTAAAAAAPQATASATTSGVVVASTVAASLVTPPPLPAVSS
TSNGGLGLSGGTGGSGGDTPCSSASANNSSGVLYVPLNGLTTTHAAYSSNRGGGTTCP
LGSHLSGLGPAGSHSHKTGISLMSLSGVLPMLNGYTHHGGTLSPPSPLAAVPPTPLAT
ATPTSATSSCTGAASGSGSSGSSYFTHNDINDASEISRILQTCHNGTSGSSGSGSGSG
GGGGSGSGGGSSSSTLNGNGSPEPDVGDDGESPLSLGIGNSFDALKRLAGGGAASSPH
FYWGSGLNHFINNNNNSGSGTAAATGNNNSSISTNNHRNHNNNSPMKTDIKSSPSCSS
SPTHNSNSNSNGEVSAAALPILSEEQREAARRHNIDLRLKIFRDIRRPGRDYSQLLDN
LGLVKGDHDMRSDFVEMCIGESRRFRRHRMVGSIQEWWDQWQQSQPAIDQQQQQQLQE
QSQQPQQQQQEAAAAATKS"
misc_feature 185..568
/gene="LOC111070651"
/note="von Willebrand factor type A domain; Region: VWA_2;
pfam13519"
/db_xref="CDD:463909"
misc_feature 4055..4240
/gene="LOC111070651"
/note="INTS6/SAGE1/DDX26B/CT45 C-terminus; Region:
INT_SG_DDX_CT_C; pfam15300"
/db_xref="CDD:464626"
ORIGIN
1 cagcagcagc agcataagca gagagtagaa gaagaagcgt aaccacgctg agaggagaac
61 agcagcagaa gcaagagaag gatagaaaat agagagtgaa attacgagag attcgcgaag
121 aaagatcaaa gagaattaag ttcaagaaag gacaaaatca aaatcaaaat acgccatgac
181 aatcatactc ttcctggtgg acacctcgtc gtccatgtgc cagaaggcgt atgtgaatgg
241 tgtgcaaaaa acttatctgg acattgccaa gggggcggtg gagacgttcc ttaagtatcg
301 gcagcgcaca caggattgcc tcggcgatcg ctacatgctg ctgaccttcg aggagccgcc
361 ggccaatgtg aaggccggct ggaaggagaa ccatgccaca tttatgaacg aactgaagaa
421 cctacagagc cacggcctca cctcgatggg tgaatcgctg cggaacgctt tcgatctgct
481 caacctgaac cgcatgcagt cgggcatcga tacgtacggg cagggcaggt gcccctttta
541 cctcgagccc tcggtgatca ttgtgatcac ggacgggggc cgctactcct accgcaacgg
601 tgtgcatcag gagatcatac tgccgctgag caatcaaata cccggcacaa agtttaccaa
661 ggagcccttc cgatgggacc agcgcctatt ctcgctggtc ctgcgcatgc ccggcaacaa
721 gatcgacgaa cgtgtcgatg gcaaggtgcc gcacgacgac tctcccatcg agaggatgtg
781 cgaggtgacc ggcggccgct cataccgcgt ccgcagtcac tatgtgctca atcaatgcat
841 cgagagcctc gttcagaagg tgcaaccggg cgtggtgctg cagtttgagc cgctactgcc
901 caaggatagt gccagcggca gcagcagcag tgggacgaac gccggcgcag gtggtggcgg
961 cggcggctcc gcgacacagc aatcgtcctc atcgtcatcc tcctcgtcct tgtcggagcc
1021 cacaccggac attgtcttcc agccaatcaa gaagatgatc tacgtgcaga agcacatcac
1081 acagaagacc tttcccattg gctactggcc gctgccggag ccctattggc cggactcgaa
1141 ggccatcaca ctgccgcccc gcgatgccca tcccaagctg aagatcatga cgccggcggt
1201 ggatgagccg cagctggtgc gcagctttcc ggtggacaag tacgagatcg agggctgccc
1261 cctgacgctg cagatcctca acaagcggga gatgaacaag tgctggcagg tgatcgtcac
1321 caatggtatg cacggctttg agctgccatt cgggtacctg aaggcgtcgc ccaccttctc
1381 gcaggtgcat ctgtatgtgc tcgcctacaa ctatccggcg ctgttgcccc tgctgcacga
1441 cctcatccac aagtacaaca tgagtccgcc caacgatctc atgtacaaat tcaatgccta
1501 tgtgcgctcc ataccgccgt actactgccc cttcctgcgc aaggcgctgg ttaacatcaa
1561 tgtgccgtac cagctgctgc agttcctgct gccggagaat gtcgacaact atctgtcgcc
1621 gaccatagcc aatcagctga agcacatcaa gaacaccgcc aagcaggacc aggagaacct
1681 ctgcatgaag gtctacaagc agctgaagca ggccaagccc ccctatcgcc aggtggagac
1741 ggccaaactc tgcacgggca tggccctgcg tcgagatctt gtgcgacatc cgctgctgcg
1801 ggacacgttc gccaagctgc atgcggagat cgagccggtg gagaattaca cgattgtggt
1861 gccacagctg acgcatcagg catcggccaa gacgtatcgc aatccgtttg atataccccg
1921 gcgggatctg gtggaggaga tagcccgcat gcgtgagacc ttcctacggc ccacgctgct
1981 ggtggccaag gactcgggcc actgtctgcc catctcggag atgggcaact accaggagta
2041 cctcaagaac aaggacaatc cgctacggga gatcgaaccg accaatgtgc ggcagcacat
2101 gttcggcaat ccctacaaga aggacaagca catggtcatg gtggacgagg cggatctcag
2161 cgatgtggcg cccatgaagt cgccgaatgg caacggtccc ggcggctcac cgcccggctc
2221 ttcgggctct ggttcgtccg gctcctcgtc gtccggcact tcatcgagtg gtgggcctgg
2281 gggtcccggc atgggcggcg gtcccggcgg catgtccggc ctaatgctgg gcccccattc
2341 ggccagcgga ctcggcaaga agttggacgg aacctcgcga ggacgcaagc ggaaagctgg
2401 tcccctgccg cgtggctttg agttccgtcg cgcctccacg gactcgcgtt catcctgctc
2461 ctcaccccgc ccctcgtccc cgtccgatag tgcacccagc tcgccgaatc cagcggcaac
2521 ggcggcctgc gactcatccg cgtctacgtc agcgaccagc acgggctctg ggtccggctc
2581 gggcttgacg ggatcgggtg ccagctctgg gccggtatcc ggcggccgtt ccagtcccag
2641 ctgcttcgat gaggacagca acagcagctt cgccagcagc tccagcagca cggaaccatc
2701 gagcgactcg ggcctgtcgc tttcgcactc ggagcgcctg gggggcattg gctttgactt
2761 tagcgaggag gagaccagcg accaggacac tgccctcaac ggccacgtgc acaacttcat
2821 caatgggatc agcgacgagg ccagtgccgc ggacacagag gccccgcccg ggcccggtcc
2881 aagccctgta gccccgccac ccagcaacgt cttgccagcc gctgccgcac ccgaacccat
2941 tgtgacggcc acatttgtgc cggtcgtggg cacagcagca gcagcagctg cgccccaggc
3001 aacggcaagc gctacgacat ccggagttgt agtagcatcg accgtggcag cctctttggt
3061 cacgcctccg ccgctgccgg ccgttagcag caccagcaac ggtggcctgg gactgtccgg
3121 cggtacgggt ggcagtggcg gcgacacgcc ctgctcctct gcgtcggcaa acaactcgag
3181 cggagtgctg tacgtgcccc tgaacggact gacgaccact cacgcggcgt acagcagcaa
3241 ccgaggcggc ggcaccacat gtccgttggg ctcccacctg agcggcctgg gaccggcggg
3301 gagtcacagc cacaagactg gcatctcgct gatgtcgctg tccggcgtac tgcccatgct
3361 caatggctac acgcatcatg gtggcaccct gtcgccaccc tcgccccttg ctgcggtccc
3421 gccaacgcca ctggcgactg caactccaac gtccgccact agcagctgca ccggtgcggc
3481 atcgggatcg ggatcctctg gctccagcta cttcacccac aatgacatca atgatgcctc
3541 tgagatctcc agaatactgc agacatgtca caatgggacc agcggctctt ccggctccgg
3601 ttcgggcagc ggaggaggag gcggcagtgg gagtggcggc ggcagcagca gcagcaccct
3661 gaatggcaac ggcagtccgg agccagatgt aggagacgat ggcgaatccc cgttgtccct
3721 gggtattgga aatagttttg atgccctgaa gcggctggca ggaggcggcg ctgcgagcag
3781 tccccatttt tactggggca gcggactcaa tcattttatc aacaacaaca acaatagcgg
3841 cagcggaaca gcagccgcca ctgggaataa caacagcagc attagcacca acaatcaccg
3901 gaatcacaac aacaatagcc ccatgaagac ggacatcaag tcgagtccaa gctgcagctc
3961 ctcgccgacg cacaacagca acagcaatag caacggggag gtgagtgccg cagcgctgcc
4021 catactgtcg gaggagcagc gggaggcggc gcgtaggcac aacattgatc tgcggctaaa
4081 gatctttcgg gatatacggc ggccgggacg cgactacagc caactgctgg acaacctggg
4141 gctggtgaag ggcgatcacg acatgaggtc cgatttcgtg gagatgtgca ttggggagtc
4201 gcgacggttc cggcggcacc gcatggtcgg cagtattcag gagtggtggg accagtggca
4261 gcagtcgcag ccggccatcg atcagcagca gcagcagcag ctgcaggagc agtcccagca
4321 gccacagcag caacagcagg aggcagccgc cgccgctacc aagagttaac gttggctctt
4381 ttggaactgc tgagcccgag cggtattcgc ctgggcttgg gcgtggtcga gggcgtggcc
4441 atggccaagg gcgaggccgt ggggcgccat cagtctgtga tgtcatcaaa agcaacaaca
4501 acaacaaaaa caacaacaag gagaaccgta taaaccgtat gataacaaat aggagaaaga
4561 aaccgtaaaa accgtacacg taaatcttaa cttaacaata gccgtaatga ataacagtaa
4621 aagtataaat taaacagtat aagagagaca cagcgagaga gagacagaga gagagagagc
4681 aagtacaacg gtaaacgaaa ggcattgaat tctcgtcaat ttccatttat gtaaataatt
4741 gccagaaatc ccctcccccg ttgtcggatc ttaatccgga gatataggcg agagcggaac
4801 cgcaaaaaaa atttgatgtg tatatggggc aatatctcta gttcgtaagg gacactctct
4861 cattctcgag acatatattt ctgagtaggc cagaacgtag aggaagggca aaggcgaacg
4921 gcaaggtata tattctctaa gaaaaaacaa aaaattgtaa ttatatcgta acgattttta
4981 gcctaagccg tagcttgaat actgtacaac ccttacaata ttttgcagtt tttttttttt
5041 attattatga ttattataca catatacaca tatatacata tatatatata gaggatatat
5101 ttaatataat gtaatcccct gatccaattc gtacacctaa ggtccccaaa gcaattacta
5161 taaatagact tataactcga tatagtgttt ataagaggaa gaagagaacg tgtaacattt
5221 gtataagaga gagagagaga gagagag