PREDICTED: Drosophila obscura protein sprint (LOC111066365),


LOCUS       XM_041591916            6094 bp    mRNA    linear   INV 14-MAY-2021
            transcript variant X1, mRNA.
ACCESSION   XM_041591916
VERSION     XM_041591916.1
DBLINK      BioProject: PRJNA728747
KEYWORDS    RefSeq.
SOURCE      Drosophila obscura
  ORGANISM  Drosophila obscura
            Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta;
            Pterygota; Neoptera; Endopterygota; Diptera; Brachycera;
            Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora.
COMMENT     MODEL REFSEQ:  This record is predicted by automated computational
            analysis. This record is derived from a genomic sequence
            (NW_024542752.1) annotated using gene prediction method: Gnomon.
            Also see:
                Documentation of NCBI's Annotation Process
            
            ##Genome-Annotation-Data-START##
            Annotation Provider         :: NCBI
            Annotation Status           :: Full annotation
            Annotation Name             :: Drosophila obscura Annotation
                                           Release 101
            Annotation Version          :: 101
            Annotation Pipeline         :: NCBI eukaryotic genome annotation
                                           pipeline
            Annotation Software Version :: 8.6
            Annotation Method           :: Best-placed RefSeq; Gnomon
            Features Annotated          :: Gene; mRNA; CDS; ncRNA
            ##Genome-Annotation-Data-END##
FEATURES             Location/Qualifiers
     source          1..6094
                     /organism="Drosophila obscura"
                     /mol_type="mRNA"
                     /isolate="BZ-5 IFL"
                     /db_xref="taxon:7282"
                     /chromosome="Unknown"
                     /sex="male"
                     /tissue_type="whole fly"
                     /dev_stage="Adult fly"
                     /geo_loc_name="Serbia: Babin Zub"
                     /collection_date="2017"
     gene            1..6094
                     /gene="LOC111066365"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Gnomon. Supporting evidence
                     includes similarity to: 10 Proteins, and 98% coverage of
                     the annotated genomic feature by RNAseq alignments"
                     /db_xref="GeneID:111066365"
     CDS             283..5778
                     /gene="LOC111066365"
                     /codon_start=1
                     /product="protein sprint isoform X1"
                     /protein_id="XP_041447850.1"
                     /db_xref="GeneID:111066365"
                     /translation="MSHDTATNVGGGAAGAGAGAGGAPSVVECSDASSKVEVIVYTKS
                     DKTLKQNMNAKDETLRKRTSLIIVPQQVDVDDDDDPNQNQMQTHGNATSKQEELLLEN
                     GNRNSIASMDSNQSSASSTRTTSSSSSSSSGSTTSEDAHAAYEDLQNGNAGPPPPRSI
                     SNLSSSSSESTSCSSSVDYSLREQLKNLANRSGAETADRHSLPTNAGGLQLLKAMPSA
                     TGPHCDALLSPQEAPLGRRYAEVSQFKAHGKARTSEPLASVAIGGVVTAAPQSNAAES
                     VNGGNSANTASTNNNNNNNNNNINSSNNNHNGQQSQKSQQQQQQQQQSNLQNGVNVGS
                     MTTSSGTSHGAAAAVAVESSNQNGDSRSHRHSSSLEPTTDIEDVVDMAQGGDVFEDDT
                     EALLPKCSRRSRDELSQSRTSLVSSSEGGILAEGETSSEDERDEETSRDSSDNSPPCD
                     LGLGERLLLTHPMWFLPGIQRSGAVHLLQGKEEGTFIVRGSSQPNTMAVSVRLPLDTG
                     PYIEHYLIQSHDNVLSLESSRFTFGSIPSLIAHYAQCCDELPVQLMLPRLLREANNRK
                     KLSSLALLGQEFWSYASSPAVLGPPTLVGPSRAAPQDTQSCDAKSPLSLTETSGLGTA
                     TFFSDAGPKPPATAPPAPGGNGNGSLFSPTGSGQLMGFFSQAGTPSDTTNSSMSSFTT
                     SGGQHMQLLSPNSVDSVILTMSPVEGQGPAGYLPPGGPPPPISCPLAGTAAAAAGIGQ
                     QMSTFKVLPIEGGGGGGAGAVDQVRPQRPKPPNTLNLKPPAPPLRWSKPHSPDQLANG
                     NGNGNGNGNSNFTVTTTVTFSMESGNGSSGGGGIGGNTGGKFVEVTTPASSNPFNALL
                     NGQASTFQTFAKRLSPEGECKDTLSSQGSSSTNDSRWQLHSSRDSSHRKILSPQTPNS
                     SSSGGGGGGGGKSRKSRAGKESQHYKESDILESPPLQYCASALSDKISDYEDVWSHDP
                     SDRASLLTSFKPTPQDGVLNRRPDLLAETPSTPTAQTLPHLTPCEEETIGDAAAAAAA
                     KDSSQQLLQFSGDAPPRSRAGLLLPNLSLVGSQATAPPPAMSQSLTEPGDDGGEITPT
                     AAQTPGSRSKQGSPFYAEPADALRQAGLTSAATAILRRQHRSQVMHANQRHSEPLKGG
                     LGGTTAALLLPSDLEKLAGSLDELKQKPQQQQQHQQQHQQQPTTKRARNRIDHWQLDS
                     SWEFMAKQDTTGEAYDASVQDWQEKENSLGRDKDGGLGKKRPPLTVHQIIAKRLPDLN
                     LPELVRCSTPPQTMAGNGTATGAAPLQPLVQGQERDGSGSQKSFQSQNVGCRLSSYDN
                     VFCQNSMFGGIDSAQSDDGTIFSEPWDSSQWDSFLPHDDTTLNSDTIHLSKCRPALSE
                     DDTIVEELSSTKDGSNSSCNQDTLKANRNGAGHHNHNHNHNNCSAKANSRPKVATILR
                     NPSMRDREVLCHPRNKMNIQSSGPGDSLRAYTLQLAQDPSSTFARNIENFISCTKESR
                     EAAPQVVMRNMRQFMSGMKNYLVKHGEGKFHAELESARGRLKADEFLNLDAMLETVMH
                     QLVVLPLREHLYGIFVDYYTRSEDIQLLAQNVRYACEREAADFGIRPTVTPPSQTALR
                     LIANLLWRLQEAELPLDKLELFLCVISTVFDATGCPRGQQLGADDFLPVLVYVVAKCG
                     FVGAEIEAEFMWGLLQPTLLNGEPGYYLTALCSAVQVLKTFMASEGESGSGSLDWRSS
                     CLPACSSVLRVIIPDECNGSLQTRTLPVRPHTTTREVCRIIAHKARITNPQDYALFKL
                     VDGEETLLSDAECPQDARLAAKGKHCMLAYKRIDAKIAWPTTQPTGH"
     misc_feature    1639..1932
                     /gene="LOC111066365"
                     /note="Src homology 2 (SH2) domain; Region: SH2; cl15255"
                     /db_xref="CDD:472789"
     misc_feature    order(1693..1695,1747..1749,1819..1821,1825..1827)
                     /gene="LOC111066365"
                     /note="phosphotyrosine binding pocket [polypeptide
                     binding]; other site"
                     /db_xref="CDD:198256"
     misc_feature    order(1822..1824,1903..1905)
                     /gene="LOC111066365"
                     /note="hydrophobic binding pocket [polypeptide binding];
                     other site"
                     /db_xref="CDD:198256"
     misc_feature    5116..5412
                     /gene="LOC111066365"
                     /note="Vacuolar sorting protein 9 (VPS9) domain; Region:
                     VPS9; pfam02204"
                     /db_xref="CDD:460489"
     misc_feature    5482..5736
                     /gene="LOC111066365"
                     /note="Ras-associating (RA) domain of Ras and Rab
                     interactor (Rin) protein family; Region: RA_Rin; cd01776"
                     /db_xref="CDD:340474"
ORIGIN      
        1 ggcttgtctc ttttgcgccg tcgttcggtt cgctcgtcgt cgcggtctct acgctcaatt
       61 ttcgatgcga tttcgcgaat tttgtgtttg caaactatga aaatgccagt gaaaatgtgc
      121 ttaatcatgc acaaggttgc attttaaaca attgatgaaa ccattttcgc ctgcaaaaga
      181 acttatcgtc agcgagaaag caggagagag agagggagag agagtaagcg agaaagagag
      241 aaccaacaca caagaaagaa aagctttttg ctcatcacga gaatgtcgca cgacacggcc
      301 acaaacgtcg gaggaggagc agcaggagca ggagccggag caggaggtgc tccatctgtg
      361 gtggagtgca gcgatgcctc ctccaaggtg gaggtgattg tctacacaaa gtcggacaag
      421 acactcaaac agaatatgaa cgccaaggat gagacgctca ggaagcgcac atccctgatc
      481 attgtgccac agcaggtgga tgtagacgat gatgatgatc caaatcagaa tcagatgcag
      541 acccatggca atgccacatc taaacaggag gaattgctgc tggagaatgg caatcgcaat
      601 tcgattgcca gcatggacag caatcagtca tcggcctcca gtacgaggac taccagtagc
      661 agcagcagca gcagcagcgg tagcaccact tcggaggatg cccatgccgc ctatgaggat
      721 ctacagaatg gcaacgctgg cccaccccca ccacgctcca tctccaactt gagctcctcc
      781 tcctccgagt ccacctcctg ctccagctcc gtggactact ccctgcggga gcagctcaag
      841 aacctagcca atcgcagtgg ggccgagacc gccgatcggc actcgctgcc caccaatgct
      901 ggtggcctac agctgctcaa ggccatgcca tcagccactg gcccgcactg cgatgctctg
      961 ctcagtcccc aggaggcgcc cctgggcaga cggtacgccg aagtgtcaca attcaaggca
     1021 cacggcaagg cgagaacaag tgaaccgcta gcatcggtgg ccattggtgg agttgtgaca
     1081 gccgcaccac agagcaacgc ggcggagtca gtgaacggcg gaaactctgc caacactgcc
     1141 agcaccaaca acaataacaa caacaacaac aataatatca acagcagcaa taacaatcac
     1201 aatggccaac agtcacaaaa atcccaacag cagcagcagc agcagcagca gtcgaacctt
     1261 cagaacggag taaacgttgg ctccatgacc acatcctccg gcacaagcca tggggcggcg
     1321 gcagcggtgg cggtggaatc atcgaatcag aacggagaca gccgcagcca tcggcacagc
     1381 agcagcctgg agccaacaac ggacatcgaa gatgttgttg acatggccca aggcggcgat
     1441 gtcttcgaag atgacacaga ggccctgctg cccaagtgtt cgcgccgatc ccgggatgag
     1501 ctcagtcagt cacgcacatc tctggtgtcg agctccgagg gcggcatcct ggccgagggc
     1561 gaaacctcca gcgaggatga acgcgatgag gagaccagtc gcgattccag cgacaattcg
     1621 ccgccctgcg atctgggtct gggcgagcgg ctgctgctca cccatcccat gtggttcctg
     1681 ccgggcatcc aacgttcggg ggccgttcat ctgctgcagg gcaaggagga agggaccttc
     1741 attgtgcgcg gctccagtca gcccaacacg atggccgtgt cggtgcgcct gcccctggac
     1801 acgggcccct acatcgagca ctatctgatc cagtcgcacg acaatgtgct cagcctggag
     1861 agctcgcggt tcacgttcgg ctcgataccc tcactgatcg cccactatgc ccagtgctgc
     1921 gacgagctgc ccgtccagct gatgctgcca cggctgctgc gcgaggccaa caaccgcaag
     1981 aagctctcct cgctggccct gctcggccag gagttctgga gctatgccag cagtcccgcc
     2041 gtcctcggtc cgcccacgct agtgggcccc tcaagggcag cgccacagga cacccaatct
     2101 tgcgatgcca agtcgccgct gtcccttacg gagacgagcg gtctgggcac ggcaactttc
     2161 ttcagcgatg cagggccaaa gcccccggcc acagcgcccc cggcacccgg tggtaatggc
     2221 aatggcagcc tctttagtcc cacgggcagc ggacagctga tgggcttctt cagccaggcg
     2281 ggcacgccct cggacacgac aaactccagc atgagctcgt tcaccaccag cgggggccag
     2341 cacatgcagc tgctgagccc caattccgtg gactcggtta tactgaccat gtcgcccgtg
     2401 gagggccagg ggccagccgg atatctgccg ccaggagggc cgccgccgcc catctcctgc
     2461 ccactggctg gaacagccgc agccgcagcc ggcatcggcc agcagatgag caccttcaag
     2521 gtgctgccca tcgagggagg aggaggagga ggggcaggtg ctgtcgatca ggtgcgtccg
     2581 caacgtccga agcctccaaa cacattgaat ctcaagccgc cggcgccacc gctgcgctgg
     2641 tccaagcccc actcgccgga tcagctggcg aatggcaatg gcaatggcaa tgggaatggg
     2701 aacagtaatt tcacggtgac caccaccgtc accttctcca tggagagcgg caatgggagc
     2761 agcggaggag gaggcattgg aggcaacact gggggcaagt ttgtcgaggt gacgacaccc
     2821 gcctccagca atcccttcaa cgctttgctc aatggccagg ccagcacctt tcagacgttc
     2881 gccaagcgcc tgtcgcccga gggcgagtgc aaggacacgc tgtcctcgca gggctcctcc
     2941 tccaccaacg acagccgctg gcagctgcac tcgagccgcg actccagcca caggaagatc
     3001 ctctcgccgc agacacccaa cagcagcagc agcggtggcg gcggcggcgg tggtggcaag
     3061 tcgcgcaaat cacgcgccgg caaggagtca cagcactaca aggagtcgga catactcgag
     3121 agcccgccgc tgcagtactg cgccagcgcc ctcagcgaca agatcagcga ctacgaggac
     3181 gtgtggtcgc acgaccccag cgacagggcc agtctgctga ccagcttcaa gccaacgccc
     3241 caggacgggg tgctgaaccg tcgacccgat ctgctggccg agacacccag cacacccaca
     3301 gcccaaacac tgccgcatct gacgccctgc gaggaggaga ccatcggcga cgctgccgcc
     3361 gccgccgccg ccaaggacag cagccaacag ctgttacagt tctctggcga tgcgccgcca
     3421 cgctctcggg ccggtctgct tttgccaaat ttaagcctag tcggcagcca ggcaacagcg
     3481 ccgccgccag ccatgagcca gtcactgacc gagccgggcg acgatggcgg tgagatcaca
     3541 ccgacagcgg cacagacacc gggatcgcgc agcaaacagg gcagcccctt ctacgcggag
     3601 ccagcggatg cactacgaca ggcgggcctg accagtgcgg ccacggccat attgcggcgc
     3661 cagcatcgca gccaggtgat gcacgccaat cagcggcact cggagccgct caagggagga
     3721 ctgggcggca caacggccgc cctgctgctg cccagcgacc tggagaagct ggcaggcagc
     3781 ctggacgagc tgaagcagaa gccacagcag cagcagcagc accagcagca acatcagcag
     3841 cagccaacga caaagcgggc acgcaaccgc atcgatcact ggcagctgga cagcagctgg
     3901 gagtttatgg ccaagcagga tacgacggga gaggcatacg acgcctcggt gcaggactgg
     3961 caggagaagg agaactcgct gggacgcgac aaggatgggg gactgggcaa gaagaggccc
     4021 ccactgacag tccaccagat cattgccaag cgactgcccg acctcaatct gccggaattg
     4081 gttcgctgtt cgacgccacc gcagacgatg gccggcaacg gcaccgccac cggtgccgcg
     4141 ccactgcagc cgctggttca gggccaggag agggatggca gcggcagcca gaagtcattc
     4201 cagagccaga acgtcggctg tcgcctctcc tcatacgaca atgtcttctg ccagaactca
     4261 atgttcggtg gcatcgattc ggcccagtcc gacgatggca ccatattctc agagccctgg
     4321 gactcatccc agtgggactc gttcctaccc catgacgaca ccaccctcaa ttcggacacc
     4381 attcatctgt ccaagtgcag gcccgccctg tccgaggacg ataccattgt cgaggaactc
     4441 agcagcacca aagacggcag caacagcagc tgcaaccagg acacgctcaa ggccaacagg
     4501 aatggtgctg gacaccacaa ccacaatcac aatcacaaca attgcagcgc caaggccaac
     4561 agtcgaccca aagtggccac cattctgcgg aatcccagca tgcgggatcg tgaagtctta
     4621 tgtcaccccc gcaacaagat gaacatccag agcagcggcc ccggcgactc actgcgcgcg
     4681 tacacgctgc agctggccca ggaccccagc tccacctttg cccgcaacat cgagaacttc
     4741 atcagctgca ccaaggagtc gcgcgaggcg gcgccccagg tggtcatgcg caacatgcga
     4801 cagttcatga gcggcatgaa gaactatctg gtgaagcacg gcgagggcaa gttccatgcc
     4861 gaactggagt cggcccgcgg ccgactgaag gccgacgagt tcctcaatct ggacgccatg
     4921 ctggagacgg tgatgcatca gctggtggtg ctgccgctgc gcgagcatct ctatggcatc
     4981 ttcgtggact actatacgcg cagcgaggac attcagctgc tggcccagaa cgtgcgctat
     5041 gcctgcgaac gggaggcggc cgactttggc atccggccca ccgtcacacc gccctcgcag
     5101 acggccctgc gactgattgc gaatttgctg tggcgcctgc aggaggccga gctgccgctg
     5161 gacaagctgg agctcttcct gtgcgtcatc tcgacggtgt tcgatgcgac gggctgtccc
     5221 cggggccagc agctgggcgc cgatgacttc ctgcccgtgc tcgtctatgt ggtggccaag
     5281 tgcggctttg ttggcgccga aatcgaggcg gagttcatgt ggggcctgct ccagccgacg
     5341 ctgctcaatg gggagccggg ctactacctc accgccctct gcagcgccgt tcaggtgctc
     5401 aagacgttca tggcctccga gggcgagagc ggcagtggat ccctcgactg gcgctccagc
     5461 tgcctgcccg cctgctccag cgtgctgcgg gtgatcatac cggatgaatg caacggctcg
     5521 ctgcagaccc ggacacttcc cgtgcggccg cacacgacga cgcgcgaggt gtgccgcatc
     5581 attgcccata aggcacggat caccaatccg caggactatg ccctgttcaa gctggtcgac
     5641 ggtgaggaga cgctgctcag cgatgccgag tgcccgcagg atgcacgcct ggcggccaag
     5701 ggcaagcact gtatgctcgc ctacaagcgg atcgatgcga agatcgcctg gcccaccaca
     5761 cagccgacag gacattgagc agcagcagca acagcaacag aatctgaaac tgaaactaaa
     5821 tctgaatcag catcctttcc tgccctgccc tgaccggccc taccctgccc catcttgctc
     5881 actaaccaag catatttgat taataattta ttattattac attaaaatat acttagtagc
     5941 ctgtaagtaa agctaattaa cgctaattaa ttagaaaaca tctaatgtgt actaaacaac
     6001 agtattgagt taagcgtcgt gtttttttta tcatcatcat caccctatcc ccaggaaccc
     6061 cctccctacc ccgttcctgc accccgttct tttt