Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YNR017W and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_TIM23/YNR017W   1   MSWLFGDKTPTDDANAAVGGQDTTKPKELSLKQSLGFEPNINNIIS---G   47
MIT_Smik_c642_18742   1   MSWLFGNSTPADDANAAAGGQGTTKPKELSLKQSLGFEPNINNIIS---G   47
MIT_Spar_c258_19394   1   MSWLFGDKTPTNDANAAAGGQDTSKPKELSLKQSLGFEPNINNIIS---G   47
MIT_Suva_c32_21730   1   MSWFSGNNTPAADAAGDAG-----KPKELSLKQSLGFEPNINNIIA---G   42
WashU_Sbay_Contig655.33   1   MSWFSGNNTPAADAAGDAG-----KPKELSLKQSLGFEPNINNIIA---G   42
WashU_Scas_Contig625.18   1   MSWLFGGNN-NKDD-------TITSPDASSGSSTLGFDTSQLTNVTNIIT   42
WashU_Sklu_Contig2020.2   1   MSWLFGGKK-SNAEQQEQQTQGLDPANDSKLKQTLGFDPSQVTNVSNIIS   49
WashU_Skud_Contig1660.2   1   MSWLFGNNTPAADETAAAGGRDTSKPKELSLKQSLGFEPNINNIIS---G   47
WashU_Smik_Contig2483.3   1   MSWLFGNSTPADDANAAAGGQGTTKPKELSLKQSLGFEPNINNIIS---G   47
Symbols






***: *... .. . ..:***:.. . ::



SGD_Scer_TIM23/YNR017W   48   PGGMHVDTARLHPLAGLDKGVEYLDLEEEQLSSLEGSQGLIPSRGWTDDL   97
MIT_Smik_c642_18742   48   PGGMHVDTARLHPLAGLDKGVEYLDLEEEQLSSLEGSQGLIPSRGWTDDL   97
MIT_Spar_c258_19394   48   PGGMHVDSARLHPLAGLDKGVEYLDLEEEQLSSLEGSQGLIPSRGWTDDL   97
MIT_Suva_c32_21730   43   PGGLHVDSARLHPLAGLDKGVEYLDLEEEQLSSLEGSQGLIPSRGWTDDL   92
WashU_Sbay_Contig655.33   43   PGGLHVDSARLHPLAGLDKGVEYLDLEEEQLSSLEGSQGLIPSRGWTDDL   92
WashU_Scas_Contig625.18   43   GPHGGFDPARLHPLAGLDKGVEYLDLEEEQLSTMEGSQGLIPSRGWTDDL   92
WashU_Sklu_Contig2020.2   50   TPG-ALDTSRLHPLAGLERGVEYLDLEEEQLSTMTGSQGLIPSRGWTDDL   98
WashU_Skud_Contig1660.2   48   PGGMHVDTARLHPLAGLDKGVEYLDLEDEQLSSLEGSQGLIPSRGWTDDL   97
WashU_Smik_Contig2483.3   48   PGGMHVDTARLHPLAGLDKGVEYLDLEEEQLSSLEGSQGLIPSRGWTDDL   97
Symbols






.*.:********::********:****:: ***************



SGD_Scer_TIM23/YNR017W   98   CYGTGAVYLLGLGIGGFSGMMQGLQNIPPNSPGKLQLNTVLNHITKRGPF   147
MIT_Smik_c642_18742   98   CYGTGAVYLLGLGIGGFSGMMQGLQNIPPNSPGKLQLNTVLNHITKRGPF   147
MIT_Spar_c258_19394   98   CYGTGAVYLLGLGIGGFSGMMQGLQNIPPNSPGKLQLNTVLNHITKRGPF   147
MIT_Suva_c32_21730   93   CYGTGAVYLLGLGAGGLSGMMQGLKNIPPNSPAKLQLNTVLNHITKRGPF   142
WashU_Sbay_Contig655.33   93   CYGTGAVYLLGLGAGGLSGMMQGLKNIPPNSPAKLQLNTVLNHITKRGPF   142
WashU_Scas_Contig625.18   93   CYGTGAVYLLGLGFGGLSGFFQGIKNIPPNSPGKLQLNTILNSITKRGPF   142
WashU_Sklu_Contig2020.2   99   CYGTGAVYLLGLGTGGAYGFLEGLRNIPPNSPGKLQLNTILNHITRRGPF   148
WashU_Skud_Contig1660.2   98   CYGTGAVYLLGLGVGGVSGMMQGLQNIPANSPGKLQLNTVLNHITKRGPF   147
WashU_Smik_Contig2483.3   98   CYGTGAVYLLGLGIGGFSGMMQGLQNIPPNSPGKLQLNTVLNHITKRGPF   147
Symbols






************* ** *:::*::***.***.******:** **:****



SGD_Scer_TIM23/YNR017W   148   LGNNAGILALSYNIINSTIDALRGKHDTAGSIGAGALTGALFKSSKGLKP   197
MIT_Smik_c642_18742   148   LGNNAGILALSYNIVNSTIDALRGKHDTAGSIGAGALTGALFKSSKGLKP   197
MIT_Spar_c258_19394   148   LGNNAGILALSYNIVNSTIDALRGKHDTAGSIGAGALTGALFKSSKGLKP   197
MIT_Suva_c32_21730   143   LGNNAGILALSYNIVNSTIDAFRGKHDTAGSIAAGALTGAVFKSSKGLKP   192
WashU_Sbay_Contig655.33   143   LGNNAGILALSYNIVNSTIDAFRGKHDTAGSIAAGALTGAVFKSSKGLKP   192
WashU_Scas_Contig625.18   143   MGNNAGILALSYNLINSTIDSFRGKHDTPGAILAGGVTGAIFKSSKGLKP   192
WashU_Sklu_Contig2020.2   149   LGNNAGVLALTYNLINSTIDSVRGKHDAASSVAAGALTGALFKSSKGLKP   198
WashU_Skud_Contig1660.2   148   LGNNAGILALSYNIVNSTIDTLRGKHDAAGSVGAGALTGALFKSSKGLKP   197
WashU_Smik_Contig2483.3   148   LGNNAGILALSYNIVNSTIDALRGKHDTAGSIGAGALTGALFKSSKGLKP   197
Symbols






:*****:***:**::*****:.*****:..:: **.:***:*********



SGD_Scer_TIM23/YNR017W   198   MGYSSAMVAAACAVWCSVKKRLLEK-   222
MIT_Smik_c642_18742   198   MGYSSAMVAAACAVWCGVKKRLLDK-   222
MIT_Spar_c258_19394   198   MGYSSAMVAAACAVWCSVKKRLLEK-   222
MIT_Suva_c32_21730   193   MGYSSAMVAAACAVWCAVKKTLLEK-   217
WashU_Sbay_Contig655.33   193   MGYSSAMVAAACAVWCAVKKTLLEK-   217
WashU_Scas_Contig625.18   193   MAYASALMAAAAGTWGVAKKSVLE--   216
WashU_Sklu_Contig2020.2   199   MGYASGLMAGAAAAWCGFKSLVL---   221
WashU_Skud_Contig1660.2   198   MGYSSVMVAAACAVWCGVKKRLLQKN   223
WashU_Smik_Contig2483.3   198   MGYSSAMVAAACAVWCGVKKRLLDK-   222
Symbols






*.*:* ::*.*...* *. :*



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_TIM23/YNR017W:

SGD_Scer_TIM23/YNR017W  Length: 223  Mon Nov  7 16:30:34 2016  Type: P  Check: 7793  ..

       1  MSWLFGDKTP TDDANAAVGG QDTTKPKELS LKQSLGFEPN INNIISGPGG

      51  MHVDTARLHP LAGLDKGVEY LDLEEEQLSS LEGSQGLIPS RGWTDDLCYG

     101  TGAVYLLGLG IGGFSGMMQG LQNIPPNSPG KLQLNTVLNH ITKRGPFLGN

     151  NAGILALSYN IINSTIDALR GKHDTAGSIG AGALTGALFK SSKGLKPMGY

     201  SSAMVAAACA VWCSVKKRLL EK*

Protein Sequence for MIT_Smik_c642_18742:

MIT_Smik_c642_18742  Length: 223  Mon Nov  7 16:30:34 2016  Type: P  Check: 7464  ..

       1  MSWLFGNSTP ADDANAAAGG QGTTKPKELS LKQSLGFEPN INNIISGPGG

      51  MHVDTARLHP LAGLDKGVEY LDLEEEQLSS LEGSQGLIPS RGWTDDLCYG

     101  TGAVYLLGLG IGGFSGMMQG LQNIPPNSPG KLQLNTVLNH ITKRGPFLGN

     151  NAGILALSYN IVNSTIDALR GKHDTAGSIG AGALTGALFK SSKGLKPMGY

     201  SSAMVAAACA VWCGVKKRLL DK*

Protein Sequence for MIT_Spar_c258_19394:

MIT_Spar_c258_19394  Length: 223  Mon Nov  7 16:30:34 2016  Type: P  Check: 8080  ..

       1  MSWLFGDKTP TNDANAAAGG QDTSKPKELS LKQSLGFEPN INNIISGPGG

      51  MHVDSARLHP LAGLDKGVEY LDLEEEQLSS LEGSQGLIPS RGWTDDLCYG

     101  TGAVYLLGLG IGGFSGMMQG LQNIPPNSPG KLQLNTVLNH ITKRGPFLGN

     151  NAGILALSYN IVNSTIDALR GKHDTAGSIG AGALTGALFK SSKGLKPMGY

     201  SSAMVAAACA VWCSVKKRLL EK*

Protein Sequence for MIT_Suva_c32_21730:

MIT_Suva_c32_21730  Length: 218  Mon Nov  7 16:30:34 2016  Type: P  Check: 6283  ..

       1  MSWFSGNNTP AADAAGDAGK PKELSLKQSL GFEPNINNII AGPGGLHVDS

      51  ARLHPLAGLD KGVEYLDLEE EQLSSLEGSQ GLIPSRGWTD DLCYGTGAVY

     101  LLGLGAGGLS GMMQGLKNIP PNSPAKLQLN TVLNHITKRG PFLGNNAGIL

     151  ALSYNIVNST IDAFRGKHDT AGSIAAGALT GAVFKSSKGL KPMGYSSAMV

     201  AAACAVWCAV KKTLLEK*

Protein Sequence for WashU_Sbay_Contig655.33:

WashU_Sbay_Contig655.33  Length: 218  Mon Nov  7 16:30:34 2016  Type: P  Check: 6283  ..

       1  MSWFSGNNTP AADAAGDAGK PKELSLKQSL GFEPNINNII AGPGGLHVDS

      51  ARLHPLAGLD KGVEYLDLEE EQLSSLEGSQ GLIPSRGWTD DLCYGTGAVY

     101  LLGLGAGGLS GMMQGLKNIP PNSPAKLQLN TVLNHITKRG PFLGNNAGIL

     151  ALSYNIVNST IDAFRGKHDT AGSIAAGALT GAVFKSSKGL KPMGYSSAMV

     201  AAACAVWCAV KKTLLEK*

Protein Sequence for WashU_Scas_Contig625.18:

WashU_Scas_Contig625.18  Length: 217  Mon Nov  7 16:30:34 2016  Type: P  Check: 5922  ..

       1  MSWLFGGNNN KDDTITSPDA SSGSSTLGFD TSQLTNVTNI ITGPHGGFDP

      51  ARLHPLAGLD KGVEYLDLEE EQLSTMEGSQ GLIPSRGWTD DLCYGTGAVY

     101  LLGLGFGGLS GFFQGIKNIP PNSPGKLQLN TILNSITKRG PFMGNNAGIL

     151  ALSYNLINST IDSFRGKHDT PGAILAGGVT GAIFKSSKGL KPMAYASALM

     201  AAAAGTWGVA KKSVLE*

Protein Sequence for WashU_Sklu_Contig2020.2:

WashU_Sklu_Contig2020.2  Length: 222  Mon Nov  7 16:30:34 2016  Type: P  Check: 8644  ..

       1  MSWLFGGKKS NAEQQEQQTQ GLDPANDSKL KQTLGFDPSQ VTNVSNIIST

      51  PGALDTSRLH PLAGLERGVE YLDLEEEQLS TMTGSQGLIP SRGWTDDLCY

     101  GTGAVYLLGL GTGGAYGFLE GLRNIPPNSP GKLQLNTILN HITRRGPFLG

     151  NNAGVLALTY NLINSTIDSV RGKHDAASSV AAGALTGALF KSSKGLKPMG

     201  YASGLMAGAA AAWCGFKSLV L*

Protein Sequence for WashU_Skud_Contig1660.2:

WashU_Skud_Contig1660.2  Length: 224  Mon Nov  7 16:30:34 2016  Type: P  Check: 5355  ..

       1  MSWLFGNNTP AADETAAAGG RDTSKPKELS LKQSLGFEPN INNIISGPGG

      51  MHVDTARLHP LAGLDKGVEY LDLEDEQLSS LEGSQGLIPS RGWTDDLCYG

     101  TGAVYLLGLG VGGVSGMMQG LQNIPANSPG KLQLNTVLNH ITKRGPFLGN

     151  NAGILALSYN IVNSTIDTLR GKHDAAGSVG AGALTGALFK SSKGLKPMGY

     201  SSVMVAAACA VWCGVKKRLL QKN*

Protein Sequence for WashU_Smik_Contig2483.3:

WashU_Smik_Contig2483.3  Length: 223  Mon Nov  7 16:30:34 2016  Type: P  Check: 7464  ..

       1  MSWLFGNSTP ADDANAAAGG QGTTKPKELS LKQSLGFEPN INNIISGPGG

      51  MHVDTARLHP LAGLDKGVEY LDLEEEQLSS LEGSQGLIPS RGWTDDLCYG

     101  TGAVYLLGLG IGGFSGMMQG LQNIPPNSPG KLQLNTVLNH ITKRGPFLGN

     151  NAGILALSYN IVNSTIDALR GKHDTAGSIG AGALTGALFK SSKGLKPMGY

     201  SSAMVAAACA VWCGVKKRLL DK*