Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YOL147C and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_PEX11/YOL147C   1   MVCDTLVYHPSVTRFVKFLDGSAGREKVLRLLQYLARFLAVQNSSLLARQ   50
MIT_Smik_c401_19868   1   MVCDTVVYHPSVTKFVKFLDGSAGREKILRLLQYLARFLAVQNSSVLARQ   50
MIT_Spar_c314_19767   1   MVCDTVVYHPSVTRFVKFLDGSAGREKILRLLQYLARFLAVQNSSILARK   50
MIT_Suva_c58_22202   1   MVCDTVVYHPSVTRFVKFLDGSAGREKILRLLQYLARFLAVQNSSALARQ   50
WashU_Sbay_Contig659.9   1   MVCDTVVYHPSVTRFVKFLDGSAGREKILRLLQYLARFLAVQNSSALARQ   50
WashU_Scas_Contig716.43   1   MVCDTIVYHPTITRLIKFFDAAAGREKVLRLLQYLCRFLSIEKG-GPTKQ   49
WashU_Sklu_Contig1876.4   1   MVCDTVVYHPTLTRLVKFLDSTAGREKALRLLQYLCRFLSFQYSSVLAKQ   50
WashU_Smik_Contig1875.2   1   MVCDTVVYHPSVTKFVKFLDGSAGREKILRLLQYLARFLAVQNSSVLARQ   50
Symbols






*****:****::*:::**:*.:***** *******.***:.: . :::



SGD_Scer_PEX11/YOL147C   51   LQAQFTTVRKFLRFLKPLNHLQAAAKFYDNKLASDNVVRVCNVLKNIFFA   100
MIT_Smik_c401_19868   51   LQAQFTTVRKFLRFLKPLNHLQAAAKFYDNKLASDNVVRICNVLKNIFFA   100
MIT_Spar_c314_19767   51   LQVQFTTVRKFLRFLKPLNHLQAAAKFYDNKLASDNVVRICNVLKNIFFA   100
MIT_Suva_c58_22202   51   LQTQFTTVRKFLRFLKPLNHLQAAAKFYDNKLASDNVIRVCNILKNFFFA   100
WashU_Sbay_Contig659.9   51   LQTQFTTVRKFLRFLKPLNHLQAAAKFYDNKLASDNVIRVCNILKNFFFA   100
WashU_Scas_Contig716.43   50   LERQFLLIRKVLRFLKPLNYLKLASKVYDNKLAGDAFVRYCNVWKNLAFA   99
WashU_Sklu_Contig1876.4   51   LQGEFTTVRKILRFLKPLNHLQAASKFYDNKISGDAILRWGNIAKNIAYV   100
WashU_Smik_Contig1875.2   51   LQAQFTTVRKFLRFLKPLSHLQAAAKFYDNKLASDNVVRICNVLKNIFFA   100
Symbols






*: :* :**.*******.:*: *:*.****::.* .:* *: **: :.



SGD_Scer_PEX11/YOL147C   101   AYLSLDQVNLLRILKVIPVTVLTGKKIPRWSNWCWLFGLLSGLAMDLRKI   150
MIT_Smik_c401_19868   101   AYLSLDQINLLRILKVLPVTILTGKKIPRWSNWCWLFGLLSGLAMDLRKI   150
MIT_Spar_c314_19767   101   AYLSLDQVNLLRILKVIPVTILTGKKIPRWSNWCWLFGLLSGLAMDLRKI   150
MIT_Suva_c58_22202   101   AYLSLDQVNLLRILKVIPVTILTSKKVPRWSNWCWLFGLLSGLVMDLRKI   150
WashU_Sbay_Contig659.9   101   AYLSLDQVNLLRILKVIPVTILTSKKVPRWSNWCWLFGLLSGLVMDLRKI   150
WashU_Scas_Contig716.43   100   LYLALDQINLLRMLKVISSTSLTGKMIPKWTNQSWLLSLFLGILMNGRKI   149
WashU_Sklu_Contig1876.4   101   GYLSLDQVNLLRILRLVPVTKTTGTKVPRWTNWCWLAGLISGLVLDLRKI   150
WashU_Smik_Contig1875.2   101   AYLSLDQINLLRILKVLPVTILTGKKIPRWSNWCWLFGLLSGLAMDLRKI   150
Symbols






**:***:****:*:::. * *.. :*:*:* .** .*: *: :: ***



SGD_Scer_PEX11/YOL147C   151   QTSHAQIAAFVKAKSQSQGD--EHEDHKKVLGKAYQDRYTALRRLFWDAA   198
MIT_Smik_c401_19868   151   QTSHVQITAFTNAKSQNQGG--EQEDHKKVLGKAYQDRYSALRRLFWDAA   198
MIT_Spar_c314_19767   151   QTSHAQISAFVKAKSQSQGD--EHEDHKKVLGKAYQDRYSALRRLFWDAA   198
MIT_Suva_c58_22202   151   QTSHSQIAAFVSAKSQGQGG--EKEDHKKVLGKAYQERYAALRRLFWDAA   198
WashU_Sbay_Contig659.9   151   QTSHSQIAAFVSAKSQGQGG--EKEDHKKVLGKAYQERYAALRRLFWDAA   198
WashU_Scas_Contig716.43   150   QIAQRHIDEIKNAKKEGSGKDTDKDEEKKVLATVTKERYSAIRKLLWDSL   199
WashU_Sklu_Contig1876.4   151   QLSQRRITSLVEDND---------ADEKKLLYKSYEERFQALRRLVWDSV   191
WashU_Smik_Contig1875.2   151   QTSHVQITAFTNAKSQNQGG--EQEDHKKVLGKAYQDRYCALRRLFWDAA   198
Symbols






* :: :* : . :. :.**:* . ::*: *:*:*.**:



SGD_Scer_PEX11/YOL147C   199   DSFIVLNNLGYLSSNEEYVALSGVVTSILGMQDMWKAT-   236
MIT_Smik_c401_19868   199   DSFIVLNNLGYLTSSEEYVALSGVVTSILGMQDMWKMT-   236
MIT_Spar_c314_19767   199   DSFIVLNNLGYLSSNEEYVALSGVVTSILGMQDMWKVT-   236
MIT_Suva_c58_22202   199   DSFIVLNNLGYLSSNEEYVALSGVITSVFGMQDMWKATS   237
WashU_Sbay_Contig659.9   199   DSFIVLNNLGYLSSNEEYVALSGVITSVFGMQDMWKATS   237
WashU_Scas_Contig716.43   200   DSLIVMNNLSYLKLDDGYTGLIGITTSLLGMQDLWKATE   238
WashU_Sklu_Contig1876.4   192   DTFIVLNNLKFLNSQDGSVALAGVATSLFGLQDLWKGAM   230
WashU_Smik_Contig1875.2   199   DSFIVLNNLGYLTSSEEYVALSGVVTSILGMQDMWKMT-   236
Symbols






*::**:*** :*. .: ..* *: **::*:**:** :



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_PEX11/YOL147C:

SGD_Scer_PEX11/YOL147C  Length: 237  Mon Nov  7 16:34:59 2016  Type: P  Check: 9614  ..

       1  MVCDTLVYHP SVTRFVKFLD GSAGREKVLR LLQYLARFLA VQNSSLLARQ

      51  LQAQFTTVRK FLRFLKPLNH LQAAAKFYDN KLASDNVVRV CNVLKNIFFA

     101  AYLSLDQVNL LRILKVIPVT VLTGKKIPRW SNWCWLFGLL SGLAMDLRKI

     151  QTSHAQIAAF VKAKSQSQGD EHEDHKKVLG KAYQDRYTAL RRLFWDAADS

     201  FIVLNNLGYL SSNEEYVALS GVVTSILGMQ DMWKAT*

Protein Sequence for MIT_Smik_c401_19868:

MIT_Smik_c401_19868  Length: 237  Mon Nov  7 16:34:59 2016  Type: P  Check: 474  ..

       1  MVCDTVVYHP SVTKFVKFLD GSAGREKILR LLQYLARFLA VQNSSVLARQ

      51  LQAQFTTVRK FLRFLKPLNH LQAAAKFYDN KLASDNVVRI CNVLKNIFFA

     101  AYLSLDQINL LRILKVLPVT ILTGKKIPRW SNWCWLFGLL SGLAMDLRKI

     151  QTSHVQITAF TNAKSQNQGG EQEDHKKVLG KAYQDRYSAL RRLFWDAADS

     201  FIVLNNLGYL TSSEEYVALS GVVTSILGMQ DMWKMT*

Protein Sequence for MIT_Spar_c314_19767:

MIT_Spar_c314_19767  Length: 237  Mon Nov  7 16:34:59 2016  Type: P  Check: 387  ..

       1  MVCDTVVYHP SVTRFVKFLD GSAGREKILR LLQYLARFLA VQNSSILARK

      51  LQVQFTTVRK FLRFLKPLNH LQAAAKFYDN KLASDNVVRI CNVLKNIFFA

     101  AYLSLDQVNL LRILKVIPVT ILTGKKIPRW SNWCWLFGLL SGLAMDLRKI

     151  QTSHAQISAF VKAKSQSQGD EHEDHKKVLG KAYQDRYSAL RRLFWDAADS

     201  FIVLNNLGYL SSNEEYVALS GVVTSILGMQ DMWKVT*

Protein Sequence for MIT_Suva_c58_22202:

MIT_Suva_c58_22202  Length: 238  Mon Nov  7 16:34:59 2016  Type: P  Check: 488  ..

       1  MVCDTVVYHP SVTRFVKFLD GSAGREKILR LLQYLARFLA VQNSSALARQ

      51  LQTQFTTVRK FLRFLKPLNH LQAAAKFYDN KLASDNVIRV CNILKNFFFA

     101  AYLSLDQVNL LRILKVIPVT ILTSKKVPRW SNWCWLFGLL SGLVMDLRKI

     151  QTSHSQIAAF VSAKSQGQGG EKEDHKKVLG KAYQERYAAL RRLFWDAADS

     201  FIVLNNLGYL SSNEEYVALS GVITSVFGMQ DMWKATS*

Protein Sequence for WashU_Sbay_Contig659.9:

WashU_Sbay_Contig659.9  Length: 238  Mon Nov  7 16:34:59 2016  Type: P  Check: 488  ..

       1  MVCDTVVYHP SVTRFVKFLD GSAGREKILR LLQYLARFLA VQNSSALARQ

      51  LQTQFTTVRK FLRFLKPLNH LQAAAKFYDN KLASDNVIRV CNILKNFFFA

     101  AYLSLDQVNL LRILKVIPVT ILTSKKVPRW SNWCWLFGLL SGLVMDLRKI

     151  QTSHSQIAAF VSAKSQGQGG EKEDHKKVLG KAYQERYAAL RRLFWDAADS

     201  FIVLNNLGYL SSNEEYVALS GVITSVFGMQ DMWKATS*

Protein Sequence for WashU_Scas_Contig716.43:

WashU_Scas_Contig716.43  Length: 239  Mon Nov  7 16:34:59 2016  Type: P  Check: 7384  ..

       1  MVCDTIVYHP TITRLIKFFD AAAGREKVLR LLQYLCRFLS IEKGGPTKQL

      51  ERQFLLIRKV LRFLKPLNYL KLASKVYDNK LAGDAFVRYC NVWKNLAFAL

     101  YLALDQINLL RMLKVISSTS LTGKMIPKWT NQSWLLSLFL GILMNGRKIQ

     151  IAQRHIDEIK NAKKEGSGKD TDKDEEKKVL ATVTKERYSA IRKLLWDSLD

     201  SLIVMNNLSY LKLDDGYTGL IGITTSLLGM QDLWKATE*

Protein Sequence for WashU_Sklu_Contig1876.4:

WashU_Sklu_Contig1876.4  Length: 231  Mon Nov  7 16:34:59 2016  Type: P  Check: 6778  ..

       1  MVCDTVVYHP TLTRLVKFLD STAGREKALR LLQYLCRFLS FQYSSVLAKQ

      51  LQGEFTTVRK ILRFLKPLNH LQAASKFYDN KISGDAILRW GNIAKNIAYV

     101  GYLSLDQVNL LRILRLVPVT KTTGTKVPRW TNWCWLAGLI SGLVLDLRKI

     151  QLSQRRITSL VEDNDADEKK LLYKSYEERF QALRRLVWDS VDTFIVLNNL

     201  KFLNSQDGSV ALAGVATSLF GLQDLWKGAM *

Protein Sequence for WashU_Smik_Contig1875.2:

WashU_Smik_Contig1875.2  Length: 237  Mon Nov  7 16:34:59 2016  Type: P  Check: 262  ..

       1  MVCDTVVYHP SVTKFVKFLD GSAGREKILR LLQYLARFLA VQNSSVLARQ

      51  LQAQFTTVRK FLRFLKPLSH LQAAAKFYDN KLASDNVVRI CNVLKNIFFA

     101  AYLSLDQINL LRILKVLPVT ILTGKKIPRW SNWCWLFGLL SGLAMDLRKI

     151  QTSHVQITAF TNAKSQNQGG EQEDHKKVLG KAYQDRYCAL RRLFWDAADS

     201  FIVLNNLGYL TSSEEYVALS GVVTSILGMQ DMWKMT*