Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YBR122C and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_MRPL36/YBR122C   1   -------------------------------------MLKSIFAKRFAST   13
MIT_Smik_c92_779   1   -------------------------------------MLKFLLGKRFAST   13
MIT_Spar_c205_1503   1   -------------------------------------MLKFIFAKRFAST   13
MIT_Suva_c7_1886   1   MNYTQLWNEKGIIISKVWKASKTSRFQRKITPNFPVSMLKFIFGKRFAST   50
WashU_Sbay_Contig678.100   1   -------------------------------------MLKFIFGKRFAST   13
WashU_Scas_Contig591.2   1   ----------------------------------MLNFMRSLVTKRCAST   16
WashU_Sklu_Contig2276.6   1   -------------------------------------MFKSLITKRFASG   13
WashU_Skud_Contig1759.6   1   -------------------------------------MLKFIFGKRFAST   13
WashU_Smik_Contig1520.1   1   -------------------------------------MLKFLLGKRFAST   13
Symbols






::: :. ** **



SGD_Scer_MRPL36/YBR122C   14   GSYPGSTRITLPRRPAKKIQLGKSRPAIYHQFNVKMELSDGSVVIRRSQY   63
MIT_Smik_c92_779   14   GSYPGSTRITLPRRPAKKIRLGKSRPAIYHQFDVKMELSDGSVIIRRSQY   63
MIT_Spar_c205_1503   14   GSYPGSTRITLPRRPAKKIRLGKSRPAIYHQFDVKMELSDGSVIIRKSQY   63
MIT_Suva_c7_1886   51   GSYPGSSRITLPRRPAKKIRLGKSRPAIYHQFDVKMELSDGSVVVRRSQY   100
WashU_Sbay_Contig678.100   14   GSYPGSSRITLPRRPAKKIRLGKSRPAIYHQFDVKMELSDGSVVVRRSQY   63
WashU_Scas_Contig591.2   17   GAYPGSTRISLPKRPMKKIRVGKARPAIYHQFNVKIEMSDGSVVLRRSQF   66
WashU_Sklu_Contig2276.6   14   G-YHGATQISLPKRPLKKIRLGKARPAIYHKFEVQVELSDGSVITRKSQF   62
WashU_Skud_Contig1759.6   14   GSYPGSTRIALPRRPAKKIRLGKSRPAIYHQFDVKMELSDGSVVIRRSQY   63
WashU_Smik_Contig1520.1   14   GSYPGSTRITLPRRPAKKIRLGKSRPAIYHQFDVKMELSDGSVIIRRSQY   63
Symbols






* * *:::*:**:** ***::**:******:*:*::*:*****: *:**:



SGD_Scer_MRPL36/YBR122C   64   PKGEIRLIQDQRNNPLWNPSRDDLVVVDANSGGSLDRFNKRYSSLFSVDS   113
MIT_Smik_c92_779   64   PKSEIRLIQDQRNNPLWNPSRDDLVIVDANSGGSLDRFKKRYSALYSVDT   113
MIT_Spar_c205_1503   64   PKGEIRLIQDQRNNPLWNPSRDDLVVVDANSGGSLDRFKKRYSSLFSVDS   113
MIT_Suva_c7_1886   101   PKSEIRLIQDQRNNPLWNPSRDDLVIVDANSGGSLDKFKKRYSSMFSVDT   150
WashU_Sbay_Contig678.100   64   PKSEIRLIQDQRNNPLWNPSRDDLVIVDANSGGSLDKFKKRYSSMFSVDT   113
WashU_Scas_Contig591.2   67   PKDEIRLIQDQRNNPLWNPSRTDLVVLDANAGSSLDKFKQRYSSIFTLED   116
WashU_Sklu_Contig2276.6   63   PKGELRLIQDQRNNPLWNPSRDDLVVVDANAGGRMDKFKQKYSSMFSVEE   112
WashU_Skud_Contig1759.6   64   PKSEIRLIQDQRNNPLWNPSRDDLIIVDANSGGSLDRFKKRYSSLFSVDI   113
WashU_Smik_Contig1520.1   64   PKSEIRLIQDQRNNPLWNPSRDDLVIVDANSGGSLDRFKKRYSALYSVDT   113
Symbols






**.*:**************** **:::***:*. :*:*:::**::::::



SGD_Scer_MRPL36/YBR122C   114   TTPN----SSSETVELSEENKKKTQIKKEEKEDVSEKAFGMDDYLSLLDD   159
MIT_Smik_c92_779   114   TPIN----SGSEESELPKESKEEAQIEKEEGKELSEKAFGMDDYLSLLDD   159
MIT_Spar_c205_1503   114   APIS----SGPEEPEISKESKKEAQVEKEEKKEVSEKTFGMDDYLSLLDD   159
MIT_Suva_c7_1886   151   ASAD----SGSGEPKVPEGSEKEAQVKKEEGKEVVEEAFGMDDYLSLLDD   196
WashU_Sbay_Contig678.100   114   ASAD----SGSGEPKVPEGSEKEAQVKKEEGKEVVEEAFGMDDYLSLLDD   159
WashU_Scas_Contig591.2   117   DSKQQEVPSPSKKEPISTKTTEEDAKIEPESELSDQDVFAADDYLSILDD   166
WashU_Sklu_Contig2276.6   113   PAKN---------APAEAKQDTKEETKTPVNEQEEEDEFGMDDYLTLLNT   153
WashU_Skud_Contig1759.6   114   ASAS----PGSVEPELPEESRKEAQVTKEEKENIPEGAFGMDDYLSLLDD   159
WashU_Smik_Contig1520.1   114   TPIN----SGSEESELPKESKEEAQIEKEEGKELSEKAFGMDDYLSLLDD   159
Symbols






. . : : : *. ****::*:



SGD_Scer_MRPL36/YBR122C   160   SEQQIKSG-KLASKKRDKK-   177
MIT_Smik_c92_779   160   GEQQIKSG-KLASKKRDKK-   177
MIT_Spar_c205_1503   160   SEQQIKSG-KLASKKRDKK-   177
MIT_Suva_c7_1886   197   GEQQIKSG-KLANKKKDKK-   214
WashU_Sbay_Contig678.100   160   GEQQIKSG-KLANKKKDKK-   177
WashU_Scas_Contig591.2   167   NSQQIKTG-KLAMKKKPKKK   185
WashU_Sklu_Contig2276.6   154   NALQVQSGGKLATKKKDKK-   172
WashU_Skud_Contig1759.6   160   GEQQIKSG-KLASKKRDKK-   177
WashU_Smik_Contig1520.1   160   GEQQIKSG-KLASKKRDKK-   177
Symbols






. *:::* *** **: **



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_MRPL36/YBR122C:

SGD_Scer_MRPL36/YBR122C  Length: 178  Mon Nov  7 14:46:35 2016  Type: P  Check: 3283  ..

       1  MLKSIFAKRF ASTGSYPGST RITLPRRPAK KIQLGKSRPA IYHQFNVKME

      51  LSDGSVVIRR SQYPKGEIRL IQDQRNNPLW NPSRDDLVVV DANSGGSLDR

     101  FNKRYSSLFS VDSTTPNSSS ETVELSEENK KKTQIKKEEK EDVSEKAFGM

     151  DDYLSLLDDS EQQIKSGKLA SKKRDKK*

Protein Sequence for MIT_Smik_c92_779:

MIT_Smik_c92_779  Length: 178  Mon Nov  7 14:46:35 2016  Type: P  Check: 290  ..

       1  MLKFLLGKRF ASTGSYPGST RITLPRRPAK KIRLGKSRPA IYHQFDVKME

      51  LSDGSVIIRR SQYPKSEIRL IQDQRNNPLW NPSRDDLVIV DANSGGSLDR

     101  FKKRYSALYS VDTTPINSGS EESELPKESK EEAQIEKEEG KELSEKAFGM

     151  DDYLSLLDDG EQQIKSGKLA SKKRDKK*

Protein Sequence for MIT_Spar_c205_1503:

MIT_Spar_c205_1503  Length: 178  Mon Nov  7 14:46:35 2016  Type: P  Check: 1178  ..

       1  MLKFIFAKRF ASTGSYPGST RITLPRRPAK KIRLGKSRPA IYHQFDVKME

      51  LSDGSVIIRK SQYPKGEIRL IQDQRNNPLW NPSRDDLVVV DANSGGSLDR

     101  FKKRYSSLFS VDSAPISSGP EEPEISKESK KEAQVEKEEK KEVSEKTFGM

     151  DDYLSLLDDS EQQIKSGKLA SKKRDKK*

Protein Sequence for MIT_Suva_c7_1886:

MIT_Suva_c7_1886  Length: 215  Mon Nov  7 14:46:35 2016  Type: P  Check: 4494  ..

       1  MNYTQLWNEK GIIISKVWKA SKTSRFQRKI TPNFPVSMLK FIFGKRFAST

      51  GSYPGSSRIT LPRRPAKKIR LGKSRPAIYH QFDVKMELSD GSVVVRRSQY

     101  PKSEIRLIQD QRNNPLWNPS RDDLVIVDAN SGGSLDKFKK RYSSMFSVDT

     151  ASADSGSGEP KVPEGSEKEA QVKKEEGKEV VEEAFGMDDY LSLLDDGEQQ

     201  IKSGKLANKK KDKK*

Protein Sequence for WashU_Sbay_Contig678.100:

WashU_Sbay_Contig678.100  Length: 178  Mon Nov  7 14:46:35 2016  Type: P  Check: 9909  ..

       1  MLKFIFGKRF ASTGSYPGSS RITLPRRPAK KIRLGKSRPA IYHQFDVKME

      51  LSDGSVVVRR SQYPKSEIRL IQDQRNNPLW NPSRDDLVIV DANSGGSLDK

     101  FKKRYSSMFS VDTASADSGS GEPKVPEGSE KEAQVKKEEG KEVVEEAFGM

     151  DDYLSLLDDG EQQIKSGKLA NKKKDKK*

Protein Sequence for WashU_Scas_Contig591.2:

WashU_Scas_Contig591.2  Length: 186  Mon Nov  7 14:46:35 2016  Type: P  Check: 7319  ..

       1  MLNFMRSLVT KRCASTGAYP GSTRISLPKR PMKKIRVGKA RPAIYHQFNV

      51  KIEMSDGSVV LRRSQFPKDE IRLIQDQRNN PLWNPSRTDL VVLDANAGSS

     101  LDKFKQRYSS IFTLEDDSKQ QEVPSPSKKE PISTKTTEED AKIEPESELS

     151  DQDVFAADDY LSILDDNSQQ IKTGKLAMKK KPKKK*

Protein Sequence for WashU_Sklu_Contig2276.6:

WashU_Sklu_Contig2276.6  Length: 173  Mon Nov  7 14:46:35 2016  Type: P  Check: 7882  ..

       1  MFKSLITKRF ASGGYHGATQ ISLPKRPLKK IRLGKARPAI YHKFEVQVEL

      51  SDGSVITRKS QFPKGELRLI QDQRNNPLWN PSRDDLVVVD ANAGGRMDKF

     101  KQKYSSMFSV EEPAKNAPAE AKQDTKEETK TPVNEQEEED EFGMDDYLTL

     151  LNTNALQVQS GGKLATKKKD KK*

Protein Sequence for WashU_Skud_Contig1759.6:

WashU_Skud_Contig1759.6  Length: 178  Mon Nov  7 14:46:35 2016  Type: P  Check: 9063  ..

       1  MLKFIFGKRF ASTGSYPGST RIALPRRPAK KIRLGKSRPA IYHQFDVKME

      51  LSDGSVVIRR SQYPKSEIRL IQDQRNNPLW NPSRDDLIIV DANSGGSLDR

     101  FKKRYSSLFS VDIASASPGS VEPELPEESR KEAQVTKEEK ENIPEGAFGM

     151  DDYLSLLDDG EQQIKSGKLA SKKRDKK*

Protein Sequence for WashU_Smik_Contig1520.1:

WashU_Smik_Contig1520.1  Length: 178  Mon Nov  7 14:46:35 2016  Type: P  Check: 290  ..

       1  MLKFLLGKRF ASTGSYPGST RITLPRRPAK KIRLGKSRPA IYHQFDVKME

      51  LSDGSVIIRR SQYPKSEIRL IQDQRNNPLW NPSRDDLVIV DANSGGSLDR

     101  FKKRYSALYS VDTTPINSGS EESELPKESK EEAQIEKEEG KELSEKAFGM

     151  DDYLSLLDDG EQQIKSGKLA SKKRDKK*