Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YBL041W and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_PRE7/YBL041W   1   --------------------MATIASEYSSEASNTPIEHQFNPYGDNGGT   30
MIT_Smik_c134_1472   1   --------------------MTTIASEYSSEASNTPIEHQFNPYGDNGGT   30
MIT_Spar_c366_929   1   --------------------MTTIASEYSSEASNTPIEHQFNPYGDNGGT   30
MIT_Suva_c115_1032   1   --------------------MATIASEYSSEVSNTPIEHQFNPYGDNGGT   30
WashU_Sbay_Contig650.47   1   --------------------MATIASEYSSEVSNTPIEHQFNPYGDNGGT   30
WashU_Scas_Contig495.3   1   --------------------MATIASEYSSEVRNTPIEHQFNPYSDNGGT   30
WashU_Sklu_Contig2418.12   1   MYFPPFKKSTSKIVTSSSNMASTIASEYSNETKNVPIEHQFNPYSDNGGT   50
WashU_Skud_Contig1851.4   1   --------------------MTTIASEYSSEASNTPIEHQFNPYGDNGGT   30
WashU_Smik_Contig1215.1   1   --------------------MTTIASEYSSEASNTPIEHQFNPYGDNGGT   30
Symbols






:*******.*. *.*********.*****



SGD_Scer_PRE7/YBL041W   31   ILGIAGEDFAVLAGDTRNITDYSINSRYEPKVFDCGDNIVMSANGFAADG   80
MIT_Smik_c134_1472   31   ILGIAGEDFAVLAGDTRNITDYSINSRYEPKVFDCGDNIVMSANGFAADG   80
MIT_Spar_c366_929   31   ILGIAGEDFAVLAGDTRNITDYSINSRYEPKVFDCGDNIVMSANGFAADG   80
MIT_Suva_c115_1032   31   ILGIAGEDFAVLAGDTRNITDYSINSRYEPKVFDCGDKIVMSANGFAADG   80
WashU_Sbay_Contig650.47   31   ILGIAGEDFAVLAGDTRNITDYSINSRYEPKVFDCGDKIVMSANGFAADG   80
WashU_Scas_Contig495.3   31   ILGIAGEDFAVLAGDTRHTTDYSINSRYEPKVFDCGDNIVISANGFAADG   80
WashU_Sklu_Contig2418.12   51   ILGIAGEDFAVLAGDTRHTTDYSINSRYEPKVFDCGDNILISANGFAADG   100
WashU_Skud_Contig1851.4   31   ILGIAGEDFAVLAGDTRNITDYSINSRYEPKVFDCGDNLVMSANGFAADG   80
WashU_Smik_Contig1215.1   31   ILGIAGEDFAVLAGDTRNITDYSINSRYEPKVFDCGDNIVMSANGFAADG   80
Symbols






*****************: ******************::::*********



SGD_Scer_PRE7/YBL041W   81   DALVKRFKNSVKWYHFDHNDKKLSINSAARNIQHLLYGKRFFPYYVHTII   130
MIT_Smik_c134_1472   81   DALVKRFKNSVKWYHFDHNDKKLSLNSAARNIQHLLYGKRFFPYYVHTII   130
MIT_Spar_c366_929   81   DALVKRFKNSVKWYHFDHNDKKLSINSAARNIQHLLYGKRFFPYYVHTII   130
MIT_Suva_c115_1032   81   DALVKRFKNSVKWYHFDHNDKKLSINSAARNIQHLLYGKRFFPYYVHTII   130
WashU_Sbay_Contig650.47   81   DALVKRFKNSVKWYHFDHNDKKLSINSAARNIQHLLYGKRFFPYYVHTII   130
WashU_Scas_Contig495.3   81   EALVKRFKNSLKWYHFDHNDKKLAMSSAARNIQHLLYGKRFFPYYVHTII   130
WashU_Sklu_Contig2418.12   101   EALVKRFQNSLKWYHFNHNDRKLSLSSAARNIQHLLYGKRFFPYYVHTII   150
WashU_Skud_Contig1851.4   81   DALVKRFKNSVKWYHFDHNDKKLSINSAARNIQHLLYGKRFFPYYVHTII   130
WashU_Smik_Contig1215.1   81   DALVKRFKNSVKWYHFDHNDKKLSLNSAARNIQHLLYGKRFFPYYVHTII   130
Symbols






:******:**:*****:***:**::.************************



SGD_Scer_PRE7/YBL041W   131   AGLDEDGKGAVYSFDPVGSYEREQCRAGGAAASLIMPFLDNQVNFKNQYE   180
MIT_Smik_c134_1472   131   AGLDENGKGAVYSFDPVGSYEREQCRAGGAAASLIMPFLDNQVNFKNQYE   180
MIT_Spar_c366_929   131   AGLDENGKGAVYSFDPVGSYEREQCRAGGAAASLIMPFLDNQVNFKNQYE   180
MIT_Suva_c115_1032   131   AGLDEDGKGAVYSFDPVGSYEREQCRAGGAAASLIMPFLDNQVNFKNQYE   180
WashU_Sbay_Contig650.47   131   AGLDEDGKGAVYSFDPVGSYEREQCRAGGAAASLIMPFLDNQVNFKNQYE   180
WashU_Scas_Contig495.3   131   AGLDEEGKGAVYSFDPVGSYEREQCRAGGAAASLIMPFLDNQVNFKNQYE   180
WashU_Sklu_Contig2418.12   151   AGLDEEGKGAVYSFDPVGSYEREQCRAGGAAASLIMPFLDNQVNFKNQYE   200
WashU_Skud_Contig1851.4   131   AGLDENGKGAVYSFDPVGSYEREQCRAGGAAASLIMPFLDNQVNFKNQYE   180
WashU_Smik_Contig1215.1   131   AGLDENGKGAVYSFDPVGSYEREQCRAGGAAASLIMPFLDNQVNFKNQYE   180
Symbols






*****:********************************************



SGD_Scer_PRE7/YBL041W   181   PGTNGKVKKPLKYLSVEEVIKLVRDSFTSATERHIQVGDGLEILIVTKDG   230
MIT_Smik_c134_1472   181   PGTNGKVKKPLKYLSVEEVIKLVRDSFTSATERHIQVGDGLEILIVTKDG   230
MIT_Spar_c366_929   181   PGTNGKVKKPLKYLSVEEVIKLVRDSFTSATERHIQVGDGLEILIVTKDG   230
MIT_Suva_c115_1032   181   PGTDGKVKKPLKYLSVEEVIKLVRDSFTSATERHIQVGDGLEILIVTKDG   230
WashU_Sbay_Contig650.47   181   PGTDGKVKKPLKYLSVEEVIKLVRDSFTSATERHIQVGDGLEILIVTKDG   230
WashU_Scas_Contig495.3   181   PGTDGKVKRPLKYLTIEEAIKLVRDAFTSATERHIHVGDGLEILIVTKDG   230
WashU_Sklu_Contig2418.12   201   PDSDGKKRREPKYLSIEQVIKLVRDAFTSATERHINVGDGLEILIVTKDG   250
WashU_Skud_Contig1851.4   181   PGTDGKVKKPLKYLSVEEVIKLVRDSFTSATERHIQVGDGLEILIVTKDG   230
WashU_Smik_Contig1215.1   181   PGTNGKVKKPLKYLSVEEVIKLVRDSFTSATERHIQVGEWN---------   221
Symbols






*.::** :: ***::*:.******:*********:**:



SGD_Scer_PRE7/YBL041W   231   VRKEFYELKRD   241
MIT_Smik_c134_1472   231   VRKEFYELKRD   241
MIT_Spar_c366_929   231   VRKEFYELKRD   241
MIT_Suva_c115_1032   231   VRKEFYELKRD   241
WashU_Sbay_Contig650.47   231   VRKEFYELKRD   241
WashU_Scas_Contig495.3   231   VRKEFFELKRD   241
WashU_Sklu_Contig2418.12   251   VRKEYYDLKRD   261
WashU_Skud_Contig1851.4   231   VKKEFYELKRD   241
WashU_Smik_Contig1215.1   
   -----------   
Symbols










Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_PRE7/YBL041W:

SGD_Scer_PRE7/YBL041W  Length: 242  Mon Nov  7 14:42:12 2016  Type: P  Check: 7131  ..

       1  MATIASEYSS EASNTPIEHQ FNPYGDNGGT ILGIAGEDFA VLAGDTRNIT

      51  DYSINSRYEP KVFDCGDNIV MSANGFAADG DALVKRFKNS VKWYHFDHND

     101  KKLSINSAAR NIQHLLYGKR FFPYYVHTII AGLDEDGKGA VYSFDPVGSY

     151  EREQCRAGGA AASLIMPFLD NQVNFKNQYE PGTNGKVKKP LKYLSVEEVI

     201  KLVRDSFTSA TERHIQVGDG LEILIVTKDG VRKEFYELKR D*


Protein Sequence for MIT_Smik_c134_1472:

MIT_Smik_c134_1472  Length: 242  Mon Nov  7 14:42:12 2016  Type: P  Check: 7533  ..

       1  MTTIASEYSS EASNTPIEHQ FNPYGDNGGT ILGIAGEDFA VLAGDTRNIT

      51  DYSINSRYEP KVFDCGDNIV MSANGFAADG DALVKRFKNS VKWYHFDHND

     101  KKLSLNSAAR NIQHLLYGKR FFPYYVHTII AGLDENGKGA VYSFDPVGSY

     151  EREQCRAGGA AASLIMPFLD NQVNFKNQYE PGTNGKVKKP LKYLSVEEVI

     201  KLVRDSFTSA TERHIQVGDG LEILIVTKDG VRKEFYELKR D*


Protein Sequence for MIT_Spar_c366_929:

MIT_Spar_c366_929  Length: 242  Mon Nov  7 14:42:12 2016  Type: P  Check: 7389  ..

       1  MTTIASEYSS EASNTPIEHQ FNPYGDNGGT ILGIAGEDFA VLAGDTRNIT

      51  DYSINSRYEP KVFDCGDNIV MSANGFAADG DALVKRFKNS VKWYHFDHND

     101  KKLSINSAAR NIQHLLYGKR FFPYYVHTII AGLDENGKGA VYSFDPVGSY

     151  EREQCRAGGA AASLIMPFLD NQVNFKNQYE PGTNGKVKKP LKYLSVEEVI

     201  KLVRDSFTSA TERHIQVGDG LEILIVTKDG VRKEFYELKR D*


Protein Sequence for MIT_Suva_c115_1032:

MIT_Suva_c115_1032  Length: 242  Mon Nov  7 14:42:12 2016  Type: P  Check: 7220  ..

       1  MATIASEYSS EVSNTPIEHQ FNPYGDNGGT ILGIAGEDFA VLAGDTRNIT

      51  DYSINSRYEP KVFDCGDKIV MSANGFAADG DALVKRFKNS VKWYHFDHND

     101  KKLSINSAAR NIQHLLYGKR FFPYYVHTII AGLDEDGKGA VYSFDPVGSY

     151  EREQCRAGGA AASLIMPFLD NQVNFKNQYE PGTDGKVKKP LKYLSVEEVI

     201  KLVRDSFTSA TERHIQVGDG LEILIVTKDG VRKEFYELKR D*


Protein Sequence for WashU_Sbay_Contig650.47:

WashU_Sbay_Contig650.47  Length: 242  Mon Nov  7 14:42:12 2016  Type: P  Check: 7220  ..

       1  MATIASEYSS EVSNTPIEHQ FNPYGDNGGT ILGIAGEDFA VLAGDTRNIT

      51  DYSINSRYEP KVFDCGDKIV MSANGFAADG DALVKRFKNS VKWYHFDHND

     101  KKLSINSAAR NIQHLLYGKR FFPYYVHTII AGLDEDGKGA VYSFDPVGSY

     151  EREQCRAGGA AASLIMPFLD NQVNFKNQYE PGTDGKVKKP LKYLSVEEVI

     201  KLVRDSFTSA TERHIQVGDG LEILIVTKDG VRKEFYELKR D*


Protein Sequence for WashU_Scas_Contig495.3:

WashU_Scas_Contig495.3  Length: 242  Mon Nov  7 14:42:12 2016  Type: P  Check: 5082  ..

       1  MATIASEYSS EVRNTPIEHQ FNPYSDNGGT ILGIAGEDFA VLAGDTRHTT

      51  DYSINSRYEP KVFDCGDNIV ISANGFAADG EALVKRFKNS LKWYHFDHND

     101  KKLAMSSAAR NIQHLLYGKR FFPYYVHTII AGLDEEGKGA VYSFDPVGSY

     151  EREQCRAGGA AASLIMPFLD NQVNFKNQYE PGTDGKVKRP LKYLTIEEAI

     201  KLVRDAFTSA TERHIHVGDG LEILIVTKDG VRKEFFELKR D*


Protein Sequence for WashU_Sklu_Contig2418.12:

WashU_Sklu_Contig2418.12  Length: 262  Mon Nov  7 14:42:12 2016  Type: P  Check: 7757  ..

       1  MYFPPFKKST SKIVTSSSNM ASTIASEYSN ETKNVPIEHQ FNPYSDNGGT

      51  ILGIAGEDFA VLAGDTRHTT DYSINSRYEP KVFDCGDNIL ISANGFAADG

     101  EALVKRFQNS LKWYHFNHND RKLSLSSAAR NIQHLLYGKR FFPYYVHTII

     151  AGLDEEGKGA VYSFDPVGSY EREQCRAGGA AASLIMPFLD NQVNFKNQYE

     201  PDSDGKKRRE PKYLSIEQVI KLVRDAFTSA TERHINVGDG LEILIVTKDG

     251  VRKEYYDLKR D*

Protein Sequence for WashU_Skud_Contig1851.4:

WashU_Skud_Contig1851.4  Length: 242  Mon Nov  7 14:42:12 2016  Type: P  Check: 7267  ..

       1  MTTIASEYSS EASNTPIEHQ FNPYGDNGGT ILGIAGEDFA VLAGDTRNIT

      51  DYSINSRYEP KVFDCGDNLV MSANGFAADG DALVKRFKNS VKWYHFDHND

     101  KKLSINSAAR NIQHLLYGKR FFPYYVHTII AGLDENGKGA VYSFDPVGSY

     151  EREQCRAGGA AASLIMPFLD NQVNFKNQYE PGTDGKVKKP LKYLSVEEVI

     201  KLVRDSFTSA TERHIQVGDG LEILIVTKDG VKKEFYELKR D*


Protein Sequence for WashU_Smik_Contig1215.1:

WashU_Smik_Contig1215.1  Length: 222  Mon Nov  7 14:42:12 2016  Type: P  Check: 4164  ..

       1  MTTIASEYSS EASNTPIEHQ FNPYGDNGGT ILGIAGEDFA VLAGDTRNIT

      51  DYSINSRYEP KVFDCGDNIV MSANGFAADG DALVKRFKNS VKWYHFDHND

     101  KKLSLNSAAR NIQHLLYGKR FFPYYVHTII AGLDENGKGA VYSFDPVGSY

     151  EREQCRAGGA AASLIMPFLD NQVNFKNQYE PGTNGKVKKP LKYLSVEEVI

     201  KLVRDSFTSA TERHIQVGEW N*