Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YEL063C and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_CAN1/YEL063C   1   MTNSKEDADIEEKHMYNEPVTTLFHDVEASQTHHRRGSIPLKDEKSKELY   50
MIT_Smik_c202_5637   1   MTDPKQNTEIEEKHMYNEPVTTLFHDVEASQTHRRRGSLPLKDEKNKELY   50
MIT_Spar_c356_5976   1   MTNLKREEDIDEKHMYNEPVTTLFHDVEASQTHHRRGSIPLKDEKSKDLY   50
MIT_Suva_c280_5973   1   MTTPKQDSEVEEKHMYTEPAGTFFHDVEASQTHHRRGSLSLKDEKSKELY   50
WashU_Sbay_Contig676.50   1   MTTPKQDSEVEEKHMYTEPAGTFFHDVEASQTHHRRGSLSLKDEKSKELY   50
WashU_Smik_Contig2875.2   1   MTDPKQNTEIEEKHMYNEPVTTLFHDVEASQTHRRRGSLPLKDEKNKELY   50
Symbols






** *.: :::*****.**. *:**********:****:.*****.*:**



SGD_Scer_CAN1/YEL063C   51   PLRSFPTRVNGEDTFSMEDGIGDEDEGEVQNAEVKRELKQRHIGMIALGG   100
MIT_Smik_c202_5637   51   PLHSFSTRVDGEDTFSMGDVIRDEDQGEVRNAEVKRELKQRHIGMIALGG   100
MIT_Spar_c356_5976   51   PLRSFPTRVNGEDTFSMEDGIGDEDEGEVQNAEVKRELKQRHIGMIALGG   100
MIT_Suva_c280_5973   51   PLRSFPNGVNGQDTFSMEEDIQDEDEGEVQNAEVKRELKQRHIGMIALGG   100
WashU_Sbay_Contig676.50   51   PLRSFPNGVNGQDTFSMEEDIQDEDEGEVQNAEVKRELKQRHIGMIALGG   100
WashU_Smik_Contig2875.2   51   PLHSFSTRVDGEDTFSMGDVIRDEDQGEVRNAEVKRELKQRHIGMIALGG   100
Symbols






**:**.. *:*:***** : * ***:***:********************



SGD_Scer_CAN1/YEL063C   101   TIGTGLFIGLSTPLTNAGPVGALISYLFMGSLAYSVTQSLGEMATFIPVT   150
MIT_Smik_c202_5637   101   TIGTGLFIGLSKPLANAGPVGALISYLFMGSLAYSVTQSLGEMATFIPVT   150
MIT_Spar_c356_5976   101   TIGTGLFIGLSTPLTNAGPVGALISYLFMGSLAYSVTQSLGEMATFIPVT   150
MIT_Suva_c280_5973   101   TIGTGLFIGLSTPLTNAGPVGALISYLFMGSLAYSVTQSLGEMATFIPVT   150
WashU_Sbay_Contig676.50   101   TIGTGLFIGLSTPLTNAGPVGALISYLFMGSLAYSVTQSLGEMATFIPVT   150
WashU_Smik_Contig2875.2   101   TIGTGLFIGLSKPLANAGPVGALISYLFMGSLAYSVTQSLGEMATFIPVT   150
Symbols






***********.**:***********************************



SGD_Scer_CAN1/YEL063C   151   SSFTVFSQRFLSPAFGAANGYMYWFSWAITFALELSVVGQVIQFWTYKVP   200
MIT_Smik_c202_5637   151   SSFTVFSQRFLSPAFGAANGYMYWFSWAITFALELSVVGQVIEFWTNKVP   200
MIT_Spar_c356_5976   151   SSFTVFSQRFLSPAFGAANGYMYWFSWAITFALELSVVGQVIQFWTSKVP   200
MIT_Suva_c280_5973   151   SSFTVFSQRFLSPAFGAANGYMYWFSWAVTFALELSVVGQVIQFWTYAVP   200
WashU_Sbay_Contig676.50   151   SSFTVFSQRFLSPAFGAANGYMYWFSWAVTFALELSVVGQVIQFWTYAVP   200
WashU_Smik_Contig2875.2   151   SSFTVFSQRFLSPAFGAANGYMYWFSWAITFALELSVVGQVIEFWTNKVP   200
Symbols






****************************:*************:*** **



SGD_Scer_CAN1/YEL063C   201   LAAWISIFWVIITIMNLFPVKYYGEFEFWVASIKVLAIIGFLIYCFCMVC   250
MIT_Smik_c202_5637   201   LAAWISIFWVIITIMNLFPVKYYGEFEFWVASIKVLAIIGFLIYCFCMVC   250
MIT_Spar_c356_5976   201   LAAWISIFWVLITIMNLFPVKYYGEFEFWVASIKVLAIIGFLIYCFCMVC   250
MIT_Suva_c280_5973   201   LAAWISIFWVIITAMNMFPVKYYGEFEFWVASIKVIAIIGFLIYCFCMVC   250
WashU_Sbay_Contig676.50   201   LAAWISIFWVIITAMNMFPVKYYGEFEFWVASIKVIAIIGFLIYCFCMVC   250
WashU_Smik_Contig2875.2   201   LAAWISIFWVIITIMNLFPVKYYGEFEFWVASIKVLAIIGFLIYCFCMVC   250
Symbols






**********:** **:******************:**************



SGD_Scer_CAN1/YEL063C   251   GAGVTGPVGFRYWRNPGAWGPGIISKDKNEGRFLGWVSSLINAAFTFQGT   300
MIT_Smik_c202_5637   251   GAGVTGPVGFRYWRNPGAWGPGIISKDKNEGRFLGWVSSLINAAFTFQGT   300
MIT_Spar_c356_5976   251   GAGVTGPVGFRYWRNPGAWGPGIISKDKNEGRFLGWVSSLINAAFTFQGT   300
MIT_Suva_c280_5973   251   GAGVTGPVGFRYWRNPGAWGPGIISKNKNEARFLGWVSSLINAAFTFQGT   300
WashU_Sbay_Contig676.50   251   GAGVTGPVGFRYWRNPGAWGPGIISKNKNEARFLGWVSSLINAAFTFQGT   300
WashU_Smik_Contig2875.2   251   GAGVTGPVGFRYWRNPGAWGPGIISKDKNEGRFLGWVSSLINAAFTFQGT   300
Symbols






**************************:***.*******************



SGD_Scer_CAN1/YEL063C   301   ELVGITAGEAANPRKSVPRAIKKVVFRILTFYIGSLLFIGLLVPYNDPKL   350
MIT_Smik_c202_5637   301   ELVGITAGEAANPRKTVPRAIKKVVFRILTFYIGSLLFIGLLVPYNDPKL   350
MIT_Spar_c356_5976   301   ELVGITAGEAANPRKSVPRAIKKVVFRILTFYIGSLLFIGLLVPYNDPKL   350
MIT_Suva_c280_5973   301   ELVGITAGEAANPRKTVPRAIKKVVFRILTFYIGSLLFIGLLVPYNDPKL   350
WashU_Sbay_Contig676.50   301   ELVGITAGEAANPRKTVPRAIKKVVFRILTFYIGSLLFIGLLVPYNDPKL   350
WashU_Smik_Contig2875.2   301   ELVGITAGEAANPRKTVPRAIKKVVFRILTFYIGSLLFIGLLVPYNDPKL   350
Symbols






***************:**********************************



SGD_Scer_CAN1/YEL063C   351   TQSTSYVSTSPFIIAIENSGTKVLPHIFNAVILTTIISAANSNIYVGSRI   400
MIT_Smik_c202_5637   351   TTATSYVSVSPFIIAIENSGTKVLPHIFNAVILTTIISAANSNIYVGSRI   400
MIT_Spar_c356_5976   351   TESTSYVSTSPFIIAIENSGTKVLPHIFNAVILTTIISAANSNIYVGSRI   400
MIT_Suva_c280_5973   351   TSATSYVSTSPFIIAIENSGTKVLPHIFNAVILTTIISAANSNIYVGSRV   400
WashU_Sbay_Contig676.50   351   TSATSYVSTSPFIIAIENSGTKVLPHIFNAVILTTIISAANSNIYVGSRV   400
WashU_Smik_Contig2875.2   351   TTATSYVSVSPFIIAIENSGTKVLPHIFNAVILTTIISAANSNIYVGSRI   400
Symbols






* :*****.****************************************:



SGD_Scer_CAN1/YEL063C   401   LFGLSKNKLAPKFLSRTTKGGVPYIAVFVTAAFGALAYMETSTGGDKVFE   450
MIT_Smik_c202_5637   401   LFGLSKNKLAPKFLSRTTKGGVPYIAVFITAVFGALAYMETSTGGDKVFE   450
MIT_Spar_c356_5976   401   LFGLSKNKLAPKFLSKTTKGGVPYVAVFTTAVFGALAYMETSTGGDKVFE   450
MIT_Suva_c280_5973   401   LFGLSKSKLAPKFFSKTTKGGVPYIAVFATSIFGALAYMETSTGGGKVFA   450
WashU_Sbay_Contig676.50   401   LFGLSKSKLAPKFFSKTTKGGVPYIAVFATSIFGALAYMETSTGGGKVFA   450
WashU_Smik_Contig2875.2   401   LFGLSKNKLAPKFLSRTTKGGVPYIAVFITAVFGALAYMETSTGGDKVFE   450
Symbols






******.******:*:********:*** *: *************.***



SGD_Scer_CAN1/YEL063C   451   WLLNITGVAGFFAWLFISISHIRFMQALKYRGISRDELPFKAKLMPGLAY   500
MIT_Smik_c202_5637   451   WLLNITGVAGFFAWLLISISHIRFMQALKYRGISRDELPFKAKLMPGLAY   500
MIT_Spar_c356_5976   451   WLLNITGVAGFFAWLFISISHIRFMQALKYRGISRDELPFKAKLMPGLAY   500
MIT_Suva_c280_5973   451   WLLNITGVAGFFAWLFISVSHIRFMQALKYRGISRDELPFKAKLMPGLAY   500
WashU_Sbay_Contig676.50   451   WLLNITGVAGFFAWLFISVSHIRFMQALKYRGISRDELPFKAKLMPGLAY   500
WashU_Smik_Contig2875.2   451   WLLNITGVAGFFAWLLISISHIRFMQALKYRGISRDELPFKAKLMPGLAY   500
Symbols






***************:**:*******************************



SGD_Scer_CAN1/YEL063C   501   YAATFMTIIIIIQGFTAFAPKFNGVSFAAAYISIFLFLAVWILFQCIFRC   550
MIT_Smik_c202_5637   501   FAATFMIIIILIQGFTAFAPKFDGIDFLAAYISIFLFLAIWILFEIIFRC   550
MIT_Spar_c356_5976   501   YAATFMIIIIIIQGFTAFAPKFNGVSFAAAYISVFLFLAVWILFEAIFRC   550
MIT_Suva_c280_5973   501   YATFFMVVIILIQGFTSFTPTFNGVNFLAAYISVFLFLAIWILFELIFRC   550
WashU_Sbay_Contig676.50   501   YATFFMVVIILIQGFTSFTPTFNGVNFLAAYISVFLFLAIWILFELIFRC   550
WashU_Smik_Contig2875.2   501   FAATFMIIIILIQGFTAFAPKFDGIDFLAAYISIFLFLAIWILFEIIFRC   550
Symbols






:*: ** :**:*****:*:*.*:*:.* *****:*****:****: ****



SGD_Scer_CAN1/YEL063C   551   RFIWKIGDVDIDSDRRDIEAIVWEDHEPKTFWDKFWNVVA   590
MIT_Smik_c202_5637   551   RFIWKIEDVDIDSDRRDIEAIVWEDHEPKTFWDKFWNVVA   590
MIT_Spar_c356_5976   551   RFIWKIEDVDIDSDRRDIEAIVWEDHEPKTFWDKFWNVVA   590
MIT_Suva_c280_5973   551   RFVWKIEDVDIDSDRRDIEAIVWEEPEPKTLWDKFWNVVG   590
WashU_Sbay_Contig676.50   551   RFVWKIEDVDIDSDRRDIEAIVWEEPEPKTLWDKFWNVVG   590
WashU_Smik_Contig2875.2   551   RFIWKIEDVDIDSDRRDIEAIVWEDHEPKTFWDKFWNVVA   590
Symbols






**:*** *****************: ****:********.



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_CAN1/YEL063C:

SGD_Scer_CAN1/YEL063C  Length: 591  Mon Nov  7 15:13:12 2016  Type: P  Check: 4485  ..

       1  MTNSKEDADI EEKHMYNEPV TTLFHDVEAS QTHHRRGSIP LKDEKSKELY

      51  PLRSFPTRVN GEDTFSMEDG IGDEDEGEVQ NAEVKRELKQ RHIGMIALGG

     101  TIGTGLFIGL STPLTNAGPV GALISYLFMG SLAYSVTQSL GEMATFIPVT

     151  SSFTVFSQRF LSPAFGAANG YMYWFSWAIT FALELSVVGQ VIQFWTYKVP

     201  LAAWISIFWV IITIMNLFPV KYYGEFEFWV ASIKVLAIIG FLIYCFCMVC

     251  GAGVTGPVGF RYWRNPGAWG PGIISKDKNE GRFLGWVSSL INAAFTFQGT

     301  ELVGITAGEA ANPRKSVPRA IKKVVFRILT FYIGSLLFIG LLVPYNDPKL

     351  TQSTSYVSTS PFIIAIENSG TKVLPHIFNA VILTTIISAA NSNIYVGSRI

     401  LFGLSKNKLA PKFLSRTTKG GVPYIAVFVT AAFGALAYME TSTGGDKVFE

     451  WLLNITGVAG FFAWLFISIS HIRFMQALKY RGISRDELPF KAKLMPGLAY

     501  YAATFMTIII IIQGFTAFAP KFNGVSFAAA YISIFLFLAV WILFQCIFRC

     551  RFIWKIGDVD IDSDRRDIEA IVWEDHEPKT FWDKFWNVVA *


Protein Sequence for MIT_Smik_c202_5637:

MIT_Smik_c202_5637  Length: 591  Mon Nov  7 15:13:12 2016  Type: P  Check: 2248  ..

       1  MTDPKQNTEI EEKHMYNEPV TTLFHDVEAS QTHRRRGSLP LKDEKNKELY

      51  PLHSFSTRVD GEDTFSMGDV IRDEDQGEVR NAEVKRELKQ RHIGMIALGG

     101  TIGTGLFIGL SKPLANAGPV GALISYLFMG SLAYSVTQSL GEMATFIPVT

     151  SSFTVFSQRF LSPAFGAANG YMYWFSWAIT FALELSVVGQ VIEFWTNKVP

     201  LAAWISIFWV IITIMNLFPV KYYGEFEFWV ASIKVLAIIG FLIYCFCMVC

     251  GAGVTGPVGF RYWRNPGAWG PGIISKDKNE GRFLGWVSSL INAAFTFQGT

     301  ELVGITAGEA ANPRKTVPRA IKKVVFRILT FYIGSLLFIG LLVPYNDPKL

     351  TTATSYVSVS PFIIAIENSG TKVLPHIFNA VILTTIISAA NSNIYVGSRI

     401  LFGLSKNKLA PKFLSRTTKG GVPYIAVFIT AVFGALAYME TSTGGDKVFE

     451  WLLNITGVAG FFAWLLISIS HIRFMQALKY RGISRDELPF KAKLMPGLAY

     501  FAATFMIIII LIQGFTAFAP KFDGIDFLAA YISIFLFLAI WILFEIIFRC

     551  RFIWKIEDVD IDSDRRDIEA IVWEDHEPKT FWDKFWNVVA *


Protein Sequence for MIT_Spar_c356_5976:

MIT_Spar_c356_5976  Length: 591  Mon Nov  7 15:13:12 2016  Type: P  Check: 4385  ..

       1  MTNLKREEDI DEKHMYNEPV TTLFHDVEAS QTHHRRGSIP LKDEKSKDLY

      51  PLRSFPTRVN GEDTFSMEDG IGDEDEGEVQ NAEVKRELKQ RHIGMIALGG

     101  TIGTGLFIGL STPLTNAGPV GALISYLFMG SLAYSVTQSL GEMATFIPVT

     151  SSFTVFSQRF LSPAFGAANG YMYWFSWAIT FALELSVVGQ VIQFWTSKVP

     201  LAAWISIFWV LITIMNLFPV KYYGEFEFWV ASIKVLAIIG FLIYCFCMVC

     251  GAGVTGPVGF RYWRNPGAWG PGIISKDKNE GRFLGWVSSL INAAFTFQGT

     301  ELVGITAGEA ANPRKSVPRA IKKVVFRILT FYIGSLLFIG LLVPYNDPKL

     351  TESTSYVSTS PFIIAIENSG TKVLPHIFNA VILTTIISAA NSNIYVGSRI

     401  LFGLSKNKLA PKFLSKTTKG GVPYVAVFTT AVFGALAYME TSTGGDKVFE

     451  WLLNITGVAG FFAWLFISIS HIRFMQALKY RGISRDELPF KAKLMPGLAY

     501  YAATFMIIII IIQGFTAFAP KFNGVSFAAA YISVFLFLAV WILFEAIFRC

     551  RFIWKIEDVD IDSDRRDIEA IVWEDHEPKT FWDKFWNVVA *


Protein Sequence for MIT_Suva_c280_5973:

MIT_Suva_c280_5973  Length: 591  Mon Nov  7 15:13:12 2016  Type: P  Check: 5909  ..

       1  MTTPKQDSEV EEKHMYTEPA GTFFHDVEAS QTHHRRGSLS LKDEKSKELY

      51  PLRSFPNGVN GQDTFSMEED IQDEDEGEVQ NAEVKRELKQ RHIGMIALGG

     101  TIGTGLFIGL STPLTNAGPV GALISYLFMG SLAYSVTQSL GEMATFIPVT

     151  SSFTVFSQRF LSPAFGAANG YMYWFSWAVT FALELSVVGQ VIQFWTYAVP

     201  LAAWISIFWV IITAMNMFPV KYYGEFEFWV ASIKVIAIIG FLIYCFCMVC

     251  GAGVTGPVGF RYWRNPGAWG PGIISKNKNE ARFLGWVSSL INAAFTFQGT

     301  ELVGITAGEA ANPRKTVPRA IKKVVFRILT FYIGSLLFIG LLVPYNDPKL

     351  TSATSYVSTS PFIIAIENSG TKVLPHIFNA VILTTIISAA NSNIYVGSRV

     401  LFGLSKSKLA PKFFSKTTKG GVPYIAVFAT SIFGALAYME TSTGGGKVFA

     451  WLLNITGVAG FFAWLFISVS HIRFMQALKY RGISRDELPF KAKLMPGLAY

     501  YATFFMVVII LIQGFTSFTP TFNGVNFLAA YISVFLFLAI WILFELIFRC

     551  RFVWKIEDVD IDSDRRDIEA IVWEEPEPKT LWDKFWNVVG *


Protein Sequence for WashU_Sbay_Contig676.50:

WashU_Sbay_Contig676.50  Length: 591  Mon Nov  7 15:13:12 2016  Type: P  Check: 5909  ..

       1  MTTPKQDSEV EEKHMYTEPA GTFFHDVEAS QTHHRRGSLS LKDEKSKELY

      51  PLRSFPNGVN GQDTFSMEED IQDEDEGEVQ NAEVKRELKQ RHIGMIALGG

     101  TIGTGLFIGL STPLTNAGPV GALISYLFMG SLAYSVTQSL GEMATFIPVT

     151  SSFTVFSQRF LSPAFGAANG YMYWFSWAVT FALELSVVGQ VIQFWTYAVP

     201  LAAWISIFWV IITAMNMFPV KYYGEFEFWV ASIKVIAIIG FLIYCFCMVC

     251  GAGVTGPVGF RYWRNPGAWG PGIISKNKNE ARFLGWVSSL INAAFTFQGT

     301  ELVGITAGEA ANPRKTVPRA IKKVVFRILT FYIGSLLFIG LLVPYNDPKL

     351  TSATSYVSTS PFIIAIENSG TKVLPHIFNA VILTTIISAA NSNIYVGSRV

     401  LFGLSKSKLA PKFFSKTTKG GVPYIAVFAT SIFGALAYME TSTGGGKVFA

     451  WLLNITGVAG FFAWLFISVS HIRFMQALKY RGISRDELPF KAKLMPGLAY

     501  YATFFMVVII LIQGFTSFTP TFNGVNFLAA YISVFLFLAI WILFELIFRC

     551  RFVWKIEDVD IDSDRRDIEA IVWEEPEPKT LWDKFWNVVG *


Protein Sequence for WashU_Smik_Contig2875.2:

WashU_Smik_Contig2875.2  Length: 591  Mon Nov  7 15:13:12 2016  Type: P  Check: 2248  ..

       1  MTDPKQNTEI EEKHMYNEPV TTLFHDVEAS QTHRRRGSLP LKDEKNKELY

      51  PLHSFSTRVD GEDTFSMGDV IRDEDQGEVR NAEVKRELKQ RHIGMIALGG

     101  TIGTGLFIGL SKPLANAGPV GALISYLFMG SLAYSVTQSL GEMATFIPVT

     151  SSFTVFSQRF LSPAFGAANG YMYWFSWAIT FALELSVVGQ VIEFWTNKVP

     201  LAAWISIFWV IITIMNLFPV KYYGEFEFWV ASIKVLAIIG FLIYCFCMVC

     251  GAGVTGPVGF RYWRNPGAWG PGIISKDKNE GRFLGWVSSL INAAFTFQGT

     301  ELVGITAGEA ANPRKTVPRA IKKVVFRILT FYIGSLLFIG LLVPYNDPKL

     351  TTATSYVSVS PFIIAIENSG TKVLPHIFNA VILTTIISAA NSNIYVGSRI

     401  LFGLSKNKLA PKFLSRTTKG GVPYIAVFIT AVFGALAYME TSTGGDKVFE

     451  WLLNITGVAG FFAWLLISIS HIRFMQALKY RGISRDELPF KAKLMPGLAY

     501  FAATFMIIII LIQGFTAFAP KFDGIDFLAA YISIFLFLAI WILFEIIFRC

     551  RFIWKIEDVD IDSDRRDIEA IVWEDHEPKT FWDKFWNVVA *