Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YHR134W and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_WSS1/YHR134W   1   MKTEGIKSPSAKYHDMAGSQRIPHKNPHIQKVAVLQSKPNKEDALNLIKE   50
MIT_Smik_c665_10133   1   --------------MTGFQGIATHKNPHIKKVAVLQCKPNQEGALNIIKE   36
MIT_Spar_c32_9877   1   --------------MAGFQRIVAHKNPHIRKVAVLQCKPNKEDALNLIKE   36
MIT_Suva_c182_10793   1   -----------------------------------------------MKE   3
WashU_Scas_Contig697.13   1   ----------------------MHENPHITKVAVLQRKPNNEYALQILQD   28
WashU_Sklu_Contig1881.2   1   -------------------MTVQKKNPHIGRIASLQGKPNKDDALALLED   31
WashU_Skud_Contig2014.13   1   --------------MTGSQRLATHNNPHIQKVAVLQRKPHQEDALLLIKK   36
WashU_Smik_Contig2708.2   1   --------------MTGFQGIATHKNPHIKKVAVLQCKPNQEGALNIIKE   36
Symbols






::.



SGD_Scer_WSS1/YHR134W   51   IAHKVSYLMKENHFKVTNLVEFYPRDQRLLGMNVNHGSKIMLRLRCSTDE   100
MIT_Smik_c665_10133   37   IARKVSFLMKENHLKVVSLVEFYPRDQRLLGMNVNHGLKIMLRLRCPTDE   86
MIT_Spar_c32_9877   37   IANKVSYLMKENNFKVVSLVEFYPRDQRLLGMNVNHGFKIMLRLRCPKDE   86
MIT_Suva_c182_10793   4   VAHKVSYLMRENHFKVVSLVEFYPHDQRLLGMNVNRGLKIMLRLRCPTDE   53
WashU_Scas_Contig697.13   29   ITKQVSYLMKEEKFKVQTLVEFYPKDKRLLGMNVNAGQKIMLRLRTPGDE   78
WashU_Sklu_Contig1881.2   32   IAHRVSYLMRENKFKVGELVEFYPRDKRLLGMNVNGGAKIMLRLRHPNDE   81
WashU_Skud_Contig2014.13   37   IAHKVSYLMKENHFKVVSLVEFYPRDQRLLGMNVNHGFKIMLRLRCPTDE   86
WashU_Smik_Contig2708.2   37   IARKVSFLMKENHLKVVSLVEFYPRDQRLLGMNVNHGLKIMLRLRCPTDE   86
Symbols






::.:**:**:*:::** ******:*:******** * ******* . **



SGD_Scer_WSS1/YHR134W   101   FQFLPMECIMGTMLHELTHNLFGPHDKKFYNKLDELIGRQWVIEQRGLYD   150
MIT_Smik_c665_10133   87   FQFLSMESILGTMLHELTHNLFGPHDKKFYDKLDELIGRQWVIEQRGLYD   136
MIT_Spar_c32_9877   87   FQFLPMESIMGTMLHELTHNVFGPHDKKFYDKLDDLIGRQWVIEQRGLYD   136
MIT_Suva_c182_10793   54   SQFLPMESIMGTMLHELTHNLFGPHDKKFYDKLDGLIGRQWVIEQMGLHD   103
WashU_Scas_Contig697.13   79   FQFLNREAILGTMLHELTHNLFGPHDRRFYEKLDQLSARQWVIEQQGLFD   128
WashU_Sklu_Contig1881.2   82   SQFLARESILGTMLHELTHNLFGPHDAKFYRKLDDLSGTQWVIEQRGLFD   131
WashU_Skud_Contig2014.13   87   FQFLPIESIMGTMLHELTHNLFGPHDKTFYDKLDALIGRQWVIEQRGLYD   136
WashU_Smik_Contig2708.2   87   FQFLSMESILGTMLHELTHNLFGPHDKKFYDKLDELIGRQWVIEQRGLYD   136
Symbols






*** *.*:**********:***** ** *** * . ****** **.*



SGD_Scer_WSS1/YHR134W   151   TFLGNGQRLGGRANLRSNRYPMTGISTNTGIVRKRGKGVKLGSLHP-EGI   199
MIT_Smik_c665_10133   137   TFLGNGHRLGGRSNLRSAGDPVTGMQTNTGIVRRRNKGVKLGTLNS-EGV   185
MIT_Spar_c32_9877   137   TFLGNGQRLGGRTNSCSNRYPMTGISTNTGIVRRRGKGVKLGSLHP-EGV   185
MIT_Suva_c182_10793   104   AFLGKGQRLGGRSNVLSNRYPMTGVSTNTGIMRRRGKGVKLGTLSLGAQP   153
WashU_Scas_Contig697.13   129   TFLGSGRRLGGSTRTLSNNRRVRSIIGRS----GKGRGRKLGTITN--RP   172
WashU_Sklu_Contig1881.2   132   SFVGRGRRLGCTPRSRIPPTERR--------------------LGTIEVV   161
WashU_Skud_Contig2014.13   137   TFLGNGKRLGGRSNVRSNRYPVTGISTDTERVRRRGKGIKLGSLSS-PGL   185
WashU_Smik_Contig2708.2   137   TFLGNGHRLGGRSNLRSAGDPVTGMQTNTGIVRRRNKGVKLGTLNS-EGV   185
Symbols






:*:* *:*** .. :



SGD_Scer_WSS1/YHR134W   200   SSIDRGNSPRELAAFAAERRYRDDRWCGETKNN----------KDQIISD   239
MIT_Smik_c665_10133   186   SSKNRGKSPREMAALAAERRYKDDRWCGETKNN----------KDQIISD   225
MIT_Spar_c32_9877   186   SSKDRGKSPRELAALAAERRYRDDRWCGEMKNN----------KDQIISD   225
MIT_Suva_c182_10793   154   SSPNRGKTPREMAALAAERRYKDDRWCGENKSS----------KNQIN-D   192
WashU_Scas_Contig697.13   173   SSTFEGKTPREMAAVAAERRYNDDKWCGEKNNL----------ENKKKLE   212
WashU_Sklu_Contig1881.2   162   SSNNRDKSPKRMAAAAAEKRARDTMWCGDLKRNKHVEPDAAELEYIILDD   211
WashU_Skud_Contig2014.13   186   SPMNRGKSPREMAALAAERRYKDDRWCGESKNN----------KDQIISD   225
WashU_Smik_Contig2708.2   186   SSKNRGKSPREMAALAAERRYKDDRWCGETKNN----------KDQIISD   225
Symbols






*. ..::*:.:** ***:* .* ***: : : :



SGD_Scer_WSS1/YHR134W   240   NISS---SLEVVILDDDDE------------VLPGDTLIEVIDLT   269
MIT_Smik_c665_10133   226   NNKD---SLEIVILDDDDEQ-----------ESPRDKLVEVIDLT   256
MIT_Spar_c32_9877   226   NNND---LLEVVILDDDEEEE----------MSQRDTSIEVIDLT   257
MIT_Suva_c182_10793   193   YNDD---LLEIVVLDDEEGS-------------PGPMRSEVIDLT   221
WashU_Scas_Contig697.13   213   PNQDDLREETIIILDDDDATTTPETQKENKDDDDDIRPIEIIDLT   257
WashU_Sklu_Contig1881.2   212   EDKDQARENGNSSILVVDLLNAHRDQNSCDRRRQGSGTPEIIDLT   256
WashU_Skud_Contig2014.13   226   NNDD---LLEVVVLEDEQEP--------------PRDTTEVIDLT   253
WashU_Smik_Contig2708.2   226   NNKD---SLEIVILDDDDEQ-----------ESPRDKLVEVIDLT   256
Symbols






.. : : *:****



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_WSS1/YHR134W:

SGD_Scer_WSS1/YHR134W  Length: 270  Mon Nov  7 15:37:35 2016  Type: P  Check: 9788  ..

       1  MKTEGIKSPS AKYHDMAGSQ RIPHKNPHIQ KVAVLQSKPN KEDALNLIKE

      51  IAHKVSYLMK ENHFKVTNLV EFYPRDQRLL GMNVNHGSKI MLRLRCSTDE

     101  FQFLPMECIM GTMLHELTHN LFGPHDKKFY NKLDELIGRQ WVIEQRGLYD

     151  TFLGNGQRLG GRANLRSNRY PMTGISTNTG IVRKRGKGVK LGSLHPEGIS

     201  SIDRGNSPRE LAAFAAERRY RDDRWCGETK NNKDQIISDN ISSSLEVVIL

     251  DDDDEVLPGD TLIEVIDLT*

Protein Sequence for MIT_Smik_c665_10133:

MIT_Smik_c665_10133  Length: 257  Mon Nov  7 15:37:35 2016  Type: P  Check: 5684  ..

       1  MTGFQGIATH KNPHIKKVAV LQCKPNQEGA LNIIKEIARK VSFLMKENHL

      51  KVVSLVEFYP RDQRLLGMNV NHGLKIMLRL RCPTDEFQFL SMESILGTML

     101  HELTHNLFGP HDKKFYDKLD ELIGRQWVIE QRGLYDTFLG NGHRLGGRSN

     151  LRSAGDPVTG MQTNTGIVRR RNKGVKLGTL NSEGVSSKNR GKSPREMAAL

     201  AAERRYKDDR WCGETKNNKD QIISDNNKDS LEIVILDDDD EQESPRDKLV

     251  EVIDLT*

Protein Sequence for MIT_Spar_c32_9877:

MIT_Spar_c32_9877  Length: 258  Mon Nov  7 15:37:35 2016  Type: P  Check: 9799  ..

       1  MAGFQRIVAH KNPHIRKVAV LQCKPNKEDA LNLIKEIANK VSYLMKENNF

      51  KVVSLVEFYP RDQRLLGMNV NHGFKIMLRL RCPKDEFQFL PMESIMGTML

     101  HELTHNVFGP HDKKFYDKLD DLIGRQWVIE QRGLYDTFLG NGQRLGGRTN

     151  SCSNRYPMTG ISTNTGIVRR RGKGVKLGSL HPEGVSSKDR GKSPRELAAL

     201  AAERRYRDDR WCGEMKNNKD QIISDNNNDL LEVVILDDDE EEEMSQRDTS

     251  IEVIDLT*

Protein Sequence for MIT_Suva_c182_10793:

MIT_Suva_c182_10793  Length: 222  Mon Nov  7 15:37:35 2016  Type: P  Check: 5129  ..

       1  MKEVAHKVSY LMRENHFKVV SLVEFYPHDQ RLLGMNVNRG LKIMLRLRCP

      51  TDESQFLPME SIMGTMLHEL THNLFGPHDK KFYDKLDGLI GRQWVIEQMG

     101  LHDAFLGKGQ RLGGRSNVLS NRYPMTGVST NTGIMRRRGK GVKLGTLSLG

     151  AQPSSPNRGK TPREMAALAA ERRYKDDRWC GENKSSKNQI NDYNDDLLEI

     201  VVLDDEEGSP GPMRSEVIDL T*

Protein Sequence for WashU_Scas_Contig697.13:

WashU_Scas_Contig697.13  Length: 258  Mon Nov  7 15:37:35 2016  Type: P  Check: 7298  ..

       1  MHENPHITKV AVLQRKPNNE YALQILQDIT KQVSYLMKEE KFKVQTLVEF

      51  YPKDKRLLGM NVNAGQKIML RLRTPGDEFQ FLNREAILGT MLHELTHNLF

     101  GPHDRRFYEK LDQLSARQWV IEQQGLFDTF LGSGRRLGGS TRTLSNNRRV

     151  RSIIGRSGKG RGRKLGTITN RPSSTFEGKT PREMAAVAAE RRYNDDKWCG

     201  EKNNLENKKK LEPNQDDLRE ETIIILDDDD ATTTPETQKE NKDDDDDIRP

     251  IEIIDLT*

Protein Sequence for WashU_Sklu_Contig1881.2:

WashU_Sklu_Contig1881.2  Length: 257  Mon Nov  7 15:37:35 2016  Type: P  Check: 4790  ..

       1  MTVQKKNPHI GRIASLQGKP NKDDALALLE DIAHRVSYLM RENKFKVGEL

      51  VEFYPRDKRL LGMNVNGGAK IMLRLRHPND ESQFLARESI LGTMLHELTH

     101  NLFGPHDAKF YRKLDDLSGT QWVIEQRGLF DSFVGRGRRL GCTPRSRIPP

     151  TERRLGTIEV VSSNNRDKSP KRMAAAAAEK RARDTMWCGD LKRNKHVEPD

     201  AAELEYIILD DEDKDQAREN GNSSILVVDL LNAHRDQNSC DRRRQGSGTP

     251  EIIDLT*

Protein Sequence for WashU_Skud_Contig2014.13:

WashU_Skud_Contig2014.13  Length: 254  Mon Nov  7 15:37:35 2016  Type: P  Check: 2328  ..

       1  MTGSQRLATH NNPHIQKVAV LQRKPHQEDA LLLIKKIAHK VSYLMKENHF

      51  KVVSLVEFYP RDQRLLGMNV NHGFKIMLRL RCPTDEFQFL PIESIMGTML

     101  HELTHNLFGP HDKTFYDKLD ALIGRQWVIE QRGLYDTFLG NGKRLGGRSN

     151  VRSNRYPVTG ISTDTERVRR RGKGIKLGSL SSPGLSPMNR GKSPREMAAL

     201  AAERRYKDDR WCGESKNNKD QIISDNNDDL LEVVVLEDEQ EPPRDTTEVI

     251  DLT*

Protein Sequence for WashU_Smik_Contig2708.2:

WashU_Smik_Contig2708.2  Length: 257  Mon Nov  7 15:37:35 2016  Type: P  Check: 5684  ..

       1  MTGFQGIATH KNPHIKKVAV LQCKPNQEGA LNIIKEIARK VSFLMKENHL

      51  KVVSLVEFYP RDQRLLGMNV NHGLKIMLRL RCPTDEFQFL SMESILGTML

     101  HELTHNLFGP HDKKFYDKLD ELIGRQWVIE QRGLYDTFLG NGHRLGGRSN

     151  LRSAGDPVTG MQTNTGIVRR RNKGVKLGTL NSEGVSSKNR GKSPREMAAL

     201  AAERRYKDDR WCGETKNNKD QIISDNNKDS LEIVILDDDD EQESPRDKLV

     251  EVIDLT*