Fungal Sequence Alignment

Help

This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al. We will soon include sequences from other fungal genomes from a variety of sources.

ClustalW Protein Alignment and Sequence for YGR058W and Homologs

Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_PEF1/YGR058W   1   MCAKKLKYAAGDDFVRYATPKEAMEETRREFEKEKQRQQQ---IKVTQAQ   47
MIT_Sbay_c592_8988   1   MCAKKLKYAAGDDFVRYATPKEAMEETRREFDKERQRQQQ---MKTAQPQ   47
MIT_Smik_c521_8449   1   MCAKKLKYAAGDDFVRYATPKEAMEETRREFEKEKQRQQQ---MKTTQPQ   47
MIT_Spar_c7_8201   1   MCAKKLKYAAGDDFVRYATPKEAMEETRREFEKEKQRQQQ---TKVTQTQ   47
WashU_Sbay_Contig613.10   1   MCAKKLKYAAGDDFVRYATPKEAMEETRREFDKERQRQQQ---MKTAQPQ   47
WashU_Scas_Contig683.16   1   MGKKKLNYATGDDLQLYATPKEAMEESRRRALEEKIRKRQQYEAQMANEQ   50
WashU_Skud_Contig2031.3   1   MCAKKLKYAAGDDFVRYATPKEAMEETRREFEKEKQRQQQ---MKTAQVQ   47
Symbols






* ***:**:***: **********:**. :*: *::* : :: *



SGD_Scer_PEF1/YGR058W   48   TPNTRVHSAPIPLQTQYNKNRAENGHHSYGSPQSYSPRHTKTP-VDPRYN   96
MIT_Sbay_c592_8988   48   LPNTRIHSAPIPFQNQYVKNRVESGHQSYGSPQNFSPRHTKVPPIDPRYN   97
MIT_Smik_c521_8449   48   ATNTRIHSAPISLQTQYVKNRVESGHQSYGSPQNFSPRHTKTP-VDPRYN   96
MIT_Spar_c7_8201   48   TPNTRVHSAPIPLQNQYTKNRVENGHHPYGSPQSYSPRHTKTP-VDPRYN   96
WashU_Sbay_Contig613.10   48   LPNTRIHSAPIPFQNQYVKNRVESGHQSYGSPQNFSPRHTKVPPIDPRYN   97
WashU_Scas_Contig683.16   51   RRSGRISSAPVMTPPNFLNNNRPPPLQYQPPSQHNGSHHTIQETFISPNS   100
WashU_Skud_Contig2031.3   48   TGNTRIHSAPIPLQNQFVKNRVESEHQSYGSPQSFSPRHTKVP-IDPRYN   96
Symbols






. *: ***: :: :*. : ..* ..:** . . .



SGD_Scer_PEF1/YGR058W   97   VIAQKPAGRPIPPAPTHYNNLNTSAQRIASSPPP-LIHNQAVPAQLLKKV   145
MIT_Sbay_c592_8988   98   NIVQKPVSRPIPPPPKHYNSASSSSLRIATPPSI---------QNQPVPV   138
MIT_Smik_c521_8449   97   MIAQKPGGRPIPPIQNHCNSVSSSSQRIASSPPPPLMYNQTVPAQVSKKV   146
MIT_Spar_c7_8201   97   AIAQKPTGRPIPPTPSHYNNLNSSSQRIVSSPPP-LMHNQSVPAQLLKKV   145
WashU_Sbay_Contig613.10   98   NIVQKPVSRPIPPPPKHYNSASSSSLRIATPPSI---------QNQPVPV   138
WashU_Scas_Contig683.16   101   RPGYN--TRPSPPAQNMNVSNPMPIPHVGRTMNHPVPPYIRNSSSSPSSR   148
WashU_Skud_Contig2031.3   97   ILAQKTAGRPIPPAPNHYS--NSPSQRITSSPPPPLMHNQSLPASCKKKV   144
Symbols






: ** ** . . :: . .



SGD_Scer_PEF1/YGR058W   146   APASFDSREDVRDMQVATQLFHNHDVKGKNRLTAEELQNLLQNDDNSHFC   195
MIT_Sbay_c592_8988   139   VTAPFESKEDLRDMQVAVQLFHNHDIKGKNRLTAEELQNLLQNDDNSHFC   188
MIT_Smik_c521_8449   147   APVPFDNKEDVRDMQVAIQLFHNHDIKGKNRLTAEELQNLLQNDDNSHFC   196
MIT_Spar_c7_8201   146   APAPFDNREEVRDMQVAIQLFHNHDVKGKNRLTAEELQNLLQNDDNSHFC   195
WashU_Sbay_Contig613.10   139   VTAPFESKEDLRDMQVAVQLFHNHDIKGKNRLTAEELQNLLQNDDNSHFC   188
WashU_Scas_Contig683.16   149   QTSSNLQDPESKDIQVARKLFQNHDIKNRGRLTAEELQNLLQNDDNTHFC   198
WashU_Skud_Contig2031.3   145   TPASFDNKEDLRDVQVAIQLFHNHDIKGKNRLTAEELQNLLQNDDNSHFC   194
Symbols






. . . : :*:*** :**:***:*.:.****************:***



SGD_Scer_PEF1/YGR058W   196   ISSVDALINLFGASRFGTVNQAEFIALYKRVKSWRKVYVDNDINGSLTIS   245
MIT_Sbay_c592_8988   189   ISSVDALINLFGASRFGTVNQAEFISLYKRVKSWRKIYVDNDINGSLTIS   238
MIT_Smik_c521_8449   197   MSSVDALINLFGASRFGTVNQTEFIALYKRVKSWRKIYVDNDINRSLTIS   246
MIT_Spar_c7_8201   196   ISSVDALINLFGASRFGTVNQTEFIALYKRVKSWRKVYVDNDINGSLTIS   245
WashU_Sbay_Contig613.10   189   ISSVDALINLFGASRFGTVNQAEFISLYKRVKSWRKIYVDNDINGSLTIS   238
WashU_Scas_Contig683.16   199   ISSIDALINLFGATRFGTINQQEFVSLYKRVKIWRKVYVDNDINSSFTIT   248
WashU_Skud_Contig2031.3   195   ISSVDALINLFGASRFGTVNQAEFIALYKRVKSWRKIYVDNDINGSLTIS   244
Symbols






:**:*********:****:** **::****** ***:******* *:**:



SGD_Scer_PEF1/YGR058W   246   VSEFHNSLQELGYLIPFEVSEKTFDQYAEFINRNGTGKELKFDKFVEALV   295
MIT_Sbay_c592_8988   239   VSEFHNSLQELGYLIPFEVSEKTFDQYAEFINRSGTGKELKFDKFVEALV   288
MIT_Smik_c521_8449   247   VGEFHNSLQELGYLIPFEVSEKTFDQYAEFINRSGTGKELKFDKFVEALV   296
MIT_Spar_c7_8201   246   VSEFHNSLQELGYLIPFEVSEKTFDQYAEFINRNGTGKELKFDKFVEALV   295
WashU_Sbay_Contig613.10   239   VSEFHNSLQELGYLIPFEVSEKTFDQYAEFINRSGTGKELKFDKFVEALV   288
WashU_Scas_Contig683.16   249   VTEFHNSLQELQYLIPYEVSEKLFDQYAEFINENNNSKELKFDKFVEVLV   298
WashU_Skud_Contig2031.3   245   VSEFHNSLQELGYLIPFEVSEKTFDQYAEFINRSGTVKELKFDKFVEALV   294
Symbols






* ********* ****:***** *********.... **********.**



SGD_Scer_PEF1/YGR058W   296   WLMRLTKLFRKFDTNQEGIATIQYKDFIDATLYLGRFLPH   335
MIT_Sbay_c592_8988   289   WLMRLTKLFRKFDTNQEGVATIQYKDFIDSTLYLGRFLPH   328
MIT_Smik_c521_8449   297   WLMRLTKLFRKFDTKQEGIATIQYKDFIDATLYLGRFLPH   336
MIT_Spar_c7_8201   296   WLMRLTKLFRKFDTNQEGIATIQYKDFIDATLYLGRFLPH   335
WashU_Sbay_Contig613.10   289   WLMRLTKLFRKFDTNQEGVATIQYKDFIDSTLYLGRFLPH   328
WashU_Scas_Contig683.16   299   WLMRLTRMFRKFDTQQDGVANIHYKDFIDMTLYLGRFLPH   338
WashU_Skud_Contig2031.3   295   WLMRLTKLFRKFDSNQEGTATIQYKDFIDATLYLGRFLPH   334
Symbols






******::*****::*:* *.*:****** **********



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_PEF1/YGR058W:

SGD_Scer_PEF1/YGR058W  Length: 336  Sat Dec 10 10:52:48 2011  Type: P  Check: 2429  ..

       1  MCAKKLKYAA GDDFVRYATP KEAMEETRRE FEKEKQRQQQ IKVTQAQTPN

      51  TRVHSAPIPL QTQYNKNRAE NGHHSYGSPQ SYSPRHTKTP VDPRYNVIAQ

     101  KPAGRPIPPA PTHYNNLNTS AQRIASSPPP LIHNQAVPAQ LLKKVAPASF

     151  DSREDVRDMQ VATQLFHNHD VKGKNRLTAE ELQNLLQNDD NSHFCISSVD

     201  ALINLFGASR FGTVNQAEFI ALYKRVKSWR KVYVDNDING SLTISVSEFH

     251  NSLQELGYLI PFEVSEKTFD QYAEFINRNG TGKELKFDKF VEALVWLMRL

     301  TKLFRKFDTN QEGIATIQYK DFIDATLYLG RFLPH*

Protein Sequence for MIT_Sbay_c592_8988:

MIT_Sbay_c592_8988  Length: 329  Sat Dec 10 10:52:48 2011  Type: P  Check: 719  ..

       1  MCAKKLKYAA GDDFVRYATP KEAMEETRRE FDKERQRQQQ MKTAQPQLPN

      51  TRIHSAPIPF QNQYVKNRVE SGHQSYGSPQ NFSPRHTKVP PIDPRYNNIV

     101  QKPVSRPIPP PPKHYNSASS SSLRIATPPS IQNQPVPVVT APFESKEDLR

     151  DMQVAVQLFH NHDIKGKNRL TAEELQNLLQ NDDNSHFCIS SVDALINLFG

     201  ASRFGTVNQA EFISLYKRVK SWRKIYVDND INGSLTISVS EFHNSLQELG

     251  YLIPFEVSEK TFDQYAEFIN RSGTGKELKF DKFVEALVWL MRLTKLFRKF

     301  DTNQEGVATI QYKDFIDSTL YLGRFLPH*

Protein Sequence for MIT_Smik_c521_8449:

MIT_Smik_c521_8449  Length: 337  Sat Dec 10 10:52:48 2011  Type: P  Check: 5686  ..

       1  MCAKKLKYAA GDDFVRYATP KEAMEETRRE FEKEKQRQQQ MKTTQPQATN

      51  TRIHSAPISL QTQYVKNRVE SGHQSYGSPQ NFSPRHTKTP VDPRYNMIAQ

     101  KPGGRPIPPI QNHCNSVSSS SQRIASSPPP PLMYNQTVPA QVSKKVAPVP

     151  FDNKEDVRDM QVAIQLFHNH DIKGKNRLTA EELQNLLQND DNSHFCMSSV

     201  DALINLFGAS RFGTVNQTEF IALYKRVKSW RKIYVDNDIN RSLTISVGEF

     251  HNSLQELGYL IPFEVSEKTF DQYAEFINRS GTGKELKFDK FVEALVWLMR

     301  LTKLFRKFDT KQEGIATIQY KDFIDATLYL GRFLPH*

Protein Sequence for MIT_Spar_c7_8201:

MIT_Spar_c7_8201  Length: 336  Sat Dec 10 10:52:48 2011  Type: P  Check: 5857  ..

       1  MCAKKLKYAA GDDFVRYATP KEAMEETRRE FEKEKQRQQQ TKVTQTQTPN

      51  TRVHSAPIPL QNQYTKNRVE NGHHPYGSPQ SYSPRHTKTP VDPRYNAIAQ

     101  KPTGRPIPPT PSHYNNLNSS SQRIVSSPPP LMHNQSVPAQ LLKKVAPAPF

     151  DNREEVRDMQ VAIQLFHNHD VKGKNRLTAE ELQNLLQNDD NSHFCISSVD

     201  ALINLFGASR FGTVNQTEFI ALYKRVKSWR KVYVDNDING SLTISVSEFH

     251  NSLQELGYLI PFEVSEKTFD QYAEFINRNG TGKELKFDKF VEALVWLMRL

     301  TKLFRKFDTN QEGIATIQYK DFIDATLYLG RFLPH*

Protein Sequence for WashU_Sbay_Contig613.10:

WashU_Sbay_Contig613.10  Length: 329  Sat Dec 10 10:52:48 2011  Type: P  Check: 719  ..

       1  MCAKKLKYAA GDDFVRYATP KEAMEETRRE FDKERQRQQQ MKTAQPQLPN

      51  TRIHSAPIPF QNQYVKNRVE SGHQSYGSPQ NFSPRHTKVP PIDPRYNNIV

     101  QKPVSRPIPP PPKHYNSASS SSLRIATPPS IQNQPVPVVT APFESKEDLR

     151  DMQVAVQLFH NHDIKGKNRL TAEELQNLLQ NDDNSHFCIS SVDALINLFG

     201  ASRFGTVNQA EFISLYKRVK SWRKIYVDND INGSLTISVS EFHNSLQELG

     251  YLIPFEVSEK TFDQYAEFIN RSGTGKELKF DKFVEALVWL MRLTKLFRKF

     301  DTNQEGVATI QYKDFIDSTL YLGRFLPH*

Protein Sequence for WashU_Scas_Contig683.16:

WashU_Scas_Contig683.16  Length: 339  Sat Dec 10 10:52:48 2011  Type: P  Check: 155  ..

       1  MGKKKLNYAT GDDLQLYATP KEAMEESRRR ALEEKIRKRQ QYEAQMANEQ

      51  RRSGRISSAP VMTPPNFLNN NRPPPLQYQP PSQHNGSHHT IQETFISPNS

     101  RPGYNTRPSP PAQNMNVSNP MPIPHVGRTM NHPVPPYIRN SSSSPSSRQT

     151  SSNLQDPESK DIQVARKLFQ NHDIKNRGRL TAEELQNLLQ NDDNTHFCIS

     201  SIDALINLFG ATRFGTINQQ EFVSLYKRVK IWRKVYVDND INSSFTITVT

     251  EFHNSLQELQ YLIPYEVSEK LFDQYAEFIN ENNNSKELKF DKFVEVLVWL

     301  MRLTRMFRKF DTQQDGVANI HYKDFIDMTL YLGRFLPH*

Protein Sequence for WashU_Skud_Contig2031.3:

WashU_Skud_Contig2031.3  Length: 335  Sat Dec 10 10:52:48 2011  Type: P  Check: 7553  ..

       1  MCAKKLKYAA GDDFVRYATP KEAMEETRRE FEKEKQRQQQ MKTAQVQTGN

      51  TRIHSAPIPL QNQFVKNRVE SEHQSYGSPQ SFSPRHTKVP IDPRYNILAQ

     101  KTAGRPIPPA PNHYSNSPSQ RITSSPPPPL MHNQSLPASC KKKVTPASFD

     151  NKEDLRDVQV AIQLFHNHDI KGKNRLTAEE LQNLLQNDDN SHFCISSVDA

     201  LINLFGASRF GTVNQAEFIA LYKRVKSWRK IYVDNDINGS LTISVSEFHN

     251  SLQELGYLIP FEVSEKTFDQ YAEFINRSGT VKELKFDKFV EALVWLMRLT

     301  KLFRKFDSNQ EGTATIQYKD FIDATLYLGR FLPH*