Fungal Sequence Alignment

Help

This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al. We will soon include sequences from other fungal genomes from a variety of sources.

ClustalW Protein Alignment and Sequence for YPL054W and Homologs

Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_LEE1/YPL054W   1   MDAFENMSVSNHPGGNARRNSQSANEMLASQIQDFQNIPRSFNDSNANVN   50
MIT_Sbay_c678_25080   1   MDAFDNMSVLNRPGSNARRSSQSSNEMFAPQIPDLQNIPRSFNNSNTTTN   50
MIT_Smik_c1102_22086   1   MDAYDNMSVLNHPGSNARRSSQSANEIFAPQIQDFQNIPRSFNANNTKLN   50
MIT_Spar_c79_23616   1   MDAFENMSVSNHPGSNARRSSQSAGEMFAPQIQDFQNIPRSFNNNNATVN   50
WashU_Sbay_Contig521.4   1   ------MSVLNRPGSNARRSSQSSNEMFAPQIPDLQNIPRSFNNSNTTTN   44
WashU_Scas_Contig567.6   1   ------------------------------------MMSHPTNTNNQTLN   14
Symbols






:.:. * .* . *



SGD_Scer_LEE1/YPL054W   51   LSKNCTVGNQLPFSSRQQKIIMEHLLITKNNSQQ-QKDYSHVPCKFFKMG   99
MIT_Sbay_c678_25080   51   INLPR---NQLPFSSHQQKIIMEHLLITKNNTQQ-QKDYSHVPCKFFKMG   96
MIT_Smik_c1102_22086   51   FPKNHNAANQLPFSSHQQKIIMEHLLITKNNSQQ-QKDYSHVPCKFFKMG   99
MIT_Spar_c79_23616   51   LSKNYNAANQLPFSSHQQKIIMEHLLITKNNSQQ-QKDYSHVPCKFFKMG   99
WashU_Sbay_Contig521.4   45   INLPR---NQLPFSSHQQKIIMEHLLITKNNTQQ-QKDYSHVPCKFFKMG   90
WashU_Scas_Contig567.6   15   D--------------EQKKLIIKHIEFTKQQSMANHKNYSHVPCKFFKQG   50
Symbols






.*:*:*::*: :**::: :*:********** *



SGD_Scer_LEE1/YPL054W   100   NCQAGSSCPFSHSPDIISSANNLPCKYFAKGNCKFGNKCVNAHVLPNG--   147
MIT_Sbay_c678_25080   97   NCQAGPSCPFSHSPDIINSANNLPCKYFAKGNCKFGNKCVNAHILPNG--   144
MIT_Smik_c1102_22086   100   NCQAGTSCPFSHSPDIISSANNLPCKYFAKGNCKFGNKCVNAHTLPNG--   147
MIT_Spar_c79_23616   100   NCQAGSSCPFSHSPDIISSANNLPCKYFAKGNCKFGNKCVNAHVLPNG--   147
WashU_Sbay_Contig521.4   91   NCQAGPSCPFSHSPDIINSANNLPCKYFAKGNCKFGNKCVNAHILPNG--   138
WashU_Scas_Contig567.6   51   NCQAGNTCPFSHSLDIN-KANSTPCKYFKLGNCKFGSKCANAHILPDGTI   99
Symbols






***** :****** ** .**. ***** ******.**.*** **:*



SGD_Scer_LEE1/YPL054W   148   --------------FKMNSKEPIDITPPSQNNYLSHARSASFSTYTS---   180
MIT_Sbay_c678_25080   145   --------------SRMNSKGPIEIAPSSNNNYFSHTRSASFSTYMS---   177
MIT_Smik_c1102_22086   148   --------------FKMNGKDPIDIASPSKNNYPSHTRSASFSTFMS---   180
MIT_Spar_c79_23616   148   --------------FKMNSREPIEITPPSQNNYLSHARSASFSTYMS---   180
WashU_Sbay_Contig521.4   139   --------------SRMNSKGPIEIAPSSNNNYFSHTRSASFSTYMS---   171
WashU_Scas_Contig567.6   100   IQYNNNNNRQRQNKFKHNNNQPSPHTISQQYTNYLNLNNMNTQYYTSNAP   149
Symbols






: *.. * : ..: . : .. . . : *



SGD_Scer_LEE1/YPL054W   181   ---------------------------------------PPLSAQTEFSH   191
MIT_Sbay_c678_25080   178   ---------------------------------------PPMSANTDISH   188
MIT_Smik_c1102_22086   181   ---------------------------------------PPLSVHTEFSN   191
MIT_Spar_c79_23616   181   ---------------------------------------PPLSAQTEFSN   191
WashU_Sbay_Contig521.4   172   ---------------------------------------PPMSANTDISH   182
WashU_Scas_Contig567.6   150   ISDPTPLQQQQHFNRSTSYSMERNQTNYPLHPFVVFSSKAPPPSQTQTQN   199
Symbols






.* . :*: .:



SGD_Scer_LEE1/YPL054W   192   SASNANYFSSQYLMYSPQKSPEALYTEFFSPPSSSSSYINYSYNNS--NI   239
MIT_Sbay_c678_25080   189   SASSTNYFTPQYPLSPPQKGLDALHSDFFSPPSTSSSYVNYNYSNANSNA   238
MIT_Smik_c1102_22086   192   SASNANYFPSQYPMSSPQKSPGVLHTEFFSPPSSSSSYINYNYN----KI   237
MIT_Spar_c79_23616   192   SASNANHFSSQYLMSSPQKSPEALNTEFFSPPSSSSSYINYNYNNS--NL   239
WashU_Sbay_Contig521.4   183   SASSTNYFTPQYPLSPPQKGLDALHSDFFSPPSTSSSYVNYNYSNANSNA   232
WashU_Scas_Contig567.6   200   SASTYNYTSSLYSALGDPSNTNTNNNYIYNAPTSNSNSNYNNINWSLSNN   249
Symbols






***. *: .. * .. . . ::..*::.*. . . :



SGD_Scer_LEE1/YPL054W   240   NAYSPVSSSSSNIWQEQGQTTLSNPSVNQNLRYRTGPAIQEESDNEIEDL   289
MIT_Sbay_c678_25080   239   STYSPVSSSSSNIWQEQGQTTLSNSSMSQNTRYCTSPVIQEESDNEIEEM   288
MIT_Smik_c1102_22086   238   TAYSPVSSSSSNIWQEQGQTTLSNPSMNQNLKYRTGPAIQEESDDEIEEL   287
MIT_Spar_c79_23616   240   NAYSPVSSSSSNIWQEQGQTTLSNPSVNQNLRHRTGPAIQEESDNEIEEL   289
WashU_Sbay_Contig521.4   233   STYSPVSSSSSNIWQEQGQTTLSNSSMSQNTRYCTSPVIQEESDNEIEEM   282
WashU_Scas_Contig567.6   250   LLNMKITDDVIEDTIPKNNQWYTNNNQNHDLDMIQDDDQNLDDDDEDADF   299
Symbols






::.. : :.: :* . .:: . : :.*:* ::



SGD_Scer_LEE1/YPL054W   290   LIHNFNSRYCHE----   301
MIT_Sbay_c678_25080   289   LIHDFNARCRQE----   300
MIT_Smik_c1102_22086   288   LIHDFNSRYCQE----   299
MIT_Spar_c79_23616   290   LIHNFNSRYCHE----   301
WashU_Sbay_Contig521.4   283   LIHDFNARCRQE----   294
WashU_Scas_Contig567.6   300   KYYSRETRIILDDMKS   315
Symbols






:. ::* :



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_LEE1/YPL054W:

SGD_Scer_LEE1/YPL054W  Length: 302  Sun Dec 11 01:55:25 2011  Type: P  Check: 30  ..

       1  MDAFENMSVS NHPGGNARRN SQSANEMLAS QIQDFQNIPR SFNDSNANVN

      51  LSKNCTVGNQ LPFSSRQQKI IMEHLLITKN NSQQQKDYSH VPCKFFKMGN

     101  CQAGSSCPFS HSPDIISSAN NLPCKYFAKG NCKFGNKCVN AHVLPNGFKM

     151  NSKEPIDITP PSQNNYLSHA RSASFSTYTS PPLSAQTEFS HSASNANYFS

     201  SQYLMYSPQK SPEALYTEFF SPPSSSSSYI NYSYNNSNIN AYSPVSSSSS

     251  NIWQEQGQTT LSNPSVNQNL RYRTGPAIQE ESDNEIEDLL IHNFNSRYCH

     301  E*

Protein Sequence for MIT_Sbay_c678_25080:

MIT_Sbay_c678_25080  Length: 301  Sun Dec 11 01:55:25 2011  Type: P  Check: 1448  ..

       1  MDAFDNMSVL NRPGSNARRS SQSSNEMFAP QIPDLQNIPR SFNNSNTTTN

      51  INLPRNQLPF SSHQQKIIME HLLITKNNTQ QQKDYSHVPC KFFKMGNCQA

     101  GPSCPFSHSP DIINSANNLP CKYFAKGNCK FGNKCVNAHI LPNGSRMNSK

     151  GPIEIAPSSN NNYFSHTRSA SFSTYMSPPM SANTDISHSA SSTNYFTPQY

     201  PLSPPQKGLD ALHSDFFSPP STSSSYVNYN YSNANSNAST YSPVSSSSSN

     251  IWQEQGQTTL SNSSMSQNTR YCTSPVIQEE SDNEIEEMLI HDFNARCRQE

     301  *

Protein Sequence for MIT_Smik_c1102_22086:

MIT_Smik_c1102_22086  Length: 300  Sun Dec 11 01:55:25 2011  Type: P  Check: 3548  ..

       1  MDAYDNMSVL NHPGSNARRS SQSANEIFAP QIQDFQNIPR SFNANNTKLN

      51  FPKNHNAANQ LPFSSHQQKI IMEHLLITKN NSQQQKDYSH VPCKFFKMGN

     101  CQAGTSCPFS HSPDIISSAN NLPCKYFAKG NCKFGNKCVN AHTLPNGFKM

     151  NGKDPIDIAS PSKNNYPSHT RSASFSTFMS PPLSVHTEFS NSASNANYFP

     201  SQYPMSSPQK SPGVLHTEFF SPPSSSSSYI NYNYNKITAY SPVSSSSSNI

     251  WQEQGQTTLS NPSMNQNLKY RTGPAIQEES DDEIEELLIH DFNSRYCQE*


Protein Sequence for MIT_Spar_c79_23616:

MIT_Spar_c79_23616  Length: 302  Sun Dec 11 01:55:25 2011  Type: P  Check: 8440  ..

       1  MDAFENMSVS NHPGSNARRS SQSAGEMFAP QIQDFQNIPR SFNNNNATVN

      51  LSKNYNAANQ LPFSSHQQKI IMEHLLITKN NSQQQKDYSH VPCKFFKMGN

     101  CQAGSSCPFS HSPDIISSAN NLPCKYFAKG NCKFGNKCVN AHVLPNGFKM

     151  NSREPIEITP PSQNNYLSHA RSASFSTYMS PPLSAQTEFS NSASNANHFS

     201  SQYLMSSPQK SPEALNTEFF SPPSSSSSYI NYNYNNSNLN AYSPVSSSSS

     251  NIWQEQGQTT LSNPSVNQNL RHRTGPAIQE ESDNEIEELL IHNFNSRYCH

     301  E*

Protein Sequence for WashU_Sbay_Contig521.4:

WashU_Sbay_Contig521.4  Length: 295  Sun Dec 11 01:55:25 2011  Type: P  Check: 5680  ..

       1  MSVLNRPGSN ARRSSQSSNE MFAPQIPDLQ NIPRSFNNSN TTTNINLPRN

      51  QLPFSSHQQK IIMEHLLITK NNTQQQKDYS HVPCKFFKMG NCQAGPSCPF

     101  SHSPDIINSA NNLPCKYFAK GNCKFGNKCV NAHILPNGSR MNSKGPIEIA

     151  PSSNNNYFSH TRSASFSTYM SPPMSANTDI SHSASSTNYF TPQYPLSPPQ

     201  KGLDALHSDF FSPPSTSSSY VNYNYSNANS NASTYSPVSS SSSNIWQEQG

     251  QTTLSNSSMS QNTRYCTSPV IQEESDNEIE EMLIHDFNAR CRQE*


Protein Sequence for WashU_Scas_Contig567.6:

WashU_Scas_Contig567.6  Length: 316  Sun Dec 11 01:55:25 2011  Type: P  Check: 8454  ..

       1  MMSHPTNTNN QTLNDEQKKL IIKHIEFTKQ QSMANHKNYS HVPCKFFKQG

      51  NCQAGNTCPF SHSLDINKAN STPCKYFKLG NCKFGSKCAN AHILPDGTII

     101  QYNNNNNRQR QNKFKHNNNQ PSPHTISQQY TNYLNLNNMN TQYYTSNAPI

     151  SDPTPLQQQQ HFNRSTSYSM ERNQTNYPLH PFVVFSSKAP PPSQTQTQNS

     201  ASTYNYTSSL YSALGDPSNT NTNNNYIYNA PTSNSNSNYN NINWSLSNNL

     251  LNMKITDDVI EDTIPKNNQW YTNNNQNHDL DMIQDDDQNL DDDDEDADFK

     301  YYSRETRIIL DDMKS*