Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YOL142W and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_RRP40/YOL142W   1   ------MSTFIFPGDSFPVDPTTP---VKLGPGIYCDPNTQEIRPVNTGV   41
MIT_Smik_c401_19854   1   ------MSTFTFPGDKLPVDPILP---VKLGPGIYCDPDDQEIRPVNTGI   41
MIT_Spar_c314_19785   1   ------MSTFIFPGDSLSVDPTVP---VKLGPGIYCDPNSQEIRPVNAGI   41
MIT_Suva_c58_22185   1   ------MSTFIFPGDALPVDSTVP---IKLGPGIYCDPNSQEVRPVNTGI   41
WashU_Sbay_Contig659.14   1   ------MSTFIFPGDALPVDSTVP---IKLGPGIYCDPNSQEVRPVNTGI   41
WashU_Scas_Contig589.1   1   ---MQPQTVVVLPGDKLPLSPSSSQSALTLGPGIYCNPQTHEIIPTQAGI   47
WashU_Sklu_Contig2319.4   1   MKHQKIMASLTIPSDSLHIDLSQP---ASIGPGVYCDPKTQELKPVNAGL   47
WashU_Skud_Contig1459.5   1   ------MSSFVFPGDKLPVDPTVP---IKLGPGIYCDPISQEVRPVNTGI   41
WashU_Smik_Contig2099.3   1   ------MSTFTFPGDKLPVDPILP---VKLGPGIYCDPDDQEIRPVNTGI   41
Symbols






: . :*.* : :. . .:***:**:* :*: *.::*:



SGD_Scer_RRP40/YOL142W   42   LHVSAKGKSGVQTAYIDYSSKRYIPSVNDFVIGVIIGTFS-DSYKVSLQN   90
MIT_Smik_c401_19854   42   LHVSTKGKSGVQAVYIDYSSRRYVPSVNDFVIGIITGTFS-DSYKVSLQN   90
MIT_Spar_c314_19785   42   LHVSTKGKSGVQAAYIDYSSKRYIPSVNDFVIGVITGTFS-DSYKVSLQN   90
MIT_Suva_c58_22185   42   LHVSTKGKGGAQAVYIDYSSKRYVPMVNDFVIGTITGTFS-DSYKVLLQN   90
WashU_Sbay_Contig659.14   42   LHVSTKGKGGAQAVYIDYSSKRYVPMVNDFVIGTITGTFS-DSYKVLLQN   90
WashU_Scas_Contig589.1   48   AHITH-PKPHRTQLYVDYDSKRYIPCVGDLVIGCIVGTQGGDSYRVSLSN   96
WashU_Sklu_Contig2319.4   48   EVISHTKKG--QSVYIDYNSKRYIPAVGDFVVGVITGTFS-DSYRVSLAD   94
WashU_Skud_Contig1459.5   42   LHVSTKGKGGVQAAYIDYFSKRYVPSVNDFVIGTISGTFS-DSYKVSLQD   90
WashU_Smik_Contig2099.3   42   LHVSTKGKSGVQAVCIDYSSRRYVPSVNDFVIGIITGTFS-DSYKVSLQN   90
Symbols






:: * :** *:**:* *.*:*:* * ** . ***:* * :



SGD_Scer_RRP40/YOL142W   91   FSSSVSLSYMAFPNASKKNRPTLQVGDLVYARVCTAEKELEAEIECFDST   140
MIT_Smik_c401_19854   91   FSSSVSLSYMAFPNASKKNRPTLQVGDLVYARVCTAEKELEAEIECFDST   140
MIT_Spar_c314_19785   91   FSSSVSLSYMAFPNASKKNRPTLQVGDLVYARVCTAEKELEAEIECFDST   140
MIT_Suva_c58_22185   91   FSSSVSLSYMAFPNASKKNRPTLQVGDLVYARVCTAEKELEAEIECVDSA   140
WashU_Sbay_Contig659.14   91   FSSSVSLSYMAFPNASKKNRPTLQVGDLVYARVCTAEKELEAEIECVDSA   140
WashU_Scas_Contig589.1   97   FSNDVSLPYMAFPNASKKNRPTLVKGDLVYAKVASAEKELEATLECLD--   144
WashU_Sklu_Contig2319.4   95   FSSSVTLSYMAFPNASKKNRPSLKIGDLVYARVCRAEKELEAEIECMDSV   144
WashU_Skud_Contig1459.5   91   FSPSVSLSYMAFPNASKKNRPTLQVGDLVYARVCTAEKELEAEIECFDST   140
WashU_Smik_Contig2099.3   91   FSSSVSLSYMAFPNASKKNRPTLQVGDLVYARVCTAEKELEAEIECFDST   140
Symbols






** .*:*.*************:* ******:*. ******* :**.*



SGD_Scer_RRP40/YOL142W   141   TGRDAGFGILEDGMIIDVNLNFARQLLFN--------NDFPLLKVLAAHT   182
MIT_Smik_c401_19854   141   TGRDAGFGLLEDGMTIDVSLNFARQLLFN--------NDFPLLKVLASHT   182
MIT_Spar_c314_19785   141   TGRDAGFGLLEDGMIIDVNLNFARQLLFN--------NDFPLLKVLAAHA   182
MIT_Suva_c58_22185   141   TGRDAGFGLLEDGMIIDVNLNFTRQLLFN--------KDFPLLKVLAAHT   182
WashU_Sbay_Contig659.14   141   TGRDAGFGLLEDGMIIDVNLNFTRQLLFN--------KDFPLLKVLAAHT   182
WashU_Scas_Contig589.1   145   ----PGFGILEDGMVLDCPLRFARMLLFEGDDKQKKEGTSSLLRLLARHT   190
WashU_Sklu_Contig2319.4   145   TGKDAGFGLLDGGMVINVSLAFARELLFN--------NEFSLLQILAEYT   186
WashU_Skud_Contig1459.5   141   TGRDAGFGQLEDGTIIDVNLNFARQLLFN--------NDFPLLKVLAAHT   182
WashU_Smik_Contig2099.3   141   TGRDAGFGLLEDGMTIDVSLNFARQLLFN--------NDFPLLKVLASHT   182
Symbols






.*** *:.* :: * *:* ***: .**::** ::



SGD_Scer_RRP40/YOL142W   183   KFEVAIGLNGKIWVKCEELSNTLACYRTIMECCQKNDTAAFKDIAKRQFK   232
MIT_Smik_c401_19854   183   KYEIAIGLNGKIWVKCDDLSNTLACYRTIMESCQKNDMTTFKDIAKKQFE   232
MIT_Spar_c314_19785   183   KFEIAIGLNGKIWVKCEELSNTLACYRTITECCQRNDAAAFKDIAKKQFK   232
MIT_Suva_c58_22185   183   KFEIAIGLNGKIWVKCDELSNTLACYRTILECCQKHDVATFKDVAKRQFE   232
WashU_Sbay_Contig659.14   183   KFEIAIGLNGKIWVKCDELSNTLACYRTILECCQKHDVATFKDVAKRQFE   232
WashU_Scas_Contig589.1   191   KFEIAIGINGKIWIKCDTVSDTLACYRIVLECCK-SPVSQYDSIVKRHFK   239
WashU_Sklu_Contig2319.4   187   QFEVAIGVNGKVWIRSEEVKNTLACYRSIMQCQQ-SPASEFKNIIKKYFR   235
WashU_Skud_Contig1459.5   183   KFEVAIGLNGKIWVKCKEVSNTLACYRTIKECCEKNDTATFKDIAKKQFE   232
WashU_Smik_Contig2099.3   183   KYEIAIGLNGKIWVKCDDLSNTLACYRTIMESCEKNDMTTFKDIAKKQFE   232
Symbols






::*:***:***:*::.. :.:****** : :. : : :..: *: *.



SGD_Scer_RRP40/YOL142W   233   EILTVKEE   240
MIT_Smik_c401_19854   233   EVLTVKEE   240
MIT_Spar_c314_19785   233   EVLTVKEE   240
MIT_Suva_c58_22185   233   EVLSVKEE   240
WashU_Sbay_Contig659.14   233   EVLSVKEE   240
WashU_Scas_Contig589.1   240   EVAKVVEE   247
WashU_Sklu_Contig2319.4   236   EVTNFSE-   242
WashU_Skud_Contig1459.5   233   EVLNVNEE   240
WashU_Smik_Contig2099.3   233   EVLTVKEE   240
Symbols






*: .. *



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_RRP40/YOL142W:

SGD_Scer_RRP40/YOL142W  Length: 241  Mon Nov  7 16:34:51 2016  Type: P  Check: 8800  ..

       1  MSTFIFPGDS FPVDPTTPVK LGPGIYCDPN TQEIRPVNTG VLHVSAKGKS

      51  GVQTAYIDYS SKRYIPSVND FVIGVIIGTF SDSYKVSLQN FSSSVSLSYM

     101  AFPNASKKNR PTLQVGDLVY ARVCTAEKEL EAEIECFDST TGRDAGFGIL

     151  EDGMIIDVNL NFARQLLFNN DFPLLKVLAA HTKFEVAIGL NGKIWVKCEE

     201  LSNTLACYRT IMECCQKNDT AAFKDIAKRQ FKEILTVKEE *


Protein Sequence for MIT_Smik_c401_19854:

MIT_Smik_c401_19854  Length: 241  Mon Nov  7 16:34:51 2016  Type: P  Check: 1577  ..

       1  MSTFTFPGDK LPVDPILPVK LGPGIYCDPD DQEIRPVNTG ILHVSTKGKS

      51  GVQAVYIDYS SRRYVPSVND FVIGIITGTF SDSYKVSLQN FSSSVSLSYM

     101  AFPNASKKNR PTLQVGDLVY ARVCTAEKEL EAEIECFDST TGRDAGFGLL

     151  EDGMTIDVSL NFARQLLFNN DFPLLKVLAS HTKYEIAIGL NGKIWVKCDD

     201  LSNTLACYRT IMESCQKNDM TTFKDIAKKQ FEEVLTVKEE *


Protein Sequence for MIT_Spar_c314_19785:

MIT_Spar_c314_19785  Length: 241  Mon Nov  7 16:34:51 2016  Type: P  Check: 7149  ..

       1  MSTFIFPGDS LSVDPTVPVK LGPGIYCDPN SQEIRPVNAG ILHVSTKGKS

      51  GVQAAYIDYS SKRYIPSVND FVIGVITGTF SDSYKVSLQN FSSSVSLSYM

     101  AFPNASKKNR PTLQVGDLVY ARVCTAEKEL EAEIECFDST TGRDAGFGLL

     151  EDGMIIDVNL NFARQLLFNN DFPLLKVLAA HAKFEIAIGL NGKIWVKCEE

     201  LSNTLACYRT ITECCQRNDA AAFKDIAKKQ FKEVLTVKEE *


Protein Sequence for MIT_Suva_c58_22185:

MIT_Suva_c58_22185  Length: 241  Mon Nov  7 16:34:51 2016  Type: P  Check: 9749  ..

       1  MSTFIFPGDA LPVDSTVPIK LGPGIYCDPN SQEVRPVNTG ILHVSTKGKG

      51  GAQAVYIDYS SKRYVPMVND FVIGTITGTF SDSYKVLLQN FSSSVSLSYM

     101  AFPNASKKNR PTLQVGDLVY ARVCTAEKEL EAEIECVDSA TGRDAGFGLL

     151  EDGMIIDVNL NFTRQLLFNK DFPLLKVLAA HTKFEIAIGL NGKIWVKCDE

     201  LSNTLACYRT ILECCQKHDV ATFKDVAKRQ FEEVLSVKEE *


Protein Sequence for WashU_Sbay_Contig659.14:

WashU_Sbay_Contig659.14  Length: 241  Mon Nov  7 16:34:51 2016  Type: P  Check: 9749  ..

       1  MSTFIFPGDA LPVDSTVPIK LGPGIYCDPN SQEVRPVNTG ILHVSTKGKG

      51  GAQAVYIDYS SKRYVPMVND FVIGTITGTF SDSYKVLLQN FSSSVSLSYM

     101  AFPNASKKNR PTLQVGDLVY ARVCTAEKEL EAEIECVDSA TGRDAGFGLL

     151  EDGMIIDVNL NFTRQLLFNK DFPLLKVLAA HTKFEIAIGL NGKIWVKCDE

     201  LSNTLACYRT ILECCQKHDV ATFKDVAKRQ FEEVLSVKEE *


Protein Sequence for WashU_Scas_Contig589.1:

WashU_Scas_Contig589.1  Length: 248  Mon Nov  7 16:34:51 2016  Type: P  Check: 6556  ..

       1  MQPQTVVVLP GDKLPLSPSS SQSALTLGPG IYCNPQTHEI IPTQAGIAHI

      51  THPKPHRTQL YVDYDSKRYI PCVGDLVIGC IVGTQGGDSY RVSLSNFSND

     101  VSLPYMAFPN ASKKNRPTLV KGDLVYAKVA SAEKELEATL ECLDPGFGIL

     151  EDGMVLDCPL RFARMLLFEG DDKQKKEGTS SLLRLLARHT KFEIAIGING

     201  KIWIKCDTVS DTLACYRIVL ECCKSPVSQY DSIVKRHFKE VAKVVEE*


Protein Sequence for WashU_Sklu_Contig2319.4:

WashU_Sklu_Contig2319.4  Length: 243  Mon Nov  7 16:34:51 2016  Type: P  Check: 3172  ..

       1  MKHQKIMASL TIPSDSLHID LSQPASIGPG VYCDPKTQEL KPVNAGLEVI

      51  SHTKKGQSVY IDYNSKRYIP AVGDFVVGVI TGTFSDSYRV SLADFSSSVT

     101  LSYMAFPNAS KKNRPSLKIG DLVYARVCRA EKELEAEIEC MDSVTGKDAG

     151  FGLLDGGMVI NVSLAFAREL LFNNEFSLLQ ILAEYTQFEV AIGVNGKVWI

     201  RSEEVKNTLA CYRSIMQCQQ SPASEFKNII KKYFREVTNF SE*


Protein Sequence for WashU_Skud_Contig1459.5:

WashU_Skud_Contig1459.5  Length: 241  Mon Nov  7 16:34:51 2016  Type: P  Check: 8806  ..

       1  MSSFVFPGDK LPVDPTVPIK LGPGIYCDPI SQEVRPVNTG ILHVSTKGKG

      51  GVQAAYIDYF SKRYVPSVND FVIGTISGTF SDSYKVSLQD FSPSVSLSYM

     101  AFPNASKKNR PTLQVGDLVY ARVCTAEKEL EAEIECFDST TGRDAGFGQL

     151  EDGTIIDVNL NFARQLLFNN DFPLLKVLAA HTKFEVAIGL NGKIWVKCKE

     201  VSNTLACYRT IKECCEKNDT ATFKDIAKKQ FEEVLNVNEE *


Protein Sequence for WashU_Smik_Contig2099.3:

WashU_Smik_Contig2099.3  Length: 241  Mon Nov  7 16:34:51 2016  Type: P  Check: 9805  ..

       1  MSTFTFPGDK LPVDPILPVK LGPGIYCDPD DQEIRPVNTG ILHVSTKGKS

      51  GVQAVCIDYS SRRYVPSVND FVIGIITGTF SDSYKVSLQN FSSSVSLSYM

     101  AFPNASKKNR PTLQVGDLVY ARVCTAEKEL EAEIECFDST TGRDAGFGLL

     151  EDGMTIDVSL NFARQLLFNN DFPLLKVLAS HTKYEIAIGL NGKIWVKCDD

     201  LSNTLACYRT IMESCEKNDM TTFKDIAKKQ FEEVLTVKEE *