Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YGL100W and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_SEH1/YGL100W   1   ---MQPFDSGHDDLVHDVVYDFYGRHVATCSSDQHIKVFKLDKDTSNWEL   47
MIT_Smik_c276_7953   1   ---MQPFDSGHDDLVHDVVYDFYGRHVATCSSDQHIKVFKLDKETSNWEL   47
MIT_Spar_c22_8716   1   ---MQPFDSGHDDLVHDVVYDFYGRHVATCSSDQHIKVFKLDKDTSNWEL   47
MIT_Suva_c392_8102   1   ---MQPFDSGHDDLVHDVVYDFYGRHVATCSSDQHIKVFKLDKETSNWEL   47
WashU_Sbay_Contig557.9   1   ---MQPFDSGHDDLVHDVVYDFYGRHVATCSSDQHIKVFKLDKETSNWEL   47
WashU_Scas_Contig652.16   1   MAGMKPFNSGHEDLIHDVVYDFYGRHVATCSSDQHIKVFKLDKETSEWEL   50
WashU_Skud_Contig1658.5   1   ---MQPFDSGHDDLVHDVVYDFYGRHVATCSSDQHIKVFKLDKETSNWEL   47
WashU_Smik_Contig2528.2   1   ---MQPFDSGHDDLVHDVVYDFYGRHVATCSSDQHIKVFKLDKETSNWEL   47
Symbols






*:**:***:**:****************************:**:***



SGD_Scer_SEH1/YGL100W   48   SDSWRAHDSSIVAIDWASPEYGRIIASASYDKTVKLWEEDPDQEECSGRR   97
MIT_Smik_c276_7953   48   SDSWRAHDSSIVSIDWASPEYGRIIASASYDKTVKLWEEDPDQEECSGRR   97
MIT_Spar_c22_8716   48   SDSWRAHDSSIVAIDWASPEYGRIIASASYDKTVKLWEEDPDQEECSGRR   97
MIT_Suva_c392_8102   48   SDSWRAHDSSIVAIDWASPEYGRIIASASYDKTVKLWEEDPDQEECSGRR   97
WashU_Sbay_Contig557.9   48   SDSWRAHDSSIVAIDWASPEYGRIIASASYDKTVKLWEEDPDQEECSGRR   97
WashU_Scas_Contig652.16   51   SDSWKAHDSSIVSVDWASPEYGRIIVSASYDKTVKLWEEDPDQPEGSGRR   100
WashU_Skud_Contig1658.5   48   SDSWRAHDSSIVAIDWASPEYGRIIASASYDKTVKLWEEDPDQEECSGRR   97
WashU_Smik_Contig2528.2   48   SDSWRAHDSSIVSIDWASPEYGRIIASASYDKTVKLWEEDPDQEECSGRR   97
Symbols






****:*******::***********.***************** * ****



SGD_Scer_SEH1/YGL100W   98   WNKLCTLNDSKGSLYSVKFAPAHLGLKLACLGNDGILRLYDALEPSDLRS   147
MIT_Smik_c276_7953   98   WNKLCTLNDPKGSLYSVKFAPAHLGLKLACLGNDGILRIYDALEPSDLRS   147
MIT_Spar_c22_8716   98   WSKLCTLNDSKGSLYSVKFAPAHLGLKLACLGNDGILRLYDALEPSDLRS   147
MIT_Suva_c392_8102   98   WNRLCTLNDSKGSLYSVKFAPAHLGLKVACIGNDGTLRIYDALEPSDLRS   147
WashU_Sbay_Contig557.9   98   WNRLCTLNDSKGSLYSVKFAPAHLGLKVACIGNDGTLRIYDALEPSDLRS   147
WashU_Scas_Contig652.16   101   WTKLCTLNDSKGSLYTVKFAPPHLGLKLACIGNDATLRIYEALEPSDLRS   150
WashU_Skud_Contig1658.5   98   WNKLCTLNDSKGSLYSAKFAPAHLGLKLACLGNDGILRIYDALEPSDLRS   147
WashU_Smik_Contig2528.2   98   WNKLCTLNDPKGSLYSVKFAPAHLGLKLACLGNDGILRIYDALEPSDLRS   147
Symbols






*.:******.*****:.****.*****:**:***. **:*:*********



SGD_Scer_SEH1/YGL100W   148   WTLTSEMKVLSIPPANHLQSDFCLSWCPSRFSPEKLAVSALEQAIIYQRG   197
MIT_Smik_c276_7953   148   WTLTSEMRVLSIPPANHLQSDFCLSWCPSRFSPEKLAVSALEQAIIYQRG   197
MIT_Spar_c22_8716   148   WTLTSEMKVLSIPPANHLQSDFCLSWCPSRFSPEKLAVSALEQAIIYQRG   197
MIT_Suva_c392_8102   148   WTLTSEIKVLSIPPANHLQSDFCLSWCPSRFSPEKLAVSSLEQAIIYQRG   197
WashU_Sbay_Contig557.9   148   WTLTSEIKVLSIPPANHLQSDFCLSWCPSRFSPEKLAVSSLEQAIIYQRG   197
WashU_Scas_Contig652.16   151   WTLTSEVKVLPVPPANHLQSDFCIAWCPSRFSPEKLVVSTLDQASIYQRG   200
WashU_Skud_Contig1658.5   148   WTLTSEMKVLSIPPANHLQSDFCLSWCPSRFSPEKLAVSALEQAIIYQRG   197
WashU_Smik_Contig2528.2   148   WTLTSEMRVLSIPPANHLQSDFCLSWCPSRFSPEKLAVSALEQAIIYQRG   197
Symbols






******::**.:***********::***********.**:*:** *****



SGD_Scer_SEH1/YGL100W   198   KDGKLHVAAKLPGHKSLIRSISWAPSIGRWYQLIATGCKDGRIRIFKITE   247
MIT_Smik_c276_7953   198   KDGKIHIAARLPGHKSLIRSISWAPSIGRWYQLIATGCKDGKIRIFKVTE   247
MIT_Spar_c22_8716   198   KDGKLHIAARLSGHKSLIRSISWAPSIGRWYQLIATGCKDGKIRIFKITE   247
MIT_Suva_c392_8102   198   KDGKLHIAARLPGHKSLIRSVSWAPSIGRWYQLIATGCKDGKIRIFKITE   247
WashU_Sbay_Contig557.9   198   KDGKLHIAARLPGHKSLIRSVSWAPSIGRWYQLIATGCKDGKIRIFKITE   247
WashU_Scas_Contig652.16   201   KDGKLYIVAKLNGHKGLIRDISWAPSIGRWYHLIATGCKDGKLRIFRLVE   250
WashU_Skud_Contig1658.5   198   KDGKLHVAARLPGHKSLIRSISWAPSIGRWYQLIATGCKDGKIRIFKITE   247
WashU_Smik_Contig2528.2   198   KDGKIHIAARLPGHKSLIRSISWAPSIGRWYQLIATGCKDGKIRIFKVTE   247
Symbols






****:::.*:* ***.***.:**********:*********::***::.*



SGD_Scer_SEH1/YGL100W   248   KLS-PLASEESLTNSNMFDNSADVDMDAQGRSDSNTEEKAELQSNLQVEL   296
MIT_Smik_c276_7953   248   KLS-PLASEESSNNSKITDNGADIDMDAQGRSDSNTEEKSELQSSLKVEL   296
MIT_Spar_c22_8716   248   KLS-PLVSEESLTNSNIFDNSADVDMDAQDKSDPNTEEKSELQSNLKVEL   296
MIT_Suva_c392_8102   248   KLLGALTSEESSNNSNLFDNGTDVDMDGQVRPSSNNEEKGELQSSLEVEL   297
WashU_Sbay_Contig557.9   248   KLLGALTSEESSNNSNLFDNGTDVDMDGQVRPSSNNEEKGELQSSLEVEL   297
WashU_Scas_Contig652.16   251   KLS--------DNSSKDAINDSYDDEDVDMEDIAENKEKSLLGSSVSVEL   292
WashU_Skud_Contig1658.5   248   KLS-PLDSEESSNNSNLFDSGTDVDMDRQGRSDSNNEEKAELQSSLMVEL   296
WashU_Smik_Contig2528.2   248   KLS-PLASEESSNNSKITDNGADIDMDAQGRSDSNTEEKSELQSSLKVEL   296
Symbols






** ..*: ..: * * : . .:.:**. * *.: ***



SGD_Scer_SEH1/YGL100W   297   LSEHDDHNGEVWSVSWNLTGTILSSAGDDGKVRLWKATYSNEFKCMSVIT   346
MIT_Smik_c276_7953   297   LSEHDDHNGEVWSVSWNLTGTILSSAGDDGKVRLWKATYSNEFKCMSVIT   346
MIT_Spar_c22_8716   297   LSEHDDHNGEVWSVSWNLTGTILSSAGDDGKVRLWKATYSNEFKCMSVIT   346
MIT_Suva_c392_8102   298   LSEHDDHNGEVWSVSWNLTGTILSSAGDDGKVRLWKATYSNEFKCMSVIT   347
WashU_Sbay_Contig557.9   298   LSEHDDHNGEVWSVSWNLTGTILSSAGDDGKVRLWKATYSNEFKCMSVIT   347
WashU_Scas_Contig652.16   293   LSEHDDHNAEIWSVSWNLTGTILSSAGDDGKVRLWKSTYSNEFKCMSVIT   342
WashU_Skud_Contig1658.5   297   LSEHDDHNGEIWSVSWNLTGTILSSAGDDGKVRLWKATYSNEFKCMSVIT   346
WashU_Smik_Contig2528.2   297   LSEHDDHNGEVWSVSWNLTGTILSSAGDDGKVRLWKATYSNEFKCMSVIT   346
Symbols






********.*:*************************:*************



SGD_Scer_SEH1/YGL100W   347   AQQ   349
MIT_Smik_c276_7953   347   AQQ   349
MIT_Spar_c22_8716   347   AQQ   349
MIT_Suva_c392_8102   348   AQQ   350
WashU_Sbay_Contig557.9   348   AQQ   350
WashU_Scas_Contig652.16   343   SNS   345
WashU_Skud_Contig1658.5   347   AQQ   349
WashU_Smik_Contig2528.2   347   AQQ   349
Symbols






::.



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_SEH1/YGL100W:

SGD_Scer_SEH1/YGL100W  Length: 350  Mon Nov  7 15:23:17 2016  Type: P  Check: 1289  ..

       1  MQPFDSGHDD LVHDVVYDFY GRHVATCSSD QHIKVFKLDK DTSNWELSDS

      51  WRAHDSSIVA IDWASPEYGR IIASASYDKT VKLWEEDPDQ EECSGRRWNK

     101  LCTLNDSKGS LYSVKFAPAH LGLKLACLGN DGILRLYDAL EPSDLRSWTL

     151  TSEMKVLSIP PANHLQSDFC LSWCPSRFSP EKLAVSALEQ AIIYQRGKDG

     201  KLHVAAKLPG HKSLIRSISW APSIGRWYQL IATGCKDGRI RIFKITEKLS

     251  PLASEESLTN SNMFDNSADV DMDAQGRSDS NTEEKAELQS NLQVELLSEH

     301  DDHNGEVWSV SWNLTGTILS SAGDDGKVRL WKATYSNEFK CMSVITAQQ*


Protein Sequence for MIT_Smik_c276_7953:

MIT_Smik_c276_7953  Length: 350  Mon Nov  7 15:23:17 2016  Type: P  Check: 601  ..

       1  MQPFDSGHDD LVHDVVYDFY GRHVATCSSD QHIKVFKLDK ETSNWELSDS

      51  WRAHDSSIVS IDWASPEYGR IIASASYDKT VKLWEEDPDQ EECSGRRWNK

     101  LCTLNDPKGS LYSVKFAPAH LGLKLACLGN DGILRIYDAL EPSDLRSWTL

     151  TSEMRVLSIP PANHLQSDFC LSWCPSRFSP EKLAVSALEQ AIIYQRGKDG

     201  KIHIAARLPG HKSLIRSISW APSIGRWYQL IATGCKDGKI RIFKVTEKLS

     251  PLASEESSNN SKITDNGADI DMDAQGRSDS NTEEKSELQS SLKVELLSEH

     301  DDHNGEVWSV SWNLTGTILS SAGDDGKVRL WKATYSNEFK CMSVITAQQ*


Protein Sequence for MIT_Spar_c22_8716:

MIT_Spar_c22_8716  Length: 350  Mon Nov  7 15:23:17 2016  Type: P  Check: 1071  ..

       1  MQPFDSGHDD LVHDVVYDFY GRHVATCSSD QHIKVFKLDK DTSNWELSDS

      51  WRAHDSSIVA IDWASPEYGR IIASASYDKT VKLWEEDPDQ EECSGRRWSK

     101  LCTLNDSKGS LYSVKFAPAH LGLKLACLGN DGILRLYDAL EPSDLRSWTL

     151  TSEMKVLSIP PANHLQSDFC LSWCPSRFSP EKLAVSALEQ AIIYQRGKDG

     201  KLHIAARLSG HKSLIRSISW APSIGRWYQL IATGCKDGKI RIFKITEKLS

     251  PLVSEESLTN SNIFDNSADV DMDAQDKSDP NTEEKSELQS NLKVELLSEH

     301  DDHNGEVWSV SWNLTGTILS SAGDDGKVRL WKATYSNEFK CMSVITAQQ*


Protein Sequence for MIT_Suva_c392_8102:

MIT_Suva_c392_8102  Length: 351  Mon Nov  7 15:23:17 2016  Type: P  Check: 4419  ..

       1  MQPFDSGHDD LVHDVVYDFY GRHVATCSSD QHIKVFKLDK ETSNWELSDS

      51  WRAHDSSIVA IDWASPEYGR IIASASYDKT VKLWEEDPDQ EECSGRRWNR

     101  LCTLNDSKGS LYSVKFAPAH LGLKVACIGN DGTLRIYDAL EPSDLRSWTL

     151  TSEIKVLSIP PANHLQSDFC LSWCPSRFSP EKLAVSSLEQ AIIYQRGKDG

     201  KLHIAARLPG HKSLIRSVSW APSIGRWYQL IATGCKDGKI RIFKITEKLL

     251  GALTSEESSN NSNLFDNGTD VDMDGQVRPS SNNEEKGELQ SSLEVELLSE

     301  HDDHNGEVWS VSWNLTGTIL SSAGDDGKVR LWKATYSNEF KCMSVITAQQ

     351  *

Protein Sequence for WashU_Sbay_Contig557.9:

WashU_Sbay_Contig557.9  Length: 351  Mon Nov  7 15:23:17 2016  Type: P  Check: 4419  ..

       1  MQPFDSGHDD LVHDVVYDFY GRHVATCSSD QHIKVFKLDK ETSNWELSDS

      51  WRAHDSSIVA IDWASPEYGR IIASASYDKT VKLWEEDPDQ EECSGRRWNR

     101  LCTLNDSKGS LYSVKFAPAH LGLKVACIGN DGTLRIYDAL EPSDLRSWTL

     151  TSEIKVLSIP PANHLQSDFC LSWCPSRFSP EKLAVSSLEQ AIIYQRGKDG

     201  KLHIAARLPG HKSLIRSVSW APSIGRWYQL IATGCKDGKI RIFKITEKLL

     251  GALTSEESSN NSNLFDNGTD VDMDGQVRPS SNNEEKGELQ SSLEVELLSE

     301  HDDHNGEVWS VSWNLTGTIL SSAGDDGKVR LWKATYSNEF KCMSVITAQQ

     351  *

Protein Sequence for WashU_Scas_Contig652.16:

WashU_Scas_Contig652.16  Length: 346  Mon Nov  7 15:23:17 2016  Type: P  Check: 8053  ..

       1  MAGMKPFNSG HEDLIHDVVY DFYGRHVATC SSDQHIKVFK LDKETSEWEL

      51  SDSWKAHDSS IVSVDWASPE YGRIIVSASY DKTVKLWEED PDQPEGSGRR

     101  WTKLCTLNDS KGSLYTVKFA PPHLGLKLAC IGNDATLRIY EALEPSDLRS

     151  WTLTSEVKVL PVPPANHLQS DFCIAWCPSR FSPEKLVVST LDQASIYQRG

     201  KDGKLYIVAK LNGHKGLIRD ISWAPSIGRW YHLIATGCKD GKLRIFRLVE

     251  KLSDNSSKDA INDSYDDEDV DMEDIAENKE KSLLGSSVSV ELLSEHDDHN

     301  AEIWSVSWNL TGTILSSAGD DGKVRLWKST YSNEFKCMSV ITSNS*


Protein Sequence for WashU_Skud_Contig1658.5:

WashU_Skud_Contig1658.5  Length: 350  Mon Nov  7 15:23:17 2016  Type: P  Check: 958  ..

       1  MQPFDSGHDD LVHDVVYDFY GRHVATCSSD QHIKVFKLDK ETSNWELSDS

      51  WRAHDSSIVA IDWASPEYGR IIASASYDKT VKLWEEDPDQ EECSGRRWNK

     101  LCTLNDSKGS LYSAKFAPAH LGLKLACLGN DGILRIYDAL EPSDLRSWTL

     151  TSEMKVLSIP PANHLQSDFC LSWCPSRFSP EKLAVSALEQ AIIYQRGKDG

     201  KLHVAARLPG HKSLIRSISW APSIGRWYQL IATGCKDGKI RIFKITEKLS

     251  PLDSEESSNN SNLFDSGTDV DMDRQGRSDS NNEEKAELQS SLMVELLSEH

     301  DDHNGEIWSV SWNLTGTILS SAGDDGKVRL WKATYSNEFK CMSVITAQQ*


Protein Sequence for WashU_Smik_Contig2528.2:

WashU_Smik_Contig2528.2  Length: 350  Mon Nov  7 15:23:17 2016  Type: P  Check: 601  ..

       1  MQPFDSGHDD LVHDVVYDFY GRHVATCSSD QHIKVFKLDK ETSNWELSDS

      51  WRAHDSSIVS IDWASPEYGR IIASASYDKT VKLWEEDPDQ EECSGRRWNK

     101  LCTLNDPKGS LYSVKFAPAH LGLKLACLGN DGILRIYDAL EPSDLRSWTL

     151  TSEMRVLSIP PANHLQSDFC LSWCPSRFSP EKLAVSALEQ AIIYQRGKDG

     201  KIHIAARLPG HKSLIRSISW APSIGRWYQL IATGCKDGKI RIFKVTEKLS

     251  PLASEESSNN SKITDNGADI DMDAQGRSDS NTEEKSELQS SLKVELLSEH

     301  DDHNGEVWSV SWNLTGTILS SAGDDGKVRL WKATYSNEFK CMSVITAQQ*