Fungal Sequence Alignment

Help

This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al. We will soon include sequences from other fungal genomes from a variety of sources.

ClustalW Protein Alignment and Sequence for YGR208W and Homologs

Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_SER2/YGR208W   1   ---MSKFVITCIAHGENLPKETIDQIAKEITESSAKDVSINGTKKLSARA   47
MIT_Sbay_c699_9759   1   ---MTKFVVTCIAHGENLPQDAIDQIAKEIVKSSSQKVTVKSTKKLSARA   47
MIT_Smik_c993_9351   1   ---MSRFVITCIAHGESLPQETIDQLANEIIECSKKEISISSTKKLSTRA   47
MIT_Spar_c290_9140   1   ---MSNYVLTCIAHGENLSKETVDQIVKEVVESSEKEISINSTKKLSARA   47
WashU_Sbay_Contig653.12   1   ---MTKFVVTCIAHGENLPQDAIDQIAKEIVKSSSQKVTVKSTKKLSARA   47
WashU_Scas_Contig463.3   1   MTNSEKIVITLIAHGSKLSAELVQQLSRDIEN--TLRCQIKDTKPLSDRA   48
WashU_Sklu_Contig786.1   1   -MTTSPYVITAISHSATFPKG-FQEGFLQFLNQGHEQLTLDSHNTLSTRA   48
WashU_Skud_Contig1744.5   1   ---MSKFVITCIAHGKNLPQETIDQIAKEIIKSSTKETSIKNIKKLSTRA   47
WashU_Smik_Contig2850.8   1   ---MSRFVITCIAHGESLPQETIDQLANEIIECSKKEISISSTKKLSTRA   47
Symbols






*:* *:*. .:. .:: :. : :.. : ** **



SGD_Scer_SER2/YGR208W   48   TDIFIEVAGSIVQKDLKNKLTNVIDS-HNDVDVIVSVDNEYRQAKKLFVF   96
MIT_Sbay_c699_9759   48   TDLFVEVAGSIVQKDFKNELIGIIEG-QDDVDVIVSADNEYRQAKKLFVF   96
MIT_Smik_c993_9351   48   TDIYVEVAESVSQNNLKNELMGMIES-HSDVDVIISADNEYRQAKKLFVF   96
MIT_Spar_c290_9140   48   TDIFLKVRGPIVQKDLKNELMNVIDS-YDDVDVIVSVDNEYRQAKKLFVF   96
WashU_Sbay_Contig653.12   48   TDLFVEVAGSIVQKDFKNELIGIIEG-QDDVDVIVSADNEYRQAKKLFVF   96
WashU_Scas_Contig463.3   49   LDIYLEDVVSSTVEETRGLLSPFIEANSEIVDIIVQRNDQYRKNKKVVVF   98
WashU_Sklu_Contig786.1   49   IDYFVGAPSLDEVKATVAEYT-----NHQEVDLVFQKNDEFRRNKKLVVF   93
WashU_Skud_Contig1744.5   48   TDIFVEVAESVLQKTFRNELMCLIEG-RDDVDLIVSADNEYRQAKKLFVF   96
WashU_Smik_Contig2850.8   48   TDIYVEVAESVSQNNLKNELMGMIES-HSDVDVIISADNEYRQAKKLFVF   96
Symbols






* :: : . **::.. ::::*: **:.**



SGD_Scer_SER2/YGR208W   97   DMDSTLIYQEVIELIAAYAGVEEQVHEITERAMNNELDFKESLRERVKLL   146
MIT_Sbay_c699_9759   97   DMDSTLIYQEVIELIAAYAGVEEQVHEITERAMNNELDFKESLRERVKLL   146
MIT_Smik_c993_9351   97   DMDSTLIYQEVIELIAAYAGVEKQVHEITERAMNNELDFKESLRERVKLL   146
MIT_Spar_c290_9140   97   DMDSTLIYQEVIELIAAYAGVEEQVHAITERAMNNELDFKESLRERVKLL   146
WashU_Sbay_Contig653.12   97   DMDSTLIYQEVIELIAAYAGVEEQVHEITERAMNNELDFKESLRERVKLL   146
WashU_Scas_Contig463.3   99   DMDSTLIYQEVIELIAAYADVEPQVKAITDRAMNNEIDFKESLRERVALL   148
WashU_Sklu_Contig786.1   94   DMDSTLIYQEVIELIAAYAGVEDKVAEITNRAMNNELDFVQSLQARVALL   143
WashU_Skud_Contig1744.5   97   DMDSTLIYQEVIELIAAYAGVEKQVHDITERAMNNELDFKESLRERVKLL   146
WashU_Smik_Contig2850.8   97   DMDSTLIYQEVIELIAAYASVEKQVHEITERAMNNELDFKESLRERVKLL   146
Symbols






*******************.** :* **:******:** :**: ** **



SGD_Scer_SER2/YGR208W   147   QGLQVDTLYDEIKQKLEVTKGVPELCKFLHKKNCKLAVLSGGFIQFAGFI   196
MIT_Sbay_c699_9759   147   KGLQIDTLYDEIKQKLEITKGVPELCKFLHDKGCKLAVLSGGFIQFASFI   196
MIT_Smik_c993_9351   147   QGLQIDTLYDEIKQKLEITKGVPELCKFLHAKNCKLAVLSGGFIQFASFI   196
MIT_Spar_c290_9140   147   KGLQIDTLYDEIKQKLVITKGVPELCKFLHDKNCKLAVLSGGFIQFASFI   196
WashU_Sbay_Contig653.12   147   KGLQIDTLYDEIKQKLEITKGVPELCKFLHDKGCKLAVLSGGFIQFASFI   196
WashU_Scas_Contig463.3   149   EGLKIDTLYDEIKGKLKITKGVHELCKVLSAEGSKLAVLSGGFIQFASFI   198
WashU_Sklu_Contig786.1   144   KGIKTKTLYDEIKDKLLVTEGVPELTKGLGRTGCKLAVLSGGFTPFANYM   193
WashU_Skud_Contig1744.5   147   KGLQIDTLYDEIKQKLEITKGVPELCKFLHDKDCKLAVLSGGFIQFASFI   196
WashU_Smik_Contig2850.8   147   QGLQIDTLYDEIKQKLEITKGVPELCKFLHAKNCKLAVLSGGFIQFASFI   196
Symbols






:*:: .******* ** :*:** ** * * ..********* **.::



SGD_Scer_SER2/YGR208W   197   KDQLGLDFCKANLLEVDTDG-----KLTGKTLGPIVDGQCKSETLLQLCN   241
MIT_Sbay_c699_9759   197   KDQLNLDFCKANLLEVDAEG-----KLTGKTLGATVDGQCKSETLLQLCT   241
MIT_Smik_c993_9351   197   KNQLGLDFCKANLLEVDAEG-----KLTGKTLGPIVDGECKSETLLQLCK   241
MIT_Spar_c290_9140   197   KDQLRLDFCKANLLEVDADG-----KLTGKTLGPIVDGQCKSETLLQLCN   241
WashU_Sbay_Contig653.12   197   KDQLNLDFCKANLLEVDAEG-----KLTGKTLGATVDGQCKSETLLQLCT   241
WashU_Scas_Contig463.3   199   AKELKFDVAKANTLEMDTEG-----KLTGKVLGDIVDGQCKAETLLELCE   243
WashU_Sklu_Contig786.1   194   KEILHLDYARANFLATEIDPTTGEEVLSGHTIGDIVDGQCKAKTLLQLAK   243
WashU_Skud_Contig1744.5   197   KDQLNLDFCKANLLEVGAGG-----KLTGKTLGPIVDGQCKSETLLQLCN   241
WashU_Smik_Contig2850.8   197   KNQLGLDFCKANLLEVDAEG-----KLTGKTLGPIVDGECKSETLLQLCK   241
Symbols






. * :* .:** * *:*:.:* ***:**::***:*.



SGD_Scer_SER2/YGR208W   242   DYNVPVEASCMVGDGGNDLPAMATAGFGIAWNAKPKVQKAAPCKLNTKSM   291
MIT_Sbay_c699_9759   242   EFKVPVESSCMVGDGGNDLPAMGTAGFGIAWNAKPKVQKAAPCKLNTKSM   291
MIT_Smik_c993_9351   242   DYKVPVESSCMVGDGGNDLPAMATAGFGIAWNAKPKVQKAAPCKLNTKSM   291
MIT_Spar_c290_9140   242   DYKVPVEASCMVGDGGNDLPAMATAGFGIAWNAKPKVQKAAPCKLNTKSM   291
WashU_Sbay_Contig653.12   242   EFKVPVESSCMVGDGGNDLPAMGTAGFGIAWNAKPKVQKAAPCKLNTKSM   291
WashU_Scas_Contig463.3   244   KYQCPVEASCMVGDGGNDLPAMSVAGFGIAWNAKPRVQKAAPCRLNTDSL   293
WashU_Sklu_Contig786.1   244   EYDVPVESTVMVGDGGNDLPAMGVAGFGIAWNAKPKVQEAAPCKLNTKSL   293
WashU_Skud_Contig1744.5   242   DYTVPVEASCMVGDGGNDLPAMAAAGFGIAWNAKPKVQNAAPCRLNTESM   291
WashU_Smik_Contig2850.8   242   DYKVPVESSCMVGDGGNDLPAMATAGFGIAWNAKPKVQKAAPCKLNTKSM   291
Symbols






.: ***:: ************..***********:**:****:***.*:



SGD_Scer_SER2/YGR208W   292   TDILYILGYTDDEIYNRQ----   309
MIT_Sbay_c699_9759   292   TDILYILGYTDDEIYGRQ----   309
MIT_Smik_c993_9351   292   KDILYILGYTDDEIYNKQ----   309
MIT_Spar_c290_9140   292   TDILYILGYTDEEIYDRQ----   309
WashU_Sbay_Contig653.12   292   TDILYILGYTDDEIYGRQ----   309
WashU_Scas_Contig463.3   294   RDALYIFGYTDSEIESIISKGN   315
WashU_Sklu_Contig786.1   294   RDAFYVFGYADDEIVQLLK---   312
WashU_Skud_Contig1744.5   292   TDILYILGYTDDEIYDRQ----   309
WashU_Smik_Contig2850.8   292   KDILYILGYTDDEIYNKQ----   309
Symbols






* :*::**:*.**



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_SER2/YGR208W:

SGD_Scer_SER2/YGR208W  Length: 310  Sat Dec 10 11:53:00 2011  Type: P  Check: 6509  ..

       1  MSKFVITCIA HGENLPKETI DQIAKEITES SAKDVSINGT KKLSARATDI

      51  FIEVAGSIVQ KDLKNKLTNV IDSHNDVDVI VSVDNEYRQA KKLFVFDMDS

     101  TLIYQEVIEL IAAYAGVEEQ VHEITERAMN NELDFKESLR ERVKLLQGLQ

     151  VDTLYDEIKQ KLEVTKGVPE LCKFLHKKNC KLAVLSGGFI QFAGFIKDQL

     201  GLDFCKANLL EVDTDGKLTG KTLGPIVDGQ CKSETLLQLC NDYNVPVEAS

     251  CMVGDGGNDL PAMATAGFGI AWNAKPKVQK AAPCKLNTKS MTDILYILGY

     301  TDDEIYNRQ*

Protein Sequence for MIT_Sbay_c699_9759:

MIT_Sbay_c699_9759  Length: 310  Sat Dec 10 11:53:00 2011  Type: P  Check: 6317  ..

       1  MTKFVVTCIA HGENLPQDAI DQIAKEIVKS SSQKVTVKST KKLSARATDL

      51  FVEVAGSIVQ KDFKNELIGI IEGQDDVDVI VSADNEYRQA KKLFVFDMDS

     101  TLIYQEVIEL IAAYAGVEEQ VHEITERAMN NELDFKESLR ERVKLLKGLQ

     151  IDTLYDEIKQ KLEITKGVPE LCKFLHDKGC KLAVLSGGFI QFASFIKDQL

     201  NLDFCKANLL EVDAEGKLTG KTLGATVDGQ CKSETLLQLC TEFKVPVESS

     251  CMVGDGGNDL PAMGTAGFGI AWNAKPKVQK AAPCKLNTKS MTDILYILGY

     301  TDDEIYGRQ*

Protein Sequence for MIT_Smik_c993_9351:

MIT_Smik_c993_9351  Length: 310  Sat Dec 10 11:53:00 2011  Type: P  Check: 6642  ..

       1  MSRFVITCIA HGESLPQETI DQLANEIIEC SKKEISISST KKLSTRATDI

      51  YVEVAESVSQ NNLKNELMGM IESHSDVDVI ISADNEYRQA KKLFVFDMDS

     101  TLIYQEVIEL IAAYAGVEKQ VHEITERAMN NELDFKESLR ERVKLLQGLQ

     151  IDTLYDEIKQ KLEITKGVPE LCKFLHAKNC KLAVLSGGFI QFASFIKNQL

     201  GLDFCKANLL EVDAEGKLTG KTLGPIVDGE CKSETLLQLC KDYKVPVESS

     251  CMVGDGGNDL PAMATAGFGI AWNAKPKVQK AAPCKLNTKS MKDILYILGY

     301  TDDEIYNKQ*

Protein Sequence for MIT_Spar_c290_9140:

MIT_Spar_c290_9140  Length: 310  Sat Dec 10 11:53:00 2011  Type: P  Check: 8187  ..

       1  MSNYVLTCIA HGENLSKETV DQIVKEVVES SEKEISINST KKLSARATDI

      51  FLKVRGPIVQ KDLKNELMNV IDSYDDVDVI VSVDNEYRQA KKLFVFDMDS

     101  TLIYQEVIEL IAAYAGVEEQ VHAITERAMN NELDFKESLR ERVKLLKGLQ

     151  IDTLYDEIKQ KLVITKGVPE LCKFLHDKNC KLAVLSGGFI QFASFIKDQL

     201  RLDFCKANLL EVDADGKLTG KTLGPIVDGQ CKSETLLQLC NDYKVPVEAS

     251  CMVGDGGNDL PAMATAGFGI AWNAKPKVQK AAPCKLNTKS MTDILYILGY

     301  TDEEIYDRQ*

Protein Sequence for WashU_Sbay_Contig653.12:

WashU_Sbay_Contig653.12  Length: 310  Sat Dec 10 11:53:00 2011  Type: P  Check: 6317  ..

       1  MTKFVVTCIA HGENLPQDAI DQIAKEIVKS SSQKVTVKST KKLSARATDL

      51  FVEVAGSIVQ KDFKNELIGI IEGQDDVDVI VSADNEYRQA KKLFVFDMDS

     101  TLIYQEVIEL IAAYAGVEEQ VHEITERAMN NELDFKESLR ERVKLLKGLQ

     151  IDTLYDEIKQ KLEITKGVPE LCKFLHDKGC KLAVLSGGFI QFASFIKDQL

     201  NLDFCKANLL EVDAEGKLTG KTLGATVDGQ CKSETLLQLC TEFKVPVESS

     251  CMVGDGGNDL PAMGTAGFGI AWNAKPKVQK AAPCKLNTKS MTDILYILGY

     301  TDDEIYGRQ*

Protein Sequence for WashU_Scas_Contig463.3:

WashU_Scas_Contig463.3  Length: 316  Sat Dec 10 11:53:00 2011  Type: P  Check: 13  ..

       1  MTNSEKIVIT LIAHGSKLSA ELVQQLSRDI ENTLRCQIKD TKPLSDRALD

      51  IYLEDVVSST VEETRGLLSP FIEANSEIVD IIVQRNDQYR KNKKVVVFDM

     101  DSTLIYQEVI ELIAAYADVE PQVKAITDRA MNNEIDFKES LRERVALLEG

     151  LKIDTLYDEI KGKLKITKGV HELCKVLSAE GSKLAVLSGG FIQFASFIAK

     201  ELKFDVAKAN TLEMDTEGKL TGKVLGDIVD GQCKAETLLE LCEKYQCPVE

     251  ASCMVGDGGN DLPAMSVAGF GIAWNAKPRV QKAAPCRLNT DSLRDALYIF

     301  GYTDSEIESI ISKGN*

Protein Sequence for WashU_Sklu_Contig786.1:

WashU_Sklu_Contig786.1  Length: 313  Sat Dec 10 11:53:00 2011  Type: P  Check: 4725  ..

       1  MTTSPYVITA ISHSATFPKG FQEGFLQFLN QGHEQLTLDS HNTLSTRAID

      51  YFVGAPSLDE VKATVAEYTN HQEVDLVFQK NDEFRRNKKL VVFDMDSTLI

     101  YQEVIELIAA YAGVEDKVAE ITNRAMNNEL DFVQSLQARV ALLKGIKTKT

     151  LYDEIKDKLL VTEGVPELTK GLGRTGCKLA VLSGGFTPFA NYMKEILHLD

     201  YARANFLATE IDPTTGEEVL SGHTIGDIVD GQCKAKTLLQ LAKEYDVPVE

     251  STVMVGDGGN DLPAMGVAGF GIAWNAKPKV QEAAPCKLNT KSLRDAFYVF

     301  GYADDEIVQL LK*

Protein Sequence for WashU_Skud_Contig1744.5:

WashU_Skud_Contig1744.5  Length: 310  Sat Dec 10 11:53:00 2011  Type: P  Check: 5206  ..

       1  MSKFVITCIA HGKNLPQETI DQIAKEIIKS STKETSIKNI KKLSTRATDI

      51  FVEVAESVLQ KTFRNELMCL IEGRDDVDLI VSADNEYRQA KKLFVFDMDS

     101  TLIYQEVIEL IAAYAGVEKQ VHDITERAMN NELDFKESLR ERVKLLKGLQ

     151  IDTLYDEIKQ KLEITKGVPE LCKFLHDKDC KLAVLSGGFI QFASFIKDQL

     201  NLDFCKANLL EVGAGGKLTG KTLGPIVDGQ CKSETLLQLC NDYTVPVEAS

     251  CMVGDGGNDL PAMAAAGFGI AWNAKPKVQN AAPCRLNTES MTDILYILGY

     301  TDDEIYDRQ*

Protein Sequence for WashU_Smik_Contig2850.8:

WashU_Smik_Contig2850.8  Length: 310  Sat Dec 10 11:53:00 2011  Type: P  Check: 6666  ..

       1  MSRFVITCIA HGESLPQETI DQLANEIIEC SKKEISISST KKLSTRATDI

      51  YVEVAESVSQ NNLKNELMGM IESHSDVDVI ISADNEYRQA KKLFVFDMDS

     101  TLIYQEVIEL IAAYASVEKQ VHEITERAMN NELDFKESLR ERVKLLQGLQ

     151  IDTLYDEIKQ KLEITKGVPE LCKFLHAKNC KLAVLSGGFI QFASFIKNQL

     201  GLDFCKANLL EVDAEGKLTG KTLGPIVDGE CKSETLLQLC KDYKVPVESS

     251  CMVGDGGNDL PAMATAGFGI AWNAKPKVQK AAPCKLNTKS MKDILYILGY

     301  TDDEIYNKQ*