Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YKR020W and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_VPS51/YKR020W   1   MAEQISHKKSLRVSSLNKDRRLLLREFYNLENEPNKGRQEARIGEKASEA   50
MIT_Smik_c115_13743   1   MAEQISHKKSLRVSSLNKDRRLLLREFYNLKNEPNKGEEDAQIEGTANKS   50
MIT_Spar_c405_13419   1   MADQISHKKSLRVSSLNKDRRLLLREFYNLENEPDKDGKDARIGEKVSKA   50
MIT_Suva_c294_15199   1   MAEQISHKKSLRINSLNKDRRLLLREFYNLKNDPNEGKQDASTEGAASGP   50
WashU_Sbay_Contig624.35   1   MAEQISHKKSLRINSLNKDRRLLLREFYNLKNDPNEGKQDASTEGAASGP   50
WashU_Scas_Contig675.33   1   MAEQISHKKSLRVNRLNKDRRLLLKEYYKLEEG--------------TEP   36
WashU_Skud_Contig2000.11   1   MAEQISHKKSLRVSSLNKDRRLLLREFYNLENDPNKGEQDAHIEGAAGKS   50
WashU_Smik_Contig2605.4   1   MAEQISHKKSLRVSSLNKDRRLLLREFYNLKNEPNKGEEDAQIEGTANKS   50
Symbols






**:*********:. *********:*:*:*:: .



SGD_Scer_VPS51/YKR020W   51   HSGEEQVTDVNIDTEANTEKPVKDDELSATEEDLKEGSEDAEEEIKNLPF   100
MIT_Smik_c115_13743   51   HSDEGQVAAGNLNAEGNREKQAKKDELKVAEEDPKEESEDTNEEIGNLPF   100
MIT_Spar_c405_13419   51   HSEEGQVTGVNVDTEGNTEKPVKKDELSAAEEDPKEGSEDAEEEIKNLPF   100
MIT_Suva_c294_15199   51   HSPEEHGADVNESVEENADELAQKNRLGAAVEDRNEGKQQVSEEIEQLPF   100
WashU_Sbay_Contig624.35   51   HSPEEHGADVNESVEENADELAQKNRLGAAVEDRNEGKQQVSEEIEQLPF   100
WashU_Scas_Contig675.33   37   EPEPEINGETKEGQEQEQKSQQRIPSTVDTPEPTVVATPMEDKPVSSLTF   86
WashU_Skud_Contig2000.11   51   YTEDGQNPDVSASTDGSTGELVKKDALGAAEENQKKGTEDTDEEIQNLPF   100
WashU_Smik_Contig2605.4   51   HSDEGQVAAGNLNAEGNREKQAKKDELKVAEEDPKEESEDTNEEIGNLPF   100
Symbols






. . . : . . : : * . .: : .*.*



SGD_Scer_VPS51/YKR020W   101   KRLVQIHNKLLGKETETNNSIKNTIYENYYDLIKVNDLLKEITNANEDQI   150
MIT_Smik_c115_13743   101   KRLVQIHNKLLGKETETNNSIKNTIYENYYDLIKVNDLLKEITNANEDKV   150
MIT_Spar_c405_13419   101   KRLVQIHNKLLGKETETNNSIKNTIYENYYDLIKVNDLLKEITNANEDQI   150
MIT_Suva_c294_15199   101   KRLVQIHNKLLGKETETNNSIKNTIYENYYDLIKVNDLLKEITNANKDQV   150
WashU_Sbay_Contig624.35   101   KRLVQIHNKLLGKETETNNSIKNTIYENYYDLIKVNDLLKEITNANKDQV   150
WashU_Scas_Contig675.33   87   KELIQIHNKLLSKETETNNTIKNTIYENYYDLIKVNDLLKEVRHAKSDEV   136
WashU_Skud_Contig2000.11   101   KKLVQIHNKLLGKETETNNSIKNTIYENYYDLIKVNNLLKEITNANEDQV   150
WashU_Smik_Contig2605.4   101   KRLVQIHNKLLGKETETNNSIKNTIYENYYDLIKVNDLLKEITNANEDKV   150
Symbols






*.*:*******.*******:****************:****: :*:.*::



SGD_Scer_VPS51/YKR020W   151   NKLKQTVESLIKEL   164
MIT_Smik_c115_13743   151   GKLKQTVESLIKEL   164
MIT_Spar_c405_13419   151   RTLKQTVESLIKEL   164
MIT_Suva_c294_15199   151   GKLRQTVETLIKEL   164
WashU_Sbay_Contig624.35   151   GKLRQTVETLIKEL   164
WashU_Scas_Contig675.33   137   AQLKQCVDLLRDEF   150
WashU_Skud_Contig2000.11   151   SKLKQTVETLIKEL   164
WashU_Smik_Contig2605.4   151   GKLKQTVESLIKEL   164
Symbols






*:* *: * .*:



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_VPS51/YKR020W:

SGD_Scer_VPS51/YKR020W  Length: 165  Mon Nov  7 15:58:31 2016  Type: P  Check: 6161  ..

       1  MAEQISHKKS LRVSSLNKDR RLLLREFYNL ENEPNKGRQE ARIGEKASEA

      51  HSGEEQVTDV NIDTEANTEK PVKDDELSAT EEDLKEGSED AEEEIKNLPF

     101  KRLVQIHNKL LGKETETNNS IKNTIYENYY DLIKVNDLLK EITNANEDQI

     151  NKLKQTVESL IKEL*

Protein Sequence for MIT_Smik_c115_13743:

MIT_Smik_c115_13743  Length: 165  Mon Nov  7 15:58:31 2016  Type: P  Check: 7055  ..

       1  MAEQISHKKS LRVSSLNKDR RLLLREFYNL KNEPNKGEED AQIEGTANKS

      51  HSDEGQVAAG NLNAEGNREK QAKKDELKVA EEDPKEESED TNEEIGNLPF

     101  KRLVQIHNKL LGKETETNNS IKNTIYENYY DLIKVNDLLK EITNANEDKV

     151  GKLKQTVESL IKEL*

Protein Sequence for MIT_Spar_c405_13419:

MIT_Spar_c405_13419  Length: 165  Mon Nov  7 15:58:31 2016  Type: P  Check: 6695  ..

       1  MADQISHKKS LRVSSLNKDR RLLLREFYNL ENEPDKDGKD ARIGEKVSKA

      51  HSEEGQVTGV NVDTEGNTEK PVKKDELSAA EEDPKEGSED AEEEIKNLPF

     101  KRLVQIHNKL LGKETETNNS IKNTIYENYY DLIKVNDLLK EITNANEDQI

     151  RTLKQTVESL IKEL*

Protein Sequence for MIT_Suva_c294_15199:

MIT_Suva_c294_15199  Length: 165  Mon Nov  7 15:58:31 2016  Type: P  Check: 8001  ..

       1  MAEQISHKKS LRINSLNKDR RLLLREFYNL KNDPNEGKQD ASTEGAASGP

      51  HSPEEHGADV NESVEENADE LAQKNRLGAA VEDRNEGKQQ VSEEIEQLPF

     101  KRLVQIHNKL LGKETETNNS IKNTIYENYY DLIKVNDLLK EITNANKDQV

     151  GKLRQTVETL IKEL*

Protein Sequence for WashU_Sbay_Contig624.35:

WashU_Sbay_Contig624.35  Length: 165  Mon Nov  7 15:58:31 2016  Type: P  Check: 8001  ..

       1  MAEQISHKKS LRINSLNKDR RLLLREFYNL KNDPNEGKQD ASTEGAASGP

      51  HSPEEHGADV NESVEENADE LAQKNRLGAA VEDRNEGKQQ VSEEIEQLPF

     101  KRLVQIHNKL LGKETETNNS IKNTIYENYY DLIKVNDLLK EITNANKDQV

     151  GKLRQTVETL IKEL*

Protein Sequence for WashU_Scas_Contig675.33:

WashU_Scas_Contig675.33  Length: 151  Mon Nov  7 15:58:31 2016  Type: P  Check: 4607  ..

       1  MAEQISHKKS LRVNRLNKDR RLLLKEYYKL EEGTEPEPEP EINGETKEGQ

      51  EQEQKSQQRI PSTVDTPEPT VVATPMEDKP VSSLTFKELI QIHNKLLSKE

     101  TETNNTIKNT IYENYYDLIK VNDLLKEVRH AKSDEVAQLK QCVDLLRDEF

     151  *

Protein Sequence for WashU_Skud_Contig2000.11:

WashU_Skud_Contig2000.11  Length: 165  Mon Nov  7 15:58:31 2016  Type: P  Check: 7211  ..

       1  MAEQISHKKS LRVSSLNKDR RLLLREFYNL ENDPNKGEQD AHIEGAAGKS

      51  YTEDGQNPDV SASTDGSTGE LVKKDALGAA EENQKKGTED TDEEIQNLPF

     101  KKLVQIHNKL LGKETETNNS IKNTIYENYY DLIKVNNLLK EITNANEDQV

     151  SKLKQTVETL IKEL*

Protein Sequence for WashU_Smik_Contig2605.4:

WashU_Smik_Contig2605.4  Length: 165  Mon Nov  7 15:58:31 2016  Type: P  Check: 7055  ..

       1  MAEQISHKKS LRVSSLNKDR RLLLREFYNL KNEPNKGEED AQIEGTANKS

      51  HSDEGQVAAG NLNAEGNREK QAKKDELKVA EEDPKEESED TNEEIGNLPF

     101  KRLVQIHNKL LGKETETNNS IKNTIYENYY DLIKVNDLLK EITNANEDKV

     151  GKLKQTVESL IKEL*