Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YKL099C and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_UTP11/YKL099C   1   MAKLVHDVQKKQHRERSQLTSRSRYGFLEKHKDYVKRAQDFHRKQSTLKV   50
MIT_Smik_c702_13288   1   MAKLVHDVQKKQHRERSQLTSRTRFGFLEKHKDYVKRAQDFHRKQSTLKV   50
MIT_Spar_c321_13861   1   MAKLVHDVQKKQHRERSQLTSRTRYGFLEKHKDYVKRAQDFHRKQSTLKV   50
MIT_Suva_c259_14375   1   MAKLVHDVQKKQHRERSQLTSRTRYGFLEKHKDYVKRAQDFHRKQSSLKI   50
WashU_Sbay_Contig593.20   1   MAKLVHDVQKKQHRERSQLTSRTRYGFLEKHKDYVKRAQDFHRKQSSLKI   50
WashU_Scas_Contig616.11   1   MAKLVHDVQKRQHRERSQLTGRSRLGFLEKHKDYVKRAQDYHKKEATLKI   50
WashU_Sklu_Contig1933.4   1   MAKLVHNIQKKQHRERSQVSERSRFGFLEKHKDYVKRARDYHKKQVTLKV   50
WashU_Smik_Contig1444.2   1   MAKLVHDVQKKQHRERSQLTSRTRFGFLEKHKDYVKRAQDFHRKQSTLKV   50
Symbols






******::**:*******:: *:* *************:*:*:*: :**:



SGD_Scer_UTP11/YKL099C   51   LREKAKERNPDEYYHAMHSRKTDAKGLLISSRHGDEEDESLSMDQVKLLK   100
MIT_Smik_c702_13288   51   LREKAKERNPDEYYHAMHSRKTDAKGLLISSRHGDEEDESLSMDQVKLLK   100
MIT_Spar_c321_13861   51   LREKAKERNPDEYYHAMHSRKTDTKGLLISSRHGNEEDESLSMDQVKLLK   100
MIT_Suva_c259_14375   51   LREKVKERNPDEYYHAMHSRKTDAKGLLITSRHGEDEDESLSMDQVKLLK   100
WashU_Sbay_Contig593.20   51   LREKVKERNPDEYYHAMHSRKTDAKGLLITSRHGEDEDESLSMDQVKLLK   100
WashU_Scas_Contig616.11   51   LRSKVTERNPDEYYHGMHSRKVDAKGLLVTSRRGEDEDESLSMDQVKLLK   100
WashU_Sklu_Contig1933.4   51   LRSKAKERNPDEYYHGMNTRKLDSKGLLIKSRHAEGEDPSLTMDQVKLLK   100
WashU_Smik_Contig1444.2   51   LREKAKERNPDEYYHAMHSRKTDAKGLLISSRHGDEEDESLSMDQVKLLK   100
Symbols






**.*..*********.*::** *:****:.**:.: ** **:********



SGD_Scer_UTP11/YKL099C   101   TQDSNYVRTLRQIELKKLEKGAKQLMFKSSGNHTIFVDSREKMNEFTPEK   150
MIT_Smik_c702_13288   101   TQDSNYVRTLRQIELKKLEKRSKELMFKSSGKHTIFVDSRERMVDFAPEK   150
MIT_Spar_c321_13861   101   TQDSNYVRTLRQIELKKLEKGSKQLMFKSSGNHTIFVDSREKMEDFAPEK   150
MIT_Suva_c259_14375   101   TQDSNYVRTLRQLELKKLEKRSKELMFKSSGNHTIFVDSREKMEDFAPEK   150
WashU_Sbay_Contig593.20   101   TQDSNYVRTLRQLELKKLEKRSKELMFKSSGNHTIFVDSREKMEDFAPEK   150
WashU_Scas_Contig616.11   101   SQDSNYVRTLRQMELKKLENKTKTLMFGSNGQHTVFVDDRQQLEDFSPEE   150
WashU_Sklu_Contig1933.4   101   TQDSNYVRTLRQIELKKLERSSKELMFKSSGNHTVFVDDNKEMRDFSPEQ   150
WashU_Smik_Contig1444.2   101   TQDSNYVRTLRQIELKKLEKRSKELMFKSSGKHTIFVDSRERMVDFAPEK   150
Symbols






:***********:******. :* *** *.*:**:***..:.: :*:**:



SGD_Scer_UTP11/YKL099C   151   FFNTTSEMVNRSENRLTKDQLAQDISNNR-----NASSIMPKESLDKKKL   195
MIT_Smik_c702_13288   151   FFNTTSEMVNRSENRLTKDQLTQDIFNNK-----TASSIMPKESLDKKKL   195
MIT_Spar_c321_13861   151   FFNTTSEMVNRSENRLTKDQLTQEILNNK-----NASSIMPKESLDKKKL   195
MIT_Suva_c259_14375   151   FFNTTSEMVNRSENRLTKDQLTQDVLNNK-----SASSIMPKESLDKKKL   195
WashU_Sbay_Contig593.20   151   FFNTTSEMVNRSENRLTKDQLTQDVLNNK-----SASSIMPKESLDKKKL   195
WashU_Scas_Contig616.11   151   YFNTTTELLNRNENRLTRDQLAATALSGSKSRAASASFIMPKESLDKKKL   200
WashU_Sklu_Contig1933.4   151   YFKTTTEMLQRRENRLTKDQLSSNQGEYLN--LNSEDVVMPKESLEKKKL   198
WashU_Smik_Contig1444.2   151   FFNTTSEMVNRSENRLTKDQLTQDIFNNK-----TASSIMPKESLDKKKL   195
Symbols






:*:**:*:::* *****:***: . . . :******:****



SGD_Scer_UTP11/YKL099C   196   KKFKQVKQHLQRETQLKQVQQRMDAQRELLKKGSKKKIVDSSGKISFKWK   245
MIT_Smik_c702_13288   196   RKFKQVKQHVQRETQLKEVQQRMDSQRELLKKGSKKKIVDPTGKSSFKWK   245
MIT_Spar_c321_13861   196   KKFKQVKQHLQRETQLKQVQQRMDAQRELLKKGSKKKIVDPSGNTSFKWK   245
MIT_Suva_c259_14375   196   KKFKQVKQHIQRETQLKQVQQRMDAQRELLKKGSKKKIVDSSGKSSFKWK   245
WashU_Sbay_Contig593.20   196   KKFKQVKQHIQRETQLKQVQQRMDAQRELLKKGSKKKIVDSSGKSSFKWK   245
WashU_Scas_Contig616.11   201   KKFKIVKQHLERETQLKEVQQRMDLQREVMKKGSKKKVVDKKGNITFKWK   250
WashU_Sklu_Contig1933.4   199   KKYKLVQQRLEREKQLKQVQQRMDIQREVMKKGSKKKIVDTKGNVSFKWK   248
WashU_Smik_Contig1444.2   196   RKFKQVKQHVQRETQLKEVQQRMDSQRELLKKGSKKKIVDPTGKSSFKWK   245
Symbols






:*:* *:*:::**.***:****** ***::*******:** .*: :****



SGD_Scer_UTP11/YKL099C   246   KQRKR   250
MIT_Smik_c702_13288   246   KQRKR   250
MIT_Spar_c321_13861   246   KQRKR   250
MIT_Suva_c259_14375   246   KQRKR   250
WashU_Sbay_Contig593.20   246   KQRKR   250
WashU_Scas_Contig616.11   251   KQRKR   255
WashU_Sklu_Contig1933.4   249   KQRKR   253
WashU_Smik_Contig1444.2   246   KQRKR   250
Symbols






*****



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_UTP11/YKL099C:

SGD_Scer_UTP11/YKL099C  Length: 251  Mon Nov  7 15:55:07 2016  Type: P  Check: 8737  ..

       1  MAKLVHDVQK KQHRERSQLT SRSRYGFLEK HKDYVKRAQD FHRKQSTLKV

      51  LREKAKERNP DEYYHAMHSR KTDAKGLLIS SRHGDEEDES LSMDQVKLLK

     101  TQDSNYVRTL RQIELKKLEK GAKQLMFKSS GNHTIFVDSR EKMNEFTPEK

     151  FFNTTSEMVN RSENRLTKDQ LAQDISNNRN ASSIMPKESL DKKKLKKFKQ

     201  VKQHLQRETQ LKQVQQRMDA QRELLKKGSK KKIVDSSGKI SFKWKKQRKR

     251  *

Protein Sequence for MIT_Smik_c702_13288:

MIT_Smik_c702_13288  Length: 251  Mon Nov  7 15:55:07 2016  Type: P  Check: 9060  ..

       1  MAKLVHDVQK KQHRERSQLT SRTRFGFLEK HKDYVKRAQD FHRKQSTLKV

      51  LREKAKERNP DEYYHAMHSR KTDAKGLLIS SRHGDEEDES LSMDQVKLLK

     101  TQDSNYVRTL RQIELKKLEK RSKELMFKSS GKHTIFVDSR ERMVDFAPEK

     151  FFNTTSEMVN RSENRLTKDQ LTQDIFNNKT ASSIMPKESL DKKKLRKFKQ

     201  VKQHVQRETQ LKEVQQRMDS QRELLKKGSK KKIVDPTGKS SFKWKKQRKR

     251  *

Protein Sequence for MIT_Spar_c321_13861:

MIT_Spar_c321_13861  Length: 251  Mon Nov  7 15:55:07 2016  Type: P  Check: 8651  ..

       1  MAKLVHDVQK KQHRERSQLT SRTRYGFLEK HKDYVKRAQD FHRKQSTLKV

      51  LREKAKERNP DEYYHAMHSR KTDTKGLLIS SRHGNEEDES LSMDQVKLLK

     101  TQDSNYVRTL RQIELKKLEK GSKQLMFKSS GNHTIFVDSR EKMEDFAPEK

     151  FFNTTSEMVN RSENRLTKDQ LTQEILNNKN ASSIMPKESL DKKKLKKFKQ

     201  VKQHLQRETQ LKQVQQRMDA QRELLKKGSK KKIVDPSGNT SFKWKKQRKR

     251  *

Protein Sequence for MIT_Suva_c259_14375:

MIT_Suva_c259_14375  Length: 251  Mon Nov  7 15:55:07 2016  Type: P  Check: 8624  ..

       1  MAKLVHDVQK KQHRERSQLT SRTRYGFLEK HKDYVKRAQD FHRKQSSLKI

      51  LREKVKERNP DEYYHAMHSR KTDAKGLLIT SRHGEDEDES LSMDQVKLLK

     101  TQDSNYVRTL RQLELKKLEK RSKELMFKSS GNHTIFVDSR EKMEDFAPEK

     151  FFNTTSEMVN RSENRLTKDQ LTQDVLNNKS ASSIMPKESL DKKKLKKFKQ

     201  VKQHIQRETQ LKQVQQRMDA QRELLKKGSK KKIVDSSGKS SFKWKKQRKR

     251  *

Protein Sequence for WashU_Sbay_Contig593.20:

WashU_Sbay_Contig593.20  Length: 251  Mon Nov  7 15:55:07 2016  Type: P  Check: 8624  ..

       1  MAKLVHDVQK KQHRERSQLT SRTRYGFLEK HKDYVKRAQD FHRKQSSLKI

      51  LREKVKERNP DEYYHAMHSR KTDAKGLLIT SRHGEDEDES LSMDQVKLLK

     101  TQDSNYVRTL RQLELKKLEK RSKELMFKSS GNHTIFVDSR EKMEDFAPEK

     151  FFNTTSEMVN RSENRLTKDQ LTQDVLNNKS ASSIMPKESL DKKKLKKFKQ

     201  VKQHIQRETQ LKQVQQRMDA QRELLKKGSK KKIVDSSGKS SFKWKKQRKR

     251  *

Protein Sequence for WashU_Scas_Contig616.11:

WashU_Scas_Contig616.11  Length: 256  Mon Nov  7 15:55:07 2016  Type: P  Check: 9594  ..

       1  MAKLVHDVQK RQHRERSQLT GRSRLGFLEK HKDYVKRAQD YHKKEATLKI

      51  LRSKVTERNP DEYYHGMHSR KVDAKGLLVT SRRGEDEDES LSMDQVKLLK

     101  SQDSNYVRTL RQMELKKLEN KTKTLMFGSN GQHTVFVDDR QQLEDFSPEE

     151  YFNTTTELLN RNENRLTRDQ LAATALSGSK SRAASASFIM PKESLDKKKL

     201  KKFKIVKQHL ERETQLKEVQ QRMDLQREVM KKGSKKKVVD KKGNITFKWK

     251  KQRKR*

Protein Sequence for WashU_Sklu_Contig1933.4:

WashU_Sklu_Contig1933.4  Length: 254  Mon Nov  7 15:55:07 2016  Type: P  Check: 8152  ..

       1  MAKLVHNIQK KQHRERSQVS ERSRFGFLEK HKDYVKRARD YHKKQVTLKV

      51  LRSKAKERNP DEYYHGMNTR KLDSKGLLIK SRHAEGEDPS LTMDQVKLLK

     101  TQDSNYVRTL RQIELKKLER SSKELMFKSS GNHTVFVDDN KEMRDFSPEQ

     151  YFKTTTEMLQ RRENRLTKDQ LSSNQGEYLN LNSEDVVMPK ESLEKKKLKK

     201  YKLVQQRLER EKQLKQVQQR MDIQREVMKK GSKKKIVDTK GNVSFKWKKQ

     251  RKR*

Protein Sequence for WashU_Smik_Contig1444.2:

WashU_Smik_Contig1444.2  Length: 251  Mon Nov  7 15:55:07 2016  Type: P  Check: 9060  ..

       1  MAKLVHDVQK KQHRERSQLT SRTRFGFLEK HKDYVKRAQD FHRKQSTLKV

      51  LREKAKERNP DEYYHAMHSR KTDAKGLLIS SRHGDEEDES LSMDQVKLLK

     101  TQDSNYVRTL RQIELKKLEK RSKELMFKSS GKHTIFVDSR ERMVDFAPEK

     151  FFNTTSEMVN RSENRLTKDQ LTQDIFNNKT ASSIMPKESL DKKKLRKFKQ

     201  VKQHVQRETQ LKEVQQRMDS QRELLKKGSK KKIVDPTGKS SFKWKKQRKR

     251  *