Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YBR070C and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_ALG14/YBR070C   1   ---------MKTAYLASLVLIVSTAYVIRLIAILPFFHTQAGTEKDTKDG   41
MIT_Smik_c155_1010   1   ---------MKTAYLASLVLIVLTAYVIRLVTILPFFHPQTDTQRDTEDE   41
MIT_Spar_c199_1274   1   ---------MNTAYLASLVLIVMTAYVIRLVTILPFFHTQTSLKKDTKDE   41
MIT_Suva_c110_860   1   MSEVESAYNMNTAYLASLVVIVTALYVVRLVTILPLFHTQTKVRKDTRDE   50
WashU_Sbay_Contig678.154   1   ---------MNTAYLASLVVIVTALYVVRLVTILPLFHTQTKVRKDTRDE   41
WashU_Scas_Contig569.10   1   ---------MDIFYLTSAILLILTAYIFRFISILPFYRTHPNDR------   35
WashU_Sklu_Contig2230.4   1   --------MHYATILGTTALVLITTLAVSVISTLPIFRYN----------   32
WashU_Skud_Contig1650.4   1   ---------MKTAYLASLILIVLTVYVFRLVAILPFFHTQTDTEKNTRNE   41
WashU_Smik_Contig2810.7   1   ---------MKTAYLASLVLIVLTAYVIRLVTILPFFHPQTDTQRDTEDE   41
Symbols






* : ::: : . .:: **::: :



SGD_Scer_ALG14/YBR070C   42   VNLLKIRKSSKKPLKIFVFLGSGGHTGEMIRLLENYQDLLLG-KSIVYLG   90
MIT_Smik_c155_1010   42   VSVLRIQKPSKKPLKVFIFLGSGGHTGEMIRLLENYKDLLLN-GSIIYLG   90
MIT_Spar_c199_1274   42   ASILRIQKPSKKPLRIFVFLGSGGHTGEMIRLLENYKDLLLN-ESIVYLG   90
MIT_Suva_c110_860   51   ASVFRTQKSSKKRLKIFVFLGSGGHTGEMLRLLQNYKDLLLD-ESILYVG   99
WashU_Sbay_Contig678.154   42   ASVFRTQKSSKKRLKIFVFLGSGGHTGEMLRLLQNYKDLLLD-ESILYVG   90
WashU_Scas_Contig569.10   36   --ILTAHKGTNGPLHIFVFLGSGGHTGEMLRILQNYKETLLNQDNVLYVG   83
WashU_Sklu_Contig2230.4   33   -SRARLLHSDNGPVHCFVYLGSGGHTGEMLRLLNNYNAVFFRKDNVLDVG   81
WashU_Skud_Contig1650.4   42   ASVLRMHKPSKKPLKIFVFLGSGGHTGEMLRLLQNYNDLLLD-KSTVYVG   90
WashU_Smik_Contig2810.7   42   VSVLRIQKPSKKPLKVFIFLGSGGHTGEMIRLLENYKDLLLN-GSIIYLG   90
Symbols






: : :: *::**********:*:*:**: :: . : :*



SGD_Scer_ALG14/YBR070C   91   YSDEASRQRFA-HFIKKFGHCKVKYYEFMKAREVKATLLQSVKTIIGTLV   139
MIT_Smik_c155_1010   91   YSDEASKQRFV-DFIKNFTRCKVQYYEFMKAREVKANLVQSVKTIIGTLM   139
MIT_Spar_c199_1274   91   YSDEASRQRFA-SFIKKFSRCKVQYSGFMKAREVKATFLQSVKTIVGTLV   139
MIT_Suva_c110_860   100   YSDQASRQRFC-SLLQNFPRCQVRYYEFMKAREVKATLLQSVKTIIGTLV   148
WashU_Sbay_Contig678.154   91   YSDQASRQRFC-SLLQNFPRCQVRYYEFMKAREVKATLLQSVKTIIGTLV   139
WashU_Scas_Contig569.10   84   YSDIDSRNKFS-KLLQ--SACKVEYIEFKKAREVNSGLLASLKSIFLTLM   130
WashU_Sklu_Contig2230.4   82   YSDEASLAKFKRMDFKNKENVHINYHEFLKAREVNATKTQSLKSILYTLW   131
WashU_Skud_Contig1650.4   91   YSDQASKQKFA-RLMKNFGHCKVQYYEFMKAREVKATLLQSVKSIIGTLV   139
WashU_Smik_Contig2810.7   91   YSDEASKQRFV-DFIKNFTRCKVQYYEFMKAREVKANLVQSVKTIIGTLM   139
Symbols






*** * :* :: ::.* * *****:: *:*:*. **



SGD_Scer_ALG14/YBR070C   140   QSFVHVVRIRFAMCGSPHLFLLNGPGTCCIISFWLKIMELLLPLLGSSHI   189
MIT_Smik_c155_1010   140   QSFIHVIKIRFAMCGSPHLFLLNGPGTCCIISFWLKFIELVVLFVDSSHI   189
MIT_Spar_c199_1274   140   QSFVHVVRIRFAMCGSPHLFLLNGPGTCCIISFWLKIIELVVPLLGSSHI   189
MIT_Suva_c110_860   149   QSLVHVVRIRLSMCGSPHLFLLNGPGTCCIITFWLKIMELVLLFLDSSHI   198
WashU_Sbay_Contig678.154   140   QSLVHVVRIRLSMCGSPHLFLLNGPGTCCIITFWLKIMELVLLFLDSSHI   189
WashU_Scas_Contig569.10   131   TSLLNVIRIRKSIAFKPHLILLNGPGTCCILVLWFKLLEWILLFSSSSNI   180
WashU_Sklu_Contig2230.4   132   NSTLVILKMKISTLGKSHLILLNGPGTCCIIAVLFKVLQIVSCTP--SKI   179
WashU_Skud_Contig1650.4   140   QSFVHVIQIRFAMCGSPHLFLLNGPGTCCIISFWLKLIELIVLFLDSSHI   189
WashU_Smik_Contig2810.7   140   QSFIHVIKIRFAMCGSPHLFLLNGPGTCCIISFWLKFIELVVLFVDSSHI   189
Symbols






* : ::::: : ..**:**********: . :*.:: : *:*



SGD_Scer_ALG14/YBR070C   190   VYVESLARINTPSLTGKILYWVVDEFIVQWQELRDNYLPRSKWFGILV   237
MIT_Smik_c155_1010   190   VYVESLARINTPSLTGKVLYWLVDEFIVQWQELRDNCLPRSKWFGILV   237
MIT_Spar_c199_1274   190   VYVESLARINTPSLTGKILYWVVDEFIVQWQELRDNCLPRSKWFGILV   237
MIT_Suva_c110_860   199   VYVESLARINTPSLTGKILYWVVDEFVVQWRELRDDCLPRSKWFGILV   246
WashU_Sbay_Contig678.154   190   VYVESLARINTPSLTGKILYWVVDEFVVQWRELRDDCLPRSKWFGILV   237
WashU_Scas_Contig569.10   181   IYIESLARINSLSLTGKIVYWMADEFIVQWKELELSCAPRAKYFGILT   228
WashU_Sklu_Contig2230.4   180   VYVESLARIDSLSLTGRILYLLVDEFVVQWDELCK-RYPRAKCYGILI   226
WashU_Skud_Contig1650.4   190   VYVESLARINTPSLTGKILYWMVDEFIVQWQELRDNCLPRSKWFGILV   237
WashU_Smik_Contig2810.7   190   VYVESLARINTPSLTGKVLYWLVDEFIVQWQELRDNCLPRSKWFGILV   237
Symbols






:*:******:: ****:::* :.***:*** ** **:* :***



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_ALG14/YBR070C:

SGD_Scer_ALG14/YBR070C  Length: 238  Mon Nov  7 14:45:20 2016  Type: P  Check: 482  ..

       1  MKTAYLASLV LIVSTAYVIR LIAILPFFHT QAGTEKDTKD GVNLLKIRKS

      51  SKKPLKIFVF LGSGGHTGEM IRLLENYQDL LLGKSIVYLG YSDEASRQRF

     101  AHFIKKFGHC KVKYYEFMKA REVKATLLQS VKTIIGTLVQ SFVHVVRIRF

     151  AMCGSPHLFL LNGPGTCCII SFWLKIMELL LPLLGSSHIV YVESLARINT

     201  PSLTGKILYW VVDEFIVQWQ ELRDNYLPRS KWFGILV*

Protein Sequence for MIT_Smik_c155_1010:

MIT_Smik_c155_1010  Length: 238  Mon Nov  7 14:45:20 2016  Type: P  Check: 2834  ..

       1  MKTAYLASLV LIVLTAYVIR LVTILPFFHP QTDTQRDTED EVSVLRIQKP

      51  SKKPLKVFIF LGSGGHTGEM IRLLENYKDL LLNGSIIYLG YSDEASKQRF

     101  VDFIKNFTRC KVQYYEFMKA REVKANLVQS VKTIIGTLMQ SFIHVIKIRF

     151  AMCGSPHLFL LNGPGTCCII SFWLKFIELV VLFVDSSHIV YVESLARINT

     201  PSLTGKVLYW LVDEFIVQWQ ELRDNCLPRS KWFGILV*

Protein Sequence for MIT_Spar_c199_1274:

MIT_Spar_c199_1274  Length: 238  Mon Nov  7 14:45:20 2016  Type: P  Check: 2710  ..

       1  MNTAYLASLV LIVMTAYVIR LVTILPFFHT QTSLKKDTKD EASILRIQKP

      51  SKKPLRIFVF LGSGGHTGEM IRLLENYKDL LLNESIVYLG YSDEASRQRF

     101  ASFIKKFSRC KVQYSGFMKA REVKATFLQS VKTIVGTLVQ SFVHVVRIRF

     151  AMCGSPHLFL LNGPGTCCII SFWLKIIELV VPLLGSSHIV YVESLARINT

     201  PSLTGKILYW VVDEFIVQWQ ELRDNCLPRS KWFGILV*

Protein Sequence for MIT_Suva_c110_860:

MIT_Suva_c110_860  Length: 247  Mon Nov  7 14:45:20 2016  Type: P  Check: 9463  ..

       1  MSEVESAYNM NTAYLASLVV IVTALYVVRL VTILPLFHTQ TKVRKDTRDE

      51  ASVFRTQKSS KKRLKIFVFL GSGGHTGEML RLLQNYKDLL LDESILYVGY

     101  SDQASRQRFC SLLQNFPRCQ VRYYEFMKAR EVKATLLQSV KTIIGTLVQS

     151  LVHVVRIRLS MCGSPHLFLL NGPGTCCIIT FWLKIMELVL LFLDSSHIVY

     201  VESLARINTP SLTGKILYWV VDEFVVQWRE LRDDCLPRSK WFGILV*


Protein Sequence for WashU_Sbay_Contig678.154:

WashU_Sbay_Contig678.154  Length: 238  Mon Nov  7 14:45:20 2016  Type: P  Check: 6900  ..

       1  MNTAYLASLV VIVTALYVVR LVTILPLFHT QTKVRKDTRD EASVFRTQKS

      51  SKKRLKIFVF LGSGGHTGEM LRLLQNYKDL LLDESILYVG YSDQASRQRF

     101  CSLLQNFPRC QVRYYEFMKA REVKATLLQS VKTIIGTLVQ SLVHVVRIRL

     151  SMCGSPHLFL LNGPGTCCII TFWLKIMELV LLFLDSSHIV YVESLARINT

     201  PSLTGKILYW VVDEFVVQWR ELRDDCLPRS KWFGILV*

Protein Sequence for WashU_Scas_Contig569.10:

WashU_Scas_Contig569.10  Length: 229  Mon Nov  7 14:45:20 2016  Type: P  Check: 3157  ..

       1  MDIFYLTSAI LLILTAYIFR FISILPFYRT HPNDRILTAH KGTNGPLHIF

      51  VFLGSGGHTG EMLRILQNYK ETLLNQDNVL YVGYSDIDSR NKFSKLLQSA

     101  CKVEYIEFKK AREVNSGLLA SLKSIFLTLM TSLLNVIRIR KSIAFKPHLI

     151  LLNGPGTCCI LVLWFKLLEW ILLFSSSSNI IYIESLARIN SLSLTGKIVY

     201  WMADEFIVQW KELELSCAPR AKYFGILT*

Protein Sequence for WashU_Sklu_Contig2230.4:

WashU_Sklu_Contig2230.4  Length: 227  Mon Nov  7 14:45:20 2016  Type: P  Check: 8436  ..

       1  MHYATILGTT ALVLITTLAV SVISTLPIFR YNSRARLLHS DNGPVHCFVY

      51  LGSGGHTGEM LRLLNNYNAV FFRKDNVLDV GYSDEASLAK FKRMDFKNKE

     101  NVHINYHEFL KAREVNATKT QSLKSILYTL WNSTLVILKM KISTLGKSHL

     151  ILLNGPGTCC IIAVLFKVLQ IVSCTPSKIV YVESLARIDS LSLTGRILYL

     201  LVDEFVVQWD ELCKRYPRAK CYGILI*

Protein Sequence for WashU_Skud_Contig1650.4:

WashU_Skud_Contig1650.4  Length: 238  Mon Nov  7 14:45:20 2016  Type: P  Check: 1833  ..

       1  MKTAYLASLI LIVLTVYVFR LVAILPFFHT QTDTEKNTRN EASVLRMHKP

      51  SKKPLKIFVF LGSGGHTGEM LRLLQNYNDL LLDKSTVYVG YSDQASKQKF

     101  ARLMKNFGHC KVQYYEFMKA REVKATLLQS VKSIIGTLVQ SFVHVIQIRF

     151  AMCGSPHLFL LNGPGTCCII SFWLKLIELI VLFLDSSHIV YVESLARINT

     201  PSLTGKILYW MVDEFIVQWQ ELRDNCLPRS KWFGILV*

Protein Sequence for WashU_Smik_Contig2810.7:

WashU_Smik_Contig2810.7  Length: 238  Mon Nov  7 14:45:20 2016  Type: P  Check: 2834  ..

       1  MKTAYLASLV LIVLTAYVIR LVTILPFFHP QTDTQRDTED EVSVLRIQKP

      51  SKKPLKVFIF LGSGGHTGEM IRLLENYKDL LLNGSIIYLG YSDEASKQRF

     101  VDFIKNFTRC KVQYYEFMKA REVKANLVQS VKTIIGTLMQ SFIHVIKIRF

     151  AMCGSPHLFL LNGPGTCCII SFWLKFIELV VLFVDSSHIV YVESLARINT

     201  PSLTGKVLYW LVDEFIVQWQ ELRDNCLPRS KWFGILV*