Fungal Sequence Alignment

Help

This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al. We will soon include sequences from other fungal genomes from a variety of sources.

ClustalW Protein Alignment and Sequence for YOR334W and Homologs

Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_MRS2/YOR334W   1   MNRRLLVRSISCFQPLSRITFGRPNTPFLRKYADTSTAANTNSTILRKQL   50
MIT_Sbay_c774_24296   1   MNRRLLVRSISGFKPLSRITLARPLTAFPRHYSDTATTAKTNGTILRKQL   50
MIT_Smik_c507_20743   1   MNRRLLLRSIFGFQPLSRITFGRPNKPFLRHHTDISATTKTNGTILRKQL   50
MIT_Spar_c261_21519   1   MNRRLLVRSISCFQPLSRITLGRPNTAFLRSYADTFATAKTNGTILRKQL   50
WashU_Sbay_Contig673.61   1   MNRRLLVRSISGFKPLSRITLARPLTAFPRHYSDTATTAKTNGTILRKQL   50
WashU_Sklu_Contig1667.2   1   -MPSVFASGIRRLSRINWVSGTLAPLSLARTLSVRQLSQSLP-PPQPQQL   48
WashU_Skud_Contig2005.3   1   MNRRLLVRSICGFQRLSRITLRRQNFPLLRHYSDTSTTAKTNGTILRKQL   50
WashU_Smik_Contig2613.2   1   MNRRLLLRSIFGFQPLSRITFGRPNKPFLRHHTDISATTKTNGTILRKQL   50
Symbols






:: .* :. :. :: .: * : : . . :**



SGD_Scer_MRS2/YOR334W   51   LSLKPISASDSLFISCTVFNSKGNIISMSEKFPKWSFLTEHSLFPRDLRK   100
MIT_Sbay_c774_24296   51   LSLKPISPSDSLFISCTVFNSEGNIISMSEKFSKWSFLTEHSLFPRDLRK   100
MIT_Smik_c507_20743   51   LSLKPISPSDSLFISCTVFNSRGNIISMSEKFPKWSFLTEHSLFPRDLRK   100
MIT_Spar_c261_21519   51   LSLKPISPSDSLFISCTVFNSKGNIISMSEKFPKWSFLTEHSLFPRDLRK   100
WashU_Sbay_Contig673.61   51   LSLKPISPSDSLFISCTVFNSEGNIISMSEKFSKWSFLTEHSLFPRDLRK   100
WashU_Sklu_Contig1667.2   49   LSVKPITPN-DAYVSCTLLNSKGDVTAVSQKFPKWTFLRDHGLYPRDLRK   97
WashU_Skud_Contig2005.3   51   LSLKPISPSDSLFISCTVFNSKGNIIARYEKFRKWAFLTEHSLFARDLRK   100
WashU_Smik_Contig2613.2   51   LSLKPISPSDSLFISCTVFNSRGNIISMSEKFPKWSFLTEHSLFPRDLRK   100
Symbols






**:***:.. . ::***::**.*:: : :** **:** :*.*:.*****



SGD_Scer_MRS2/YOR334W   101   IDNSSIDIIPTIMCKPNCIVINLLHIKALIERDKVYVFDTTNPSAAAKLS   150
MIT_Sbay_c774_24296   101   IDNSSIDIIPTIMCKPDCIVINLLHIKALIERDKVYVFDTTNPSSAAKLS   150
MIT_Smik_c507_20743   101   IDNSSIDIIPTIMCKPNCIVINLLHIKALIERDKVYVFDTTNPSAAAKLS   150
MIT_Spar_c261_21519   101   IDNSSIDIIPTIMCKPNCIVINLLHIKALIERDKVYVFDTTNPSAAAKLS   150
WashU_Sbay_Contig673.61   101   IDNSSIDIIPTIMCKPDCIVINLLHIKALIERDKVYVFDTTNPSSAAKLS   150
WashU_Sklu_Contig1667.2   98   IDTSTIDIIPSIVVKPTCILINLLHIKALIQKNQIFVFDTSNPEAAMKLG   147
WashU_Skud_Contig2005.3   101   IVRSSINIIP----------------------------------------   110
WashU_Smik_Contig2613.2   101   IDNSSIDIIPTIMCKPNCIVINLLHIKALIERDKVYVFDTTNPSAAAKLS   150
Symbols






* *:*:***



SGD_Scer_MRS2/YOR334W   151   VLMYDLESKLSSTKNN----SQFYEHRALESIFINVMSALETDFKLHSQI   196
MIT_Sbay_c774_24296   151   VLMYDLESKLSSTKNN----SQFYEHRALESIFINVMSALETDFKLHSQI   196
MIT_Smik_c507_20743   151   VLMYDLESKLSYTKNN----SQFYEHRALESIFINVMSALETDFKLHSQI   196
MIT_Spar_c261_21519   151   VLMYDLESKLSSTKNN----SQFYEHRALESIFINVMSALETDFKLHSQI   196
WashU_Sbay_Contig673.61   151   VLMYDLESKLSSTKNN----SQFYEHRALESIFINVMSALETDFKLHSQI   196
WashU_Sklu_Contig1667.2   148   VLMYDLESKLSQTNLTPHLTAQLYEHKALESILINVMTCLETEYKQHYSI   197
WashU_Skud_Contig2005.3   111   --AHDVQAELYSYQLT----TYQGSHRTRQSLRFRHHX------------   142
WashU_Smik_Contig2613.2   151   VLMYDLESKLSYTKNN----SQFYEHRALESIFINVMSALETDFKLHSQI   196
Symbols






:*::::* : . : .*:: :*: :.



SGD_Scer_MRS2/YOR334W   197   CIQILNDLENEVNRLKLRHLLIKSKDLTLFYQKTLLIRDLLDELLENDDD   246
MIT_Sbay_c774_24296   197   CIQILNDLENEVNRLKLRHLLIKSKDLTLFYQKTLLIRDLLDELLENDDD   246
MIT_Smik_c507_20743   197   CIQILNDLENEVNRLKLRRLLIKSKDLTLFYQKTLLIRDLLDELLENDDD   246
MIT_Spar_c261_21519   197   CIQILNDLENEVNRLKLRHLLIKSKDLTLFYQKTLLIRDLLDELLENDDD   246
WashU_Sbay_Contig673.61   197   CIQILNDLENEVNRLKLRHLLIKSKDLTLFYQKTLLIRDLLDELLENDDD   246
WashU_Sklu_Contig1667.2   198   CGQILNELEDQIDRDKLRDLLIRSKNLTSFYQKSLLIRDVLDELLESDED   247
WashU_Skud_Contig2005.3   
   --------------------------------------------------   
WashU_Smik_Contig2613.2   197   CIQILNDLENEVNRLKLRRLLIKSKDLTLFYQKTLLIRDLLDELLENDDD   246
Symbols










SGD_Scer_MRS2/YOR334W   247   LANMYLTVKKSPKDNFSDLEMLIETYYTQCDEYVQQSESLIQDIKSTEEI   296
MIT_Sbay_c774_24296   247   LANMYLTVKKSPKDNFSDLEMLIETYYTQCDEYVQQSESLIQDIKSTEEI   296
MIT_Smik_c507_20743   247   LANMYLTVKKSPKDNFSDLEMLIETYYTQCDEYVQQSESLIQDIKSTEEI   296
MIT_Spar_c261_21519   247   LANMYLTVKKSPKDNFSDLEMLIETYYTQCDEYVQQSESLIQDIKSTEEI   296
WashU_Sbay_Contig673.61   247   LANMYLTVKKSPKDNFSDLEMLIETYYTQCDEYVQQSESLIQDIKSTEEI   296
WashU_Sklu_Contig1667.2   248   LASMYLSEQKTEADDXADLEMLLETYYKQCDEYRQQSESLIQDIKSTEEI   297
WashU_Skud_Contig2005.3   
   --------------------------------------------------   
WashU_Smik_Contig2613.2   247   LANMYLTVKKSPKDNFSDLEMLIETYYTQCDEYVQQSESLIQDIKSTEEI   296
Symbols










SGD_Scer_MRS2/YOR334W   297   VNIILDANRNSLMLLELKVTIYTLGFTVASVLPAFYGMNLKNFIEESEWG   346
MIT_Sbay_c774_24296   297   VNIILDANRNSLMLLELKVTIYTLGFTVASVLPAFYGMNLKNFIEESEWG   346
MIT_Smik_c507_20743   297   VNIILDANRNSLMLLELKVTIYTLGFTVATVLPAFYGMNLKNFIEESEWG   346
MIT_Spar_c261_21519   297   VNIILDANRNSLMLLELKVTIYTLGFTVASVLPAFYGMNLKNFIEESEWG   346
WashU_Sbay_Contig673.61   297   VNIILDANRNSLMLLELKVTIYTLGFTVASVLPAFYGMNLKNFIEESEWG   346
WashU_Sklu_Contig1667.2   298   VNIILDANRNSLMLFELKVTIYTLGFTVATMVPAFYGMNLKNFIEESELG   347
WashU_Skud_Contig2005.3   
   --------------------------------------------------   
WashU_Smik_Contig2613.2   297   VNIILDANRNSLMLLELKVTIYTLGFTVATVLPAFYGMNLKNFIEESEWG   346
Symbols










SGD_Scer_MRS2/YOR334W   347   FTSVAVFSIVSALYITKKNFNSLRSVTKMTMYPNSPANSSVYPKTSASIA   396
MIT_Sbay_c774_24296   347   FTSVVVFSIMSALYITKKNFNSLRSVTKMTMYPNSTTNSSSFPKTSISTS   396
MIT_Smik_c507_20743   347   FTSVVVFSIVSGLYITKKNFNSLRSVTKMTMYPNSPVNSSGYSKTSASIS   396
MIT_Spar_c261_21519   347   FTSVVMFSIVSALYITKKNFNSLRSVTKMTMYPNSPANSTAYLKTPASIS   396
WashU_Sbay_Contig673.61   347   FTSVVVFSIMSALYITKKNFNSLRSVTKMTMYPNSTTNSSSFPKTSISTS   396
WashU_Sklu_Contig1667.2   348   FASVVVFSIISAGLVSVANFRALRSVTRLTLMNNHTGDKTTKHIQNAKLA   397
WashU_Skud_Contig2005.3   
   --------------------------------------------------   
WashU_Smik_Contig2613.2   347   FTSVVVFSIVSGLYITKKNFNSLRSVTKMTMYPNSPVNSSGYSKTSASIS   396
Symbols










SGD_Scer_MRS2/YOR334W   397   LTNKLKRRRKWWKSTKQRLGVLLYGSSYTNKANLSNNKINKGFSKVKKFN   446
MIT_Sbay_c774_24296   397   VASKLRERRTWWKTTRQRLGILLYGSNYYNAAMSNNKGNKGFLKLKKFNM   446
MIT_Smik_c507_20743   397   SPNKLRRRRTWWTSTKQRLGILFYGSSYYNKSTLSKNRINKGFSKVKKFN   446
MIT_Spar_c261_21519   397   LSNKLRRRRNWWKSTKQRLGVLLYGSSYYNEANLSNNKINKGLSKLKKFN   446
WashU_Sbay_Contig673.61   397   VASKLRERRTWWKTTRQRLGILLYGSNYYNAAMSNNKGNKGFLKLKKFNM   446
WashU_Sklu_Contig1667.2   398   VDREVPTLWARWKHGARVIWSGNTNYTTVGDGKRRDMIWKWLVDDNKK--   445
WashU_Skud_Contig2005.3   
   --------------------------------------------------   
WashU_Smik_Contig2613.2   397   SPNKLRRRRTWWTSTKQRLGILFYGSSYYNKSTLSKNRINKGFSKVKKFN   446
Symbols










SGD_Scer_MRS2/YOR334W   447   MENDIKNKQNRDMIWKWLIEDKKN   470
MIT_Sbay_c774_24296   447   ENDLKNKQNRDMIWRWLIEDKKN-   469
MIT_Smik_c507_20743   447   MENDIKNKQNRDMIWKWLIEDKKN   470
MIT_Spar_c261_21519   447   MENDIKNKQNRDMIWKWLIEDKKN   470
WashU_Sbay_Contig673.61   447   ENDLKNKQNRDMIWRWLIEDKKN-   469
WashU_Sklu_Contig1667.2   
   ------------------------   
WashU_Skud_Contig2005.3   
   ------------------------   
WashU_Smik_Contig2613.2   447   MENDIKNKQNRDMIWKWLIEDKKN   470
Symbols










Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_MRS2/YOR334W:

SGD_Scer_MRS2/YOR334W  Length: 471  Sun Dec 11 01:08:30 2011  Type: P  Check: 7677  ..

       1  MNRRLLVRSI SCFQPLSRIT FGRPNTPFLR KYADTSTAAN TNSTILRKQL

      51  LSLKPISASD SLFISCTVFN SKGNIISMSE KFPKWSFLTE HSLFPRDLRK

     101  IDNSSIDIIP TIMCKPNCIV INLLHIKALI ERDKVYVFDT TNPSAAAKLS

     151  VLMYDLESKL SSTKNNSQFY EHRALESIFI NVMSALETDF KLHSQICIQI

     201  LNDLENEVNR LKLRHLLIKS KDLTLFYQKT LLIRDLLDEL LENDDDLANM

     251  YLTVKKSPKD NFSDLEMLIE TYYTQCDEYV QQSESLIQDI KSTEEIVNII

     301  LDANRNSLML LELKVTIYTL GFTVASVLPA FYGMNLKNFI EESEWGFTSV

     351  AVFSIVSALY ITKKNFNSLR SVTKMTMYPN SPANSSVYPK TSASIALTNK

     401  LKRRRKWWKS TKQRLGVLLY GSSYTNKANL SNNKINKGFS KVKKFNMEND

     451  IKNKQNRDMI WKWLIEDKKN *

Protein Sequence for MIT_Sbay_c774_24296:

MIT_Sbay_c774_24296  Length: 470  Sun Dec 11 01:08:30 2011  Type: P  Check: 7749  ..

       1  MNRRLLVRSI SGFKPLSRIT LARPLTAFPR HYSDTATTAK TNGTILRKQL

      51  LSLKPISPSD SLFISCTVFN SEGNIISMSE KFSKWSFLTE HSLFPRDLRK

     101  IDNSSIDIIP TIMCKPDCIV INLLHIKALI ERDKVYVFDT TNPSSAAKLS

     151  VLMYDLESKL SSTKNNSQFY EHRALESIFI NVMSALETDF KLHSQICIQI

     201  LNDLENEVNR LKLRHLLIKS KDLTLFYQKT LLIRDLLDEL LENDDDLANM

     251  YLTVKKSPKD NFSDLEMLIE TYYTQCDEYV QQSESLIQDI KSTEEIVNII

     301  LDANRNSLML LELKVTIYTL GFTVASVLPA FYGMNLKNFI EESEWGFTSV

     351  VVFSIMSALY ITKKNFNSLR SVTKMTMYPN STTNSSSFPK TSISTSVASK

     401  LRERRTWWKT TRQRLGILLY GSNYYNAAMS NNKGNKGFLK LKKFNMENDL

     451  KNKQNRDMIW RWLIEDKKN*

Protein Sequence for MIT_Smik_c507_20743:

MIT_Smik_c507_20743  Length: 471  Sun Dec 11 01:08:30 2011  Type: P  Check: 439  ..

       1  MNRRLLLRSI FGFQPLSRIT FGRPNKPFLR HHTDISATTK TNGTILRKQL

      51  LSLKPISPSD SLFISCTVFN SRGNIISMSE KFPKWSFLTE HSLFPRDLRK

     101  IDNSSIDIIP TIMCKPNCIV INLLHIKALI ERDKVYVFDT TNPSAAAKLS

     151  VLMYDLESKL SYTKNNSQFY EHRALESIFI NVMSALETDF KLHSQICIQI

     201  LNDLENEVNR LKLRRLLIKS KDLTLFYQKT LLIRDLLDEL LENDDDLANM

     251  YLTVKKSPKD NFSDLEMLIE TYYTQCDEYV QQSESLIQDI KSTEEIVNII

     301  LDANRNSLML LELKVTIYTL GFTVATVLPA FYGMNLKNFI EESEWGFTSV

     351  VVFSIVSGLY ITKKNFNSLR SVTKMTMYPN SPVNSSGYSK TSASISSPNK

     401  LRRRRTWWTS TKQRLGILFY GSSYYNKSTL SKNRINKGFS KVKKFNMEND

     451  IKNKQNRDMI WKWLIEDKKN *

Protein Sequence for MIT_Spar_c261_21519:

MIT_Spar_c261_21519  Length: 471  Sun Dec 11 01:08:30 2011  Type: P  Check: 6166  ..

       1  MNRRLLVRSI SCFQPLSRIT LGRPNTAFLR SYADTFATAK TNGTILRKQL

      51  LSLKPISPSD SLFISCTVFN SKGNIISMSE KFPKWSFLTE HSLFPRDLRK

     101  IDNSSIDIIP TIMCKPNCIV INLLHIKALI ERDKVYVFDT TNPSAAAKLS

     151  VLMYDLESKL SSTKNNSQFY EHRALESIFI NVMSALETDF KLHSQICIQI

     201  LNDLENEVNR LKLRHLLIKS KDLTLFYQKT LLIRDLLDEL LENDDDLANM

     251  YLTVKKSPKD NFSDLEMLIE TYYTQCDEYV QQSESLIQDI KSTEEIVNII

     301  LDANRNSLML LELKVTIYTL GFTVASVLPA FYGMNLKNFI EESEWGFTSV

     351  VMFSIVSALY ITKKNFNSLR SVTKMTMYPN SPANSTAYLK TPASISLSNK

     401  LRRRRNWWKS TKQRLGVLLY GSSYYNEANL SNNKINKGLS KLKKFNMEND

     451  IKNKQNRDMI WKWLIEDKKN *

Protein Sequence for WashU_Sbay_Contig673.61:

WashU_Sbay_Contig673.61  Length: 470  Sun Dec 11 01:08:30 2011  Type: P  Check: 7749  ..

       1  MNRRLLVRSI SGFKPLSRIT LARPLTAFPR HYSDTATTAK TNGTILRKQL

      51  LSLKPISPSD SLFISCTVFN SEGNIISMSE KFSKWSFLTE HSLFPRDLRK

     101  IDNSSIDIIP TIMCKPDCIV INLLHIKALI ERDKVYVFDT TNPSSAAKLS

     151  VLMYDLESKL SSTKNNSQFY EHRALESIFI NVMSALETDF KLHSQICIQI

     201  LNDLENEVNR LKLRHLLIKS KDLTLFYQKT LLIRDLLDEL LENDDDLANM

     251  YLTVKKSPKD NFSDLEMLIE TYYTQCDEYV QQSESLIQDI KSTEEIVNII

     301  LDANRNSLML LELKVTIYTL GFTVASVLPA FYGMNLKNFI EESEWGFTSV

     351  VVFSIMSALY ITKKNFNSLR SVTKMTMYPN STTNSSSFPK TSISTSVASK

     401  LRERRTWWKT TRQRLGILLY GSNYYNAAMS NNKGNKGFLK LKKFNMENDL

     451  KNKQNRDMIW RWLIEDKKN*

Protein Sequence for WashU_Sklu_Contig1667.2:

WashU_Sklu_Contig1667.2  Length: 446  Sun Dec 11 01:08:30 2011  Type: P  Check: 8097  ..

       1  MPSVFASGIR RLSRINWVSG TLAPLSLART LSVRQLSQSL PPPQPQQLLS

      51  VKPITPNDAY VSCTLLNSKG DVTAVSQKFP KWTFLRDHGL YPRDLRKIDT

     101  STIDIIPSIV VKPTCILINL LHIKALIQKN QIFVFDTSNP EAAMKLGVLM

     151  YDLESKLSQT NLTPHLTAQL YEHKALESIL INVMTCLETE YKQHYSICGQ

     201  ILNELEDQID RDKLRDLLIR SKNLTSFYQK SLLIRDVLDE LLESDEDLAS

     251  MYLSEQKTEA DDXADLEMLL ETYYKQCDEY RQQSESLIQD IKSTEEIVNI

     301  ILDANRNSLM LFELKVTIYT LGFTVATMVP AFYGMNLKNF IEESELGFAS

     351  VVVFSIISAG LVSVANFRAL RSVTRLTLMN NHTGDKTTKH IQNAKLAVDR

     401  EVPTLWARWK HGARVIWSGN TNYTTVGDGK RRDMIWKWLV DDNKK*


Protein Sequence for WashU_Skud_Contig2005.3:

WashU_Skud_Contig2005.3  Length: 142  Sun Dec 11 01:08:30 2011  Type: P  Check: 6983  ..

       1  MNRRLLVRSI CGFQRLSRIT LRRQNFPLLR HYSDTSTTAK TNGTILRKQL

      51  LSLKPISPSD SLFISCTVFN SKGNIIARYE KFRKWAFLTE HSLFARDLRK

     101  IVRSSINIIP AHDVQAELYS YQLTTYQGSH RTRQSLRFRH HX


Protein Sequence for WashU_Smik_Contig2613.2:

WashU_Smik_Contig2613.2  Length: 471  Sun Dec 11 01:08:30 2011  Type: P  Check: 439  ..

       1  MNRRLLLRSI FGFQPLSRIT FGRPNKPFLR HHTDISATTK TNGTILRKQL

      51  LSLKPISPSD SLFISCTVFN SRGNIISMSE KFPKWSFLTE HSLFPRDLRK

     101  IDNSSIDIIP TIMCKPNCIV INLLHIKALI ERDKVYVFDT TNPSAAAKLS

     151  VLMYDLESKL SYTKNNSQFY EHRALESIFI NVMSALETDF KLHSQICIQI

     201  LNDLENEVNR LKLRRLLIKS KDLTLFYQKT LLIRDLLDEL LENDDDLANM

     251  YLTVKKSPKD NFSDLEMLIE TYYTQCDEYV QQSESLIQDI KSTEEIVNII

     301  LDANRNSLML LELKVTIYTL GFTVATVLPA FYGMNLKNFI EESEWGFTSV

     351  VVFSIVSGLY ITKKNFNSLR SVTKMTMYPN SPVNSSGYSK TSASISSPNK

     401  LRRRRTWWTS TKQRLGILFY GSSYYNKSTL SKNRINKGFS KVKKFNMEND

     451  IKNKQNRDMI WKWLIEDKKN *