Fungal Sequence Alignment

Help

This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al. We will soon include sequences from other fungal genomes from a variety of sources.

ClustalW Protein Alignment and Sequence for YEL042W and Homologs

Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_GDA1/YEL042W   1   MAPIFRNYRFAIGAFAVIMLILLIKTS-SIGPPSIARTVTPNASIPKTPE   49
MIT_Sbay_c281_6076   1   MAPIFRNYRFAITAFAVIMLILLIKTS-TTN-IDIARKVSPTATIPKTPE   48
MIT_Smik_c281_5717   1   MAPIFRNYRFAIGAFAVIMLILLIKTS-SVGPNSIARTVSPTAEIPKTPD   49
MIT_Spar_c355_5892   1   MAPIFRNYRFAIGAFAVIMLILLIKTS-SMGPSSIARTVATTASIPKTPE   49
WashU_Sbay_Contig676.29   1   MAPIFRNYRFAITAFAVIMLILLIKTS-TTN-IDIARKVSPTATIPKTPE   48
WashU_Scas_Contig608.7   1   MPSFFRSYRFIIGAFAAIMLLLLIRSSTTESTIDIARTVASIASRPSTPQ   50
WashU_Skud_Contig1931.1   1   MAPIFRNYRFAIGAFAAIMLILLIKTS-STGSLSISRTVSPTASVPKTSG   49
Symbols






*..:**.*** * ***.***:***::* : . .*:*.*:. * *.*.



SGD_Scer_GDA1/YEL042W   50   DISILPVNDEPGYLQDSKTEQNYPELADAVKSQTS--QTCSEEHKYVIMI   97
MIT_Sbay_c281_6076   49   DVSILPISDKPGYIDDKKTEQNDPDLADAVKSQTS--STCKKDHKYVIMI   96
MIT_Smik_c281_5717   50   DASISPIDGTPDYIQNPKTEENFPGLADVVESQTG--QTCTKEHRYVIMI   97
MIT_Spar_c355_5892   50   DVSISPINDEPGYIHDPKTEQNYPELADAVKSQTS--QTCSEEHKYVIMI   97
WashU_Sbay_Contig676.29   49   DVSILPISDKPGYIDDKKTEQNDPDLADAVKSQTS--STCKKDHKYVIMI   96
WashU_Scas_Contig608.7   51   DVNALPISNEPGYVSDSKTEQNNPEVATLVEESTNKKSKCNKDHEYVVMI   100
WashU_Skud_Contig1931.1   50   DVSTLPFGDKPGYIGNPKAEQDYPEMADAVKSQTS--QKCSEEHRYVVMI   97
Symbols






* . *... *.*: : *:*:: * :* *:..*. ..*.::*.**:**



SGD_Scer_GDA1/YEL042W   98   DAGSTGSRVHIYKFDVCTSPPTLLDEKFDMLEPGLSSFDTDSVGAANSLD   147
MIT_Sbay_c281_6076   97   DGGSSGSRIHVYEFDICTSPPTLVNETFKMLEPGLSSFDNDSAGAAESLD   146
MIT_Smik_c281_5717   98   DAGSTGSRIHVYEFDVCTSPPTLIKEKFEMLEPGLSSFDTDSVSAAKSLN   147
MIT_Spar_c355_5892   98   DAGSTGSRIHIYEFDVCTSPPTLLYEKFEMLEPGLSSFDTDSVGAANSLD   147
WashU_Sbay_Contig676.29   97   DGGSSGSRIHVYEFDICTSPPTLVNETFKMLEPGLSSFDNDSAGAAESLD   146
WashU_Scas_Contig608.7   101   DAGSTGSRVHVYEFDVCSQPPALINESFKMLKPGLSSFDTDAEGAAKSLD   150
WashU_Skud_Contig1931.1   98   DAGSSGSRVHVYEFDVCTSPPTLINEEFKMLTPGLSSYDTDAAGAAESLD   147
Symbols






*.**:***:*:*:**:*:.**:*: * *.** *****:*.*: .**:**:



SGD_Scer_GDA1/YEL042W   148   PLLKVAMNYVPIKARSCTPVAVKATAGLRLLGDAKSSKILSAVRDHLEKD   197
MIT_Sbay_c281_6076   147   PLLTAAMAAVPVKARRCTPVSVKATAGLRLLGEAKSAKILKAIRDHLEKD   196
MIT_Smik_c281_5717   148   PLLEVAMEFVPSKAKKCTPIAVKATAGLRLLGTAKSSKILSAVRDHLEKK   197
MIT_Spar_c355_5892   148   PLLEVAMKYVPLKARSCTPVAVKATAGLRLLGDAKSSKILSAVRDHLEKD   197
WashU_Sbay_Contig676.29   147   PLLTAAMAAVPVKARRCTPVSVKATAGLRLLGEAKSAKILKAIRDHLEKD   196
WashU_Scas_Contig608.7   151   PLLQVALDAVPEKKRSCTPVAVKATAGLRLLGDTKSAKILQAVRSHLEKD   200
WashU_Skud_Contig1931.1   148   SLLDFAVDNVPLKARGCTPVAVRATAGLRIIGDAKSKKILTAVTNHLEKD   197
Symbols






.** *: ** * : ***::*:******::* :** *** *: .****.



SGD_Scer_GDA1/YEL042W   198   YPFPVVEGDGVSIMGGDEEGVFAWITTNYLLGNIGANGPKLPTAAVFDLG   247
MIT_Sbay_c281_6076   197   YPFPVVEGDGISIMGGDEEGVFAWITTNYLLGNIGTAGSKLPTSAVFDLG   246
MIT_Smik_c281_5717   198   YPFPVVEDDGISIMSGEEEGVFAWITTNYLLGNIGTDGPKLPTAAIFDLG   247
MIT_Spar_c355_5892   198   YPFPVVEKDGVSIMGGDEEGVFAWITTNYLLGNIGTNGPKLPTAAVFDLG   247
WashU_Sbay_Contig676.29   197   YPFPVVEGDGISIMGGDEEGVFAWITTNYLLGNIGTAGSKLPTSAVFDLG   246
WashU_Scas_Contig608.7   201   YPFAVVDGDGISIMSGDEEGVYAWVTTNYLLGNIGTG-SKLATSAVFDLG   249
WashU_Skud_Contig1931.1   198   YPFPIAEG-SVSIMDGDEEGVFAWITTNYLLKNIGTEGAKLPTAAVFDLG   246
Symbols






***.:.: .:***.*:****:**:****** ***: .**.*:*:****



SGD_Scer_GDA1/YEL042W   248   GGSTQIVFEPTFPINEKMVDGEHKFDLKFGDENYTLYQFSHLGYGLKEGR   297
MIT_Sbay_c281_6076   247   GGSTQIVFEPTFPPNEKMVDGEHKFDLNFGGEKYTLYQFSHLGYGLNQVR   296
MIT_Smik_c281_5717   248   GGSTQIVFEPTYSPNEKMIDGEHKYDLKFGGKNYTLYQFSHLAYGLKEGR   297
MIT_Spar_c355_5892   248   GGSTQIVFEPTFSANEKMVDGEHKFDLKFGDENYTLYQFSHLGYGLKEGR   297
WashU_Sbay_Contig676.29   247   GGSTQIVFEPTFPPNEKMVDGEHKFDLNFGGEKYTLYQFSHLGYGLNQVR   296
WashU_Scas_Contig608.7   250   GGSTQIVFEPTFPPNEEMVDGEHKYELRFGGQDYSLYQFSHLGYGLMEGR   299
WashU_Skud_Contig1931.1   247   GGSTQIVFEPTFPENEKMVEGEHKYDLNFGGKIYTLYQFSHLRYGLMEGR   296
Symbols






***********:. **:*::****::*.**.: *:******* *** : *



SGD_Scer_GDA1/YEL042W   298   NKVNSVLVENALKDGKILKGDNTKTHQLSSPCLPPKVNATNEKVTLESKE   347
MIT_Sbay_c281_6076   297   NKINSVLVENALKEGTILNGDVSTAHNLSSPCLPPKVNALKEKVKLDSGE   346
MIT_Smik_c281_5717   298   NKINSVLVEKALKNGEIKEGDNERTHTLLSPCLPPKTNATSEVVKLSSKK   347
MIT_Spar_c355_5892   298   NKVNSVLLENAIKDGRILKGDNTKTHELLSPCLPPKVNATKEKVTLESKE   347
WashU_Sbay_Contig676.29   297   NKINSVLVENALKEGTILNGDVSTAHNLSSPCLPPKVNALKEKVKLDSGE   346
WashU_Scas_Contig608.7   300   NKINQLLVETAIKAGTIKKGDYTPSVALHSPCLPPNVNVTQEKVKLSSKE   349
WashU_Skud_Contig1931.1   297   KRINSVLVQNAIKDGKITKGDGSKTHKIMSPCLPPKVNSSNEKVELAAGE   346
Symbols






:::*.:*::.*:* * * :** : : ******:.* .* * * : :



SGD_Scer_GDA1/YEL042W   348   TYTIDFIGPDEPSGAQCRFLTDEILNKDAQCQSPPCSFNGVHQPSLVRTF   397
MIT_Sbay_c281_6076   347   TYIIDFIGPEVPSGPQCRFLADSILNKDAECKSPPCSFNGAHQPSLVRTF   396
MIT_Smik_c281_5717   348   TYTIDFIGPDEPTGTLCRSLTDQILNKDAACQTPPCSFNGIHQPSLVRTF   397
MIT_Spar_c355_5892   348   TYTIDFIGPDEPSGAQCRFLTDQILNKDAECQFPPCSFNGVHQPSLVRTF   397
WashU_Sbay_Contig676.29   347   TYIIDFIGPEVPSGPQCRFLADSILNKDAECKSPPCSFNGAHQPSLVRTF   396
WashU_Scas_Contig608.7   350   TYVVDFIGPKVASGAQCRFLSDQILNKDAKCTTKPCSFNGVHQPSLVHTF   399
WashU_Skud_Contig1931.1   347   TYTVDFIGPDVPTGTQCRFLTDQILNKDAKCQSPPCSFNGVHQPSMVRTF   396
Symbols






** :*****. .:*. ** *:*.****** * ****** ****:*:**



SGD_Scer_GDA1/YEL042W   398   KESNDIYIFSYFYDRTRPLGMPLSFTLNELNDLARIVCKGEETWNSVFSG   447
MIT_Sbay_c281_6076   397   KETNDLYIFSYFYDRTHPLGMPLTFTLNELMDLARTVCNGEEIWKSVFTG   446
MIT_Smik_c281_5717   398   KESNDMYIFSYFYDRTRPLGMPLSFTLKELWDLTSAVCKGKETWKSVFGS   447
MIT_Spar_c355_5892   398   KESNDIYIFSYFYDRTRPLGMPLSFTLNELKDLARTVCNGEETWKSVFGG   447
WashU_Sbay_Contig676.29   397   KETNDLYIFSYFYDRTHPLGMPLTFTLNELMDLARTVCNGEEIWKSVFTG   446
WashU_Scas_Contig608.7   400   KETNDLYIFSYFYDRTHTLGMPLSFTLNELADLAKMVCDGEDTWESVLSE   449
WashU_Skud_Contig1931.1   397   KELNDIYIFSFFYDRTHPLGMPSSFTLNELMDLTRTVCSGEETWKSVFSG   446
Symbols






** **:****:*****:.**** :***:** **: **.*:: *:**:



SGD_Scer_GDA1/YEL042W   448   IAGSLDELESDSHFCLDLSFQVSLLHTGYDIPLQRELRTGKKIANKEIGW   497
MIT_Sbay_c281_6076   447   IEGSLDKLRSDPHFCMDLSFQVSLLHTGYDIPLNRELKTAKTLAKNEIGW   496
MIT_Smik_c281_5717   448   IEGSLDALESDPHFCLDLSFQLSLLHTGYDIPLERELKTAEKIAGKEIGW   497
MIT_Spar_c355_5892   448   IAGSLDELESDSHFCLDLSFQVSLLHTGYDIPLQRELRTGEKIANKEIGW   497
WashU_Sbay_Contig676.29   447   IEGSLDKLRSDPHFCMDLSFQVSLLHTGYDIPLNRELKTAKTLAKNEIGW   496
WashU_Scas_Contig608.7   450   IDGSLDALVKDPYFCQDLSFQVSLLHTGYDIPLHRELKTAETIAGNELGW   499
WashU_Skud_Contig1931.1   447   IEGSLDELKSDPHYCLDLSFQVSLLHTGYDIPLYRELRTAEKIDDTEIGW   496
Symbols






* **** * .*.::* *****:*********** ***:*.:.: .*:**



SGD_Scer_GDA1/YEL042W   498   CLGASLPLLKADNWKCKIQSA-   518
MIT_Sbay_c281_6076   497   CLGASLPLLESDNWKCKLSQIE   518
MIT_Smik_c281_5717   498   CLGASLPLLESDNWKCKVSLVE   519
MIT_Spar_c355_5892   498   CLGASLPLLKPDNWKCKLSQIE   519
WashU_Sbay_Contig676.29   497   CLGASLPLLESDNWKCKLSQIE   518
WashU_Scas_Contig608.7   500   CLGASLPLLESDNWKCRVDKLQ   521
WashU_Skud_Contig1931.1   497   SLGASLSLLES-DFECKVSQIE   517
Symbols






.*****.**:. :::*::.



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_GDA1/YEL042W:

SGD_Scer_GDA1/YEL042W  Length: 519  Sat Dec 10 06:11:56 2011  Type: P  Check: 4631  ..

       1  MAPIFRNYRF AIGAFAVIML ILLIKTSSIG PPSIARTVTP NASIPKTPED

      51  ISILPVNDEP GYLQDSKTEQ NYPELADAVK SQTSQTCSEE HKYVIMIDAG

     101  STGSRVHIYK FDVCTSPPTL LDEKFDMLEP GLSSFDTDSV GAANSLDPLL

     151  KVAMNYVPIK ARSCTPVAVK ATAGLRLLGD AKSSKILSAV RDHLEKDYPF

     201  PVVEGDGVSI MGGDEEGVFA WITTNYLLGN IGANGPKLPT AAVFDLGGGS

     251  TQIVFEPTFP INEKMVDGEH KFDLKFGDEN YTLYQFSHLG YGLKEGRNKV

     301  NSVLVENALK DGKILKGDNT KTHQLSSPCL PPKVNATNEK VTLESKETYT

     351  IDFIGPDEPS GAQCRFLTDE ILNKDAQCQS PPCSFNGVHQ PSLVRTFKES

     401  NDIYIFSYFY DRTRPLGMPL SFTLNELNDL ARIVCKGEET WNSVFSGIAG

     451  SLDELESDSH FCLDLSFQVS LLHTGYDIPL QRELRTGKKI ANKEIGWCLG

     501  ASLPLLKADN WKCKIQSA*

Protein Sequence for MIT_Sbay_c281_6076:

MIT_Sbay_c281_6076  Length: 519  Sat Dec 10 06:11:56 2011  Type: P  Check: 1977  ..

       1  MAPIFRNYRF AITAFAVIML ILLIKTSTTN IDIARKVSPT ATIPKTPEDV

      51  SILPISDKPG YIDDKKTEQN DPDLADAVKS QTSSTCKKDH KYVIMIDGGS

     101  SGSRIHVYEF DICTSPPTLV NETFKMLEPG LSSFDNDSAG AAESLDPLLT

     151  AAMAAVPVKA RRCTPVSVKA TAGLRLLGEA KSAKILKAIR DHLEKDYPFP

     201  VVEGDGISIM GGDEEGVFAW ITTNYLLGNI GTAGSKLPTS AVFDLGGGST

     251  QIVFEPTFPP NEKMVDGEHK FDLNFGGEKY TLYQFSHLGY GLNQVRNKIN

     301  SVLVENALKE GTILNGDVST AHNLSSPCLP PKVNALKEKV KLDSGETYII

     351  DFIGPEVPSG PQCRFLADSI LNKDAECKSP PCSFNGAHQP SLVRTFKETN

     401  DLYIFSYFYD RTHPLGMPLT FTLNELMDLA RTVCNGEEIW KSVFTGIEGS

     451  LDKLRSDPHF CMDLSFQVSL LHTGYDIPLN RELKTAKTLA KNEIGWCLGA

     501  SLPLLESDNW KCKLSQIE*

Protein Sequence for MIT_Smik_c281_5717:

MIT_Smik_c281_5717  Length: 520  Sat Dec 10 06:11:56 2011  Type: P  Check: 2931  ..

       1  MAPIFRNYRF AIGAFAVIML ILLIKTSSVG PNSIARTVSP TAEIPKTPDD

      51  ASISPIDGTP DYIQNPKTEE NFPGLADVVE SQTGQTCTKE HRYVIMIDAG

     101  STGSRIHVYE FDVCTSPPTL IKEKFEMLEP GLSSFDTDSV SAAKSLNPLL

     151  EVAMEFVPSK AKKCTPIAVK ATAGLRLLGT AKSSKILSAV RDHLEKKYPF

     201  PVVEDDGISI MSGEEEGVFA WITTNYLLGN IGTDGPKLPT AAIFDLGGGS

     251  TQIVFEPTYS PNEKMIDGEH KYDLKFGGKN YTLYQFSHLA YGLKEGRNKI

     301  NSVLVEKALK NGEIKEGDNE RTHTLLSPCL PPKTNATSEV VKLSSKKTYT

     351  IDFIGPDEPT GTLCRSLTDQ ILNKDAACQT PPCSFNGIHQ PSLVRTFKES

     401  NDMYIFSYFY DRTRPLGMPL SFTLKELWDL TSAVCKGKET WKSVFGSIEG

     451  SLDALESDPH FCLDLSFQLS LLHTGYDIPL ERELKTAEKI AGKEIGWCLG

     501  ASLPLLESDN WKCKVSLVE*

Protein Sequence for MIT_Spar_c355_5892:

MIT_Spar_c355_5892  Length: 520  Sat Dec 10 06:11:56 2011  Type: P  Check: 3016  ..

       1  MAPIFRNYRF AIGAFAVIML ILLIKTSSMG PSSIARTVAT TASIPKTPED

      51  VSISPINDEP GYIHDPKTEQ NYPELADAVK SQTSQTCSEE HKYVIMIDAG

     101  STGSRIHIYE FDVCTSPPTL LYEKFEMLEP GLSSFDTDSV GAANSLDPLL

     151  EVAMKYVPLK ARSCTPVAVK ATAGLRLLGD AKSSKILSAV RDHLEKDYPF

     201  PVVEKDGVSI MGGDEEGVFA WITTNYLLGN IGTNGPKLPT AAVFDLGGGS

     251  TQIVFEPTFS ANEKMVDGEH KFDLKFGDEN YTLYQFSHLG YGLKEGRNKV

     301  NSVLLENAIK DGRILKGDNT KTHELLSPCL PPKVNATKEK VTLESKETYT

     351  IDFIGPDEPS GAQCRFLTDQ ILNKDAECQF PPCSFNGVHQ PSLVRTFKES

     401  NDIYIFSYFY DRTRPLGMPL SFTLNELKDL ARTVCNGEET WKSVFGGIAG

     451  SLDELESDSH FCLDLSFQVS LLHTGYDIPL QRELRTGEKI ANKEIGWCLG

     501  ASLPLLKPDN WKCKLSQIE*

Protein Sequence for WashU_Sbay_Contig676.29:

WashU_Sbay_Contig676.29  Length: 519  Sat Dec 10 06:11:56 2011  Type: P  Check: 1977  ..

       1  MAPIFRNYRF AITAFAVIML ILLIKTSTTN IDIARKVSPT ATIPKTPEDV

      51  SILPISDKPG YIDDKKTEQN DPDLADAVKS QTSSTCKKDH KYVIMIDGGS

     101  SGSRIHVYEF DICTSPPTLV NETFKMLEPG LSSFDNDSAG AAESLDPLLT

     151  AAMAAVPVKA RRCTPVSVKA TAGLRLLGEA KSAKILKAIR DHLEKDYPFP

     201  VVEGDGISIM GGDEEGVFAW ITTNYLLGNI GTAGSKLPTS AVFDLGGGST

     251  QIVFEPTFPP NEKMVDGEHK FDLNFGGEKY TLYQFSHLGY GLNQVRNKIN

     301  SVLVENALKE GTILNGDVST AHNLSSPCLP PKVNALKEKV KLDSGETYII

     351  DFIGPEVPSG PQCRFLADSI LNKDAECKSP PCSFNGAHQP SLVRTFKETN

     401  DLYIFSYFYD RTHPLGMPLT FTLNELMDLA RTVCNGEEIW KSVFTGIEGS

     451  LDKLRSDPHF CMDLSFQVSL LHTGYDIPLN RELKTAKTLA KNEIGWCLGA

     501  SLPLLESDNW KCKLSQIE*

Protein Sequence for WashU_Scas_Contig608.7:

WashU_Scas_Contig608.7  Length: 522  Sat Dec 10 06:11:56 2011  Type: P  Check: 6991  ..

       1  MPSFFRSYRF IIGAFAAIML LLLIRSSTTE STIDIARTVA SIASRPSTPQ

      51  DVNALPISNE PGYVSDSKTE QNNPEVATLV EESTNKKSKC NKDHEYVVMI

     101  DAGSTGSRVH VYEFDVCSQP PALINESFKM LKPGLSSFDT DAEGAAKSLD

     151  PLLQVALDAV PEKKRSCTPV AVKATAGLRL LGDTKSAKIL QAVRSHLEKD

     201  YPFAVVDGDG ISIMSGDEEG VYAWVTTNYL LGNIGTGSKL ATSAVFDLGG

     251  GSTQIVFEPT FPPNEEMVDG EHKYELRFGG QDYSLYQFSH LGYGLMEGRN

     301  KINQLLVETA IKAGTIKKGD YTPSVALHSP CLPPNVNVTQ EKVKLSSKET

     351  YVVDFIGPKV ASGAQCRFLS DQILNKDAKC TTKPCSFNGV HQPSLVHTFK

     401  ETNDLYIFSY FYDRTHTLGM PLSFTLNELA DLAKMVCDGE DTWESVLSEI

     451  DGSLDALVKD PYFCQDLSFQ VSLLHTGYDI PLHRELKTAE TIAGNELGWC

     501  LGASLPLLES DNWKCRVDKL Q*

Protein Sequence for WashU_Skud_Contig1931.1:

WashU_Skud_Contig1931.1  Length: 518  Sat Dec 10 06:11:56 2011  Type: P  Check: 7295  ..

       1  MAPIFRNYRF AIGAFAAIML ILLIKTSSTG SLSISRTVSP TASVPKTSGD

      51  VSTLPFGDKP GYIGNPKAEQ DYPEMADAVK SQTSQKCSEE HRYVVMIDAG

     101  SSGSRVHVYE FDVCTSPPTL INEEFKMLTP GLSSYDTDAA GAAESLDSLL

     151  DFAVDNVPLK ARGCTPVAVR ATAGLRIIGD AKSKKILTAV TNHLEKDYPF

     201  PIAEGSVSIM DGDEEGVFAW ITTNYLLKNI GTEGAKLPTA AVFDLGGGST

     251  QIVFEPTFPE NEKMVEGEHK YDLNFGGKIY TLYQFSHLRY GLMEGRKRIN

     301  SVLVQNAIKD GKITKGDGSK THKIMSPCLP PKVNSSNEKV ELAAGETYTV

     351  DFIGPDVPTG TQCRFLTDQI LNKDAKCQSP PCSFNGVHQP SMVRTFKELN

     401  DIYIFSFFYD RTHPLGMPSS FTLNELMDLT RTVCSGEETW KSVFSGIEGS

     451  LDELKSDPHY CLDLSFQVSL LHTGYDIPLY RELRTAEKID DTEIGWSLGA

     501  SLSLLESDFE CKVSQIE*