Fungal Sequence Alignment

Help

This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al. We will soon include sequences from other fungal genomes from a variety of sources.

ClustalW Protein Alignment and Sequence for YGR059W and Homologs

Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_SPR3/YGR059W   1   -MKSKGSRLSTDCPVEFPKIVSGFAEEVKIRRQSSQGQYAVDSHPPKSPE   49
MIT_Sbay_c592_8991   1   -MKPKEGGLPTDCPVEFSKIISGFSEEAKVRKQASQG-KHVDPYQPKSPK   48
MIT_Smik_c521_8455   1   -MKPKGSRLSSDCPVEFPKIVTGFAEEVKIRRQSSQG-QNVDSCEPKSPE   48
MIT_Spar_c7_8195   1   -MKSKGSRLSTDCPVEFPKIISEFAEEVKIRRQSSQG-QNVDSYHATSPE   48
WashU_Sbay_Contig613.9   1   -MKPKEGGLPTDCPVEFSKIISGFSEEAKVRKQASQG-KHVDPYQPKSPK   48
WashU_Scas_Contig683.17   1   MSKTTSKLKKDHIPENFQILVNGFFQES---------------------Q   29
WashU_Skud_Contig2031.2   1   -MKPKGGRLSTDCPVEFPTIISGFSDEVKARKQASQG-QHLDAYQPKSPE   48
Symbols






*.. . * :* ::. * :* :



SGD_Scer_SPR3/YGR059W   50   LKHRRQRSSSFVNGKCRNRDLPLLDNKKAQEINTNSHGQDIGIKNLPRQR   99
MIT_Sbay_c592_8991   49   SRSIRQRSSSFVNGKCKSKETPLSENYGVEEMNSNSNGRDIGIRNIPRQR   98
MIT_Smik_c521_8455   49   LKSRRQRSSSFVNGKYRNKDLTLLENKKIEENNSNSRGLDIGIRYLPRQR   98
MIT_Spar_c7_8195   49   LKHRRQRSSSFVNGKYRSRDIPLLDNKNAEEISSNSHGQDIGIRNLPRQR   98
WashU_Sbay_Contig613.9   49   SRSIRQRSSSFVNGKCKSKETPLSENYGVEEMNSNSNGRDIGIRNIPRQR   98
WashU_Scas_Contig683.17   30   LKIKRLLKENGLDGRVTHKSVSKMISYGKPIISN----YKIGLENLPKQV   75
WashU_Skud_Contig2031.2   49   SRHRRQRSSSFVNGKCRNRDAPFADDKNSPSLN----GQGVGIENLPRQR   94
Symbols






: * ... ::*: :. . . . :*:. :*:*



SGD_Scer_SPR3/YGR059W   100   ELLNAKNGIDFTLMVAGQSGLGKTTFINSLFSTSLIDDDIKENK----PI   145
MIT_Sbay_c592_8991   99   ELLNAKNGIHFTLMVAGQSGLGKTTFINSLFATSLIDDNIKENR----PI   144
MIT_Smik_c521_8455   99   ELLNAKNGIDFTLMVAGQSGLGKTTFINSLFSTSLIDDEIEENK----PI   144
MIT_Spar_c7_8195   99   ELLNAKNGIHFTLMVAGQSGLGKTTFINSLFSTSLIDDNIKENK----PI   144
WashU_Sbay_Contig613.9   99   ELLNAKNGIHFTLMVAGQSGLGKTTFINSLFATSLIDDNIKENR----PI   144
WashU_Scas_Contig683.17   76   ELIKAQKGFDFTVMVAGQSGVGKSTFINTLFGESLVEKEIHDEKDIGKSI   125
WashU_Skud_Contig2031.2   95   ELLNAKNGIHFTLMVAGQSGLGKTTFINSLFSASLIDEGIKEDK----PI   140
Symbols






**::*::*:.**:*******:**:****:**. **::. *.::: .*



SGD_Scer_SPR3/YGR059W   146   IRYKSIVEGDGTHLNFNVIDTPGFGNNMDNAFTWRTMVNYIDEEIRSYIF   195
MIT_Sbay_c592_8991   145   VRYKNIIEGDGTHLRFGVIDTPNFGNDMDNAFTWRSMVNYIDEEIRSYIF   194
MIT_Smik_c521_8455   145   VRHKSVIEGDGTHLNFNVIETPGFGNNMDNAFTWRTMVNYIDEEIRSYIF   194
MIT_Spar_c7_8195   145   VRYKSVVEGDGTHLNFNVIDTPGFGNNMDNAFTWRTMVNYIDEEIRSYIF   194
WashU_Sbay_Contig613.9   145   VRYKNIIEGDGTHLRFGVIDTPNFGNDMDNAFTWRSMVNYIDEEIRSYIF   194
WashU_Scas_Contig683.17   126   IKRKFHIQGEGTELRFSVLETPDYGNKVNNSFVWVPLESYIDEQLRSFIF   175
WashU_Skud_Contig2031.2   141   VRYKSVIEGDETHLRLSVIDTPGFGNNMDNAFTWRTMVNYIDEEIRSYIF   190
Symbols






:: * ::*: *.*.:.*::**.:**.::*:*.* .: .****::**:**



SGD_Scer_SPR3/YGR059W   196   QEEQPDRTKMVDNRVHCCLYFLRPSNKGIDTLDVVTMKKLAKRVNLIPVI   245
MIT_Sbay_c592_8991   195   QEEQPDRTKMIDDRVHCCLYFLKPSNKGIDALDVLTMKELARRVNLIPVI   244
MIT_Smik_c521_8455   195   QEEQPDRVKMVDDRVHCCLYFLRPSNKGIDTLDVVTMKKLAKRVNLIPVI   244
MIT_Spar_c7_8195   195   QEEQPDRAKMVDDRVHCCLYFLKPTNKGIDALDVVTMKKLAKRVNLIPVI   244
WashU_Sbay_Contig613.9   195   QEEQPDRTKMIDDRVHCCLYFLKPSNKGIDALDVLTMKELARRVNLIPVI   244
WashU_Scas_Contig683.17   176   QEEQPQRECINDTRIHCCVYLFEPTNKGIKALDIVTMKELSKKVNLVPII   225
WashU_Skud_Contig2031.2   191   QEEQPDRTKMIDDRVHCCLYFLKPTGKGIDTLDVVTMKKLATRVNLIPVI   240
Symbols






*****:* : * *:***:*::.*:.***.:**::***:*: :***:*:*



SGD_Scer_SPR3/YGR059W   246   AKSDLLTKEELKNFKTQVREIIRVQDIPVCFFFGDE---VLNATQDIFQK   292
MIT_Sbay_c592_8991   245   AKADLLTKYELNNFKMEIREIIRVQNISVCCFFDNN---VLDATQNIFQK   291
MIT_Smik_c521_8455   245   AKADLLTKDELKEFKKEIREIIRVQDVPVCFLFGND---VLNATQDIFER   291
MIT_Spar_c7_8195   245   AKADLLTKEELKNFKMEIREIIRVQDIPVCFFFGND---VLNATQDIFQK   291
WashU_Sbay_Contig613.9   245   AKADLLTKYELNNFKMEIREIIRVQNISVCCFFDNN---VLDATQNIFQK   291
WashU_Scas_Contig683.17   226   SKIDIYSIEERQKLKLLVKNLIKVHNIEVCKLIKDYGFFEEDNVEQFCTD   275
WashU_Skud_Contig2031.2   241   AKADLLTKDELRNFKTEIREIIRVQDISICYFFGNH---VSNATQDIFQQ   287
Symbols






:* *: : * .::* ::::*:*::: :* :: : : .:::



SGD_Scer_SPR3/YGR059W   293   YPFSIIASNEYIFNEKGEKVKGRQYKWGAVDIENEKYCDFKILQKTIFDW   342
MIT_Sbay_c592_8991   292   FPFSIIASKEYVLNKKGKKVKGREYKWGTVEIENERHCDFKILQRILFDR   341
MIT_Smik_c521_8455   292   YPYSIIASNEYIINEKGKRVKGRQYEWGNVDIENEKYCDFKILQKTLFDW   341
MIT_Spar_c7_8195   292   YPFSIIASNEYIFNEKGERVKGRQYKWGAVDIENEKYCDFKILQKTLFDW   341
WashU_Sbay_Contig613.9   292   FPFSIIASKEYVLNKKGKKVKGREYKWGTVEIENERHCDFKILQRILFDR   341
WashU_Scas_Contig683.17   276   CPYGVVTSDKHILSLSEDLVLGRSFNGNKIEVENEEHSDFLKIRNFLLRD   325
WashU_Skud_Contig2031.2   288   YPFSIIASNEYVLNDKGEKVKGRQYKWGTVDIENEKYCDFKILQKTLFDW   337
Symbols






*:.:::*.::::. . . * **.:: . :::***.:.** ::. ::



SGD_Scer_SPR3/YGR059W   343   NLIDLVESTEDYYEKCRSEMLRTRLLKARDCLTTKSVDITEEQRKFLEEE   392
MIT_Sbay_c592_8991   342   HLIDFVESTEDYYEKCRSEMLRTRLLKARDCLTTNTVEITDEQRKFLQEE   391
MIT_Smik_c521_8455   342   HLIDLVESTEEYYEKCRSEMLRTRLLKARDCLATKSVEITEEQKKFLEEE   391
MIT_Spar_c7_8195   342   HLIDLVESTEEYYEKCRSEMLRTRLLKARDCLTTKSVDLTEEQKKFLEEE   391
WashU_Sbay_Contig613.9   342   HLIDFVESTEDYYEKCRSEMLRTRLLKARDCLTTNTVEITDEQRKFLQEE   391
WashU_Scas_Contig683.17   326   NLIDLVNSTTAYYEKCRAEMLKSRIAKTQELVLKD--LNAEHTKPELFRN   373
WashU_Skud_Contig2031.2   338   HLIDLVETTEEYYEKCRSEMLRTRLLKARDCLTTNSVELTEKQKKFLQEE   387
Symbols






:***:*::* ******:***::*: *::: : .. ::. : * .:



SGD_Scer_SPR3/YGR059W   393   MNFDEIEENKLKNYKCYEIINKTVMDKVATEWDPEFITRQLEAKKKFNEL   442
MIT_Sbay_c592_8991   392   MNFEDIEENKLKNYKCYEIINKVIMDKVATEWDPEFITRQLETKKRFNEI   441
MIT_Smik_c521_8455   392   MNFDDIEENKLKNYRCYEIINKTVMDKVATEWDPEFITRQLEAKRKFSEL   441
MIT_Spar_c7_8195   392   MNFDELEENKLKNYKCYEIINKTIMDKVATEWDPEFITRQLEAKKKFNEL   441
WashU_Sbay_Contig613.9   392   MNFEDIEENKLKNYKCYEIINKVIMDKVATEWDPEFITRQLETKKRFNEI   441
WashU_Scas_Contig683.17   374   FNFENPDENGLRNYICYQLFNKNAMNKPIDVWCPDLLERQLNFKKKYDDL   423
WashU_Skud_Contig2031.2   388   MNFEDIEENKLKNYKCYEIINKAIMDKVATEWDPEFITRQLEAKRKFNEL   437
Symbols






:**:: :** *:** **:::** *:* * *::: ***: *:::.::



SGD_Scer_SPR3/YGR059W   443   SNREISKFRDWKKSLFMEQENFNQEIEQLNHKLENLQLECQDLEYKLLIG   492
MIT_Sbay_c592_8991   442   SNQEIRKIKEWKKNLFMKQENFNQEIEDLNNNLENLQLECQDLEYKLLMD   491
MIT_Smik_c521_8455   442   SNREISKFRDWKKSLFMEQDNFNHEIEQLNHKLENLQLECQDLEYKLLIG   491
MIT_Spar_c7_8195   442   SNREISKFRDWKKSLFMEQENFNQEIEQLNHKLENLQLECQDLEYKLLIG   491
WashU_Sbay_Contig613.9   442   SNQEIRKIKEWKKNLFMKQENFNQEIEDLNNNLENLQLECQDLEYKLLMD   491
WashU_Scas_Contig683.17   424   LTVEEGKYQEWAQGLKKTQEEVNHDIKQMTETIQLLQLECEVLEDQLLNG   473
WashU_Skud_Contig2031.2   438   SNREICKFRDWKRSLFMEQENFNQEIEELNHKLENLQLECQDLEYKLLIG   487
Symbols






. * * ::* :.* *::.*::*:::...:: *****: ** :** .



SGD_Scer_SPR3/YGR059W   493   KSSNSHSTDS-ATLVNVHIKR   512
MIT_Sbay_c592_8991   492   KSSDHHSTDS-ATLVNVHIKR   511
MIT_Smik_c521_8455   492   KSSNNHSTDS-ATLVNVHIKR   511
MIT_Spar_c7_8195   492   KSSNNHSTDS-ATLVNVHIRR   511
WashU_Sbay_Contig613.9   492   KSSDHHSTDS-ATLVNVHIKR   511
WashU_Scas_Contig683.17   474   KRTRIYPEDSNVTLVGYCHKK   494
WashU_Skud_Contig2031.2   488   KSANNHSTDS-ATLVNVHVKR   507
Symbols






* : :. ** .***. ::



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_SPR3/YGR059W:

SGD_Scer_SPR3/YGR059W  Length: 513  Sat Dec 10 10:53:11 2011  Type: P  Check: 8883  ..

       1  MKSKGSRLST DCPVEFPKIV SGFAEEVKIR RQSSQGQYAV DSHPPKSPEL

      51  KHRRQRSSSF VNGKCRNRDL PLLDNKKAQE INTNSHGQDI GIKNLPRQRE

     101  LLNAKNGIDF TLMVAGQSGL GKTTFINSLF STSLIDDDIK ENKPIIRYKS

     151  IVEGDGTHLN FNVIDTPGFG NNMDNAFTWR TMVNYIDEEI RSYIFQEEQP

     201  DRTKMVDNRV HCCLYFLRPS NKGIDTLDVV TMKKLAKRVN LIPVIAKSDL

     251  LTKEELKNFK TQVREIIRVQ DIPVCFFFGD EVLNATQDIF QKYPFSIIAS

     301  NEYIFNEKGE KVKGRQYKWG AVDIENEKYC DFKILQKTIF DWNLIDLVES

     351  TEDYYEKCRS EMLRTRLLKA RDCLTTKSVD ITEEQRKFLE EEMNFDEIEE

     401  NKLKNYKCYE IINKTVMDKV ATEWDPEFIT RQLEAKKKFN ELSNREISKF

     451  RDWKKSLFME QENFNQEIEQ LNHKLENLQL ECQDLEYKLL IGKSSNSHST

     501  DSATLVNVHI KR*

Protein Sequence for MIT_Sbay_c592_8991:

MIT_Sbay_c592_8991  Length: 512  Sat Dec 10 10:53:12 2011  Type: P  Check: 3483  ..

       1  MKPKEGGLPT DCPVEFSKII SGFSEEAKVR KQASQGKHVD PYQPKSPKSR

      51  SIRQRSSSFV NGKCKSKETP LSENYGVEEM NSNSNGRDIG IRNIPRQREL

     101  LNAKNGIHFT LMVAGQSGLG KTTFINSLFA TSLIDDNIKE NRPIVRYKNI

     151  IEGDGTHLRF GVIDTPNFGN DMDNAFTWRS MVNYIDEEIR SYIFQEEQPD

     201  RTKMIDDRVH CCLYFLKPSN KGIDALDVLT MKELARRVNL IPVIAKADLL

     251  TKYELNNFKM EIREIIRVQN ISVCCFFDNN VLDATQNIFQ KFPFSIIASK

     301  EYVLNKKGKK VKGREYKWGT VEIENERHCD FKILQRILFD RHLIDFVEST

     351  EDYYEKCRSE MLRTRLLKAR DCLTTNTVEI TDEQRKFLQE EMNFEDIEEN

     401  KLKNYKCYEI INKVIMDKVA TEWDPEFITR QLETKKRFNE ISNQEIRKIK

     451  EWKKNLFMKQ ENFNQEIEDL NNNLENLQLE CQDLEYKLLM DKSSDHHSTD

     501  SATLVNVHIK R*

Protein Sequence for MIT_Smik_c521_8455:

MIT_Smik_c521_8455  Length: 512  Sat Dec 10 10:53:12 2011  Type: P  Check: 4906  ..

       1  MKPKGSRLSS DCPVEFPKIV TGFAEEVKIR RQSSQGQNVD SCEPKSPELK

      51  SRRQRSSSFV NGKYRNKDLT LLENKKIEEN NSNSRGLDIG IRYLPRQREL

     101  LNAKNGIDFT LMVAGQSGLG KTTFINSLFS TSLIDDEIEE NKPIVRHKSV

     151  IEGDGTHLNF NVIETPGFGN NMDNAFTWRT MVNYIDEEIR SYIFQEEQPD

     201  RVKMVDDRVH CCLYFLRPSN KGIDTLDVVT MKKLAKRVNL IPVIAKADLL

     251  TKDELKEFKK EIREIIRVQD VPVCFLFGND VLNATQDIFE RYPYSIIASN

     301  EYIINEKGKR VKGRQYEWGN VDIENEKYCD FKILQKTLFD WHLIDLVEST

     351  EEYYEKCRSE MLRTRLLKAR DCLATKSVEI TEEQKKFLEE EMNFDDIEEN

     401  KLKNYRCYEI INKTVMDKVA TEWDPEFITR QLEAKRKFSE LSNREISKFR

     451  DWKKSLFMEQ DNFNHEIEQL NHKLENLQLE CQDLEYKLLI GKSSNNHSTD

     501  SATLVNVHIK R*

Protein Sequence for MIT_Spar_c7_8195:

MIT_Spar_c7_8195  Length: 512  Sat Dec 10 10:53:12 2011  Type: P  Check: 3364  ..

       1  MKSKGSRLST DCPVEFPKII SEFAEEVKIR RQSSQGQNVD SYHATSPELK

      51  HRRQRSSSFV NGKYRSRDIP LLDNKNAEEI SSNSHGQDIG IRNLPRQREL

     101  LNAKNGIHFT LMVAGQSGLG KTTFINSLFS TSLIDDNIKE NKPIVRYKSV

     151  VEGDGTHLNF NVIDTPGFGN NMDNAFTWRT MVNYIDEEIR SYIFQEEQPD

     201  RAKMVDDRVH CCLYFLKPTN KGIDALDVVT MKKLAKRVNL IPVIAKADLL

     251  TKEELKNFKM EIREIIRVQD IPVCFFFGND VLNATQDIFQ KYPFSIIASN

     301  EYIFNEKGER VKGRQYKWGA VDIENEKYCD FKILQKTLFD WHLIDLVEST

     351  EEYYEKCRSE MLRTRLLKAR DCLTTKSVDL TEEQKKFLEE EMNFDELEEN

     401  KLKNYKCYEI INKTIMDKVA TEWDPEFITR QLEAKKKFNE LSNREISKFR

     451  DWKKSLFMEQ ENFNQEIEQL NHKLENLQLE CQDLEYKLLI GKSSNNHSTD

     501  SATLVNVHIR R*

Protein Sequence for WashU_Sbay_Contig613.9:

WashU_Sbay_Contig613.9  Length: 512  Sat Dec 10 10:53:12 2011  Type: P  Check: 3483  ..

       1  MKPKEGGLPT DCPVEFSKII SGFSEEAKVR KQASQGKHVD PYQPKSPKSR

      51  SIRQRSSSFV NGKCKSKETP LSENYGVEEM NSNSNGRDIG IRNIPRQREL

     101  LNAKNGIHFT LMVAGQSGLG KTTFINSLFA TSLIDDNIKE NRPIVRYKNI

     151  IEGDGTHLRF GVIDTPNFGN DMDNAFTWRS MVNYIDEEIR SYIFQEEQPD

     201  RTKMIDDRVH CCLYFLKPSN KGIDALDVLT MKELARRVNL IPVIAKADLL

     251  TKYELNNFKM EIREIIRVQN ISVCCFFDNN VLDATQNIFQ KFPFSIIASK

     301  EYVLNKKGKK VKGREYKWGT VEIENERHCD FKILQRILFD RHLIDFVEST

     351  EDYYEKCRSE MLRTRLLKAR DCLTTNTVEI TDEQRKFLQE EMNFEDIEEN

     401  KLKNYKCYEI INKVIMDKVA TEWDPEFITR QLETKKRFNE ISNQEIRKIK

     451  EWKKNLFMKQ ENFNQEIEDL NNNLENLQLE CQDLEYKLLM DKSSDHHSTD

     501  SATLVNVHIK R*

Protein Sequence for WashU_Scas_Contig683.17:

WashU_Scas_Contig683.17  Length: 495  Sat Dec 10 10:53:12 2011  Type: P  Check: 8920  ..

       1  MSKTTSKLKK DHIPENFQIL VNGFFQESQL KIKRLLKENG LDGRVTHKSV

      51  SKMISYGKPI ISNYKIGLEN LPKQVELIKA QKGFDFTVMV AGQSGVGKST

     101  FINTLFGESL VEKEIHDEKD IGKSIIKRKF HIQGEGTELR FSVLETPDYG

     151  NKVNNSFVWV PLESYIDEQL RSFIFQEEQP QRECINDTRI HCCVYLFEPT

     201  NKGIKALDIV TMKELSKKVN LVPIISKIDI YSIEERQKLK LLVKNLIKVH

     251  NIEVCKLIKD YGFFEEDNVE QFCTDCPYGV VTSDKHILSL SEDLVLGRSF

     301  NGNKIEVENE EHSDFLKIRN FLLRDNLIDL VNSTTAYYEK CRAEMLKSRI

     351  AKTQELVLKD LNAEHTKPEL FRNFNFENPD ENGLRNYICY QLFNKNAMNK

     401  PIDVWCPDLL ERQLNFKKKY DDLLTVEEGK YQEWAQGLKK TQEEVNHDIK

     451  QMTETIQLLQ LECEVLEDQL LNGKRTRIYP EDSNVTLVGY CHKK*


Protein Sequence for WashU_Skud_Contig2031.2:

WashU_Skud_Contig2031.2  Length: 508  Sat Dec 10 10:53:12 2011  Type: P  Check: 7326  ..

       1  MKPKGGRLST DCPVEFPTII SGFSDEVKAR KQASQGQHLD AYQPKSPESR

      51  HRRQRSSSFV NGKCRNRDAP FADDKNSPSL NGQGVGIENL PRQRELLNAK

     101  NGIHFTLMVA GQSGLGKTTF INSLFSASLI DEGIKEDKPI VRYKSVIEGD

     151  ETHLRLSVID TPGFGNNMDN AFTWRTMVNY IDEEIRSYIF QEEQPDRTKM

     201  IDDRVHCCLY FLKPTGKGID TLDVVTMKKL ATRVNLIPVI AKADLLTKDE

     251  LRNFKTEIRE IIRVQDISIC YFFGNHVSNA TQDIFQQYPF SIIASNEYVL

     301  NDKGEKVKGR QYKWGTVDIE NEKYCDFKIL QKTLFDWHLI DLVETTEEYY

     351  EKCRSEMLRT RLLKARDCLT TNSVELTEKQ KKFLQEEMNF EDIEENKLKN

     401  YKCYEIINKA IMDKVATEWD PEFITRQLEA KRKFNELSNR EICKFRDWKR

     451  SLFMEQENFN QEIEELNHKL ENLQLECQDL EYKLLIGKSA NNHSTDSATL

     501  VNVHVKR*