Fungal Sequence Alignment

Help

This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al. We will soon include sequences from other fungal genomes from a variety of sources.

ClustalW Protein Alignment and Sequence for YAL034C and Homologs

Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_FUN19/YAL034C   1   MGLYSPESEKSQLNMNYIGKDDSQSIFRRLNQNLKASNNNNDSNKNGLNM   50
MIT_Sbay_c942_172   1   MGLCSPESEKSQLSMDYSLKNGSQSIFKRLNQHLKLNNNDNNNDNN--NM   48
MIT_Smik_c1291_153   1   MGLYSPKSEKSQLNMNYNPKNNSQSIFRRLNQNLKVNNNSNDNDRSGMNM   50
MIT_Spar_c219_338   1   MGLYSPESEKSQLNMDYTAKDDSQSIFRRLNRNLKASNNNNDNNRSGLNM   50
WashU_Scas_Contig449.2   1   MQLYSPQTKHTNLGSSRAASGSKNNNTNEGNDKALLELLNQRVLIATNGV   50
WashU_Skud_Contig1685.7   1   MGLYSPESEKSQLNMDYIPKNDSQSIFKSLNQNVKLNDNNSNNNRSHLNM   50
Symbols






* * **::::::*. . ....:. . * : . .. .:



SGD_Scer_FUN19/YAL034C   51   SDYSNNSPYGRSYDVRINQNSQNNGNGCFSGSIDSLVDEHIIPSPPLSPK   100
MIT_Sbay_c942_172   49   DVYSNHSPYGPSYDERVNQNSQNSGAGCFSGSIDSLMDEHIIPSPPLSPK   98
MIT_Smik_c1291_153   51   SDFSNASPYGRSYNVRINQNSQNNGNGCFSGSIDSLVDEHIIPSPPLSPK   100
MIT_Spar_c219_338   51   SDYSNNSPYGRSYDVRINQNSQNNGNGCFSGSIDSLVDEHIIPSPPLSPK   100
WashU_Scas_Contig449.2   51   KNKNSTLGDGSSSIGIGIRNSNSE-----LDHMDTILDEHLIPSPPMSPK   95
WashU_Skud_Contig1685.7   51   SDYGNSSPYGRSYDARINQNPQSNGNGRFSGSIDSLVDEHIIPSPPLSPK   100
Symbols






. .. * * :*.:.. . :*:::***:*****:***



SGD_Scer_FUN19/YAL034C   101   LESKISHNG-SPRMASSVLVGSTPKGAVENVLFVKPVWPNGLSRKRYRYA   149
MIT_Sbay_c942_172   99   EESDFQHDA-SSHISSSALVGATPKGPVENVLTVKPAWPNGLTRKRYRYA   147
MIT_Smik_c1291_153   101   PESELHHNG-ISQISSSALVESTPKGLVEDVLFVKPAWPNGLSRKRYRYA   149
MIT_Spar_c219_338   101   LESKIDHNG-SPRMLSSALVGSTPKGPVENVLFVKPVWPNGLSRKRYRYA   149
WashU_Scas_Contig449.2   96   FSTIHRDED-------------------NSDILLKPIWQPGLTVKRYRHA   126
WashU_Skud_Contig1685.7   101   LESKFQQHDESSHVSSPALVGAAPKGPAENVLFVRPAWPNGLTRKRYRYA   150
Symbols






.: .. :. : ::* * **: ****:*



SGD_Scer_FUN19/YAL034C   150   TYGFLSQYKIFSNLAQPYSKNIINRYNNLAYNARHKYSKYNDDMTPPPLP   199
MIT_Sbay_c942_172   148   TYGFLSQYKMFSNLAQPYSKNIINRYNNLAYNNRYKHSKYNDELTPSSS-   196
MIT_Smik_c1291_153   150   TYGFLSQYKIFSNLVQPYSKNIINRYNNLAYNARQKYSKFNDEMTPSSS-   198
MIT_Spar_c219_338   150   TYGFLSQYKIFSNLAQPYSKNIINRYNNLAYNARHKYSKYNDEMTPPPS-   198
WashU_Scas_Contig449.2   127   THGFLSQYKIFD-------DSIINRRQGGYYNTSNRSRRYNYSDVRKYP-   168
WashU_Skud_Contig1685.7   151   TYGFLSQYKIFSNLAHPYSKNIINRYNNLAYNARHKYFRYDDEVTPSSS-   199
Symbols






*:*******:*. ..**** :. ** : ::: . .



SGD_Scer_FUN19/YAL034C   200   SSSSRLPSPLASPNLNRQARYNMRKQALYNNNLGKFESDTEWIPRKRKVY   249
MIT_Sbay_c942_172   197   ----RLPSPLASPNLNRQSRYNMRKQVLYNNNLGKFESDTEWVSRERKVY   242
MIT_Smik_c1291_153   199   ----RLPSPIASPNLNRQARYDMRKQAFYNNNLEKFESDTEWIPRKRKVY   244
MIT_Spar_c219_338   199   --SSRLPSPLASPNLNRQARYNMRKQALYNNNLGKFESDTEWIPRKRKVY   246
WashU_Scas_Contig449.2   169   ----YIHDDDLLGGRHKRPSRRAGRPSHYDDGKYVSSVYGTDDTDWDSYN   214
WashU_Skud_Contig1685.7   200   ----RLPSPLASPNLNRQSRYNMRKQVFFNNNLSKFESDTEWMPRKRKVY   245
Symbols






: . . :::. : :::. . . .



SGD_Scer_FUN19/YAL034C   250   SPQRRTMTTSPHRAKKFSPSASTPHTNIASIEAIHDAPQYIPNVSWKKLP   299
MIT_Sbay_c942_172   243   SPQRRTMTTSPHRAKKFSPSASAPHTNIASMEAIHEAPQYVPNISWKKLP   292
MIT_Smik_c1291_153   245   SPQRRTMTTSPHRVKKFSPSASAPHTNIASIEAIHEAPQYIPNVSWKKLP   294
MIT_Spar_c219_338   247   SPQRRTMTTSPHRAKKFSPSASAPHTNIASIEAIHDAPQYIPNVSWKKLP   296
WashU_Scas_Contig449.2   215   PILSKRPSTPIHRIKKSTPAVSSP---LASALAIHSAPQYVPNMSWEKLP   261
WashU_Skud_Contig1685.7   246   SPQRRTMTTSPHRSKKFSPSASAPHTNIASIEAIHEAPQYIPNVSWKKLP   295
Symbols






. : :*. ** ** :*:.*:* :** ***.****:**:**:***



SGD_Scer_FUN19/YAL034C   300   DYSPPLSTLPTDSNKSLKIEWKGSPMDLSTDPLRNELHPAELVLAQTLRL   349
MIT_Sbay_c942_172   293   DFSPPLSTLPADTNKSLKIEWKGSPMDLSTDPLRDELHPAELVLAQTLRL   342
MIT_Smik_c1291_153   295   DLSPSLTTLPADSNKSLKIEWKGSPMDLSADPLRDELHPAELMLAQTLRL   344
MIT_Spar_c219_338   297   DFTPPLSTLPADSNKSLKIEWKGSPMDLSTDPLRNELHPAELVLAQTLRL   346
WashU_Scas_Contig449.2   262   DYSPPLSTLPADNTKSLKVEWKGSSMDLSHDPLKKHLHPAELQLASILRL   311
WashU_Skud_Contig1685.7   296   DFSPSLSTLPADTNKSLKIEWKGSPMDLSLDPLKNELHPAELILAQTLRL   345
Symbols






* :*.*:***:*..****:*****.**** ***:..****** **. ***



SGD_Scer_FUN19/YAL034C   350   PCDLYLDSKRRLFLEKVYRLKKGLPFRRTDAQKACRIDVNKASRLFQAFE   399
MIT_Sbay_c942_172   343   PCDLYLDSKKRLFLEKVYRLRKGLPFRRTDAQKACRIDVNKASRLFQAFE   392
MIT_Smik_c1291_153   345   PCDLYLDSKRRLFLEKVYRLRKGLPFRRTDAQKACRIDVNKASRLFQAFE   394
MIT_Spar_c219_338   347   PCDLYLDSKRRLFLEKVYRLKKGLPFRRTDAQKACRIDVNKASRLFQAFE   396
WashU_Scas_Contig449.2   312   PCDLYLDSKRRLFLEKVHRLKKGLPFRRTDAQKACRIDVNKASRLFAAFE   361
WashU_Skud_Contig1685.7   346   PCDLYLDSKRRLFLEKVYRLRKGLPFRRTDAQKACRIDVNKASRLFQAFE   395
Symbols






*********:*******:**:************************* ***



SGD_Scer_FUN19/YAL034C   400   KVGWLQDSNFTKYL   413
MIT_Sbay_c942_172   393   KVGWLRDSNFTKYL   406
MIT_Smik_c1291_153   395   KVGWLQDSNFTKYL   408
MIT_Spar_c219_338   397   KVGWLQDSNFTKFL   410
WashU_Scas_Contig449.2   362   KVGWLRDSNFKQWV   375
WashU_Skud_Contig1685.7   396   KVGWLRDSNFTKYL   409
Symbols






*****:****.:::



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_FUN19/YAL034C:

SGD_Scer_FUN19/YAL034C  Length: 414  Fri Dec  9 20:30:32 2011  Type: P  Check: 1725  ..

       1  MGLYSPESEK SQLNMNYIGK DDSQSIFRRL NQNLKASNNN NDSNKNGLNM

      51  SDYSNNSPYG RSYDVRINQN SQNNGNGCFS GSIDSLVDEH IIPSPPLSPK

     101  LESKISHNGS PRMASSVLVG STPKGAVENV LFVKPVWPNG LSRKRYRYAT

     151  YGFLSQYKIF SNLAQPYSKN IINRYNNLAY NARHKYSKYN DDMTPPPLPS

     201  SSSRLPSPLA SPNLNRQARY NMRKQALYNN NLGKFESDTE WIPRKRKVYS

     251  PQRRTMTTSP HRAKKFSPSA STPHTNIASI EAIHDAPQYI PNVSWKKLPD

     301  YSPPLSTLPT DSNKSLKIEW KGSPMDLSTD PLRNELHPAE LVLAQTLRLP

     351  CDLYLDSKRR LFLEKVYRLK KGLPFRRTDA QKACRIDVNK ASRLFQAFEK

     401  VGWLQDSNFT KYL*

Protein Sequence for MIT_Sbay_c942_172:

MIT_Sbay_c942_172  Length: 407  Fri Dec  9 20:30:32 2011  Type: P  Check: 5309  ..

       1  MGLCSPESEK SQLSMDYSLK NGSQSIFKRL NQHLKLNNND NNNDNNNMDV

      51  YSNHSPYGPS YDERVNQNSQ NSGAGCFSGS IDSLMDEHII PSPPLSPKEE

     101  SDFQHDASSH ISSSALVGAT PKGPVENVLT VKPAWPNGLT RKRYRYATYG

     151  FLSQYKMFSN LAQPYSKNII NRYNNLAYNN RYKHSKYNDE LTPSSSRLPS

     201  PLASPNLNRQ SRYNMRKQVL YNNNLGKFES DTEWVSRERK VYSPQRRTMT

     251  TSPHRAKKFS PSASAPHTNI ASMEAIHEAP QYVPNISWKK LPDFSPPLST

     301  LPADTNKSLK IEWKGSPMDL STDPLRDELH PAELVLAQTL RLPCDLYLDS

     351  KKRLFLEKVY RLRKGLPFRR TDAQKACRID VNKASRLFQA FEKVGWLRDS

     401  NFTKYL*

Protein Sequence for MIT_Smik_c1291_153:

MIT_Smik_c1291_153  Length: 409  Fri Dec  9 20:30:32 2011  Type: P  Check: 3508  ..

       1  MGLYSPKSEK SQLNMNYNPK NNSQSIFRRL NQNLKVNNNS NDNDRSGMNM

      51  SDFSNASPYG RSYNVRINQN SQNNGNGCFS GSIDSLVDEH IIPSPPLSPK

     101  PESELHHNGI SQISSSALVE STPKGLVEDV LFVKPAWPNG LSRKRYRYAT

     151  YGFLSQYKIF SNLVQPYSKN IINRYNNLAY NARQKYSKFN DEMTPSSSRL

     201  PSPIASPNLN RQARYDMRKQ AFYNNNLEKF ESDTEWIPRK RKVYSPQRRT

     251  MTTSPHRVKK FSPSASAPHT NIASIEAIHE APQYIPNVSW KKLPDLSPSL

     301  TTLPADSNKS LKIEWKGSPM DLSADPLRDE LHPAELMLAQ TLRLPCDLYL

     351  DSKRRLFLEK VYRLRKGLPF RRTDAQKACR IDVNKASRLF QAFEKVGWLQ

     401  DSNFTKYL*

Protein Sequence for MIT_Spar_c219_338:

MIT_Spar_c219_338  Length: 411  Fri Dec  9 20:30:32 2011  Type: P  Check: 6305  ..

       1  MGLYSPESEK SQLNMDYTAK DDSQSIFRRL NRNLKASNNN NDNNRSGLNM

      51  SDYSNNSPYG RSYDVRINQN SQNNGNGCFS GSIDSLVDEH IIPSPPLSPK

     101  LESKIDHNGS PRMLSSALVG STPKGPVENV LFVKPVWPNG LSRKRYRYAT

     151  YGFLSQYKIF SNLAQPYSKN IINRYNNLAY NARHKYSKYN DEMTPPPSSS

     201  RLPSPLASPN LNRQARYNMR KQALYNNNLG KFESDTEWIP RKRKVYSPQR

     251  RTMTTSPHRA KKFSPSASAP HTNIASIEAI HDAPQYIPNV SWKKLPDFTP

     301  PLSTLPADSN KSLKIEWKGS PMDLSTDPLR NELHPAELVL AQTLRLPCDL

     351  YLDSKRRLFL EKVYRLKKGL PFRRTDAQKA CRIDVNKASR LFQAFEKVGW

     401  LQDSNFTKFL *

Protein Sequence for WashU_Scas_Contig449.2:

WashU_Scas_Contig449.2  Length: 376  Fri Dec  9 20:30:32 2011  Type: P  Check: 2641  ..

       1  MQLYSPQTKH TNLGSSRAAS GSKNNNTNEG NDKALLELLN QRVLIATNGV

      51  KNKNSTLGDG SSSIGIGIRN SNSELDHMDT ILDEHLIPSP PMSPKFSTIH

     101  RDEDNSDILL KPIWQPGLTV KRYRHATHGF LSQYKIFDDS IINRRQGGYY

     151  NTSNRSRRYN YSDVRKYPYI HDDDLLGGRH KRPSRRAGRP SHYDDGKYVS

     201  SVYGTDDTDW DSYNPILSKR PSTPIHRIKK STPAVSSPLA SALAIHSAPQ

     251  YVPNMSWEKL PDYSPPLSTL PADNTKSLKV EWKGSSMDLS HDPLKKHLHP

     301  AELQLASILR LPCDLYLDSK RRLFLEKVHR LKKGLPFRRT DAQKACRIDV

     351  NKASRLFAAF EKVGWLRDSN FKQWV*

Protein Sequence for WashU_Skud_Contig1685.7:

WashU_Skud_Contig1685.7  Length: 410  Fri Dec  9 20:30:32 2011  Type: P  Check: 7022  ..

       1  MGLYSPESEK SQLNMDYIPK NDSQSIFKSL NQNVKLNDNN SNNNRSHLNM

      51  SDYGNSSPYG RSYDARINQN PQSNGNGRFS GSIDSLVDEH IIPSPPLSPK

     101  LESKFQQHDE SSHVSSPALV GAAPKGPAEN VLFVRPAWPN GLTRKRYRYA

     151  TYGFLSQYKI FSNLAHPYSK NIINRYNNLA YNARHKYFRY DDEVTPSSSR

     201  LPSPLASPNL NRQSRYNMRK QVFFNNNLSK FESDTEWMPR KRKVYSPQRR

     251  TMTTSPHRSK KFSPSASAPH TNIASIEAIH EAPQYIPNVS WKKLPDFSPS

     301  LSTLPADTNK SLKIEWKGSP MDLSLDPLKN ELHPAELILA QTLRLPCDLY

     351  LDSKRRLFLE KVYRLRKGLP FRRTDAQKAC RIDVNKASRL FQAFEKVGWL

     401  RDSNFTKYL*