Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YBR026C and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_ETR1/YBR026C   1   -----------------MLPTFKRYMSSSAHQIPKHFKSLIYSTHEVEDC   33
MIT_Smik_c146_1193   1   -----------------MLPTLKRFMSSSTHQIPKQFKSLIYSAHEVEDC   33
MIT_Spar_c197_1100   1   -----------------MLPTLKRFMSSSAHQIPKQFKSVIYSTHEVEDC   33
MIT_Suva_c809_21444   1   -----------------MLPTLKRFMSSSGHQIPKQFKSIIYSTHEVEDC   33
WashU_Sbay_Contig532.4   1   -------------------------MSSSGHQIPKQFKSIIYSTHEVEDC   25
WashU_Scas_Contig593.14   1   -----------------MIPTFKRMMSSK-PTIPKQFKSLVYSSHSVEDP   32
WashU_Sklu_Contig1124.1   1   ----------------MLPSLRLTSKRFMSSRMPNTFKSVVYADHDSQDC   34
WashU_Skud_Contig2061.4   1   MNLYLRSNSSLNLEFNKMLTTLKRFMSSPAHQIPKQFKSIIYSTHEVEDC   50
WashU_Smik_Contig2208.1   1   -----------------MLPTLKRFMSSSTHQIPKQFKSLIYSAHEVEDC   33
Symbols






:*: ***::*: *. :*



SGD_Scer_ETR1/YBR026C   34   TKVLSVKNYTPKQDLSQSIVLKTLAFPINPSDINQLQGVYPSRPEKTYDY   83
MIT_Smik_c146_1193   34   SKVLSVKNYTPKQNLSQSIVLKTLAFPINPSDINQLQGVYPSRPEKTYDY   83
MIT_Spar_c197_1100   34   AKVLSVKNYTPKQDLSQSIVLKTLAFPINPSDINQLQGVYPSRPEKTYDY   83
MIT_Suva_c809_21444   34   TKVLSVKNYTPKQDLSQSIVLKTLAFPINPSDINQLQGVYPSRPEKTYDY   83
WashU_Sbay_Contig532.4   26   TKVLSVKNYTPKQDLSQSIVLKTLAFPINPSDINQLQGVYPSRPEKTYDY   75
WashU_Scas_Contig593.14   33   TSVLTLQRYTPKEDLTKSIVLRSLAFPINPSDINQLQGVYPSLPEKTLDY   82
WashU_Sklu_Contig1124.1   35   TNVLSVHNYTPKTPPEESIVLRTLAFPINPSDINQLEGVYPSKPEKTLDY   84
WashU_Skud_Contig2061.4   51   TKVLSVKNYTPKQDLFKSIVLKTLAFPINPSDVNQLQGVYPSRPEKTYDY   100
WashU_Smik_Contig2208.1   34   SKVLSVKNYTPKQNLSQSIVLKTLAFPINPSDINQLQGVYPSRPEKTYDY   83
Symbols






:.**:::.**** :****::*********:***:***** **** **



SGD_Scer_ETR1/YBR026C   84   STDEPAAIAGNEGVFEVVSLPSGSSKGDLKLG-DRVIPLQANQGTWSNYR   132
MIT_Smik_c146_1193   84   STDEPAAIAGNEGVFEVVYLPPSSSNGDLKLG-DRVIPLQANQGTWSDYR   132
MIT_Spar_c197_1100   84   STDEPAAIAGNEGVFEVVSLPSGSSKGNLKLG-DRVIPLQANQGTWSNYR   132
MIT_Suva_c809_21444   84   STDEPAAIAGNEGVFEVVSLPSDGSKGKLKLG-DRVIPLQANQGTWSNYR   132
WashU_Sbay_Contig532.4   76   STDEPAAIAGNEGVFEVVSLPSDGSKGKLKLG-DRVIPLQANQGTWSNYR   124
WashU_Scas_Contig593.14   83   STKAPSAIAGNEGVFEVVSIPEG--ETDLVQG-DWVIPLEANQGTWSDYR   129
WashU_Sklu_Contig1124.1   85   DTERPSAIAGNEGVFEVVHVPLGVENANGVREGDVVIPLQANFGTWSDYR   134
WashU_Skud_Contig2061.4   101   STDEPSAIAGNEGVFEVVSLPSGNSKGELKLG-DHVIPLQANQGTWSNYR   149
WashU_Smik_Contig2208.1   84   STDEPAAIAGNEGVFEVVYLPPSSSNGDLKLG-DRVIPLQANQGTWSNYR   132
Symbols






.*. *:************ :* . : . * ****:** ****:**



SGD_Scer_ETR1/YBR026C   133   VFSSSSDLIKVNDLDLFSAATVSVNGCTGFQLVSDYIDWNSN--GNEWII   180
MIT_Smik_c146_1193   133   VFSSSSELIKVNDLDLFSAATVSVNGCTGFQLVSDYIDWNSG--GNEWII   180
MIT_Spar_c197_1100   133   VFSNSSELIKVNDLDLFSAATVSVNGCTGFQLVSDYISWNNG--GNEWII   180
MIT_Suva_c809_21444   133   VFSDSSELIKVNDLDLFSAATVSVNGCTGFQLVSDFIDWNKG--GNEWII   180
WashU_Sbay_Contig532.4   125   VFSDSSELIKVNDLDLFSAATVSVNGCTGFQLVSDFIDWNKG--GNEWII   172
WashU_Scas_Contig593.14   130   VFANSSDLIKVNDLDLYTAATISVNGCTAYQLVKNYVNWDVQSLGNEWLV   179
WashU_Sklu_Contig1124.1   135   TCSRASDLVKVPGIDLIAAATIAVNACTAYQMVNNYVKWDGS---NEWIV   181
WashU_Skud_Contig2061.4   150   VFSNSSDLIRVNDLDLFSAATVSVNGCTGFQLVSDYIDWNRG--ANEWII   197
WashU_Smik_Contig2208.1   133   VFSSSSELIKVNDLDLFSAATVSVNGCTGFQLVSDYIDWNSG--GNEWII   180
Symbols






. : :*:*::* .:** :***::**.**.:*:*.:::.*: ***::



SGD_Scer_ETR1/YBR026C   181   QNAGTSSVSKIVTQVAKAKGIKTLSVIRDRDNFDEVAKVLEDKYGATKVI   230
MIT_Smik_c146_1193   181   QNAGTSGVSKIVSQVAKAKGINTLSVIRDRDNFDEVAKVLEDKYGATKVI   230
MIT_Spar_c197_1100   181   QNAGTSSVSKIVSQVAKAKGIKTLSVIRDRNNFDEVAKVLEDKYGATKVI   230
MIT_Suva_c809_21444   181   QNAGTSGVSKIVSQVAKAKGIKTLSVIRDRDNFDEVAKVLEDKYGATKVI   230
WashU_Sbay_Contig532.4   173   QNAGTSGVSKIVSQVAKAKGIKTLSVIRDRDNFDEVAKVLEDKYGATKVI   222
WashU_Scas_Contig593.14   180   QNAGTSGVSKMVSQIAKANGIKTLSVIRDRDDFDEVASTLEKKYGATKVI   229
WashU_Sklu_Contig1124.1   182   QNAGNSSVSRIVSQIAKAQGIKTLSVVRDREDFEQLANELETKYGATKVI   231
WashU_Skud_Contig2061.4   198   QNAGTSGVSKIVSQVAKAKGIKTLSVIRDRDNFDEVAKVLENEYGATKVI   247
WashU_Smik_Contig2208.1   181   QNAGTSGVSKIVSQVAKAKGINTLSVIRDRDNFDEVAKVLEDKYGATKVI   230
Symbols






****.*.**::*:*:***:**:****:***::*:::*. ** :*******



SGD_Scer_ETR1/YBR026C   231   SESQNNDKTFAKEVLSKILGENARVRLALNSVGGKSSASIARKLENNALM   280
MIT_Smik_c146_1193   231   SESQNNNKTFAKEVLTKVLGENARVRLALNSVGGKSSASIARKLEKNALM   280
MIT_Spar_c197_1100   231   SESQNNDKTFAKEVLAKVLGENARVRLALNSVGGKSSASIARKLENNALM   280
MIT_Suva_c809_21444   231   SESQNNDKAFAKEVLAKVLGENARVKLALNSVGGKSSASIARKLEKNALM   280
WashU_Sbay_Contig532.4   223   SESQNNDKAFAKEVLAKVLGENARVKLALNSVGGKSSASIARKLEKNALM   272
WashU_Scas_Contig593.14   230   SETQNNDKQFNKEELPKILGSHARVRLALNSVGGKSSSAIARKLENDALM   279
WashU_Sklu_Contig1124.1   232   SEGENNDKQFAKEVLPQILGPNAQVKLALNSVGGKSSASIARKLSRDATM   281
WashU_Skud_Contig2061.4   248   SESQNNDKTFAKEVLPKVLGENARVKLALNSVGGKSSASIARKLEKNALM   297
WashU_Smik_Contig2208.1   231   SESQNNNKTFAKEVLTKVLGENARVRLALNSVGGKSSASIARKLEKNALM   280
Symbols






** :**:* * ** *.::** :*:*:***********::*****..:* *



SGD_Scer_ETR1/YBR026C   281   LTYGGMSKQPVTLPTSLHIFKGLTSKGYWVTEKNKKNPQSKIDTISDFIK   330
MIT_Smik_c146_1193   281   LTYGGMSKQPVTLPTSLHIFKGLTSKGYWVTEKNKENPQSKIDTINNFIE   330
MIT_Spar_c197_1100   281   LTYGGMSKQPVTLPTSLHIFKGLTSKGYWVTEKNKKNPQSKIDTISDFIK   330
MIT_Suva_c809_21444   281   LTYGGMSKQPVTLPTSLHIFKGLTSKGYWVTEKNKENPQSKIDTINDFIK   330
WashU_Sbay_Contig532.4   273   LTYGGMSKQPVTLPTSLHIFKGLTSKGYWVTEKNKENPQSKIDTINDFIK   322
WashU_Scas_Contig593.14   280   LTYGGMSKQPVTLPTSLHIFKGLTSKGFWITKNVKERPQDKIDAVQNITE   329
WashU_Sklu_Contig1124.1   282   LTYGGMSKMPVTLPTSLHIFRGLKSMGFWVTENSKKDPQSKIDTINALLK   331
WashU_Skud_Contig2061.4   298   LTYGGMSKQPVTLPTSLHIFKGLTSKGYWVTEKNKANPQTKIDTVNGFIK   347
WashU_Smik_Contig2208.1   281   LTYGGMSKQPVTLPTSLHIFKGLTSKGYWVTEKNKENPQSKIDTINNFIE   330
Symbols






******** ***********:**.* *:*:*:: * ** ***::. : :



SGD_Scer_ETR1/YBR026C   331   MYNYGHIISPRDEIETLTWNTNTTTDEQLLELVKKGITGKGKKKMVVLEW   380
MIT_Smik_c146_1193   331   MYNAGQIISPKDEIETLTWNTNTTTDEQLLELIKKGITGKGKKKLVVLEW   380
MIT_Spar_c197_1100   331   MYNDGHIISPRDEVETLIWNTNTTTDEQLLELVKKGITEKGKKKMVVLEW   380
MIT_Suva_c809_21444   331   MYNEGQIISPKDEIQTLTWDTNTMTDEQLLDIVKKGITDKGKKKLVILEW   380
WashU_Sbay_Contig532.4   323   MYNEGQIISPKDEIQTLTWDTNTMTDEQLLDIVKKGITDKGKKKLVILEW   372
WashU_Scas_Contig593.14   330   MYKDGTFLSPRDEIEKLQWDVRRASDNDILELVKAGIRGKGKKKVVALQW   379
WashU_Sklu_Contig1124.1   332   LYANGDIVSPKDEVNVVEWDVNAASDDQVLELVKCGITKKGKKNVVVLKW   381
WashU_Skud_Contig2061.4   348   MYNHGQIISPKDEIETLTWNTNSTTDEELLELVKRGITAKGKKKLVVLEW   397
WashU_Smik_Contig2208.1   331   MYNAGQIISPKDEIETLTWNTNTTTDEQLLELIKKGITGKGKKKLVVLEW   380
Symbols






:* * ::**:**:: : *:.. :*:::*:::* ** ****::* *:*



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_ETR1/YBR026C:

SGD_Scer_ETR1/YBR026C  Length: 381  Mon Nov  7 14:44:23 2016  Type: P  Check: 1427  ..

       1  MLPTFKRYMS SSAHQIPKHF KSLIYSTHEV EDCTKVLSVK NYTPKQDLSQ

      51  SIVLKTLAFP INPSDINQLQ GVYPSRPEKT YDYSTDEPAA IAGNEGVFEV

     101  VSLPSGSSKG DLKLGDRVIP LQANQGTWSN YRVFSSSSDL IKVNDLDLFS

     151  AATVSVNGCT GFQLVSDYID WNSNGNEWII QNAGTSSVSK IVTQVAKAKG

     201  IKTLSVIRDR DNFDEVAKVL EDKYGATKVI SESQNNDKTF AKEVLSKILG

     251  ENARVRLALN SVGGKSSASI ARKLENNALM LTYGGMSKQP VTLPTSLHIF

     301  KGLTSKGYWV TEKNKKNPQS KIDTISDFIK MYNYGHIISP RDEIETLTWN

     351  TNTTTDEQLL ELVKKGITGK GKKKMVVLEW *

Protein Sequence for MIT_Smik_c146_1193:

MIT_Smik_c146_1193  Length: 381  Mon Nov  7 14:44:23 2016  Type: P  Check: 807  ..

       1  MLPTLKRFMS SSTHQIPKQF KSLIYSAHEV EDCSKVLSVK NYTPKQNLSQ

      51  SIVLKTLAFP INPSDINQLQ GVYPSRPEKT YDYSTDEPAA IAGNEGVFEV

     101  VYLPPSSSNG DLKLGDRVIP LQANQGTWSD YRVFSSSSEL IKVNDLDLFS

     151  AATVSVNGCT GFQLVSDYID WNSGGNEWII QNAGTSGVSK IVSQVAKAKG

     201  INTLSVIRDR DNFDEVAKVL EDKYGATKVI SESQNNNKTF AKEVLTKVLG

     251  ENARVRLALN SVGGKSSASI ARKLEKNALM LTYGGMSKQP VTLPTSLHIF

     301  KGLTSKGYWV TEKNKENPQS KIDTINNFIE MYNAGQIISP KDEIETLTWN

     351  TNTTTDEQLL ELIKKGITGK GKKKLVVLEW *

Protein Sequence for MIT_Spar_c197_1100:

MIT_Spar_c197_1100  Length: 381  Mon Nov  7 14:44:23 2016  Type: P  Check: 1515  ..

       1  MLPTLKRFMS SSAHQIPKQF KSVIYSTHEV EDCAKVLSVK NYTPKQDLSQ

      51  SIVLKTLAFP INPSDINQLQ GVYPSRPEKT YDYSTDEPAA IAGNEGVFEV

     101  VSLPSGSSKG NLKLGDRVIP LQANQGTWSN YRVFSNSSEL IKVNDLDLFS

     151  AATVSVNGCT GFQLVSDYIS WNNGGNEWII QNAGTSSVSK IVSQVAKAKG

     201  IKTLSVIRDR NNFDEVAKVL EDKYGATKVI SESQNNDKTF AKEVLAKVLG

     251  ENARVRLALN SVGGKSSASI ARKLENNALM LTYGGMSKQP VTLPTSLHIF

     301  KGLTSKGYWV TEKNKKNPQS KIDTISDFIK MYNDGHIISP RDEVETLIWN

     351  TNTTTDEQLL ELVKKGITEK GKKKMVVLEW *

Protein Sequence for MIT_Suva_c809_21444:

MIT_Suva_c809_21444  Length: 381  Mon Nov  7 14:44:23 2016  Type: P  Check: 6841  ..

       1  MLPTLKRFMS SSGHQIPKQF KSIIYSTHEV EDCTKVLSVK NYTPKQDLSQ

      51  SIVLKTLAFP INPSDINQLQ GVYPSRPEKT YDYSTDEPAA IAGNEGVFEV

     101  VSLPSDGSKG KLKLGDRVIP LQANQGTWSN YRVFSDSSEL IKVNDLDLFS

     151  AATVSVNGCT GFQLVSDFID WNKGGNEWII QNAGTSGVSK IVSQVAKAKG

     201  IKTLSVIRDR DNFDEVAKVL EDKYGATKVI SESQNNDKAF AKEVLAKVLG

     251  ENARVKLALN SVGGKSSASI ARKLEKNALM LTYGGMSKQP VTLPTSLHIF

     301  KGLTSKGYWV TEKNKENPQS KIDTINDFIK MYNEGQIISP KDEIQTLTWD

     351  TNTMTDEQLL DIVKKGITDK GKKKLVILEW *

Protein Sequence for WashU_Sbay_Contig532.4:

WashU_Sbay_Contig532.4  Length: 373  Mon Nov  7 14:44:23 2016  Type: P  Check: 7015  ..

       1  MSSSGHQIPK QFKSIIYSTH EVEDCTKVLS VKNYTPKQDL SQSIVLKTLA

      51  FPINPSDINQ LQGVYPSRPE KTYDYSTDEP AAIAGNEGVF EVVSLPSDGS

     101  KGKLKLGDRV IPLQANQGTW SNYRVFSDSS ELIKVNDLDL FSAATVSVNG

     151  CTGFQLVSDF IDWNKGGNEW IIQNAGTSGV SKIVSQVAKA KGIKTLSVIR

     201  DRDNFDEVAK VLEDKYGATK VISESQNNDK AFAKEVLAKV LGENARVKLA

     251  LNSVGGKSSA SIARKLEKNA LMLTYGGMSK QPVTLPTSLH IFKGLTSKGY

     301  WVTEKNKENP QSKIDTINDF IKMYNEGQII SPKDEIQTLT WDTNTMTDEQ

     351  LLDIVKKGIT DKGKKKLVIL EW*

Protein Sequence for WashU_Scas_Contig593.14:

WashU_Scas_Contig593.14  Length: 380  Mon Nov  7 14:44:23 2016  Type: P  Check: 8336  ..

       1  MIPTFKRMMS SKPTIPKQFK SLVYSSHSVE DPTSVLTLQR YTPKEDLTKS

      51  IVLRSLAFPI NPSDINQLQG VYPSLPEKTL DYSTKAPSAI AGNEGVFEVV

     101  SIPEGETDLV QGDWVIPLEA NQGTWSDYRV FANSSDLIKV NDLDLYTAAT

     151  ISVNGCTAYQ LVKNYVNWDV QSLGNEWLVQ NAGTSGVSKM VSQIAKANGI

     201  KTLSVIRDRD DFDEVASTLE KKYGATKVIS ETQNNDKQFN KEELPKILGS

     251  HARVRLALNS VGGKSSSAIA RKLENDALML TYGGMSKQPV TLPTSLHIFK

     301  GLTSKGFWIT KNVKERPQDK IDAVQNITEM YKDGTFLSPR DEIEKLQWDV

     351  RRASDNDILE LVKAGIRGKG KKKVVALQW*

Protein Sequence for WashU_Sklu_Contig1124.1:

WashU_Sklu_Contig1124.1  Length: 382  Mon Nov  7 14:44:23 2016  Type: P  Check: 4450  ..

       1  MLPSLRLTSK RFMSSRMPNT FKSVVYADHD SQDCTNVLSV HNYTPKTPPE

      51  ESIVLRTLAF PINPSDINQL EGVYPSKPEK TLDYDTERPS AIAGNEGVFE

     101  VVHVPLGVEN ANGVREGDVV IPLQANFGTW SDYRTCSRAS DLVKVPGIDL

     151  IAAATIAVNA CTAYQMVNNY VKWDGSNEWI VQNAGNSSVS RIVSQIAKAQ

     201  GIKTLSVVRD REDFEQLANE LETKYGATKV ISEGENNDKQ FAKEVLPQIL

     251  GPNAQVKLAL NSVGGKSSAS IARKLSRDAT MLTYGGMSKM PVTLPTSLHI

     301  FRGLKSMGFW VTENSKKDPQ SKIDTINALL KLYANGDIVS PKDEVNVVEW

     351  DVNAASDDQV LELVKCGITK KGKKNVVVLK W*

Protein Sequence for WashU_Skud_Contig2061.4:

WashU_Skud_Contig2061.4  Length: 398  Mon Nov  7 14:44:23 2016  Type: P  Check: 5070  ..

       1  MNLYLRSNSS LNLEFNKMLT TLKRFMSSPA HQIPKQFKSI IYSTHEVEDC

      51  TKVLSVKNYT PKQDLFKSIV LKTLAFPINP SDVNQLQGVY PSRPEKTYDY

     101  STDEPSAIAG NEGVFEVVSL PSGNSKGELK LGDHVIPLQA NQGTWSNYRV

     151  FSNSSDLIRV NDLDLFSAAT VSVNGCTGFQ LVSDYIDWNR GANEWIIQNA

     201  GTSGVSKIVS QVAKAKGIKT LSVIRDRDNF DEVAKVLENE YGATKVISES

     251  QNNDKTFAKE VLPKVLGENA RVKLALNSVG GKSSASIARK LEKNALMLTY

     301  GGMSKQPVTL PTSLHIFKGL TSKGYWVTEK NKANPQTKID TVNGFIKMYN

     351  HGQIISPKDE IETLTWNTNS TTDEELLELV KRGITAKGKK KLVVLEW*


Protein Sequence for WashU_Smik_Contig2208.1:

WashU_Smik_Contig2208.1  Length: 381  Mon Nov  7 14:44:23 2016  Type: P  Check: 967  ..

       1  MLPTLKRFMS SSTHQIPKQF KSLIYSAHEV EDCSKVLSVK NYTPKQNLSQ

      51  SIVLKTLAFP INPSDINQLQ GVYPSRPEKT YDYSTDEPAA IAGNEGVFEV

     101  VYLPPSSSNG DLKLGDRVIP LQANQGTWSN YRVFSSSSEL IKVNDLDLFS

     151  AATVSVNGCT GFQLVSDYID WNSGGNEWII QNAGTSGVSK IVSQVAKAKG

     201  INTLSVIRDR DNFDEVAKVL EDKYGATKVI SESQNNNKTF AKEVLTKVLG

     251  ENARVRLALN SVGGKSSASI ARKLEKNALM LTYGGMSKQP VTLPTSLHIF

     301  KGLTSKGYWV TEKNKENPQS KIDTINNFIE MYNAGQIISP KDEIETLTWN

     351  TNTTTDEQLL ELIKKGITGK GKKKLVVLEW *