Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YDR253C and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_MET32/YDR253C   1   MEDQDAAFIKQATEAIVDVSLNIDNIDPIIKELLERVRNRQNRLQNKKPA   50
MIT_Smik_c237_3987   1   MEDEDAAFIKQATEAIVDVSLNMDNIDPIIKELLERVRKRRNTSRNGKTS   50
MIT_Spar_c115_4678   1   MEDQDAAFIKQATEAIVDVSLNVDNIDPIIKELLERVRNRQNTSQNKKPS   50
MIT_Suva_c1002_4696   1   MEDQDSAFIKQATEAIVDISLDINNIDPIIKELLQRVKNTRNTSRHRKRP   50
WashU_Sbay_Contig630.43   1   MEDQDSAFIKQATEAIVDISLDINNIDPIIKELLQRVKNTRNTSRHRKRP   50
WashU_Scas_Contig568.5   1   -MNEESTFFRLAAEAIVATSLNANNVDPTIRELLNRINYNDNMATARIPG   49
WashU_Skud_Contig1915.9   1   MEDQDSAFIKQATEAIVDVSLNVDSIDPIIKELLQRVRNMQNKSQNRKNS   50
WashU_Smik_Contig1237.2   1   MEDEDAAFIKQATEAIVDVSLNMDNIDPIIKELLERVRKRRNTSRNGKTS   50
Symbols






:::::*:: *:**** **: :.:** *:***:*:. *



SGD_Scer_MET32/YDR253C   51   LIPAENGVDIN---------------------SQGGNIKVKKENALPKPP   79
MIT_Smik_c237_3987   51   LKPGKNSIDIS---------------------SYGGAVEIEKETELERIS   79
MIT_Spar_c115_4678   51   LIQAENGVNIN---------------------SQNGNMNVKKENELQKPP   79
MIT_Suva_c1002_4696   51   VASTENGIKTG---------------------THNGNVKVKKEDVGQGLT   79
WashU_Sbay_Contig630.43   51   VASTENGIKTG---------------------THNGNVKVKKEDVGQGLT   79
WashU_Scas_Contig568.5   50   MLDVTGVSNHANLNTTAHLSPKSLPSMQTVELSHIPSSQSTSSSLHTHTD   99
WashU_Skud_Contig1915.9   51   LTSAENSVDNS---------------------NHSGNVKIEKEHSS---S   76
WashU_Smik_Contig1237.2   51   LKPGKNSIDIS---------------------SYGGAVEIEKETELERIS   79
Symbols






: . . . : ..



SGD_Scer_MET32/YDR253C   80   KSSKSKPQDRRNSTGEKRFKCAKCSLEFSRSSDLRRHEKTHFAILPNICP   129
MIT_Smik_c237_3987   80   KPSESKSQDRRNSTGEKRFKCAKCSLEFSRSSDLRRHEKTHFAILPNICP   129
MIT_Spar_c115_4678   80   KPSKSKSQDRRNSTGEKRFKCAKCLLEFSRSSDLRRHEKTHFAILPNICP   129
MIT_Suva_c1002_4696   80   KPNESKSQDRRNSNGGKQFRCAKCSLEFSRSSDLRRHEKTLLTYYLTYA-   128
WashU_Sbay_Contig630.43   80   KPNESKSQDRRNSNGGKQFRCAKCSLEFSRSSDLRRHEKTHFAILPNICP   129
WashU_Scas_Contig568.5   100   KSNKVAKRVRKDSYDNKRYPCSKCELIFLRSSDLRRHEKAHLLVLPHICS   149
WashU_Skud_Contig1915.9   77   KTSKSKSQDRRDSTGEKKFRCAKCSLEFSRSSDLRRHEKTHFAILPNICP   126
WashU_Smik_Contig1237.2   80   KPSESKSQDRRNSTGEKKFKCAKCSLEFSRSSDLRRHEKTHFAILPNICP   129
Symbols






*..: : *::* . *:: *:** * * **********: : .



SGD_Scer_MET32/YDR253C   130   QCGKGFARKDALKRHYDTLTCRRNRTKLLTAGGEGINELLKKVKQSNIVH   179
MIT_Smik_c237_3987   130   QCGKGFARKDALKRHYDTLTCRRNRTKLLTAGGEGINELLRKVKQSNIVN   179
MIT_Spar_c115_4678   130   QCGKGFARKDALKRHYDTLTCRRNRTKLLTAGGEGINELLKKVKQSNIVH   179
MIT_Suva_c1002_4696   
   --------------------------------------------------   
WashU_Sbay_Contig630.43   130   QCGKGFARKDALKRHYDTLTCRRNRSKLLSAGGEGINELLKKVKQSNIAN   179
WashU_Scas_Contig568.5   150   QCGKGFARKDALKRHFNTQTCQRNRKKLMEIAGGNINELLERARVNGTSL   199
WashU_Skud_Contig1915.9   127   QCGKGFARKDALKRHYDTLTCRRNRTKLLTAGGESINELLKKVKQSNIVN   176
WashU_Smik_Contig1237.2   130   QCGKGFARKDALKRHYDTLTCRRNRTKLLTAGGEGINELLRKVKQSNIVN   179
Symbols










SGD_Scer_MET32/YDR253C   180   RQDNNHNGSSNG--   191
MIT_Smik_c237_3987   180   SQKSNSNSSSSSNG   193
MIT_Spar_c115_4678   180   RQDNNQNSSSNG--   191
MIT_Suva_c1002_4696   
   --------------   
WashU_Sbay_Contig630.43   180   REESNKNNGWKG--   191
WashU_Scas_Contig568.5   
   --------------   
WashU_Skud_Contig1915.9   177   RQENNNNSTND---   187
WashU_Smik_Contig1237.2   180   SQKSNSNSSSSSNG   193
Symbols










Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_MET32/YDR253C:

SGD_Scer_MET32/YDR253C  Length: 192  Mon Nov  7 15:05:24 2016  Type: P  Check: 2258  ..

       1  MEDQDAAFIK QATEAIVDVS LNIDNIDPII KELLERVRNR QNRLQNKKPA

      51  LIPAENGVDI NSQGGNIKVK KENALPKPPK SSKSKPQDRR NSTGEKRFKC

     101  AKCSLEFSRS SDLRRHEKTH FAILPNICPQ CGKGFARKDA LKRHYDTLTC

     151  RRNRTKLLTA GGEGINELLK KVKQSNIVHR QDNNHNGSSN G*


Protein Sequence for MIT_Smik_c237_3987:

MIT_Smik_c237_3987  Length: 194  Mon Nov  7 15:05:24 2016  Type: P  Check: 9207  ..

       1  MEDEDAAFIK QATEAIVDVS LNMDNIDPII KELLERVRKR RNTSRNGKTS

      51  LKPGKNSIDI SSYGGAVEIE KETELERISK PSESKSQDRR NSTGEKRFKC

     101  AKCSLEFSRS SDLRRHEKTH FAILPNICPQ CGKGFARKDA LKRHYDTLTC

     151  RRNRTKLLTA GGEGINELLR KVKQSNIVNS QKSNSNSSSS SNG*


Protein Sequence for MIT_Spar_c115_4678:

MIT_Spar_c115_4678  Length: 192  Mon Nov  7 15:05:24 2016  Type: P  Check: 4137  ..

       1  MEDQDAAFIK QATEAIVDVS LNVDNIDPII KELLERVRNR QNTSQNKKPS

      51  LIQAENGVNI NSQNGNMNVK KENELQKPPK PSKSKSQDRR NSTGEKRFKC

     101  AKCLLEFSRS SDLRRHEKTH FAILPNICPQ CGKGFARKDA LKRHYDTLTC

     151  RRNRTKLLTA GGEGINELLK KVKQSNIVHR QDNNQNSSSN G*


Protein Sequence for MIT_Suva_c1002_4696:

MIT_Suva_c1002_4696  Length: 128  Mon Nov  7 15:05:24 2016  Type: P  Check: 2315  ..

       1  MEDQDSAFIK QATEAIVDIS LDINNIDPII KELLQRVKNT RNTSRHRKRP

      51  VASTENGIKT GTHNGNVKVK KEDVGQGLTK PNESKSQDRR NSNGGKQFRC

     101  AKCSLEFSRS SDLRRHEKTL LTYYLTYA

Protein Sequence for WashU_Sbay_Contig630.43:

WashU_Sbay_Contig630.43  Length: 192  Mon Nov  7 15:05:24 2016  Type: P  Check: 4798  ..

       1  MEDQDSAFIK QATEAIVDIS LDINNIDPII KELLQRVKNT RNTSRHRKRP

      51  VASTENGIKT GTHNGNVKVK KEDVGQGLTK PNESKSQDRR NSNGGKQFRC

     101  AKCSLEFSRS SDLRRHEKTH FAILPNICPQ CGKGFARKDA LKRHYDTLTC

     151  RRNRSKLLSA GGEGINELLK KVKQSNIANR EESNKNNGWK G*


Protein Sequence for WashU_Scas_Contig568.5:

WashU_Scas_Contig568.5  Length: 200  Mon Nov  7 15:05:24 2016  Type: P  Check: 2872  ..

       1  MNEESTFFRL AAEAIVATSL NANNVDPTIR ELLNRINYND NMATARIPGM

      51  LDVTGVSNHA NLNTTAHLSP KSLPSMQTVE LSHIPSSQST SSSLHTHTDK

     101  SNKVAKRVRK DSYDNKRYPC SKCELIFLRS SDLRRHEKAH LLVLPHICSQ

     151  CGKGFARKDA LKRHFNTQTC QRNRKKLMEI AGGNINELLE RARVNGTSL*


Protein Sequence for WashU_Skud_Contig1915.9:

WashU_Skud_Contig1915.9  Length: 188  Mon Nov  7 15:05:24 2016  Type: P  Check: 967  ..

       1  MEDQDSAFIK QATEAIVDVS LNVDSIDPII KELLQRVRNM QNKSQNRKNS

      51  LTSAENSVDN SNHSGNVKIE KEHSSSKTSK SKSQDRRDST GEKKFRCAKC

     101  SLEFSRSSDL RRHEKTHFAI LPNICPQCGK GFARKDALKR HYDTLTCRRN

     151  RTKLLTAGGE SINELLKKVK QSNIVNRQEN NNNSTND*

Protein Sequence for WashU_Smik_Contig1237.2:

WashU_Smik_Contig1237.2  Length: 194  Mon Nov  7 15:05:24 2016  Type: P  Check: 8927  ..

       1  MEDEDAAFIK QATEAIVDVS LNMDNIDPII KELLERVRKR RNTSRNGKTS

      51  LKPGKNSIDI SSYGGAVEIE KETELERISK PSESKSQDRR NSTGEKKFKC

     101  AKCSLEFSRS SDLRRHEKTH FAILPNICPQ CGKGFARKDA LKRHYDTLTC

     151  RRNRTKLLTA GGEGINELLR KVKQSNIVNS QKSNSNSSSS SNG*