Fungal Sequence Alignment Help



This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al.

ClustalW Protein Alignment and Sequence for YMR138W and Homologs


Choose two or more sequences for alignment:
Best Hits & Orthologs

Pick a sequence type:

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_CIN4/YMR138W   1   ---------------------------MGLLSIIRKQKLRDKEIRCLILG   23
MIT_Smik_c299_16767   1   ---------------------------MGLLSIIRKQKLKDREIRCLILG   23
MIT_Spar_c164_16765   1   MTKSKNPLHYFIEIIRGKPQGVGKQEYMGLLSIIRKQKLKDKEIRCLILG   50
MIT_Suva_c655_18676   1   ---------------------------MGLLSIIRKQKLKDREIRCLILG   23
WashU_Sbay_Contig657.44   1   ---------------------------MGLLSIIRKQKLKDREIRCLILG   23
WashU_Scas_Contig680.21   1   ---------------------------MGLLTIIKKQKRKDKELKCLILG   23
WashU_Sklu_Contig1358.1   1   -MECTSSSSSFNAIVKKKQVKQVKQQNMGLLTVIKKQKMKDKELRSLVLG   49
WashU_Skud_Contig1802.2   1   ---------------------------MGLLSIIRKQKLKDREIRCLILG   23
WashU_Smik_Contig1743.2   1   ---------------------------MGLLSIIRKQKLKDREIRCLILG   23
Symbols






****::*:*** :*:*::.*:**



SGD_Scer_CIN4/YMR138W   24   LDNSGKSTIVNKLLPKDEQNNDGIMPTVGFQIHSLMIKDVTISLWDIGGQ   73
MIT_Smik_c299_16767   24   LDNSGKSTIVNKLLPEDEQNANDIMPTVGFQIHSLMIRDVMVSLWDVGGQ   73
MIT_Spar_c164_16765   51   LDNSGKSTIVNKLLPEDEQSKNGIMPTVGFQIHSLIIRDVTISLWDIGGQ   100
MIT_Suva_c655_18676   24   LDNSGKSTIVNKLLPENEQSANGITPTVGFQIHSLLIRDIMVSLWDIGGQ   73
WashU_Sbay_Contig657.44   24   LDNSGKSTIVNKLLPENEQSANGITPTVGFQIHSLLIRDIMVSLWDIGGQ   73
WashU_Scas_Contig680.21   24   LDNSGKSTLVNKLLPEEERSQVEITPTIGFQIVNFNHGGYTISMWDIGGQ   73
WashU_Sklu_Contig1358.1   50   LDNSGKSTVVDWLLERGEKRSR-ITPTVGFRIHTIEYAGHNVQLWDIGGQ   98
WashU_Skud_Contig1802.2   24   LDNSGKSTIVNKLLPEDEQKVNGIMPTVGFQIHSLVTRDVMVSLWDIGGQ   73
WashU_Smik_Contig1743.2   24   LDNSGKSTIVNKLLPEDEQNANDIMPTVGFQIHSLMIRDVMVSLWDVGGQ   73
Symbols






********:*: ** . *: * **:**:* .: . :.:**:***



SGD_Scer_CIN4/YMR138W   74   RTLRPFWDNYFDKTQAMIWCIDVSLSMRFDETLQELKELINRD---ENRI   120
MIT_Smik_c299_16767   74   RTLRPFWDNYFDKTQVMIWCIDVSLMMRLDETLQELSELVNRD---ENRI   120
MIT_Spar_c164_16765   101   RTLRPFWDNYFDKTQVMIWCIDVSLSMRFDETLQELRELVNRD---ENRI   147
MIT_Suva_c655_18676   74   RTLRPFWDNYFDKTQVMIWCIDVSLLVRFDETMQELEELVNRD---ESRM   120
WashU_Sbay_Contig657.44   74   RTLRPFWDNYFDKTQVMIWCIDVSLLVRFDETMQELEELVNRD---ESRM   120
WashU_Scas_Contig680.21   74   TTLRPFWDNYFDKMEALVWCVDVSAPSRFQESLRELSQLLNLDRTVESGE   123
WashU_Sklu_Contig1358.1   99   RTLRPFWDNYFDKTDVLLWVIDVTARSRFSESFAELEKLLQDR----DRL   144
WashU_Skud_Contig1802.2   74   RTLRQFWDNYFDKTQVMIWCIDVSLLMRFGETIRELRELINRD---ENRI   120
WashU_Smik_Contig1743.2   74   RTLRPFWDNYFDKTQVMIWCIDVSLMMRLDETLQELSELVNRD---ENRI   120
Symbols






*** ******** :.::* :**: *: *:: ** :*:: .



SGD_Scer_CIN4/YMR138W   121   GYECAVIVVLNKIDLVED--------KSELHRRCLLVESELKCLFKPDIR   162
MIT_Smik_c299_16767   121   GYECAVIIVLNKIDLVEN--------KSELCQRYASVESKLKRLFKPDIR   162
MIT_Spar_c164_16765   148   GYECAVIIVLNKIDLVES--------KSELHQRYILVESKLKCLFKPDIR   189
MIT_Suva_c655_18676   121   GYECAVIIALNKIDLVED--------KTELGRRCSLVESELKCLFKPGIR   162
WashU_Sbay_Contig657.44   121   GYECAVIIALNKIDLVED--------KTELGRRCSLVESELKCLFKPGIR   162
WashU_Scas_Contig680.21   124   TLPFKLIVVLNKIDLVDW--------NSVEGQLDSLLSTHLEGIEYVSLA   165
WashU_Sklu_Contig1358.1   145   GYRCKMIVLLNKMDLIDEDESVTDDVRRAVIERFSLVNTXXXXXXXXXXX   194
WashU_Skud_Contig1802.2   121   GYQCAVIIALNKTDLVED--------KSELCRRQKLVELELKSLFKPDIR   162
WashU_Smik_Contig1743.2   121   GYECAVIIVLNKIDLVEN--------KSELCQRYASVESKLKRLFKPDIR   162
Symbols






:*: *** **:: . . :.



SGD_Scer_CIN4/YMR138W   163   IELVKCSGVTGEGIDNLRDRLVESCHFTQ   191
MIT_Smik_c299_16767   163   VALVQCSGITGEGIDNLRDGLVEACHFT-   190
MIT_Spar_c164_16765   190   ITLVRCSGITGGGIDDLRDHLVESCYFTQ   218
MIT_Suva_c655_18676   163   LALVQCSGITGEGLDDLRDRLVEFSRFAQ   191
WashU_Sbay_Contig657.44   163   LALVQCSGITGEGLDDLRDRLVEFSRFAQ   191
WashU_Scas_Contig680.21   166   VSALNGQGTMELATLLLP-----------   183
WashU_Sklu_Contig1358.1   195   XXXVPCSGLTGFALGKLLEVVVSQ-----   218
WashU_Skud_Contig1802.2   163   IAVVQCSGITGEGIEDLRDRLVESCHFPQ   191
WashU_Smik_Contig1743.2   163   VALVQCSGITGEGIDNLRDGLVEACHFT-   190
Symbols






: .* . *



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_CIN4/YMR138W:

SGD_Scer_CIN4/YMR138W  Length: 192  Mon Nov  7 16:18:16 2016  Type: P  Check: 3309  ..

       1  MGLLSIIRKQ KLRDKEIRCL ILGLDNSGKS TIVNKLLPKD EQNNDGIMPT

      51  VGFQIHSLMI KDVTISLWDI GGQRTLRPFW DNYFDKTQAM IWCIDVSLSM

     101  RFDETLQELK ELINRDENRI GYECAVIVVL NKIDLVEDKS ELHRRCLLVE

     151  SELKCLFKPD IRIELVKCSG VTGEGIDNLR DRLVESCHFT Q*


Protein Sequence for MIT_Smik_c299_16767:

MIT_Smik_c299_16767  Length: 191  Mon Nov  7 16:18:16 2016  Type: P  Check: 4189  ..

       1  MGLLSIIRKQ KLKDREIRCL ILGLDNSGKS TIVNKLLPED EQNANDIMPT

      51  VGFQIHSLMI RDVMVSLWDV GGQRTLRPFW DNYFDKTQVM IWCIDVSLMM

     101  RLDETLQELS ELVNRDENRI GYECAVIIVL NKIDLVENKS ELCQRYASVE

     151  SKLKRLFKPD IRVALVQCSG ITGEGIDNLR DGLVEACHFT *


Protein Sequence for MIT_Spar_c164_16765:

MIT_Spar_c164_16765  Length: 219  Mon Nov  7 16:18:16 2016  Type: P  Check: 5811  ..

       1  MTKSKNPLHY FIEIIRGKPQ GVGKQEYMGL LSIIRKQKLK DKEIRCLILG

      51  LDNSGKSTIV NKLLPEDEQS KNGIMPTVGF QIHSLIIRDV TISLWDIGGQ

     101  RTLRPFWDNY FDKTQVMIWC IDVSLSMRFD ETLQELRELV NRDENRIGYE

     151  CAVIIVLNKI DLVESKSELH QRYILVESKL KCLFKPDIRI TLVRCSGITG

     201  GGIDDLRDHL VESCYFTQ*

Protein Sequence for MIT_Suva_c655_18676:

MIT_Suva_c655_18676  Length: 192  Mon Nov  7 16:18:16 2016  Type: P  Check: 4397  ..

       1  MGLLSIIRKQ KLKDREIRCL ILGLDNSGKS TIVNKLLPEN EQSANGITPT

      51  VGFQIHSLLI RDIMVSLWDI GGQRTLRPFW DNYFDKTQVM IWCIDVSLLV

     101  RFDETMQELE ELVNRDESRM GYECAVIIAL NKIDLVEDKT ELGRRCSLVE

     151  SELKCLFKPG IRLALVQCSG ITGEGLDDLR DRLVEFSRFA Q*


Protein Sequence for WashU_Sbay_Contig657.44:

WashU_Sbay_Contig657.44  Length: 192  Mon Nov  7 16:18:16 2016  Type: P  Check: 4397  ..

       1  MGLLSIIRKQ KLKDREIRCL ILGLDNSGKS TIVNKLLPEN EQSANGITPT

      51  VGFQIHSLLI RDIMVSLWDI GGQRTLRPFW DNYFDKTQVM IWCIDVSLLV

     101  RFDETMQELE ELVNRDESRM GYECAVIIAL NKIDLVEDKT ELGRRCSLVE

     151  SELKCLFKPG IRLALVQCSG ITGEGLDDLR DRLVEFSRFA Q*


Protein Sequence for WashU_Scas_Contig680.21:

WashU_Scas_Contig680.21  Length: 184  Mon Nov  7 16:18:16 2016  Type: P  Check: 7818  ..

       1  MGLLTIIKKQ KRKDKELKCL ILGLDNSGKS TLVNKLLPEE ERSQVEITPT

      51  IGFQIVNFNH GGYTISMWDI GGQTTLRPFW DNYFDKMEAL VWCVDVSAPS

     101  RFQESLRELS QLLNLDRTVE SGETLPFKLI VVLNKIDLVD WNSVEGQLDS

     151  LLSTHLEGIE YVSLAVSALN GQGTMELATL LLP*

Protein Sequence for WashU_Sklu_Contig1358.1:

WashU_Sklu_Contig1358.1  Length: 219  Mon Nov  7 16:18:16 2016  Type: P  Check: 2804  ..

       1  MECTSSSSSF NAIVKKKQVK QVKQQNMGLL TVIKKQKMKD KELRSLVLGL

      51  DNSGKSTVVD WLLERGEKRS RITPTVGFRI HTIEYAGHNV QLWDIGGQRT

     101  LRPFWDNYFD KTDVLLWVID VTARSRFSES FAELEKLLQD RDRLGYRCKM

     151  IVLLNKMDLI DEDESVTDDV RRAVIERFSL VNTXXXXXXX XXXXXXXVPC

     201  SGLTGFALGK LLEVVVSQ*

Protein Sequence for WashU_Skud_Contig1802.2:

WashU_Skud_Contig1802.2  Length: 192  Mon Nov  7 16:18:16 2016  Type: P  Check: 4941  ..

       1  MGLLSIIRKQ KLKDREIRCL ILGLDNSGKS TIVNKLLPED EQKVNGIMPT

      51  VGFQIHSLVT RDVMVSLWDI GGQRTLRQFW DNYFDKTQVM IWCIDVSLLM

     101  RFGETIRELR ELINRDENRI GYQCAVIIAL NKTDLVEDKS ELCRRQKLVE

     151  LELKSLFKPD IRIAVVQCSG ITGEGIEDLR DRLVESCHFP Q*


Protein Sequence for WashU_Smik_Contig1743.2:

WashU_Smik_Contig1743.2  Length: 191  Mon Nov  7 16:18:16 2016  Type: P  Check: 4189  ..

       1  MGLLSIIRKQ KLKDREIRCL ILGLDNSGKS TIVNKLLPED EQNANDIMPT

      51  VGFQIHSLMI RDVMVSLWDV GGQRTLRPFW DNYFDKTQVM IWCIDVSLMM

     101  RLDETLQELS ELVNRDENRI GYECAVIIVL NKIDLVENKS ELCQRYASVE

     151  SKLKRLFKPD IRVALVQCSG ITGEGIDNLR DGLVEACHFT *