Fungal Sequence Alignment

Help

This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al. We will soon include sequences from other fungal genomes from a variety of sources.

ClustalW Protein Alignment and Sequence for YOL006C and Homologs

Choose two or more sequences for alignment:
Pick a sequence type:
Best Hits & Orthologs"Other" Hits

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_TOP1/YOL006C   1   MTIADASKVNHELSSDDDDDVPLSQTLKKRKVASMNSASLQDEAEPYDSD   50
MIT_Spar_c373_20279   1   MTIADASKAKHELPSDDDDDVPLSQTFKKRKVESTNSASLQDDADPYDSD   50
WashU_Scas_Contig644.14   1   MTTDYQGRKRDEADEADEENVPLSQVSKRRKLKNEDVKDESESDDESLAS   50
WashU_Sklu_Contig2419.8   1   ----MTVNTKHEISGDEEEDVPLSSILPKKRRKTNIKVEESEN-------   39
Symbols






. ..* ::::****. ::: . . .:.



SGD_Scer_TOP1/YOL006C   51   EAISKISKKKTKKIKTEPVQSSSLPSPPAKKSATSKPKKIKKEDGDVKVK   100
MIT_Spar_c373_20279   51   EAISKISKKKTKKIKTEPLPSSSSSPPPVKKSAVSKPKKIKKEDGDVKLK   100
WashU_Scas_Contig644.14   51   IFKKKSKEKLKKVKLEEKDEDATADGKETVKKVTKTSKKVKKE-------   93
WashU_Sklu_Contig2419.8   40   ----NVKPKVKTVKKDQEYDSDEVLSKAVKKNRKKTTKPVAKDEDD----   81
Symbols






: . * .. : . . *. ...* : *:



SGD_Scer_TOP1/YOL006C   101   TTKKEEQENEKKKRE-EEEEEDKKAKEEEEEYKWWEKENEDDTIKWVTLK   149
MIT_Spar_c373_20279   101   TTKKEDQENEKKKKKKQEEEEDKKAKEEEEEYKWWEKENEDDTIKWVTLR   150
WashU_Scas_Contig644.14   94   ----------KKTVKDESEQDQNDEEEENGEYKWWENENEDDSIKWTTLK   133
WashU_Sklu_Contig2419.8   82   ---------EEEEKAILKKEDQDAEERESDEFKWWEQENDDDSIKWTTLR   122
Symbols






:: .::::. :.*. *:****:**:**:***.**:



SGD_Scer_TOP1/YOL006C   150   HNGVIFPPPYQPLPSHIKLYYDGKPVDLPPQAEEVAGFFAALLESDHAKN   199
MIT_Spar_c373_20279   151   HNGVIFPPPYQPLPSHIKLYYDGKPVDLPPQAEEVAGFFAALLESDHAKN   200
WashU_Scas_Contig644.14   134   HNGVVIPPPYEPLPSHIKLYYDGKPVDLPPEAEEVAGFYGAMLETDHAKN   183
WashU_Sklu_Contig2419.8   123   HNGVIFPPEYEPLPLHVKLYYDGKPVDLPPQAEEVAGFFAALLESDHAKN   172
Symbols






****::** *:*** *:*************:*******:.*:**:*****



SGD_Scer_TOP1/YOL006C   200   PVFQKNFFNDFLQVLKESGGPLNGIEIKEFSRCDFTKMFDYFQLQKEQKK   249
MIT_Spar_c373_20279   201   PVFQQNFFNDFLQVLKESGGPLNEIEIKEFSRCDFTKMFDYFQLQKEQKK   250
WashU_Scas_Contig644.14   184   PVFQKNFFQDFLEVLKTAGGTKNGITIEKFEKCDFTKMFDYFELLKEEKK   233
WashU_Sklu_Contig2419.8   173   PVFQKNFFNDFLEVLKEEGGTGNGITIESLDKCDFTKMFDHFQLEKEQKK   222
Symbols






****:***:***:*** **. * * *:.:.:********:*:* **:**



SGD_Scer_TOP1/YOL006C   250   QLTSQEKKQIRLEREKFEEDYKFCELDGRREQVGNFKVEPPDLFRGRGAH   299
MIT_Spar_c373_20279   251   QLTSQEKKQIRLEREKFEEDYKFCELDGRREQVGNFKVEPPDLFRGRGAH   300
WashU_Scas_Contig644.14   234   KLTSQEKKQIRLDREKAEEKYKFCELDGRKEQVGNFKVEPPGLFRGRGAH   283
WashU_Sklu_Contig2419.8   223   QLSSQEKKQLKLEKEKLEEPFKFCYLDGRREQVGNFRIEPAGLFRGRGAH   272
Symbols






:*:******::*::** ** :*** ****:******::**..********



SGD_Scer_TOP1/YOL006C   300   PKTGKLKRRVNPEDIVLNLSKDAPVPPAPEGHKWGEIRHDNTVQWLAMWR   349
MIT_Spar_c373_20279   301   PKTGKLKRRVNPEDIVLNLSKDAPIPPAPEGHKWGEIRHDNTVQWLAMWR   350
WashU_Scas_Contig644.14   284   PKTGKLKRRVQPEDIILNLDKDAPIPEAPEGHHWGEIRNDNSVQWLAMWR   333
WashU_Sklu_Contig2419.8   273   PKTGKLKRRVYPEDVVLNLDKDAPIPEPPTGHKWGEIRHDNTVQWLAMWR   322
Symbols






********** ***::***.****:* .* **:*****:**:********



SGD_Scer_TOP1/YOL006C   350   ENIFNSFKYVRLAANSSLKGQSDYKKFEKARQLKSYIDAIRRDYTRNLKS   399
MIT_Spar_c373_20279   351   ENIFNSFKYVRLAANSSLKGQSDYKKFEKARQLKSYIDAIRRDYTRNLKS   400
WashU_Scas_Contig644.14   334   ENIFNSVKYVRLAANSSLKGQSDYKKFEKARQLKDHIDAIRKDYKKQLKS   383
WashU_Sklu_Contig2419.8   323   ENISNSFKYVRFAASSSLKGISDYKKFEKARQLKDHIDAVRKDYNKQLKS   372
Symbols






*** **.****:**.***** *************.:***:*:**.::***



SGD_Scer_TOP1/YOL006C   400   KVMLERQKAVAIYLIDVFALRAGGEKSEDEADTVGCCSLRYEHVTLKPPN   449
MIT_Spar_c373_20279   401   KVMLERQKAVAIYLIDVFALRAGGEKSEDEADTVGCCSLRYEHVTLKPPN   450
WashU_Scas_Contig644.14   384   EVMLERQKAVAIYLIDVFALRAGGEKSEDEADTVGCCSLRYEHVTLKPPN   433
WashU_Sklu_Contig2419.8   373   KVMLDRQMAVATYLIDVFALRAGGEKSEDEADTVGCCSLRYEHVTLKPPN   422
Symbols






:***:** *** **************************************



SGD_Scer_TOP1/YOL006C   450   TVIFDFLGKDSIRFYQEVEVDKQVFKNLTIFKRPPKQPGHQLFDRLDPSI   499
MIT_Spar_c373_20279   451   TVIFDFLGKDSIRFYQEVEVDKQVFKNLTIFKRPPKQPGHQLFDRLDPSI   500
WashU_Scas_Contig644.14   434   TVVFDFLGKDSIRFYQEVEVDKKVFKNLKIFKRPPKQPGHQLFDRLDPST   483
WashU_Sklu_Contig2419.8   423   TVIFDFLGKDSIRFYQEVQVDKQVFKNLTIFKRPPKQPGHQLFDRLDPSI   472
Symbols






**:***************:***:*****.********************



SGD_Scer_TOP1/YOL006C   500   LNKYLQNYMPGLTAKVFRTYNASKTMQDQLDLIPNKGSVAEKILKYNAAN   549
MIT_Spar_c373_20279   501   LNKYLQNYMPGLTAKVFRTYNASKTMQDQLDLIPNKGSVAEKILKYNAAN   550
WashU_Scas_Contig644.14   484   LNKHLQNYMPGLSAKVFRTYNASKTMQDQLDLIPNEGSVAEKLLRYNAAN   533
WashU_Sklu_Contig2419.8   473   LNKHLQNYMPGLTAKVFRTYNASKTMQDQLDLIPNEGTVAEKMLRYNAAN   522
Symbols






***:********:**********************:*:****:*:*****



SGD_Scer_TOP1/YOL006C   550   RTVAILCNHQRTVTKGHAQTVEKANNRIQELEWQKIRCKRAILQLDKDLL   599
MIT_Spar_c373_20279   551   RTVAILCNHQRTVTKGHAQTVEKANNRIQELEWQKVRCKRAILQLDKDLL   600
WashU_Scas_Contig644.14   534   RTVAILCNHQRTVTKGHAQSVQKANDKIKELEWQKIRLKKALLQLEPERL   583
WashU_Sklu_Contig2419.8   523   RTVAILCNHQRTVTKGHAAAVQKANDKIEELEWQKIRYKRAILQLDKNEL   572
Symbols






****************** :*:***::*:******:* *:*:***: : *



SGD_Scer_TOP1/YOL006C   600   KKEPKYFEEIDDLTKEDEATIHKRIIDREIEKYQRKFVRENDKRKFEKEE   649
MIT_Spar_c373_20279   601   KKEPKYFEEIDDLTKEDEAAIHKRIIEREIEKYQRKFVRENDKRKFEKEE   650
WashU_Scas_Contig644.14   584   KKNPAYFEEIEDMTKEDEAEIHRRIIEREIEKYHKKFARENDKRKFEKEE   633
WashU_Sklu_Contig2419.8   573   KKNLKYFEEISDLTKEEESAIHKRVIEREREKYNRKFLRENEKRKHEKQE   622
Symbols






**: *****.*:***:*: **:*:*:** ***::** ***:***.**:*



SGD_Scer_TOP1/YOL006C   650   LLPESQLKEWLEKVDEKKQEFEKELKTGEVELKSSWNSVEKIKAQVEKLE   699
MIT_Spar_c373_20279   651   LLPESQLKEWLEKVDEKKQEFEKELETGEIELKSTWNSVEKIKAQVEKLE   700
WashU_Scas_Contig644.14   634   LLPESQLKEWLDGVDELKKQYKEELKTGVVELKPTMNTVEKIEKQIERLD   683
WashU_Sklu_Contig2419.8   623   LLPEAQLTEWMTQVDELKEQYEKELETGDIELKPTLQSVDKLKVQIEKLE   672
Symbols






****:**.**: *** *:::::**:** :***.: ::*:*:: *:*:*:



SGD_Scer_TOP1/YOL006C   700   QRIQTSSIQLKDKEENSQVSLGTSKINYIDPRLSVVFCKKYDVPIEKIFT   749
MIT_Spar_c373_20279   701   QRIQTSSIQLKDKEENSQVSLGTSKINYIDPRLSVVFCKKYDVPIEKIFT   750
WashU_Scas_Contig644.14   684   NRIQTNSIQLKDKEENSQVSLGTSKINYIDPRLTVVFCKKYDVPIEKIFT   733
WashU_Sklu_Contig2419.8   673   QRIRTSTVQLKDREDNSQVALGTSKINYIDPRLSVVFCKKYNVPIEKVFT   722
Symbols






:**:*.::****:*:****:*************:*******:*****:**



SGD_Scer_TOP1/YOL006C   750   KTLREKFKWAIESVDENWRF   769
MIT_Spar_c373_20279   751   KTLREKFKWAIESVDENWRF   770
WashU_Scas_Contig644.14   734   KTLREKFKWAIESVDENWRF   753
WashU_Sklu_Contig2419.8   723   KSLREKFKWAIESADENWRF   742
Symbols






*:***********.******



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_TOP1/YOL006C:

SGD_Scer_TOP1/YOL006C  Length: 770  Fri Jan  6 19:49:53 2012  Type: P  Check: 5952  ..

       1  MTIADASKVN HELSSDDDDD VPLSQTLKKR KVASMNSASL QDEAEPYDSD

      51  EAISKISKKK TKKIKTEPVQ SSSLPSPPAK KSATSKPKKI KKEDGDVKVK

     101  TTKKEEQENE KKKREEEEEE DKKAKEEEEE YKWWEKENED DTIKWVTLKH

     151  NGVIFPPPYQ PLPSHIKLYY DGKPVDLPPQ AEEVAGFFAA LLESDHAKNP

     201  VFQKNFFNDF LQVLKESGGP LNGIEIKEFS RCDFTKMFDY FQLQKEQKKQ

     251  LTSQEKKQIR LEREKFEEDY KFCELDGRRE QVGNFKVEPP DLFRGRGAHP

     301  KTGKLKRRVN PEDIVLNLSK DAPVPPAPEG HKWGEIRHDN TVQWLAMWRE

     351  NIFNSFKYVR LAANSSLKGQ SDYKKFEKAR QLKSYIDAIR RDYTRNLKSK

     401  VMLERQKAVA IYLIDVFALR AGGEKSEDEA DTVGCCSLRY EHVTLKPPNT

     451  VIFDFLGKDS IRFYQEVEVD KQVFKNLTIF KRPPKQPGHQ LFDRLDPSIL

     501  NKYLQNYMPG LTAKVFRTYN ASKTMQDQLD LIPNKGSVAE KILKYNAANR

     551  TVAILCNHQR TVTKGHAQTV EKANNRIQEL EWQKIRCKRA ILQLDKDLLK

     601  KEPKYFEEID DLTKEDEATI HKRIIDREIE KYQRKFVREN DKRKFEKEEL

     651  LPESQLKEWL EKVDEKKQEF EKELKTGEVE LKSSWNSVEK IKAQVEKLEQ

     701  RIQTSSIQLK DKEENSQVSL GTSKINYIDP RLSVVFCKKY DVPIEKIFTK

     751  TLREKFKWAI ESVDENWRF*

Protein Sequence for MIT_Spar_c373_20279:

MIT_Spar_c373_20279  Length: 771  Fri Jan  6 19:49:53 2012  Type: P  Check: 5853  ..

       1  MTIADASKAK HELPSDDDDD VPLSQTFKKR KVESTNSASL QDDADPYDSD

      51  EAISKISKKK TKKIKTEPLP SSSSSPPPVK KSAVSKPKKI KKEDGDVKLK

     101  TTKKEDQENE KKKKKKQEEE EDKKAKEEEE EYKWWEKENE DDTIKWVTLR

     151  HNGVIFPPPY QPLPSHIKLY YDGKPVDLPP QAEEVAGFFA ALLESDHAKN

     201  PVFQQNFFND FLQVLKESGG PLNEIEIKEF SRCDFTKMFD YFQLQKEQKK

     251  QLTSQEKKQI RLEREKFEED YKFCELDGRR EQVGNFKVEP PDLFRGRGAH

     301  PKTGKLKRRV NPEDIVLNLS KDAPIPPAPE GHKWGEIRHD NTVQWLAMWR

     351  ENIFNSFKYV RLAANSSLKG QSDYKKFEKA RQLKSYIDAI RRDYTRNLKS

     401  KVMLERQKAV AIYLIDVFAL RAGGEKSEDE ADTVGCCSLR YEHVTLKPPN

     451  TVIFDFLGKD SIRFYQEVEV DKQVFKNLTI FKRPPKQPGH QLFDRLDPSI

     501  LNKYLQNYMP GLTAKVFRTY NASKTMQDQL DLIPNKGSVA EKILKYNAAN

     551  RTVAILCNHQ RTVTKGHAQT VEKANNRIQE LEWQKVRCKR AILQLDKDLL

     601  KKEPKYFEEI DDLTKEDEAA IHKRIIEREI EKYQRKFVRE NDKRKFEKEE

     651  LLPESQLKEW LEKVDEKKQE FEKELETGEI ELKSTWNSVE KIKAQVEKLE

     701  QRIQTSSIQL KDKEENSQVS LGTSKINYID PRLSVVFCKK YDVPIEKIFT

     751  KTLREKFKWA IESVDENWRF *

Protein Sequence for WashU_Scas_Contig644.14:

WashU_Scas_Contig644.14  Length: 754  Fri Jan  6 19:49:53 2012  Type: P  Check: 4001  ..

       1  MTTDYQGRKR DEADEADEEN VPLSQVSKRR KLKNEDVKDE SESDDESLAS

      51  IFKKKSKEKL KKVKLEEKDE DATADGKETV KKVTKTSKKV KKEKKTVKDE

     101  SEQDQNDEEE ENGEYKWWEN ENEDDSIKWT TLKHNGVVIP PPYEPLPSHI

     151  KLYYDGKPVD LPPEAEEVAG FYGAMLETDH AKNPVFQKNF FQDFLEVLKT

     201  AGGTKNGITI EKFEKCDFTK MFDYFELLKE EKKKLTSQEK KQIRLDREKA

     251  EEKYKFCELD GRKEQVGNFK VEPPGLFRGR GAHPKTGKLK RRVQPEDIIL

     301  NLDKDAPIPE APEGHHWGEI RNDNSVQWLA MWRENIFNSV KYVRLAANSS

     351  LKGQSDYKKF EKARQLKDHI DAIRKDYKKQ LKSEVMLERQ KAVAIYLIDV

     401  FALRAGGEKS EDEADTVGCC SLRYEHVTLK PPNTVVFDFL GKDSIRFYQE

     451  VEVDKKVFKN LKIFKRPPKQ PGHQLFDRLD PSTLNKHLQN YMPGLSAKVF

     501  RTYNASKTMQ DQLDLIPNEG SVAEKLLRYN AANRTVAILC NHQRTVTKGH

     551  AQSVQKANDK IKELEWQKIR LKKALLQLEP ERLKKNPAYF EEIEDMTKED

     601  EAEIHRRIIE REIEKYHKKF ARENDKRKFE KEELLPESQL KEWLDGVDEL

     651  KKQYKEELKT GVVELKPTMN TVEKIEKQIE RLDNRIQTNS IQLKDKEENS

     701  QVSLGTSKIN YIDPRLTVVF CKKYDVPIEK IFTKTLREKF KWAIESVDEN

     751  WRF*

Protein Sequence for WashU_Sklu_Contig2419.8:

WashU_Sklu_Contig2419.8  Length: 743  Fri Jan  6 19:49:53 2012  Type: P  Check: 9978  ..

       1  MTVNTKHEIS GDEEEDVPLS SILPKKRRKT NIKVEESENN VKPKVKTVKK

      51  DQEYDSDEVL SKAVKKNRKK TTKPVAKDED DEEEEKAILK KEDQDAEERE

     101  SDEFKWWEQE NDDDSIKWTT LRHNGVIFPP EYEPLPLHVK LYYDGKPVDL

     151  PPQAEEVAGF FAALLESDHA KNPVFQKNFF NDFLEVLKEE GGTGNGITIE

     201  SLDKCDFTKM FDHFQLEKEQ KKQLSSQEKK QLKLEKEKLE EPFKFCYLDG

     251  RREQVGNFRI EPAGLFRGRG AHPKTGKLKR RVYPEDVVLN LDKDAPIPEP

     301  PTGHKWGEIR HDNTVQWLAM WRENISNSFK YVRFAASSSL KGISDYKKFE

     351  KARQLKDHID AVRKDYNKQL KSKVMLDRQM AVATYLIDVF ALRAGGEKSE

     401  DEADTVGCCS LRYEHVTLKP PNTVIFDFLG KDSIRFYQEV QVDKQVFKNL

     451  TIFKRPPKQP GHQLFDRLDP SILNKHLQNY MPGLTAKVFR TYNASKTMQD

     501  QLDLIPNEGT VAEKMLRYNA ANRTVAILCN HQRTVTKGHA AAVQKANDKI

     551  EELEWQKIRY KRAILQLDKN ELKKNLKYFE EISDLTKEEE SAIHKRVIER

     601  EREKYNRKFL RENEKRKHEK QELLPEAQLT EWMTQVDELK EQYEKELETG

     651  DIELKPTLQS VDKLKVQIEK LEQRIRTSTV QLKDREDNSQ VALGTSKINY

     701  IDPRLSVVFC KKYNVPIEKV FTKSLREKFK WAIESADENW RF*