Fungal Sequence Alignment

Help

This page displays a Saccharomyces cerevisiae protein in a ClustalW alignment with identified orthologs in other fungal species.

Currently, this page displays other fungal sequences from Cliften et al. and Kellis et al. We will soon include sequences from other fungal genomes from a variety of sources.

ClustalW Protein Alignment and Sequence for YMR190C and Homologs

Choose two or more sequences for alignment:
Pick a sequence type:
Best Hits & Orthologs"Other" Hits

Select or unselect multiple options for sequence
name by pressing the Control (PC) or Command
(Mac) key while clicking.

Align selected sequences
selected sequences (FASTA format)


Symbols:
* = identical
: = strong similarity
. = weak similarity

SGD_Scer_SGS1/YMR190C   1   --MVTKPSHNLRREHKWLKETATLQEDKDFVFQAIQKHIANKRPKTNSPP   48
MIT_Spar_c161_16574   1   --MVTKPSHNLRREHKWLKETATLQEDRDFVFQAIQKHITNKRPKTNSPP   48
WashU_Sbay_Contig651.38   1   --MVTKPSHNLRREHKWLKETATLQEDKEFVFQAIQKHIANKKPKTNSPP   48
WashU_Scas_Contig720.57   1   MSARTKPSHNLRREHKWLKETNVLQHDRDLVLQVINKNITHKRIRSTADA   50
Symbols






***************** .**.*:::*:*.*:*:*::*: ::.: .



SGD_Scer_SGS1/YMR190C   49   TTPSKDECGPG--TTNFITSIPASGPTNTATKQHEVMQTLSNDTEWLSYT   96
MIT_Spar_c161_16574   49   TTPSRDEYGAG--ATNLLTSVPASASTNTATKQHEVLQTLSNDTEWLSYP   96
WashU_Sbay_Contig651.38   49   ITPSRDVYEAG--AINLLTSLPGSGSTSTATKQREGTQTLSNDSEWLSYP   96
WashU_Scas_Contig720.57   51   ITKQASPNFAGRNATNEADMIIPDSSSSIYQKDSNVVYSIDIDLEHNIPM   100
Symbols






* . . .* : * : ...:. *: : ::. * *



SGD_Scer_SGS1/YMR190C   97   ATSNQYADVPMVDIPASTSVVSNPRTPNGSKTHNFNTFRPHMASSLVEND   146
MIT_Spar_c161_16574   97   ATSNQYTDVPMVDIPASTSVVSNPRTPSGSKAHNFNTYPSHMGSSLVEND   146
WashU_Sbay_Contig651.38   97   ATSNQYTDIPIGDIPASTSIVSNPRTPSGSKTHSANMLRPPMASSLVENS   146
WashU_Scas_Contig720.57   101   LAPEQPLLPTITNMKPPQERVVNSTRKVSATIISNNLTELNIAQSSS---   147
Symbols






:.:* .: :: .. . * *. .:. . * :..*



SGD_Scer_SGS1/YMR190C   147   SSRNLGSRNNNKSVIDNSSIGKQLEN---DIKLEVIRLQGSLIMALKEQS   193
MIT_Spar_c161_16574   147   SSRNLDCRNNIKDLTDNSGTNKQFEN---NMKVEVIRLQSSLIAALKEQS   193
WashU_Sbay_Contig651.38   147   SSRNMDRRNNTNDVIDSASTSKQIEKHQDRLERDIVRLQGSLITVLKEQS   196
WashU_Scas_Contig720.57   148   ------------------------------TKDELISLQEALIKKLKDQS   167
Symbols






: ::: ** :** **:**



SGD_Scer_SGS1/YMR190C   194   KLLLQKCSIIESTSLSEDAKRLQLSRDIRPQLSNMSIRIDSLEKEIIKAK   243
MIT_Spar_c161_16574   194   KLLMQKCSIIESTSLSEDAKRLQLSRDIRPQLSNMSIRIDSLEKEIVKSK   243
WashU_Sbay_Contig651.38   197   KLLLQKCSIIESTSLSEDAKRLQLSRDIRPQLSNVSIKINSLEREIAQIK   246
WashU_Scas_Contig720.57   168   NTFSRKCSIMESTSLSEDNKKSQLSKEINPALIRLKSEISDLESRIEPLA   217
Symbols






: : :****:******** *: ***::*.* * .:. .*..** .*



SGD_Scer_SGS1/YMR190C   244   KDGMSKDQSKGRSQVSSQDDNIISSILPSPLEYNTSSRNSNLTSTTATTV   293
MIT_Spar_c161_16574   244   KDGTSKGHIKGRSQIPSQDDNIISSILPSPLENNTSSRNSNLTNTAATTV   293
WashU_Sbay_Contig651.38   247   KDAVSKSQTKDHSQPPSQNNNIISSILPSPLENNLSFKNTNIATTTAFTA   296
WashU_Scas_Contig720.57   218   RKAKTES-------------------------------------------   224
Symbols






:.. ::.



SGD_Scer_SGS1/YMR190C   294   TKALAITGAKQNITNNTGKNSNNDSNNDDLIQVLDDEDDIDCDPPVILKE   343
MIT_Spar_c161_16574   294   TKALVITATKQTISNNTNKNSNNNSNDDDLIQVLDDEDDVDYDPPITLKE   343
WashU_Sbay_Contig651.38   297   TMAVAAGAATAIPTKTPNSNANNENHDDDLIQVLDDEDDANNDPTVVIQK   346
WashU_Scas_Contig720.57   225   ------AHPIIESTGTPSNLQGQMRSTEGLIEVLEDNDETSNEEPTILSQ   268
Symbols






. : .... .: :.**:**:*:*: . : . :.:



SGD_Scer_SGS1/YMR190C   344   GAPHSP--------AFPHLHMTSEEQDELTRRRNMRSREPVNYRIPDRDD   385
MIT_Spar_c161_16574   344   GAPQSP--------AFPPLHMSSEEQDELIRSRSMRSREPVNYRIPDRDD   385
WashU_Sbay_Contig651.38   347   GLIDMPQNNIPISPSPPRLNMTTEEQDELTRRRNMRSREPVNYRIPERDD   396
WashU_Scas_Contig720.57   269   QSVIHN-----------------SIPTPPPQRPLRERHDRINYRIPEKDD   301
Symbols






. : . :: :*****::**



SGD_Scer_SGS1/YMR190C   386   PFDYVMGKSLRDDYPDVEREEDELTMEAEDDAHSSYMTTRDEEKEENELL   435
MIT_Spar_c161_16574   386   PFDYVMGKSLKDDYPDVEREEDELTMEAEEDAHSSYMTTRDEEKEESDLL   435
WashU_Sbay_Contig651.38   397   PFDYIMGKSLREDCPEIEREEDELTMEAEEDDHSSYMTTRDEEKEENDLL   446
WashU_Scas_Contig720.57   302   PFDYKLG----KLDATIESTQHENTMDAEDDDASSYMTTRDEERDER--L   345
Symbols






**** :* . . :* :.* **:**:* **********::* *



SGD_Scer_SGS1/YMR190C   436   NQSDFDFVVNDDLDPTQDTDYHDNMDVSANIQESSQEGDTRSTITLSQNK   485
MIT_Spar_c161_16574   436   NQSDFDFVVNDGLDPTQDTDYHDNLDVSANIQENSQEDDTRSTITLSQNK   485
WashU_Sbay_Contig651.38   447   NQSDFDFVVNDGLEATQDTDYHDNLDGSANIKEDPLEDDTRSTITLSQNK   496
WashU_Scas_Contig720.57   346   NQSDLEFVINDAPDQDAPYKDDSEGENIELINSSPLKELPMPAPTNDMVD   395
Symbols






****::**:** : . ..: : *:... : . .: * . .



SGD_Scer_SGS1/YMR190C   486   NVQVILSSPTAQSVPSNGQNQIGVEHIDLLEDDLEKDAILDDSMSFSFGR   535
MIT_Spar_c161_16574   486   NVQVILSSPTAQSASSNCQKLTGVEHIDLLEDDLEKDAILDDSMSFSFGH   535
WashU_Sbay_Contig651.38   497   NVQVILSSPTTQSDVSNRQRQTDVEHIDLLEDDLERDAILDDSISFSFGN   546
WashU_Scas_Contig720.57   396   HDEIINDS--------------DEPIIDLLDEEDELDIIIEDSMPNLHNT   431
Symbols






: ::* .* . ****::: * * *::**:. ..



SGD_Scer_SGS1/YMR190C   536   QHMPMSHSDLELIDSEKENEDFEEDNNNNGIEYLSDSDLERFDEERENRT   585
MIT_Spar_c161_16574   536   QHVPMSHSDLELIDSEKENGGYEEDNNNNDVEYLSDSDLERFDEERENRT   585
WashU_Sbay_Contig651.38   547   QNLPMSHSDLELIDSEKESGDYDEDNNNNNFENLSDSDLERFDEERENRT   596
WashU_Scas_Contig720.57   432   RTNNINNTQLSISHSDLELINSDENEDDDEEDNLSDSDLEQFDEERENGT   481
Symbols






: :.:::*.: .*: * . :*::::: : *******:******* *



SGD_Scer_SGS1/YMR190C   586   QVADIQELDNDLKIITERKLTGDNEHPPPSWSPKIKREKSSVSQK-----   630
MIT_Spar_c161_16574   586   QVADIQELDNDLKIITERKLTGDNNHRSPSWSPIIKMEKSSVCQK-----   630
WashU_Sbay_Contig651.38   597   QAADIQELDNDLKIITERKLTDDHDLQSPSWSPTIKTERSNASQKK----   642
WashU_Scas_Contig720.57   482   QMVNIEELDNDLKIIAERKLQEDVAKDWSPVKIKNEPKDTFISPTKNKLL   531
Symbols






* .:*:*********:**** * .. . : : : . .



SGD_Scer_SGS1/YMR190C   631   ----DEEDDFDDDFSLSDIVSKSNLSS---KTNGPTYPWSDEVLYRLHEV   673
MIT_Spar_c161_16574   631   ----GEEDDFDDDFSLSDIVSKSNLPS---KTNAPTYPWSDEVLYRLHEV   673
WashU_Sbay_Contig651.38   643   ----GEEDDFDDDFSLSDIVSRSKLSS---KAKGPTYPWSDEVLYRLHEI   685
WashU_Scas_Contig720.57   532   DFPDDDDDDDDDDFSLLNEIASDHTKKNHNSTTSPRHPWSDEVDFKLHEV   581
Symbols






.::** ****** : :: .: . .:..* :****** ::***:



SGD_Scer_SGS1/YMR190C   674   FKLPGFRPNQLEAVNATLQGKDVFVLMPTGGGKSLCYQLPAVVKSGKTHG   723
MIT_Spar_c161_16574   674   FKLPGFRPNQQEAVNATLQGKDVFVLMPTGGGKSLCYQLPAVVKSGKTHG   723
WashU_Sbay_Contig651.38   686   FKLPGFRPNQLEAVNATLQGKDVFVLMPTGGGKSLCYQLPAVVKLGKTQG   735
WashU_Scas_Contig720.57   582   FKLPGFRSNQLEAVDATLSGKDVFVLMPTGGGKSLCYQLPAVVKSGKTKG   631
Symbols






*******.** ***:***.************************* ***:*



SGD_Scer_SGS1/YMR190C   724   TTIVISPLISLMQDQVEHLLNKNIKASMFSSRGTAEQRRQTFNLFINGLL   773
MIT_Spar_c161_16574   724   TTIVISPLISLMQDQVEHLLNKNIKASMFSSRGTAEQRRQTFNLFINGLL   773
WashU_Sbay_Contig651.38   736   TTIVVSPLISLMQDQVEHLLNRNIKASMFSSKGTAEQRRQTFNLFIHGLL   785
WashU_Scas_Contig720.57   632   TTIVISPLISLMQDQVEHLLAKDIKACMFSSKGTAEQRKQTFNLFIHGLL   681
Symbols






****:*************** ::***.****:******:*******:***



SGD_Scer_SGS1/YMR190C   774   DLVYISPEMISASEQCKRAISRLYADGKLARIVVDEAHCVSNWGHDFRPD   823
MIT_Spar_c161_16574   774   DLVYISPEMISASEQCKRAISRLYADGKLARIVVDEAHCVSNWGHDFRPD   823
WashU_Sbay_Contig651.38   786   DLVYISPEMISASEQCKRAISRLYADGKLARIVVDEAHCVSNWGHDFRPD   835
WashU_Scas_Contig720.57   682   DLIYISPEMISASEQCKRAISKLHSDGKLARIVVDEAHCVSNWGHDFRPD   731
Symbols






**:******************:*::*************************



SGD_Scer_SGS1/YMR190C   824   YKELKFFKREYPDIPMIALTATASEQVRMDIIHNLELKEPVFLKQSFNRT   873
MIT_Spar_c161_16574   824   YKELKFFKREYPDIPMIALTATASEQVRMDIIHNLELKEPVFLKQSFNRT   873
WashU_Sbay_Contig651.38   836   YKELKFFKREYPDIPMIALTATASEQVRMDIIHNLELKDPVFLKQSFNRT   885
WashU_Scas_Contig720.57   732   YKELKYFKREYPDVPMIALTATASEQVRMDIIHNLELKDPVFLKQSFNRT   781
Symbols






*****:*******:************************:***********



SGD_Scer_SGS1/YMR190C   874   NLYYEVNKKTKNTIFEICDAVKSRFKNQTGIIYCHSKKSCEQTSAQMQRN   923
MIT_Spar_c161_16574   874   NLYYEVNKKTKNTIFEICDAVKSRFKNQTGIIYCHSKKSCEQTSAQMQRN   923
WashU_Sbay_Contig651.38   886   NLYYEVKKKTKNTIFEICDAVKLRFKNETGIIYCHSKKSCEQTSAQMQRN   935
WashU_Scas_Contig720.57   782   NLYYEVKKKTKNAIFEIIDSIKTKFRNQTGIIYCHSKNSCEQTSDKLQRQ   831
Symbols






******:*****:**** *::* :*:*:*********:****** ::**:



SGD_Scer_SGS1/YMR190C   924   GIKCAYYHAGMEPDERLSVQKAWQADEIQVICATVAFGMGIDKPDVRFVY   973
MIT_Spar_c161_16574   924   GIKCAYYHAGMEPDERLSVQKAWQADEIQVICATVAFGMGIDKPDVRFVY   973
WashU_Sbay_Contig651.38   936   GIKCAFYHAGMEPDERLSVQKAWQADEIQVICATVAFGMGIDKPDVRFVY   985
WashU_Scas_Contig720.57   832   GIKCAFYHAGMEPDDRLKVQKAWQADEIQVICATVAFGMGIDKPDVRFVY   881
Symbols






*****:********:**.********************************



SGD_Scer_SGS1/YMR190C   974   HFTVPRTLEGYYQETGRAGRDGNYSYCITYFSFRDIRTMQTMIQKDKNLD   1023
MIT_Spar_c161_16574   974   HFTVPRTLEGYYQETGRAGRDGKYSYCITYFSFRDIRTMQTMIQKDKNLD   1023
WashU_Sbay_Contig651.38   986   HFTVPRTLEGYYQETGRAGRDGNYSYCITYFSFRDIRTMQTMIQKDKNLD   1035
WashU_Scas_Contig720.57   882   HFTVPRTLEGYYQETGRAGRDGKFSYCITYFSFRDVRTMQTMIQKDENLD   931
Symbols






**********************::***********:**********:***



SGD_Scer_SGS1/YMR190C   1024   RENKEKHLNKLQQVMAYCDNVTDCRRKLVLSYFNEDFDSKLCHKNCDNCR   1073
MIT_Spar_c161_16574   1024   RENKEKHLNKLQQVMAYCDNVTDCRRKLVLSYFNEDFDSKSCHKNCDNCR   1073
WashU_Sbay_Contig651.38   1036   RENKEKHLNKLQQVMAYCDNVTDCRRKLVLSYFNEDFDSKLCHKNCDNCR   1085
WashU_Scas_Contig720.57   932   RQNKEKHLNKLQQVMSYCDNMTDCRRKLVLSYFNEDFDSKLCHKNCDNCK   981
Symbols






*:*************:****:******************* ********:



SGD_Scer_SGS1/YMR190C   1074   NSANVINEERDVTEPAKKIVKLVESIQNERVTIIYCQDVFKGSRSSKIVQ   1123
MIT_Spar_c161_16574   1074   NSANVINEERDVTEPAKKIVKLVESIQNERVTIIYCQDIFKGSRSSKIVQ   1123
WashU_Sbay_Contig651.38   1086   NSANVINEERDVTDPAKKIVKLVESIQNERVTIIYCQDIFKGSRTSKIVQ   1135
WashU_Scas_Contig720.57   982   NSAHVITEERDITEISKKIISLVEAIQNDRVTLIQCQDIVKGSRSSKIIQ   1031
Symbols






***:**.****:*: :***:.***:***:***:* ***:.****:***:*



SGD_Scer_SGS1/YMR190C   1124   ANHDTLEEHGIGKSMQKSEIERIFFHLITIRVLQEYSIMNNSGFASSYVK   1173
MIT_Spar_c161_16574   1124   ANHDTLEEHGLGKSMQKSEIERIFFHLITIRVLQEYSIMNNSGFASSYVK   1173
WashU_Sbay_Contig651.38   1136   ANHDTLDEHGLGKSMQKSEIERIFFHLITIRVLQEYSIMNNSGFASSYVK   1185
WashU_Scas_Contig720.57   1032   SGYTNLEQYGLGKKMSKSDIERIFFHLVTIRILQEYSIMNNGGFAASYVK   1081
Symbols






:.: .*:::*:**.*.**:********:***:*********.***:****



SGD_Scer_SGS1/YMR190C   1174   VGPNAKKLLTGKMEIKMQFTISAPNSRPSTSSSFQANEDNIPVIAQKSTT   1223
MIT_Spar_c161_16574   1174   VGPNAKKLLTGKMEIKMQFTISAPNSRPSTSSSFQPNEDNTPVIAQKSTT   1223
WashU_Sbay_Contig651.38   1186   VGPNARKLLSGKMEIKMQFTISAPNSRPSTSSSHQSSGDNTPAVAQRSTT   1235
WashU_Scas_Contig720.57   1082   VGPNSKKLLDGKLDVAMQFTSSSSAGTRTPSSNEASRPSPSRMSSSR---   1128
Symbols






****::*** **::: **** *:. . :.**. . . :.:



SGD_Scer_SGS1/YMR190C   1224   IGGNVAANPPRFISAKEHLRSYTYGGSTMGSSHPITLKNTSDLRSTQELN   1273
MIT_Spar_c161_16574   1224   IGGSVAANPPRFISAKEHLRSYTYGGSTMGASHPISLKNTNDLRSTQELN   1273
WashU_Sbay_Contig651.38   1236   IGGNTAANPARFISAKEHLRSYTYSSSTTETSHPISLKNNSELHSTQELN   1285
WashU_Scas_Contig720.57   1129   ---SEQMVPKPAFISAKQLSSFSYRGNNTEVARPISLQNNTESRSTQELI   1175
Symbols






. * : : ::* *::* ... ::**:*:*..: :*****



SGD_Scer_SGS1/YMR190C   1274   NLRMTYERLRELSLNLGNRMVPPVGNFMPDSILKKMAAILPMNDSAFATL   1323
MIT_Spar_c161_16574   1274   NLRMTYERLRELSLNLGNRMVPPVGNFMPDCILKKMAAILPMNDLAFATL   1323
WashU_Sbay_Contig651.38   1286   NLRMTYERLRELSLNLGNRMVPPVGNFMPDSILKKMATILPMSESAFASL   1335
WashU_Scas_Contig720.57   1176   DITAAYNKLREVSLGLGNRMNPPVMNFLSDVALKKLATILPVTPEQFSSI   1225
Symbols






:: :*::***:**.***** *** **:.* ***:*:***:. *:::



SGD_Scer_SGS1/YMR190C   1324   GTVEDKYRRRFKYFKATIADLSKKRSSEDHEKYDTILNDEFVNRAAASSN   1373
MIT_Spar_c161_16574   1324   GTVEDKYRRRFKYFKATIADLSKKRSSADHEKYDTILNEESANGASTSDS   1373
WashU_Sbay_Contig651.38   1336   GPVEDKYCRRFKYFKATIADLSNKRSSREHEKYDTILNEESVNRISGSTN   1385
WashU_Scas_Contig720.57   1226   PSIGERQRKKFKYFKSTILNLRKNRRNISNNKDSIVLSNGSSIPELNSQN   1275
Symbols






.: :: ::*****:** :* ::* . .::* . :*.: * .



SGD_Scer_SGS1/YMR190C   1374   GIAQSTGTKSKFFGANLNEAKENEQIINQIRQSQLPKNTTSSKSGTRSIS   1423
MIT_Spar_c161_16574   1374   GIVQSTGTRSKFFDANPNEARENEQIINQIRQSQLPKNATSSKSGTRAIC   1423
WashU_Sbay_Contig651.38   1386   QIIPTTGTRSRFFDANPNEAKENEQIINQIRESQLPNSSISAKASTRAIS   1435
WashU_Scas_Contig720.57   1276   ----SSGTKSHYFSLDPDEEQYNQAIVNQLRDAQFKSASNSAES---TQL   1318
Symbols






::**:*::*. : :* : *: *:**:*::*: . : *::: :



SGD_Scer_SGS1/YMR190C   1424   KSSKKSANGRRGFRNYRGHYRGRK--   1447
MIT_Spar_c161_16574   1424   KPVKKSTNGRRGFRNYRGHYRGRK--   1447
WashU_Sbay_Contig651.38   1436   KPTKKPANGKRGFKNYKGHYRRRK--   1459
WashU_Scas_Contig720.57   1319   RPKQNNYGSKRSYTNNYKSYNRKKKT   1344
Symbols






:. :: ..:*.: * *. :*



Symbols:
* = identical
: = strong similarity
. = weak similarity


- Download all sequences in alignment, in FASTA format.
GCG format sequences are displayed below.




Protein Sequence for SGD_Scer_SGS1/YMR190C:

SGD_Scer_SGS1/YMR190C  Length: 1448  Fri Jan  6 16:03:50 2012  Type: P  Check: 6103  ..

       1  MVTKPSHNLR REHKWLKETA TLQEDKDFVF QAIQKHIANK RPKTNSPPTT

      51  PSKDECGPGT TNFITSIPAS GPTNTATKQH EVMQTLSNDT EWLSYTATSN

     101  QYADVPMVDI PASTSVVSNP RTPNGSKTHN FNTFRPHMAS SLVENDSSRN

     151  LGSRNNNKSV IDNSSIGKQL ENDIKLEVIR LQGSLIMALK EQSKLLLQKC

     201  SIIESTSLSE DAKRLQLSRD IRPQLSNMSI RIDSLEKEII KAKKDGMSKD

     251  QSKGRSQVSS QDDNIISSIL PSPLEYNTSS RNSNLTSTTA TTVTKALAIT

     301  GAKQNITNNT GKNSNNDSNN DDLIQVLDDE DDIDCDPPVI LKEGAPHSPA

     351  FPHLHMTSEE QDELTRRRNM RSREPVNYRI PDRDDPFDYV MGKSLRDDYP

     401  DVEREEDELT MEAEDDAHSS YMTTRDEEKE ENELLNQSDF DFVVNDDLDP

     451  TQDTDYHDNM DVSANIQESS QEGDTRSTIT LSQNKNVQVI LSSPTAQSVP

     501  SNGQNQIGVE HIDLLEDDLE KDAILDDSMS FSFGRQHMPM SHSDLELIDS

     551  EKENEDFEED NNNNGIEYLS DSDLERFDEE RENRTQVADI QELDNDLKII

     601  TERKLTGDNE HPPPSWSPKI KREKSSVSQK DEEDDFDDDF SLSDIVSKSN

     651  LSSKTNGPTY PWSDEVLYRL HEVFKLPGFR PNQLEAVNAT LQGKDVFVLM

     701  PTGGGKSLCY QLPAVVKSGK THGTTIVISP LISLMQDQVE HLLNKNIKAS

     751  MFSSRGTAEQ RRQTFNLFIN GLLDLVYISP EMISASEQCK RAISRLYADG

     801  KLARIVVDEA HCVSNWGHDF RPDYKELKFF KREYPDIPMI ALTATASEQV

     851  RMDIIHNLEL KEPVFLKQSF NRTNLYYEVN KKTKNTIFEI CDAVKSRFKN

     901  QTGIIYCHSK KSCEQTSAQM QRNGIKCAYY HAGMEPDERL SVQKAWQADE

     951  IQVICATVAF GMGIDKPDVR FVYHFTVPRT LEGYYQETGR AGRDGNYSYC

    1001  ITYFSFRDIR TMQTMIQKDK NLDRENKEKH LNKLQQVMAY CDNVTDCRRK

    1051  LVLSYFNEDF DSKLCHKNCD NCRNSANVIN EERDVTEPAK KIVKLVESIQ

    1101  NERVTIIYCQ DVFKGSRSSK IVQANHDTLE EHGIGKSMQK SEIERIFFHL

    1151  ITIRVLQEYS IMNNSGFASS YVKVGPNAKK LLTGKMEIKM QFTISAPNSR

    1201  PSTSSSFQAN EDNIPVIAQK STTIGGNVAA NPPRFISAKE HLRSYTYGGS

    1251  TMGSSHPITL KNTSDLRSTQ ELNNLRMTYE RLRELSLNLG NRMVPPVGNF

    1301  MPDSILKKMA AILPMNDSAF ATLGTVEDKY RRRFKYFKAT IADLSKKRSS

    1351  EDHEKYDTIL NDEFVNRAAA SSNGIAQSTG TKSKFFGANL NEAKENEQII

    1401  NQIRQSQLPK NTTSSKSGTR SISKSSKKSA NGRRGFRNYR GHYRGRK*


Protein Sequence for MIT_Spar_c161_16574:

MIT_Spar_c161_16574  Length: 1448  Fri Jan  6 16:03:50 2012  Type: P  Check: 5626  ..

       1  MVTKPSHNLR REHKWLKETA TLQEDRDFVF QAIQKHITNK RPKTNSPPTT

      51  PSRDEYGAGA TNLLTSVPAS ASTNTATKQH EVLQTLSNDT EWLSYPATSN

     101  QYTDVPMVDI PASTSVVSNP RTPSGSKAHN FNTYPSHMGS SLVENDSSRN

     151  LDCRNNIKDL TDNSGTNKQF ENNMKVEVIR LQSSLIAALK EQSKLLMQKC

     201  SIIESTSLSE DAKRLQLSRD IRPQLSNMSI RIDSLEKEIV KSKKDGTSKG

     251  HIKGRSQIPS QDDNIISSIL PSPLENNTSS RNSNLTNTAA TTVTKALVIT

     301  ATKQTISNNT NKNSNNNSND DDLIQVLDDE DDVDYDPPIT LKEGAPQSPA

     351  FPPLHMSSEE QDELIRSRSM RSREPVNYRI PDRDDPFDYV MGKSLKDDYP

     401  DVEREEDELT MEAEEDAHSS YMTTRDEEKE ESDLLNQSDF DFVVNDGLDP

     451  TQDTDYHDNL DVSANIQENS QEDDTRSTIT LSQNKNVQVI LSSPTAQSAS

     501  SNCQKLTGVE HIDLLEDDLE KDAILDDSMS FSFGHQHVPM SHSDLELIDS

     551  EKENGGYEED NNNNDVEYLS DSDLERFDEE RENRTQVADI QELDNDLKII

     601  TERKLTGDNN HRSPSWSPII KMEKSSVCQK GEEDDFDDDF SLSDIVSKSN

     651  LPSKTNAPTY PWSDEVLYRL HEVFKLPGFR PNQQEAVNAT LQGKDVFVLM

     701  PTGGGKSLCY QLPAVVKSGK THGTTIVISP LISLMQDQVE HLLNKNIKAS

     751  MFSSRGTAEQ RRQTFNLFIN GLLDLVYISP EMISASEQCK RAISRLYADG

     801  KLARIVVDEA HCVSNWGHDF RPDYKELKFF KREYPDIPMI ALTATASEQV

     851  RMDIIHNLEL KEPVFLKQSF NRTNLYYEVN KKTKNTIFEI CDAVKSRFKN

     901  QTGIIYCHSK KSCEQTSAQM QRNGIKCAYY HAGMEPDERL SVQKAWQADE

     951  IQVICATVAF GMGIDKPDVR FVYHFTVPRT LEGYYQETGR AGRDGKYSYC

    1001  ITYFSFRDIR TMQTMIQKDK NLDRENKEKH LNKLQQVMAY CDNVTDCRRK

    1051  LVLSYFNEDF DSKSCHKNCD NCRNSANVIN EERDVTEPAK KIVKLVESIQ

    1101  NERVTIIYCQ DIFKGSRSSK IVQANHDTLE EHGLGKSMQK SEIERIFFHL

    1151  ITIRVLQEYS IMNNSGFASS YVKVGPNAKK LLTGKMEIKM QFTISAPNSR

    1201  PSTSSSFQPN EDNTPVIAQK STTIGGSVAA NPPRFISAKE HLRSYTYGGS

    1251  TMGASHPISL KNTNDLRSTQ ELNNLRMTYE RLRELSLNLG NRMVPPVGNF

    1301  MPDCILKKMA AILPMNDLAF ATLGTVEDKY RRRFKYFKAT IADLSKKRSS

    1351  ADHEKYDTIL NEESANGAST SDSGIVQSTG TRSKFFDANP NEARENEQII

    1401  NQIRQSQLPK NATSSKSGTR AICKPVKKST NGRRGFRNYR GHYRGRK*


Protein Sequence for WashU_Sbay_Contig651.38:

WashU_Sbay_Contig651.38  Length: 1460  Fri Jan  6 16:03:50 2012  Type: P  Check: 2909  ..

       1  MVTKPSHNLR REHKWLKETA TLQEDKEFVF QAIQKHIANK KPKTNSPPIT

      51  PSRDVYEAGA INLLTSLPGS GSTSTATKQR EGTQTLSNDS EWLSYPATSN

     101  QYTDIPIGDI PASTSIVSNP RTPSGSKTHS ANMLRPPMAS SLVENSSSRN

     151  MDRRNNTNDV IDSASTSKQI EKHQDRLERD IVRLQGSLIT VLKEQSKLLL

     201  QKCSIIESTS LSEDAKRLQL SRDIRPQLSN VSIKINSLER EIAQIKKDAV

     251  SKSQTKDHSQ PPSQNNNIIS SILPSPLENN LSFKNTNIAT TTAFTATMAV

     301  AAGAATAIPT KTPNSNANNE NHDDDLIQVL DDEDDANNDP TVVIQKGLID

     351  MPQNNIPISP SPPRLNMTTE EQDELTRRRN MRSREPVNYR IPERDDPFDY

     401  IMGKSLREDC PEIEREEDEL TMEAEEDDHS SYMTTRDEEK EENDLLNQSD

     451  FDFVVNDGLE ATQDTDYHDN LDGSANIKED PLEDDTRSTI TLSQNKNVQV

     501  ILSSPTTQSD VSNRQRQTDV EHIDLLEDDL ERDAILDDSI SFSFGNQNLP

     551  MSHSDLELID SEKESGDYDE DNNNNNFENL SDSDLERFDE ERENRTQAAD

     601  IQELDNDLKI ITERKLTDDH DLQSPSWSPT IKTERSNASQ KKGEEDDFDD

     651  DFSLSDIVSR SKLSSKAKGP TYPWSDEVLY RLHEIFKLPG FRPNQLEAVN

     701  ATLQGKDVFV LMPTGGGKSL CYQLPAVVKL GKTQGTTIVV SPLISLMQDQ

     751  VEHLLNRNIK ASMFSSKGTA EQRRQTFNLF IHGLLDLVYI SPEMISASEQ

     801  CKRAISRLYA DGKLARIVVD EAHCVSNWGH DFRPDYKELK FFKREYPDIP

     851  MIALTATASE QVRMDIIHNL ELKDPVFLKQ SFNRTNLYYE VKKKTKNTIF

     901  EICDAVKLRF KNETGIIYCH SKKSCEQTSA QMQRNGIKCA FYHAGMEPDE

     951  RLSVQKAWQA DEIQVICATV AFGMGIDKPD VRFVYHFTVP RTLEGYYQET

    1001  GRAGRDGNYS YCITYFSFRD IRTMQTMIQK DKNLDRENKE KHLNKLQQVM

    1051  AYCDNVTDCR RKLVLSYFNE DFDSKLCHKN CDNCRNSANV INEERDVTDP

    1101  AKKIVKLVES IQNERVTIIY CQDIFKGSRT SKIVQANHDT LDEHGLGKSM

    1151  QKSEIERIFF HLITIRVLQE YSIMNNSGFA SSYVKVGPNA RKLLSGKMEI

    1201  KMQFTISAPN SRPSTSSSHQ SSGDNTPAVA QRSTTIGGNT AANPARFISA

    1251  KEHLRSYTYS SSTTETSHPI SLKNNSELHS TQELNNLRMT YERLRELSLN

    1301  LGNRMVPPVG NFMPDSILKK MATILPMSES AFASLGPVED KYCRRFKYFK

    1351  ATIADLSNKR SSREHEKYDT ILNEESVNRI SGSTNQIIPT TGTRSRFFDA

    1401  NPNEAKENEQ IINQIRESQL PNSSISAKAS TRAISKPTKK PANGKRGFKN

    1451  YKGHYRRRK*

Protein Sequence for WashU_Scas_Contig720.57:

WashU_Scas_Contig720.57  Length: 1345  Fri Jan  6 16:03:50 2012  Type: P  Check: 9270  ..

       1  MSARTKPSHN LRREHKWLKE TNVLQHDRDL VLQVINKNIT HKRIRSTADA

      51  ITKQASPNFA GRNATNEADM IIPDSSSSIY QKDSNVVYSI DIDLEHNIPM

     101  LAPEQPLLPT ITNMKPPQER VVNSTRKVSA TIISNNLTEL NIAQSSSTKD

     151  ELISLQEALI KKLKDQSNTF SRKCSIMEST SLSEDNKKSQ LSKEINPALI

     201  RLKSEISDLE SRIEPLARKA KTESAHPIIE STGTPSNLQG QMRSTEGLIE

     251  VLEDNDETSN EEPTILSQQS VIHNSIPTPP PQRPLRERHD RINYRIPEKD

     301  DPFDYKLGKL DATIESTQHE NTMDAEDDDA SSYMTTRDEE RDERLNQSDL

     351  EFVINDAPDQ DAPYKDDSEG ENIELINSSP LKELPMPAPT NDMVDHDEII

     401  NDSDEPIIDL LDEEDELDII IEDSMPNLHN TRTNNINNTQ LSISHSDLEL

     451  INSDENEDDD EEDNLSDSDL EQFDEERENG TQMVNIEELD NDLKIIAERK

     501  LQEDVAKDWS PVKIKNEPKD TFISPTKNKL LDFPDDDDDD DDDDFSLLNE

     551  IASDHTKKNH NSTTSPRHPW SDEVDFKLHE VFKLPGFRSN QLEAVDATLS

     601  GKDVFVLMPT GGGKSLCYQL PAVVKSGKTK GTTIVISPLI SLMQDQVEHL

     651  LAKDIKACMF SSKGTAEQRK QTFNLFIHGL LDLIYISPEM ISASEQCKRA

     701  ISKLHSDGKL ARIVVDEAHC VSNWGHDFRP DYKELKYFKR EYPDVPMIAL

     751  TATASEQVRM DIIHNLELKD PVFLKQSFNR TNLYYEVKKK TKNAIFEIID

     801  SIKTKFRNQT GIIYCHSKNS CEQTSDKLQR QGIKCAFYHA GMEPDDRLKV

     851  QKAWQADEIQ VICATVAFGM GIDKPDVRFV YHFTVPRTLE GYYQETGRAG

     901  RDGKFSYCIT YFSFRDVRTM QTMIQKDENL DRQNKEKHLN KLQQVMSYCD

     951  NMTDCRRKLV LSYFNEDFDS KLCHKNCDNC KNSAHVITEE RDITEISKKI

    1001  ISLVEAIQND RVTLIQCQDI VKGSRSSKII QSGYTNLEQY GLGKKMSKSD

    1051  IERIFFHLVT IRILQEYSIM NNGGFAASYV KVGPNSKKLL DGKLDVAMQF

    1101  TSSSSAGTRT PSSNEASRPS PSRMSSSRSE QMVPKPAFIS AKQLSSFSYR

    1151  GNNTEVARPI SLQNNTESRS TQELIDITAA YNKLREVSLG LGNRMNPPVM

    1201  NFLSDVALKK LATILPVTPE QFSSIPSIGE RQRKKFKYFK STILNLRKNR

    1251  RNISNNKDSI VLSNGSSIPE LNSQNSSGTK SHYFSLDPDE EQYNQAIVNQ

    1301  LRDAQFKSAS NSAESTQLRP KQNNYGSKRS YTNNYKSYNR KKKT*