Protein abundance data, domains, shared domains with other proteins, protein sequence retrieval for various strains, sequence-based physico-chemical properties, protein modification sites, and external identifiers for the protein.
AlphaFold, developed by DeepMind, is an AI program that accurately predicts protein structures from amino acid sequences, enabling visualization of protein conformations. The predicted structures can be accessed through the Protein Data Bank (PDB) and AlphaFold Protein Structure Database.
Contains experimentally-derived protein half-life data obtained using stable isotope labeling by amino acids (SILAC) coupled with mass spectrometry. This section also contains protein abundance data for both untreated and treated cells obtained from over 20 studies. These data have been normalized and converted to a common unit of molecules per cell.
Collection of computationally identified domains and motifs, as determined by InterProScan analysis; includes protein coordinates for the domain, a domain Description, a Source and corresponding accession ID, and the number of S. cerevisiae genes that share the same domain.
Increase the total number of rows showing on this page by using the pull-down located below the table, or use the page scroll at the table's top right to browse through its pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table.
Evidence ID | Analyze ID | Gene | Gene Systematic Name | Protein Coordinates | Accession ID | Description | Source | No. of Genes with Domain |
---|
Visual representation of the locations of the domains within the protein, as listed in the Domains and Classification table. Each row displays the domain(s) derived from a different Source, with domains color-coded according to this Source.
Scroll over a domain to view its exact coordinates and its Description.
Protein sequence for the given gene in S288C and other strains, when available. Use the pull-down menu under "Strain" to select the sequence for a specific strain. The displayed sequence can be downloaded in FASTA format as a .txt file. Amino acids displayed in blue represent modification sites. More detailed evidence for these modification sites is presented in the Post-translational Modifications table, located just below the protein sequence.
This locus is not translated into a protein.
1 MESQQLSNYP HISHGSACAS VTSKEVHTNQ DPLDVSASKI QEYDKASTKA NSQQTTTPAS
61 SAVPENPHHA SPQPASVPPP QNGPYPQQCM MTQNQANPSG WSFYGHPSMI PYTPYQMSPM
121 YFPPGPQSQF PQYPSSVGTP LSTPSPESGN TFTDSSSADS DMTSTKKYVR PPPMLTSPND
181 FPNWVKTYIK FLQNSNLGGI IPTVNGKPVR PITDDELTFL YNAFQIFAPS QFLPTWVKDI
241 LSVDYTDIMK ILSKSIEKMQ SDTQEANDIV TLANLQYNGS TPADAFETKV TNIIDRLNNN
301 GIHINNKVAC QLIMRGLSGE YKFLRYTRHR HLNMTVAELF LDIHAIYEEQ QGSRNSKPNY
361 RRNPSDEKND SRSYTNTTKP KVIARNPQKT NNSKSKTARA HNVSTSNNSP STDNDSISKS
421 TTEPIQLNNK HDLHLGQKLT ESTVNHTNHS DDELPGHLLL DSGASRTLIR SAHHIHSASS
481 NPDINVVDAQ KRNIPINAIG DLQFHFQDNT KTSLKVLHTP NIAYDLLSLN ELAAVDITAC
541 FTKNVLERSD GTVLAPIVKY GDFYWVSKKY LLPSNISVPT INNVHTSEST RKYPYPFIHR
601 MLAHANAQTI RYSLKNNTIT YFNESDVDWS SAIDYQCPDC LIGKSTKHRH IKGSRLKYQN
661 SYEPFQYLHT DIFGPVHNLP KSAPSYFISF TDETTKFRWV YPLHDRREDS ILDVFTTILA
721 FIKNQFQASV LVIQMDRGSE YTNRTLHKFL EKNGITPCYT TTADSRAHGV AERLNRTLLD
781 DCRTQLQCSG LPNHLWFSAI EFSTIVRNSL ASPKSKKSAR QHAGLAGLDI STLLPFGQPV
841 IVNDHNPNSK IHPRGIPGYA LHPSRNSYGY IIYLPSLKKT VDTTNYVILQ GKDSRLDQFN
901 YDALTFDEDL NRLTASYQSF IASNEIQQSN DLNIESDHDF QSDIELYPEQ PRNVLSKAVS
961 PTDSTPPSTH TEDSKRVSKT NIRAPREVDP NISESNILPS KKRSSTPQIS DIESTDSGGM
1021 HRLDVPLLAP MSQSNTHESS YASKSKDFRH SDSYSDNETN HTNVPISSTG GTNNKTVPQT
1081 SEQETEKRII HRFTSDRILP SSESNSLHHV VPIKTSDTCF KENTEESIIA DLPLPDLPPE
1141 PPTELSDSFK ELPPINSRQT NSSLGGIGDS NAYTTINSKK RSLEDNETEI KVSRDTWNTK
1201 NMRSLEPPRS KKRIHLIAAV KAVKSIKPIR TTLRYDEAIT YNKDIKEKEK YIEAYHKEVN
1261 QLLKMKTWDT DKYYDRKEID PKRVINSMFI FNRKRDGTHK ARFVARGDIQ HPDTYDSGMQ
1321 SNTVHHYALM TSLSLALDNN YYITQLDISS AYLYADIKEE LYIRPPPHLG MNDKLIRLKK
1381 SLYGLKQSGA NWYETIKSYL IKQCGMEEVR GWSCVFKNSQ VTICLFVDDM ILFSKDLNSN
1441 KRIIAKLKMQ YDTKIINLGE SDDEIQHDIL GLEIKYQRGK YMKLGMENSL TEKIPKLNVP
1501 LNPNGRKLGA PGQPGLYINQ QELELEEDDY KMKVHEMQKL IGLASYVGYK FRFDLLYYIN
1561 TLAQHILFPS KQVLDMTYEL IQFIWNTRDK QLIWHKSKPV KPTNKLVVIS DASYGNQPYY
1621 KSQIGNIYLL NGKVIGGKST KASLTCTSTT EAEIHAISES VPLLNNLSYL IQELDKKPIT
1681 KGLLTDSKST ISIIISNNEE KFRNRFFGTK AMRLRDEVSG NHLHVCYIET KKNIADVMTK
1741 PLPIKTFKLL TNKWIH*
* Blue amino acids indicate modification sites. More information below.
Modification sites for the protein in the selected strain, based on the presence of a residue in the specific strain, as inferred from experimental evidence.
3 entries for 3 sitesIncrease the total number of rows showing on this page by using the pull-down located below the table, or use the page scroll at the table's top right to browse through its pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table.
Site | Modification | Modifier | Reference |
---|---|---|---|
K419 | ubiquitinylated lysine | Fang NN, et al. (2014) PMID: 25344756 | |
S450 | phosphorylated residue | Holt LJ, et al. (2009) PMID: 19779198 | |
K1732 | ubiquitinylated lysine | RSP5 | Fang NN, et al. (2014) PMID: 25344756 |
Calculated protein properties, including amino acid composition, length, coding region calculations, and atomic composition.
Sort table using the arrow to the right of a column header to sort by that column; download all properties as a .txt file using the "Download Properties" button.
Amino Acid | Frequency | Percentage |
---|---|---|
A | 80 | 4.56 |
C | 15 | 0.85 |
D | 106 | 6.04 |
E | 83 | 4.73 |
F | 53 | 3.02 |
G | 68 | 3.87 |
H | 60 | 3.42 |
I | 127 | 7.23 |
K | 129 | 7.35 |
L | 145 | 8.26 |
M | 32 | 1.82 |
N | 124 | 7.06 |
P | 111 | 6.32 |
Q | 75 | 4.27 |
R | 73 | 4.16 |
S | 179 | 10.19 |
T | 132 | 7.52 |
V | 73 | 4.16 |
W | 14 | 0.80 |
Y | 77 | 4.38 |
Length (a.a): | 1756 |
Molecular Weight (Da): | 198884.4 |
Isoelectric Point (pl): | 8.68 |
Formula: | C8804H13836N2437O2722S47 |
Aliphatic Index: | 73.77 |
Instability Index: | 46.14 |
Codon Bias: | 0.04 |
Codon Adaptation Index: | 0.14 |
Frequence of Optimal Codons: | 0.46 |
Hydropathicity of Protein: | -0.63 |
Aromaticity Score: | 0.08 |
ALL Cys residues appear as half cystines: | 192605.0 |
NO Cys residues appear as half cystines: | 191730.0 |
Sort table using the arrow to the right of a column header to sort by that column; download all properties as a .txt file using the "Download Properties" button.
Atom | Frequency | Percentage |
---|
Data not found or not available for S288C
List of external identifiers for the protein from various database sources.
18 entries for 6 sourcesIncrease the total number of rows showing on this page by using the pull-down located below the table, or use the page scroll at the table's top right to browse through its pages; use the arrows to the right of a column header to sort by that column; filter the table using the "Filter" box at the top of the table.
AlphaFold Protein Structure | Pfam domains | SUPERFAMILY | UniProtKB