New & Noteworthy
December 17, 2014
Have you ever wondered what’s happening to your favorite protein as it’s hanging out in the cell? SGD’s advanced search tool, YeastMine, now includes four new templates that can be used to find protein modification and abundance data.
The Gene -> Protein Modifications template retrieves phosphorylation, ubiquitination, succinylation, acetylation and methylation data, currently curated from the following 11 publications: Peng et al. 2003, Hitchcock et al. 2003, Seyfried et al. 2008, Vogtle et al. 2009, Ziv et al. 2011, Mommen et al. 2012, Henriksen et al. 2012, Swaney et al. 2013, Kolawa et al. 2013, Weinert et al. 2013, and Wang et al. 2014.
The Gene -> Experimental N-termini and N-terminal modifications template retrieves experimentally-determined amino-terminal sequence and acetylation data, currently curated from Vogtle et al. 2009 and Mommen et al. 2012.
Lastly, two new templates pull protein abundance data curated from Ghaemmaghami et al. 2003. Gene -> Protein Abundance retrieves molecules/cell counts for a gene or list of genes. The same data can be quickly filtered using the Retrieve -> Proteins in a given molecules/cell abundance range template.
Please explore these new YeastMine protein data templates, and send us your feedback.
December 8, 2014
At SGD, we are expanding our scope to provide annotation and comparative analyses of all major budding yeast strains, and are making progress in our move toward providing multiple reference genomes. To this end, the following new S. cerevisiae genomes have been incorporated into SGD as “Alternative References”: CEN.PK, D273-10B, FL100, JK9-3d, RM11-1a, SEY6210, SK1, Sigma1278b, W303, X2180-1A, Y55. These genomes are accessible via Sequence, Strain, and Contig pages, and are the genomes for which we have curated the most phenotype data, and for which we aim to curate specific functional information. It is important to emphasize that we are not abandoning a standard sequence; S288C is still in place as “The Reference Genome”. However, we do recognize that it is helpful for students and researchers to be able to ‘shift the reference’, selecting the genome that is most appropriate and informative for a specific area of study.
These new genome sequences have been also been added to SGD’s BLAST datasets, multiple sequence alignments, the Pattern Matching tool, and the Downloads site. Please explore these new genomes, and send us your feedback.
August 25, 2014
New Sequence pages are now available in SGD for virtually every yeast gene (e.g., HMRA1 Sequence page), and include genomic sequence annotations for the Reference Strain S288C, as well as several Alternative Reference Genomes from strains such as CEN.PK, RM11-1a, Sigma1278b, and W303 (more Alternative References coming soon). Each page includes an Overview section containing descriptive information, maps depicting genomic context in Reference Strain S288C (as shown below) and Alternative Reference strains, as well as chromosomal and relative coordinates in S288C.
The sequence itself includes display options for genomic DNA, coding DNA, or translated protein.
Also available on each Sequence page are links to redesigned S288C Chromosome pages, links to new Contig pages for Alternative Reference Genomes, and a Downloads menu for easy access to DNA sequences of several other industrial strains and environmental isolates. The new Sequence, Chromosome, and Contig pages make use of many of the features you enjoy on other new or redesigned pages at SGD, including graphical display of data, sortable tables, and responsive visualizations. The Sequence pages also provide seamless access to other tools at SGD such as BLAST and Web Primer. Please explore these new pages, accessible via the Sequence tab on your favorite Locus Summary page, and send us your feedback.
March 11, 2014
The SGD Gene Associations file (GAF; gene_association.sgd) contains Gene Ontology (GO) annotations for all yeast genes, in a standard file format specified by the GO Consortium. We are changing the taxon identifier in this file to be consistent with the reference genome sequence at GenBank and protein entries at UniProt.
Until now, the taxon identifier in column 13 of SGD’s GAF has been 4932, which refers to Saccharomyces cerevisiae in general rather than to a specific S. cerevisiae strain. Starting March 8th, 2014, we have changed this to taxon ID 559292, which is specific to the S288C strain used for the S. cerevisiae reference genome sequence.
Please note that the taxon ID 559292 merely reflects the sequence (genome) to which the geneIDs in column 2 are mapped. SGD will continue to capture gene functions (GO annotations) for all strains of S. cerevisiae. Please contact us if you have any questions.
The S. cerevisiae GO annotations (GAF) can be downloaded from SGD’s Downloads site.