Contents
This page describes how the systematic names of protein coding genes were assigned, and how systematic names of newly discovered protein coding ORFs are determined. Please note that new ORFs are assigned systematic names by SGD and MIPS. If you have discovered a new ORF, please contact SGD in order to have the proper systematic name assigned.
If you would like information on the systematic names of other types of genes (tRNAs, snoRNAs, ARSes, etc.), please see the Gene Nomenclature Conventions help page.

- The base name of the new ORF is the same as the closest centromere proximal ORF. The correct base names for the example new ORFs are in green below. Note that the closest centromere proximal ORF does not have to be on the same strand, although it can be. The new ORF may overlap an existing ORF. When this occurs, if any portion of the existing overlapping ORF is closer to the centromere than the new ORF, then the existing overlapping ORF is "centromere proximal" relative to the new ORF.
![]()
- The W/C suffix indicates the strandedness of the new ORF. The W/C suffix of the new ORF is independent of the strandedness of the centromere proximal ORF. The correct suffixes for the example new ORFs are in green below.
![]()
- An additional suffix, -[letter], is appended to the name of the new ORF. This distinguishes the new ORF from ORFs named in the original annotation. The letters are assigned in alphabetical order, per base name, in order of discovery (see additional examples below). The correct suffixes for the example new ORFs are in green below. If several neighboring new ORFs are added simultaneously, then the -[letter] suffix is assigned in alphabetical order, from the centromere to the telomere. However, since new neighboring new ORFs are not necessarily discovered simultaneously, the -[letter] suffix does not always indicate relative position.
![]()
Return to Saccharomyces Genome Database |
Send a Message to the SGD Curators ![]() |