SGD Paper Help



Wang Y, et al.  (2012) Interrogating noise in protein sequences from the perspective of protein-protein interactions prediction. J Theor Biol 315():64-70

Abstract: The past decades witnessed extensive efforts to study the relationship among proteins. Particularly, sequence-based protein-protein interactions (PPIs) prediction is fundamentally important in speeding up the process of mapping interactomes of organisms. High-throughput experimental methodologies make many model organism's PPIs known, which allows us to apply machine learning methods to learn understandable rules from the available PPIs. Under the machine learning framework, the composition vectors are usually applied to encode proteins as real-value vectors. However, the composition vector value might be highly correlated to the distribution of amino acids, i.e., amino acids which are frequently observed in nature tend to have a large value of composition vectors. Thus formulation to estimate the noise induced by the background distribution of amino acids may be needed during representations. Here, we introduce two kinds of denoising composition vectors, which were successfully used in construction of phylogenetic trees, to eliminate the noise. When validating these two denoising composition vectors on Escherichia coli (E. coli), Saccharomyces cerevisiae (S. cerevisiae) and human PPIs datasets, surprisingly, the predictive performance is not improved, and even worse than non-denoised prediction. These results suggest that the noise in phylogenetic tree construction may be valuable information in PPIs prediction.

Status: Published Type: Journal Article PubMed ID: 22999977

Topics addressed in this paper

  • To find other papers on a gene and topic, click on the colored ball in the appropriate box.
  • displays other papers with information about that topic for that gene.
  • displays other papers in SGD that are associated with that topic.
    The topic is addressed in these papers but does not describe a specific gene or chromosomal feature.
  • To go to the Locus page for a gene, click on the gene name.
Topics Topics not linked to Genes
Computational analysis yg ball
Omics yg ball

Author Searches

To find contact information or other publications by the authors of this paper, follow these three steps:
  1. (1) Choose an author,
  2. (2) Choose a search parameter,
  3. (3) Click to implement