Type: Journal Publication
Abstract: Motivation: Mass spectrometry is a complex technique used for large-scale protein profiling with clinical and pharmaceutical applications. While individual components in the system have been studied extensively, little work has been done to integrate various modules and evaluate them from a systems point of view. Results: In this work, we investigate this problem by putting together the different modules in a typical proteomics work flow, in order to capture and analyze key factors that impact the number of identified peptides and quantified proteins, protein quantification error, differential expression results, and classification performance. The proposed proteomics pipeline model can be used to optimize the work flow as well as to pinpoint critical bottlenecks worth investing time and resources into for improving performance. Using the model-based approach proposed here, one can study systematically the critical problem of proteomic biomarker discovery, by means of simulation using ground-truthed synthetic MS data.
Cited as: Y. Sun, U.M. Braga-Neto and E.R. Dougherty, "A systematic model of the LC-MS proteomics pipeline", BMC Genomics, GENSIPS'2011 Special Issue, Vol. 13, Suppl. 6, October 2012, p. S2, doi:10.1186/1471-2164-13-S6-S2