Researchers develop innovative, freely available software to purify mass spectrometry data

December 6, 2010 By Karin Fleming

Researchers at University of Toronto and the Samuel Lunenfeld Research Institute of Mount Sinai Hospital, as well as colleagues in Michigan and Scotland, have developed an innovative computational approach -- the first of its kind worldwide -- designed to analyze mass spectrometry data. The software, called SAINT (Significance Analysis of INTeractome), will allow researchers globally to quickly assess the reliability and accuracy of protein binding data, helping to further their studies of cancer and other illnesses.

SAINT is described online in the leading international journal , published Nov. 5.

The tool was developed by Professor Anne-Claude Gingras of molecular genetics and principal investigator and Lea Reichmann Research Chair in Cancer Proteomics at the Samuel Lunenfeld Research Institute and Professor Alexey Nesvizhskii of the University of Michigan, Ann Arbor, to meet a key challenge in the field of protein (a technology that helps researchers separate, identify and quantify specific proteins): namely, to identify and quantify “true” protein interactions gleaned from mass spectrometry data and filter them from protein-based contaminants in the sample data. Previously, other approaches to analyze mass spectrometry data have not allowed for a probability-based model to measure and account for errors in a data set.

“SAINT allows researchers to identify the real protein interactions in their sample and to exclude the false positives generated through contaminants,” said Gingras. “In effect, the software applies a much needed filter to purify the data and remove ‘noise.’”

SAINT was introduced earlier this year by Gingras, Nesvizhskii and Professor Mike Tyers of when their teams generated a comprehensive road map of the signalling proteins that control many aspects of cellular behaviour in yeast cells ( — a discovery reported in a May issue of Science. However, the updated approach can be applied to a wider variety of datasets of various sizes and levels of protein network density.

“The first version of SAINT was intended to help us analyze very large scale datasets, which is something that only a few laboratories worldwide are generating,” said Nesvizhskii. “We then realized that, with some modifications, the same approach could be extended to researchers specifically interested in knowing what a few proteins interact with inside the cell. This makes our approach very useful to most cancer biologists using mass spectrometry, as it enables them to quantify their interaction data.”

Gingras and Nesvizhskii are former research associates and have collaborated on various research projects over the past eight years and bring together their expertise in biology and computational modelling, respectively.

“We come from completely different directions but have focused on this problem together,” said Gingras.

Gingras has encouraged many other scientists to use SAINT and the software is being implemented at research institutes internationally. Gingras and Nesvizhskii, with study first author Hyungwon Choi (a post-doctoral research fellow in the Nesvizhskii lab), recently held a workshop at the Lunenfeld Research Institute at which almost 100 Toronto-based scientists and industry representatives learned of the advantages offered by SAINT.

“SAINT is a new and important software tool that — for the first time — allows us to assign a confidence value to every protein-protein interaction that we identify in our mass spectrometry studies,” said Professor Brian Raught of medical biophysics, Canada Research Chair in Proteomics and Molecular Medicine and a scientist at the Ontario Cancer Institute. Raught attended Gingras’ workshop and uses the tool in his research.

“It has been truly instrumental in our work on deciphering the protein-protein interactions within complex intracellular regulatory networks, and thus represents a major advance in our field.”

Explore further: Casting the molecular net

More information: The software is available for downloading at .

Related Stories

Casting the molecular net

June 14, 2007

Scientists at the Samuel Lunenfeld Research Institute of Mount Sinai Hospital (Canada), European Molecular Biology Laboratory (Germany), and Massachusetts Institute of Technology (USA) have created a new computational method ...

Model aids understanding of protein networks

June 25, 2007

An international team of researchers, including several from MIT, has developed a computational model that helps identify relationships between proteins and the enzymes that regulate them.

Searching for purpose in proteins

October 29, 2010

As scientists continue to acquire immense amounts of genomic and biochemical data from various organisms, they routinely find themselves confronted by proteins of known structure but enigmatic function—and resolving ...

Clues to thyroid cancer unlocked

November 16, 2010

Researchers at the University of Toronto and Mount Sinai Hospital have utilized proteomic technologies to discover proteins secreted by thyroid cancer cell lines to identify and characterize potential biomarkers for the future ...

Recommended for you

How the finch changes its tune

August 3, 2015

Like top musicians, songbirds train from a young age to weed out errors and trim variability from their songs, ultimately becoming consistent and reliable performers. But as with human musicians, even the best are not machines. ...

Machine Translates Thoughts into Speech in Real Time

December 21, 2009

( -- By implanting an electrode into the brain of a person with locked-in syndrome, scientists have demonstrated how to wirelessly transmit neural signals to a speech synthesizer. The "thought-to-speech" process ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.