October 26, 2015

Scientists devise new method to solve significant variables conundrum

Scientists at Columbia University, the University of California, San Diego (UCSD) and Harvard University have presented an alternative method to address the challenge of using significant variables to make useful predictions in areas such as complex disease.

Shaw-Hwa Lo and Tian Zheng of Columbia, Adeline Lo of UCSD and Herman Chernoff of Harvard present findings in a paper to appear in Proceedings of the National Academy of Sciences on Monday, October 26, that demonstrates that statistically significant variables are not necessarily predictive. In addition, very predictive variables do not necessarily have to appear significant and thereby evade a researcher using statistical significance as a criterion to evaluate variables for prediction.

Statistical significance is a traditional, long-standing measure in any researcher's toolbox but thus far, scientists have been puzzled by the inability to use results of statistically significant variants in complex diseases to make predictions useful for personalized medicine. Why aren't significant variables leading to good prediction of outcomes? This conundrum affects both simple and complex data in a broad range of science and social science fields.

In their findings, the authors demonstrate that what makes variables good for prediction versus significance depends on different properties of the underlying distributions. They suggest that progress in prediction requires efforts toward a new research agenda of searching for a novel criterion to retrieve highly predictive variables rather than highly significant variables.

They also present an alternative approach, the Partition Retention method, which displays strong power in prediction. The researchers applied the method to a well-known breast cancer dataset, the van't Veer dataset, and reduced the prediction error rate from 30% to 8%, finding breast cancer genes that are highly predictive - and not significant.

Their results show that using their method to examine the top five interacting breast cancer genes they were able to find predicted breast cancer relapse effectively, when the outcome would not have appeared using significance measures. Previous methods were only 70% correct in predicting something as significant as breast cancer relapse. Using the new method and avoiding significance as a criterion, scientists correctly predicted such an outcome with 92% accuracy.

"What we're saying here is that using the previously very well-known methods might not be appropriate when we care about predicting important outcomes," says Professor Lo. "Our alternative approach seems to do very well in prediction, and is relevant for many scientific fields."

More information: Why significant variables aren't automatically good predictors, www.pnas.org/cgi/doi/10.1073/pnas.1518285112

Journal information: Proceedings of the National Academy of Sciences

Provided by Columbia University

Citation: Scientists devise new method to solve significant variables conundrum (2015, October 26) retrieved 10 May 2024 from https://phys.org/news/2015-10-scientists-method-significant-variables-conundrum.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New algorithm aimed at combating science's reproducibility problem

41 shares

Feedback to editors

Scientists devise new method to solve significant variables conundrum

Scientists unlock key to breeding 'carbon gobbling' plants with a major appetite

Clues from deep magma reservoirs could improve volcanic eruption forecasts

Study shows AI conversational agents can help reduce interethnic prejudice during online interactions

NASA's Chandra notices the galactic center is venting

Wildfires in old-growth Amazon forest areas rose 152% in 2023, study shows

GoT-ChA: New tool reveals how gene mutations affect cells

Accelerating material characterization: Machine learning meets X-ray absorption spectroscopy

Life expectancy study reveals longest and shortest-lived cats

New research shows microevolution can be used to predict how evolution works on much longer timescales

Stable magnetic bundles achieved at room temperature and zero magnetic field

Relevant PhysicsForums posts

Formal definition of multiplication for real and complex numbers

Identity Theorem for power series

Probability - two possible points of view?

How to interpret Pascal's Triangle for negative numbers?

Tennis Probabilities Challenge

Scissor Blade Problem

New algorithm aimed at combating science's reproducibility problem

Drug sensitivity predicted computationally

New model better predicts breast cancer risk in African American women

Team develops prognostic test for E2F4 in breast cancer

New breast cancer risk prediction model more accurate current model

New method to predict increased risk of non-familial breast cancer

Random processes shape science and math: Researchers propose a unified, probabilistic framework

Study of new method used to preserve privacy with US census data suggests accuracy has suffered

New study is first to use statistical physics to corroborate 1940s social balance theory

Too many vehicles, slow reactions and reckless merging: New math model explains how traffic and bacteria move

Theoretical biologists test two modes of social reasoning and find surprising truths in simplicity

New algorithm cuts through 'noisy' data to better predict tipping points

Medical Xpress

Tech Xplore

Science X

Scientists devise new method to solve significant variables conundrum

Scientists unlock key to breeding 'carbon gobbling' plants with a major appetite

Clues from deep magma reservoirs could improve volcanic eruption forecasts

Study shows AI conversational agents can help reduce interethnic prejudice during online interactions

NASA's Chandra notices the galactic center is venting

Wildfires in old-growth Amazon forest areas rose 152% in 2023, study shows

GoT-ChA: New tool reveals how gene mutations affect cells

Accelerating material characterization: Machine learning meets X-ray absorption spectroscopy

Life expectancy study reveals longest and shortest-lived cats

New research shows microevolution can be used to predict how evolution works on much longer timescales

Stable magnetic bundles achieved at room temperature and zero magnetic field

Relevant PhysicsForums posts

Related Stories

New algorithm aimed at combating science's reproducibility problem

Drug sensitivity predicted computationally

New model better predicts breast cancer risk in African American women

Team develops prognostic test for E2F4 in breast cancer

New breast cancer risk prediction model more accurate current model

New method to predict increased risk of non-familial breast cancer

Recommended for you

Random processes shape science and math: Researchers propose a unified, probabilistic framework

Study of new method used to preserve privacy with US census data suggests accuracy has suffered

New study is first to use statistical physics to corroborate 1940s social balance theory

Too many vehicles, slow reactions and reckless merging: New math model explains how traffic and bacteria move

Theoretical biologists test two modes of social reasoning and find surprising truths in simplicity

New algorithm cuts through 'noisy' data to better predict tipping points

Newsletter sign up

Donate and enjoy an ad-free experience