Bayesian model selection shows extremely polarized behavior when the models are wrong

February 7, 2018, Chinese Academy of Sciences
Classification of Bayesian model-selection problems involving two equally right or equally wrong models. Credit: ZHU Tianqi

Scientists from University College London (UCL) and the Academy of Mathematics and Systems Science, Chinese Academy of Sciences (CAS, AMSS), have reported progress in understanding problems associated with Bayesian model selection. The research suggests that the Bayesian method tends to produce very high-posterior probabilities for estimated evolutionary trees even if the trees are clearly wrong, and offers a possible explanation for this phenomenon.

Model comparison is widely used in various branches of sciences in which scientific hypotheses are formulated as statistical models and tested using observed data. However, model comparison is a thorny issue in both classical statistics and Bayesian statistics.

In classical statistics, two nested models are compared. The framework does not work when the compared models are not nested. In contrast, Bayesian statistics compares different models by calculating their posterior probabilities, which indicates our confidence or belief in the model.

Not only do the two methodologies spring from drastically different philosophies, they may also produce opposite conclusions in the analysis of the same data. Bayesian model selection is known to converge to the true model if the true model is included among the models under consideration.

That is, when scientists collect more data, the posterior probability for the right model will increase and approach 100 percent, and they will thus be increasingly certain which is the true model.

However, if all the considered models are wrong, the behavior of the Bayesian method is unknown.

Scientists have characterized Bayesian model selection problems, and categorized them into three types, each of which shows a different behavior.

In the most scientifically interesting case, i.e., when the compared models are distinct and nearly equally wrong, Bayesian model selection shows problematic polarized behavior: It tends to support one model with full force in some datasets, but support another model in other datasets.

The result may be summarized using the following analogy: Suppose the world is gray, but we ask a sage whether it is black or white. He takes a deep look at the world and says it is black, with total confidence. But the next time we ask the same question, he says it is white, again with total confidence.

This study was motivated by problems in molecular phylogenetics, which is the science of working out the relationships among species using genetic data, represented by evolutionary trees.

These different trees are opposing statistical models in the Bayesian analysis of the data. Evolutionary biologists have long observed that the method tends to produce very high posterior probabilities for the estimated evolutionary trees (very often 100 percent), even if the are clearly wrong.

Our results provide a possible explanation for this unpleasant behavior. The implications of the results for the use of Bayesian selection in testing opposing scientific hypotheses in general are yet to be explored.

Explore further: Advances in Bayesian methods for big data

More information: Bayesian selection of misspecified models is overconfident and may cause spurious posterior probabilities for phylogenetic trees, PNAS, DOI: 10.1073/pnas.1712673115 , http://www.pnas.org/content/early/2018/02/02/1712673115

Related Stories

Advances in Bayesian methods for big data

June 2, 2017

In the Big Data era, many scientific and engineering domains are producing massive data streams, with petabyte and exabyte scales becoming increasingly common. Besides the explosive growth in volume, Big Data also has high ...

Bayesian method useful for noncompleters of 400-m walk

September 27, 2017

(HealthDay)—A Bayesian multiple imputation (MI) method is useful for calculating the speeds of those who are unable to complete the 400-m walk test within the time constraint (noncompleters), according to a study published ...

How to grow an evolutionary tree

December 8, 2016

You've seen them in popular science news, biology textbooks, wall plaques in museums, perhaps even as tattoos. Evolutionary trees are among the most instantly recognisable, ubiquitous and iconic images of science.

Recommended for you

Using organoids to understand how the brain wrinkles

February 20, 2018

A team of researchers working at the Weizmann Institute of Science has found that organoids can be used to better understand how the human brain wrinkles as it develops. In their paper published in the journal Nature Physics, ...

Pattern formation—the paradoxical role of turbulence

February 19, 2018

The formation of self-organizing molecular patterns in cells is a critical component of many biological processes. Researchers from Ludwig-Maximilians-Universitaet (LMU) in Munich have proposed a new theory to explain how ...

Bringing a hidden superconducting state to light

February 16, 2018

A team of scientists has detected a hidden state of electronic order in a layered material containing lanthanum, barium, copper, and oxygen (LBCO). When cooled to a certain temperature and with certain concentrations of barium, ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.