March 7, 2018

Want to predict March Madness? New method identifies key statistics, outperforms others in accuracy

University of Illinois researchers have developed a method using causal inference for predicting upsets in the NCAA Men's Basketball Tournament that outperforms many other techniques. In addition to improved accuracy, the method stands out because it relies on publicly available data, making it reproducible and more accessible for others to use.

The paper reporting the method is published in the American Statistical Association (ASA) Journal of Quantitative Analysis in Sports (JQAS) by Sheldon H. Jacobson (University of Illinois at Urbana-Champaign), Jason J. Sauppe (University of Wisconsin La Crosse) and Shouvik Dutta (former University of Illinois graduate student). In short, the technique identifies potential upsets using only a small number of publicly available statistics by identifying match-ups in the current year that exhibit characteristics similar to those exhibited by historical round-of-64 upsets.

Using decision trees, machine learning, and causal inference, Jacobson and his collaborators analyzed 115 publicly available statistics to detect the 15 most important for identifying upsets in the first-round matchups between the teams seeded 2 and 15, 3 and 14, and 4 and 13. Among the most influential of the 15 were the effective possession ratio—the number of possessions and offensive rebounds minus the number of turnovers all divided by the number of possessions—the number of games played in the regular season and a measure of scoring chances per game.

The differences in those 15 statistics between the two teams in each historical upset are then used to build a profile of past upsets. Finally, the upset profiles can be compared to round-of-64 games in the current year to find match-ups that are most like historical upsets.

Jacobson and co-authors applied their approach to the NCAA tournament in each of the 13 years from 2003 to 2015. Of the 26 selected games, 10 (38.4%) were actual upsets, which is more than twice as many as the expected number of correct selections when using a weighted random selection method.
Identifying causal factors in the NCAA tournament is challenging for many reasons, one being that randomized controlled trials—an established method ideally suited for identifying causality—is not an option. "By approaching the problem as a causal inference problem using observational data," said Jacobson, "we were able to improve on forecasting upsets over pure random chance. "

Dubbed balance optimization subset selection (or BOSS), the framework can be applied to a broad array of data in the social sciences and medicine. The initial research for the BOSS idea was supported in part by the National Science Foundation. "The covariate balance approach taken by the authors is novel in the context of a sports application," said Mark Glickman (Harvard University), former editor-in-chief of JQAS who handled this manuscript. "It is refreshing to see causal inference play a prominent role in assessing factors that impact game upsets."

Jacobson's projected upsets for this year's tournament will be posted after Selection Sunday at http://bracketodds.cs.illinois.edu, a STEM learning laboratory focused on the statistics of March Madness.

"March Madness is a superb opportunity for all people, young and old, to enjoy a national sporting event while gaining an appreciation for how statistics and data science shed light on the tournament. Simply put, our research program on data analysis helps makes sense of the madness," said Jacobson.

Jacobson is a judge in the second annual Statsketball contest, hosted by This Is Statistics (http://thisisstatistics.org), the ASA's campaign to make students, teachers and parents aware of the many careers empowered by statistical thinking.

More information: Shouvik Dutta et al. Identifying NCAA tournament upsets using Balance Optimization Subset Selection, Journal of Quantitative Analysis in Sports (2017). DOI: 10.1515/jqas-2016-0062 , www.amstat.org/asa/files/pdfs/ … Upset-Prediction.pdf

Provided by American Statistical Association

Citation: Want to predict March Madness? New method identifies key statistics, outperforms others in accuracy (2018, March 7) retrieved 10 July 2024 from https://phys.org/news/2018-03-madness-method-key-statistics-outperforms.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Who makes the NCAA tournament? Researchers at the University of Illinois can help

47 shares

Feedback to editors

Want to predict March Madness? New method identifies key statistics, outperforms others in accuracy

A new species of extinct crocodile relative rewrites life on the Triassic coastline

New method achieves tenfold increase in quantum coherence time via destructive interference of correlated noise

Mars likely had cold and icy past, new study finds

Study: Nanoparticle vaccines enhance cross-protection against influenza viruses

New tools are needed to make water affordable, says study

Researchers demonstrate how to build 'time-traveling' quantum sensors

Lion with nine lives breaks record with longest swim in predator-infested waters

New multimode coupler design advances scalable quantum computing

High-speed electron camera uncovers new 'light-twisting' behavior in ultrathin material

Perceived warmth, competence predict callback decisions in meta-analysis of hiring experiments

Relevant PhysicsForums posts

Sharing Ratio

Understanding why πr^2 works for different area calculations

Implication vs Equivalence

P-adic numbers and the Ramanujan summation

What motivates famous mathematicians?

Innumeracy in public media today

Who makes the NCAA tournament? Researchers at the University of Illinois can help

Real March Madness is relying on seedings to determine Final 4

Expert: Bracket seedings irrelevant after Sweet Sixteen round

Odds are, seedings don't matter after Sweet 16, professor says

'Match' Madness: Picking upsets a losing strategy

NCAA tournament math: More than adding up ones, twos and threes

Merging AI and human efforts to tackle complex mathematical problems

New mathematical proof helps to solve equations with random components

Study finds cooperation can still evolve even with limited payoff memory

Study shows the power of social connections to predict hit songs

Wire-cut forensic examinations currently too unreliable for court, new study says

How can we make good decisions by observing others? A videogame and computational model have the answer

Medical Xpress

Tech Xplore

Science X

Want to predict March Madness? New method identifies key statistics, outperforms others in accuracy

A new species of extinct crocodile relative rewrites life on the Triassic coastline

New method achieves tenfold increase in quantum coherence time via destructive interference of correlated noise

Mars likely had cold and icy past, new study finds

Study: Nanoparticle vaccines enhance cross-protection against influenza viruses

New tools are needed to make water affordable, says study

Researchers demonstrate how to build 'time-traveling' quantum sensors

Lion with nine lives breaks record with longest swim in predator-infested waters

New multimode coupler design advances scalable quantum computing

High-speed electron camera uncovers new 'light-twisting' behavior in ultrathin material

Perceived warmth, competence predict callback decisions in meta-analysis of hiring experiments

Relevant PhysicsForums posts

Related Stories

Who makes the NCAA tournament? Researchers at the University of Illinois can help

Real March Madness is relying on seedings to determine Final 4

Expert: Bracket seedings irrelevant after Sweet Sixteen round

Odds are, seedings don't matter after Sweet 16, professor says

'Match' Madness: Picking upsets a losing strategy

NCAA tournament math: More than adding up ones, twos and threes

Recommended for you

Merging AI and human efforts to tackle complex mathematical problems

New mathematical proof helps to solve equations with random components

Study finds cooperation can still evolve even with limited payoff memory

Study shows the power of social connections to predict hit songs

Wire-cut forensic examinations currently too unreliable for court, new study says

How can we make good decisions by observing others? A videogame and computational model have the answer

Newsletter sign up

Donate and enjoy an ad-free experience