Researchers say Twitter analysis can help gamblers beat the spread on NFL games

Sep 12, 2013

Analyses of Twitter feeds have been used to track flu epidemics, predict stock market changes and do political polling, but now that the National Football League season is underway, the natural question is: Can Twitter help beat the spread on NFL games?

The answer, say at Carnegie Mellon University, is yes. Or, at least it can help a little bit at certain times during the season. They will report their findings Sept. 27 at the Machine Learning and Data Mining for Sports Analytics conference in Prague, Czech Republic.

The study began as a class project by then-student Kevin Gimpel, now a research assistant professor at Toyota Technological University at Chicago. It ultimately encompassed three NFL seasons, 2010-2012. The researchers used automated tools to sort through a stream of that averaged 42 million messages a day in 2012.

Out of those, the researchers plucked out messages with hashtags associated with individual NFL teams—#giants, #newyorkgiants, #nygiants, #steelers, #steelersnation, etc.—that were sent at least 12 hours after the start of the team's previous game and one hour before the start of its upcoming game. In 2012, more than a million such messages were identified.

The idea, explained Christopher Dyer, assistant professor in CMU's Language Technologies Institute, was to see what might be divined from the collective wisdom or sentiments of fans, as reflected by their tweets. Could simple measures, such as the volume of tweets, or the distribution of positive and negative words in tweets, provide insight into which teams would win, which teams would beat the point , or provide guidance on betting on the over/under line (the total number of points scored by both teams in a game)?

"It's an experiment every week," Dyer said.

What they found was that their analysis of tweets didn't help much when it came to predicting winners or the over/under score. But when it came to winning with the spread, the researchers said their method was 55 percent accurate. That doesn't offer a huge advantage, Dyer acknowledged, but it might be enough to be profitable.

When setting the spread, of course, sports books don't look only at a team's performance or factors such as home field advantage and weather predictions. The goal is to attract an even amount of bets for both teams, which minimizes the financial risk of the bookie. A certain amount of psychology factors into the spread, Dyer noted, which suggests why gauging the sentiments of team fans might offer some betting advantages.

But the researchers also developed a deep appreciation for the performance of the sports books and for how hard it is to beat the spread. "One thing that surprised us is how hard setting the point spread is to do well," Dyer said. "And the sports books are very, very good."

There were limits to what Twitter could reveal. Dyer said the Twitter analysis didn't work well for the first four weeks of the season. And, in the last few weeks of the season, when many teams begin altering their strategies in preparation for the post-season, the Twitter analysis wasn't useful either.

Though the advantage of Twitter analysis was modest, Dyer said improvement might be possible with a more sophisticated analysis of tweet content. Also, a common difficulty in Twitter analysis is the rapidly changing nature of the social network. "It's a moving target," Dyer explained. "More people are on Twitter this year than the year before and the year before that." The number of tweets analyzed by the researchers for the 2012 season was five times greater than the number of tweets in the 2010 season.

The sports books themselves might benefit from using Twitter analysis, if they aren't already using it, Dyer said. As for the researchers, who included Noah Smith, associate professor of language technologies and machine learning, and Shiladitya Sinha, a student majoring in mathematical sciences, the interest is purely in understanding how Twitter data can be analyzed and used.

"As far as I know," Dyer said, "none of us has actually placed a bet based on our findings."

Explore further: All public Twitter posts indexed by analytics firm

Related Stories

Twitter sees new record during Euro 2012 final

Jul 02, 2012

(AP) — Twitter says its users fired off more than 15,000 tweets per second when Spain scored its fourth goal in Sunday's European Championship final, setting a sports-related record on the social networking ...

Twitter targets ads based on interests

Aug 30, 2012

Twitter on Thursday began letting advertisers target users with "promoted tweets" based on interests expressed at the globally popular one-to-many texting service.

Recommended for you

User comments : 0

More news stories

Under some LED bulbs whites aren't 'whiter than white'

For years, companies have been adding whiteners to laundry detergent, paints, plastics, paper and fabrics to make whites look "whiter than white," but now, with a switch away from incandescent and fluorescent lighting, different ...

Continents may be a key feature of Super-Earths

Huge Earth-like planets that have both continents and oceans may be better at harboring extraterrestrial life than those that are water-only worlds. A new study gives hope for the possibility that many super-Earth ...

Researchers successfully clone adult human stem cells

(Phys.org) —An international team of researchers, led by Robert Lanza, of Advanced Cell Technology, has announced that they have performed the first successful cloning of adult human skin cells into stem ...