# Doing the math 'predicts' which movies will be box office hits

##### Aug 22, 2013

(Phys.org) —Researchers have devised a mathematical model which can be used to predict whether films will become blockbusters or flops at the box office – up to a month before the movie is released.

Their model is based on an analysis of the activity on Wikipedia pages about American films released in 2009 and 2010. They examined 312 , taking into account the number of page views for the movie's article, the number of human editors contributing to the article, the number of edits made and the diversity of online users.

The researchers from Oxford University, the Central European University at Budapest, and Budapest University of Technology and Economics have published their findings in the journal PLOS ONE.

The model was applied retrospectively so the researchers systematically charted the online buzz on Wikipedia around particular films and compared this with the box office takings from the first weekend after release. The results of the comparison between the predicted opening weekend revenue, using their mathematical model, and the actual figures (published in Internet Movie Database [IMDb]) showed a high degree of .

Their allowed them to predict box office revenues with an overall accuracy of around 77%. The study authors say this level of accuracy is higher than the best existing applied by marketing firms (which they estimate to be at around 57%). They could predict the box office takings of six out of 312 films with 99% accuracy where the predicted value was within 1% of the real value. Some 23 movies were predicted with 90% accuracy and 70 movies with an accuracy of 70% and above.

The more successful the film, the more accurately the researchers were able to predict box office takings. In the study, they explain that this is possibly due to the increased amount of online data generated by films that turn out to be successes. The model correctly forecast the commercial success of Iron Man 2, Alice in Wonderland, Toy Story 3 and Inception, but failed to accurately forecast the financial return on the less successful movies Never Let Me Go and Animal Kingdom.

Dr Taha Yasseri, from the Oxford Internet Institute at the University of Oxford, said: 'These results can be of great value to marketing firms but more importantly for us, we were able to demonstrate how we can use socially generated online data to predict a lot about future human behaviour. The predicting power of the Wikipedia-based model, despite its simplicity compared with Twitter, is that many of the editors of the Wikipedia pages about the movies are committed movie-goers who gather and edit relevant material well before the release date. By contrast, the "mass" production of tweets occurs very close to the release time, and often these can be spun by marketing agencies rather than reflecting the feelings of the public.'

Co-author Professor János Kertész, from the Central European University of Budapest, Hungary, said: 'We have demonstrated for the first time that Wikipedia edit statistics provide us with another tool to predict social events. We studied the problem of predicting the financial success of movies and concluded that, in some aspects, forecasting based on Wikipedia outperforms tweets as Wikipedia activity has a longer timescale which enables earlier predictions.'

The study suggests that the efficiency of the predictions might be improved by applying more sophisticated statistical methods, such as including the controversy measure of an article. The has not been applied yet to films that are not on release.

## Related Stories

#### Mathematical model predicts success of movies at the box office

Jun 14, 2012

A group of Japanese scientists have surprised themselves by being able to predict the success or failure of blockbuster movies at the box office using a set of mathematical models.

#### Wikipedia 'edit wars' show dynamics of conflict emergence and resolution

Jun 20, 2012

Wikipedia's crowd-sourced content generation has made it the world's largest encyclopedia, but this model also leads to "edit wars" when editors disagree. The dynamics of these conflicts provide an interesting window into ...

#### Research looks at how a box office success can translate internationally

May 23, 2013

Hollywood will have the box office heating up this summer with dozens of blockbuster films. But whether a movie is a worldwide box office bomb or a box office bonanza has a lot to do with the culture and release strategy ...

#### Researchers say Twitter algorithm can predict movie profits

Apr 06, 2010

Want to know how "Clash of the Titans" will fare at the box office this weekend? Check Twitter.

#### Netflix strikes movie deal with Weinstein Co.

Aug 20, 2013

(AP)—Netflix says it's reached a multi-year agreement with The Weinstein Co. that will give it the exclusive streaming rights to the company's first-run films starting in 2016.

#### English Wikipedia hosts three millionth article

Aug 17, 2009

The English version of user-generated online encylopedia Wikipedia hosted its three millionth article on Monday -- an entry about a Norwegian actress.

## Recommended for you

#### Strong teams attract crowds for international cricket

22 hours ago

The strength of the team—not the promise of a close contest—is the biggest draw to crowds in international cricket, new research has found.

#### Improving radiation therapies for cancer mathematically

Mar 05, 2014

In a paper published in December in the SIAM Journal on Scientific Computing, authors Li-Tien Cheng, Bin Dong, Chunhua Men, Xun Jia, and Steve Jiang propose a method to optimize radiation therapy treatments in cancer patien ...

#### Computational study finds maximum packing density of 55,000 different shapes

Mar 05, 2014

A team of researchers at the University of Michigan has used computational and analytical analysis to find the maximum packing density of 55,000 uniquely shaped particles. In their paper published in the ...

#### Secret to the perfect pancake is discovered

Mar 04, 2014

In a collaboration with Meadowhall Shopping Centre, students from the University's Maths Society (SUMS) developed, trialled and tested a formula which enables pancake-lovers across the world to rustle-up ...

#### New data shows baseball managers when to replace the starting pitcher

Feb 28, 2014

Last October, the Detroit Tigers won the first game of the American League Championship Series against the Boston Red Sox; the Tigers led the second game, 5-1, going into the eighth inning in Boston's Fenway ...

##### antialias_physorg
5 / 5 (1) Aug 22, 2013
The study authors say this level of accuracy is higher than the best existing predictive models applied by marketing firms (which they estimate to be at around 57%)

57% ? That's slightly better than a wild guess. These marketing firms aren't worth their money.
##### Modernmystic
1 / 5 (2) Aug 22, 2013
The choice of hits depends on many socio-psychological factors, for example at the time of economical crisis the frustrated people are getting more satisfied with simpler enjoyment providing movies and vice-versa. You cannot get these connections with analysis of movie content only.

Indeed. You can do your best but ultimately all models and algorithms are at the "mercy" of reality. To use a somewhat simplistic example; if you predict a movie is going to gross about a billion dollars and a huge solar flare knocks out power in North America and Europe on it's opening night and lasts for two weeks your model is going to be off by about 990 million dollars give or take....
##### technodiss
4.7 / 5 (3) Aug 22, 2013
if only they'd put this kind of effort into making GOOD movies, instead of these gazillion dollar "blockbusters" where all the money goes to big names, over the top cgi, nauseating 3d, and not to writing, directing, or decent camera work.
would anyone even recognize a good, well told story?
##### antialias_physorg
not rated yet Aug 22, 2013
would anyone even recognize a good, well told story?

That's probably why the money doesn't go to directing, writing and camera work anymore. They don't matter.

Case in point: Comic book adaptations are heralded as cinematic milestones.
Now I don't know about you, but Batman or the Avengers or even Watchmen (enjoyable as they are on their level) aren't Shakespeare. They aren't even George Lucas. They're intellectual tripe with big explosions and fancy costumes with sometimes pretentions to basic literacy and infantile philosophy.
There's a place for that in cinema, for sure. Heck, we all use movies sometimes to unwind - and you don't need Tolstoy type writing to do that.
But overhyping these movies into 'genius writing' and 'deep insights into the human psyche' is just ludicrous.
##### pauljpease
5 / 5 (1) Aug 22, 2013
"we were able to demonstrate how we can use socially generated online data to predict a lot about future human behaviour."

Reminds me of Asimov's "psychohistory". Apparently it doesn't require trillions of humans to be at least somewhat accurate...
##### thingsmith
not rated yet Aug 22, 2013
Ok, the developed a model based on 2009 and 2010. How well does the model predict 2011, 2012 and 2013?
##### alfie_null
not rated yet Aug 23, 2013
The single criterion for being a hit is making lots of money. Striving for that, producers focus on, spend lots of money on, do all sorts of things, that don't contribute to the quality of their movies.
##### teledyn
1 / 5 (1) Aug 26, 2013
The study authors say this level of accuracy is higher than the best existing predictive models applied by marketing firms (which they estimate to be at around 57%)

57% ? That's slightly better than a wild guess. These marketing firms aren't worth their money.

and you're just figuring this out now?

but just you wait a while, it will get worse: as soon as this report hits their radar, they'll start swarming the Wikipedia and stacking all the other buzz-routes ten times as hard, their eyes glassed over with the mistaken belief that this hit-maker relationship is bilateral. there goes the neighbourhood.
##### patnclaire
not rated yet Aug 26, 2013
I can predict box office hits more than 90% of the time.
One useful predictor is to choose the opposite of what "hi-brow" critics tout.
Another useful predictor is lack of competition at release time.
Still another, and probably the best predictor is a well written script. Gizmos and gadgets are neet but without a good plot and story line the movie is a Titanic waiting to happen.
Movies need the backing of a good Producer. They need the artistic craftsmanship of a good director who is trying to visually tell the story that the writer(s) intend(s). Acting is the necessary part which holds the other three together....the glue? Lastly, the movie needs a good support system such as all those unseen people who do the necessary things that make a film production go. Any one of these pieces, in and of themselves, are crucial but there is a synergy to the box office hit or lack of it in the box office dud.

## More news stories

#### Statue of Egypt pharoanic princess found in Luxor

(AP)—Egypt has announced that a team of European archaeologists have found a nearly 2-meter- (6 ½-foot-) tall alabaster statue of a pharoanic princess, dating from approximately 1350 B.C., outside the southern city of ...

#### Lose yourself to dance, know yourself better

Could managers gain a new kind of understanding about their interaction with colleagues and employees by 'dancing'? That's the question arising from new research published this month in the International Journal of Work Or ...

A second viewing in a police line-up may help more eyewitnesses identify the culprit, new research from Flinders University reveals.

#### Expiration of terrorism risk insurance act could hurt national security, study finds

Allowing the federal terrorism risk insurance act to expire could have negative consequences for U.S. national security, according to a new study from the RAND Corporation.

#### Women's widespread inequality and rape as a weapon of war

Women are more likely to experience mass rape and sexual torture in armed conflicts around the world, as a deliberate strategy to humiliate, intimidate and dominate them and their 'enemy' community.

#### Researchers suggest earthquake lightning may be due to cracks forming in Earth's surface

(Phys.org) —A team of four researchers from several universities in the U.S. has given a presentation at this year's American Physical Society meeting, outlining a theory they are developing to help explain ...

#### Bitcoin: the digital currency that became a target for speculators

Whether you see Bitcoin as the future of finance or a reckless gamble, the digital currency has become headline news over the past year, even as its origins remain shrouded in mystery.

#### Pre-term birth and asthma: Preterm birth may increase the risk of asthma and wheezing disorders during childhood

Researchers at Brigham and Women's Hospital (BWH) in Boston, Massachusetts, in collaboration with investigators at the Maastricht University Medical Centre and Maastricht University School of Public Health in the Netherlands ...

#### Software analyzes apps for malicious behavior

Last year at the end of July the Russian software company "Doctor Web" detected several malicious apps in the app store "Google Play". Downloaded on a smartphone, the malware installed—without the permission ...

#### Pre-op pain patterns affect stenosis surgery outcomes

(HealthDay)—For patients with spinal stenosis without degenerative spondylolisthesis, predominance of back pain (BP) versus leg pain (LP) is associated with worse surgical outcomes, according to a study ...