# Mathematically ranking ranking methods

##### May 24, 2011

In a world where everything from placement in a Google search result to World Cup eligibility depends on ranking and numerical ratings of some kind, it is becoming increasingly important to analyze the algorithms and techniques that underlie such ranking methods in order to ensure fairness, eliminate bias, and tailor them to specific applications.

In a paper published this month in the SIAM Journal on , authors Timothy Chartier, Erich Kreutzer, Amy Langville, and Kathryn Pedings mathematically analyze three commonly-used ranking methods. "We studied the sensitivity and stability of three popular ranking methods: PageRank, which is the method has used to rank web pages, and the Colley and Massey methods, which have been used by the Bowl Championship Series to rank U.S. college football teams," explains Langville.

All three methods analyzed – the Colley and the Massey ranking techniques and the Markov web page rankings—which is a generalized version of PageRank—are linear algebra-based with simple elegant formulations. Here, the authors apply a modified version of PageRank to a sports season.

"Both web page authors and teams sometimes try to game, or spam, ranking systems to achieve a higher ranking. For instance, web page authors try to modify their incoming and outgoing links while teams try to run up the score against weak opponents," says Langville, pointing out the significance of studying such methods. "Mathematically, such spamming can be viewed as changes to the input data required by the ranking method."

Most methods, including the aforementioned three, produce "ratings" of numerical scores for each team, which represents their playing ability. When sorted, these ratings produce ranks with integer values for each team, simply representing a numerical listing of the teams based on their rating.

In the first step of their analysis, the authors assume a simple rating scheme with constant difference of 1 in scores and apply it to a perfect sports season. In a perfect season, each team plays every other team only once and there are no upset victories or losses. In such an ideal scenario, a highly-ranked team would always beat a lower-ranked team. Thus, in a system with teams numbered 1 through 4 for their ranks, team 1 would beat all other teams; team 2 would beat teams 3 and 4, and lose to 1; team 3 would beat team 4, losing to teams 1 and 2; and team 4 would lose to all other teams. They then compute the output rating for each of the three methods and compare them to the input rating.

The three methods are applied to this ideal data, and all three methods recover the input ranking. However, while the Colley and Massey methods produce ratings that are uniformly spaced as would be desirable in a rating system, the Markov method, produces non-uniformly spaced ratings.

The authors analyze the sensitivity of the methods to small perturbations and determine how much the rating and ranking is affected by these changes. If, for instance, small changes in input data cause large changes in the output ratings, the method is considered sensitive. Similar discrepancies in the input and output ranking data would show instability of the ranking method.

The authors conclude that while the Colley and Massey methods are insensitive to small changes, the Markov method (or Page Rank method) is highly sensitive to such changes, often resulting in anomalies in rankings. For instance, there are cases of a single upset in a perfect season resulting in rearrangements of rankings for all teams because of the Markov method's high sensitivity. In these cases, the Colley and Massey methods would have an isolated response, resulting in changes to the rankings of only the two teams in question.

In addition, the sensitivity of the PageRank or Markov method gets more pronounced further down in the rankings. "The PageRank vector is quite sensitive to small changes in the input data. Further, this sensitivity increases as the rank position increases," Langville explains. "In other words, values in the tail (low-ranked positions) of the PageRank vector are extremely sensitive, which calls into question PageRank's use to produce a full ranking, as opposed to a simply top-k ranking. It also partially explains PageRank's susceptibility to spam. On the other hand, the Colley and Massey methods are stable throughout the entire ranking."

PageRank has recently evolved from being used exclusively for web pages to rank various entities, from species to social networks, reinforcing the ubiquity of these ranking systems.

But the stability displayed by the Colley and Massey methods in this study shows that these two methods would perhaps be effective even in ranking other entities, such as and movies, though originally conceived for sports rankings.

"As future work, we are exploring the use of the Colley and Massey methods in other settings beyond sports. For example, we have found that these two methods are more appropriate than PageRank for ranking in social networks such as Twitter," says Langville.

While methods can be applied to a wide range of areas, modifications are often required in order to translate a particular method to suit a specific application, making analyses of sensitivity and stability that much more important.

## Related Stories

#### Think College Football Teams Punished For Late Losses? Not So Fast, My Friend

November 21, 2007

A new study of 25 years of data from a major college football poll challenges three strongly held beliefs of many coaches and fans.

#### Web page ranking algorithm detects critical species in ecosystems

September 4, 2009

Google's algorithm for ranking web-pages can be used to determine which species are critical for sustaining ecosystems. Drs. Stefano Allesina and Mercedes Pascual find that "PageRank" can be applied to the study of food webs, ...

#### New Algorithm Ranks Sports Teams like Google's PageRank

December 15, 2009

(PhysOrg.com) -- Sports fans may be interested in a new system that ranks NFL and college football teams in a simple, straightforward way, similar to how Google PageRank ranks webpages. The new sports algorithm, called the ...

#### Google PageRank-like algorithm dates back to 1941

February 19, 2010

(PhysOrg.com) -- When Sergey Brin and Larry Page developed their PageRank algorithm for ranking webpages in 1998, they certainly knew that the seeds of the algorithm had been sown long before that time, as is evident from ...

#### IST researchers categorize social media searches

July 30, 2010

(PhysOrg.com) -- By integrating social media, Penn State researchers have found a way to better satisfy individual users' multimedia searches on the Web.

#### Ranking research

May 3, 2011

A new approach to evaluating research papers exploits social bookmarking tools to extract relevance. Details are reported in the latest issue of the International Journal of Internet Technology and Secured Transactions.

## Recommended for you

#### Four pre-Inca tombs found in Peru's Lima

November 27, 2015

Archaeologists in Peru have found four tombs that are more than 1,000 years old in a pyramid-shaped cemetery that now sits in the middle of a residential neighborhood in Lima, experts said.

#### Income inequality makes the rich more Scrooge-like, study finds

November 24, 2015

As the annual "season of giving" dawns, a new study finds that stark income inequity - a dramatically rising trend in the United States - makes the "haves" less generous toward others.

#### New study finds that teaching is not essential for people to learn to make effective tools

November 26, 2015

A new study from the University of Exeter has found that teaching is not essential for people to learn to make effective tools. The results counter established views about how human tools and technologies come to improve ...

#### Betrayals of trust helped the rapid spread of human species around the world

November 24, 2015

New research by an archaeologist at the University of York suggests that betrayals of trust were the missing link in understanding the rapid spread of our own species around the world.

#### Eggshell porosity can be used to infer the type of nest built by extinct archosaurs

November 25, 2015

Extinct archosaurs' eggshell porosity may be used as a proxy for predicting covered or exposed nest types, according to a study published November 25, 2015 in the open-access journal PLOS ONE by Kohei Tanaka from the University ...

#### Fossil vertebrae reveal clues to evolution of long neck in giraffe

November 25, 2015

(Phys.org)—A trio of researchers with the New York Institute of Technology has pieced together the neck of the now extinct giraffe-like creature Samotherium major and in so doing has found some clues that help explain how ...