Numbers follow a surprising law of digits, and scientists can't explain why

May 10, 2007 By Lisa Zyga

Does your house address start with a 1? According to a strange mathematical law, about 1/3 of house numbers have 1 as their first digit. The same holds true for many other areas that have almost nothing in common: the Dow Jones index history, size of files stored on a PC, the length of the world’s rivers, the numbers in newspapers’ front page headlines, and many more.

The law is called Benford’s law after its (second) founder, Frank Benford, who discovered it in 1935 as a physicist at General Electric. The law tells how often each number (from 1 to 9) appears as the first significant digit in a very diverse range of data sets.

Besides the number 1 consistently appearing about 1/3 of the time, number 2 appears with a frequency of 17.6%, number 3 at 12.5%, on down to number 9 at 4.6%. In mathematical terms, this logarithmic law is written as F(d) = log[1 + (1/d)], where F is the frequency and d is the digit in question.

If this sounds kind of strange, scientists Jesús Torres, Sonsoles Fernández, Antonio Gamero, and Antonio Sola from the Universidad de Cordoba also call the feature surprising. The scientists published a letter in the European Journal of Physics called “How do numbers begin? (The first digit law),” which gives a short historical review of the law. Their paper also includes useful applications and explains that no one has been able to provide an underlying reason for the consistent frequencies.

“The Benford law has been an intriguing question for me for years, ever since I read about it,” Torres, who specializes in plasma physics, told PhysOrg.com. “I have used it as a surprising example at statistical physics classes to arouse the curiosity of my pupils.”

Torres et al. explain that, before Benford, a highly esteemed astronomer named Simon Newcomb discovered the law in 1881, although Newcomb’s contemporaries did not pay much attention to his publication. Both Benford and Newcomb stumbled upon the law in the same way: while flipping through pages of a book of logarithmic tables, they noticed that the pages in the beginning of the book were dirtier than the pages at the end. This meant that their colleagues who shared the library preferred quantities beginning with the number one in their various disciplines.

Benford took this observation a step further than Newcomb, and began investigating other groups of numbers, finding that the “first digit law” emerged in groups as disparate as populations, death rates, physical and chemical constants, baseball statistics, the half-lives of radioactive isotopes, answers in a physics book, prime numbers, and Fibonacci numbers. In other words, just about any group of data obtained by using measurements satisfies the law.

On the other hand, data sets that are arbitrary and contain restrictions usually don’t follow Benford’s law. For example, lottery numbers, telephone numbers, gas prices, dates, and the weights or heights of a group of people are either random or arbitrarily assigned, and not obtained by measurement.

As Torres and his colleagues explain, scientists in the decades following Benford performed numerous studies, but discovered little more about the law other than racking up a wide variety of examples. However, scientists did discover a few curiosities. For one, when investigating second significant digits of data sets, the law still held, but with less importance. Similarly, for the third and fourth digits, the appearance of the numbers started becoming equal, leveling out at a uniform 10% for the fifth digit. A second discovery attracted even more scientific interest:

“In 1961, Pinkham discovered the first general relevant result, demonstrating that Benford’s law is scale invariant and is also the only law referring to digits which can have this scale invariance,” the scientists wrote in their letter. “That is to say, as the length of the rivers of the world in kilometers fulfill Benford’s law, it is certain that these same data expressed in miles, light years, microns or in any other length units will also fulfill it.”

Torres et al. also explain that in the last years of the 20th century, some important theoretical advances have been proven (base invariance, unicity, etc.), mainly by Ted Hill and other mathematicians. While some cases can be explained (for example, house addresses almost always start with 1’s, and lower numbers must occur before higher numbers), there is still no general justification for all examples. The scientists also explain that there is no a priori criteria that tells when a data set should or should not obey the law.

“Nowadays there are many theoretical results about the law, but some points remain in darkness,” said Torres. “Why do some numerical sets, like universal physical constants, follow the law so well? We need to know not only mathematical reasons for the law, but also characterize this set of experimental data. For example, what are their points of contact? Where they come from? Apparently, they are independent.

”I hope the general necessary and sufficient conditions will be discovered in the future—many people are interested in the law, especially economists—but I also know it could be not possible ever,” he added, mentioning Godel.

Nevertheless, scientists have been using the law for many practical applications. For example, because a year’s accounting data of a company should fulfill the law, economists can detect falsified data, which is very hard to manipulate to follow the law. (Interestingly, scientists found that numbers 5 and 6, rather than 1, are the most prevalent, suggesting that forgers try to “hide” data in the middle.)

Benford’s law has also been recently applied to electoral fraud in order to detect voting anomalies. Scientists found that the 2004 US presidential election showed anomalies in the state of Florida, as well as fraud in Venezuela in 2004 and Mexico in 2006.

“The story about how it was discovered—twice—from dirty pages … it is almost incredible,” said Torres. “Benford's law has undeniable applications, and this useful aspect was not clear when the law was discovered. It seemed to be only a math curiosity. For me, this is an example of how simplicity can be unexpectedly marvelous.”

For more details on Benford’s law, the highly readable letter is temporarily available at:
www.iop.org/EJ/abstract/0143-0807/28/3/N04 (with free registration).

Citation: Torres, J. Fernández, S., Gamero, A., Solar, A. “How do numbers begin? (The first digit law).” Eur. J. Phys. 28 (2007) L17-25.

All rights reserved. This material may not be published, broadcast, rewritten or redistributed in whole or part without the express written permission of PhysOrg.com.

Explore further: Could 'Jedi Putter' be the force golfers need?

Related Stories

Professor leading study on connection between student journalism, civic activity

44 minutes ago

While anecdotal evidence supports a link between journalism and civic activity, a University of Kansas professor is leading a study to determine whether that is in fact the case among high school journalists and to determine ...

Congress clashes over online domain name oversight

Apr 10, 2014

Republican opposition to Obama administration plans to spin off U.S. oversight of the Internet's domain name system is evolving into an election-year political fight, with lawmakers using it as the latest front in their attacks ...

When science and philosophy collide in a 'fine-tuned' universe

Apr 03, 2014

When renowned scientists now talk seriously about millions of multiverses, the old question "are we alone?" gets a whole new meaning. ...

Obama puts forward plan ending NSA bulk collection (Update)

Mar 27, 2014

President Barack Obama put forward a long-awaited plan Thursday to end the US government's bulk collection of telephone records, aiming to defuse a controversy over surveillance on millions of Americans.

Little-known scores rank consumers' value based on data from Web, apps, loyalty cards

Mar 26, 2014

If you've bought a house or car lately, chances are you know your credit score, or at least whether it's good or bad. But what about your customer loyalty score? Or your identity score? Or your health score?

Black markets for hackers are increasingly sophisticated, specialized and maturing

Mar 25, 2014

Black and gray markets for computer hacking tools, services and byproducts such as stolen credit card numbers continue to expand, creating an increasing threat to businesses, governments and individuals, according to a new ...

Recommended for you

Could 'Jedi Putter' be the force golfers need?

18 hours ago

Putting is arguably the most important skill in golf; in fact, it's been described as a game within a game. Now a team of Rice engineering students has devised a training putter that offers golfers audio, ...

Technologies for the optical characterization of materials at terahertz frequencies

18 hours ago

The noncontact measurement of material properties using light is used in a wide variety of applications, from airport security scanners to medical x-ray imaging and various analytical techniques. Some of ...

Better thermal-imaging lens from waste sulfur

Apr 17, 2014

Sulfur left over from refining fossil fuels can be transformed into cheap, lightweight, plastic lenses for infrared devices, including night-vision goggles, a University of Arizona-led international team ...

Information storage for the next generation of plastic computers

Apr 16, 2014

Inexpensive computers, cell phones and other systems that substitute flexible plastic for silicon chips may be one step closer to reality, thanks to research published on April 16 in the journal Nature Communications.

With neutrons, scientists can now look for dark energy in the lab

Apr 16, 2014

It does not always take a huge accelerator to do particle physics: First results from a low energy, table top alterative takes validity of Newtonian gravity down by five orders of magnitude and narrows the ...

At the origin of cell division: The features of living matter emerge from inanimate matter

Apr 16, 2014

Droplets of filamentous material enclosed in a lipid membrane: these are the models of a "simplified" cell used by the SISSA physicists Luca Giomi and Antonio DeSimone, who simulated the spontaneous emergence ...

gruff
not rated yet Aug 08, 2008
I think to say Benford's Law can't be explained by scientists is not just sensationalist, it's plain wrong. Benford's law may be unintuitive but it's been explained many times in various ways and is very well understood (but not by Torres clearly). Steve Smith's Digital Signal Processing analysis offers the most intuitive insight into it: it's an artefact produced by the operation of taking the first digit in the first place which is a logarithmic operation (in the number base used). Random numbers _may_ follow Benford's law depending on their probability distribution function.
gruff
not rated yet Aug 08, 2008
(or if you prefer, an artefact of expressing measurements in given base when they're taken across many orders magnitude in that base)
Nerdle
not rated yet May 08, 2009
I think gruffs pretty much got it there. Since we always put 1 at the start of a new base 10 (10-100-1000), there will inevitably be more numbers cantaing a 1 as the first digit.

More news stories

'Dressed' laser aimed at clouds may be key to inducing rain, lightning

The adage "Everyone complains about the weather but nobody does anything about it," may one day be obsolete if researchers at the University of Central Florida's College of Optics & Photonics and the University ...

Impurity size affects performance of emerging superconductive material

Research from North Carolina State University finds that impurities can hurt performance – or possibly provide benefits – in a key superconductive material that is expected to find use in a host of applications, ...

Researchers develop scalable methods for manufacturing metamaterials

(Phys.org) —Metamaterials, or materials that have had their matter rearranged so they interact with light in specific ways, could be key to making everything from super lenses for satellite surveillance to biosensors that ...

Technologies for the optical characterization of materials at terahertz frequencies

The noncontact measurement of material properties using light is used in a wide variety of applications, from airport security scanners to medical x-ray imaging and various analytical techniques. Some of ...

Could 'Jedi Putter' be the force golfers need?

Putting is arguably the most important skill in golf; in fact, it's been described as a game within a game. Now a team of Rice engineering students has devised a training putter that offers golfers audio, ...

New, more versatile version of Geckskin: Gecko-like adhesives now useful for real world surfaces

(Phys.org) —The ability to stick objects to a wide range of surfaces such as drywall, wood, metal and glass with a single adhesive has been the elusive goal of many research teams across the world, but ...

Astronomers discover first self-lensing binary star system

(Phys.org) —A pair of astronomers at the University of Washington has discovered the first known instance of a self-lensing binary-star system. In their paper published in the journal Science, Ethan Kruse ...

Impact glass stores biodata for millions of years

(Phys.org) —Bits of plant life encapsulated in molten glass by asteroid and comet impacts millions of years ago give geologists information about climate and life forms on the ancient Earth. Scientists ...

Going nuts? Turkey looks to pistachios to heat new eco-city

Pistachios are already a key ingredient in Turkish baklava, but the country may now have found a new way to exploit the nuts known as "green gold"—by using their shells to heat a new eco-city.

Researchers create methylation maps of Neanderthals and Denisovans, compare them to modern humans

(Phys.org) —A team of Israeli, Spanish and German researchers has for the first time created a map of gene expression in Neanderthals and Denisovans and has compared them with modern humans. In their paper ...