Computers unlock more secrets of the mysterious Indus Valley script

Aug 03, 2009
This is an Indus Valley seal. Credit: J. M. Kenoyer / harappa.com

Four-thousand years ago, an urban civilization lived and traded on what is now the border between Pakistan and India. During the past century, thousands of artifacts bearing hieroglyphics left by this prehistoric people have been discovered. Today, a team of Indian and American researchers are using mathematics and computer science to try to piece together information about the still-unknown script.

The team led by a University of Washington researcher has used computers to extract patterns in ancient Indus symbols. The study, published this week in the , shows distinct patterns in the symbols' placement in sequences and creates a for the unknown language.

"The statistical model provides insights into the underlying grammatical structure of the Indus script," said lead author Rajesh Rao, a UW associate professor of computer science. "Such a model can be valuable for decipherment, because any meaning ascribed to a symbol must make sense in the context of other symbols that precede or follow it."

Co-authors are Nisha Yadav and Mayank Vahia of the Tata Institute of Fundamental Research and Centre for Excellence in Basic Sciences in Mumbai; Hrishikesh Joglekar of Mumbai; R. Adhikari of the Institute of Mathematical Sciences in Chennai; and Iravatham Mahadevan of the Indus Research Centre in Chennai.

Despite dozens of attempts, nobody has yet deciphered the Indus script. The symbols are found on tiny seals, tablets and amulets, left by people inhabiting the Indus Valley from about 2600 to 1900 B.C. Each artifact is inscribed with a sequence that is typically five to six symbols long.

Some people have questioned whether the symbols represent a language at all, or are merely pictograms of political or religious icons.

The new study looks for mathematical patterns in the sequence of symbols. Calculations show that the order of symbols is meaningful; taking one symbol from a sequence found on an artifact and changing its position produces a new sequence that has a much lower probability of belonging to the hypothetical language. The authors said the presence of such distinct rules for sequencing symbols provides further support for the group's previous findings, reported earlier this year in the journal Science, that the unknown script might represent a language.

"These results give us confidence that there is a clear underlying logic in Indus writing," Vahia said.

Seals with sequences of Indus symbols have been found as far away as West Asia, in the region historically known as Mesopotamia and site of modern-day Iraq. The statistical results showed that the West-Asian sequences are ordered differently from sequences on artifacts found in the Indus valley. This supports earlier theories that the script may have been used by Indus traders in West Asia to represent different information compared to the Indus region.

"The finding that the Indus script may have been versatile enough to represent different subject matter in West Asia is provocative. This finding is hard to reconcile with the claim that the script merely represents religious or political symbols," Rao said.

The researchers used a Markov model, a statistical method that estimates the likelihood of a future event (such as inscribing a particular ) based on patterns seen in the past. The method was first developed by Russian mathematician Andrey Markov a century ago and is increasingly used in economics, genetics, speech-recognition and other fields.

"One of the main purposes of our paper is to introduce Markov models, and statistical models in general, as computational tools for investigating ancient scripts," Adhikari said.

One application described in the paper uses the statistical model to fill in missing symbols on damaged archaeological artifacts. Such filled-in texts can increase the pool of data available for deciphering the writings of ancient civilizations, Rao said.

Source: University of Washington (news : web)

Explore further: New frontier in error-correcting codes

add to favorites email to friend print save as pdf

Related Stories

Scientists trace how rivers change course

Dec 25, 2005

U.S. scientists have used laboratory techniques and sediment cores from the ocean to help explain the how rivers have changed course over millions of years.

Computerized treatment of manuscripts

Sep 06, 2007

Researchers at the UAB Computer Vision Centre working on the automatic recognition of manuscript documents have designed a new system that is more efficient and reliable than currently existing ones.

3,000-year-old writing found in Mexico

Sep 15, 2006

A stone slab with 3,000-year-old writing, perhaps the oldest script ever found in the Western Hemisphere, has been discovered in Mexico, reports say.

Recommended for you

New frontier in error-correcting codes

18 hours ago

Error-correcting codes are one of the glories of the information age: They're what guarantee the flawless transmission of digital information over the airwaves or through copper wire, even in the presence ...

Five ways the superintelligence revolution might happen

Sep 26, 2014

Biological brains are unlikely to be the final stage of intelligence. Machines already have superhuman strength, speed and stamina – and one day they will have superhuman intelligence. This is of course ...

User comments : 5

Adjust slider to filter visible comments by rank

Display comments: newest first

docknowledge
2.7 / 5 (3) Aug 04, 2009
It's hard to tell, as is soooooo often the case with newsbite articles, whether there's anything significant here. Linguists were using proximity to determine the structure of language ... predates computers by hundreds of years. What are they doing that's new? Maybe nothing at all.
Velanarris
3.3 / 5 (3) Aug 04, 2009
It's hard to tell, as is soooooo often the case with newsbite articles, whether there's anything significant here. Linguists were using proximity to determine the structure of language ... predates computers by hundreds of years. What are they doing that's new? Maybe nothing at all.

Yeah but the issue they face is that this language is so old as to be a possible "father language" that didn't evolve into anything currently used. It's very hard to decipher it when a) you don't know if it's actually a language, and b) you have very little knowledge or correlation to current symbols.

A computer can turn a thousand year decipher job into a few months or years.
DozerIAm
3.3 / 5 (4) Aug 04, 2009
a possible "father language" that didn't evolve into anything currently used.


If it didn't evolve into anything currently used, then it wasn't much of a father. Maybe it should be called a "uncle who never got married language" instead.
thales
1 / 5 (2) Aug 04, 2009
Yeah, this is your typical UWNGM language.
RayCherry
3 / 5 (1) Aug 05, 2009
I am glad their are so many Asian (and) Indian computer professionals in the world who have a genuine interest in their history and 'almost forgotten' ancestral culture.

If a computer can break the code of a human significance recording system, (written language), then it should find encrypted messages a breaze.

Are the Americans still using native American Indian languages for just the same (training) purpose?