June 2, 2014

New analysis contradicts findings published in Science

New research published in the June 2014 issue of Language presents evidence that the methods employed by the authors of articles published in prestigious international science journals are not supported by a more rigorous linguistic analysis. The Language article, "A statistical comparison of written language and non-linguistic symbol systems," was authored by Richard Sproat, a Research Scientist at Google, based on work he previously did at the Oregon Health & Science University.

Sproat's analysis comes in response to a number of papers published in high-profile science publications that have argued that statistical analyses of symbol combinations can provide insights into the origins of written language. One paper, by Rajesh Rao (University of Washington), Iravatham Mahadevan (Indus Research Centre) and colleagues at the TATA Institute in Mumbai, India, appeared in 2009 in the journal Science. It argued that a particular statistical measure—bigram conditional entropy—showed that the Indus Valley symbols behave more like those in linguistic texts than those of non-linguistic systems. In another paper in the Proceedings of the Royal Society, Rob Lee and colleagues (University of Exeter) claimed that a more sophisticated set of entropic measures put Pictish symbols in the same category as linguistic texts. Both papers (and other subsequent papers by Rao and his colleagues) received a large amount of attention from the news media. In these popular media accounts, the techniques were often presented as demonstrating that the symbol systems in question were written language, though this was not necessarily the intention of the authors.

Understanding statistical techniques for analyzing symbol systems and what they do and do not show is of fundamental importance to language science, as there are many old or ancient symbol systems whose function is largely or completely unknown. Examples include the Easter Island rongorongo inscriptions (19th century), the Pictish symbols of Scotland (6th century onwards), and the Indus Valley symbols (Northern India, Pakistan, 3rd millennium BCE). As part of his work on the question of whether symbol systems such as these exemplify written language, Sproat developed large, structured collections of text, or corpora, from a variety of non-linguistic systems, both ancient and modern, including Mesopotamian deity symbols (Babylonia), Totem poles (Pacific Northwest), Pennsylvania barn stars ("hex signs"), weather forecast icon sequences from http://www.wunderground.com, and Unicode characters for Asian emoticons. He compared these to corpora developed from fourteen languages representing a variety of different writing-system types, both ancient and modern.

From the point of view of the measures that had been proposed in the previous literature, all of the non-linguistic symbol systems in Sproat's collection or corpora behaved the same as the linguistic systems. However, he also found that a novel measure of the amount of local repetition and a version of one of Lee and colleagues' entropic measures with a different setting than they used could accurately distinguish two different categories of symbol systems. Moreover, his statistical procedure, unlike the earlier ones, classifies both the Pictish and Indus Valley symbols as non-linguistic.

Despite these promising results, Sproat cautions against relying too heavily on statistical measures to analyze ancient symbol systems that have not been deciphered. All statistical measures are heavily influenced by, among other things, the size of the corpus, the length of texts, and what kind of text is involved. Shopping lists, for example, have statistical properties that distinguish them from running prose from a novel. He argues that a truly reliable demonstration that a collection of symbols exemplifies written language requires supporting empirical evidence, such as a credible decipherment or independent archeological evidence of a related culture of active literacy. What is clear, however, is that the previously proposed statistical methods simply do not work for the intended purpose.

More information: A pre-print version of the article is available for review at: http://www.linguisticsociety.org/document/language-vol-90-issue-2-june-2014-sproat.

Journal information: Language , Science , Proceedings of the Royal Society

Provided by Linguistic Society of America

Citation: New analysis contradicts findings published in Science (2014, June 2) retrieved 17 April 2024 from https://phys.org/news/2014-06-analysis-contradicts-published-science.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Indus script encodes language, reveals new study of ancient symbols

0 shares

Feedback to editors

Study reveals how humanity could unite to address global challenges

6 hours ago

CO₂ worsens wildfires by helping plants grow, model experiments show

7 hours ago

Surf clams off the coast of Virginia reappear and rebound

8 hours ago

Yellowstone Lake ice cover unchanged despite warming climate

9 hours ago

The history of the young cold traps of the asteroid Ceres

9 hours ago

Researchers shine light on rapid changes in Arctic and boreal ecosystems

9 hours ago

New benzofuran synthesis method enables complex molecule creation

9 hours ago

Human odorant receptor for characteristic petrol note of Riesling wines identified

9 hours ago

Uranium-immobilizing bacteria in clay rock: Exploring how microorganisms can influence the behavior of radioactive waste

9 hours ago

Research team identifies culprit behind canned wine's rotten egg smell

9 hours ago

Load comments (0)

New analysis contradicts findings published in Science

Study reveals how humanity could unite to address global challenges

CO₂ worsens wildfires by helping plants grow, model experiments show

Surf clams off the coast of Virginia reappear and rebound

Yellowstone Lake ice cover unchanged despite warming climate

The history of the young cold traps of the asteroid Ceres

Researchers shine light on rapid changes in Arctic and boreal ecosystems

New benzofuran synthesis method enables complex molecule creation

Human odorant receptor for characteristic petrol note of Riesling wines identified

Uranium-immobilizing bacteria in clay rock: Exploring how microorganisms can influence the behavior of radioactive waste

Research team identifies culprit behind canned wine's rotten egg smell

Relevant PhysicsForums posts

Biographies, history, personal accounts

Who is your favorite Jazz musician and what is your favorite song?

Esoteric Music Recommendations

Cover songs versus the original track, which ones are better?

For WW2 buffs!

History of Railroad Safety - Spotlight on current derailments

Indus script encodes language, reveals new study of ancient symbols

Computers unlock more secrets of the mysterious Indus Valley script

Entropy study suggests Pictish symbols likely were part of a written language

Picking up a second language is predicted by ability to learn patterns

New computer-based tool measures readability for different readers

The 'bumpy ride' of linguistic change

Study reveals how humanity could unite to address global challenges

Are the world's cultures growing apart?

Building footprints could help identify neighborhood sociodemographic traits

First languages of North America traced back to two very different language groups from Siberia

Can the bias in algorithms help us see our own?

The 'Iron Pipeline': Is Interstate 95 the connection for moving guns up and down the East Coast?

Medical Xpress

Tech Xplore

Science X

New analysis contradicts findings published in Science

Study reveals how humanity could unite to address global challenges

CO₂ worsens wildfires by helping plants grow, model experiments show

Surf clams off the coast of Virginia reappear and rebound

Yellowstone Lake ice cover unchanged despite warming climate

The history of the young cold traps of the asteroid Ceres

Researchers shine light on rapid changes in Arctic and boreal ecosystems

New benzofuran synthesis method enables complex molecule creation

Human odorant receptor for characteristic petrol note of Riesling wines identified

Uranium-immobilizing bacteria in clay rock: Exploring how microorganisms can influence the behavior of radioactive waste

Research team identifies culprit behind canned wine's rotten egg smell

Relevant PhysicsForums posts

Related Stories

Indus script encodes language, reveals new study of ancient symbols

Computers unlock more secrets of the mysterious Indus Valley script

Entropy study suggests Pictish symbols likely were part of a written language

Picking up a second language is predicted by ability to learn patterns

New computer-based tool measures readability for different readers

The 'bumpy ride' of linguistic change

Recommended for you

Study reveals how humanity could unite to address global challenges

Are the world's cultures growing apart?

Building footprints could help identify neighborhood sociodemographic traits

First languages of North America traced back to two very different language groups from Siberia

Can the bias in algorithms help us see our own?

The 'Iron Pipeline': Is Interstate 95 the connection for moving guns up and down the East Coast?

Newsletter sign up

Donate and enjoy an ad-free experience