New algorithm found for learning languages

Sep 06, 2005

Cornell University and Tel Aviv University researchers say they've developed a method for enabling a computer program to learn languages.

The researchers said the algorithm they've formulated allows a computer to scan text in many languages, including English and Chinese, and autonomously and without previous information infer the underlying rules of grammar.

The rules can then be used to generate new and meaningful sentences, they said. The method also works for such data as sheet music or protein sequences.

"The algorithm -- the computational method -- for language learning and processing that we have developed can take a body of text, abstract from it a collection of recurring patterns or rules and then generate new material," explained Shimon Edelman, a computer scientist and professor of psychology at Cornell.

"This is the first time an unsupervised algorithm is shown capable of learning complex syntax, generating grammatical new sentences and proving useful in other fields that call for structure discovery from raw data, such as bioinformatics," he said.

The study appears in the Proceedings of the National Academy of Sciences.

Copyright 2005 by United Press International

Explore further: Supercomputers a hidden power center of Silicon Valley

Related Stories

Linguists tackle computational analysis of grammar

Feb 26, 2015

Children don't have to be told that "cat" and "cats" are variants of the same word—they pick it up just by listening. To a computer, though, they're as different as, well, cats and dogs. Yet it's computers ...

Recommended for you

Domino's taking orders via tweet

13 hours ago

Domino's on Wednesday launched perhaps the easiest way to satisfy a pizza craving this side of mind reading: order by tweet.

User comments : 0

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.