April 16, 2009

Dialect Detectives

by Dorothy Ryan

(PhysOrg.com) -- Technology under development by Pedro Torres-Carrasquillo and his colleagues at Lincoln Laboratory may lead to a dialect identification system that compensates for a translator's inexperience with multiple variants of a spoken language.

A law enforcement agency intercepts an international phone call alerting a suspected drug dealer to a new shipment. While the translator listening to the message is confident the caller's Spanish carries a South American accent, he cannot pinpoint a more specific region for agents to put under surveillance. But technology under development by Pedro Torres-Carrasquillo and his colleagues at Lincoln Laboratory may lead to a dialect identification system that compensates for a translator's inexperience with multiple variants of a spoken language.

Language identification systems that can recognize as many as 29 languages from written text are already marketed, and systems that can identify a spoken language from a prescribed range of choices also exist. So far, however, no system that automatically discriminates one spoken dialect from another is available.

Lincoln Laboratory's earlier work on dialect identification focused on building models that mapped the audiowave frequencies of phonemes - the individual sounds of a spoken language. Torres-Carrasquillo, an electrical engineer specializing in speech processing in the laboratory's Information Systems Technology Group, says his group has more recently moved from this phonetic-based approach to lower-level acoustic systems that use the basic spectral similarities of small pieces of spoken utterances. "We are not looking for the types of data linguists deal with - larger units such as phonemes and words," he says. "We're looking at the statistical distributions of basic frequency spectra of small pieces of sounds."

The laboratory researchers are building a model that classifies the training data, finding markers that discriminate the frequency characteristics of the data. Previously, Torres-Carrasquillo says, the approach was to "get a lot of examples, and then build a model that looks like your examples." But he is tackling the problem in a different way. "Our group's idea is that we don't need a model that looks like our data - we need a model that can classify our data," he explains. "We take very small pieces - snippets of speech - turn them into frequencies, add up all these contributions, and make a model that can tell them apart. We're looking for patterns from just milliseconds of speech."

The researchers are using pattern recognition and classification methods known as support vector machines (SVMs) and Gaussian Mixture Models (GMMs) that use models trained to emphasize the more distinctive tiny features seen in the frequency patterns of small pieces of the dialects in question. The trained GMMs have the edge in accuracy, but SVMs are "an order of magnitude faster than the GMM," according to Torres-Carrasquillo. Even more effective than either SVMs or GMMs alone, he says, is combining the two techniques. In a test to discriminate general American English from Indian-accented English, for example, the error rate was 10 percent when GMM was used alone, 15 percent for SVM alone - and only 7 percent for a fusion of GMM and SVM.

To be incorporated into an automatic machine translation system, a dialect identification system would have to be able to recognize a dialect without having to process lengthy strings of speech data. Torres-Carrasquillo's goal is to be able to determine a speaker's dialect by categorizing discrete, characteristic markers in the snippets, and then create a model without using large sets of training data. "We'd love to see a short-term spectrum characteristic that is a strong discriminator, is very pervasive in the dialect, and that could be reliably detected in a sample," he says.

Finding this characteristic is a tall order. "You're not going to have a single spectrum characteristic that gives away the identification," Torres-Carrasquillo says. The linguistic differences between dialects of a language are often small; for example, vowel sounds in Cuban Spanish are slightly longer than those of Puerto Rican Spanish. The subtle differences between the spectral pictures of dialects are difficult to detect, especially in the milliseconds of speech used in the Laboratory experiments. "But as you look at the data" says Torres-Carrasquillo, "the differences start to pile up and you have a profile." The Laboratory's work to classify dialect differences, which Torres-Carrasquillo presented at a September 2008 speech communication and technology conference in Australia, may lead to the discovery of a strategy for any dialect problem - a global approach that could be exploited for various classes of dialects instead of a method that works only for specific dialects.

The Lincoln Laboratory research on dialect identification may contribute to approaches for language identification more generally, but Torres-Carrasquillo offers a caveat: "The differences one can exploit within two dialects are very specific - maybe too specific to be applicable to language ID." Still, when a universal machine translation system arrives on the scene in some future decade, it may well depend on Lincoln Laboratory research to ensure that nuances of meaning conveyed in dialects are not lost in translation.

Provided by Massachusetts Institute of Technology (news : web)

Citation: Dialect Detectives (2009, April 16) retrieved 13 May 2024 from https://phys.org/news/2009-04-dialect.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Brain processing of speech sounds is different in some Southern English speakers

0 shares

Feedback to editors

Solar storm puts on brilliant light show across the globe, but no serious problems reported

May 11, 2024

Study discovers cellular activity that hints recycling is in our DNA

May 11, 2024

Weaker ocean currents lead to decline in nutrients for North Atlantic ocean life during prehistoric climate change

May 11, 2024

Research explores ways to mitigate the environmental toxicity of ubiquitous silver nanoparticles

May 11, 2024

AI may be to blame for our failure to make contact with alien civilizations

May 11, 2024

Saturday Citations: Dietary habits of humans; dietary habits of supermassive black holes; saving endangered bilbies

May 11, 2024

Scientists unlock key to breeding 'carbon gobbling' plants with a major appetite

May 10, 2024

Clues from deep magma reservoirs could improve volcanic eruption forecasts

May 10, 2024

Study shows AI conversational agents can help reduce interethnic prejudice during online interactions

May 10, 2024

NASA's Chandra notices the galactic center is venting

May 10, 2024

Load comments (1)

Dialect Detectives

Solar storm puts on brilliant light show across the globe, but no serious problems reported

Study discovers cellular activity that hints recycling is in our DNA

Weaker ocean currents lead to decline in nutrients for North Atlantic ocean life during prehistoric climate change

Research explores ways to mitigate the environmental toxicity of ubiquitous silver nanoparticles

AI may be to blame for our failure to make contact with alien civilizations

Saturday Citations: Dietary habits of humans; dietary habits of supermassive black holes; saving endangered bilbies

Scientists unlock key to breeding 'carbon gobbling' plants with a major appetite

Clues from deep magma reservoirs could improve volcanic eruption forecasts

Study shows AI conversational agents can help reduce interethnic prejudice during online interactions

NASA's Chandra notices the galactic center is venting

Relevant PhysicsForums posts

How to analyse a sequence of vehicle states?

Most efficient way to randomly choose a word from a file with a list of words

Parallel processing for loops and pointer defined outside the loop

Links from navbar made with React don't work

Passing variables in FORTRAN

User-Defined Functions in Sql Server SSMS

Brain processing of speech sounds is different in some Southern English speakers

Mapping the English language – from cockney to Orkney

Linguists looking for a Pacific Northwest dialect

NEC Develops Speech-to-Speech Translation Software for Mobile Phones

Research team develops systems that process and understand spoken language, especially Basque

Bringing down the language barrier... automatically

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Dialect Detectives

Solar storm puts on brilliant light show across the globe, but no serious problems reported

Study discovers cellular activity that hints recycling is in our DNA

Weaker ocean currents lead to decline in nutrients for North Atlantic ocean life during prehistoric climate change

Research explores ways to mitigate the environmental toxicity of ubiquitous silver nanoparticles

AI may be to blame for our failure to make contact with alien civilizations

Saturday Citations: Dietary habits of humans; dietary habits of supermassive black holes; saving endangered bilbies

Scientists unlock key to breeding 'carbon gobbling' plants with a major appetite

Clues from deep magma reservoirs could improve volcanic eruption forecasts

Study shows AI conversational agents can help reduce interethnic prejudice during online interactions

NASA's Chandra notices the galactic center is venting

Relevant PhysicsForums posts

Related Stories

Brain processing of speech sounds is different in some Southern English speakers

Mapping the English language – from cockney to Orkney

Linguists looking for a Pacific Northwest dialect

NEC Develops Speech-to-Speech Translation Software for Mobile Phones

Research team develops systems that process and understand spoken language, especially Basque

Bringing down the language barrier... automatically

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience