What you see affects what you hear (Videos)

March 4, 2009

Understanding what a friend is saying in the hubbub of a noisy party can present a challenge - unless you can see the friend's face.

New research from Baylor College of Medicine in Houston and the City College of New York shows that the visual information you absorb when you see can improve your understanding of the spoken words by as much as sixfold.

Your brain uses the visual information derived from the person's face and lip movements to help you interpret what you hear, and this benefit increases when the sound quality rises to moderately noisy, said Dr. Wei Ji Ma, assistant professor of neuroscience at BCM and the report's lead author, in a report that appears online today in the open access journal PLoS ONE.

This video is not supported by your browser at this time.
Example of congruent AV stimuli (boot) - 12dB noise.

"Most people with normal hearing lip-read very well, even though they don't think so," said Ma. "At certain noise levels, lip-reading can increase word recognition performance from 10 to 60 percent correct."

However, when the environment is very noisy or when the voice you are trying to understand is very faint, lip-reading is difficult.

This video is not supported by your browser at this time.
Examples of congruent AV* stimuli (cheap) - 12dB noise

"We find that a minimum sound level is needed for lip-reading to be most effective," said Ma.

This research is the first to study word recognition in a natural setting, where people report freely what they believe is being said. Previous experiments only used limited lists of words for people to choose from.

The lip-reading data help scientists understand how the brain integrates two different kinds of stimuli to come to a conclusion.

Ma and his colleagues constructed a mathematical model that allowed them to predict how successful a person will be at integrating the visual and auditory information.

People actually combine the two stimuli close to optimally, Ma said. What they perceive depends on the reliability of the stimuli.

"Suppose you are a detective," he said. "You have two witnesses to a crime. One is very precise and believable. The other one is not as believable. You take information from both and weigh the believability of each in your determination of what happened."

In a way, lip-reading involves the same kind of integration of information in the brain, he said.

In experiments, videos of individuals were shown in which a person said a word. If the person is presented normally, the visual information provides a great benefit when it is integrated with the auditory information, especially when there is moderate background noise. Surprisingly, if the person is just a "cartoon" that does not truly mouth the word, then the visual information is still helpful, though not as much.

In another study, the person mouths one word but the audio projects another, and often the brain integrates the two stimuli into a totally different perceived word.

"The mathematical model can predict how often the person will understand the word correctly in all these contexts," Ma said.

More information: Wei Ji Ma, Xiang Zhou, Lars A. Ross, John J. Foxe, Lucas C. Parra, " Lip-reading aids word recognition most in moderate noise: a Bayesian explanation using high-dimensional feature space," PLoS ONE, in press, to appear March 2009. dx.plos.org/10.1371/journal.pone.0004638

Source: Baylor College of Medicine

Explore further: FDA allows marketing of vision aid via tongue for blind

Related Stories

FDA allows marketing of vision aid via tongue for blind

June 23, 2015

This month Wisconsin-based company Wicab announced that the US Food and Drug Administration cleared a nonsurgical vision aid for the profoundly blind. The safety and effectiveness of their product, BrainPort V100, were supported ...

Robots do kitchen duty with cooking video dataset

January 5, 2015

Now that we have robots that walk, gesture and talk, roboticists are interested in a next level: How can they learn more than they already know? The ability of these machines to learn actions from human demonstrations is ...

GPS-loaded helmet offers easier trip for motorcyclists

December 30, 2014

An Android-based motorcycle helmet with GPS and voice-control has won the confidence of project supporters. The company is preparing to roll out its creation next year. Russian startup Livemap is behind this tech helmet with ...

Recommended for you

How the finch changes its tune

August 3, 2015

Like top musicians, songbirds train from a young age to weed out errors and trim variability from their songs, ultimately becoming consistent and reliable performers. But as with human musicians, even the best are not machines. ...

Machine Translates Thoughts into Speech in Real Time

December 21, 2009

(PhysOrg.com) -- By implanting an electrode into the brain of a person with locked-in syndrome, scientists have demonstrated how to wirelessly transmit neural signals to a speech synthesizer. The "thought-to-speech" process ...

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.