October 26, 2021

To better understand speech, focus on who is talking

Seeing a person's face as we are talking to them greatly improves our ability to understand their speech. While previous studies indicate that the timing of words-to-mouth movements across the senses is critical to this audio-visual speech benefit, whether it also depends on spatial alignment between faces and voices has been largely unstudied.

Researchers found matching the locations of faces with the speech sounds they are producing significantly improves our ability to understand them, especially in noisy areas where other talkers are present.

In the Journal of the Acoustical Society of America, researchers from Harvard University, University of Minnesota, University of Rochester, and Carnegie Mellon University outline a set of online experiments that mimicked aspects of distracting scenes to learn more about how we focus on one audio-visual talker and ignore others.

"If there's only one multisensory object in a scene, our group and others have shown that the brain is perfectly willing to combine sounds and visual signals that come from different locations in space," said author Justin Fleming. "It's when there's multisensory competition that spatial cues take on more importance."

The researchers first asked participants to pay attention to one talker's speech and ignore another talker, either when corresponding faces and voices originated from the same location or different locations. Participants performed significantly better when the face matched where the voice was coming from.

Next, they found task performance decreased when participants directed their gaze toward a voice trying to distract them.

Finally, the researchers showed spatial alignment between faces and voices was more important when the background noise was louder, suggesting the brain makes more use of audio-visual spatial cues in challenging sensory environments.

The pandemic forced the group to get creative about conducting such research with participants over the internet.

"We had to learn about—and, in some cases, create—several tasks to make sure participants were seeing and hearing the stimuli properly, wearing headphones, and following instructions," Fleming said.

Fleming hopes their findings will lead to improved designs for hearing devices and better handling of sound in virtual and augmented reality. They look to expand on their work by bringing additional real-world elements into the fold.

"Historically, we have learned a great deal about our sensory systems from studies involving simple flashes and beeps," he said. "However, this and other studies are now showing that when we make our tasks more complicated in ways that better simulate the real world, new patterns of results start to emerge."

More information: Justin T. Fleming et al, Spatial alignment between faces and voices improves selective attention to audio-visual speech,
Journal of the Acoustical Society of America (2021). doi.org/10.1121/10.0006415

Journal information: Journal of the Acoustical Society of America

Provided by American Institute of Physics

Citation: To better understand speech, focus on who is talking (2021, October 26) retrieved 11 July 2024 from https://phys.org/news/2021-10-speech-focus.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

How does eye position affect 'cocktail party' listening?

153 shares

Feedback to editors

To better understand speech, focus on who is talking

A new species of extinct crocodile relative rewrites life on the Triassic coastline

New method achieves tenfold increase in quantum coherence time via destructive interference of correlated noise

Mars likely had cold and icy past, new study finds

Study: Nanoparticle vaccines enhance cross-protection against influenza viruses

New tools are needed to make water affordable, says study

Researchers demonstrate how to build 'time-traveling' quantum sensors

Lion with nine lives breaks record with longest swim in predator-infested waters

New multimode coupler design advances scalable quantum computing

High-speed electron camera uncovers new 'light-twisting' behavior in ultrathin material

Perceived warmth, competence predict callback decisions in meta-analysis of hiring experiments

Relevant PhysicsForums posts

What will be the reading of this vernier calliper?

Any examples of naturally occurring holograms?

Why can't we photograph magnetic lines of force emitted by planetary and beyond objects (like we can for the Sun)?

Can anyone tell me what these formulas in electromagnetism are called?

How can concentration gradient field come into existence immediately?

Increasing tone while mixing sugar in water

How does eye position affect 'cocktail party' listening?

Study explains 'cocktail party effect' in hearing impairment

Does visual feedback of our tongues help in speech motor learning?

Face masks provide additional communication barrier for nonnative speech

Study finds blind people depend on timing cues for some spatial awareness

Training the brain to recognize voices

Physicists report first measured isomeric-ratio in multinucleon-transfer reactions: A doorway to access terra incognita

Searching for dark matter with the coldest quantum detectors in the world

Physicists explore how fluctuations shape transport networks

Scientists create world's most amazingly difficult maze with future potential to boost carbon capture

Moving beyond the 80-year-old solar cell equation

CERN's ATLAS experiment releases 65 TB of open data for research

Medical Xpress

Tech Xplore

Science X

To better understand speech, focus on who is talking

A new species of extinct crocodile relative rewrites life on the Triassic coastline

New method achieves tenfold increase in quantum coherence time via destructive interference of correlated noise

Mars likely had cold and icy past, new study finds

Study: Nanoparticle vaccines enhance cross-protection against influenza viruses

New tools are needed to make water affordable, says study

Researchers demonstrate how to build 'time-traveling' quantum sensors

Lion with nine lives breaks record with longest swim in predator-infested waters

New multimode coupler design advances scalable quantum computing

High-speed electron camera uncovers new 'light-twisting' behavior in ultrathin material

Perceived warmth, competence predict callback decisions in meta-analysis of hiring experiments

Relevant PhysicsForums posts

Related Stories

How does eye position affect 'cocktail party' listening?

Study explains 'cocktail party effect' in hearing impairment

Does visual feedback of our tongues help in speech motor learning?

Face masks provide additional communication barrier for nonnative speech

Study finds blind people depend on timing cues for some spatial awareness

Training the brain to recognize voices

Recommended for you

Physicists report first measured isomeric-ratio in multinucleon-transfer reactions: A doorway to access terra incognita

Searching for dark matter with the coldest quantum detectors in the world

Physicists explore how fluctuations shape transport networks

Scientists create world's most amazingly difficult maze with future potential to boost carbon capture

Moving beyond the 80-year-old solar cell equation

CERN's ATLAS experiment releases 65 TB of open data for research

Newsletter sign up

Donate and enjoy an ad-free experience