August 11, 2015 report

New metamaterial device solves the cocktail party problem

by Bob Yirka , Phys.org

(Phys.org)—A team of researchers at Duke University has found a way to solve what is known as the cocktail party problem, getting a computer to pick out different human voices among multiple speakers in a single room. In their paper, published in Proceedings of the National Academy of Sciences they describe the device they constructed and the algorithm that goes along with it.

Most people have the uncanny ability to stand among a group of people, many of whom are talking, and pick out the words that are being spoken by any given individual, at will—our brains are somehow able to combine all the necessary ingredients—pitch, tone, distance, etc. and perhaps most importantly, filtering, to allow us to process only the words being spoken by the person we are focusing our attention on. Getting a computer to accomplish the same feat has been difficult—most solutions rely on the placement of multiple microphones, though some newer approaches have relied on artificial intelligence systems. Unfortunately, most such efforts have not led to a computer being anywhere near as accurate as a human being, until now.

The device developed by the team at Duke is made of plastic and is approximately pizza sized and shaped, thought it is a bit thicker—it was also constructed using a 3D printer. It is made up of 36 pie slices, or wedges, each made of a honeycombed structured acoustic metamaterial. Openings around the edges channel the sound toward a microphone that is fixed in the center of the hub. The wedges cause sound that passes through to be modified slightly in a beneficial way (attenuating certain frequencies). The sound that is captured by the microphone is then processed by an algorithm running on a computer that is able to localize what has been heard and assign words to a given speaker.

Helping Siri hear through a cocktail party — This prototype sensor can separate simultaneous sounds coming from different directions using a unique distortion given by the slice of "pie" that it passes through. Credit: Steve Cummer, Duke University

In testing their system, which the team describes as combining acoustic metamaterials and compressive sensing, they found it to be 96.7 percent accurate when run with three overlapping sound sources. They believe their device could be used in speech recognition applications and perhaps sensing or acoustic scenarios as well—and with some modifications, even in hearing aids.

More information: Single-sensor multispeaker listening with acoustic metamaterials, Yangbo Xie, PNAS, DOI: 10.1073/pnas.1502276112

Abstract
Designing a "cocktail party listener" that functionally mimics the selective perception of a human auditory system has been pursued over the past decades. By exploiting acoustic metamaterials and compressive sensing, we present here a single-sensor listening device that separates simultaneous overlapping sounds from different sources. The device with a compact array of resonant metamaterials is demonstrated to distinguish three overlapping and independent sources with 96.67% correct audio recognition. Segregation of the audio signals is achieved using physical layer encoding without relying on source characteristics. This hardware approach to multichannel source separation can be applied to robust speech recognition and hearing aids and may be extended to other acoustic imaging and sensing applications.

Press release

Journal information: Proceedings of the National Academy of Sciences

Citation: New metamaterial device solves the cocktail party problem (2015, August 11) retrieved 10 May 2024 from https://phys.org/news/2015-08-metamaterial-device-cocktail-party-problem.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Soda can array revisited: It may not beat the diffraction limit after all

1146 shares

Feedback to editors

New metamaterial device solves the cocktail party problem

New phononics materials may lead to smaller, more powerful wireless devices

New 'forever chemical' cleanup strategy discovered

TESS discovers a rocky planet that glows with molten lava as it's squeezed by its neighbors

Ultrasound experiment identifies new superconductor

Quantum breakthrough sheds light on perplexing high-temperature superconductors

How climate change will affect malaria transmission

Topological phonons: Where vibrations find their twist

Looking for life on Enceladus: What questions should we ask?

Analysis of millions of posts shows that users seek out echo chambers on social media

NASA images help explain eating habits of massive black hole

Relevant PhysicsForums posts

How does phase of merging sines affect overall periodic tones?

Interactive visualization of the Hopf fibration

Too much energy -- thought experiment

Calculating vacuum -- These numbers do not make sense

Density fluctuations and the color of the sky

Circular motion as a result of the Lorentz force

Soda can array revisited: It may not beat the diffraction limit after all

Lip-reading technology promises to make hearing aids more human

New method provides direct SI traceability for sound pressure

World's first 3-D acoustic cloaking device hides objects from sound

Phone snooping via gyroscope to be detailed at Usenix

Computer student on gesture control: Start experimenting

New phononics materials may lead to smaller, more powerful wireless devices

Probing neptunium's atomic structure with laser spectroscopy

Possible evidence of glueballs found during Beijing Spectrometer III experiments

Advanced experimental setup expands the hunt for hidden dark matter particles

Scientists directly measure a key reaction in neutron star binaries

The BREAD Collaboration is searching for dark photons using a coaxial dish antenna

Medical Xpress

Tech Xplore

Science X

New metamaterial device solves the cocktail party problem

New phononics materials may lead to smaller, more powerful wireless devices

New 'forever chemical' cleanup strategy discovered

TESS discovers a rocky planet that glows with molten lava as it's squeezed by its neighbors

Ultrasound experiment identifies new superconductor

Quantum breakthrough sheds light on perplexing high-temperature superconductors

How climate change will affect malaria transmission

Topological phonons: Where vibrations find their twist

Looking for life on Enceladus: What questions should we ask?

Analysis of millions of posts shows that users seek out echo chambers on social media

NASA images help explain eating habits of massive black hole

Relevant PhysicsForums posts

Related Stories

Soda can array revisited: It may not beat the diffraction limit after all

Lip-reading technology promises to make hearing aids more human

New method provides direct SI traceability for sound pressure

World's first 3-D acoustic cloaking device hides objects from sound

Phone snooping via gyroscope to be detailed at Usenix

Computer student on gesture control: Start experimenting

Recommended for you

New phononics materials may lead to smaller, more powerful wireless devices

Probing neptunium's atomic structure with laser spectroscopy

Possible evidence of glueballs found during Beijing Spectrometer III experiments

Advanced experimental setup expands the hunt for hidden dark matter particles

Scientists directly measure a key reaction in neutron star binaries

The BREAD Collaboration is searching for dark photons using a coaxial dish antenna

Newsletter sign up

Donate and enjoy an ad-free experience