February 1, 2016

Software adapts speech to ambient noise level

Loudspeaker announcements at railway stations are often incomprehensible, since the surroundings are noisy. With new software, the clarity of such announcements can be considerably improved. A microphone picks up ambient noise and adjusts the spoken messages perfectly to the noise level. Even calls over mobile phones will be understood more easily with the help of this technology.

If a freight train rattles past, passengers usually only understand about half of an announcement such as "The train to Frankfurt am Main will be departing today from platform...". Researchers from the Oldenburg-based Project Group Hearing, Speech and Audio Technology of the Fraunhofer Institute for Digital Media Technology IDMT have developed a software that significantly improves the intelligibility of speech – even for the voices of speakers at conferences or conversations on mobile phones.

Microphone analyzes noise levels

The trick of the ADAPT DRC software is that the ambient noise is continually analyzed via a microphone, and the speech is adjusted to it in real time. "It is not enough to simply make the voice louder over the loudspeaker or mobile phone to drown out the noise," says project manager Dr. Jan Rennies-Hochmuth. Such technologies are already used today in car radios, making the voice louder, but not necessarily more easily understood, because, at high volumes, the speakers reach their limits and start to rattle. "Speech is much more complex," says Rennies-Hochmuth.

Firstly, it is important to reinforce certain pitches, the frequencies, in a targeted fashion. Vowels are relatively deep, long-drawn-out word components that are easy to understand. Consonants like "p", "t" and "k", however, are very short and have higher frequencies. Even though they are very important for understanding what is said, it is generally not easy to understand them as well in noisy environments. For example, the consonants influence whether a recipient who is listening to an announcement in German thinks he has heard the word "Kasse" or "Tasse" (in English, "checkout" or "cup"). "Our algorithms are able to prioritize certain frequencies and to reinforce, at the right time, precisely those which are particularly disturbed by the ambient noise," adds Rennies-Hochmuth.

Amplifying quiet speech components

Secondly, the software takes into account the parts of the speech signal which are of different volumes. Since spoken language is composed of loud and quiet parts, experts use the term "voice dynamics". Speech intelligibility increases particularly when loud parts are systematically subdued and quiet parts are specifically amplified. This technique is called Dynamic Range Compression (DRC). This is also of interest if, for example, you make a call using a mobile phone when you are on a noisy street.

The ADAPT DRC software has already been developed to the point of application maturity and is available to industrial partners. Since modern conference equipment or mobile phones already have built-in microphones, the devices already possess the technology which is necessary to be able to record the ambient noise. For speaker systems at railway stations or airports, additional microphones would first have to be installed.

Provided by Fraunhofer-Gesellschaft

Citation: Software adapts speech to ambient noise level (2016, February 1) retrieved 2 July 2024 from https://phys.org/news/2016-02-software-speech-ambient-noise.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Synthetic speech system puts a dampener on noisy announcements

9 shares

Feedback to editors

Software adapts speech to ambient noise level

Microphone analyzes noise levels

Amplifying quiet speech components

Infrared glow high in Jupiter's atmosphere may be dark matter particles colliding

German scientists investigate supernova remnant SNR G309.8+00.0 at high energies

The demonstration of vacuum levitation and motion control on an optical-electrostatic chip

True scale of carbon impact from long-distance travel revealed

Aboriginal ritual passed down over 12,000 years, cave find shows

Increased atmospheric moisture may dampen the 'seeds' of hurricanes

Researchers train sheep to complete awake MRI imaging

Research intern helps discover a new pulsar buried in a mountain of data

Genetic patterns of world's farmed, domesticated foxes revealed via historical deep-dive

Study finds one-third of Indonesia's deforested land left idle

Relevant PhysicsForums posts

Custom icon for specific file type in Nautilus on Ubuntu 22.04

Cyber security in the modern/post-modern internet

AI In Actual Use

Help! Old PC dog has to learn new Mac tricks

How can you trade non integer values of Bitcoin?

Help with my buggy TV/Streaming Services

Synthetic speech system puts a dampener on noisy announcements

Speech signal processing technology for smart devices to achieve multilingual speech translation service

Reduced noise allows clearer mobile phone conversations

Fujitsu develops new speech synthesis technology

Brain turns down volume of background noise in a busy cafe

Turn up the volume? Researchers find better way for public announcements

Google's challenge to game consoles to kick off in November

Technology streamlines computational science projects

New video game teaches teens about electricity

Travis the translator aims to make people understood

Windows 10 update set for October release

De-jargonizing program helps decode science speak

Medical Xpress

Tech Xplore

Science X

Software adapts speech to ambient noise level

Microphone analyzes noise levels

Amplifying quiet speech components

Infrared glow high in Jupiter's atmosphere may be dark matter particles colliding

German scientists investigate supernova remnant SNR G309.8+00.0 at high energies

The demonstration of vacuum levitation and motion control on an optical-electrostatic chip

True scale of carbon impact from long-distance travel revealed

Aboriginal ritual passed down over 12,000 years, cave find shows

Increased atmospheric moisture may dampen the 'seeds' of hurricanes

Researchers train sheep to complete awake MRI imaging

Research intern helps discover a new pulsar buried in a mountain of data

Genetic patterns of world's farmed, domesticated foxes revealed via historical deep-dive

Study finds one-third of Indonesia's deforested land left idle

Relevant PhysicsForums posts

Related Stories

Synthetic speech system puts a dampener on noisy announcements

Speech signal processing technology for smart devices to achieve multilingual speech translation service

Reduced noise allows clearer mobile phone conversations

Fujitsu develops new speech synthesis technology

Brain turns down volume of background noise in a busy cafe

Turn up the volume? Researchers find better way for public announcements

Recommended for you

Google's challenge to game consoles to kick off in November

Technology streamlines computational science projects

New video game teaches teens about electricity

Travis the translator aims to make people understood

Windows 10 update set for October release

De-jargonizing program helps decode science speak

Newsletter sign up

Donate and enjoy an ad-free experience