September 9, 2010

Bionic speech recognition

As speech recognition systems become more commonplace - on the computer desktop top, at the call centre and even in the car - it is increasingly important to ensure that the voice signal is as clear as possible before it is processed by a computer and acted upon. It could mean the difference between anything from a profitable financial deal to a safe vehicle or aircraft maneuver. Similarly, mobile phone conversations and even the clandestine recording of speech for security and law enforcement purposes could benefit.

Now, researchers at the University Campus in Tunis, Tunisia, have published details of a speech enhancement system that uses two distinct tools to reduce the noise from a recorded or sampled voice signal. Talbi Mourad, Salhi Lotfi, Abid Sabeur and Cherif Adnane of the Faculty of Sciences of Tunis, Laboratory of Signal Processing, explain how a bionic wavelet transform and a recurrent neural network can be used for speech enhancement in the International Journal of Signal and Imaging Systems Engineering.

"The presence of background noise in speech signal processing constitutes a very serious problem," the researchers explain. Noise affects the performance of speech recognition, coding and synthesis leading to failed voice commands and errors. There are three forms of noise that speech recognition systems must cope with: convolutive, multiplicative and additive. It is the latter, additive noise, that can have the most impact on speech recognition and it is this form of noise that the team addresses with their approach. Additive noise is often referred to as "white noise" and is commonly perceived as random background hiss on a sound recording.

"Our proposed technique consists of computing in an automatic manner the optimal threshold set to be employed to the bionic wavelet coefficients and this is performed by using an Elman neural network in the bionic wavelet domain," Mourad explains.

The team demonstrated the effectiveness of their approach against F16 fighter jet cockpit noise and the noise inside a Volvo car. "We have applied our hybrid method on several kinds of noises and noisy speech database and the obtained results show an increase in the signal to noise ratio from 5 dB to 12 dB," the team says. "In speech enhancement it is necessary to achieve a compromise between noise reduction and preserving intelligibility," adds Mourad.

More information: "Recurrent Neural Network and Bionic Wavelet Transform for speech enhancement" in Int. J. Signal and Imaging Systems Engineering, 2010, 3, 136-144

Provided by Inderscience Publishers

Citation: Bionic speech recognition (2010, September 9) retrieved 19 September 2024 from https://phys.org/news/2010-09-bionic-speech-recognition.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Reduced noise allows clearer mobile phone conversations

0 shares

Feedback to editors

New material with wavy layers of atoms exhibits unusual superconducting properties

45 minutes ago

Researchers build AI model database to find new alloys for nuclear fusion facilities

57 minutes ago

Greylag geese with similar personalities have higher hatching success, study suggests

57 minutes ago

Can captive tigers be part of the effort to save wild populations?

1 hour ago

Proteins in tooth enamel offer window into ancient and modern human wellness

2 hours ago

Mysteries of the bizarre 'pseudogap' in quantum physics finally untangled

2 hours ago

Are cows pickier than goats? Answers from innovative large-scale feeding experiments from 275 years ago

3 hours ago

Research predicts rise in tropical hydraulic failure

3 hours ago

Human genome stored on 'everlasting' memory crystal

3 hours ago

Scientists say there is enough evidence to agree to global action on microplastics

3 hours ago

Load comments (1)

Bionic speech recognition

New material with wavy layers of atoms exhibits unusual superconducting properties

Researchers build AI model database to find new alloys for nuclear fusion facilities

Greylag geese with similar personalities have higher hatching success, study suggests

Can captive tigers be part of the effort to save wild populations?

Proteins in tooth enamel offer window into ancient and modern human wellness

Mysteries of the bizarre 'pseudogap' in quantum physics finally untangled

Are cows pickier than goats? Answers from innovative large-scale feeding experiments from 275 years ago

Research predicts rise in tropical hydraulic failure

Human genome stored on 'everlasting' memory crystal

Scientists say there is enough evidence to agree to global action on microplastics

Relevant PhysicsForums posts

Container shrinks at certain screen widths (CSS)

Unsolvable python code bug? (finding the difference between two input strings)

User-Defined Functions in Sql Server SSMS

Can Fortran 77 Code Be Used to Debug Python Code for Solving ODEs Using Radau5?

Help solving a geometrical matching issue with Graph Neural Networks

Zipping identical iterables

Reduced noise allows clearer mobile phone conversations

Novel Ear-like Dual Microphone System Tunes Out Background Noise In Cellphones

Report Says Musicians Hear Better Than Non-Musicians

Carnegie Mellon engineering researchers to create speech recognition in silicon

Finding a Better Way to Quiet Noisy Environments

New brain findings on dyslexic children

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Bionic speech recognition

New material with wavy layers of atoms exhibits unusual superconducting properties

Researchers build AI model database to find new alloys for nuclear fusion facilities

Greylag geese with similar personalities have higher hatching success, study suggests

Can captive tigers be part of the effort to save wild populations?

Proteins in tooth enamel offer window into ancient and modern human wellness

Mysteries of the bizarre 'pseudogap' in quantum physics finally untangled

Are cows pickier than goats? Answers from innovative large-scale feeding experiments from 275 years ago

Research predicts rise in tropical hydraulic failure

Human genome stored on 'everlasting' memory crystal

Scientists say there is enough evidence to agree to global action on microplastics

Relevant PhysicsForums posts

Related Stories

Reduced noise allows clearer mobile phone conversations

Novel Ear-like Dual Microphone System Tunes Out Background Noise In Cellphones

Report Says Musicians Hear Better Than Non-Musicians

Carnegie Mellon engineering researchers to create speech recognition in silicon

Finding a Better Way to Quiet Noisy Environments

New brain findings on dyslexic children

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience