Bionic speech recognition

Sep 09, 2010

As speech recognition systems become more commonplace - on the computer desktop top, at the call centre and even in the car - it is increasingly important to ensure that the voice signal is as clear as possible before it is processed by a computer and acted upon. It could mean the difference between anything from a profitable financial deal to a safe vehicle or aircraft maneuver. Similarly, mobile phone conversations and even the clandestine recording of speech for security and law enforcement purposes could benefit.

Now, researchers at the University Campus in Tunis, Tunisia, have published details of a speech enhancement system that uses two distinct tools to reduce the noise from a recorded or sampled voice signal. Talbi Mourad, Salhi Lotfi, Abid Sabeur and Cherif Adnane of the Faculty of Sciences of Tunis, Laboratory of Signal Processing, explain how a bionic wavelet transform and a recurrent neural network can be used for speech enhancement in the International Journal of Signal and Imaging Systems Engineering.

"The presence of in speech signal processing constitutes a very serious problem," the researchers explain. Noise affects the performance of speech recognition, coding and synthesis leading to failed voice commands and errors. There are three forms of noise that speech recognition systems must cope with: convolutive, multiplicative and additive. It is the latter, additive noise, that can have the most impact on and it is this form of noise that the team addresses with their approach. Additive noise is often referred to as "white noise" and is commonly perceived as random background hiss on a sound recording.

"Our proposed technique consists of computing in an automatic manner the optimal threshold set to be employed to the bionic wavelet coefficients and this is performed by using an Elman neural network in the bionic wavelet domain," Mourad explains.

The team demonstrated the effectiveness of their approach against F16 fighter jet cockpit noise and the noise inside a Volvo car. "We have applied our hybrid method on several kinds of noises and noisy speech database and the obtained results show an increase in the signal to noise ratio from 5 dB to 12 dB," the team says. "In speech enhancement it is necessary to achieve a compromise between noise reduction and preserving intelligibility," adds Mourad.

Explore further: Computer scientist publishes new algorithm cluster to data mine health records

More information: "Recurrent Neural Network and Bionic Wavelet Transform for speech enhancement" in Int. J. Signal and Imaging Systems Engineering, 2010, 3, 136-144

Related Stories

Report Says Musicians Hear Better Than Non-Musicians

Nov 17, 2009

(PhysOrg.com) -- The Journal of Neuroscience reports this week that musicians are better than non-musicians at recognizing speech in noisy environments.  The finding from a study conducted by neurobiologists at Nor ...

Finding a Better Way to Quiet Noisy Environments

Apr 05, 2006

Researchers at UCSD report in the April 4 issue of the Journal of Sound and Vibration a new mathematical algorithm designed to dramatically improve noise-cancellation technologies that are used to quiet everything from a ...

New brain findings on dyslexic children

Nov 11, 2009

The vast majority of school-aged children can focus on the voice of a teacher amid the cacophony of the typical classroom thanks to a brain that automatically focuses on relevant, predictable and repeating auditory information, ...

Recommended for you

The brain as a model for future supercomputers

May 14, 2013

(Phys.org) —The brain's repute took a big hit in 1997 when an IBM supercomputer defeated world chess champion Gary Kasparov in a match reported around the world. But in the second round, the brain is back.

User comments : 1

Adjust slider to filter visible comments by rank

Display comments: newest first

winthrom
not rated yet Sep 09, 2010
Sounds like a software "squelch" control

More news stories

Morocco to harness the wind in energy hunt

Morocco is ploughing ahead with a programme to boost wind energy production, particularly in the southern Tarfaya region, where Africa's largest wind farm is set to open in 2014.

Yahoo Japan suspects 22 million IDs stolen

Yahoo Japan Corp. has said it suspects up to 22 million user IDs may have been stolen during an unauthorised attempt to access the administrative system of its Yahoo! Japan portal.

New case of SARS-like virus in Saudi: ministry

A new case of the deadly coronavirus has been detected in Saudi Arabia where 15 people have already died after contracting it, the health ministry announced on Saturday on its Internet website.

Galaxy's Ring of Fire

Johnny Cash may have preferred this galaxy's burning ring of fire to the one he sang about falling into in his popular song. The "starburst ring" seen at center in red and yellow hues is not the product of ...