W3C looks to improve speech recognition technology for web transactions

December 10, 2005 feature
A man surfs the internet in Valencia

W3C, the standards-setting body for the Internet (World Wide Web Consortium), has completed a draft for the important VoiceXML 3.0 - technology enabling voice identification verification. While normally associated with voice commands, it has the potential to greatly speed and improve the accuracy and positive proof of online transactions.

Some larger net businesses are even using it to confirm orders and verify identity. Many, however, have become increasingly worried about the reliability and security of these transactions with fraud and identity theft on the rise. Error rates have been around 1 to 2% - unacceptable for ironclad business transactions.

W3C does not actually make software but produces standards. They now have a working draft, said James Larson, co-chair of the W3C VBWG - Voice Browser Working Group.

The standard also addressed the issue of extending its Speech Synthesis Markup Language (SSML) functionality to certain languages including Mandarin Chinese, Japanese and Korean.

SSML is important because it allows software makers to control speech from pitch to volume to pronunciation. This insures the software will hear the right tones and pitches so critical in languages were a tiny change in pronunciation can affect the whole meaning of a word.

SSML is also used to tag areas of speech with different regional pronunciations. It is based on JSpeech Grammar Format (JSGF).

A technical description and how to use SSML version 1 on a web page can be found here: www.xml.com/pub/a/2004/10/20/ssml.html

Microsoft Agent website is another source for would be speech interface developers.
www.microsoft.com/MSAGENT/downloads/user.asp

Opera browsers can be programmed for speech recognition with some XHTML (Extended Hypertext Markup Language) extensions. my.opera.com/community/dev/voice/

Working with web-based speech applications can be frustrating. While the speech recognition software works well, poor quality microphones and PC speakers combined with slower Internet connections can put a damper on effectiveness. These issues will be difficult to address due to being largely beyond the control of the developer. New speech compression algorithms and simple responses like yes or no make the job much easier.

Trained systems – ones that are accustomed to the user’s voice – have been much more successful, but users typically do not have the patience to complete the training and the time factor makes it impractical.

Expect to find the first complex VoiceXML 3.0 technology mostly in telephone-connected and cell-phone activated systems – ones that have more controllable voice quality.

Hopefully, with the new W3C standards, companies can dedicate more to useful speech recognition and less to reinventing the wheel. Standards usually lead to software tool kits for programmers and these often end up in popular packages like Microsoft’s Frontpage and Adobe’s Macromedia Dreamweaver.

Amateur and professional web designers alike may soon find a compelling reason to upgrade to voice enabled web design suites.

Maybe one day you can toss that pesky keyboard and mouse and talk to your machine instead – a promise made since the late 1980s and not yet satisfactorily realized.

by Philip Dunn, Copyright 2005 PhysOrg.com

Explore further: Name that voice: Mathematica catches impersonations

Related Stories

Name that voice: Mathematica catches impersonations

December 2, 2014

Benedict Cumberbatch was recently invited to show off his voice impersonations of celebrities, from Christopher Walken to Taylor Swift. That video evidently inspired Wolfram's Rita Crook, marketing products manager, to ask ...

Build your own Siri: An open-source digital assistant

March 11, 2015

An open-source computing system you command with your voice like Apple's Siri is designed to spark a new generation of "intelligent personal assistants" for wearables and other devices. It could also lead to much-needed advancements ...

Recommended for you

How bees naturally vaccinate their babies

July 31, 2015

When it comes to vaccinating their babies, bees don't have a choice—they naturally immunize their offspring against specific diseases found in their environments. And now for the first time, scientists have discovered how ...

Image: Hubble sees a dying star's final moments

July 31, 2015

A dying star's final moments are captured in this image from the NASA/ESA Hubble Space Telescope. The death throes of this star may only last mere moments on a cosmological timescale, but this star's demise is still quite ...

Binary star system precisely timed with pulsar's gamma-rays

July 31, 2015

Pulsars are rapidly rotating compact remnants born in the explosions of massive stars. They can be observed through their lighthouse-like beams of radio waves and gamma-rays. Scientists at the Max Planck Institute for Gravitational ...

Exoplanets 20/20: Looking back to the future

July 31, 2015

Geoff Marcy remembers the hair standing up on the back of his neck. Paul Butler remembers being dead tired. The two men had just made history: the first confirmation of a planet orbiting another star.

Earth flyby of 'space peanut' captured in new video

July 31, 2015

NASA scientists have used two giant, Earth-based radio telescopes to bounce radar signals off a passing asteroid and produce images of the peanut-shaped body as it approached close to Earth this past weekend.

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.