It's like software understands, um, language

Jan 21, 2009
It's like software understands, um, language

(PhysOrg.com) -- EU researchers have taken speech recognition to a whole new level by creating software that can understand spontaneous language. It will, like, make human-machine interaction, um, work a lot more, er, smoothly.

Automated speech recognition has revolutionised customer relations for banks, allowing them to respond quickly and with less staff to more low-level queries. It has helped to enable online banking and the development of more advanced private and public services because machines can handle routine matters, leaving people to take care of more serious issues.

But this technology has its limits. The most common, very basic, voice system asks a series of questions or offers a series of options, slowly and fitfully narrowing down your problem or supplying the solution. It would be nice to just tell the service what you want.

Soon, you can, thanks to the work of the Luna project, a European-wide effort to dramatically advance the power and intelligence of speech recognition. The team is moving the system from utterances - like ‘yes’, ‘no’, or ‘account’ - to spontaneous speech, such as ‘I want to get the balance on my current account.’

Um, ah, and er…

This high level of speech recognition is called spoken language understanding (SLU), where software understands the meaning of what you are saying and can filter out the irrelevant verbiage, like ‘um’, and ‘ah’ and ‘er’.

Luna’s work in several languages is even more impressive. It has developed the most advanced SLU for both Polish and Italian, languages that had no similar systems before.

It is a big job. “We had to spend a lot of time initially recording spontaneous conversations between people and between people and machines,” explains Silvia Mosso, coordinator of the EU-funded Luna.

This is called the corpora, the collection of words and phrases that gives the software its basic language. Then, researchers have to annotate the terms in a way that machines can understand, and finally they apply statistical language models.

I have a problem …

“You can say things like ‘I have a problem with my printer’ and it will help you go through the options,” says Mosso.

The result is a system that can interact with people in a much more natural and fluid way. It will mean faster and more productive interactions with service centres, whether its getting travel information from public transport, dealing with an IT problem or tourist information - three of the areas where Luna applied its research.

“The advantage with these areas is that you can apply our work to any kind of help centre. But if you want to apply it to different areas, then you need to do the initial collection of the conversations, the corpora, again,” Mosso reveals.

Fundamental mechanics

Their scientific work is perhaps even more important. It looked at the fundamental mechanics of language and the development of SLU, work that will have potential applications in robotics and other areas.

Luna presented its work at ICT 2008, Europe’s largest conference and exhibition for European Information and Communication Technology research, and its demonstration was well received. “We had an avatar presenting the project and talking to people about it, and it was very popular.”

The work of the project is guaranteed practical use, with industrial partners like France Telecom, Loquendo and CSI Piemonte planning to incorporate the results into the services run within public administrations.

And the project has still several months left before it ends. “We have released the baseline systems in three languages and we will be refining them over the last months of the project.”

And then people can look forward to telephone systems with a little more understanding.

The Luna project received funding from the ICT strand of the Sixth Framework Programme for research.

This is the first of a two-part feature on Luna.

Provided by ICT Results

Explore further: Computerized emotion detector

add to favorites email to friend print save as pdf

Related Stories

Hitchhiking robot charms its way across Canada

Aug 15, 2014

He has dipped his boots in Lake Superior, crashed a wedding and attended an Aboriginal powwow. A talking, bucket-bodied robot has enthralled Canadians since it departed from Halifax last month on a hitchhiking ...

JIBO robot could become part of the family

Jul 17, 2014

JIBO, measuring at about 11 inches tall and weighing approximately 6 pounds, is a robotic device designed for people to use as a companion and helper at home. , The team behind JIBO aims to bring it to market ...

QuadStick controller says Go for quadriplegic gamers

Feb 06, 2014

Children and adults sidelined by serious illness and immobility from life as healthy people know it b may find relief from anxiety and depression with a "pill" in the form of online games, a drug-free mood ...

Recommended for you

Computerized emotion detector

19 hours ago

Face recognition software measures various parameters in a mug shot, such as the distance between the person's eyes, the height from lip to top of their nose and various other metrics and then compares it with photos of people ...

Cutting the cloud computing carbon cost

Sep 12, 2014

Cloud computing involves displacing data storage and processing from the user's computer on to remote servers. It can provide users with more storage space and computing power that they can then access from anywhere in the ...

Teaching computers the nuances of human conversation

Sep 12, 2014

Computer scientists have successfully developed programs to recognize spoken language, as in automated phone systems that respond to voice prompts and voice-activated assistants like Apple's Siri.

Mapping the connections between diverse sets of data

Sep 12, 2014

What is a map? Most often, it's a visual tool used to demonstrate the relationship between multiple places in geographic space. They're useful because you can look at one and very quickly pick up on the general ...

User comments : 0