A search engine for TV programs

July 7, 2009

The journalist recalls more or less what Ulla Schmidt said regarding the health reform, but needs the exact wording to be able to cite her. A new speech recognition system helps to search TV broadcasts. It does not need to be updated and so does not entail any running costs.

When was the financial crisis first mentioned in the news? What was it that Angela Merkel said concerning the presidential elections? Until now, journalists, archivists and media observers have had to search painstakingly to find a specific section in a TV recording - or have had to invest heaps of money in speech recognition software.

The systems currently available to take over the search have to be regularly updated by specialists and therefore entail high running costs. These systems are based on a kind of thesaurus containing all the words they can recognize. However, new topics and personalities bring along new words like "financial crisis" or names such as "Obama". These terms need to be transferred to the thesaurus so that they can be found.

Researchers at the Fraunhofer Institute for Intelligent Analysis and Information Systems IAIS in Sankt Augustin have developed a system that does not require expensive updating measures. "Our system is based on a syllable thesaurus instead of a word thesaurus. Conventional speech recognizers can only discern a limited number of words, while the total number of words in existence is too vast to handle. The number of existing syllables, on the other hand, is manageable. With about 10,000 stored syllables we can make up any word," says IAIS scientist Daniel Schneider. The program can even acquire new words independently by composing them from the stored syllables: fi-nan-cial cri-sis. It does not need to be updated and so does not entail any running costs.

For each search, the programs are first of all split into segments. Whenever a new speaker starts to talk or a film contribution begins - in which case the content of the audio track changes - the program saves the following scene as a new segment. The user can then navigate from speaker to speaker, and can choose to watch only the contributions of one particular interview partner. In a second step, the individual words are analyzed by speech algorithms.

Users can apply the program just like a conventional search engine. You simply enter the search term, and a few milliseconds later the program has scanned 10,000 hours of processed data. Just like an Internet search engine, it displays the results in context in their given sentences. The user then simply clicks on a word to play back the relevant section of film material. The system can find over 85 percent of the spoken words in a program, and 99 out of a 100 located contributions are correct. A license model of the program is already available.

Source: Fraunhofer-Gesellschaft (news : web)

Explore further: NEC Develops Speech-to-Speech Translation Software for Mobile Phones

Related Stories

Answers.com sues Babylon

March 9, 2006

Search engine Answers.com has sued Babylon, creators of language-translation software, for copyright infringement and violation of intellectual property.

Hum a few bars and I’ll find it

January 25, 2007

A European research consortium hopes to make it much easier to find audio/visual content online. The new search approach will be driven by content or example rather than relying on key words and tags.

Turn off TV to teach toddlers new words

June 28, 2007

Toddlers learn their first words better from people than from Teletubbies, according to new research at Wake Forest University. The study was published in the June 21 issue of Media Psychology.

MIT develops lecture search engine to aid students

November 8, 2007

Imagine you are taking an introductory biology course. You're studying for an exam and realize it would be helpful to revisit the professor's explanation of RNA interference. Fortunately for you, a digital recording of the ...

Recommended for you

New computer vision algorithm predicts orientation of objects

February 11, 2016

Seen from any angle, a horse looks like a horse. But it doesn't look the same from every angle. Scientists at Disney Research have developed a method to help computer vision systems avoid the confusion associated with changes ...

Record for fastest data rate set

February 11, 2016

A new record for the fastest ever data rate for digital information has been set by UCL researchers in the Optical Networks Group. They achieved a rate of 1.125 Tb/s as part of research on the capacity limits of optical transmission ...

GPS tracking down to the centimeter

February 11, 2016

Researchers at the University of California, Riverside have developed a new, more computationally efficient way to process data from the Global Positioning System (GPS), to enhance location accuracy from the meter-level down ...

Math reveals unseen worlds of Star Wars

February 10, 2016

Using a new computer program, EPFL researchers offer unusual insight into the universe of Star Wars, which includes more than 20,000 characters spread among 640 communities over a period of 36,000 years.

1 comment

Adjust slider to filter visible comments by rank

Display comments: newest first

jso
not rated yet Aug 18, 2009
You can already search what is said on live TV at www.livedash.com. It indexes what is mentioned on TV across all national channels, and provides real-time results

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.