July 4, 2012

'Googling' through unique audio material: towards a better search result

by Netherlands Organisation for Scientific Research (NWO)

Searching and finding in audio archives can be improved if we take a different look at the underlying technology and allow for how the results are used. This provides a better picture of the problems and the points for improvement. Laurens van der Werff demonstrated this in his PhD thesis 'Evaluation of Noisy Transcripts for Spoken Document Retrieval', which he will defend on 5 July at the University of Twente.

Van der Werff's research was carried out within the project CHoral, which focuses on making spoken audio material from the past accessible. Dutch archives and other heritage institutions look after many hundreds of thousands of hours of audio material such as interviews with witnesses of a special event but also, for example, all transmissions of national and regional radio organisations.

If this unique audio material can be disclosed well then it will make a valuable contribution to research in the area of language use and dialect, regional and national politics, and history. CHoral is one of 18 projects from the NWO research programme CATCH (Continuous Access to Cultural Heritage) which has a total budget of more than 15 million euros and is working on the accessibility of Dutch cultural heritage.

Improved evaluation of transcripts

Automatic speech recognition in combination with search technology offers the possibility of searching through sound files: spoken word is converted into a written text (transcript) that you can subsequently search as 'usual'. Many research labs worldwide are working hard on improving the quality of automatic speech recognition. However, for applications in search systems - and certainly for heritage collections - these improvements do not always deliver a maximum benefit.

For heritage collections, Van der Werff proposed a new way of evaluating the quality of automatically generated transcripts that pays more attention to how historians and other end-users want to use the search results. This offers the possibility of an improved analysis of where problems occur and provides leads for optimisation. Due to the limited frame of reference in the heritage sector on which optimisations can be based, this approach is a most welcome step forwards.

Specific challenges of heritage material

The audio material in heritage collections has a number of special characteristics. Many sound tapes are not digitised, they have mostly not been manually transcribed and they have no or only superficial metadata. Furthermore, it often concerns recordings from non-professional speakers with a lot of noise in the background. And many of the speakers only occur in a single sound fragment and so very little training material is available for a computer – a typical problem within cultural heritage that is exacerbated by the small geographic area Dutch is spoken in. Another complicating factor is that this heritage data is mostly used in a highly specific manner. As a result of all of these special characteristics, an approach that works well with news data, for example, cannot be automatically applied to this unique material.

Applications of the optimised technology

The techniques from the Choral project were, for example, used on collections from the Rotterdam Municipal Archive (transmissions Radio Rijnmond; website 'Brandgrens' with eyewitness accounts about the bombing of Rotterdam), the NIOD (Radio Oranje with speeches from Queen Wilhelmina during World War II; eyewitness accounts of survivors from Buchenwald) and the interview archive of Aletta/IAVV.

The knowledge and techniques from CHoral have also helped to lay the basis for the open source speech recognition package SHoUT (University of Twente) that has been further developed within the CATCH valorisation programme CATCHPlus (www.catchplus.nl). Using this software each archive can now, in principle, make its audio sources accessible without the need for its own in-house specialists. SHoUT is already being used for the national website 'Verteld Verleden' ['Spoken Past'], through which all audio sources in the Netherlands will be accessible in the future.

Further information: www.nwo.nl/catch and www.nwo.nl/catch/choral

Provided by Netherlands Organisation for Scientific Research (NWO)

Citation: 'Googling' through unique audio material: towards a better search result (2012, July 4) retrieved 14 August 2024 from https://phys.org/news/2012-07-googling-unique-audio-material-result.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Faster, easier way to access audiovisual assets

0 shares

Feedback to editors

Protons can tune synaptic signaling by changing the shape of a protein receptor

19 minutes ago

Scientists create material that can take the temperature of nanoscale objects

54 minutes ago

Findings challenge current understanding of nitrogenases and highlight their potential for sustainable bioproduction

1 hour ago

NASA still deciding whether to keep 2 astronauts at space station until next year

1 hour ago

Statistical analysis can detect when ChatGPT is used to cheat on multiple-choice chemistry exams

1 hour ago

Physicists throw world's smallest disco party with a levitating ball of fluorescent nanodiamond

1 hour ago

First-of-its-kind analysis reveals importance of storms in air–sea carbon exchange in Southern Ocean

2 hours ago

Fine fragrances from test tubes: A new method to synthesize ambrox

2 hours ago

NASA's Perseverance rover to begin long climb up Martian crater rim

2 hours ago

Revealing the mysteries within microbial genomes with a new high-throughput approach

2 hours ago

Load comments (0)

'Googling' through unique audio material: towards a better search result

Improved evaluation of transcripts

Specific challenges of heritage material

Applications of the optimised technology

Protons can tune synaptic signaling by changing the shape of a protein receptor

Scientists create material that can take the temperature of nanoscale objects

Findings challenge current understanding of nitrogenases and highlight their potential for sustainable bioproduction

NASA still deciding whether to keep 2 astronauts at space station until next year

Statistical analysis can detect when ChatGPT is used to cheat on multiple-choice chemistry exams

Physicists throw world's smallest disco party with a levitating ball of fluorescent nanodiamond

First-of-its-kind analysis reveals importance of storms in air–sea carbon exchange in Southern Ocean

Fine fragrances from test tubes: A new method to synthesize ambrox

NASA's Perseverance rover to begin long climb up Martian crater rim

Revealing the mysteries within microbial genomes with a new high-throughput approach

Relevant PhysicsForums posts

Python Socket library to create a server and client scripts

Safe, free and unlimited xls to xlsx converter?

Help solving a geometrical matching issue with Graph Neural Networks

5 GHz PC WiFi connection Cybersecurity question

Help with some optimization code for Block Matrices

Is an API Always Necessary for Server-Client Communication?

Faster, easier way to access audiovisual assets

Culture vultures go beyond, way beyond Google

Unlocking the secrets of Heritage Smells

Research team develops systems that process and understand spoken language, especially Basque

Rich musical pickings with easier access to archives

'Talking dictionaries' document vanishing languages

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

'Googling' through unique audio material: towards a better search result

Improved evaluation of transcripts

Specific challenges of heritage material

Applications of the optimised technology

Protons can tune synaptic signaling by changing the shape of a protein receptor

Scientists create material that can take the temperature of nanoscale objects

Findings challenge current understanding of nitrogenases and highlight their potential for sustainable bioproduction

NASA still deciding whether to keep 2 astronauts at space station until next year

Statistical analysis can detect when ChatGPT is used to cheat on multiple-choice chemistry exams

Physicists throw world's smallest disco party with a levitating ball of fluorescent nanodiamond

First-of-its-kind analysis reveals importance of storms in air–sea carbon exchange in Southern Ocean

Fine fragrances from test tubes: A new method to synthesize ambrox

NASA's Perseverance rover to begin long climb up Martian crater rim

Revealing the mysteries within microbial genomes with a new high-throughput approach

Relevant PhysicsForums posts

Related Stories

Faster, easier way to access audiovisual assets

Culture vultures go beyond, way beyond Google

Unlocking the secrets of Heritage Smells

Research team develops systems that process and understand spoken language, especially Basque

Rich musical pickings with easier access to archives

'Talking dictionaries' document vanishing languages

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience