Faster, easier way to access audiovisual assets

Jan 13, 2010
Faster, easier way to access audiovisual assets

(PhysOrg.com) -- Millions of hours of old shows sit collecting dust in the basements of TV and radio broadcasters. Digging through these audiovisual treasure troves is becoming faster and easier thanks to software developed by European researchers.

In recent years many public and private organisations have embarked on initiatives to digitise collections of recordings from decades past in an effort to gain new insights into history and preserve the audiovisual for posterity.

Sifting through these collections of analogue magnetic tapes they have uncovered long-lost footage of historical events and interviews with historical figures. But they have also encountered numerous problems.

“We don’t have the resources to digitise, describe and index all content in detail, so we need some sort of automated or semi-automated method,” says Jean-François Cosandier, the head of the documentation and archive department at Radio Suisse Romande, a radio station in the French-speaking part of Switzerland.

Not least among the challenges these archival archaeologists face is identifying what is included in the content of an old recording and cataloguing the digital copy for easy access and retrieval in the future.

“Some archives have collections of recordings that are well documented, but many do not,” says Philippe Scohy, a project manager at Memnon Archiving Services in Brussels. Some even have hundreds of thousands of hours of content without even knowing what’s in it.

Because of the lack of metadata information describing the content of old recordings it can take an archivist as long as five or six hours to catalogue a one-hour radio interview even though perhaps only a few minutes of that interview will be of interest.

“Given the amount of old media being digitised and the problems of identifying and cataloguing it, any tool that makes the archivist’s job easier is a welcome development,” Scohy notes.

Memnon is currently marketing a set of tools, IPI-Manager, intended to do just that. Developed in the EU-funded Memories project, the tools automate the more laborious aspects of the archiving process, helping archivists index and sort media collections faster and more easily. That in turn should lead to more historical content being made more accessible to more people, ensuring its preservation for future generations. For example, Radio Suisse Romande, a partner in the Memories project, plans to use the tools to help make its 80-year-old collection of audio recordings accessible and searchable online.

“We have digitised a quarter of our old analogue archive, so there is still a lot more work to be done,” Cosandier says. And, he adds, the development of new and more effective techniques is not justified solely by efforts to digitise old content. Nowadays archivists have to deal with a diverse range of audio documents, from radio programmes and speeches to conferences and university courses. With traditional methods of analysis and indexation it would be almost impossible to archive and make this content accessible.

Tools to dig up the past and the future

By analysing audio content, the Memories tools are able to identify different features of a recording. Used to catalogue a radio interview, for example, they detect when a question is asked and an answer given by recognising the exchange between speakers. The system then automatically tags each question and answer pair to let future listeners jump to different parts of the interview at the click of a button. Similarly, the Memories researchers developed a tool to automatically detect and tag the start, end and commercial breaks of different shows by recognising their trademark jingles.

“An old tape might be labelled with the shows that are on it, but more often than not an archivist is given no clue as to what order they are in or how long they run without watching the whole thing,” Scohy says. “Our tools provide that information.”

In the case of recordings of a person or people speaking, voice recognition technology can also be applied, which, with training, can automatically identify speakers, while a speech-to-text application turns the spoken content into text.

To provide search functionality, the Memories team developed a sophisticated search tool adapted from information-gathering methods that have been tried and tested in genetic and genomic applications. It is based on the statistical association of the occurrences of words.

In the case of music, the Memories researchers in Mist Technologies/Audionamix and Technion (Haifa) developed a tool to “unmix” the different channels that make up a song. Called Single Sensor Source Separation (SSSS), the software is able to differentiate between instruments, separating the sound of a trumpet from a piano, for example, and making it possible to identify different stages in a tune. The current version works best with mono recordings and can also be used to help digitally remaster them into stereo and surround sound, Scohy notes.

Open archives for future-proof content

The overall Memories architecture is based on the Open Archiving Information System (OAIS) model, a standard originally developed by the Consultative Committee for Space Data Systems (CCSDS) with the aim of future proofing digital content by storing and cataloguing it in such a way that it does not become obsolete and inaccessible as a result of technological progress.

“By adopting the OAIS approach we are trying to ensure that content is around for a very long time, not just years but thousands of years,” Scohy says.

With Memnon actively marketing products based on the work done in Memories and expecting its first sales imminently, preserving audiovisual memories for the future should be a little less of a challenge.

Explore further: New frontier in error-correcting codes

More information: Memories project: www.memories-project.eu/

Related Stories

Rich musical pickings with easier access to archives

Apr 22, 2009

(PhysOrg.com) -- Digital sound archives offer enormously rich resources but accessing them is currently difficult, and often arbitrary. European researchers believe they have developed a solution, one that offers compelling ...

P2P comes to the aid of audiovisual search (w/ Video)

Nov 18, 2009

(PhysOrg.com) -- Current methods of searching audiovisual content can be a hit-and-miss affair. Manually tagging online media content is time consuming, and costly. But new 'query by example' methods, built on peer-to-peer ...

UI develops free, easy-to-use web tool kit for archivists

Feb 19, 2008

Archivists at the University of Illinois Library believe they have built a better tool kit. Their new online collections management program called Archon has more than a few attractive features – not the least of which ...

Metadata bring order to digital chaos

Sep 10, 2008

MP3 files, video streams, digital images – the flood of multimedia data swells higher every day. New systems help the user to keep tabs on it all. At the International Broadcasting Convention IBC in Amsterdam on September ...

THESEUS - tool for internet services

Mar 03, 2009

(PhysOrg.com) -- The improved use and exploitation of digital knowledge - that is the aim of the THESEUS Project. In the future semantic technologies will be able to recognise the meaning of information content. Fraunhofer ...

Recommended for you

New frontier in error-correcting codes

10 hours ago

Error-correcting codes are one of the glories of the information age: They're what guarantee the flawless transmission of digital information over the airwaves or through copper wire, even in the presence of the corrupting ...

Five ways the superintelligence revolution might happen

Sep 26, 2014

Biological brains are unlikely to be the final stage of intelligence. Machines already have superhuman strength, speed and stamina – and one day they will have superhuman intelligence. This is of course ...

User comments : 0