Human-computer music performances use system that links music and musical gestures (w/ Video)

Nov 09, 2010 By Lisa Zyga feature
In this image from the video below, computers, robots, and humans perform together in the Machine Orchestra, which uses the captured musical gestures. Image credit: Ajay Kapur.

(PhysOrg.com) -- Every musical sound comes from a specific way that an instrument is played. With modern technology such as sensors, signal processing, and sometimes machine learning algorithms, researchers can determine the precise musical gesture used to produce a particular sound on an instrument. The ability to recreate musical gestures from sounds can be used for interactive human-computer music performances, music transcription, and other innovative applications.

In a new study, researchers have developed a method for capturing musical gestures and mapping them to sounds that overcomes some of the disadvantages of previous methods. Adam Tindale, Ajay Kapur, and George Tzanetakis – all trained musicians and computer scientists working at the University of Victoria in Victoria, Canada – have described the new method in a study to be published in IEEE Transactions on Multimedia. The method grew out of the authors' experiences in developing instruments for interactive human-computer music performances. At the time, Tindale and Kapur were both completing their PhDs at the University of Victoria. Tindale now works at the Alberta College of Art and Design and Kapur works at the California Institute of the Arts and the New Zealand School of Music.

As the researchers explain in their study, there are two main approaches for capturing musical gestures. One approach is direct acquisition, which involves attaching permanent to instruments to create “hyper-instruments.” However, this approach is often invasive to performers and requires modification of expensive instruments. The second approach is indirect acquisition, which involves using a microphone to capture the sound, as well as sophisticated and machine learning algorithms to extract gestures from the sounds, which requires large amounts of training.

The researchers' new method is somewhat of a hybrid of these two approaches. They temporarily attach sensors to an instrument to capture musical gestures and a microphone to capture sound. This data is analyzed, and the gesture-sound mappings are used to train machine learning models to extract gestures from sounds only. These machine models then create what the researchers call a “surrogate sensor,” which behaves like the original invasive sensor but is not attached to the instrument. The surrogate sensor can determine the musical gestures based only on the analyzed sound captured from the microphone.

This video is not supported by your browser at this time.
A demonstration of the gesture recognition system: in the second half of the video, the recognition system is parsing the events. When it detects a rim hit, it changes its pre-set processing on the physical modeling. Video credit: Adam Tindale.

“The main advantage of the method is that it allows large amounts of training data for machine learning algorithms to be acquired without human annotation simply by playing an instrument enhanced with sensors,” Tzanetakis told PhysOrg.com. “Once the surrogate sensor has been trained and its performance evaluated, then it can be used in place of the actual physical sensor on unmodified instruments of the same type.”

The system overcomes the previous difficulties in that it doesn't hinder the performers or their instruments when performing, and doesn't require large amounts of processing and analysis, greatly reducing time requirements. For instance, data from snare drum samples that took nearly a week of manual labor can be processed in less than an hour with the new method.

The researchers demonstrated the system with professional musicians playing electronic sitars and electronic snare drums. The results showed that the trained surrogate sensors can accurately determine musical gestures when different musicians were performing, and not only the performer who was used to train the surrogate sensor. Further, any kind of performing (such as improvisation or playing a song) could be used to train the surrogate sensor.


Video above: Computers, robots, and humans, including coauthor Ajay Kapur, perform together in the Machine Orchestra. Video credit: Ajay Kapur.

In the future, the researchers plan to make the system more sensitive to additional features, such as training it to recognize the type of mouthpiece used by a woodwind instrument and the string on which a note is played on a violin. Both Tindale and Kapur currently use the system during musical performances, as shown in the accompanying videos.

“Basically, the gesture information extracted is typically used in the following ways,” Tzanetakis explained. “(1) To map performance information to sound and a music generation algorithm that react in real-time expressively to the music; (2) to drive animations and visuals; (3) to synthesize sounds parametrically (for example, a synthesized drum sound might change based on where the snare drum is hit); and (4) to do analysis of the music played (for example, automatically tracking the tempo) in the context of computer-controlled robotic instruments interacting with humans.”

Explore further: What makes people click? Researchers analyze online news preferences

More information: -- Adam Tindale, et al. “Training surrogate sensors in musical gesture acquisition systems.” IEEE Transactions on Multimedia. To be published. DOI:10.1109/TMM.2010.2089786 .
-- www.adamtindale.com

3.9 /5 (10 votes)

Related Stories

Method developed to identify musical notes at any venue

Apr 23, 2010

A team of telecommunications engineers from the University of Jaen (UJA) has created a new method to automatically detect and identify the musical notes in an audio file and generate sheet music. The system ...

Neuroscientist: Think twice about cutting music in schools

Feb 21, 2010

At a press briefing today at the American Association for the Advancement of Science annual meeting, a Northwestern University neuroscientist will argue that music training has profound effects that shape the sensory system ...

Recommended for you

Seeing data

38 minutes ago

More data are being created, consumed, and transported than ever before, and in all areas of society, including business, government, health care, and science. The hope and promise is that this influx of ...

Making online translation accurate, reliable and efficient

Jun 13, 2013

European cooperation is based on our ability to understand each other. Given that there are presently 23 official EU languages, the availability of online tools to facilitate accurate translation is fundamentally ...

User comments : 0

More news stories

Seeing data

More data are being created, consumed, and transported than ever before, and in all areas of society, including business, government, health care, and science. The hope and promise is that this influx of ...

Dish won't submit revised bid for Sprint

Satellite TV operator Dish Network Corp. said Tuesday it would not submit a revised bid for Sprint, leaving the path open for the wireless carrier to accept what it already considers a superior offer from Japan's Softbank.

Diabetes key to transplant success, research finds

(Medical Xpress)—Better management of diabetes could dramatically improve outcomes for lung transplant patients, with new research showing that those without diabetes lived twice as long as transplant recipients ...