August 26, 2009

Strictly ballroom analysis: Computers get to know their rumba from their cha-cha-cha

Computer scientists in Taiwan have devised a neural network program that can successfully classify a computerized music file based on its beat and tempo. The system could be a boon for music archivists with large numbers of untagged recordings and for users searching through mislabeled mp3 libraries. Details of tests on ballroom dancing music are reported this month in the International Journal of Intelligent Information and Database Systems.

Mao-Yuan Kao and Chang-Biau Yang of National Sun Yat-sen University, in Kaohsiung and Shyue-Horng Shiau of the Chang Jung Christian University, in Tainan, explain that most music fans can put a tune into a particular genre even on a first listen. However, for archivists and others with large collections of unclassified music an automated approach that assigns the main genre to each tune would save a lot of time and effort.

Until now, there were two main approaches to classifying music - the Ellis and the Dixon methods - named for their inventors. These methods work reasonably well by analyzing the audio signal but Yang and colleagues hope to combine the strengths of each and to use a neural network to do the initial classification.

An artificial neural network, is a type of computer model that mimics the behavior of clusters of brain cells. The researchers "play" the music file to the neural network, which analyses the beat and tempo and outputs a general musical genre. Additional music files are played on after the other and genres assigned.

In this initial learning phase the researchers correct the misses and feed the hits back into the neural network so that it builds up an audio profile of how different music files sound in each different genre. Once the neural network has been trained it can then classify a whole collection of music files.

The next step is then to use the Ellis and Dixon methods to further confirm the genre of each neurally classified group of music files. These methods using different signal processing approaches to "listen" to the music file and to determine the position of peaks that correspond to the musical beat. They can be used to estimate tempo and beat pattern.

The Taiwanese team has so far tested their approach on a collection of several hundred ballroom dance music files. Their system has classified different music styles, such as cha-cha-cha, jive, quickstep, and tango, with varying degrees of success, the cha-cha-cha being the most accurately categorized in tests of dozens of music files. The American rumba, unfortunately failed classification. Paradoxically, the neural network could successfully classify a Viennese waltz with relative ease, but not a standard waltz.

Overall, the neural network approach is never 100% accurate, but in all musical genres tested it outperformed both the Ellis and Dixon methods. They suggest that further training of the neural network with classical, jazz and pop music files will ultimately allow them to classify much more diverse music collections automatically.

More information: "Tempo and beat tracking for audio signals with music genre classification" in Int. J. Intelligent Information and Database Systems, 2009, 3, 275-290; www.inderscience.com/offer.php?id=27687

Source: Inderscience Publishers (news : web)

Citation: Strictly ballroom analysis: Computers get to know their rumba from their cha-cha-cha (2009, August 26) retrieved 3 July 2024 from https://phys.org/news/2009-08-strictly-ballroom-analysis-rumba-cha-cha-cha.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Metadata bring order to digital chaos

0 shares

Feedback to editors

Compact cities found to have lower carbon emissions but poorer air quality, less green space and higher mortality rates

3 hours ago

New theory reveals fracture mechanism in soft materials

5 hours ago

Researchers uncover key mechanisms in chromosome structure development

5 hours ago

Energy landscape theory sheds light on evolution of foldable proteins

6 hours ago

Researchers discover photo-induced charge-transfer complex between amine and imide

7 hours ago

Why do you keep your house so cold? Study suggests childhood home temperature can predict adult thermostat settings

8 hours ago

Cryptocurrency investors are more likely to self-report 'Dark Tetrad' personality traits, study shows

8 hours ago

New study challenges drought theory for Cahokia exodus

8 hours ago

World's oldest artwork discovered in Indonesian cave

9 hours ago

How 36,000 buried tea bags help explain global decomposition

9 hours ago

Load comments (0)

Strictly ballroom analysis: Computers get to know their rumba from their cha-cha-cha

Compact cities found to have lower carbon emissions but poorer air quality, less green space and higher mortality rates

New theory reveals fracture mechanism in soft materials

Researchers uncover key mechanisms in chromosome structure development

Energy landscape theory sheds light on evolution of foldable proteins

Researchers discover photo-induced charge-transfer complex between amine and imide

Why do you keep your house so cold? Study suggests childhood home temperature can predict adult thermostat settings

Cryptocurrency investors are more likely to self-report 'Dark Tetrad' personality traits, study shows

New study challenges drought theory for Cahokia exodus

World's oldest artwork discovered in Indonesian cave

How 36,000 buried tea bags help explain global decomposition

Relevant PhysicsForums posts

Number of Multiplications in the FFT Algorithm

Newbie question about deep learning

Who can find the largest prime number with their own programmed code?

Math Major Trying to Learn CS

Parallelizing N-Queens

How to test locally hosted websites on mobile?

Metadata bring order to digital chaos

Nokia 6630 Music Edition

Next-generation hi-fi: deepening the musical experience

Semantic descriptors to help the hunt for music

Intelligent DJ Emerges from Fundamental Research

60-second review: Sansa slotRadio

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Strictly ballroom analysis: Computers get to know their rumba from their cha-cha-cha

Compact cities found to have lower carbon emissions but poorer air quality, less green space and higher mortality rates

New theory reveals fracture mechanism in soft materials

Researchers uncover key mechanisms in chromosome structure development

Energy landscape theory sheds light on evolution of foldable proteins

Researchers discover photo-induced charge-transfer complex between amine and imide

Why do you keep your house so cold? Study suggests childhood home temperature can predict adult thermostat settings

Cryptocurrency investors are more likely to self-report 'Dark Tetrad' personality traits, study shows

New study challenges drought theory for Cahokia exodus

World's oldest artwork discovered in Indonesian cave

How 36,000 buried tea bags help explain global decomposition

Relevant PhysicsForums posts

Related Stories

Metadata bring order to digital chaos

Nokia 6630 Music Edition

Next-generation hi-fi: deepening the musical experience

Semantic descriptors to help the hunt for music

Intelligent DJ Emerges from Fundamental Research

60-second review: Sansa slotRadio

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience