April 1, 2008

Music file compressed 1,000 times smaller than mp3

Researchers at the University of Rochester have digitally reproduced music in a file nearly 1,000 times smaller than a regular MP3 file. The music, a 20-second clarinet solo, is encoded in less than a single kilobyte, and is made possible by two innovations: recreating in a computer both the real-world physics of a clarinet and the physics of a clarinet player.

The achievement, announced today at the International Conference on Acoustics Speech and Signal Processing held in Las Vegas, is not yet a flawless reproduction of an original performance, but the researchers say it's getting close.

"This is essentially a human-scale system of reproducing music," says Mark Bocko, professor of electrical and computer engineering and co-creator of the technology. "Humans can manipulate their tongue, breath, and fingers only so fast, so in theory we shouldn't really have to measure the music many thousands of times a second like we do on a CD. As a result, I think we may have found the absolute least amount of data needed to reproduce a piece of music."

In replaying the music, a computer literally reproduces the original performance based on everything it knows about clarinets and clarinet playing. Two of Bocko's doctoral students, Xiaoxiao Dong and Mark Sterling, worked with Bocko to measure every aspect of a clarinet that affects its soundfrom the backpressure in the mouthpiece for every different fingering, to the way sound radiates from the instrument. They then built a computer model of the clarinet, and the result is a virtual instrument built entirely from the real-world acoustical measurements.

The team then set about creating a virtual player for the virtual clarinet. They modeled how a clarinet player interacts with the instrument including the fingerings, the force of breath, and the pressure of the player's lips to determine how they would affect the response of the virtual clarinet. Then, says Bocko, it's a matter of letting the computer "listen" to a real clarinet performance to infer and record the various actions required to create a specific sound. The original sound is then reproduced by feeding the record of the player's actions back into the computer model.

At present the results are a very close, though not yet a perfect, representation of the original sound.

"We are still working on including 'tonguing,' or how the player strikes the reed with the tongue to start notes in staccato passages," says Bocko. "But in music with more sustained and connected notes the method works quite well and it's difficult to tell the synthesized sound from the original."

As the method is refined the researchers imagine that it may give computer musicians more intuitive ways to create expressive music by including the actions of a virtual musician in computer synthesizers. And although the human vocal tract is highly complex, Bocko says the method may in principle be extended to vocals as well.

The current method handles only a single instrument at a time, however in other work in the University's Music Research Lab with post-doctoral researcher Gordana Velikic and Dave Headlam, professor of music theory at the University of Rochester's Eastman School of Music, the team has produced a method of separating multiple instruments in a mix so the two methods can be combined to produce a very compact recording.

Bocko believes that the quality will continue to improve as the acoustic measurements and the resulting synthesis algorithms become more accurate, and he says this process may represent the maximum possible data compression of music.

"Maybe the future of music recording lies in reproducing performers and not recording them," says Bocko.

Source: University of Rochester

Citation: Music file compressed 1,000 times smaller than mp3 (2008, April 1) retrieved 19 September 2024 from https://phys.org/news/2008-04-music-compressed-smaller-mp3.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

National park wild boar contain five-times more toxic PFAS than humans allowed to eat, study finds

0 shares

Feedback to editors

Music file compressed 1,000 times smaller than mp3

New material with wavy layers of atoms exhibits unusual superconducting properties

Researchers build AI model database to find new alloys for nuclear fusion facilities

Greylag geese with similar personalities have higher hatching success, study suggests

Can captive tigers be part of the effort to save wild populations?

Proteins in tooth enamel offer window into ancient and modern human wellness

Mysteries of the bizarre 'pseudogap' in quantum physics finally untangled

Are cows pickier than goats? Answers from innovative large-scale feeding experiments from 275 years ago

Research predicts rise in tropical hydraulic failure

Human genome stored on 'everlasting' memory crystal

Scientists say there is enough evidence to agree to global action on microplastics

Relevant PhysicsForums posts

Laptop Shutting Down Unexpectedly: Dead Battery?

Whatever happened to Adobe PrintGear?

OpenAI introduces o1 Formerly known as Q

Creating Entropy For Cryptographic Purposes

Should I set up SPF, DKIM, DMARC, or forward email to Gmail account?

Are human database related jobs going to disappear?

National park wild boar contain five-times more toxic PFAS than humans allowed to eat, study finds

Time to build zero-debris satellites

Lemur communication shows how humans evolved to create music

For many urban residents, it's even hotter than their weather app says

Half of world's lakes are less resilient to disturbance than they used to be

Only 1 in 3 people enjoy talking about politics—researchers say the reasons are more social than political

Google's challenge to game consoles to kick off in November

Technology streamlines computational science projects

New video game teaches teens about electricity

Travis the translator aims to make people understood

Windows 10 update set for October release

De-jargonizing program helps decode science speak

Medical Xpress

Tech Xplore

Science X

Music file compressed 1,000 times smaller than mp3

New material with wavy layers of atoms exhibits unusual superconducting properties

Researchers build AI model database to find new alloys for nuclear fusion facilities

Greylag geese with similar personalities have higher hatching success, study suggests

Can captive tigers be part of the effort to save wild populations?

Proteins in tooth enamel offer window into ancient and modern human wellness

Mysteries of the bizarre 'pseudogap' in quantum physics finally untangled

Are cows pickier than goats? Answers from innovative large-scale feeding experiments from 275 years ago

Research predicts rise in tropical hydraulic failure

Human genome stored on 'everlasting' memory crystal

Scientists say there is enough evidence to agree to global action on microplastics

Relevant PhysicsForums posts

Related Stories

National park wild boar contain five-times more toxic PFAS than humans allowed to eat, study finds

Time to build zero-debris satellites

Lemur communication shows how humans evolved to create music

For many urban residents, it's even hotter than their weather app says

Half of world's lakes are less resilient to disturbance than they used to be

Only 1 in 3 people enjoy talking about politics—researchers say the reasons are more social than political

Recommended for you

Google's challenge to game consoles to kick off in November

Technology streamlines computational science projects

New video game teaches teens about electricity

Travis the translator aims to make people understood

Windows 10 update set for October release

De-jargonizing program helps decode science speak

Newsletter sign up

Donate and enjoy an ad-free experience