Speaking in song: New singalong software brings sweet melody to any cacophonous cry

Speaking in song
“We want to use our technology to help the average person sing well," the researcher says. Credit: Sergey Nivens

Whether you give it your best effort or your worst, voice synthesis software developed at Singapore's Agency for Science, Technology and Research (A*STAR) will make you sound like the melodious singer you've always wanted to be. Called I2R Speech2Singing, this software is the first to deliver high-quality singing automatically, while still preserving the original character of your natural voice.

"Many people like singing but they lack the skills to do so," says Minghui Dong, the project leader at A*STAR's Institute for Infocomm Research (I2R). "We want to use our technology to help the average person sing well."

Speech consists of three key elements: content, prosody and timbre. Content is conveyed using words; prosody, or in the case of singing, is expressed through rhythm and pitch; and timbre is the distinctive quality that makes a banjo sound different from a trumpet and one singer's different from another's. I2R Speech2Singing works by polishing melody while retaining the original content and timbre of a sound.

Existing technologies that focus on correcting melody try to align off-tune sounds to the closest note on the musical scale or to the exact note in the original score. The former works well for professional singers who may be only slightly out of tune but cannot fix those who are singing drastically off-key or simply reading out loud. The latter is better at correcting discordant tunes but ignores many other aspects of melody such as vibrato and vowel stretching.

I2R Speech2Singing uses recordings by professional singers as templates to correct the melody of a or to convert a speaking voice into a one. The detects the timing of each phonetic sound using and then stretches or compresses the duration of the signal using voice conversion technology to match the rhythm to that of a professional singer. A speech synthesizer then combines the time-corrected voice with pitch data and background music to produce a beautiful solo.

"When we compared the output with other currently available applications, we realized that our software generated a much better voice quality," says Dr Dong.

Singaporeans were first introduced to the software in 2013 through "Sing for Singapore", part of the official mobile app of National Day Parade 2013. And in 2014, I2R Speech2Singing won the award for best Show & Tell contribution at INTERSPEECH, a major global venue for research on the science and of speech communication.


Explore further

Voice synthesis software delivers high-quality singing automatically

Citation: Speaking in song: New singalong software brings sweet melody to any cacophonous cry (2016, June 17) retrieved 21 April 2021 from https://phys.org/news/2016-06-song-singalong-software-sweet-melody.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.
8 shares

Feedback to editors

User comments