May 7, 2007 feature

Software allows amateurs to compose professional-looking music sports videos

By Lisa Zyga , Phys.org

Although currently the composition of music sports videos requires a tech-savvy professional with an artist’s touch, the future may enable any amateur to create their own personalized video using software in the works by a group of scientists. With its fully or semi-automatic modes, the system would turn a tedious, time-consuming, and skilled task into a hobbyist’s evening activity.

Scientists Jinjun Wang et al., representing different institutions in China and Singapore, have presented a novel approach to personalized music sports video composition. Their system can automatically detect events, players, or teams, and then smoothly integrate the scenes with music, all while maintaining the artistic quality of professional videos. They predict that home users will easily be able to customize sports videos for themselves, greatly increasing the production of these videos, as well as expanding the audience.

“We have introduced a real-time sports event detection and broadcasting system and tested it using FIFA World Cup 2006 games,” Wang told PhysOrg.com. “In addition, with our second-step system, we are able to utilize the detected events to provide value-added services. With the ability to generate music sports video, it is possible to have prelude/postlude, half-break commentary, and summary TV programs using the latest game scenes.”

In their paper published in IEEE Transactions on Multimedia, the scientists explain how they optimized the features for the intelligent automatic system. For example, a user can request certain events (such as goals in soccer videos or three-pointers in basketball) to include in a video.

To satisfy the request, the system uses “semantic content extraction,” meaning that it searches the text for key words associated with the events. Text not only includes closed captioning, but also web casts from sites such as the BBC and Yahoo, where text often involves very detailed information. Rather than simple word matching, the software (dtSearch) uses techniques to filter unwanted scenes (e.g. ignores “goal kick” when searching for “goal”) and other advanced options.

Then, to align the sports scenes with music, the system can automatically choose a song whose phrasing, beats, and lyrical structure matches with the dynamics of the scene shots.

“The more different types of events the user need for ‘editing,’ the more processing time is required by our system,” Wang explained. “In an extreme case where all the types of events are required for detection, the system needs around 90 minutes to process a 90-minute soccer game—near real-time. The second step, the ‘editing" step, is quite fast—usually less than one minute for typical pop music.”

If a user has editing preferences (for example, they want certain shots to align with certain parts of a song), the system can also work in semi-automatic mode. A user’s stipulations can become fairly complex, as well, overriding the system’s inherent rules.

“A computer program won't create new things unless it's been taught to,” Wang explained. “In fact, computers are more suitable for tedious or computational tasks, such as selecting precise in/out frames and alignment work. The human must tell the computer what tasks to do.

“The difference with our system is that it is able to use some high-level, abstract rules,” he continued. “For example, a user may say ‘I want the music "Hero" and Beckham's shooting scene from EPL [English Premier League] 2004 to compose a music video,’ and our system can do the rest of the work, finding Beckham's shooting events and aligning them to the music to achieve smooth shot transitions and understandable video content. The contribution of our system is that, since it is able to execute certain high-level rules for video editing, people can personalize this rule to produce their customized music video.”

Even when the system performed fully automatically, the artistic results were impressive to viewers, who consistently rated the system higher in all scoring categories compared with other similar systems. In addition to their sensible structure, the videos also demonstrated a high degree of artistic quality, which may be somewhat surprising for a completely computerized system.

“We think that, given limited material, a good selection must follow some rules,” Wang said. “Since we are not artists, we have to do some statistical work to discover these rules. If a music video can satisfy most of the predefined rules, its artistic attribute won't be bad. But of course, it is usually necessary to conduct subjective evaluation to see how much the predefined rules are suitable.”

Wang said that currently the system targets the broadcasting industry, but hopefully general users will benefit from it in the future.

“We are definitely doing investigations to support more application areas based on the technique,” he said. “A software program like ‘muvee’ [a currently available program] for the general public is surely one of the best options.”

Sample videos created with this software are temporarily available at: www.ntu.edu.sg/home5/Y020002/R … emo/Introduction.htm.

Citation: Wang, Jinjun, Chng, Engsiong, Xu, Changsheng, Lu, Hanqinq, and Tian, Qi. “Generation of Personalized Music Sports Video Using Multimodal Cues.” IEEE Transactions on Multimedia, Vol. 9, No. 3, April 2007.

Citation: Software allows amateurs to compose professional-looking music sports videos (2007, May 7) retrieved 25 April 2024 from https://phys.org/news/2007-05-software-amateurs-professional-looking-music-sports.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

How Christmas music in adverts and shops harnesses nostalgia to encourage you to spend more

0 shares

Feedback to editors

Ancient giant tortoise fossils found in Colombian Andes

26 minutes ago

Emperor penguins perish as ice melts to new lows: Study

33 minutes ago

Artificial intelligence helps scientists engineer plants to fight climate change

11 hours ago

Ultrasensitive photonic crystal detects single particles down to 50 nanometers

12 hours ago

Scientists map soil RNA to fungal genomes to understand forest ecosystems

13 hours ago

Researchers show it's possible to teach old magnetic cilia new tricks

13 hours ago

Mantle heat may have boosted Earth's crust 3 billion years ago

13 hours ago

Study suggests that cells possess a hidden communication system

13 hours ago

Researcher finds that wood frogs evolved rapidly in response to road salts

13 hours ago

Imaging technique shows new details of peptide structures

14 hours ago

Load comments (0)

Software allows amateurs to compose professional-looking music sports videos

Ancient giant tortoise fossils found in Colombian Andes

Emperor penguins perish as ice melts to new lows: Study

Artificial intelligence helps scientists engineer plants to fight climate change

Ultrasensitive photonic crystal detects single particles down to 50 nanometers

Scientists map soil RNA to fungal genomes to understand forest ecosystems

Researchers show it's possible to teach old magnetic cilia new tricks

Mantle heat may have boosted Earth's crust 3 billion years ago

Study suggests that cells possess a hidden communication system

Researcher finds that wood frogs evolved rapidly in response to road salts

Imaging technique shows new details of peptide structures

Relevant PhysicsForums posts

Passing variables in FORTRAN

My Website For Creating Interactive Visuals Linked To Equations

Number of Multiplications in the FFT Algorithm

Error logging in: onLoginSuccess is not a function

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

How Christmas music in adverts and shops harnesses nostalgia to encourage you to spend more

Is age linked to the picture of the perfect partner?

Understanding the key to predicting heat events in Central Europe

Q&A: Finding more sustainable ways to use plastics in agriculture

Study highlights benefits of user-generated content to digital platform

How a drought led to the rise of skateboarding in 1970s California

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Software allows amateurs to compose professional-looking music sports videos

Ancient giant tortoise fossils found in Colombian Andes

Emperor penguins perish as ice melts to new lows: Study

Artificial intelligence helps scientists engineer plants to fight climate change

Ultrasensitive photonic crystal detects single particles down to 50 nanometers

Scientists map soil RNA to fungal genomes to understand forest ecosystems

Researchers show it's possible to teach old magnetic cilia new tricks

Mantle heat may have boosted Earth's crust 3 billion years ago

Study suggests that cells possess a hidden communication system

Researcher finds that wood frogs evolved rapidly in response to road salts

Imaging technique shows new details of peptide structures

Relevant PhysicsForums posts

Related Stories

How Christmas music in adverts and shops harnesses nostalgia to encourage you to spend more

Is age linked to the picture of the perfect partner?

Understanding the key to predicting heat events in Central Europe

Q&A: Finding more sustainable ways to use plastics in agriculture

Study highlights benefits of user-generated content to digital platform

How a drought led to the rise of skateboarding in 1970s California

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience