Mistletoe hugs, kisses spotted by computer (w/ Video)

Dec 22, 2010
Capturing the motion of a hug.

(PhysOrg.com) -- Hugs and kisses exchanged under the mistletoe are among the human interactions which can now be automatically recognized by computers from video footage, thanks to new research.

The technology, developed at Oxford University, can also automatically recognize interactions such as handshakes and high fives. It is part of research to enable computers to automatically analyse the content of the vast amount of generated from sources such as TV, films, YouTube and CCTV.

"Human actions and activities are of central importance in ," said Alonso Patron-Perez of Oxford University’s Department of Engineering Science, who led the research. "This new work makes it possible to recognise two-person human interactions, such as hugs, kisses and hand-shakes, automatically. Once you can recognize these interactions the applications are numerous: for instance you could automatically search home videos and YouTube for kisses and handshakes or even fast forward CCTV to find incidents."

This video is not supported by your browser at this time.
An illustration of the method proposed is shown in this video.

The method, developed by an Oxford University team including Alonso Patron-Perez, Dr. Ian Reid, Dr. Marcin Marszalek, and Professor Andrew Zisserman, is built on algorithms from computer vision and machine learning.

Teaching computers to recognize the interactions involves a number of steps: first, humans are detected and tracked through the video footage; then, once the position of the humans in the video is established, different cues such as head orientation and relative motion of people’s bodies are computed to determine if an interaction occurs and, if it does, what kind of interaction it is.

All this information is computed for several examples of each interaction (the team has focused on four interactions so far: handshakes, high fives, hugs and kisses), and machine learning methods are then used to learn a model for each interaction from these examples.

Alonso Patron-Perez said: "Once a computer has learnt these models, human interactions can then be located and recognised in new videos, with the computer able to determine when in the video interactions occur, which people are interacting and what kind of interactions are involved. This work enables computers to make sense of how people are behaving in video footage in a way that has simply not been possible before."

Explore further: What makes people click? Researchers analyze online news preferences

More information: Project page: www.robots.ox.ac.uk/~alonso/projects/human_interactions_project.html

Related Stories

Researchers train computers to analyze fruit-fly behavior

Apr 08, 2009

Scientists at the California Institute of Technology (Caltech) have trained computers to automatically analyze aggression and courtship in fruit flies, opening the way for researchers to perform large-scale, high-throughput ...

YouTube adds online video editing tool

Jun 17, 2010

YouTube users can now edit their own videos online. The Google-owned video-sharing site added an online editing tool this week that allows YouTube users to combine multiple videos, shorten a video or add soundtracks ...

Training computers to classify pictures and videos

Oct 13, 2010

Spanish researchers have developed a new computer technique that allows to "train" computers to interpret the visual contents of a video or picture. This advance will allow to classify automatically pictures ...

Recommended for you

Seeing data

18 hours ago

More data are being created, consumed, and transported than ever before, and in all areas of society, including business, government, health care, and science. The hope and promise is that this influx of ...

Making online translation accurate, reliable and efficient

Jun 13, 2013

European cooperation is based on our ability to understand each other. Given that there are presently 23 official EU languages, the availability of online tools to facilitate accurate translation is fundamentally ...

User comments : 2

Adjust slider to filter visible comments by rank

Display comments: newest first

Marquette
5 / 5 (2) Dec 22, 2010
Coming soon to street corners in Iran.
that_guy
3 / 5 (2) Dec 22, 2010
oh HAL is going to love this.

More news stories

Multiview 3-D photography made simple

Computational photography is the use of clever light-gathering tricks and sophisticated algorithms to extract more information from the visual environment than traditional cameras can.

Microsoft mulled buying Nokia unit

Microsoft was in talks to boost its position in the mobile phone market by buying the devices business from Nokia but failed to seal a deal, the Wall Street Journal reported Wednesday.

LA to give every student an iPad; $30M order

Los Angeles' school system, the second largest in the United States, is ordering iPads for all its students, handing Apple a major success in its quest to make the tablet computer a replacement for textbooks.