November 10, 2009

New search technique for images and videos has broad applications

By Daniel Strain

(PhysOrg.com) -- Engineers at the University of California, Santa Cruz, have developed a powerful new approach to a fundamental problem in computer vision: how to program a computer to recognize or categorize what it "sees" in an image or video. Their software could change the way people search the Web for photos and videos, and it may have applications in many other areas as well, such as video surveillance and security systems.

Peyman Milanfar, a professor of electrical engineering in the Baskin School of Engineering at UCSC, and graduate student Hae Jong Seo were able to overcome a major drawback of existing methods for computer recognition of objects in images--the need for an extensive "training" phase using a large number of examples. With a single photograph or video clip as a template, their software can sift through thousands of images or videos to pull out the ones that look like the template.

"When you search Google Images, you type in a term and it gives you returns from pages that have that text in them. We want to be able to upload an image and use it as a model for finding similar images," Milanfar said.

Milanfar and Seo developed an algorithm that enables automated recognition of both objects in images and actions in videos. The software analyzes an image or short movie and characterizes the most important constituents of the object or action represented. It can then search for those constituents in image and video databases. The researchers presented their new methods at the IEEE International Conference on Computer Vision in September and in a recent paper published by the IEEE Transcripts on Pattern Analysis and Machine Intelligence.

"When it comes to recognizing things in the visual world, humans have some uncanny abilities which, at least until now, well exceed the limits of what could be done by computer," Milanfar said. "In particular, we have the capacity to recognize an object after having seen it only once."

Existing technology can search for and distinguish individual objects in a database of images only after running through a time-consuming training phase. "If you're looking for images of bicycles, for instance, current algorithms have to be shown pictures of hundreds, if not thousands, of bicycles in order to be able to recognize a bicycle," Milanfar said.

With his new software, a single photo of a bicycle at night can be used as a template to locate pictures of bicycles in full sunlight, in the foreground or the background. It works under a wide range of image qualities and lighting discrepancies. The template image or the target image can be sharp or out-of-focus, clean or noisy. To Milanfar's software, a bicycle is a bicycle.

Similarly, a person riding a bicycle is a person riding a bicycle. Video of Lance Armstrong in the Tour de France can be used to find clips of men and women riding along an ordinary street.

But the potential applications for Milanfar's work go well beyond browsing for cyclists on YouTube. By using videos of aggressive behavior as templates, the technology could help surveillance systems learn to recognize potentially dangerous situations. If a man reached for a weapon on camera and that action matched a template of such behavior, surveillance software could alert a busy security guard.

A picture is a composite of thousands of pixels. Milanfar's software examines these pixels and their relation to one another. In other words, how similar is a central pixel to adjacent pixels in orientation, coloring, and shading? To find actions within videos, like a man riding a bicycle, Milanfar's software completes the same procedures but incorporates the manner in which those pixel relationships move over time.

The software analyzes the map of pixel relationships and determines the salient geometric features of the object or action. These components remain perceptually constant within an object regardless of image quality.

"The geometry of the bicycle is recognizable by the shape of the wheels and the way they are connected to the body, for example," Milanfar said. "We compute features from an image that are very stable. They are there even if we make the object bigger or smaller, change the background, or add noise."

Search engines can use this algorithm to detect similar patterns of pixel relationships in a whole database of photos. The software calculates the statistical likelihood that a candidate image contains the queried object. If the template is a bicycle, the outcome consists of a series of photographs containing bicycles of all shapes and sizes, ranked in order of similarity.

"This has been an area of research that has entertained people for many years, but the big successes have been few and far between," Milanfar said. "Our work is showing state-of-the-art performance with an accuracy as good as or better than any algorithm out there."

Provided by University of California, Santa Cruz

Citation: New search technique for images and videos has broad applications (2009, November 10) retrieved 16 April 2024 from https://phys.org/news/2009-11-technique-images-videos-broad-applications.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Research leads to improved human, object detection technology

0 shares

Feedback to editors

Older male blue tits out-compete young males when it comes to extra-marital breeding

37 minutes ago

Dating the solar system's giant planet orbital instability using enstatite meteorites

1 hour ago

Animals deserve to be included in global carbon cycle models as well, say researchers

1 hour ago

NASA's Fermi mission sees no gamma rays from nearby supernova

1 hour ago

'One ring to rule them all': How actin filaments are assembled by formins

2 hours ago

Quantum electronics: Charge travels like light in bilayer graphene

2 hours ago

Researchers advance pigment chemistry with moon-inspired reddish magentas

2 hours ago

Chemists invent a more efficient way to extract lithium from mining sites, oil fields, used batteries

2 hours ago

New tagging method provides bioadhesive interface for marine sensors on diverse, soft and fragile species

2 hours ago

Crucial connection for 'quantum internet' made for the first time

3 hours ago

Load comments (5)

New search technique for images and videos has broad applications

Older male blue tits out-compete young males when it comes to extra-marital breeding

Dating the solar system's giant planet orbital instability using enstatite meteorites

Animals deserve to be included in global carbon cycle models as well, say researchers

NASA's Fermi mission sees no gamma rays from nearby supernova

'One ring to rule them all': How actin filaments are assembled by formins

Quantum electronics: Charge travels like light in bilayer graphene

Researchers advance pigment chemistry with moon-inspired reddish magentas

Chemists invent a more efficient way to extract lithium from mining sites, oil fields, used batteries

New tagging method provides bioadhesive interface for marine sensors on diverse, soft and fragile species

Crucial connection for 'quantum internet' made for the first time

Relevant PhysicsForums posts

My Website For Creating Interactive Visuals Linked To Equations

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

Most efficient way to randomly choose a word from a file with a list of words

Git, staging and committing files

Is it possible to use js variables in html while using Puppeteer?

Research leads to improved human, object detection technology

Human eye inspires advance in computer vision (w/Video)

Seeing things: Researchers teach computers to recognize objects

Extreme makeover: computer science edition

Researchers develop new image-recognition software

Researchers teach computers how to name images by 'thinking'

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

New search technique for images and videos has broad applications

Older male blue tits out-compete young males when it comes to extra-marital breeding

Dating the solar system's giant planet orbital instability using enstatite meteorites

Animals deserve to be included in global carbon cycle models as well, say researchers

NASA's Fermi mission sees no gamma rays from nearby supernova

'One ring to rule them all': How actin filaments are assembled by formins

Quantum electronics: Charge travels like light in bilayer graphene

Researchers advance pigment chemistry with moon-inspired reddish magentas

Chemists invent a more efficient way to extract lithium from mining sites, oil fields, used batteries

New tagging method provides bioadhesive interface for marine sensors on diverse, soft and fragile species

Crucial connection for 'quantum internet' made for the first time

Relevant PhysicsForums posts

Related Stories

Research leads to improved human, object detection technology

Human eye inspires advance in computer vision (w/Video)

Seeing things: Researchers teach computers to recognize objects

Extreme makeover: computer science edition

Researchers develop new image-recognition software

Researchers teach computers how to name images by 'thinking'

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience