October 13, 2009

Seeing things: Researchers teach computers to recognize objects

by Larry Hardesty

(PhysOrg.com) -- If computers could recognize objects, they could automatically search through hours of video footage for a particular two-minute scene. A tourist strolling down a street in a strange city could take a cell-phone photo of an unmarked monument and immediately find out what it was. And an Internet image search on, say, "Shakespeare" would pull up pictures of Shakespeare, not pictures of Gwyneth Paltrow in the movie Shakespeare in Love. Though object recognition is one of the major research topics in computer vision, MIT researchers may have found a way to make it much more practical.

Typically, object recognition algorithms need to be "trained" using digital images in which objects have been outlined and labeled by hand. By looking at a million pictures of cars labeled "car," an algorithm can learn to recognize features shared by images of cars. The problem is that for every new class of objects — trees, buildings, telephone poles — the algorithm has to be trained all over again.

But Esther and Harold E. Edgerton Associate Professor of Electrical Engineering and Computer Science Antonio Torralba and Computer Science and Artificial Intelligence Lab graduate students Ce Liu, PhD '09, and Jenny Yuen have developed an object recognition system that doesn't require any training. Nonetheless, it still identifies objects with 50 percent greater accuracy than the best prior algorithm.

The system uses a modified version of a so-called motion estimation algorithm, a type of algorithm common in video processing. Since consecutive frames of video usually change very little, data compression schemes often store the unchanging aspects of a scene once, updating only the positions of moving objects. The motion estimation algorithm determines which objects have moved from one frame to the next. In a video, that's usually fairly easy to do: most objects don't move very far in one-30th of a second. Nor does the algorithm need to know what the object is; it just has to recognize, say, corners and edges, and how their appearance typically changes under different perspectives.

The MIT researchers' new system essentially treats unrelated images as if they were consecutive frames in a video sequence. When the modified motion estimation algorithm tries to determine which objects have "moved" between one image and the next, it usually picks out objects of the same type: it will guess, for instance, that the 2006 Infiniti in image two is the same object as the 1965 Chevy in image one.

If the first image comes from the type of database used to train computer vision systems, the Infiniti will already be labeled "car." The new system simply transfers the label to the Chevy.

Credits - Courtesy of Ce Liu
Credits - Courtesy of Ce Liu

The greater the resemblance of the labeled and unlabeled images, the better the algorithm works. Fortunately, Torralba's earlier work was largely directed toward amassing a huge database of labeled images. Torralba and his colleagues have developed a simple web-based system called LabelMe that lets online volunteers tag objects in digital images, and they also created a web site called 80 Million Tiny Images that sorts the images according to subject matter. When confronted with an unlabeled image, the new object recognition algorithm is likely to find something similar in Torralba's database. And as the database grows larger, that likelihood will only increase.

"It's a real commonsense solution to a fundamental problem in computer vision," says Marshall Tappen, a computer vision researcher at the University of Central Florida. "The results are great and better than you can get with much more complicated methods." Tappen adds that "a large database makes it possible to do lots of really interesting thing that no one's even envisioned. There are lots of interesting things it can do beyond just standard object recognition, so I think it's really going to enable a lot of innovation." Tappen points in particular to recent work on image editing and image completion done by Alyosha Efros at Carnegie Mellon University. "If you look at his last few Siggraph papers" — that is, papers presented at Siggraph, the major conference in the field of computer graphics — "they're all using LabelMe," Tappen says.

Provided by Massachusetts Institute of Technology (news : web)

Citation: Seeing things: Researchers teach computers to recognize objects (2009, October 13) retrieved 17 April 2024 from https://phys.org/news/2009-10-seeing-things-researchers-teach-computers.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researchers develop new image-recognition software

0 shares

Feedback to editors

Study reveals how humanity could unite to address global challenges

4 hours ago

CO₂ worsens wildfires by helping plants grow, model experiments show

6 hours ago

Surf clams off the coast of Virginia reappear and rebound

7 hours ago

Yellowstone Lake ice cover unchanged despite warming climate

7 hours ago

The history of the young cold traps of the asteroid Ceres

8 hours ago

Researchers shine light on rapid changes in Arctic and boreal ecosystems

8 hours ago

New benzofuran synthesis method enables complex molecule creation

8 hours ago

Human odorant receptor for characteristic petrol note of Riesling wines identified

8 hours ago

Uranium-immobilizing bacteria in clay rock: Exploring how microorganisms can influence the behavior of radioactive waste

8 hours ago

Research team identifies culprit behind canned wine's rotten egg smell

8 hours ago

Load comments (1)

Seeing things: Researchers teach computers to recognize objects

Study reveals how humanity could unite to address global challenges

CO₂ worsens wildfires by helping plants grow, model experiments show

Surf clams off the coast of Virginia reappear and rebound

Yellowstone Lake ice cover unchanged despite warming climate

The history of the young cold traps of the asteroid Ceres

Researchers shine light on rapid changes in Arctic and boreal ecosystems

New benzofuran synthesis method enables complex molecule creation

Human odorant receptor for characteristic petrol note of Riesling wines identified

Uranium-immobilizing bacteria in clay rock: Exploring how microorganisms can influence the behavior of radioactive waste

Research team identifies culprit behind canned wine's rotten egg smell

Relevant PhysicsForums posts

Error logging in: onLoginSuccess is not a function

My Website For Creating Interactive Visuals Linked To Equations

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

Most efficient way to randomly choose a word from a file with a list of words

Git, staging and committing files

Researchers develop new image-recognition software

Researchers use Web images to add realism to edited photos

New system estimates geographic location of photos

Stanford site advances science of turning 2-D images into 3-D models

Computer vision may not be as good as thought

New algorithm improves robot vision

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Seeing things: Researchers teach computers to recognize objects

Study reveals how humanity could unite to address global challenges

CO₂ worsens wildfires by helping plants grow, model experiments show

Surf clams off the coast of Virginia reappear and rebound

Yellowstone Lake ice cover unchanged despite warming climate

The history of the young cold traps of the asteroid Ceres

Researchers shine light on rapid changes in Arctic and boreal ecosystems

New benzofuran synthesis method enables complex molecule creation

Human odorant receptor for characteristic petrol note of Riesling wines identified

Uranium-immobilizing bacteria in clay rock: Exploring how microorganisms can influence the behavior of radioactive waste

Research team identifies culprit behind canned wine's rotten egg smell

Relevant PhysicsForums posts

Related Stories

Researchers develop new image-recognition software

Researchers use Web images to add realism to edited photos

New system estimates geographic location of photos

Stanford site advances science of turning 2-D images into 3-D models

Computer vision may not be as good as thought

New algorithm improves robot vision

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience