New algorithm improves robot vision

Dec 07, 2005
robot

Except in fanciful movies like 2003's The Matrix Revolutions, where fearsome squid-like robots maneuvered with incredible ease, most robots are too clumsy to move around obstacles at high speeds. This is true in large part because they have trouble judging in the images they "see" just how far ahead obstacles are. This week, however, Stanford computer scientists will unveil a machine vision algorithm that gives robots the ability to approximate distances from single still images.

"Many people have said that depth estimation from a single monocular image is impossible," says computer science Assistant Professor Andrew Ng, who will present a paper on his research at the Neural Information Processing Systems Conference in Vancouver Dec. 5-8. "I think this work shows that in practical problems, monocular depth estimation not only works well, but can also be very useful."

With substantial sensor arrays and considerable investment, robots are gaining the ability to navigate adequately. Stanley, the Stanford robot car that drove a desert course in the DARPA Grand Challenge this past October, used lasers and radar as well as a video camera to scan the road ahead. Using the work of Ng and his students, robots that are too small to carry many sensors or that must be built cheaply could navigate with just one video camera. In fact, using a simplified version of the algorithm, Ng has enabled a radio-controlled car to drive autonomously for several minutes through a cluttered, wooded area before crashing.

Inferring depth

To give robots depth perception, Ng and graduate students Ashutosh Saxena and Sung H. Chung designed software capable of learning to spot certain depth cues in still images. The cues include variations in texture (surfaces that appear detailed are more likely to be close), edges (lines that appear to be converging, such as the sides of a path, indicate increasing distance) and haze (objects that appear hazy are likely farther).

To analyze such cues as thoroughly as possible, the software breaks images into sections and analyzes them both individually and in relationship to neighboring sections. This allows the software to infer how objects in the image appear relative to each other. The software also looks for cues in the image at varying levels of magnification to ensure that it doesn't miss details or prevailing trends—literally missing the forest for the trees.

Using the Stanford algorithm, robots were able to judge distances in indoor and outdoor locations with an average error of about 35 percent—in other words, a tree that is actually 30 feet away would be perceived as being between 20 and 40 feet away. A robot moving at 20 miles per hour and judging distances from video frames 10 times a second has ample time to adjust its path even with this uncertainty. Ng points out that compared to traditional stereo vision algorithms—ones that use two cameras and triangulation to infer depth—the new software was able to reliably detect obstacles five to 10 times farther away.

"The difficulty of getting visual depth perception to work at large distances has been a major barrier to getting robots to move and to navigate at high speeds," Ng says. "I'd like to build an aircraft that can fly through a forest, flying under the tree canopy and dodging around trees." Of course, that brings to mind another movie image: that of the airborne chase scene through the forest on the Ewok planet in Return of the Jedi. Ng wants to take that idea out of the realm of fiction and make it a reality.

Source: Stanford University

Explore further: Pandora posts in-line 1Q loss, upbeat sales

add to favorites email to friend print save as pdf

Related Stories

Teaming up with robots

Mar 27, 2013

Critical situations are occurring with greater frequency at industrial workplaces – situations that could lead to serious job-related accidents. With the "4Save" toolbox from Fraunhofer, these dangers do ...

Robotics 101 with NASA's Chris McQuin + Jaret Matthews

Jun 18, 2012

(Phys.org) -- When you hear the word "robot," you might think of Hollywood creations such as the Terminator, C-3PO or Megatron. Thankfully, the reality of current robotics isn't quite that sinister, emotional ...

Recommended for you

Pandora posts in-line 1Q loss, upbeat sales

1 hour ago

(AP)—Internet radio company Pandora reported higher-than-expected revenue in the latest quarter, with losses in line with analysts' forecasts, as the number of subscribers who pay for ad-free listening rose above 2.5 million.

Google Drive sports new view and scan enhancements

1 hour ago

(Phys.org) —Google Drive has a new look and functions. The makeover in Google Drive features scanning and interface enhancements that put the user into "card" mode. The enhancements make it easy for the ...

Inventor creates Card Beams with 3D printer

1 hour ago

What are card beams, you may ask? They are the building toy that allows you to build gravity-defying houses of cards with the help of friction, gravity, and two types of beams - the cap and the connector.

Solar Kettle allows for boiling water off the grid

3 hours ago

(Phys.org) —A company called Contemporary Energy has unveiled a new device it calls the Solar Kettle. It looks very much like a normal coffee thermos, but has flaps on one side that open to allow for collecting ...

Review: Google music plan solid, serendipitous

5 hours ago

Google's new music service offers a lot of eye candy to go with the tunes. The song selection of around 18 million tracks is comparable to popular services such as Spotify and Rhapsody, and a myriad of playlists ...

User comments : 0

More news stories

Google Drive sports new view and scan enhancements

(Phys.org) —Google Drive has a new look and functions. The makeover in Google Drive features scanning and interface enhancements that put the user into "card" mode. The enhancements make it easy for the ...

Solar Kettle allows for boiling water off the grid

(Phys.org) —A company called Contemporary Energy has unveiled a new device it calls the Solar Kettle. It looks very much like a normal coffee thermos, but has flaps on one side that open to allow for collecting ...

Pandora posts in-line 1Q loss, upbeat sales

(AP)—Internet radio company Pandora reported higher-than-expected revenue in the latest quarter, with losses in line with analysts' forecasts, as the number of subscribers who pay for ad-free listening rose above 2.5 million.

Future doctors unaware of their obesity bias

Two out of five medical students have an unconscious bias against obese people, according to a new study by researchers at Wake Forest Baptist Medical Center. The study is published online ahead of print in the Journal of ...

WHO: Scientific red tape mars efforts vs. virus

International efforts to combat a new pneumonia-like virus that has now killed 22 people are being slowed by unclear rules and competition for the potentially profitable rights to disease samples, the head ...