Robots learn to handle objects, understand places

September 2, 2011 By Bill Steele

Robots learn to handle objects, understand places

Enlarge

After scanning a room, a robot points to the keyboard it was asked to locate. It uses context to identify objects, such as the fact that a keyboard is usually in front of a monitor.

(PhysOrg.com) -- Infants spend their first few months learning to find their way around and manipulating objects, and they are very flexible about it: Cups can come in different shapes and sizes, but they all have handles. So do pitchers, so we pick them up the same way.

Similarly, your in the future will need the ability to generalize -- for example, to handle your particular set of dishes and put them in your particular dishwasher.

In Cornell's Personal Robotics Laboratory, a team led by Ashutosh Saxena, assistant professor of computer science, is teaching robots to manipulate objects and find their way around in new environments. They reported two examples of their work at the 2011 Robotics: Science and Systems Conference June 27 at the University of Southern California.

A common thread running through the research is "machine learning" -- programming a computer to observe events and find commonalities. With the right programming, for example, a computer can look at a wide array of cups, find their common characteristics and then be able to identify cups in the future. A similar process can teach a to find a cup's handle and grasp it correctly.

Other researchers have gone this far, but Saxena's team has found that placing objects is harder than picking them up, because there are many options. A cup is placed upright on a table, but upside down in a dishwasher, so the robot must be trained to make those decisions.

"We just show the robot some examples and it learns to generalize the placing strategies and applies them to objects that were not seen before," Saxena explained. "It learns about stability and other criteria for good placing for plates and cups, and when it sees a new object -- a bowl -- it applies them."

In early tests they placed a plate, mug, martini glass, bowl, candy cane, disc, spoon and on a flat surface, on a hook, in a stemware holder, in a pen holder and on several different dish racks.

Robots learn to handle objects, understand places
Enlarge

Placing dishes in a rack is a challenging task for a robot. It must identify empty spaces and place the plate in the correct upright position.

Surveying its environment with a 3-D camera, the robot randomly tests small volumes of space as suitable locations for placement. For some objects it will test for "caging" -- the presence of vertical supports that would hold an upright. It also gives priority to "preferred" locations: A plate goes flat on a table, but upright in a dishwasher.

After training, their robot placed most objects correctly 98 percent of the time when it had seen the objects and environments previously, and 95 percent of the time when working with new objects in a new environment. Performance could be improved, the researchers suggested, by longer training.

But first, the robot has to find the dish rack.

Just as we unconsciously catalog the objects in a room when we walk in, Saxena and colleague Thorsten Joachims, associate professor of , have developed a system that enables a robot to scan a room and identify its objects. Pictures from the robot's 3-D camera are stitched together to form a 3-D image of an entire room that is then divided into segments, based on discontinuities and distances between objects. The goal is to label each segment.

The researchers trained a robot by giving it 24 office scenes and 28 home scenes in which they had labeled most objects. The computer examines such features as color, texture and what is nearby and decides what characteristics all objects with the same label have in common. In a new environment, it compares each segment of its scan with the objects in its memory and chooses the ones with the best fit.

"The novelty of this work is to learn the contextual relations in 3-D," Saxena said. "For identifying a keyboard it may be easier to locate the monitors first, because the keyboards are found below the monitors."

In tests, the robot correctly identified objects about 83 percent of the time in home scenes and 88 percent in offices. In a final test, it successfully located a keyboard in an unfamiliar room. Again, Saxena said, context gives this robot an advantage. The keyboard only shows up as a few pixels in the image, but the monitor is easily found, and the robot uses that information to locate the keyboard.

Robots still have a long way to go to learn like humans, the researchers admit. "I would be really happy if we could build a robot that would even act like a six-month-old baby," Saxena said.

Provided by Cornell University search and more info website

4.8 /5 (5 votes)  

Filter


Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

hush1
Sep 02, 2011

Rank: 1 / 5 (1)
"I would be really happy if we could build a robot that would even act like a six-month-old baby," - Saxena


Or software replicating the exact growth of the first few hundred cell divisions of the embryo.
nxtr
Sep 02, 2011

Rank: 5 / 5 (1)
well since robots have near limitless data storage, 24 scenes could be 24000000 scenes and web-linked learning between models passed around.
Deadbolt
Sep 02, 2011

Rank: not rated yet
Would have been nice to see a video of this robot in action.
kochevnik
Sep 02, 2011

Rank: not rated yet
Similarly, your personal robot in the future will need the ability to generalize -- for example, to handle your particular set of dishes and put them in your particular dishwasher.
This is deduction, not induction. It's working from a general set of knowledge to learn specificity. Training mode would be inductive, and operating mode would be deductive.
Eikka
Sep 05, 2011

Rank: not rated yet
What they still have is just symbol recognition. They don't recognize the objects at all.

If you show the robot different kinds of chairs, it will assign the symbol "chair" to a few common geometric and visual properties of chairs. The context of the chair is just an extension of this process - chairs are found near tables.

But it doesn't get "chair". If you put a large chair next to a small table, it will get them wrong. The "chairness" or "tableness" of these objects doesn't depend on how they are, but what they're used for, and since the machine only understands symbols and their relations, it has no idea about why a chair is a chair when you sit on it, but a table when you put your coffee mug on it. You may teach it this relationship as well, but take away the coffee mug and it's clueless again.

This is why I think the AI researchers are going at the problem the wrong way. They're not making intelligence, they're just making something that acts like it through clever programmig.
Newbeak
Sep 05, 2011

Rank: 5 / 5 (1)
Would have been nice to see a video of this robot in action.

Have you seen the video on this page? I find it very impressive.
http://www.physor...ity.html
Rank 4.8 /5 (5 votes)
Relevant PhysicsForums posts

More news stories

Nvidia trumpets Tegra 3 phone design wins for 2012

(Phys.org) -- Nvidia’s competitive war paint has a name, Tegra 3. On the heels of Nvidia announcements about lowering costs of its Tegra 3 processors and Nvidia-enabled tablets running Android Ice Cream ...

Electronics / Hardware

created 9 hours ago | popularity 5 / 5 (2) | comments 1 | with audio podcast report

Dell tablet leak: 10.1-inch display, two-battery choice

(Phys.org) -- Headline after headline talks about vendors’ tablets in the wings as likely number-one contenders for the iPad. Such claims have justifiably been taken with a grain of salt, considering ...

Electronics / Consumer & Gadgets

created 21 hours ago | popularity 5 / 5 (3) | comments 9 | with audio podcast report

Nvidia says Kai platform will turn price tide for tablets

(Phys.org) -- In March, Nvidia gave some signs that they were working to lower the cost of their Tegra 3 processors and they suggested consumers might see prices for Android tablets as low as $199. Connect ...

Electronics / Hardware

created May 24, 2012 | popularity 4.3 / 5 (4) | comments 3 | with audio podcast report

OmniVision tops up sensors for cameras, phones

(Phys.org) -- OmniVision has announced two high-resolution image sensors for the digital still and digital video camera market (DS/DVC) and higher end smartphones. In end-user language, it is a claim for superior ...

Electronics / Hardware

created May 25, 2012 | popularity 5 / 5 (6) | comments 3 | with audio podcast report

MIT researchers devise new means to synchronize a group of robots (w/ Video)

(Phys.org) -- For several years, roboticists have been working out ways to get a group of robots to perform synchronized activities as demonstrated most often in dance routines. It’s not just about trying ...

Electronics / Robotics

created May 25, 2012 | popularity 5 / 5 (1) | comments 1 | with audio podcast report


Land and sea species differ in climate change response: study

(Phys.org) -- Marine and terrestrial species will likely differ in their responses to climate warming, new research by Simon Fraser University and Australia’s University of Tasmania has found.

Almost half of new vets seek disability

(AP) -- America's newest veterans are filing for disability benefits at a historic rate, claiming to be the most medically and mentally troubled generation of former troops the nation has ever seen.

'Unzipped' carbon nanotubes could help energize fuel cells, batteries

Multi-walled carbon nanotubes riddled with defects and impurities on the outside could replace some of the expensive platinum catalysts used in fuel cells and metal-air batteries, according to scientists at ...

T cells 'hunt' parasites like animal predators seek prey, study shows

By pairing an intimate knowledge of immune-system function with a deep understanding of statistical physics, a cross-disciplinary team at the University of Pennsylvania has arrived at a surprising finding: T cells use a movement ...

Computer model used to pinpoint prime materials for efficient carbon capture

When power plants begin capturing their carbon emissions to reduce greenhouse gases – and to most in the electric power industry, it's a question of when, not if – it will be an expensive undertaking.

Change in developmental timing was crucial in the evolutionary shift from dinosaurs to birds: study

At first glance, it's hard to see how a common house sparrow and a Tyrannosaurus Rex might have anything in common. After all, one is a bird that weighs less than an ounce, and the other is a dinosaur that ...