Can cartoons be used to teach machines to understand the visual world?

An enormous gap exists between human abilities and machine performance when it comes to understanding the visual world from images and videos. Humans are still way out in front.

"People are the best vision systems we have," said Devi Parikh assistant professor in the Bradley Department of Electrical and Computer Engineering at Virginia Tech. "If we can figure out a way for people to effectively teach machines, machines will be much more intelligent than they are today."

In her research, Parikh is proposing to use visual abstractions or cartoons to teach machines. She works from the idea that concepts that are difficult to describe textually may be easier to illustrate. By having thousands of online crowd workers manipulate clipart images to mimic photographs, she seeks to teach a to understand the visual world like humans do.

Parikh has expertise in computing areas such as computer vision and pattern recognition. Based on her earlier successful creative work on how to learn from visual abstractions, Google has selected Parikh to receive one of its Faculty Research Awards.

Google's innovative award provides Parikh with $92,000 of unrestricted funds and allows her to work directly with Google researchers and engineers as they explore how to best learn from visual information.

Parikh, formerly a research assistant professor at the Toyota Technological Institute in Chicago, received her doctorate in electrical and computer engineering from Carnegie Mellon in 2009, is already a U.S. Army Research Office Young Investigator, working with the government on ways to reduce failures in computerized vision recognition programs.

"We need to build intelligent machines that can understand our visual world from images just as humans do. These must be capable of answering high-level semantic questions about an image such as what objects are present, where they are, and how they are interacting," Parikh said.

The Google award is for one year and only full-time, tenure-track university faculty members are eligible.

Provided by Virginia Tech

Citation: Can cartoons be used to teach machines to understand the visual world? (2014, September 26) retrieved 5 June 2023 from
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researcher seeks to lessen failures in computerized visual recognition programs


Feedback to editors