September 26, 2014

Can cartoons be used to teach machines to understand the visual world?

An enormous gap exists between human abilities and machine performance when it comes to understanding the visual world from images and videos. Humans are still way out in front.

"People are the best vision systems we have," said Devi Parikh assistant professor in the Bradley Department of Electrical and Computer Engineering at Virginia Tech. "If we can figure out a way for people to effectively teach machines, machines will be much more intelligent than they are today."

In her research, Parikh is proposing to use visual abstractions or cartoons to teach machines. She works from the idea that concepts that are difficult to describe textually may be easier to illustrate. By having thousands of online crowd workers manipulate clipart images to mimic photographs, she seeks to teach a computer to understand the visual world like humans do.

Parikh has expertise in computing areas such as computer vision and pattern recognition. Based on her earlier successful creative work on how to learn from visual abstractions, Google has selected Parikh to receive one of its Faculty Research Awards.

Google's innovative award provides Parikh with $92,000 of unrestricted funds and allows her to work directly with Google researchers and engineers as they explore how to best learn from visual information.

Parikh, formerly a research assistant professor at the Toyota Technological Institute in Chicago, received her doctorate in electrical and computer engineering from Carnegie Mellon in 2009, is already a U.S. Army Research Office Young Investigator, working with the government on ways to reduce failures in computerized vision recognition programs.

"We need to build intelligent machines that can understand our visual world from images just as humans do. These machines must be capable of answering high-level semantic questions about an image such as what objects are present, where they are, and how they are interacting," Parikh said.

The Google award is for one year and only full-time, tenure-track university faculty members are eligible.

Provided by Virginia Tech

Citation: Can cartoons be used to teach machines to understand the visual world? (2014, September 26) retrieved 17 July 2024 from https://phys.org/news/2014-09-cartoons-machines-visual-world.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researcher seeks to lessen failures in computerized visual recognition programs

0 shares

Feedback to editors

New 3D anatomical atlas of the African clawed frog increases understanding of development and metamorphosis processes

9 hours ago

Intensive farming could raise risk of new pandemics, researchers warn

10 hours ago

Scientists develop new AI method to create material 'fingerprints'

13 hours ago

Study shows frogs can quickly increase their tolerance to pesticides

14 hours ago

Nature-based solutions to disaster risk from climate change are cost-effective, study confirms

14 hours ago

Astronomers discover what may be 21 neutron stars orbiting sun-like stars

14 hours ago

Scientists use machine learning to predict diversity of tree species in forests

15 hours ago

Physicists pool skills to better describe the unstable sigma meson particle

17 hours ago

Telescope tag-team discovers 10 strange and exotic pulsars

17 hours ago

NASA transmits hip-hop song to deep space for first time

17 hours ago

Load comments (0)

Can cartoons be used to teach machines to understand the visual world?

New 3D anatomical atlas of the African clawed frog increases understanding of development and metamorphosis processes

Intensive farming could raise risk of new pandemics, researchers warn

Scientists develop new AI method to create material 'fingerprints'

Study shows frogs can quickly increase their tolerance to pesticides

Nature-based solutions to disaster risk from climate change are cost-effective, study confirms

Astronomers discover what may be 21 neutron stars orbiting sun-like stars

Scientists use machine learning to predict diversity of tree species in forests

Physicists pool skills to better describe the unstable sigma meson particle

Telescope tag-team discovers 10 strange and exotic pulsars

NASA transmits hip-hop song to deep space for first time

Relevant PhysicsForums posts

Particle.js: Exploring Particle Physics with Web Technologies

Help solving a geometrical matching issue with Graph Neural Networks

5 GHz PC WiFi connection Cybersecurity question

Help with some optimization code for Block Matrices

Is an API Always Necessary for Server-Client Communication?

I did this POST message configuration damage to my wifi internet, help

Researcher seeks to lessen failures in computerized visual recognition programs

Forecasting future may one day become as practical as predicting weather, thanks to Big Data advances

Google team rises to 2014 visual recognition challenge

Neural networks that function like the human visual cortex may help realize faster, more reliable pattern recognition

Google buys computer vision startup 'PittPatt'

Carnegie Mellon computer searches web 24/7 to analyze images and teach itself common sense

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Can cartoons be used to teach machines to understand the visual world?

New 3D anatomical atlas of the African clawed frog increases understanding of development and metamorphosis processes

Intensive farming could raise risk of new pandemics, researchers warn

Scientists develop new AI method to create material 'fingerprints'

Study shows frogs can quickly increase their tolerance to pesticides

Nature-based solutions to disaster risk from climate change are cost-effective, study confirms

Astronomers discover what may be 21 neutron stars orbiting sun-like stars

Scientists use machine learning to predict diversity of tree species in forests

Physicists pool skills to better describe the unstable sigma meson particle

Telescope tag-team discovers 10 strange and exotic pulsars

NASA transmits hip-hop song to deep space for first time

Relevant PhysicsForums posts

Related Stories

Researcher seeks to lessen failures in computerized visual recognition programs

Forecasting future may one day become as practical as predicting weather, thanks to Big Data advances

Google team rises to 2014 visual recognition challenge

Neural networks that function like the human visual cortex may help realize faster, more reliable pattern recognition

Google buys computer vision startup 'PittPatt'

Carnegie Mellon computer searches web 24/7 to analyze images and teach itself common sense

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience