Image processing: The human (still) beats the machine

October 31, 2011 By Emmanuel Barraud

Image processing: the human (still) beats the machine

Enlarge

(PhysOrg.com) -- A novel experiment conducted by researchers at Idiap Research Institute and Johns Hopkins University highlights some of the limitations of automatic image analysis systems. Their results were recently published in the early online edition of the Proceedings of the National Academy of Sciences.

Anyone with a relatively new digital camera has experienced it: the system that is supposed to automatically identify and smiles sometimes doesn’t work quite right. Patterns in a photo of a bookshelf or of leaves on a tree are often mistaken for faces.

Behind this nearly universal gadget are the results of years of “computer vision” research. When you frame a scene, the camera divides it into many small zones and tries to identify subtle differences in hue. A dark, vaguely horizontal band can indicate eyes and eyebrows – or the empty space above a series of books.

How can the camera make such glaring errors, mistakes that no human would ever commit? To try and grasp the mechanisms at work in the image analysis process, François Fleuret, Senior Scientist at EPFL and researcher at the Idiap research institute in Martigny, has developed, along with colleagues from Johns Hopkins University, a “simple” contest in which humans and machines compete. The experiment and its results have just been published in the advance online edition of PNAS ().

The candidates were presented with a series of small, square black and white images of random shapes, and asked to classify them into two “families,” discovering for themselves the classification criteria. For example, if one shape is inside another or if the two are side by side.

While the solution was often obvious for humans, who would understand the trick after just a few images, the computers frequently had to be shown several thousands of examples before reaching a satisfactory result. And even worse, one of the 24 puzzles couldn’t be figured out using machine analysis at all.

Image processing: the human (still) beats the machine
Enlarge

The two images on top belong to the 1st family; those below to the 2nd family. Humans quickly understand that the criterion is the position of the smaller shape; either it’s in the center of the other shape or it’s not.

“We should remember that humans have had decades of experiential learning, in which they’re perceiving dozens of images per second, not to mention their genetic background. The computers are basically “blank slates” in comparison,” Fleuret remarks. By simplifying the images as much as possible, the scientists wanted to identify the main weaknesses of machine learning. “What we found, in a general sense, was that humans jump immediately to a semantic level of image analysis,” he continues. “He or she will say which pair of images is more crowded than another pair, where the computer will compare, for example, numerical values associated with the pixel density in a given perimeter.”

The experiment gave the researchers a glimpse into the “black box” of how the intelligence of a supposedly self-taught machine develops. “It’s the first time that we have been able to precisely, and on an identical task, quantify and compare the performance of classical learning algorithms and humans,” adds Fleuret. The scientists were also able to confirm that the number and variety of measures made in the image, upon which learning depends, increased their success rate. “When classifying the image depends on the relative placement of shapes in the image, machine learning has a really hard time,” Fleuret comments. “This justifies the current trend in the field to invent algorithms that are designed to identify individual parts of the image and their relative position.”

The rapidity of the human brain, the fact that it can instantly “reconstruct” an entire object even when part of it is hidden, its ability to find connections between parameters that are extremely variable while taking into account the temporal dimension (clothing and gait, for example, instead of a face for recognizing a person) all give it a huge advantage over machines in the area of . At their own pace, however, electronic devices will continue to benefit from improving techniques and processor speeds to get even better at decoding the world.

More information: Comparing machines and humans on a visual categorization test, Published online before print October 17, 2011, doi:10.1073/pnas.1109168108

Provided by Ecole Polytechnique Federale de Lausanne

Filter


Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

Noumenon
Oct 31, 2011

Rank: 4.6 / 5 (30)
We should remember that humans have had decades of experiential learning, in which theyre perceiving dozens of images per second, not to mention their genetic background.


Algorithmic "learning" is the wrong approach if comparing to humans, and it has nothing to do with genes. The human mind has had MILLIONS of years, not just "decades", to evolve mechanisms to process images. It's not all about "learning",.. the mind has built-in mechanisms that process images a-priori to consciousness.

Computers are no where near matching humans in image processing because the require for that is that humans have an inkling of how the mind functions,... which is WAY different from "algorithms".
krundoloss
Oct 31, 2011

Rank: 5 / 5 (1)
In order to function competitively in our world, humans have incredibly fast image processing. Imagine what is going on when you track a fly, or play air hockey. This has allowed us to see animals that are camouflaged and hunt them for food. We also have "smooth eye tracking" which allows our eyes to move instead of our whole head (like a cat). And we play video games and watch TV, drive cars, all strengthen our image processing abilities to a level greater than any other animal. A computer cannot compete with that, yet. Not to mention we are able to "understand" what we see, which is even further off for computers.
Isaacsname
Oct 31, 2011

Rank: not rated yet
Woooo. I don't know about you guys/gals, but I have extreme visual snow/static, which when I read this:

" The rapidity of the human brain, the fact that it can instantly reconstruct an entire object even when part of it is hidden "

Made me smile somewhat, because I see heavy static on everything, I started looking it up a while back, found what's called " stochastic resonance in visual perception "

http://en.wikiped...biology)

http://www.youtub...duEEoCaA

Basically, it allows me to be able to see small details most people can't, so I wonder if somehow this could be used in any way for the problems described by the article in terms of object recognitions....
Deesky
Oct 31, 2011

Rank: 5 / 5 (1)
They speak of 'classical' learning algorithms, but I wonder how they would have gone had they used a vision system based on Jeff Hawkin's research (Numenta) implementing Hierarchical Temporal Memory (HTM) to simulate the neocortex.

Here's an interesting lecture: Advances in Modeling Neocortex and its Impact on Machine Intelligence:
http://www.youtub...iFOIbTkE
Rank 5 /5 (2 votes)
Relevant PhysicsForums posts
  • Ideas to mitigate risk of 911 calls being misdirected
    createdMay 24, 2012
  • Live scribe pen?
    createdMay 10, 2012
  • Shallow water flow simulation
    createdMay 07, 2012
  • Tablet for taking notes?
    createdMay 05, 2012
  • Best fit tablet for me?
    createdMay 05, 2012
  • Measure of Informaton
    createdMay 04, 2012
  • More from Physics Forums - Computing & Technology

More news stories

Browser wars flare in mobile space

The browser wars are heating up again, but this time the fight is for dominance of the mobile Internet.

Technology / Software

created 11 hours ago | popularity 5 / 5 (1) | comments 3

Probability of contamination from severe nuclear reactor accidents is higher than expected: study

Catastrophic nuclear accidents such as the core meltdowns in Chernobyl and Fukushima are more likely to happen than previously assumed. Based on the operating hours of all civil nuclear reactors and the number ...

Technology / Energy & Green Tech

created May 22, 2012 | popularity 3.6 / 5 (22) | comments 56 | with audio podcast

HyperSolar shows dirty water no barrier to power world

(Phys.org) -- The Santa Barbara, California, company, HyperSolar, is set to transparently share the ups and downs of its research experiences toward the company’s ultimate vision, successfully producing ...

Technology / Energy & Green Tech

created May 24, 2012 | popularity 4.8 / 5 (16) | comments 17 | with audio podcast report

SpotterRF debuts Radar Backpack Kit (w/ Video)

(Phys.org) -- SpotterRF has announced a special radar backpack kit designed to enhance situational awareness for soldiers on the ground. The company says its special radar is designed for warfighters as part ...

Technology / Hi Tech & Innovation

created May 26, 2012 | popularity 5 / 5 (5) | comments 13 | with audio podcast report

Tesla to launch electric sedan in US on June 22

Tesla Motors said Tuesday it would begin deliveries of "the world's first premium electric sedan" on June 22, slightly ahead of schedule.

Technology / Energy & Green Tech

created May 22, 2012 | popularity 4.5 / 5 (12) | comments 18


Land and sea species differ in climate change response: study

(Phys.org) -- Marine and terrestrial species will likely differ in their responses to climate warming, new research by Simon Fraser University and Australia’s University of Tasmania has found.

'Unzipped' carbon nanotubes could help energize fuel cells, batteries

Multi-walled carbon nanotubes riddled with defects and impurities on the outside could replace some of the expensive platinum catalysts used in fuel cells and metal-air batteries, according to scientists at ...

T cells 'hunt' parasites like animal predators seek prey, study shows

By pairing an intimate knowledge of immune-system function with a deep understanding of statistical physics, a cross-disciplinary team at the University of Pennsylvania has arrived at a surprising finding: T cells use a movement ...

Computer model used to pinpoint prime materials for efficient carbon capture

When power plants begin capturing their carbon emissions to reduce greenhouse gases – and to most in the electric power industry, it's a question of when, not if – it will be an expensive undertaking.

Change in developmental timing was crucial in the evolutionary shift from dinosaurs to birds: study

At first glance, it's hard to see how a common house sparrow and a Tyrannosaurus Rex might have anything in common. After all, one is a bird that weighs less than an ounce, and the other is a dinosaur that ...

Nvidia trumpets Tegra 3 phone design wins for 2012

(Phys.org) -- Nvidia’s competitive war paint has a name, Tegra 3. On the heels of Nvidia announcements about lowering costs of its Tegra 3 processors and Nvidia-enabled tablets running Android Ice Cream ...