October 1, 2015

Giving machine-learning systems "partial credit" during training improves image classification

by Larry Hardesty, Massachusetts Institute of Technology

Machine learning, which is the basis for most commercial artificial-intelligence systems, is intrinsically probabilistic. An object-recognition algorithm asked to classify a particular image, for instance, might conclude that it has a 60 percent chance of depicting a dog, but a 30 percent chance of depicting a cat.

At the Annual Conference on Neural Information Processing Systems in December, MIT researchers will present a new way of doing machine learning that enables semantically related concepts to reinforce each other. So, for instance, an object-recognition algorithm would learn to weigh the co-occurrence of the classifications "dog" and "Chihuahua" more heavily than it would the co-occurrence of "dog" and "cat."

In experiments, the researchers found that a machine-learning algorithm that used their training strategy did a better job of predicting the tags that human users applied to images on the Flickr website than it did when it used a conventional training strategy.

"When you have a lot of possible categories, the conventional way of dealing with it is that, when you want to learn a model for each one of those categories, you use only data associated with that category," says Chiyuan Zhang, an MIT graduate student in electrical engineering and computer science and one of the new paper's lead authors. "It's treating all other categories equally unfavorably. Because there are actually semantic similarities between those categories, we develop a way of making use of that semantic similarity to sort of borrow data from close categories to train the model."

Zhang is joined on the paper by his thesis advisor, Tomaso Poggio, the Eugene McDermott Professor in the Brain Sciences and Human Behavior, and by his fellow first author Charlie Frogner, also a graduate student in Poggio's group. Hossein Mobahi, a postdoc in the Computer Science and Artificial Intelligence Laboratory, and Mauricio Araya-Polo, a researcher with Shell Oil, round out the paper's co-authors.

Close counts

To quantify the notion of semantic similarity, the researchers wrote an algorithm that combed through Flickr images identifying tags that tended to co-occur—for instance, "sunshine," "water," and "reflection." The semantic similarity of two words was a function of how frequently they co-occurred.

Ordinarily, a machine-learning algorithm being trained to predict Flickr tags would try to identify visual features that consistently corresponded to particular tags. During training, it would be credited with every tag it got right but penalized for failed predictions.

The MIT researchers' system essentially gives the algorithm partial credit for incorrect tags that are semantically related to the correct tags. Say, for instance, that a waterscape was tagged, among other things, "water," "boat," and "sunshine." With conventional machine learning, a system that tagged that image "water," "boat," "summer" would get no more credit than one that tagged it "water," "boat," "rhinoceros." With the researchers' system, it would, and the credit would be a function of the likelihood that the tags "summer" and "sunshine" co-occur in the Flickr database.

The problem is that assigning partial credit involves much more complicated calculations than simply scoring predictions as true or false. How, for instance, does a system that gets none of the tags completely right—say, "lake," "sail," and "summer"—compare to one that makes only one enormous error—say, "water," "boat," and "rhinoceros"?

To perform this type of complicated evaluation, the researchers use a metric called the Wasserstein distance, which is a way of comparing probability distributions. That would have been prohibitively time-consuming even two years ago, but in 2014, Marco Cuturi of the University of Kyoto and Arnaud Doucet of Oxford University proposed a new algorithm for calculating the Wasserstein distance more efficiently. The MIT researchers believe that their paper is the first to use the Wasserstein distance as an error metric in supervised machine learning, where the system's performance is gauged against human annotations.

Human error

In experiments, the researchers' system outperformed a conventional machine-learning system even when the criterion of success was simply predicting the tags that Flickr users had applied to a given image. But the difference was even more acute when the criterion of success was the prediction of tags that were semantically similar to those applied by Flickr users.

That may sound circular: A system that factors in semantic similarity is better at predicting semantic similarity. But when a Web user is trying to find images online, a general thematic correspondence may well be more important than a precise intersection of keywords.

Moreover, the tags that users assign to any given Flickr image can be a motley assortment. Automatically generated tags clustered according to semantic similarity could be more useful than those applied by humans. One image in the researchers' test set, for instance, depicted a uniformed mountain biker wearing a crash helmet biking down a hilly trail. The actual tags were "spring," "race," and "training." But the trees in the image are bare, the grass is brown, and the tags "race" and "training" can't both be right. The researchers' system came up with "road," "bike," and "trail"; the conventional machine-learning algorithm produced "dog," "surf," and "bike."

Finally, if some other measure of the notion of semantic similarity proved better able to capture human intuition than co-occurrence of Flickr tags, then the MIT researchers' system could simply adopt it instead. Indeed, a longstanding and ongoing project in artificial-intelligence research is the assembly of "ontologies" that relate classification terms hierarchically—dogs are animals, collies are dogs, Lassie was a collie. In future work, the researchers hope to test their system using ontologies standard in machine-vision research.

"I think this work is very innovative because it uses the Wasserstein distance directly as a way to design learning machines," says Cuturi, who was not involved in the current work. "From a technical point of view, the authors had to deal with the problem of comparing unnormalized histograms"—rather than probability distributions, which is what the Wasserstein distance was designed for. "They proposed a very elegant solution that is well motivated and computationally very efficient."

More information: "Learning with a Wasserstein Loss." arxiv.org/pdf/1506.05439v1.pdf

Provided by Massachusetts Institute of Technology

This story is republished courtesy of MIT News (web.mit.edu/newsoffice/), a popular site that covers news about MIT research, innovation and teaching.

Citation: Giving machine-learning systems "partial credit" during training improves image classification (2015, October 1) retrieved 30 April 2024 from https://phys.org/news/2015-10-machine-learning-partial-credit-image-classification.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Learning language by playing games

55 shares

Feedback to editors

Study uncovers the secret of long-lived stem cells

2 hours ago

Scientists show that ancient village adapted to drought, rising seas

3 hours ago

How polyps of the moon jellyfish repel viral attacks on their microbiome

3 hours ago

Organic electrochemical transistors: Scientists solve chemical mystery at the interface of biology and technology

4 hours ago

Activity in a room stirs up nanoparticles left over from consumer sprays, study shows

4 hours ago

Study dispels myth that purebred dogs are more prone to health problems

5 hours ago

Study shows climate change and mercury pollution stressed plants for millions of years

6 hours ago

Exploiting disorder to harvest heat energy: The potentialities of 2D magnets for thermoelectric applications

6 hours ago

Citizen scientists help discover record-breaking exoplanet in binary star system

6 hours ago

Novel calculations peg age of 'baby' asteroid

6 hours ago

Load comments (0)

Giving machine-learning systems "partial credit" during training improves image classification

Close counts

Human error

Study uncovers the secret of long-lived stem cells

Scientists show that ancient village adapted to drought, rising seas

How polyps of the moon jellyfish repel viral attacks on their microbiome

Organic electrochemical transistors: Scientists solve chemical mystery at the interface of biology and technology

Activity in a room stirs up nanoparticles left over from consumer sprays, study shows

Study dispels myth that purebred dogs are more prone to health problems

Study shows climate change and mercury pollution stressed plants for millions of years

Exploiting disorder to harvest heat energy: The potentialities of 2D magnets for thermoelectric applications

Citizen scientists help discover record-breaking exoplanet in binary star system

Novel calculations peg age of 'baby' asteroid

Relevant PhysicsForums posts

User-Defined Functions in Sql Server SSMS

Parallel processing for loops and pointer defined outside the loop

Passing variables in FORTRAN

My Website For Creating Interactive Visuals Linked To Equations

Number of Multiplications in the FFT Algorithm

Error logging in: onLoginSuccess is not a function

Learning language by playing games

Technique enables pattern-recognition systems to convey what they learn to humans

Probabilistic programming does in 50 lines of code what used to take thousands

System designed to label visual scenes according to type learns to detect specific objects

Researchers create first image-recognition software that greatly improves web searches

Microsoft Research project can interpret, caption photos

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Giving machine-learning systems "partial credit" during training improves image classification

Close counts

Human error

Study uncovers the secret of long-lived stem cells

Scientists show that ancient village adapted to drought, rising seas

How polyps of the moon jellyfish repel viral attacks on their microbiome

Organic electrochemical transistors: Scientists solve chemical mystery at the interface of biology and technology

Activity in a room stirs up nanoparticles left over from consumer sprays, study shows

Study dispels myth that purebred dogs are more prone to health problems

Study shows climate change and mercury pollution stressed plants for millions of years

Exploiting disorder to harvest heat energy: The potentialities of 2D magnets for thermoelectric applications

Citizen scientists help discover record-breaking exoplanet in binary star system

Novel calculations peg age of 'baby' asteroid

Relevant PhysicsForums posts

Related Stories

Learning language by playing games

Technique enables pattern-recognition systems to convey what they learn to humans

Probabilistic programming does in 50 lines of code what used to take thousands

System designed to label visual scenes according to type learns to detect specific objects

Researchers create first image-recognition software that greatly improves web searches

Microsoft Research project can interpret, caption photos

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience