April 17, 2014

Researcher seeks to lessen failures in computerized visual recognition programs

by Steven D Mackay, Virginia Tech

Computer programs that use facial or image recognition systems—be it security cameras or applications that search databases for everything from photographs of wanted criminals to images of bears – are like any other technological marvel. They may be fast and versatile, but they frequently fail, and are limited to one-way communication, taking orders from the user.

Devi Parikh, an assistant professor with the Virginia Tech Bradley Department of Electrical and Computer Engineering, wants to change that, creating a two-way communication path between user and computer vision systems and algorithms. The two way system won't directly prevent failures and faults, but it will help users better diagnose computer problems and correct errors, and prevent future occurrences.

"Models that characterize the failures of a system can then also be used to predict oncoming failure," said Parikh, whose research project is at the center of a $150,000 U.S. Army Research Office Young Investigators' Award, and well could have future applications in a wide variety of artificial intelligence systems. "Such a warning signal can be valuable to a downstream application that uses the output of the machine perception system as input. These techniques are broadly applicable to many research and development efforts on intelligent and autonomous systems."

To wit, using computer vision programs – or almost any computer system – that prove faulty or make errors is now much akin to talking with a small child who may be ill: The adult can tell something is wrong with the child from his or her behavior, but the child does not have the vocabulary to express why he or she is feeling ill. The parent must guess and/or seek help in a diagnosis, or the child remains sick.

Computers act much the same way during a system or program crash or failure. When a facial recognition system fails to recognize or track a person's face, it may not be able to tell the user – likely law enforcement – why it is failing or even that it is failing. The user must guess if the program is failing because of, say, low or harsh light or because the subject has his or her face at an odd angle, askew from the lens.

Parikh wants to remove the guesswork, allowing the system or application to directly tell the user the cause of failure.

Once the user is aware of the fault, they can take action to correct the error – switch to a different camera to capture the person's face from another angle or lower the aperture of the lens to take in less light, thereby avoiding excessive glare – and obtain a better, usable image.

Much the same way, if a computer is programmed to sort through thousands of images for photographs of bears, when its initial model is based only on images of a grizzly standing near a lake, the system well can mistake the body of water as directly associated with a bear, and miss images of polar bears as it was only instructed to search for one type of the species. Parikh wants to create systems smart enough to ask the user questions that will avoid such errors or shortcomings, thus saving the user's time and likely, money.

"A semantic characterization of the failure modes of a system can thus allow us to design better systems in the future, as well as to make today's computer vision systems more usable even with their existing imperfections," Parikh wrote in her proposal.

Provided by Virginia Tech

Citation: Researcher seeks to lessen failures in computerized visual recognition programs (2014, April 17) retrieved 27 April 2024 from https://phys.org/news/2014-04-lessen-failures-computerized-visual-recognition.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New project ensures 'what you see is what you send'

0 shares

Feedback to editors

Global study shows a third more insects come out after dark

12 hours ago

Cicada-palooza! Billions of bugs to blanket America

14 hours ago

Getting dynamic information from static snapshots

14 hours ago

Ancient Maya blessed their ballcourts: Researchers find evidence of ceremonial offerings in Mexico

14 hours ago

Optical barcodes expand range of high-resolution sensor

Apr 26, 2024

Ridesourcing platforms thrive on socio-economic inequality, say researchers

Apr 26, 2024

Did Vesuvius bury the home of the first Roman emperor?

Apr 26, 2024

Florida dolphin found with highly pathogenic avian flu: Report

Apr 26, 2024

A new way to study and help prevent landslides

Apr 26, 2024

New algorithm cuts through 'noisy' data to better predict tipping points

Apr 26, 2024

Load comments (0)

Researcher seeks to lessen failures in computerized visual recognition programs

Global study shows a third more insects come out after dark

Cicada-palooza! Billions of bugs to blanket America

Getting dynamic information from static snapshots

Ancient Maya blessed their ballcourts: Researchers find evidence of ceremonial offerings in Mexico

Optical barcodes expand range of high-resolution sensor

Ridesourcing platforms thrive on socio-economic inequality, say researchers

Did Vesuvius bury the home of the first Roman emperor?

Florida dolphin found with highly pathogenic avian flu: Report

A new way to study and help prevent landslides

New algorithm cuts through 'noisy' data to better predict tipping points

Relevant PhysicsForums posts

Passing variables in FORTRAN

Parallel processing for loops and pointer defined outside the loop

My Website For Creating Interactive Visuals Linked To Equations

Number of Multiplications in the FFT Algorithm

Error logging in: onLoginSuccess is not a function

Latest Notable AI accomplishments

New project ensures 'what you see is what you send'

See what a child will look like using automated age-progression software (w/ video)

Enhanced autopilot system could help prevent accidents like 2009 Air France 447 crash

Patent talk: Google sharpens contact lens vision

Smartphone-based voting technology may lead to fewer user errors

Computers see through faked expressions of pain better than people

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Researcher seeks to lessen failures in computerized visual recognition programs

Global study shows a third more insects come out after dark

Cicada-palooza! Billions of bugs to blanket America

Getting dynamic information from static snapshots

Ancient Maya blessed their ballcourts: Researchers find evidence of ceremonial offerings in Mexico

Optical barcodes expand range of high-resolution sensor

Ridesourcing platforms thrive on socio-economic inequality, say researchers

Did Vesuvius bury the home of the first Roman emperor?

Florida dolphin found with highly pathogenic avian flu: Report

A new way to study and help prevent landslides

New algorithm cuts through 'noisy' data to better predict tipping points

Relevant PhysicsForums posts

Related Stories

New project ensures 'what you see is what you send'

See what a child will look like using automated age-progression software (w/ video)

Enhanced autopilot system could help prevent accidents like 2009 Air France 447 crash

Patent talk: Google sharpens contact lens vision

Smartphone-based voting technology may lead to fewer user errors

Computers see through faked expressions of pain better than people

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience