November 1, 2006

Researchers teach computers how to name images by 'thinking'

Penn State researchers have "taught" computers how to interpret images using a vocabulary of up to 330 English words, so that a computer can describe a photograph of two polo players, for instance, as "sport," "people," "horse," "polo."

The new system, which can automatically annotate entire online collections of photographs as they are uploaded, means significant time-savings for the millions of Internet users who now manually tag or identify their images. It also facilitates retrieval of images through the use of search terms, said James Wang, associate professor in the Penn State College of Information Sciences and Technology, and one of the technology's two inventors.

The system is described in a paper, "Real-Time Computerized Annotation of Pictures," given at the recent ACM Multimedia 2006 conference in Santa Barbara, Calif., and authored by Jia Li, associate professor, Department of Statistics, and Wang. Penn State has filed a provisional patent application on the invention. Major search engines currently rely upon uploaded tags of text to describe images. While many collections are annotated, many are not. The result: Images without text tags are not accessible to Web searchers. Because it provides text tags, the ALIPR system-Automatic Linguistic Indexing of Pictures-Real Time-makes those images visible to Web users.

ALIPR does this by analyzing the pixel content of images and comparing that against a stored knowledge base of the pixel content of tens of thousands of image examples. The computer then suggests a list of 15 possible annotations or words for the image.

"By inputting tens of thousands of images, we have trained computers to recognize certain objects and concepts and automatically annotate those new or unseen images," Wang said. "More than half the time, the computer's first tag out of the top 15 tags is correct."

In addition, for 98 percent of images tested, the system has provided at least one correct annotation in the top 15 selected words. The system, which completes the annotation in about 1.4 seconds, also can be applied to other domains such as art collections, satellite imaging and pathology slides, Wang said. The new system builds on the authors' previous invention, ALIP, which also analyzes image content. But unlike ALIP which characterized images by incorporating computational-intensive spatial modeling, ALIPR characterizes images by modeling distributions of color and texture.

The researchers acknowledge computers trained with their algorithms have difficulties when photos are fuzzy or have low contrast or resolution; when objects are shown only partially; and when the angle used by the photographer presents an image in a way that is different than how the computer was trained on the object. Adding more training images as well as improving the training process may reduce these limitations-future areas of research.

Source: Penn State

Citation: Researchers teach computers how to name images by 'thinking' (2006, November 1) retrieved 22 September 2024 from https://phys.org/news/2006-11-images.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

From takeoff to flight, the wiring of a fly's nervous system is mapped

0 shares

Feedback to editors

Researchers observe an antiferromagnetic diode effect in even-layered MnBi₂Te₄

3 hours ago

Scientists explore origins of horseback riding through human skeletons

3 hours ago

'Pirate birds' force other seabirds to regurgitate fish meals. Their thieving ways could spread lethal avian flu

22 hours ago

Even the heaviest particles experience the usual quantum weirdness, new experiment shows

22 hours ago

New method developed to relocate misplaced proteins in cells

23 hours ago

New biosensor illuminates physiological signals in living animals

23 hours ago

New tool to help decision makers navigate possible futures of the Colorado River

23 hours ago

Many people in the Pacific lack access to adequate toilets—and climate change makes things worse

23 hours ago

Saturday Citations: Football metaphors in physics; vets treat adorable baby rhino's broken leg

Sep 21, 2024

New data science tool greatly speeds up molecular analysis of our environment

Sep 20, 2024

Load comments (0)

Researchers teach computers how to name images by 'thinking'

Researchers observe an antiferromagnetic diode effect in even-layered MnBi₂Te₄

Scientists explore origins of horseback riding through human skeletons

'Pirate birds' force other seabirds to regurgitate fish meals. Their thieving ways could spread lethal avian flu

Even the heaviest particles experience the usual quantum weirdness, new experiment shows

New method developed to relocate misplaced proteins in cells

New biosensor illuminates physiological signals in living animals

New tool to help decision makers navigate possible futures of the Colorado River

Many people in the Pacific lack access to adequate toilets—and climate change makes things worse

Saturday Citations: Football metaphors in physics; vets treat adorable baby rhino's broken leg

New data science tool greatly speeds up molecular analysis of our environment

Relevant PhysicsForums posts

Container shrinks at certain screen widths (CSS)

Unsolvable python code bug? (finding the difference between two input strings)

User-Defined Functions in Sql Server SSMS

Can Fortran 77 Code Be Used to Debug Python Code for Solving ODEs Using Radau5?

Help solving a geometrical matching issue with Graph Neural Networks

Zipping identical iterables

From takeoff to flight, the wiring of a fly's nervous system is mapped

Math makes finding bat roosts much easier, our research shows

Wolves reintroduced to Isle Royale temporarily affect other carnivores, humans have influence as well

How 'One Health' clinics support unhoused people and their pets

How forest fires also have an impact on lakes

How to decide how to vote—a psychologist's advice

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Researchers teach computers how to name images by 'thinking'

Researchers observe an antiferromagnetic diode effect in even-layered MnBi₂Te₄

Scientists explore origins of horseback riding through human skeletons

'Pirate birds' force other seabirds to regurgitate fish meals. Their thieving ways could spread lethal avian flu

Even the heaviest particles experience the usual quantum weirdness, new experiment shows

New method developed to relocate misplaced proteins in cells

New biosensor illuminates physiological signals in living animals

New tool to help decision makers navigate possible futures of the Colorado River

Many people in the Pacific lack access to adequate toilets—and climate change makes things worse

Saturday Citations: Football metaphors in physics; vets treat adorable baby rhino's broken leg

New data science tool greatly speeds up molecular analysis of our environment

Relevant PhysicsForums posts

Related Stories

From takeoff to flight, the wiring of a fly's nervous system is mapped

Math makes finding bat roosts much easier, our research shows

Wolves reintroduced to Isle Royale temporarily affect other carnivores, humans have influence as well

How 'One Health' clinics support unhoused people and their pets

How forest fires also have an impact on lakes

How to decide how to vote—a psychologist's advice

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience