November 18, 2014

Researchers create first image-recognition software that greatly improves web searches

Dartmouth researchers and their colleagues have created an artificial intelligence software that uses photos to locate documents on the Internet with far greater accuracy than ever before.

The new system, which was tested on photos and is now being applied to videos, shows for the first time that a machine learning algorithm for image recognition and retrieval is accurate and efficient enough to improve large-scale document searches online. The system uses pixel data in images and potentially video - rather than just text—to locate documents. It learns to recognize the pixels associated with a search phrase by studying the results from text-based image search engines. The knowledge gleaned from those results can then be applied to other photos without tags or captions, making for more accurate document search results.

The findings appear in the journal PAMI (IEEE Transactions on Pattern Analysis and Machine Intelligence).

"Images abound on the Internet and our approach means they'll no longer be ignored during document retrieval," says Associate Professor Lorenzo Torresani, a co-author of the study. "Over the last 30 years, the Web has evolved from a small collection of mostly text documents to a modern, gigantic, fast-growing multimedia dataset, where nearly every page includes multiple pictures or videos. When a person looks at a Web page, she immediately gets the gist of it by looking at the pictures in it. Yet, surprisingly, all existing popular search engines, such as Google or Bing, strip away the information contained in the photos and use exclusively the text of Web pages to perform the document retrieval. Our study is the first to show that modern machine vision systems are accurate and efficient enough to make effective use of the information contained in image pixels to improve document search."

The researchers designed and tested a machine vision system - a type of artificial intelligence that allows computers to learn without being explicitly programmed—that extracts semantic information from the pixels of photos in Web pages. This information is used to enrich the description of the HTML page used by search engines for document retrieval. The researchers tested their approach using more than 600 search queries on a database of 50 million Web pages. They selected the text-retrieval search engine with the best performance and modified it to make use of the additional semantic information extracted by their method from the pictures of the Web pages. They found that this produced a 30 percent improvement in precision over the original search engine purely based on text. The new system was developed by researchers at Dartmouth College, Tecnalia Research & Innovation and Microsoft Research Cambridge.

Journal information: IEEE Transactions on Pattern Analysis and Machine Intelligence

Provided by Dartmouth College

Citation: Researchers create first image-recognition software that greatly improves web searches (2014, November 18) retrieved 17 April 2024 from https://phys.org/news/2014-11-image-recognition-software-greatly-web.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Software provides a clear overview in long documents

0 shares

Feedback to editors

Soil bacteria link their life strategies to soil conditions: Study

4 hours ago

Atom-by-atom: Imaging structural transformations in 2D materials

4 hours ago

Researchers identify genetic variant that helped shape human skull base evolution

4 hours ago

Two-dimensional nanomaterial sets expansion record

5 hours ago

Vibrations of granular materials: Theoretical physicists shed light on an everyday scientific mystery

6 hours ago

Global study reveals health impacts of airborne trace elements

6 hours ago

Researchers find lower grades given to students with surnames that come later in alphabetical order

6 hours ago

New model finds previous cell division calculations ignore drivers at the molecular scale

6 hours ago

Peptides on interstellar ice: Study finds presence of water molecules not a major obstacle for formation

7 hours ago

Honey bees experience multiple health stressors out in the field

7 hours ago

Load comments (1)

Researchers create first image-recognition software that greatly improves web searches

Soil bacteria link their life strategies to soil conditions: Study

Atom-by-atom: Imaging structural transformations in 2D materials

Researchers identify genetic variant that helped shape human skull base evolution

Two-dimensional nanomaterial sets expansion record

Vibrations of granular materials: Theoretical physicists shed light on an everyday scientific mystery

Global study reveals health impacts of airborne trace elements

Researchers find lower grades given to students with surnames that come later in alphabetical order

New model finds previous cell division calculations ignore drivers at the molecular scale

Peptides on interstellar ice: Study finds presence of water molecules not a major obstacle for formation

Honey bees experience multiple health stressors out in the field

Relevant PhysicsForums posts

Error logging in: onLoginSuccess is not a function

My Website For Creating Interactive Visuals Linked To Equations

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

Most efficient way to randomly choose a word from a file with a list of words

Git, staging and committing files

Software provides a clear overview in long documents

The engines of change

How many scholarly papers are on the Web? At least 114 million, professor finds

The economics of database searching

Researchers Teach Computers to Search for Photos Based on Their Contents

Smarter video searching and indexing

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Researchers create first image-recognition software that greatly improves web searches

Soil bacteria link their life strategies to soil conditions: Study

Atom-by-atom: Imaging structural transformations in 2D materials

Researchers identify genetic variant that helped shape human skull base evolution

Two-dimensional nanomaterial sets expansion record

Vibrations of granular materials: Theoretical physicists shed light on an everyday scientific mystery

Global study reveals health impacts of airborne trace elements

Researchers find lower grades given to students with surnames that come later in alphabetical order

New model finds previous cell division calculations ignore drivers at the molecular scale

Peptides on interstellar ice: Study finds presence of water molecules not a major obstacle for formation

Honey bees experience multiple health stressors out in the field

Relevant PhysicsForums posts

Related Stories

Software provides a clear overview in long documents

The engines of change

How many scholarly papers are on the Web? At least 114 million, professor finds

The economics of database searching

Researchers Teach Computers to Search for Photos Based on Their Contents

Smarter video searching and indexing

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience