February 8, 2016

Search engines will know what you want ... sooner

If you enter "Oklahoma" in a search engine, you might get a travelogue, news about the oil industry, Oklahoma State football scores or an article on Rodgers and Hammerstein musicals. What appears at the top of the list might – and should – depend on what you were actually looking for.

Web search engines, social media sites and retailers that offer you recommendations sometimes "personalize" the ranking of results by looking at your search history.

"If you buy something from Amazon tonight, when you come back tomorrow they may show you related products," explained Wenlei Xie, a graduate student in the field of computer science. "They have computed the rankings offline, based on your choice."

But now Xie and colleagues have refined the algorithm (the underlying design of the computer program) to make it faster so search engines can become interactive, responding to your interests in real time. The new method is, they say, "breaking a decade-old performance barrier." The techniques could be applied in social media and private and commercial databases as well as in Web searches and recommendation systems.

Xie is first author of a paper describing the innovation presented at the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, last summer in Sydney, Australia, where it received the Best Student Paper Award. He collaborated with Johannes Gehrke, the Tisch University Professor of Computer Science; David Bindel, assistant professor of computer science; and principal research scientist Alan Demers.

Your search history might be visualized as a "graph." In computer science, that's not a squiggly line that shows how your company's profits have been falling off, but rather a sort of concept map in which small circles called "nodes" represent items of information, connected by lines called "edges" that represent relationships. (A computer doesn't use pictures. It just stores the data items and links between them. Humans draw a graph to help in thinking about it.)

To examine your history, the computer does a "random walk" through the graph until it has read out all the information. To guide the walk, nodes and edges may be "weighted." Nodes may record how many times you have visited that website or looked at that product. Edges may show the importance of a relationship. In social media, for example, "spouse" is a stronger relationship than "co-worker."

With a "node-weighted" algorithm, a walker landing on low-rated nodes could "teleport" to others at random, ending up with information on just the most interesting nodes. But "edge weighting" works better, the Cornell researchers say.

On Twitter, they point out, ranking by how much two people have interests in common gives better results than just looking at the topics on which each user tweets.

There already are ranking algorithms available that use edge weights, but they're slow. To speed it up, the researchers "reduce" the graph and make the walk faster – sort of like looking at a map of the United States that shows only interstate highways, not all the county roads and city streets.

The algorithm looks for nodes that are "correlated" – representing similar interests, and with strong connections between them. A high school student checking out colleges might visit a lot of university websites; these could be combined into one large and very important node in the simplified graph. "It's like we can collapse a million nodes into a hundred virtual nodes," Xie explained.

The researchers tested their method on a database of scholarly publications and a blog search system and found that it worked five orders of magnitude faster than currently used methods. They also found that their reduced model speeded up "learn to rank" systems where the computer notes which items in a list the user clicks on to get an idea of the user's preferences.

A way to make the results even more timely, the researchers suggested, might be to do the calculations on the client side, after downloading the reduced model to the client's computer. They would also like to update the reduced model continuously as new data comes in.

Provided by Cornell University

Citation: Search engines will know what you want ... sooner (2016, February 8) retrieved 16 April 2024 from https://phys.org/news/2016-02-sooner.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Explained: Graphs

840 shares

Feedback to editors

Study reveals how humanity could unite to address global challenges

4 hours ago

CO₂ worsens wildfires by helping plants grow, model experiments show

5 hours ago

Surf clams off the coast of Virginia reappear and rebound

6 hours ago

Yellowstone Lake ice cover unchanged despite warming climate

7 hours ago

The history of the young cold traps of the asteroid Ceres

7 hours ago

Researchers shine light on rapid changes in Arctic and boreal ecosystems

7 hours ago

New benzofuran synthesis method enables complex molecule creation

7 hours ago

Human odorant receptor for characteristic petrol note of Riesling wines identified

7 hours ago

Uranium-immobilizing bacteria in clay rock: Exploring how microorganisms can influence the behavior of radioactive waste

7 hours ago

Research team identifies culprit behind canned wine's rotten egg smell

7 hours ago

Load comments (3)

Search engines will know what you want ... sooner

Study reveals how humanity could unite to address global challenges

CO₂ worsens wildfires by helping plants grow, model experiments show

Surf clams off the coast of Virginia reappear and rebound

Yellowstone Lake ice cover unchanged despite warming climate

The history of the young cold traps of the asteroid Ceres

Researchers shine light on rapid changes in Arctic and boreal ecosystems

New benzofuran synthesis method enables complex molecule creation

Human odorant receptor for characteristic petrol note of Riesling wines identified

Uranium-immobilizing bacteria in clay rock: Exploring how microorganisms can influence the behavior of radioactive waste

Research team identifies culprit behind canned wine's rotten egg smell

Relevant PhysicsForums posts

Error logging in: onLoginSuccess is not a function

My Website For Creating Interactive Visuals Linked To Equations

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

Most efficient way to randomly choose a word from a file with a list of words

Git, staging and committing files

Explained: Graphs

Computer scientist claims to have solved the graph isomorphism problem

New approach to vertex connectivity could maximize networks' bandwidth

Short algorithm, long-range consequences

New algorithm identifies data subsets that will yield the most reliable predictions

New algorithm can dramatically streamline solutions to the 'max flow' problem

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Search engines will know what you want ... sooner

Study reveals how humanity could unite to address global challenges

CO₂ worsens wildfires by helping plants grow, model experiments show

Surf clams off the coast of Virginia reappear and rebound

Yellowstone Lake ice cover unchanged despite warming climate

The history of the young cold traps of the asteroid Ceres

Researchers shine light on rapid changes in Arctic and boreal ecosystems

New benzofuran synthesis method enables complex molecule creation

Human odorant receptor for characteristic petrol note of Riesling wines identified

Uranium-immobilizing bacteria in clay rock: Exploring how microorganisms can influence the behavior of radioactive waste

Research team identifies culprit behind canned wine's rotten egg smell

Relevant PhysicsForums posts

Related Stories

Explained: Graphs

Computer scientist claims to have solved the graph isomorphism problem

New approach to vertex connectivity could maximize networks' bandwidth

Short algorithm, long-range consequences

New algorithm identifies data subsets that will yield the most reliable predictions

New algorithm can dramatically streamline solutions to the 'max flow' problem

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience