February 8, 2016

Search engines will know what you want ... sooner

If you enter "Oklahoma" in a search engine, you might get a travelogue, news about the oil industry, Oklahoma State football scores or an article on Rodgers and Hammerstein musicals. What appears at the top of the list might – and should – depend on what you were actually looking for.

Web search engines, social media sites and retailers that offer you recommendations sometimes "personalize" the ranking of results by looking at your search history.

"If you buy something from Amazon tonight, when you come back tomorrow they may show you related products," explained Wenlei Xie, a graduate student in the field of computer science. "They have computed the rankings offline, based on your choice."

But now Xie and colleagues have refined the algorithm (the underlying design of the computer program) to make it faster so search engines can become interactive, responding to your interests in real time. The new method is, they say, "breaking a decade-old performance barrier." The techniques could be applied in social media and private and commercial databases as well as in Web searches and recommendation systems.

Xie is first author of a paper describing the innovation presented at the 21st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, last summer in Sydney, Australia, where it received the Best Student Paper Award. He collaborated with Johannes Gehrke, the Tisch University Professor of Computer Science; David Bindel, assistant professor of computer science; and principal research scientist Alan Demers.

Your search history might be visualized as a "graph." In computer science, that's not a squiggly line that shows how your company's profits have been falling off, but rather a sort of concept map in which small circles called "nodes" represent items of information, connected by lines called "edges" that represent relationships. (A computer doesn't use pictures. It just stores the data items and links between them. Humans draw a graph to help in thinking about it.)

To examine your history, the computer does a "random walk" through the graph until it has read out all the information. To guide the walk, nodes and edges may be "weighted." Nodes may record how many times you have visited that website or looked at that product. Edges may show the importance of a relationship. In social media, for example, "spouse" is a stronger relationship than "co-worker."

With a "node-weighted" algorithm, a walker landing on low-rated nodes could "teleport" to others at random, ending up with information on just the most interesting nodes. But "edge weighting" works better, the Cornell researchers say.

On Twitter, they point out, ranking by how much two people have interests in common gives better results than just looking at the topics on which each user tweets.

There already are ranking algorithms available that use edge weights, but they're slow. To speed it up, the researchers "reduce" the graph and make the walk faster – sort of like looking at a map of the United States that shows only interstate highways, not all the county roads and city streets.

The algorithm looks for nodes that are "correlated" – representing similar interests, and with strong connections between them. A high school student checking out colleges might visit a lot of university websites; these could be combined into one large and very important node in the simplified graph. "It's like we can collapse a million nodes into a hundred virtual nodes," Xie explained.

The researchers tested their method on a database of scholarly publications and a blog search system and found that it worked five orders of magnitude faster than currently used methods. They also found that their reduced model speeded up "learn to rank" systems where the computer notes which items in a list the user clicks on to get an idea of the user's preferences.

A way to make the results even more timely, the researchers suggested, might be to do the calculations on the client side, after downloading the reduced model to the client's computer. They would also like to update the reduced model continuously as new data comes in.

Provided by Cornell University

Citation: Search engines will know what you want ... sooner (2016, February 8) retrieved 3 May 2024 from https://phys.org/news/2016-02-sooner.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Explained: Graphs

840 shares

Feedback to editors

Tweaking isotopes sheds light on promising approach to engineer semiconductors

just added

The Eta Aquarid meteor shower, debris of Halley's comet, peaks this weekend. Here's how to see it

1 hour ago

Lego-pushing bumblebees reveal insect collaboration dynamics

1 hour ago

Physicists create an optical tweezer array of individual polyatomic molecules for the first time

1 hour ago

Chemist explores the real-world science of Star Wars

1 hour ago

Nanostructured copper surface shows potential for transparent, antimicrobial surfaces in touch displays

1 hour ago

Novel triple drug combination effective against antibiotic-resistant bacteria

1 hour ago

Study finds microbiome changes dynamically and favors important host-relevant functions

1 hour ago

In medieval England, leprosy spread between red squirrels and people, genome evidence shows

1 hour ago

New discovery of a mechanism that controls cell division

2 hours ago

Load comments (3)

Search engines will know what you want ... sooner

Tweaking isotopes sheds light on promising approach to engineer semiconductors

The Eta Aquarid meteor shower, debris of Halley's comet, peaks this weekend. Here's how to see it

Lego-pushing bumblebees reveal insect collaboration dynamics

Physicists create an optical tweezer array of individual polyatomic molecules for the first time

Chemist explores the real-world science of Star Wars

Nanostructured copper surface shows potential for transparent, antimicrobial surfaces in touch displays

Novel triple drug combination effective against antibiotic-resistant bacteria

Study finds microbiome changes dynamically and favors important host-relevant functions

In medieval England, leprosy spread between red squirrels and people, genome evidence shows

New discovery of a mechanism that controls cell division

Relevant PhysicsForums posts

Parallel processing for loops and pointer defined outside the loop

Passing variables in FORTRAN

User-Defined Functions in Sql Server SSMS

Classifiers, threshold, and ROC curve

My Website For Creating Interactive Visuals Linked To Equations

Number of Multiplications in the FFT Algorithm

Explained: Graphs

Computer scientist claims to have solved the graph isomorphism problem

New approach to vertex connectivity could maximize networks' bandwidth

Short algorithm, long-range consequences

New algorithm identifies data subsets that will yield the most reliable predictions

New algorithm can dramatically streamline solutions to the 'max flow' problem

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Search engines will know what you want ... sooner

Tweaking isotopes sheds light on promising approach to engineer semiconductors

The Eta Aquarid meteor shower, debris of Halley's comet, peaks this weekend. Here's how to see it

Lego-pushing bumblebees reveal insect collaboration dynamics

Physicists create an optical tweezer array of individual polyatomic molecules for the first time

Chemist explores the real-world science of Star Wars

Nanostructured copper surface shows potential for transparent, antimicrobial surfaces in touch displays

Novel triple drug combination effective against antibiotic-resistant bacteria

Study finds microbiome changes dynamically and favors important host-relevant functions

In medieval England, leprosy spread between red squirrels and people, genome evidence shows

New discovery of a mechanism that controls cell division

Relevant PhysicsForums posts

Related Stories

Explained: Graphs

Computer scientist claims to have solved the graph isomorphism problem

New approach to vertex connectivity could maximize networks' bandwidth

Short algorithm, long-range consequences

New algorithm identifies data subsets that will yield the most reliable predictions

New algorithm can dramatically streamline solutions to the 'max flow' problem

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience