How on earth does geotagging work?

January 9, 2017 by Katie Willis, University of Alberta
Davood Rafiei is a professor in the Department of Computing Sciences and expert in big data and information management. Credit: John Ulan

In an increasingly digital world, we don't always consider where on earth the information we find online comes from.

Now, computing science researchers at the University of Alberta are using automated geotagging models to put a place to online data and documents.

"With the proliferation of online content and the need for sharing it across the globe, it is important to correctly match names to the places they refer to," says Davood Rafiei, professor in the Department of Computing Sciences and expert in and information management.

"The potential applications are huge. Perhaps you want to find out about people, organizations, or events in a certain location. Or maybe you want to understand where your data sources are located. There are even applications for determining if two named entities are in fact referring to the same thing."

Using a two-part model, Rafiei and former master of science student Jiangwei Yu have developed a technique to automate geotagging for news articles and other online documents and data. The model integrates two competing hypotheses: inheritance and near-location.

According to the inheritance hypothesis, named entities are given the same geographical location as the document in which they are mentioned. "For example, every name mentioned in a Wall Street Journal article will inherit the geocentre of the article, which in this case will be New York City, New York, USA," explains Rafiei.

The near-location hypothesis links the named entities to geographical locations mentioned in nearby text—such as a person's name mentioned next to the phrase "Edmonton, Alberta" in an article.

"What happens in the real world though appears to be a mixture of the two forces," explains Rafiei. "Our data shows that the inheritance hypothesis holds in 72 percent of the cases, the near-location hypothesis holds in 67 percent of the cases, and at least one holds in close to 99 percent of the cases."

In addition to being highly accurate, the model is automated, cutting the cost of geotagging significantly.

"The power of geotagging is being better able to understand people, places, and things referenced in online documents," says Rafiei.

The paper, "Geotagging named entities in news and online documents", was presented at International Conference on Information and Knowledge Management, Proceedings.

Explore further: Hide your location on Twitter? We can still find you and that's not a bad thing in an emergency

More information: Geotagging named entities in news and online documents, DOI: 10.1145/2983323.2983795

Related Stories

Literature searches benefit from location tagging

October 31, 2014

Agricultural Research Service ecologist Jason Karl is creating new options for helping researchers to conduct literature searches that go beyond using traditional search terms such as keywords or authors. With the help of ...

Women are seen more than heard in online news

February 3, 2016

It has long been argued that women are under-represented and marginalised in relation to men in the world's news media. New research, using artificial intelligence (AI), has analysed over two million articles to find out ...

Recommended for you

Cryptocurrency rivals snap at Bitcoin's heels

January 14, 2018

Bitcoin may be the most famous cryptocurrency but, despite a dizzying rise, it's not the most lucrative one and far from alone in a universe that counts 1,400 rivals, and counting.

Top takeaways from Consumers Electronics Show

January 13, 2018

The 2018 Consumer Electronics Show, which concluded Friday in Las Vegas, drew some 4,000 exhibitors from dozens of countries and more than 170,000 attendees, showcased some of the latest from the technology world.

Finnish firm detects new Intel security flaw

January 12, 2018

A new security flaw has been found in Intel hardware which could enable hackers to access corporate laptops remotely, Finnish cybersecurity specialist F-Secure said on Friday.

0 comments

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.