Search gets smarter with identifiers

Mar 20, 2014
Search gets smarter with identifiers
Credit: CORDIS

The future of computing is based on Big Data. The vast collections of information available on the web and in the cloud could help prevent the next financial crisis, or even tell you exactly when your bus is due. The key lies in giving everything – whether it's a person, business or product – a unique identifier.

Imagine if everything you owned or used had a unique code that you could scan, and that would bring you a wealth of information. Creating a database of billions of unique identifiers could revolutionise the way we think about objects. For example, if every product that you buy can be traced through every step in the supply chain you can check whether your food has really come from an organic farm or whether your car is subject to an emergency recall.

But it's not just consumers who will benefit from a world of unique identifiers: governments and businesses are making use of them too. The company Okkam indexes unique identifiers for the vast amount of data available online. Okkam srl was created to commercialise technologies developed within the EU-funded research project OKKAM. One of their main customers is the regional government in Trentino which is using big data, as processed by Okkam srl, to improve their tax collecting activities.

"We are working with the regional government in Trentino to collect data about tax payers," says Paolo Bouquet, president of Okkam srl. "We use it to help discover tax evasion, which is, as you can imagine, a very hot topic in Italy."

The company is also working with the financial services industry to help prevent a deepening of the . By bringing together data about individual customers from banks, credit rating companies and the web, lenders can identify high-risk individuals and change their lending decisions accordingly. This will make it easier to prevent the high rates of defaulting that brought about the recent global financial meltdown.

Using identifiers to index the world

Okkam srl provides a centralised repository of identifiers (or labels) for people, organisations and things. Anyone can use these labels to index anything that might be of interest online or in private collections.

The difficulty with using is that the person or business named in one database might have a completely different name somewhere else. For example, news reports talk about Barack Obama, The US President, and The White House interchangeably. For a human being, it's easy to know that these names all refer to the same person, but computers don't know how to make these connections. To address the problem, Okkam has created a Global Open Naming System: essentially an index of unique entities like people, organisations and products, that lets people share data.

"We provide a very fast and effective way of discovering data about the same entities across a variety of sources. We do it very quickly," says Paolo Bouquet. "And we do it in a way that it is incremental so you never waste the work you've done. Okkam's entity naming system allows you to share the same identifiers across different projects, different companies, different data sets. You can always build on top of what you have done in the past."

The benefits of a unique name for everything

It's not just data that benefits from Okkam's unique name register. Real world objects like bus stops and newspapers are getting the unique identifier treatment. Using simple technologies like QR codes (the black and white 'messy chessboard' type of bar code) and Near Field Communication (a radio frequency that allows mobile devices pass information back and forth), Okkam has made it possible to tag real objects with online, up-to-the-minute data.

For example, the province of Trentino has equipped each of its bus stops with unique QR codes. This means that passengers can scan the code and get the latest information about travel disruption or download timetables. The future 'internet of things' takes a step closer with this technology.

Explore further: The new technologies needed for dealing with big data

More information: 'Enabling the Web of Entities. A scalable and sustainable solution for systematic and global identifier reuse in decentralized information environments' website: www.okkam.org/

add to favorites email to friend print save as pdf

Related Stories

Web of entities: prepare to 'Okkamise'!

Mar 07, 2008

Internet searching is something of an art form. The spaghetti-like tangle of documents and fragments resulting from what you thought were perfectly cogent keyword searches make the web a forbidding place. European researchers ...

The new technologies needed for dealing with big data

Feb 20, 2014

While much focus and discussion of the so-called "Big Data revolution" has been on the data itself and the exciting new applications it is enabling—from Google's self-driving cars through to CSIRO and University ...

QR codes pose internet security risk

Feb 19, 2014

Internet security experts from Murdoch University have raised concerns about the growing use of Quick Response codes, also known as QR codes.

Recommended for you

Forging a photo is easy, but how do you spot a fake?

Nov 21, 2014

Faking photographs is not a new phenomenon. The Cottingley Fairies seemed convincing to some in 1917, just as the images recently broadcast on Russian television, purporting to be satellite images showin ...

Algorithm, not live committee, performs author ranking

Nov 21, 2014

Thousands of authors' works enter the public domain each year, but only a small number of them end up being widely available. So how to choose the ones taking center-stage? And how well can a machine-learning ...

Professor proposes alternative to 'Turing Test'

Nov 19, 2014

(Phys.org) —A Georgia Tech professor is offering an alternative to the celebrated "Turing Test" to determine whether a machine or computer program exhibits human-level intelligence. The Turing Test - originally ...

Image descriptions from computers show gains

Nov 18, 2014

"Man in black shirt is playing guitar." "Man in blue wetsuit is surfing on wave." "Black and white dog jumps over bar." The picture captions were not written by humans but through software capable of accurately ...

Converting data into knowledge

Nov 17, 2014

When a movie-streaming service recommends a new film you might like, sometimes that recommendation becomes a new favorite; other times, the computer's suggestion really misses the mark. Yisong Yue, assistant ...

User comments : 0

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.