May 20, 2009

Grid browser finds the meaning of life

(PhysOrg.com) -- A web browser that can understand technical terms in life sciences and automatically find additional resources and services has been developed by European researchers. It could lead to a new generation of intelligent search engines.

The life sciences community has built numerous databases - such as for gene sequencing and information about diseases - that are available to researchers as ‘grid’ services.

“Grid computing is essentially about building virtual organisations that are independent of the physical location where they reside,” says Michael Schroeder of Technische Universität Dresden.

The problem is how to link those services to other scientific information found on the web. Schroeder is coordinator of the EU-funded Sealife project which has created a ‘semantic grid browser’ to make grid services for the life sciences much more accessible.

“We have the web on the one hand and then we have grid computing, with its many services, on the other,” he says. A semantic grid browser seamlessly integrates them.

“It tries to understand what it finds on web pages, interprets this content and then links it, on the fly, to services that might be useful to the user.”

A matter of semantics

The key to the Sealife browser is a ‘semantic hyperlink’ that shows up on the page to direct users to relevant services. The link is not put there by the website but by the browser itself.

How does it do that?

First, the browser needs to understand the content of the page and identify terms which could be linked to grid services. An example tested in the Sealife project is the naming of genes. Each human gene has an average of 5.5 names, Schroeder points out, but if it can be identified correctly, a link can be made to a wealth of information about that gene.

The browser must also be able to handle ambiguity. “If I see ‘Jaguar’ on a web page, what is it? Is it an animal? Is it a car? Is it the Mac operating system?” Sealife uses specialised algorithms to work out the context from other words on the page and correctly interpret the meaning.

It is still not an exact science, though. The Sealife team entered their algorithm in an international competition with 50 others to identify names of genes. They won, with an 81% success rate, though Schroeder says they have now got that up to 87%.

Background knowledge
The second challenge is the background knowledge that allows the browser to make sense of the identified terms. Such knowledge is formally known as an ‘ontology’, a systematic hierarchy of concepts and their relation to one another. Biology, with its extensive taxonomies, is an ideal field for semantic grid browsing.

“All these efforts of building hierarchical classification systems have been at the core of biology for centuries,” says Schroeder. “Biologists are used to it and there are many efforts to make information exchangeable.”

But outside the life sciences such systematic classification is not so well developed, and the Sealife project has created editors to build ontologies from published literature in any specific field of interest.

“We developed algorithms that grind through this data, identify the key concepts and then the ontology editor offers these concepts to you,” Schroeder explains. “If you agree, it then searches the web to find things that look like definitions. This whole process of building this background knowledge cannot be fully automated but you can ease the pain of doing this quite significantly.”

Different varieties of the Sealife browser build on work by partners in Edinburgh, Manchester, London and Sophia-Antipolis, as well as in Dresden. They have been tested in three scenarios: evidence-based medicine, mining of scientific and patent literature, and in molecular biology. In each case, the focus has been on infectious diseases.

Browser that understands everything?

So successful has the project been that TU Dresden has spun-off a new company, Transinsight, to exploit work done in Sealife. The company has sold semantic browsers to such major customers as BASF and Unilever and runs the GoPubMed search engine, which is linked to the respected PubMed archive of biomedical literature.

But there is no reason why a semantic browser should be confined to specialised academic areas. Could we have a browser that understands everything? Schroeder thinks that is not as far-fetched as it may seem. “The vision is to include every domain,” he says. “For example, if we were able to extract and formalise the knowledge in Wikipedia we would have this general background knowledge that covers all areas.”

Many researchers look forward to a next-generation search engine that can understand what the user is looking for and return much more relevant results than today’s engines can. “This will involve integrating information,” says Schroeder, “because very often answers to questions are not provided in one document as a single statement that I can pick up by keywords.

“In the future, we will need background knowledge and this is at the core of Sealife. If we build semantic into search, and make it scaleable, then you will have the next-generation search engine.”

More information: www.biotec.tu-dresden.de/sealife/

Provided by ICT Results

Citation: Grid browser finds the meaning of life (2009, May 20) retrieved 20 September 2024 from https://phys.org/news/2009-05-grid-browser-life.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Understanding grid semantics for virtual collaboration

0 shares

Feedback to editors

Oceanic life found to be thriving thanks to Saharan dust blown from thousands of kilometers away

3 hours ago

New material with wavy layers of atoms exhibits unusual superconducting properties

11 hours ago

Researchers build AI model database to find new alloys for nuclear fusion facilities

11 hours ago

Greylag geese with similar personalities have higher hatching success, study suggests

11 hours ago

Can captive tigers be part of the effort to save wild populations?

11 hours ago

Proteins in tooth enamel offer window into ancient and modern human wellness

12 hours ago

Mysteries of the bizarre 'pseudogap' in quantum physics finally untangled

13 hours ago

Are cows pickier than goats? Answers from innovative large-scale feeding experiments from 275 years ago

13 hours ago

Research predicts rise in tropical hydraulic failure

13 hours ago

Human genome stored on 'everlasting' memory crystal

13 hours ago

Load comments (0)

Grid browser finds the meaning of life

A matter of semantics

How does it do that?

Browser that understands everything?

Oceanic life found to be thriving thanks to Saharan dust blown from thousands of kilometers away

New material with wavy layers of atoms exhibits unusual superconducting properties

Researchers build AI model database to find new alloys for nuclear fusion facilities

Greylag geese with similar personalities have higher hatching success, study suggests

Can captive tigers be part of the effort to save wild populations?

Proteins in tooth enamel offer window into ancient and modern human wellness

Mysteries of the bizarre 'pseudogap' in quantum physics finally untangled

Are cows pickier than goats? Answers from innovative large-scale feeding experiments from 275 years ago

Research predicts rise in tropical hydraulic failure

Human genome stored on 'everlasting' memory crystal

Relevant PhysicsForums posts

Container shrinks at certain screen widths (CSS)

Unsolvable python code bug? (finding the difference between two input strings)

User-Defined Functions in Sql Server SSMS

Can Fortran 77 Code Be Used to Debug Python Code for Solving ODEs Using Radau5?

Help solving a geometrical matching issue with Graph Neural Networks

Zipping identical iterables

Understanding grid semantics for virtual collaboration

Tropical cyclone or ISU Cyclone? Semantic science search engine knows that there is a difference

Apple Announces Safari 4 Browser

Foundations for the World Wide Grid

Ross: Firefox Goes Where Few Browsers Have Gone Before

Semantic desktop paves the way for the semantic web

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Grid browser finds the meaning of life

A matter of semantics

How does it do that?

Browser that understands everything?

Oceanic life found to be thriving thanks to Saharan dust blown from thousands of kilometers away

New material with wavy layers of atoms exhibits unusual superconducting properties

Researchers build AI model database to find new alloys for nuclear fusion facilities

Greylag geese with similar personalities have higher hatching success, study suggests

Can captive tigers be part of the effort to save wild populations?

Proteins in tooth enamel offer window into ancient and modern human wellness

Mysteries of the bizarre 'pseudogap' in quantum physics finally untangled

Are cows pickier than goats? Answers from innovative large-scale feeding experiments from 275 years ago

Research predicts rise in tropical hydraulic failure

Human genome stored on 'everlasting' memory crystal

Relevant PhysicsForums posts

Related Stories

Understanding grid semantics for virtual collaboration

Tropical cyclone or ISU Cyclone? Semantic science search engine knows that there is a difference

Apple Announces Safari 4 Browser

Foundations for the World Wide Grid

Ross: Firefox Goes Where Few Browsers Have Gone Before

Semantic desktop paves the way for the semantic web

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience