September 20, 2011

Novel high-performance hybrid system for semantic factoring of graph databases

by Pacific Northwest National Laboratory

Imagine trying to analyze all of the English entries in Wikipedia. Now imagine you've got 20 times as much information. That's the challenge scientists face when working with gigabyte data sets. Scientists at Pacific Northwest National Laboratory, Sandia National Laboratories and Cray, Inc. developed an application to take on such massive data analysis challenges. Their novel high-performance computing application uses semantic factoring to organize data, bringing out hidden connections and threads.

The team then used their applications to analyze the massive datasets for the Billion Triple Challenge, an international competition focused on demonstrating capability and innovation for dealing with very large semantic graph databases, known as SGDs.

Why it matters? Science. Security. In both areas, people must turn massive data sets into knowledge that can be used to save lives.

As SGD technology grows to address components from extremely large data stores, it is becoming increasingly important to be able to use high-performance computational resources for analysis, interpretation, and visualization, especially as it pertains to the innate structure. However, the ability to understand the semantic structure of a vast SGD still needs both a coherent methodology and the high-performance computing platform to exercise the necessary methods.

The team took advantage of the Cray XMT architecture, which allowed all 624 gigabytes of input data to be held in RAM. They were then able to scalably perform a variety of novel tasks for descriptive analysis of the inherent semantics in the dataset provided by the Billion Triple Challenge, including identifying the ontological structure, the sensitivity of connectivity within the relationships, and the interaction among different contributions to the dataset.

The semantic database system research team is developing a prototype that can be adapted to a variety of application domains and datasets, including working with the bio2rdf.org and future billion-triple-challenge datasets in prototype testing and evaluation.

More information: Joslyn C, R Adolf, S al-Saffar, J Feo, E Goodman, D Haglin, G Mackey, and D Mizell. 2010. "High Performance Semantic Factoring of Giga-Scale Semantic Graph Databases." Semantic Web Challenge Billion Triple Challenge 2010. cass-mt.pnl.gov/btc2010/pnnl_btc.pdf

Provided by Pacific Northwest National Laboratory

Citation: Novel high-performance hybrid system for semantic factoring of graph databases (2011, September 20) retrieved 19 April 2024 from https://phys.org/news/2011-09-high-performance-hybrid-semantic-factoring-graph.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New application allows scientists easy access to important government data

0 shares

Feedback to editors

Seeing is believing: Scientists reveal connectome of the fruit fly visual system

3 minutes ago

Why zebrafish can regenerate damaged heart tissue, while other fish species cannot

6 minutes ago

Development of organic semiconductors featuring ultrafast electrons

1 hour ago

Uncovering key players in gene silencing: Insights into plant growth and human diseases

1 hour ago

Many prisoners go years without touching a smartphone—it means they struggle to navigate life on the outside

1 hour ago

Mycoheterotrophic plants as a key to the 'Wood Wide Web'

1 hour ago

Technical trials for easing the (cosmological) tension

2 hours ago

A hydrocarbon molecule as supplier and energy storage solution for solar energy

2 hours ago

Comprehensive model unravels quantum-mechanical effects behind photoluminescence in thin gold films

2 hours ago

Cosmic rays streamed through Earth's atmosphere 41,000 years ago: New findings on the Laschamps excursion

2 hours ago

Load comments (0)

Novel high-performance hybrid system for semantic factoring of graph databases

Seeing is believing: Scientists reveal connectome of the fruit fly visual system

Why zebrafish can regenerate damaged heart tissue, while other fish species cannot

Development of organic semiconductors featuring ultrafast electrons

Uncovering key players in gene silencing: Insights into plant growth and human diseases

Many prisoners go years without touching a smartphone—it means they struggle to navigate life on the outside

Mycoheterotrophic plants as a key to the 'Wood Wide Web'

Technical trials for easing the (cosmological) tension

A hydrocarbon molecule as supplier and energy storage solution for solar energy

Comprehensive model unravels quantum-mechanical effects behind photoluminescence in thin gold films

Cosmic rays streamed through Earth's atmosphere 41,000 years ago: New findings on the Laschamps excursion

Relevant PhysicsForums posts

Number of Multiplications in the FFT Algorithm

Error logging in: onLoginSuccess is not a function

My Website For Creating Interactive Visuals Linked To Equations

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

Most efficient way to randomly choose a word from a file with a list of words

New application allows scientists easy access to important government data

Web interface defines new paradigm for life science data-sharing

UT's Remote Data Analysis and Visualization Center enters full production

Customizing supercomputers from the ground up

Tropical cyclone or ISU Cyclone? Semantic science search engine knows that there is a difference

Enter the semantic grid

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Novel high-performance hybrid system for semantic factoring of graph databases

Seeing is believing: Scientists reveal connectome of the fruit fly visual system

Why zebrafish can regenerate damaged heart tissue, while other fish species cannot

Development of organic semiconductors featuring ultrafast electrons

Uncovering key players in gene silencing: Insights into plant growth and human diseases

Many prisoners go years without touching a smartphone—it means they struggle to navigate life on the outside

Mycoheterotrophic plants as a key to the 'Wood Wide Web'

Technical trials for easing the (cosmological) tension

A hydrocarbon molecule as supplier and energy storage solution for solar energy

Comprehensive model unravels quantum-mechanical effects behind photoluminescence in thin gold films

Cosmic rays streamed through Earth's atmosphere 41,000 years ago: New findings on the Laschamps excursion

Relevant PhysicsForums posts

Related Stories

New application allows scientists easy access to important government data

Web interface defines new paradigm for life science data-sharing

UT's Remote Data Analysis and Visualization Center enters full production

Customizing supercomputers from the ground up

Tropical cyclone or ISU Cyclone? Semantic science search engine knows that there is a difference

Enter the semantic grid

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience