September 20, 2011

Novel high-performance hybrid system for semantic factoring of graph databases

by Pacific Northwest National Laboratory

Imagine trying to analyze all of the English entries in Wikipedia. Now imagine you've got 20 times as much information. That's the challenge scientists face when working with gigabyte data sets. Scientists at Pacific Northwest National Laboratory, Sandia National Laboratories and Cray, Inc. developed an application to take on such massive data analysis challenges. Their novel high-performance computing application uses semantic factoring to organize data, bringing out hidden connections and threads.

The team then used their applications to analyze the massive datasets for the Billion Triple Challenge, an international competition focused on demonstrating capability and innovation for dealing with very large semantic graph databases, known as SGDs.

Why it matters? Science. Security. In both areas, people must turn massive data sets into knowledge that can be used to save lives.

As SGD technology grows to address components from extremely large data stores, it is becoming increasingly important to be able to use high-performance computational resources for analysis, interpretation, and visualization, especially as it pertains to the innate structure. However, the ability to understand the semantic structure of a vast SGD still needs both a coherent methodology and the high-performance computing platform to exercise the necessary methods.

The team took advantage of the Cray XMT architecture, which allowed all 624 gigabytes of input data to be held in RAM. They were then able to scalably perform a variety of novel tasks for descriptive analysis of the inherent semantics in the dataset provided by the Billion Triple Challenge, including identifying the ontological structure, the sensitivity of connectivity within the relationships, and the interaction among different contributions to the dataset.

The semantic database system research team is developing a prototype that can be adapted to a variety of application domains and datasets, including working with the bio2rdf.org and future billion-triple-challenge datasets in prototype testing and evaluation.

More information: Joslyn C, R Adolf, S al-Saffar, J Feo, E Goodman, D Haglin, G Mackey, and D Mizell. 2010. "High Performance Semantic Factoring of Giga-Scale Semantic Graph Databases." Semantic Web Challenge Billion Triple Challenge 2010. cass-mt.pnl.gov/btc2010/pnnl_btc.pdf

Provided by Pacific Northwest National Laboratory

Citation: Novel high-performance hybrid system for semantic factoring of graph databases (2011, September 20) retrieved 6 August 2024 from https://phys.org/news/2011-09-high-performance-hybrid-semantic-factoring-graph.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New application allows scientists easy access to important government data

0 shares

Feedback to editors

Researchers reveal atomic-scale details of catalysts' active sites

8 minutes ago

Sniff test for explosives detection extends its reach

9 minutes ago

Researchers dig deeper into stability challenges of nuclear fusion—with mayonnaise

1 hour ago

New X-ray world record: Looking inside a microchip with 4 nanometer precision

1 hour ago

Groundwater reserves in southwestern Europe more stable overall than previously thought

1 hour ago

Competition over millions of years preserves genetic diversity of three crustaceans

1 hour ago

Researchers discover optimum twilight time for plant growth

1 hour ago

Patents can help researchers understand wildlife trade trends, new study shows

1 hour ago

New technology protects crops by testing the air for the DNA of plant diseases

1 hour ago

Visiting an art exhibition can make you think more socially and openly—but for how long?

1 hour ago

Load comments (0)

Novel high-performance hybrid system for semantic factoring of graph databases

Researchers reveal atomic-scale details of catalysts' active sites

Sniff test for explosives detection extends its reach

Researchers dig deeper into stability challenges of nuclear fusion—with mayonnaise

New X-ray world record: Looking inside a microchip with 4 nanometer precision

Groundwater reserves in southwestern Europe more stable overall than previously thought

Competition over millions of years preserves genetic diversity of three crustaceans

Researchers discover optimum twilight time for plant growth

Patents can help researchers understand wildlife trade trends, new study shows

New technology protects crops by testing the air for the DNA of plant diseases

Visiting an art exhibition can make you think more socially and openly—but for how long?

Relevant PhysicsForums posts

Creating a minimal Windows 11 Bootable USB stick for my ROG Computer

Python Socket library to create a server and client scripts

Safe, free and unlimited xls to xlsx converter?

Help solving a geometrical matching issue with Graph Neural Networks

5 GHz PC WiFi connection Cybersecurity question

Help with some optimization code for Block Matrices

New application allows scientists easy access to important government data

Web interface defines new paradigm for life science data-sharing

UT's Remote Data Analysis and Visualization Center enters full production

Customizing supercomputers from the ground up

Tropical cyclone or ISU Cyclone? Semantic science search engine knows that there is a difference

Enter the semantic grid

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Novel high-performance hybrid system for semantic factoring of graph databases

Researchers reveal atomic-scale details of catalysts' active sites

Sniff test for explosives detection extends its reach

Researchers dig deeper into stability challenges of nuclear fusion—with mayonnaise

New X-ray world record: Looking inside a microchip with 4 nanometer precision

Groundwater reserves in southwestern Europe more stable overall than previously thought

Competition over millions of years preserves genetic diversity of three crustaceans

Researchers discover optimum twilight time for plant growth

Patents can help researchers understand wildlife trade trends, new study shows

New technology protects crops by testing the air for the DNA of plant diseases

Visiting an art exhibition can make you think more socially and openly—but for how long?

Relevant PhysicsForums posts

Related Stories

New application allows scientists easy access to important government data

Web interface defines new paradigm for life science data-sharing

UT's Remote Data Analysis and Visualization Center enters full production

Customizing supercomputers from the ground up

Tropical cyclone or ISU Cyclone? Semantic science search engine knows that there is a difference

Enter the semantic grid

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience