New application allows scientists easy access to important government data

December 10, 2010
Rensselaer collaborated with Elsevier on the application. Credit: Rensselaer Polytechnic Institute

Government agencies around the world make billions of bits of raw data available to the public each day, but this data is often in difficult formats or so widely spread around the Web it is virtually unusable to the public and scientists who seek to use this valuable information in their research.

Computer scientists within the Tetherless World Research Constellation at Rensselaer Polytechnic Institute have developed an application to help solve the problem. A collaboration with scientific publisher Elsevier, the application utilizes the U.S. warehouse,, to provide scientists with easy and direct access to government data sets relevant to their research.

For Rensselaer, the work is the latest example of the renowned Web Science research group's efforts to enhance the hundreds of thousands of raw government datasets available on the website with advanced technology. Their work is bringing scientists and the public usable, relevant, searchable, and easy replicable datasets on topics from to public safety to the federal deficit.

The new application, called US Government Dataset Search, lives on Elsevier's SciVerse websites. SciVerse provides the global scientific research community with searchable access to the world's largest source of peer-reviewed scientific content. Such access is a vital component of the modern scientific process as scientists develop new discoveries by building off the findings of previous peer-reviewed publications.

"There is a growing movement to make data and content more open and accessible on the Web," said Tetherless World Research Constellation Professor James Hendler. "Elsevier's tool-based systems show a new way for publishers to join this movement without sacrificing copyrights. It should serve as a starting place to be emulated by others around the world."

Once selected from an application gallery by SciVerse users, the new application will display a customized list of government data sets most relevant to the topics for which the scientist is searching for articles. As an example, a climatologist searching SciVerse for peer-reviewed articles on climate change would be provided with a list of all relevant government data on ranging from the National Oceanic and Atmospheric Administration's massive collaborative weather observation networks to historical climate diaries and journals from the National Archives. This free and relevant data can then be used by the scientists to advance their research, often in totally new and unexpected ways, according to its developers.

In addition to providing direct access to raw government datasets, the application simultaneously searches the Linking Open Government Data (LOGD) portal at Rensselaer's Tetherless World Research Constellation. The portal hosts datasets that have been converted and enhanced with Semantic Web technologies. Semantic enhancements to the datasets make them much more usable and searchable to a variety of applications, enabling multiple data sets to be linked even when the underlying structure or format of each is different. Completely unseen to the average user, this semantic technology resides below the surface of the Web, augmenting rather than replacing traditional search engines. and developers can also take the semantic coding and utilize and enhance it independently.

"When we enhance data with semantics, we make it much more usable to a researcher than raw data," said the project lead for the application and Rensselaer research engineer John Erickson. "Through this application and others developed within the Tetherless World, we are empowering researchers with new tools for the basic practice of science by introducing semantics into the exploration of data."

Erickson was joined in the research by research scientist Li Ding, graduate student Dominic DiFranzo, as well the professors who lead the research group, Deborah McGuinness, Hendler, and Peter Fox.

"Using Semantic Web technologies, Tetherless World Research Constellation at Rensselaer has built innovative solutions leveraging open government datasets from," said Vice President of Product Management for Elsevier's Application Marketplace and Developer Network Rafael Sidi. "We are delighted to partner with them to bring government datasets to our users. The Dataset Search application built by Rensselaer illustrates how collaboration with the research community can lead to innovative applications that enhance scientists' productivity."

Explore further: Computer scientists lay out vision for a 'science of the Web'

More information: For more information on the Tetherless World Research Constellation work with, go to

Related Stories

Computer scientists lay out vision for a 'science of the Web'

August 10, 2006

Researchers need a clear agenda to harness the rapidly evolving potential of the World Wide Web, according to an article in the Aug. 11 issue of the journal Science. Calling for the creation of an interdisciplinary "science ...

White House launches open government initiative

May 21, 2009

The White House invited ordinary Americans on Thursday to contribute ideas on making government more open and unveiled a new website where raw federal data will be put online for public use.

Rensselaer team shows how to analyze raw government data

November 15, 2010

Who is the White House’s most frequent visitor? Which White House staffer has the most visitors? How do smoking quit rates, state by state, relate to unemployment, taxes, and violent crimes? How do politics influence ...

Recommended for you

The ethics of robot love

November 25, 2015

There was to have been a conference in Malaysia last week called Love and Sex with Robots but it was cancelled. Malaysian police branded it "illegal" and "ridiculous". "There is nothing scientific about sex with robots," ...

Glider pilots aim for the stratosphere

November 20, 2015

Talk about serendipity. Einar Enevoldson was strolling past a scientist's office in 1991 when he noticed a freshly printed image tacked to the wall. He was thunderstruck; it showed faint particles in the sky that proved something ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.