June 3, 2013

Addressing biodiversity data quality is a community-wide effort

Improving data quality in large online data access facilities depends on a combination of automated checks and capturing expert knowledge, according to a paper published in the open-access journal Zookeys. The authors, from the Atlas of Living Australia (ALA) and the Global Biodiversity Information Facility (GBIF) welcome a recent paper by Mesibov (2013) highlighting errors in millipede data, but argue that addressing such issues requires the joint efforts of 'aggregators' and the wider expert community.

The paper notes that aggregations of data openly exposed in facilities such as the ALA and GBIF will contain errors, and both organisations are fully committed to improving the quality of these data. Errors will arise in a multitude of ways. For example, an observation of a species may be misnamed, the name could have changed or the pre-GPS location could be in error. The card entry of this observation could then have been incorrectly transcribed into a digital record by a museum or herbarium. When the record was translated into a standard form for communication with the ALA or GBIF, other errors could have been introduced. At each step of the process, errors can be detected, introduced or corrected.

The authors argue that one of the most powerful outcomes of publishing digital data is that such problems are revealed, providing an opportunity for the whole community to detect and correct them. The paper points out that Mesibov's detection of data issues was only possible with convenient public exposure of a large volume of biological data through the ALA and GBIF.

The ALA and GBIF also run a comprehensive range of automated data checks, for example flagging records whose coordinates lie outside the stated country of the observation or specimen. Such automatic checks will not detect all errors. Specialist expertise therefore remains necessary to detect and correct a wide range of data issues.

Agencies such as the GBIF and the ALA have infrastructure that simplifies error detection and correction. Aggregating many records of a species improves the chances of errors being detected. For example, one observation may be geographically isolated from other records. In the ALA, anyone can annotate an issue exposed in a record. Such annotations are sent to the data provider for evaluation and correction. It then depends on the resources of the provider to ensure that record is updated.

The ability to identify and correct data issues is the responsibility of the whole community and not any one agent such as the ALA. There is the need to seamlessly and effectively integrate expert knowledge and automated processes, so all amendments form part of a persistent digital knowledge base about species. Talented and committed individuals can make enormous progress in error detection and correction (as seen in Mesibov's paper) but how do we ensure that when an individual project like that on millipedes ceases, the data and all associated work are not lost? This implies standards in capturing and linking this information and maintaining the data with all amendments uniquely documented. To achieve this, the biodiversity research community needs to be motivated and empowered to work in a collaborative fashion.

Data should be published in secure locations where they can be preserved and improved in perpetuity. The ALA and GBIF are moving beyond storage of data by individuals or institutions using stand-alone computers that do not have a strategy for enduring digital data integration, storage and access.

More information: Belbin L, Daly J, Hirsch T, Hobern D, Salle JL (2013) A specialist's audit of aggregated occurrence records: An 'aggregator's' perspective. Title. ZooKeys 305: 67–76, doi: 10.3897/zookeys.305.5438

Journal information: ZooKeys

Provided by Pensoft Publishers

Citation: Addressing biodiversity data quality is a community-wide effort (2013, June 3) retrieved 23 April 2024 from https://phys.org/news/2013-06-biodiversity-quality-community-wide-effort.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Online biodiversity databases audited: 'Improvement needed'

0 shares

Feedback to editors

Addressing biodiversity data quality is a community-wide effort

Study shines light on properties and promise of hexagonal boron nitride, used in electronic and photonics technologies

Liquid droplets shape how cells respond to change, shows study

Rice bran nanoparticles show promise as affordable and targeted anticancer agent

Advance in forensic fingerprint research provides new hope for cold cases

How spicy does mustard get depending on the soil?

Electron videography captures moving dance between proteins and lipids

New findings shed light on how bella moths use poison to attract mates

AI tool creates 'synthetic' images of cells for enhanced microscopy analysis

Announcing the birth of QUIONE, a unique analog quantum processor

World's oases threatened by desertification, even as humans expand them

Relevant PhysicsForums posts

The Cass Report (UK)

Major Evolution in Action

If theres a 15% probability each month of getting a woman pregnant...

Can four legged animals drink from beneath their feet?

Mold in Plastic Water Bottles? What does it eat?

Dolphins don't breathe through their esophagus

Online biodiversity databases audited: 'Improvement needed'

Peer review option proposed for biodiversity data

New biodiversity data publishing framework proposed

Effective new biodiversity data access portal

Managing biodiversity data from local government

Use of GBIF helps clarify environment-species links

Linking environmental influences, genetic research to address concerns of genetic determinism of human behavior

40 years of crop research shows inequities

AI-generated disproportioned rat genitalia makes its way into peer-reviewed journal

Unpacking social equity from biodiversity data: An interdisciplinary policy perspective

A whiff of tears reduces male aggression, says study

Solicitor in 19th-century Tasmania traded human Aboriginal remains for scientific accolades, study reveals

Medical Xpress

Tech Xplore

Science X

Addressing biodiversity data quality is a community-wide effort

Study shines light on properties and promise of hexagonal boron nitride, used in electronic and photonics technologies

Liquid droplets shape how cells respond to change, shows study

Rice bran nanoparticles show promise as affordable and targeted anticancer agent

Advance in forensic fingerprint research provides new hope for cold cases

How spicy does mustard get depending on the soil?

Electron videography captures moving dance between proteins and lipids

New findings shed light on how bella moths use poison to attract mates

AI tool creates 'synthetic' images of cells for enhanced microscopy analysis

Announcing the birth of QUIONE, a unique analog quantum processor

World's oases threatened by desertification, even as humans expand them

Relevant PhysicsForums posts

Related Stories

Online biodiversity databases audited: 'Improvement needed'

Peer review option proposed for biodiversity data

New biodiversity data publishing framework proposed

Effective new biodiversity data access portal

Managing biodiversity data from local government

Use of GBIF helps clarify environment-species links

Recommended for you

Linking environmental influences, genetic research to address concerns of genetic determinism of human behavior

40 years of crop research shows inequities

AI-generated disproportioned rat genitalia makes its way into peer-reviewed journal

Unpacking social equity from biodiversity data: An interdisciplinary policy perspective

A whiff of tears reduces male aggression, says study

Solicitor in 19th-century Tasmania traded human Aboriginal remains for scientific accolades, study reveals

Newsletter sign up

Donate and enjoy an ad-free experience