Scientists propose tactics for ethical use of Twitter data in research studies

Jun 10, 2014 by Maureen Lawrence-Kuether.

Twitter has emerged as one of the most popular social networks in the world. Billions of tweets present an unfiltered view of public opinions and experiences. Researchers can deduce a lot from those 140 characters—from tracking disease outbreaks to predicting social unrest.

Unlike other , Twitter provides multiple application programming interfaces that allow for real-time data streaming. Metadata from the stream can even include geolocations that pinpoint the exact geographic coordinates of a user.

The vast amount of data available from tweets is extremely beneficial for , but handling that information requires an ethical framework.

Caitlin Rivers of Hanover, New Hampshire, a doctoral student in the Genetics, Bioinformatics, and Computational Biology Program, along with Bryan Lewis, a research scientist in the Network Dynamics and Simulation Science Laboratory at the Virginia Bioinformatics Institute, National Capital Region, have proposed a set of ethical guidelines for scientific researchers using Twitter data. Their opinion article first appeared in F1000 Research, an open science journal for life scientists.

Currently researchers can access a random sample of approximately 1 percent of all live-streamed tweets or retrospectively gather up to 3,200 tweets from a single user. In 2010, Twitter donated its entire historical record of tweets to the U.S. Library of Congress.

In February 2014, Twitter introduced a pilot program that would give researchers full access to Twitter's public and historical data. The variety of ways researchers can utilize Twitter as a research tool is likely to continue to grow.

By viewing online spaces with the same regard as physical spaces, there are social norms regarding privacy. However, Twitter users often do not expect their movements will be tracked or their personal information recorded. Even in public places, common standards of privacy are observed.

"Why should online spaces like Twitter not be subject to those same standards?" said Rivers, a Department of Defense Science, Mathematics, and Research for Transformation scholar.

Researchers interested in user-centric studies look at the Twitter activity of individuals, but doing so raises potential privacy concerns. Twitter data is public information, but many individuals claim a reasonable expectation of privacy. Even if a user's assumptions on privacy are incorrect, there are presumptions that because something is shared publicly, the user consents to it being used for research.

"Twitter is an amazing data source for scientists, but it's also a potential privacy risk," said Rivers. "Our guidelines are simple ways for researchers to respect the privacy of Twitter users while still conducting valuable research."

Modeled after the U.S. Consumer Privacy Bill of Rights, Rivers and Lewis outline six guidelines for the ethical use of Twitter data:

  • Make objectives, methodologies, and data handling practices transparent and easily accessible.
  • Protect the anonymity of tweet authors by not publishing identifiable information without consent.
  • Honor Twitter users' efforts to control their personal data by omitting private and deleted tweets.
  • Do not use tweet data to aggregate personal information from other sources.
  • Work collaboratively with the Institutional Review Board for study designs that may compromise and anonymity.
  • Respect the context in which a tweet was sent.

Lewis, a public health policy analyst in the National Capital Region, points out these tactics are "not to preclude using data in this manner, but only to encourage researchers to adhere to existing ethical standards."

The proposed guidelines are meant to inspire an open discussion among the research community.

"We are at a point in digital research where the technical advances are outpacing standards of ethical practice," Rivers said.

Explore further: Twitter buys data analytics partner Gnip

More information: The paper is available online: f1000research.com/articles/3-38/v1#reflist

add to favorites email to friend print save as pdf

Related Stories

Twitter buys data analytics partner Gnip

Apr 15, 2014

Twitter says it has bought its data partner Gnip, which provides analysis of the more than 500 million tweets its users share each day—to advertisers, academic institutions, politicians and other customers.

Recommended for you

Voice, image give clues in hunt for Foley's killer

Aug 21, 2014

Police and intelligence services are using image analysis and voice-recognition software, studying social media postings and seeking human tips as they scramble to identify the militant recorded on a video ...

Smartphone-loss anxiety disorder

Aug 21, 2014

The smart phone has changed our behavior, sometimes for the better as we are now able to connect and engage with many more people than ever before, sometimes for the worse in that we may have become over-reliant on the connectivity ...

Why conspiracy theorists won't give up on MH17 and MH370

Aug 20, 2014

A huge criminal investigation is underway in the Netherlands, following the downing of flight MH17. Ten Dutch prosecutors and 200 policemen are involved in collecting evidence to present at the International Criminal Court in the Hague. The inv ...

Here's how you find out who shot down MH17

Aug 20, 2014

More than a month has passed since Malaysia Airlines flight MH17 crashed with the loss of all 298 lives on board. But despite the disturbances at the crash site near the small town of Grabovo, near Donetsk ...

Assange talks of leaving embassy, sowing confusion

Aug 18, 2014

WikiLeaks founder Julian Assange sowed confusion Monday with an announcement that appeared to indicate he was leaving his embassy bolt hole, but his spokesman later clarified that that would not happen unless ...

User comments : 0