Study shows just how fast censorship can occur in social media

Mar 14, 2013 by Jade Boyd
A new study analyzes how controversial posts are deleted in near real time from the Twitter-like Chinese social media service called Weibo, which hosts about 100 million messages per day. Credit: Photos.com/Rice University

(Phys.org) —An analysis of censorship patterns on the Twitter-like Chinese social media service called Weibo gives the clearest picture yet of how the site's operator, Sina Weibo, finds and deletes controversial posts in near real time, despite a daily volume of 100 million messages. The study, which was conducted by an independent researcher and collaborators at Rice University and the University of New Mexico (UNM), is available online and undergoing peer review.

"Other people have explored on Weibo, but this work is focused on the speed at which censorship happens," said lead researcher Dan Wallach, professor of computer science at Rice and co-author of a forthcoming study that was recently posted on the pre-print site arxiv.org.

A team led by Wallach and UNM's Jed Crandall worked with the study's lead author, an independent researcher named Tao Zhu. Their analysis indicates that Weibo uses a combination of keyword- and human to monitor and delete potentially controversial posts on Weibo. By closely monitoring individuals who frequently post controversial messages, Sina Weibo is able to delete many objectionable posts in less than five minutes, the study found.

Launched three years ago, Weibo, like , allows users to post 140-character messages with usernames and hashtags. About 300 million people use Weibo, which is China's most popular microblogging service. Users post 100 million messages each day on Weibo.

For the study, researchers began by following 25 "sensitive" users that they had discovered by doing a search for people who had used words previously banned by Weibo. To broaden their search, the researchers added more than 3,000 users who had reposted one of the 25 sensitive users more than five times. They then followed this expanded group for a period of time and measured how often and how quickly their posts were deleted. Any user with more than five deleted posts was added to the pool of sensitive users.

After 15 days, the sensitive group included 3,567 users. The researchers found that on average, about 4,500 posts by the sensitive users were deleted each day, including about 1,500 that were deleted at the network level by Sina Weibo. The team's censorship-tracking software was able to track, within one minute, the amount of time a post remained online before it was deleted.

The researchers found that deletions happened most heavily in the first hour after an original post had been made, and nearly 90 percent of deletions occurred within 24 hours. The analysis also revealed a sophisticated mechanism to remove all reposts of deleted posts, often within five minutes of the original post's deletion. Deletion times were found to be significantly shorter for a subset of users who tended to post deleted content most often, an indication that Sina Weibo actively monitors the activity of some users.

"Roughly 12 percent of the total posts from our sensitive users were eventually deleted," Wallach said. "We have enough of these posts to be able to run topical analysis algorithms that let us extract the main subjects that Weibo's censors seemed concerned with on any given day."

To date, researchers have collected 470 million posts from the Weibo public timeline and 2.38 million posts from a user timeline.

"Measuring how censorship is practiced by Weibo allows us to examine industry practices that may become more widespread," said Crandall, assistant professor of computer science at UNM. "There has been considerable debate in the U.S. recently about extending copyright law enforcement to include various kinds of filtering online. China already has laws in place for companies within China to filter online content."

Wallach and Crandall said that Weibo, as one of China's largest social networking companies, faces the dual challenge of keeping its users engaged—and thereby watching advertisements and making money for Weibo—while keeping the content it hosts compliant with local laws.

"Weibo gives us a window into the future for what Internet censorship of around the world may look like," Wallach said. "Former Supreme Court Justice Louis Brandeis championed transparency a century ago when he wrote, 'Sunlight is said to be the best of disinfectants.' We hope that our research shines a light on how laws created by governments and implemented by the private sector can affect free speech everywhere, including here in the U.S."

Explore further: MU researchers develop more accurate Twitter analysis tools

More information: Research available for download:

The Velocity of Censorship: High-Fidelity Detection of Microblog Post Deletions
arxiv.org/abs/1303.0597
Tracking and Quantifying Censorship on a Chinese Microblogging Site
arxiv.org/pdf/1211.6166v1.pdf
An Analysis of Chinese Search Engine Filtering
arxiv.org/pdf/1107.3794.pdf

Related Stories

China's Tencent launches English microblog site

Oct 11, 2011

Chinese Internet giant Tencent has launched an English version of its Twitter-like microblogging service in a push to tap the overseas market, the official Xinhua news agency reported Tuesday.

China microblogging site to tighten controls

Sep 19, 2011

China's popular micoblogging site Weibo said it was tightening controls over its Twitter-like service, state press said Monday, amid concerns over growing government interference on the web.

China extends microblog rules to south: report

Dec 22, 2011

China is extending rules requiring microblog users to register under their real names to Guangdong, state media said Thursday, after a spate of violent protests in the southern province.

China's biggest microblog tops 200 million users

Aug 18, 2011

A popular social networking service used by Chinese people to vent their anger over a deadly July train crash now has more than 200 million users, owner Sina.com said Thursday.

Recommended for you

Avatars make the Internet sign to deaf people

10 hours ago

It is challenging for deaf people to learn a sound-based language, since they are physically not able to hear those sounds. Hence, most of them struggle with written language as well as with text reading ...

Chameleon: Cloud computing for computer science

Aug 26, 2014

Cloud computing has changed the way we work, the way we communicate online, even the way we relax at night with a movie. But even as "the cloud" starts to cross over into popular parlance, the full potential ...

User comments : 4

Adjust slider to filter visible comments by rank

Display comments: newest first

frajo
1 / 5 (1) Mar 14, 2013
It would have been more interesting to measure the speed of censorship on Youtube (by German GEMA intervention) or Western social media. Just post something deemed to be a terrorist plot or child pornography.
Don't try to tell me the Western camp (including Saudi Arabia and Bahrain) doesn't have vastly superior methods of censorship and other cyber interventions.
Doug_Huffman
1 / 5 (2) Mar 14, 2013
Western censorship may not be so straight forward, being a tool of repression, social, political, academic.
Shakescene21
not rated yet Mar 14, 2013
@frajo -- What do you mean by "the Western camp"? And, No, Saudi Arabia is no more Western than you are. Are you a Chinese Government tool?
CQT
not rated yet Mar 30, 2013
Censorship is archaic. False flagging is the superior method.
"Just post something..." that fulfills the function labeled libel.

Of course you are not the source of libel. The source of libel is hand picked, custom tailored...and rigged.

One of many "other cyber interventions"