You are what you tweet: Tracking public health trends with Twitter
Johns Hopkins computer scientists Mark Dredze, left, and Michael J. Paul, found that Twitter posts could yield useful public health information. Credit: Will Kirk/JHU
Twitter allows millions of social media fans to comment in 140 characters or less on just about anything: an actor's outlandish behavior, an earthquake's tragic toll or the great taste of a grilled cheese sandwich.
But by sifting through this busy flood of banter, is it possible to also track important public health trends? Two Johns Hopkins University computer scientists would respond with a one-word tweet: "Yes!"
Mark Dredze and Michael J. Paul fed 2 billion public tweets posted between May 2009 and October 2010 into computers, then used software to filter out the 1.5 million messages that referred to health matters. Identities of the tweeters were not collected by Dredze, a researcher at the university's Human Language Technology Center of Excellence and an assistant research professor of computer science, and Paul, a doctoral student.
"Our goal was to find out whether Twitter posts could be a useful source of public health information, " Dredze said. "We determined that indeed, they could. In some cases, we probably learned some things that even the tweeters' doctors were not aware of, like which over-the-counter medicines the posters were using to treat their symptoms at home."
By sorting these health-related tweets into electronic "piles," Dredze and Paul uncovered intriguing patterns about allergies, flu cases, insomnia, cancer, obesity, depression, pain and other ailments.
"There have been some narrow studies using Twitter posts, for example, to track the flu," Dredze said. "But to our knowledge, no one has ever used tweets to look at as many health issues as we did."
Dredze and Paul, who also are affiliated with the university's Center for Language and Speech Processing, have discussed some of their results in recent months at computer science conferences. They will present their complete study on July 18 in Barcelona, Spain, at the International Conference on Weblogs and Social Media, sponsored by the Association for the Advancement of Artificial Intelligence.
In addition to finding a range of health ailments in Twitter posts, the researchers were able to record many of the medications that ill tweeters consumed, thanks to posts such as: "Had to pop a Benadryl allergies are the worst."
Other tweets pointed to misuse of medicine. "We found that some people tweeted that they were taking antibiotics for the flu," Paul said. "But antibiotics don't work on the flu, which is a virus, and this practice could contribute to the growing antibiotic resistance problems. So these tweets showed us that some serious medical misperceptions exist out there."
Of course, the vast majority of daily tweets have nothing to do with an illness. While a simple approach would be to filter for words that are tied to illness, such as "headache" or "fever," this strategy fails on such tweets as "High price of gas is a headache for my business" or "Got a case of Bieber Fever. Love his new song."
To find the health-related posts among the billions of messages in their original pool, the Johns Hopkins researchers applied a filtering and categorization system they devised. With this tool, computers can be taught to disregard phrases that do not really relate to one's health, even though they contain a word commonly used in a health context.
Once the unrelated tweets were removed, the remaining results provided some surprising findings.
"When we started, I didn't even know if people talked about allergies on Twitter," Paul said. "But we found out that they do. And there was one thing I didn't expect: The system found two different types of allergies: the type that causes sniffling and sneezing and the kind that causes skin rashes and hives."
In about 200,000 of the health-related tweets, the researchers were able to draw on user-provided public information to identify the geographic state from which the message was sent. That allowed them to track some trends by time and place, such as when the allergy and flu seasons peaked in various parts of the country. "We were able to see from the tweets that the allergy season started earlier in the warmer states and later in the Midwest and the Northeast," Dredze said.
Dredze and Paul have already begun talking to public health scientists, including some affiliated with Johns Hopkins, who say that future studies of tweets could uncover even more useful data, not only about posters' medical problems but also about public perceptions concerning illnesses, medications and other health issues.
Still, Dredze and Paul cautioned that trying to take the nation's temperature by analyzing tweets has its limitations. For one thing, most Twitter users did not comment more than once on their particular ailment, making it tough to track how long the illness lasted and whether it recurred. In addition, most Twitter users tend to be young, which would exclude many senior citizens from a public health study. Also, at the moment, Twitter is dominated by users who are in the United States, making it less useful for research in other countries.
Although social media sites allow users to expose lots of personal information to friends and strangers, Twitter-based research may only reach a certain depth.
"In our study," Paul said, "we could only learn what people were willing to share. We think there's a limit to what people are willing to share on Twitter."
Nevertheless, Dredze says there is still plenty of useful data left to plumb from Twitter posts. "The people I've talked to have felt this is a really interesting research tool," he said, "and they have some great ideas about what they'd like to learn next from Twitter."
Provided by
Johns Hopkins University
-
From lemons to lemonade: Reaction uses carbon dioxide to make carbon-based semiconductor,
32 comments
-
Thioridazine kills cancer stem cells in human while avoiding toxic side-effects of conventional cancer treatments,
3 comments
-
SpaceX private rocket blasts off for space station (Update),
42 comments
-
Climate scientists say they have solved riddle of rising sea,
31 comments
-
SpaceX capsule has 'new car' smell, astronauts say (Update),
2 comments
-
Need a rigid insulation material???
15 hours ago
-
magnets or EMF in car bumpers to protect from fender bender
May 26, 2012
-
length of wire in a coil of known dimensions?
May 25, 2012
-
India Engineering Powerhouse
May 25, 2012
-
electromagnet core dereference between hard and soft iron
May 25, 2012
-
Measuring water pressure in an open tank
May 24, 2012
- More from Physics Forums - General Engineering
More news stories
Browser wars flare in mobile space
The browser wars are heating up again, but this time the fight is for dominance of the mobile Internet.
8 hours ago |
5 / 5 (1) |
3
Probability of contamination from severe nuclear reactor accidents is higher than expected: study
Catastrophic nuclear accidents such as the core meltdowns in Chernobyl and Fukushima are more likely to happen than previously assumed. Based on the operating hours of all civil nuclear reactors and the number ...
Technology / Energy & Green Tech
May 22, 2012 |
3.6 / 5 (22) |
56
|
SpotterRF debuts Radar Backpack Kit (w/ Video)
(Phys.org) -- SpotterRF has announced a special radar backpack kit designed to enhance situational awareness for soldiers on the ground. The company says its special radar is designed for warfighters as part ...
HyperSolar shows dirty water no barrier to power world
(Phys.org) -- The Santa Barbara, California, company, HyperSolar, is set to transparently share the ups and downs of its research experiences toward the companys ultimate vision, successfully producing ...
Tesla to launch electric sedan in US on June 22
Tesla Motors said Tuesday it would begin deliveries of "the world's first premium electric sedan" on June 22, slightly ahead of schedule.
Technology / Energy & Green Tech
May 22, 2012 |
4.5 / 5 (12) |
18
Change in developmental timing was crucial in the evolutionary shift from dinosaurs to birds: study
At first glance, it's hard to see how a common house sparrow and a Tyrannosaurus Rex might have anything in common. After all, one is a bird that weighs less than an ounce, and the other is a dinosaur that ...
Computer model used to pinpoint prime materials for efficient carbon capture
When power plants begin capturing their carbon emissions to reduce greenhouse gases and to most in the electric power industry, it's a question of when, not if it will be an expensive undertaking.
'Unzipped' carbon nanotubes could help energize fuel cells, batteries
Multi-walled carbon nanotubes riddled with defects and impurities on the outside could replace some of the expensive platinum catalysts used in fuel cells and metal-air batteries, according to scientists at ...
T cells 'hunt' parasites like animal predators seek prey, study shows
By pairing an intimate knowledge of immune-system function with a deep understanding of statistical physics, a cross-disciplinary team at the University of Pennsylvania has arrived at a surprising finding: T cells use a movement ...
Manufacturing genes to attack flu virus
An international research team has manufactured a new protein that can combat deadly flu epidemics.
Yale study concludes public apathy over climate change unrelated to science literacy
Are members of the public divided about climate change because they don't understand the science behind it? If Americans knew more basic science and were more proficient in technical reasoning, would public consensus match ...