July 30, 2018

Statistics, computer science students collaborate on real-world data problems through mini-think tanks

by Trevor Butterworth, American Statistical Association

What is the difference between statistics and data science—and, perhaps more importantly, why do we have two fields with what seems to be the same focus? The best way to understand the emergence of data science as a separate discipline, explains Herman "Gene" Ray, director of the Center for Statistics and Analytical Research at Kennesaw State University, is to see data science as the merger of computer science and statistics. "Most traditional statistics programs teach you a lot of theory and how to work out problems by hand," he says. "Computer applications are something of an afterthought. But businesses aren't going to analyze 100 million records by hand; they're dealing with huge convenience samples. And that's where data science steps in."

And that's where the academic infighting starts: Statisticians say data scientists lack the statistical or mathematical foundation to understand data collection and analysis, and data scientists roll their eyes at statisticians for their lack of programming savvy. This, says Ray, was the biggest obstacle they faced in creating one of the first US Ph.D. programs in analytics and data science: How do you combine statistics and computer science? "Each one thinks they can do it without the other," he says. "But the reality is that most statisticians are not very good programmers, and most computer scientists don't really understand some of the nuances of statistics. Our goal is to bridge that divide."

Their solution, in part, leveraged the increasing awareness among Atlanta-area businesses of the importance of data. The Analytics and Data Science Institute created nine sponsored research laboratories, each focused on data problems facing a business or public service or nonprofit, and each with one to four Ph.D. students led by a faculty member. "They're like miniature think tanks exploring real-world problems," says Ray. "And in doing so, students get to understand the problem from the computer science and the statistical perspective." A more traditionally minded statistics student might be encouraged by a colleague to explore neural networks, while a more traditionally minded computer science student might be encouraged to see why they have to use representative sampling over convenience sampling.

One recent project involved working with Cobb County Fire Department, a suburb of Atlanta, which was not meeting the national metrics for fire standards. "We took all their data for fire and ambulance events—the time of the first phone call to the time the ambulance left the firehouse to the time it took it to get to an event. We looked at the routes and traffic patterns, and then optimized response times using graft theory and Google Maps." Routes were changed, fire zones reallocated, and response times were cut. "The Cobb County fire chief is very data savvy," says Ray, "so he's implementing incremental changes and then seeing how the data updates."

The research laboratories also add another dimension—and an increasingly important one—to student experience: how to talk to people who aren't statisticians or data scientists.

"When I was trained, the expectation was that I would work with other statisticians and present at academic conferences," says Ray. "So, we all spoke the same language. Today, a data scientist could be speaking with an executive, or client, or policymaker, who has very little statistics background at all. They must be able to read this really quickly, and make sure the right message is still communicated at the appropriate level. That's one of the beautiful things about these labs—they force everyone to learn how to speak in a way for the lab to be successful."

More information: http://ww2.amstat.org/meetings/jsm/2018/onlineprogram/ActivityDetails.cfm?SessionID=215542

Provided by American Statistical Association

Citation: Statistics, computer science students collaborate on real-world data problems through mini-think tanks (2018, July 30) retrieved 6 July 2024 from https://phys.org/news/2018-07-statistics-science-students-collaborate-real-world.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

ASA issues statement on role of statistics in data science

61 shares

Feedback to editors

Statistics, computer science students collaborate on real-world data problems through mini-think tanks

High-selectivity graphene membranes enhance CO₂ capture efficiency

Exploring the possibility of probing fundamental spacetime symmetries via gravitational wave memory

Starlings' migratory behavior found to be inherited, not learned

Webb captures a staggering quasar-galaxy merger in the remote universe

Repurposed technology used to probe new regions of Mars' atmosphere

Evidence shows ancient Saudi Arabia had complex and thriving communities, not struggling people in a barren land

Research finds humpbacks were happier during pandemic pause

Webb admires bejeweled ring of the lensed quasar RX J1131-1231

Researchers demonstrate economical process for the synthesis and purification of ionic liquids

New probe reveals water-ice microstructures

Relevant PhysicsForums posts

P-adic numbers and the Ramanujan summation

Implication vs Equivalence

What motivates famous mathematicians?

Innumeracy in public media today

Shrinking a polygon -- calculation logic

Why are the axes taken as perpendicular to each other?

ASA issues statement on role of statistics in data science

Are statistics behind Pokémon Go's success?

More students earning statistics degrees; not enough to meet surging demand

Leading statisticians establish steps to convey statistics a science not toolbox

New training platform for big data analysis

Statistics is the fastest-growing undergraduate STEM degree

Merging AI and human efforts to tackle complex mathematical problems

New mathematical proof helps to solve equations with random components

Study finds cooperation can still evolve even with limited payoff memory

Study shows the power of social connections to predict hit songs

Wire-cut forensic examinations currently too unreliable for court, new study says

How can we make good decisions by observing others? A videogame and computational model have the answer

Medical Xpress

Tech Xplore

Science X

Statistics, computer science students collaborate on real-world data problems through mini-think tanks

High-selectivity graphene membranes enhance CO₂ capture efficiency

Exploring the possibility of probing fundamental spacetime symmetries via gravitational wave memory

Starlings' migratory behavior found to be inherited, not learned

Webb captures a staggering quasar-galaxy merger in the remote universe

Repurposed technology used to probe new regions of Mars' atmosphere

Evidence shows ancient Saudi Arabia had complex and thriving communities, not struggling people in a barren land

Research finds humpbacks were happier during pandemic pause

Webb admires bejeweled ring of the lensed quasar RX J1131-1231

Researchers demonstrate economical process for the synthesis and purification of ionic liquids

New probe reveals water-ice microstructures

Relevant PhysicsForums posts

Related Stories

ASA issues statement on role of statistics in data science

Are statistics behind Pokémon Go's success?

More students earning statistics degrees; not enough to meet surging demand

Leading statisticians establish steps to convey statistics a science not toolbox

New training platform for big data analysis

Statistics is the fastest-growing undergraduate STEM degree

Recommended for you

Merging AI and human efforts to tackle complex mathematical problems

New mathematical proof helps to solve equations with random components

Study finds cooperation can still evolve even with limited payoff memory

Study shows the power of social connections to predict hit songs

Wire-cut forensic examinations currently too unreliable for court, new study says

How can we make good decisions by observing others? A videogame and computational model have the answer

Newsletter sign up

Donate and enjoy an ad-free experience