July 11, 2016

When is big data too big? Making data-based models comprehensible

Data-driven mathematical modeling is having an enormous impact on the ability to organize and describe very large data sets, and make inferences and predictions about populations and situations based on sampling data. However, as these models become increasingly complex, the ability of users to understand and apply them represents a growing challenge. The article "A Framework for Considering Comprehensibility in Modeling", which describes this emerging dilemma and a strategy for developing solutions, is published in Big Data.

Michael Gleicher, University of Wisconsin-Madison, defines comprehensibility as "the ability of the various stakeholders to understand relevant aspects of the modeling process." He suggests that comprehensibility should be a key goal in model development. However, as models become more sophisticated, tradeoffs may be inevitable—even between understandability and accuracy—in some cases, improving comprehensibility may help achieve other goals in modeling.

"Gleicher provides a holistic framework of comprehensibility that considers what the various stakeholders in a data science project do and don't understand easily and their need for comprehensibility," says Big Data Editor-in-Chief Vasant Dhar, Professor at the Stern School of Business and the Center for Data Science at New York University. "More broadly, the article highlights comprehensibility from a human-centric standpoint, identifying the role and needs of humans in complex data science projects."

More information: Michael Gleicher, A Framework for Considering Comprehensibility in Modeling, Big Data (2016). DOI: 10.1089/big.2016.0007

Provided by Mary Ann Liebert, Inc

Citation: When is big data too big? Making data-based models comprehensible (2016, July 11) retrieved 10 May 2024 from https://phys.org/news/2016-07-big-data-based-comprehensible.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Understanding accents: Effective communication is about more than simply pronunciation

40 shares

Feedback to editors

Scientists unlock key to breeding 'carbon gobbling' plants with a major appetite

3 hours ago

Clues from deep magma reservoirs could improve volcanic eruption forecasts

3 hours ago

Study shows AI conversational agents can help reduce interethnic prejudice during online interactions

3 hours ago

NASA's Chandra notices the galactic center is venting

3 hours ago

Wildfires in old-growth Amazon forest areas rose 152% in 2023, study shows

4 hours ago

GoT-ChA: New tool reveals how gene mutations affect cells

5 hours ago

Accelerating material characterization: Machine learning meets X-ray absorption spectroscopy

5 hours ago

Life expectancy study reveals longest and shortest-lived cats

5 hours ago

New research shows microevolution can be used to predict how evolution works on much longer timescales

5 hours ago

Stable magnetic bundles achieved at room temperature and zero magnetic field

5 hours ago

Load comments (0)

When is big data too big? Making data-based models comprehensible

Scientists unlock key to breeding 'carbon gobbling' plants with a major appetite

Clues from deep magma reservoirs could improve volcanic eruption forecasts

Study shows AI conversational agents can help reduce interethnic prejudice during online interactions

NASA's Chandra notices the galactic center is venting

Wildfires in old-growth Amazon forest areas rose 152% in 2023, study shows

GoT-ChA: New tool reveals how gene mutations affect cells

Accelerating material characterization: Machine learning meets X-ray absorption spectroscopy

Life expectancy study reveals longest and shortest-lived cats

New research shows microevolution can be used to predict how evolution works on much longer timescales

Stable magnetic bundles achieved at room temperature and zero magnetic field

Relevant PhysicsForums posts

Most efficient way to randomly choose a word from a file with a list of words

Parallel processing for loops and pointer defined outside the loop

Links from navbar made with React don't work

Passing variables in FORTRAN

User-Defined Functions in Sql Server SSMS

Classifiers, threshold, and ROC curve

Understanding accents: Effective communication is about more than simply pronunciation

Novel type 2 diabetes risk model more accurately assesses disease trajectory

Real-time visualization tool reveals behavioral patterns in Bitcoin transactions

Large-scale analytics system for predicting major societal events described in Big Data Journal

Simulation tool uses FinTech quant techniques and big data to guide best health insurance plan

Big data is transforming healthcare—from diabetes to the ER to research

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

When is big data too big? Making data-based models comprehensible

Scientists unlock key to breeding 'carbon gobbling' plants with a major appetite

Clues from deep magma reservoirs could improve volcanic eruption forecasts

Study shows AI conversational agents can help reduce interethnic prejudice during online interactions

NASA's Chandra notices the galactic center is venting

Wildfires in old-growth Amazon forest areas rose 152% in 2023, study shows

GoT-ChA: New tool reveals how gene mutations affect cells

Accelerating material characterization: Machine learning meets X-ray absorption spectroscopy

Life expectancy study reveals longest and shortest-lived cats

New research shows microevolution can be used to predict how evolution works on much longer timescales

Stable magnetic bundles achieved at room temperature and zero magnetic field

Relevant PhysicsForums posts

Related Stories

Understanding accents: Effective communication is about more than simply pronunciation

Novel type 2 diabetes risk model more accurately assesses disease trajectory

Real-time visualization tool reveals behavioral patterns in Bitcoin transactions

Large-scale analytics system for predicting major societal events described in Big Data Journal

Simulation tool uses FinTech quant techniques and big data to guide best health insurance plan

Big data is transforming healthcare—from diabetes to the ER to research

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience