March 30, 2020

Projecting the outcomes of people's lives with AI isn't so simple

The machine learning techniques scientists use to predict outcomes from large datasets may fall short when it comes to projecting the outcomes of people's lives, according to a mass study led by researchers at Princeton University in a collaboration with researchers across many institutions, including Virginia Tech.

This mass collaboration, called the Fragile Families Challenge, represents a cohort of scientists that build statistical and machine-learning models to predict and measure life outcomes for children, parents, and households across the United States.

Published by 112 co-authors in the Proceedings of the National Academy of Sciences, the results suggest that sociologists and data scientists should use caution when using predictive modeling, especially in the criminal justice system and social programs.

Even after using state-of-the-art modeling and a high-quality dataset containing 13,000 data points for more than 4,000 families, the best AI predictive models were not very accurate.

Brian J. Goode, a research scientist from Virginia Tech's Fralin Life Sciences Institute, was among the data and social scientists that were in the Fragile Families Challenge.

"It's one effort to try to capture the complexities and intricacies that compose the fabric of a human life in data and models. But, it is compulsory to take the next step and contextualize models in terms of how they are going to be applied in order to better reason about expected uncertainties and limitations of a prediction. That's a very difficult problem to grapple with, and I think the Fragile Families Challenge shows that we need more research support in this area, particularly as machine learning has a greater impact on our everyday lives," said Goode.Goode's modeling was conducted through the Discovery Analytics Center at Virginia Tech. There, he teamed up with the Discovery Analytics Center's director and the Thomas L. Phillips Professor of Engineering, Naren Ramakrishnan, and Debanjan Datta, a Ph.D. student in the Department of Computer Science in the College of Engineering, who were instrumental in gathering and analyzing data.

The Virginia Tech team has also published research in a special issue of Socius, a new open-access journal from the American Sociological Association. In order to support additional research in this area, all the submissions to the Challenge—code, predictions and narrative explanations—are publicly available.

"The study also shows us that we have so much to learn, and mass collaborations like this are hugely important to the research community," said the PNAS study co-lead author Matt Salganik, professor of sociology at Princeton and interim director of the Center for Information Technology Policy, based at Princeton's Woodrow Wilson School of Public and International Affairs.

The project was inspired by Wikipedia, one of the world's first mass collaborations, which was created in 2001 as a shared encyclopedia. Salganik pondered what other scientific problems could be solved through a new form of collaboration, and that's when he joined forces with Sara McLanahan, the William S. Tod Professor of Sociology and Public Affairs at Princeton, as well as Princeton graduate students Ian Lundberg and Alex Kindel, both in the Department of Sociology.

McLanahan is principal investigator of the Fragile Families and Child Wellbeing Study based at Princeton and Columbia University, which has been studying a cohort of about 5,000 children born in large American cities between 1998 and 2000, with an oversampling of children born to unmarried parents. The longitudinal study was designed to understand the lives of children born into unmarried families.

Through surveys collected in six waves (when the child was born and then when the child reached ages 1, 3, 5, 9, and 15), the study has captured millions of data points on children and their families. Another wave will be captured at age 22.

At the time the researchers designed the challenge, data from age 15 (which the researchers call in the paper the "hold-out data) had not yet been made publicly available. This created an opportunity to ask other scientists to predict life outcomes of the people in the study through a mass collaboration.

The co-organizers received 457 applications from 68 institutions from around the world, including from several teams based at Princeton. Using the Fragile Families data, participants were asked to predict one or more of the six life outcomes at age 15. These included child grade point average (GPA); child grit; household eviction; household material hardship; primary caregiver layoff; and primary caregiver participation in job training.

The challenge was based around the common task method, a research design used frequently in computer science but not in the social sciences. This method releases some but not all of the data, allowing people to use whatever technique they want to determine outcomes. The goal is to accurately predict the hold-out data, no matter how fancy a technique it takes to get there.

The team is currently applying for grants to continue research in this area.

The paper, "Measuring the predictability of life outcomes with a scientific mass collaboration," was published on March 30 by PNAS.

More information: Matthew J. Salganik el al., "Measuring the predictability of life outcomes with a scientific mass collaboration," PNAS (2020). www.pnas.org/cgi/doi/10.1073/pnas.1915006117

Journal information: Proceedings of the National Academy of Sciences

Provided by Virginia Tech

Citation: Projecting the outcomes of people's lives with AI isn't so simple (2020, March 30) retrieved 28 April 2024 from https://phys.org/news/2020-03-outcomes-people-ai-isnt-simple.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Lower-income children raised in counties with high upward mobility display fewer behavioral issues

153 shares

Feedback to editors

Global study shows a third more insects come out after dark

18 hours ago

Cicada-palooza! Billions of bugs to blanket America

21 hours ago

Getting dynamic information from static snapshots

21 hours ago

Ancient Maya blessed their ballcourts: Researchers find evidence of ceremonial offerings in Mexico

21 hours ago

Optical barcodes expand range of high-resolution sensor

Apr 26, 2024

Ridesourcing platforms thrive on socio-economic inequality, say researchers

Apr 26, 2024

Did Vesuvius bury the home of the first Roman emperor?

Apr 26, 2024

Florida dolphin found with highly pathogenic avian flu: Report

Apr 26, 2024

A new way to study and help prevent landslides

Apr 26, 2024

New algorithm cuts through 'noisy' data to better predict tipping points

Apr 26, 2024

Load comments (3)

Projecting the outcomes of people's lives with AI isn't so simple

Global study shows a third more insects come out after dark

Cicada-palooza! Billions of bugs to blanket America

Getting dynamic information from static snapshots

Ancient Maya blessed their ballcourts: Researchers find evidence of ceremonial offerings in Mexico

Optical barcodes expand range of high-resolution sensor

Ridesourcing platforms thrive on socio-economic inequality, say researchers

Did Vesuvius bury the home of the first Roman emperor?

Florida dolphin found with highly pathogenic avian flu: Report

A new way to study and help prevent landslides

New algorithm cuts through 'noisy' data to better predict tipping points

Relevant PhysicsForums posts

Cover songs versus the original track, which ones are better?

Interesting anecdotes in the history of physics?

Favorite Mashups - All Your Favorites in One Place

Great Rhythm Sections in the 21st Century

Biographies, history, personal accounts

History of Railroad Safety - Spotlight on current derailments

Lower-income children raised in counties with high upward mobility display fewer behavioral issues

Young children and infants read to by parents have stronger vocabulary skills

Experts call for more support for parents of children with genetic learning disabilities

Dads in prison can bring poverty, instability for families on the outside

More than half of all children in the US will likely live with an unmarried mother

Fifteen-year national survey of 'fragile families' fuels wide range of research

Ridesourcing platforms thrive on socio-economic inequality, say researchers

How much trust do people have in different types of scientists?

Maternal grandmothers' support buffers children against the impacts of adversity, finds study

The magic of voices: Why we like some singers' voices and not others

Social change may explain decline in genetic diversity of the Y chromosome at the end of the Neolithic period

Study finds rekindling old friendships as scary as making new ones

Medical Xpress

Tech Xplore

Science X

Projecting the outcomes of people's lives with AI isn't so simple

Global study shows a third more insects come out after dark

Cicada-palooza! Billions of bugs to blanket America

Getting dynamic information from static snapshots

Ancient Maya blessed their ballcourts: Researchers find evidence of ceremonial offerings in Mexico

Optical barcodes expand range of high-resolution sensor

Ridesourcing platforms thrive on socio-economic inequality, say researchers

Did Vesuvius bury the home of the first Roman emperor?

Florida dolphin found with highly pathogenic avian flu: Report

A new way to study and help prevent landslides

New algorithm cuts through 'noisy' data to better predict tipping points

Relevant PhysicsForums posts

Related Stories

Lower-income children raised in counties with high upward mobility display fewer behavioral issues

Young children and infants read to by parents have stronger vocabulary skills

Experts call for more support for parents of children with genetic learning disabilities

Dads in prison can bring poverty, instability for families on the outside

More than half of all children in the US will likely live with an unmarried mother

Fifteen-year national survey of 'fragile families' fuels wide range of research

Recommended for you

Ridesourcing platforms thrive on socio-economic inequality, say researchers

How much trust do people have in different types of scientists?

Maternal grandmothers' support buffers children against the impacts of adversity, finds study

The magic of voices: Why we like some singers' voices and not others

Social change may explain decline in genetic diversity of the Y chromosome at the end of the Neolithic period

Study finds rekindling old friendships as scary as making new ones

Newsletter sign up

Donate and enjoy an ad-free experience