Highlighting the lakes identified by ReaLSAT and HydroLakes (blue), as well as the lakes present in ReaLSAT but not HydroLakes (red) for different regions in world: a) small reservoirs in India, b) water-intensive agriculture in Vietnam, c) natural lakes in the U.S. and wetlands in Venezuela and d) shallow lakes in Australia. Credit: ReaLSAT, University of Minnesota

An interdisciplinary team of researchers, led by University of Minnesota Twin Cities data scientists, has published a first-of-its-kind comprehensive global dataset of the lakes and reservoirs on Earth showing how they have changed over the last 30+ years.

The data will provide environmental researchers with new information about land and fresh water use as well as how lakes and are being impacted by humans and . The research is also a major advancement in machine learning techniques.

A paper highlighting the Reservoir and Lake Surface Area Timeseries (ReaLSAT) dataset was recently published in Scientific Data.

Highlights of the study include:

  • The ReaLSAT dataset contains the location and surface area variations of 681,137 lakes and reservoirs larger than 0.1 square kilometers (south of 50 degrees north latitude). The previous most comprehensive database, called HydroLAKES, had identified only 245,420 lakes and reservoirs for the part of the world and minimum size being considered in this study.
  • ReaLSAT provides data on the surface area of each body of water for each month from 1984 to 2015. This makes it possible to quantify changes in and reservoir area over time, which is key to understanding how changing climate and land use are altering bodies of fresh water. The HydroLAKES data contains only a static shape for each water body.
  • The ReaLSAT dataset is the culmination of eight years of research. It represents a major milestone in the application of new knowledge-guided machine learning for use in the environmental sciences. Unlike other existing efforts, this dataset can now be extended nearly automatically via machine learning and can be quickly replicated for a wide variety of earth observation data that are becoming available at increasingly better resolution.

An image of Minnesota lakes identified using the ReaLSAT dataset (red) is combined with a similar image of the area where lakes were identified in the previous HydroLAKES dataset (blue). The ReaLSAT dataset identifies almost three times as many lakes and reservoirs worldwide compared to HydroLAKES. Credit: ReaLSAT, University of Minnesota

"Around the world, we are seeing lakes and reservoirs changing rapidly with seasonal precipitation patterns, long-term changes in climate, and human management decisions," said Vipin Kumar, the senior author of the study and Regents Professor and William Norris Endowed Chair in the University of Minnesota Twin Cities Department of Computer Science and Engineering. "This new dataset greatly improves the ability of scientists to understand the impact of changing climate and human actions on our fresh water across the globe."

Building a global of lakes and reservoirs and how they are changing required a new type of machine learning algorithms that meld knowledge of the physical dynamics of water bodies with satellite imagery.

"ReaLSAT is a shining example where motivated a new class of knowledge-guided algorithms that are now being used in numerous scientific applications," Kumar said.

Scientists who study the environment agree that ReaLSAT will improve their work.

"The availability and quality of surface fresh water is central to sustainable use of our planet," said Paul C. Hanson, a Distinguished Research Professor at the University of Wisconsin-Madison Center for Limnology and a co-author of the study. "Because ReaLSAT shows changes in lakes and their boundaries, rather than just water pixels across the landscape, we can now connect ecosystem process about water quality with hundreds of thousands of lakes around the world."

More information: Ankush Khandelwal et al, ReaLSAT, a global dataset of reservoir and lake surface area variations, Scientific Data (2022). DOI: 10.1038/s41597-022-01449-5

Journal information: Scientific Data