Method to visualize hidden statistical structures in environmental data

January 29, 2018, King Abdullah University of Science and Technology
Method to visualize hidden statistical structures in environmental data
Huang Huang (left) and Ying Sun have developed a method for visualizing the spatio-temporal covariance properties of a dataset, which will help make sense of environmental data. Credit: KAUST

Prediction of climate and weather relies on statistical models that can capture variability at one location over time as well as the relationship with other geographical locations. Sometimes future conditions at one location can be predicted from the current conditions at another location, while in other cases there may be no such correlation. The assumption of whether two sites are 'covariant' in one way or another can have profound implications for the accuracy of the statistical model, and so the choice of space-time covariance is crucial.

Ying Sun and her student Huang Huang from KAUST have now developed a method for visualizing the spatio-temporal covariance properties of a dataset, greatly simplifying an important modeling step that previously demanded painstaking exploratory .

"We propose an easy and convenient way to visualize the properties of the covariance structure in the data, which will help practitioners choose appropriate statistical models for covariances," says Sun. "In particular, this method is useful for data that are observed sparse in space and dense in time, which is often the case for weather station observations for example."

Sun and Huang considered two key types of covariance-symmetry and separability. Symmetry implies that the spatial-temporal processes are reversible in time, while separability indicates that the correlation in time does not interact with that in space.

"Assuming a fully symmetric or a separable covariance leads to a much simpler model and thus fast computations," says Sun. "However, this assumption may be violated in many real applications, leading to less accurate estimation and prediction."

Huang and Sun used a functional data analysis approach to construct test functions from the covariances in time series data between pairs. These test functions effectively summarize the properties of separability or symmetry and can be displayed as boxplots that show the degree of non-separability or asymmetry.

"We applied this approach to meteorological observations and simulated weather data from some commonly used climate models," says Huang. "In the reported examples for a study area in the North Atlantic Ocean, this method showed that wind speed and surface temperature have different covariance structures in different seasons."

The visualization can be computed relatively quickly for a handful of monitoring stations, and the researchers note that the computational efficiency can be improved for larger numbers of stations by dividing the problem into sub-regions. Nevertheless, the method provides a valuable tool that will greatly assist practitioners.

Explore further: Dataset size counts for better climate and environmental predictions

More information: Visualization and Assessment of Spatio-temporal Covariance Properties.

Related Stories

Simple statistics can be good enough

November 7, 2017

Study of the mismatch between spatial environmental data and a commonly used statistical analysis suggests simpler statistics are sufficient in many cases.

Modeling where the wind blows

January 9, 2018

By incorporating geographical information into models for wind energy, researchers from KAUST have developed an innovative statistical tool that reduces the computational burden of locating global wind resources.

Improving connections for spatial analysis

March 7, 2017

A statistical model that accounts for common dependencies in spatial data yields more realistic results for studies of temperature, wind and pollution levels.

Recommended for you

Not all stem cells are created equal, study reveals

March 22, 2019

Researchers from the University of Toronto's Institute for Biomaterials and Biomedical Engineering (IBBME) and the Donnelly Centre have discovered a population of cells – dubbed to be "elite" – that play a key role in ...

Ancient birds out of the egg running

March 22, 2019

The ~125 million-year-old Early Cretaceous fossil beds of Los Hoyas, Spain, have long been known for producing thousands of petrified fish and reptiles (Fig. 1). However, researchers have uncovered an extremely rare, nearly ...

Making solar cells is like buttering bread

March 22, 2019

Formamidinium lead iodide is a very good material for photovoltaic cells, but getting the correct stable crystal structure is a challenge. The techniques developed so far have produced poor results. However, University of ...


Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.