The Pacific slope of Peru is greening, but it's not good news

Research led by physicists and geographers at the University of Cambridge has unveiled some large-scale changes in the vegetation in the South American Andes which may have dramatic impact on the environment and ecosystems ...

'Living fossil' may upend basic tenet of evolutionary theory

The field of evolutionary biology has seen its share of spirited debates. But if there's one principle that virtually every expert in the field agrees on, it's that natural selection occurs at the level of the genome.

12 matter particles suffice in nature

How many matter particles exist in nature? Particle physicists have been dealing with this question for a long time. The 12 matter particles contained in the standard model of particle physics? Or are there further particles ...

Gene transfer from transgenic crops: A more realistic picture

A new data-driven statistical model that incorporates the surrounding landscape in unprecedented detail describes the transfer of an inserted bacterial gene via pollen and seed dispersal in cotton plants more accurately than ...

Data scientists build more honest prediction models

On Nov. 3, 2020—and for many days after—millions of people kept a wary eye on the presidential election prediction models run by various news outlets. With such high stakes in play, every tick of a tally and twitch of ...

page 1 from 31

Statistical model

A statistical model is a set of mathematical equations which describe the behavior of an object of study in terms of random variables and their associated probability distributions. If the model has only one equation it is called a single-equation model, whereas if it has more than one equation, it is known as a multiple-equation model.

In mathematical terms, a statistical model is frequently thought of as a pair (Y,P) where Y is the set of possible observations and P the set of possible probability distributions on Y. It is assumed that there is a distinct element of P which generates the observed data. Statistical inference enables us to make statements about which element(s) of this set are likely to be the true one.

Three notions are sufficient to describe all statistical models.

One of the most basic models is the simple linear regression model which assumes a relationship between two random variables Y and X. For instance, one may want to linearly explain child mortality in a given country by its GDP. This is a statistical model because the relationship need not to be perfect and the model includes a disturbance term which accounts for other effects on child mortality other than GDP.

As a second example, Bayes theorem in its raw form may be intractable, but assuming a general model H allows it to become

which may be easier. Models can also be compared using measures such as Bayes factors or mean square error.

This text uses material from Wikipedia, licensed under CC BY-SA