Credit: CC0 Public Domain

Multi-disciplinary researchers at The University of Manchester have helped develop a powerful physics-based tool to map the pace of language development and human innovation over thousands of years—even stretching into pre-history before records were kept.

Tobias Galla, a professor in , and Dr. Ricardo Bermúdez-Otero, a specialist in , from The University of Manchester, have come together as part of an international team to share their diverse expertise to develop the new model, revealed in a paper entitled 'Geospatial distributions reflect temperatures of linguistic feature' authored by Henri Kauhanen, Deepthi Gopal, Tobias Galla and Ricardo Bermúdez-Otero, and published by the journal Science Advances.

Professor Galla has applied statistical physics—usually used to map atoms or nanoparticles—to help build a mathematically-based model that responds to the evolutionary dynamics of language. Essentially, the forces that drive can operate across thousands of years and leave a measurable "geospatial signature", determining how languages of different types are distributed over the surface of the Earth.

Dr. Bermúdez-Otero explained: "In our model each language has a collection of properties or features and some of those features are what we describe as 'hot' or 'cold'.

"So, if a language puts the object before the verb, then it is relatively likely to get stuck with that order for a long period of time—so that's a 'cold' feature. In contrast, markers like the English article 'the' come and go a lot faster: they may be here in one historical period, and be gone in the next. In that sense, definite articles are 'hot' features.

"The striking thing is that languages with 'cold' properties tend to form big clumps, whereas languages with 'hot' properties tend to be more scattered geographically."

This method therefore works like a thermometer, enabling researchers to retrospectively tell whether one linguistic property is more prone to change in historical time than another. This modelling could also provide a similar benchmark for the pace of change in other social behaviours or practices over time and space.

"For example, suppose that you have a map showing the spatial distribution of some variable cultural practice for which you don't have any —this could be be anything, like different rules on marriage or on the inheritance of possessions," added Dr. Bermúdez-Otero.

"Our method could, in principle, be used to ascertain whether one practice changes in the course of historical time faster than another, ie whether people are more innovative in one area than in another, just by looking at how the present-day variation is distributed in space."

The source data for the linguistic modelling comes from present-day languages and the team relied on The World Atlas of Language Structures (WALS). This records information of 2,676 contemporary languages.

Professor Galla explained: "We were interested in emergent phenomena, such as how large-scale effects, for example patterns in the distribution of language features arise from relatively simple interactions. This is a common theme in complex systems research.

"I was able to help with my expertise in the mathematical tools we used to analyse the model and in simulation techniques. I also contributed to setting up the model in the first place, and by asking questions that a linguist would perhaps not ask in the same way."

More information: Henri Kauhanen et al, Geospatial distributions reflect temperatures of linguistic features, Science Advances (2021). DOI: 10.1126/sciadv.abe6540

Journal information: Science Advances