March 3, 2020

How computational linguistics helps to understand how language works

by Universitat Pompeu Fabra - Barcelona

Distributional semantics obtains representations of the meaning of words by processing thousands of texts and extracting generalizations using computational algorithms. Despite the popularity of distributional semantics in such fields as computational linguistics and cognitive science, its impact on theoretical linguistics has so far been very limited.

Research by Gemma Boleda, head of the Computational Linguistics and Language Theory (COLT) research group and ICREA research professor with the Department of Translation and Language Sciences at UPF, published in the journal Annual Review of Linguistics, provides a critical review of the abundant studies available on distributional semantics, putting special emphasis on the results that are relevant for theoretical linguistics. Specifically there are three areas: semantic change, polysemy and composition, and the grammar-semantics interface.

The research by Gemma Boleda seeks to connect theoretical and computational approaches to advance in the collective knowledge about how language works. One of the methods she has extensively researched is distributional semantics, which allows obtaining representations of words automatically. These representations have been shown to reflect significant linguistic properties, such as how two words are similar: a person will tell you that "dog" and "puppy" are very similar, and yet "dog" and "democracy" are hardly similar at all; distributional semantics will say the same, thanks to the fact that it induces linguistic properties based on texts written by people. Therefore, distributional semantics provides radically empirical representations.

Distributional semantics allows analysing the use of words and the evolution of their meaning

Distributional semantics provides an attractive, complementary framework to other, more traditional methods, not only because it is radically empirical but also because it provides multidimensional representations: two words can be likened on one dimension of meaning ("pizza" and "pasta" are types of food), or on another ("pizza" and "wheel" are round). To represent all aspects of meaning, multidimensional representations are needed. Distributional semantics can capture the common uses of two words, as well as their differentiating factors.

One of the important applications of distributional semantics in theoretical linguistics is the detection of changes in meaning. If language data from different periods are processed, such as books in English from 1900, 1950 and 1990, distributional semantics can be used to automatically detect some words' change in meaning. For example, the word "gay" in English at the beginning of the last century meant "happy" and has been used increasingly to mean "homosexual."

Aspects of research into distributional semantics that contribute to language theory

From the analysis of the works studied, Boleda concludes that there is sufficient evidence for the solid results of distributional semantics to be imported directly to research in theoretical linguistics.

"There are at least four aspects of research in distributional semantics that can contribute to language theory. The first aspect is exploratory: distributional representations can be used to explore large-scale data, for example by examining the similarity of words. The second is as a tool to identify specific cases of linguistic phenomena. For example, words can be identified whose meanings have changed when comparing the representations obtained from texts from different periods. The third is as a test bench: evaluating different linguistic hypotheses in distributional terms. The fourth and most difficult is the discovery of new linguistic phenomena or relevant theoretical trends in the data," the author explains in her work.

More information: Gemma Boleda, Distributional Semantics and Linguistic Theory, Annual Review of Linguistics (2019). DOI: 10.1146/annurev-linguistics-011619-030303

Provided by Universitat Pompeu Fabra - Barcelona

Citation: How computational linguistics helps to understand how language works (2020, March 3) retrieved 12 September 2024 from https://phys.org/news/2020-03-linguistics-language.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

The meaning of emotion: Cultural and biological evolution impact how humans feel feelings

1 shares

Feedback to editors

New fossil fish species scales up evidence of Earth's evolutionary march

1 hour ago

Ozone pollution reduces yearly tropical forest growth by 5.1%, study finds

1 hour ago

Personal carbon footprint of the rich is vastly underestimated by rich and poor alike, study finds

1 hour ago

Hydroclimate study finds natural variations in Earth's tilt affect precipitation and humidity

2 hours ago

Wreck discovered of French steamship that sank in Atlantic in 1856

2 hours ago

Trilobite fossils from upstate New York reveal 'extra' set of legs

2 hours ago

Report outlines a path to prosperity for planet and people if Earth's critical resources are better shared

12 hours ago

Smartphone-based microscope rapidly reconstructs 3D holograms

13 hours ago

Observational study supports century-old theory that challenges the Big Bang

14 hours ago

Clovis people used Great Lakes camp annually about 13,000 years ago, researchers confirm

14 hours ago

Load comments (0)

How computational linguistics helps to understand how language works

Distributional semantics allows analysing the use of words and the evolution of their meaning

Aspects of research into distributional semantics that contribute to language theory

New fossil fish species scales up evidence of Earth's evolutionary march

Ozone pollution reduces yearly tropical forest growth by 5.1%, study finds

Personal carbon footprint of the rich is vastly underestimated by rich and poor alike, study finds

Hydroclimate study finds natural variations in Earth's tilt affect precipitation and humidity

Wreck discovered of French steamship that sank in Atlantic in 1856

Trilobite fossils from upstate New York reveal 'extra' set of legs

Report outlines a path to prosperity for planet and people if Earth's critical resources are better shared

Smartphone-based microscope rapidly reconstructs 3D holograms

Observational study supports century-old theory that challenges the Big Bang

Clovis people used Great Lakes camp annually about 13,000 years ago, researchers confirm

Relevant PhysicsForums posts

Who is your favorite Jazz musician and what is your favorite song?

Favorite Mashups - All Your Favorites in One Place

Cover songs versus the original track, which ones are better?

Favorite songs (cont.)

Biographies, history, personal accounts

When streets were lit by gas lights

The meaning of emotion: Cultural and biological evolution impact how humans feel feelings

Unzipping Zipf's Law: Solution to a century-old linguistic problem

People track when talkers say 'uh' to predict what comes next

Our ambiguous world of words

Language disorders as indicators of the diagnosis and progression of Huntington's disease

New report on distributional and work incentive effects of basic income

Data show trust in police declined among Black Chicago residents after Jacob Blake shooting

Streamlining energy regulations on Native American reservations could help alleviate poverty

Psychology study offers messaging roadmap for changing attitudes on environmental issues and policies

Privileged parents who believe in economic upward mobility are more likely to hoard resources: Study

Simulation study explores how gift giving drives social change

How new words arise in social media

Medical Xpress

Tech Xplore

Science X

How computational linguistics helps to understand how language works

Distributional semantics allows analysing the use of words and the evolution of their meaning

Aspects of research into distributional semantics that contribute to language theory

New fossil fish species scales up evidence of Earth's evolutionary march

Ozone pollution reduces yearly tropical forest growth by 5.1%, study finds

Personal carbon footprint of the rich is vastly underestimated by rich and poor alike, study finds

Hydroclimate study finds natural variations in Earth's tilt affect precipitation and humidity

Wreck discovered of French steamship that sank in Atlantic in 1856

Trilobite fossils from upstate New York reveal 'extra' set of legs

Report outlines a path to prosperity for planet and people if Earth's critical resources are better shared

Smartphone-based microscope rapidly reconstructs 3D holograms

Observational study supports century-old theory that challenges the Big Bang

Clovis people used Great Lakes camp annually about 13,000 years ago, researchers confirm

Relevant PhysicsForums posts

Related Stories

The meaning of emotion: Cultural and biological evolution impact how humans feel feelings

Unzipping Zipf's Law: Solution to a century-old linguistic problem

People track when talkers say 'uh' to predict what comes next

Our ambiguous world of words

Language disorders as indicators of the diagnosis and progression of Huntington's disease

New report on distributional and work incentive effects of basic income

Recommended for you

Data show trust in police declined among Black Chicago residents after Jacob Blake shooting

Streamlining energy regulations on Native American reservations could help alleviate poverty

Psychology study offers messaging roadmap for changing attitudes on environmental issues and policies

Privileged parents who believe in economic upward mobility are more likely to hoard resources: Study

Simulation study explores how gift giving drives social change

How new words arise in social media

Newsletter sign up

Donate and enjoy an ad-free experience