While distributional semantic models that represent word meanings as high-dimensional vectors induced from large text corpora have been shown to successfully predict human behavior across a wide range of tasks, they have also received criticism from different directions. These include concerns over their interpretability (how can numbers specifying abstract, latent dimensions represent meaning?) and their ability to capture variation in meaning (how can a single vector representation capture multiple different interpretations for the same expression?). Here, we demonstrate that semantic vectors can indeed rise up to these challenges, by training a mapping system (a simple linear regression) that predicts inter-individual variation in relational interpretations for compounds such as wood brush (for example brush FOR wood, or brush MADE OF wood) from (compositional) semantic vectors representing the meanings of these compounds. These predictions consistently beat different random baselines, both for familiar compounds (moon light, Experiment 1) as well as novel compounds (wood brush, Experiment 2), demonstrating that distributional semantic vectors encode variations in qualitative interpretations that can be decoded using techniques as simple as linear regression.

Günther, F., Marelli, M. (2022). Patterns in CAOSS: Distributed representations predict variation in relational interpretations for familiar and novel compound words. COGNITIVE PSYCHOLOGY, 134(May 2022) [10.1016/j.cogpsych.2022.101471].

Patterns in CAOSS: Distributed representations predict variation in relational interpretations for familiar and novel compound words

Marelli, Marco
2022

Abstract

While distributional semantic models that represent word meanings as high-dimensional vectors induced from large text corpora have been shown to successfully predict human behavior across a wide range of tasks, they have also received criticism from different directions. These include concerns over their interpretability (how can numbers specifying abstract, latent dimensions represent meaning?) and their ability to capture variation in meaning (how can a single vector representation capture multiple different interpretations for the same expression?). Here, we demonstrate that semantic vectors can indeed rise up to these challenges, by training a mapping system (a simple linear regression) that predicts inter-individual variation in relational interpretations for compounds such as wood brush (for example brush FOR wood, or brush MADE OF wood) from (compositional) semantic vectors representing the meanings of these compounds. These predictions consistently beat different random baselines, both for familiar compounds (moon light, Experiment 1) as well as novel compounds (wood brush, Experiment 2), demonstrating that distributional semantic vectors encode variations in qualitative interpretations that can be decoded using techniques as simple as linear regression.
Articolo in rivista - Articolo scientifico
Compound words; Conceptual combination; Distributional semantics; Novel compounds; Relational information;
English
24-mar-2022
2022
134
May 2022
101471
none
Günther, F., Marelli, M. (2022). Patterns in CAOSS: Distributed representations predict variation in relational interpretations for familiar and novel compound words. COGNITIVE PSYCHOLOGY, 134(May 2022) [10.1016/j.cogpsych.2022.101471].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/422571
Citazioni
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 4
Social impact