In many languages, compounding is a fundamental process for the generation of novel words. When this process is productive (as, e.g., in English), native speakers can juxtapose two words to create novel compounds that can be readily understood by other speakers. The present paper proposes a large-scale, data-driven computational system for compound semantic processing based on distributional semantics, the CAOSS model (Compounding as Abstract Operation in Semantic Space). In CAOSS, word meanings are represented as vectors encoding their lexical co-occurrences in a reference corpus. Given two constituent words, their composed representation (the compound) is computed by using matrices representing the abstract properties of constituent roles (modifier vs. head). The matrices are also induced through examples of language usage. The model is then validated against behavioral results concerning the processing of novel compounds, and in particular relational effects on response latencies. The effects of relational priming and relational dominance are considered. CAOSS predictions are shown to pattern with previous results, in terms of both the impact of relational information and the dissociations related to the different constituent roles. The simulations indicate that relational information is implicitly reflected in language usage, suggesting that human speakers can learn these aspects from language experience and automatically apply them to the processing of new word combinations. The present model is flexible enough to emulate this procedure, suggesting that relational effects might emerge as a by-product of nuanced operations across distributional patterns.

Marelli, M., Gagné, C., Spalding, T. (2017). Compounding as Abstract Operation in Semantic Space: Investigating relational effects through a large-scale, data-driven computational model. COGNITION, 166, 207-224 [10.1016/j.cognition.2017.05.026].

Compounding as Abstract Operation in Semantic Space: Investigating relational effects through a large-scale, data-driven computational model

Marelli, M
;
2017

Abstract

In many languages, compounding is a fundamental process for the generation of novel words. When this process is productive (as, e.g., in English), native speakers can juxtapose two words to create novel compounds that can be readily understood by other speakers. The present paper proposes a large-scale, data-driven computational system for compound semantic processing based on distributional semantics, the CAOSS model (Compounding as Abstract Operation in Semantic Space). In CAOSS, word meanings are represented as vectors encoding their lexical co-occurrences in a reference corpus. Given two constituent words, their composed representation (the compound) is computed by using matrices representing the abstract properties of constituent roles (modifier vs. head). The matrices are also induced through examples of language usage. The model is then validated against behavioral results concerning the processing of novel compounds, and in particular relational effects on response latencies. The effects of relational priming and relational dominance are considered. CAOSS predictions are shown to pattern with previous results, in terms of both the impact of relational information and the dissociations related to the different constituent roles. The simulations indicate that relational information is implicitly reflected in language usage, suggesting that human speakers can learn these aspects from language experience and automatically apply them to the processing of new word combinations. The present model is flexible enough to emulate this procedure, suggesting that relational effects might emerge as a by-product of nuanced operations across distributional patterns.
Articolo in rivista - Articolo scientifico
Compound words; Conceptual combination; Distributional semantics; Novel compounds; Relational information
English
2017
166
207
224
none
Marelli, M., Gagné, C., Spalding, T. (2017). Compounding as Abstract Operation in Semantic Space: Investigating relational effects through a large-scale, data-driven computational model. COGNITION, 166, 207-224 [10.1016/j.cognition.2017.05.026].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/182585
Citazioni
  • Scopus 41
  • ???jsp.display-item.citation.isi??? 38
Social impact