Bicocca Open Archive

According to psycholinguistic theories, processing a compound word (“snowman”) involves its automatic decomposition into its constituents (“snow”, “man”), then connected by an implicit semantic relation (“made of”) to obtain a plausible interpretation (“man made of snow”). However, the appropriate relation is often not univocal and must be selected from a set of competitors. In this study, we investigated whether contextualized word embeddings (cwe) capture human intuitions on compounds’ interpretations. We used BERT-base to obtain cwe of compounds in context (e.g., “We built a [snowman] in our garden”). Then, we systematically replaced compounds with paraphrase variants in which candidate relations were made explicit (e.g., “We built a [man made of snow] in our garden”). We then computed the similarity between the original compound cwe and its multiple variants. We find that these similarities predict participants’ interpretations (i.e., the probability of selecting a given relation) and their degree of conflict. Thus, we show that cwe can be leveraged to generate semantic representations for linguistic units that are not directly observable in text, but which influence compounds’ interpretation and processing.

Ciapparelli, M., Marelli, M. (2023). Modeling compound word relational interpretations with contextualized word embeddings. Intervento presentato a: Psycholinguistics in Flanders Conference, Ghent, Belgio.

Modeling compound word relational interpretations with contextualized word embeddings

Ciapparelli, M^Primo;Marelli, M^Ultimo

2023

Abstract

According to psycholinguistic theories, processing a compound word (“snowman”) involves its automatic decomposition into its constituents (“snow”, “man”), then connected by an implicit semantic relation (“made of”) to obtain a plausible interpretation (“man made of snow”). However, the appropriate relation is often not univocal and must be selected from a set of competitors. In this study, we investigated whether contextualized word embeddings (cwe) capture human intuitions on compounds’ interpretations. We used BERT-base to obtain cwe of compounds in context (e.g., “We built a [snowman] in our garden”). Then, we systematically replaced compounds with paraphrase variants in which candidate relations were made explicit (e.g., “We built a [man made of snow] in our garden”). We then computed the similarity between the original compound cwe and its multiple variants. We find that these similarities predict participants’ interpretations (i.e., the probability of selecting a given relation) and their degree of conflict. Thus, we show that cwe can be leveraged to generate semantic representations for linguistic units that are not directly observable in text, but which influence compounds’ interpretation and processing.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				relazione (orale)
			
	Parole chiave
	
				computational modelling; psycholinguistics; compound words; large language models
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				Psycholinguistics in Flanders Conference
			
	Anno del convegno
	
				2023
			
	Data di pubblicazione
	
				2023
			
	Fulltext
	
				none
			
	Citazione
	
				Ciapparelli, M., Marelli, M. (2023). Modeling compound word relational interpretations with contextualized word embeddings. Intervento presentato a: Psycholinguistics in Flanders Conference, Ghent, Belgio.
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/467198

Citazioni

ND

ND

Social impact