In this paper, we present our results related to the EVALITA 2020 challenge, DIACR-Ita, for semantic change detection for the Italian language. Our approach is based on measuring the semantic distance across time-specific word vectors generated with Compass-aligned Distributional Embeddings (CADE). We first generate temporal embeddings with CADE, a strategy to align word embeddings that are specific for each time period; the quality of this alignment is the main asset of our proposal. We then measure the semantic shift of each word, combining two different semantic shift measures. Eventually, we classify a word meaning as changed or not changed by defining a threshold over the semantic distance across time.

Belotti, F., Bianchi, F., Palmonari, M. (2020). UNIMIB @ DIACR-Ita: Aligning distributional embeddings with a compass for semantic change detection in the Italian language. In CEUR Workshop Proceedings. CEUR-WS.

UNIMIB @ DIACR-Ita: Aligning distributional embeddings with a compass for semantic change detection in the Italian language

Bianchi F.;Palmonari M.
2020

Abstract

In this paper, we present our results related to the EVALITA 2020 challenge, DIACR-Ita, for semantic change detection for the Italian language. Our approach is based on measuring the semantic distance across time-specific word vectors generated with Compass-aligned Distributional Embeddings (CADE). We first generate temporal embeddings with CADE, a strategy to align word embeddings that are specific for each time period; the quality of this alignment is the main asset of our proposal. We then measure the semantic shift of each word, combining two different semantic shift measures. Eventually, we classify a word meaning as changed or not changed by defining a threshold over the semantic distance across time.
paper
Natural language processing, artificial intelligence, computational linguistics
English
7th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop, EVALITA 2020
2020
CEUR Workshop Proceedings
2020
2765
none
Belotti, F., Bianchi, F., Palmonari, M. (2020). UNIMIB @ DIACR-Ita: Aligning distributional embeddings with a compass for semantic change detection in the Italian language. In CEUR Workshop Proceedings. CEUR-WS.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/302419
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
Social impact