Cross-lingual word representations allow us to analyse word meanings across diverse language settings. It is crucial in aiding cross-lingual knowledge transfer when constructing natural language processing (NLP) models for languages with limited resources. This survey presents a comprehensive classification of cross-lingual contextual embedding models. We assess their data requirements and objective functions, and we introduce a taxonomy for categorising these approaches. Then, we present a comprehensive table containing a set of hierarchical criteria to compare them better, along with information regarding the availability of code and data to enable replication of the research. Furthermore, we delve into the evaluation methodologies employed for cross-lingual embeddings, exploring their practical applications and addressing their current associated challenges.

Pallucchini, F., Malandri, L., Mercorio, F., Mezzanzanica, M. (2025). Lost in Alignment: A Survey on Cross-Lingual Alignment Methods for Contextualized Representation. ACM COMPUTING SURVEYS [10.1145/3764112].

Lost in Alignment: A Survey on Cross-Lingual Alignment Methods for Contextualized Representation

Pallucchini, Filippo;Malandri, Lorenzo;Mercorio, Fabio;Mezzanzanica, Mario
2025

Abstract

Cross-lingual word representations allow us to analyse word meanings across diverse language settings. It is crucial in aiding cross-lingual knowledge transfer when constructing natural language processing (NLP) models for languages with limited resources. This survey presents a comprehensive classification of cross-lingual contextual embedding models. We assess their data requirements and objective functions, and we introduce a taxonomy for categorising these approaches. Then, we present a comprehensive table containing a set of hierarchical criteria to compare them better, along with information regarding the availability of code and data to enable replication of the research. Furthermore, we delve into the evaluation methodologies employed for cross-lingual embeddings, exploring their practical applications and addressing their current associated challenges.
Articolo in rivista - Articolo scientifico
NLP; machine translation; word embeddings; AI
English
26-ago-2025
2025
none
Pallucchini, F., Malandri, L., Mercorio, F., Mezzanzanica, M. (2025). Lost in Alignment: A Survey on Cross-Lingual Alignment Methods for Contextualized Representation. ACM COMPUTING SURVEYS [10.1145/3764112].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/565724
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact