Cross-lingual word representations allow us to analyse word meanings across diverse language settings. It is crucial in aiding cross-lingual knowledge transfer when constructing natural language processing (NLP) models for languages with limited resources. This survey presents a comprehensive classification of cross-lingual contextual embedding models. We assess their data requirements and objective functions, and we introduce a taxonomy for categorising these approaches. Then, we present a comprehensive table containing a set of hierarchical criteria to compare them better, along with information regarding the availability of code and data to enable replication of the research. Furthermore, we delve into the evaluation methodologies employed for cross-lingual embeddings, exploring their practical applications and addressing their current associated challenges.

Pallucchini, F., Malandri, L., Mercorio, F., Mezzanzanica, M. (2026). Lost in Alignment: A Survey on Cross-Lingual Alignment Methods for Contextualized Representation. ACM COMPUTING SURVEYS, 58(5 (April 2026)), 1-34 [10.1145/3764112].

Lost in Alignment: A Survey on Cross-Lingual Alignment Methods for Contextualized Representation

Pallucchini, Filippo;Malandri, Lorenzo;Mercorio, Fabio;Mezzanzanica, Mario
2026

Abstract

Cross-lingual word representations allow us to analyse word meanings across diverse language settings. It is crucial in aiding cross-lingual knowledge transfer when constructing natural language processing (NLP) models for languages with limited resources. This survey presents a comprehensive classification of cross-lingual contextual embedding models. We assess their data requirements and objective functions, and we introduce a taxonomy for categorising these approaches. Then, we present a comprehensive table containing a set of hierarchical criteria to compare them better, along with information regarding the availability of code and data to enable replication of the research. Furthermore, we delve into the evaluation methodologies employed for cross-lingual embeddings, exploring their practical applications and addressing their current associated challenges.
Articolo in rivista - Articolo scientifico
cross-lingual alignment; Embedding alignment;
English
26-ago-2025
2026
58
5 (April 2026)
1
34
116
open
Pallucchini, F., Malandri, L., Mercorio, F., Mezzanzanica, M. (2026). Lost in Alignment: A Survey on Cross-Lingual Alignment Methods for Contextualized Representation. ACM COMPUTING SURVEYS, 58(5 (April 2026)), 1-34 [10.1145/3764112].
File in questo prodotto:
File Dimensione Formato  
Pallucchini et al-2026-ACM Computing Surveys-VoR.pdf

accesso aperto

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 1.32 MB
Formato Adobe PDF
1.32 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/565724
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
Social impact