Bicocca Open Archive

There are several ways of implementing multilingual NLP systems but little consensus as to whether different approaches exhibit similar effects. Are the trends that we observe when adding more languages the same as those we observe when sharing more parameters? We focus on encoder representations drawn from modular multilingual machine translation systems in an English-centric scenario, and study their quality from multiple aspects: how adequate they are for machine translation, how independent of the source language they are, and what semantic information they convey. Adding translation directions in English-centric scenarios does not conclusively lead to an increase in translation quality. Shared layers increase performance on zero-shot translation pairs and lead to more language-independent representations, but these improvements do not systematically align with more semantically accurate representations, from a monolingual standpoint.

Boggia, M., Gronroos, S., Loppi, N., Mickus, T., Raganato, A., Tiedemann, J., et al. (2023). Dozens of Translation Directions or Millions of Shared Parameters? Comparing Two Types of Multilinguality in Modular Machine Translation. In Proceedings of the 24th Nordic Conference on Computational Linguistics, NoDaLiDa 2023 (pp.238-247). University of Tartu Library.

Dozens of Translation Directions or Millions of Shared Parameters? Comparing Two Types of Multilinguality in Modular Machine Translation

Boggia M.;Gronroos S. -A.;Loppi N. A.;Mickus T.;Raganato A.;Tiedemann J.;Vazquez R.

2023

Abstract

There are several ways of implementing multilingual NLP systems but little consensus as to whether different approaches exhibit similar effects. Are the trends that we observe when adding more languages the same as those we observe when sharing more parameters? We focus on encoder representations drawn from modular multilingual machine translation systems in an English-centric scenario, and study their quality from multiple aspects: how adequate they are for machine translation, how independent of the source language they are, and what semantic information they convey. Adding translation directions in English-centric scenarios does not conclusively lead to an increase in translation quality. Shared layers increase performance on zero-shot translation pairs and lead to more language-independent representations, but these improvements do not systematically align with more semantically accurate representations, from a monolingual standpoint.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Computational linguistics; Latent semantic analysis; Neural machine translation; Semantics
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				24th Nordic Conference on Computational Linguistics, NoDaLiDa 2023 - 22 May 2023 through 24 May 2023
			
	Anno del convegno
	
				2023
			
	Titolo degli atti
	
				Proceedings of the 24th Nordic Conference on Computational Linguistics, NoDaLiDa 2023
			
	ISBN del volume degli atti
	
				9789916219997
			
	Data di pubblicazione
	
				2023
			
	Pagina iniziale
	
				238
			
	Pagina finale
	
				247
			
	Fulltext
	
				open
			
	Citazione
	
				Boggia, M., Gronroos, S., Loppi, N., Mickus, T., Raganato, A., Tiedemann, J., et al. (2023). Dozens of Translation Directions or Millions of Shared Parameters? Comparing Two Types of Multilinguality in Modular Machine Translation. In Proceedings of the 24th Nordic Conference on Computational Linguistics, NoDaLiDa 2023 (pp.238-247). University of Tartu Library.
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
Boggia-2023-NoDaLiDa-VoR.pdf accesso aperto Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 526.71 kB Formato Adobe PDF Visualizza/Apri	526.71 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/553048

Citazioni

2

ND

Social impact