In this paper we present the University of Helsinki submissions to the WMT 2019 shared news translation task in three language pairs: English-German, English-Finnish and Finnish-English. This year we focused first on cleaning and filtering the training data using multiple data-filtering approaches, resulting in much smaller and cleaner training sets. For English-German we trained both sentence-level transformer models as well as compared different document-level translation approaches. For Finnish-English and English-Finnish we focused on different segmentation approaches and we also included a rule-based system for English-Finnish.

Talman, A., Sulubacak, U., Vázquez, R., Scherrer, Y., Virpioja, S., Raganato, A., et al. (2019). The University of Helsinki Submissions to the WMT19 News Translation Task. In Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1) (pp.412-423) [10.18653/v1/W19-5347].

The University of Helsinki Submissions to the WMT19 News Translation Task

Raganato, Alessandro;
2019

Abstract

In this paper we present the University of Helsinki submissions to the WMT 2019 shared news translation task in three language pairs: English-German, English-Finnish and Finnish-English. This year we focused first on cleaning and filtering the training data using multiple data-filtering approaches, resulting in much smaller and cleaner training sets. For English-German we trained both sentence-level transformer models as well as compared different document-level translation approaches. For Finnish-English and English-Finnish we focused on different segmentation approaches and we also included a rule-based system for English-Finnish.
paper
transformer; machine translation; data filtering
English
Fourth Conference on Machine Translation
2019
Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1)
978-1-950737-27-7
2019
412
423
reserved
Talman, A., Sulubacak, U., Vázquez, R., Scherrer, Y., Virpioja, S., Raganato, A., et al. (2019). The University of Helsinki Submissions to the WMT19 News Translation Task. In Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1) (pp.412-423) [10.18653/v1/W19-5347].
File in questo prodotto:
File Dimensione Formato  
W19-5347.pdf

Solo gestori archivio

Dimensione 361.13 kB
Formato Adobe PDF
361.13 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/361569
Citazioni
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 1
Social impact