Bicocca Open Archive

Word Sense Disambiguation models exist in many flavors. Even though supervised ones tend to perform best in terms of accuracy, they often lose ground to more flexible knowledge-based solutions, which do not require training by a word expert for every disambiguation target. To bridge this gap we adopt a different perspective and rely on sequence learning to frame the disambiguation problem: we propose and study in depth a series of end-to-end neural architectures directly tailored to the task, from bidirectional Long Short-Term Memory to encoder-decoder models. Our extensive evaluation over standard benchmarks and in multiple languages shows that sequence learning enables more versatile all-words models that consistently lead to state-of-the-art results, even against word experts with engineered features.

Raganato, A., Delli Bovi, C., Navigli, R. (2017). Neural sequence learning models for word sense disambiguation. In EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings (pp.1156-1167). Association for Computational Linguistics (ACL) [10.18653/v1/d17-1120].

Neural sequence learning models for word sense disambiguation

Raganato, Alessandro;Delli Bovi, Claudio;Navigli, Roberto

2017

Abstract

Word Sense Disambiguation models exist in many flavors. Even though supervised ones tend to perform best in terms of accuracy, they often lose ground to more flexible knowledge-based solutions, which do not require training by a word expert for every disambiguation target. To bridge this gap we adopt a different perspective and rely on sequence learning to frame the disambiguation problem: we propose and study in depth a series of end-to-end neural architectures directly tailored to the task, from bidirectional Long Short-Term Memory to encoder-decoder models. Our extensive evaluation over standard benchmarks and in multiple languages shows that sequence learning enables more versatile all-words models that consistently lead to state-of-the-art results, even against word experts with engineered features.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				Word Sense Disambiguation; Sequence Modeling; Neural Networks; Deep Learning;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017 - 7-11 Settembre
			
	Anno del convegno
	
				2017
			
	Titolo degli atti
	
				EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings
			
	ISBN del volume degli atti
	
				978-1-945626-83-8
			
	Data di pubblicazione
	
				2017
			
	Numero del volume
	
				1
			
	Pagina iniziale
	
				1156
			
	Pagina finale
	
				1167
			
	DOI dell'intervento
	
				https://dx.doi.org/10.18653/v1/d17-1120
			
	URL alternativo
	
				https://aclweb.org/anthology/D/D17/D17-1121.pdf
			
	Fulltext
	
				reserved
			
	Citazione
	
				Raganato, A., Delli Bovi, C., Navigli, R. (2017). Neural sequence learning models for word sense disambiguation. In EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing, Proceedings (pp.1156-1167). Association for Computational Linguistics (ACL) [10.18653/v1/d17-1120].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
Navigli_Neural_2017.pdf Solo gestori archivio Dimensione 887.41 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	887.41 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/361559

Citazioni

166

ND

Social impact