Bicocca Open Archive

The automatic detection of sarcasm and irony in user generated contents is one of the most challenging task of Natural Language Processing. In this paper we address this problem by introducing Bayesian Model Averaging (BMA), an ensemble approach to take into account several classifiers according to their reliabilities and their marginal probability predictions. The impact of the most used expressive signals (pragmatic particles and POS tags) have been evaluated in baseline models (traditional classifiers and majority voting) as well as in the proposed BMA approach. Experimental results highlight two main findings: (1) not all the features are equally able to characterize sarcasm and irony and (2) BMA not only outperforms traditional state of the art models, but is also able to ensure notable generalization capabilities both on ironic and sarcastic text.

Fersini, E., Pozzi, F., Messina, V. (2015). Detecting irony and sarcasm in microblogs: The role of expressive signals and ensemble classifiers. In Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 (pp.981-988). Institute of Electrical and Electronics Engineers Inc. [10.1109/DSAA.2015.7344888].

Detecting irony and sarcasm in microblogs: The role of expressive signals and ensemble classifiers

FERSINI, ELISABETTA^Primo;POZZI, FEDERICO ALBERTO^Secondo;MESSINA, VINCENZINA^Ultimo

2015

Abstract

The automatic detection of sarcasm and irony in user generated contents is one of the most challenging task of Natural Language Processing. In this paper we address this problem by introducing Bayesian Model Averaging (BMA), an ensemble approach to take into account several classifiers according to their reliabilities and their marginal probability predictions. The impact of the most used expressive signals (pragmatic particles and POS tags) have been evaluated in baseline models (traditional classifiers and majority voting) as well as in the proposed BMA approach. Experimental results highlight two main findings: (1) not all the features are equally able to characterize sarcasm and irony and (2) BMA not only outperforms traditional state of the art models, but is also able to ensure notable generalization capabilities both on ironic and sarcastic text.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
			paper
		
	Parole chiave
	
			Artificial Intelligence; Information Systems and Management; Information Systems
		
	Lingua del contenuto
	
			English
		
	Nome del convegno
	
			IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 Oct 19-21
		
	Anno del convegno
	
			2015
		
	Titolo degli atti
	
			Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015
		
	ISBN del volume degli atti
	
			9781467382731
		
	Data di pubblicazione
	
			2015
		
	Pagina iniziale
	
			981
		
	Pagina finale
	
			988
		
	Article number
	
			7344888
		
	DOI dell'intervento
	
			https://dx.doi.org/10.1109/DSAA.2015.7344888
		
	Fulltext
	
			none
		
	Citazione
	
			Fersini, E., Pozzi, F., Messina, V. (2015). Detecting irony and sarcasm in microblogs: The role of expressive signals and ensemble classifiers. In Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 (pp.981-988). Institute of Electrical and Electronics Engineers Inc. [10.1109/DSAA.2015.7344888].
		
	Appare nelle tipologie:
	
			02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/135766

Citazioni

52

16

Social impact