Bicocca Open Archive

The huge amount of textual data on the Web has grown in the last few years rapidly creating unique contents of massive dimension. In a decision making context, one of the most relevant tasks is polarity classification of a text source, which is usually performed through supervised learning methods. Most of the existing approaches select the best classification model leading to over-confident decisions that do not take into account the inherent uncertainty of the natural language. In this paper, we pursue the paradigm of ensemble learning to reduce the noise sensitivity related to language ambiguity and therefore to provide a more accurate prediction of polarity. The proposed ensemble method is based on Bayesian Model Averaging, where both uncertainty and reliability of each single model are taken into account. We address the classifier selection problem by proposing a greedy approach that evaluates the contribution of each model with respect to the ensemble. Experimental results on gold standard datasets show that the proposed approach outperforms both traditional classification and ensemble methods.

Fersini, E., Messina, V., Pozzi, F. (2014). Sentiment analysis: Bayesian Ensemble Learning. DECISION SUPPORT SYSTEMS, 68, 26-38 [10.1016/j.dss.2014.10.004].

Sentiment analysis: Bayesian Ensemble Learning

FERSINI, ELISABETTA;MESSINA, VINCENZINA^Secondo;POZZI, FEDERICO ALBERTO^Ultimo

2014

Abstract

The huge amount of textual data on the Web has grown in the last few years rapidly creating unique contents of massive dimension. In a decision making context, one of the most relevant tasks is polarity classification of a text source, which is usually performed through supervised learning methods. Most of the existing approaches select the best classification model leading to over-confident decisions that do not take into account the inherent uncertainty of the natural language. In this paper, we pursue the paradigm of ensemble learning to reduce the noise sensitivity related to language ambiguity and therefore to provide a more accurate prediction of polarity. The proposed ensemble method is based on Bayesian Model Averaging, where both uncertainty and reliability of each single model are taken into account. We address the classifier selection problem by proposing a greedy approach that evaluates the contribution of each model with respect to the ensemble. Experimental results on gold standard datasets show that the proposed approach outperforms both traditional classification and ensemble methods.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Ensemble learning; Polarity classification; Sentiment analysis; Management Information Systems; Information Systems; Information Systems and Management; Arts and Humanities (miscellaneous); Developmental and Educational Psychology
			
	Lingua del contenuto
	
				English
			
	Data di pubblicazione
	
				2014
			
	Rivista
	
				DECISION SUPPORT SYSTEMS
			
	Numero del volume
	
				68
			
	Pagina iniziale
	
				26
			
	Pagina finale
	
				38
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.dss.2014.10.004
			
	Fulltext
	
				none
			
	Citazione
	
				Fersini, E., Messina, V., Pozzi, F. (2014). Sentiment analysis: Bayesian Ensemble Learning. DECISION SUPPORT SYSTEMS, 68, 26-38 [10.1016/j.dss.2014.10.004].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/59547

Citazioni

165

122

Social impact