Improving sentiment analysis using preprocessing techniques and lexical patterns

IRIS

Sentiment analysis has recently gained considerable attention, since the classification of the emotional content of a text (online reviews, blog messages etc.) may have a relevant impact on market research, political science and many other fields. In this paper, we focus on the importance of the text preprocessing phase, proposing a new technique we termed lexical pattern-based feature weighting (LPFW) that allows one to improve sentence-level sentiment analysis by increasing the relevance of the features contained in particular lexical patterns. This approach has been evaluated on two sentiment classification datasets. We show that a systematic optimisation of the preprocessing filters is important for obtaining good classification accuracy. Also, we show that LPFW is effective in different application domains and with different training set sizes.

Improving sentiment analysis using preprocessing techniques and lexical patterns / Cagnoni, S.; Ferrari, L.; Fornacciari, P.; Mordonini, M.; Sani, L.; Tomaiuolo, M.. - In: INTERNATIONAL JOURNAL OF DATA ANALYSIS TECHNIQUES AND STRATEGIES. - ISSN 1755-8050. - 13:3(2021), pp. 171-185. [10.1504/IJDATS.2021.118022]

Improving sentiment analysis using preprocessing techniques and lexical patterns

Cagnoni S.;Ferrari L.;Fornacciari P.;Mordonini M.;Sani L.;Tomaiuolo M.

2021-01-01

Abstract

Sentiment analysis has recently gained considerable attention, since the classification of the emotional content of a text (online reviews, blog messages etc.) may have a relevant impact on market research, political science and many other fields. In this paper, we focus on the importance of the text preprocessing phase, proposing a new technique we termed lexical pattern-based feature weighting (LPFW) that allows one to improve sentence-level sentiment analysis by increasing the relevance of the features contained in particular lexical patterns. This approach has been evaluated on two sentiment classification datasets. We show that a systematic optimisation of the preprocessing filters is important for obtaining good classification accuracy. Also, we show that LPFW is effective in different application domains and with different training set sizes.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2021
			
	Citazione
	
				Improving sentiment analysis using preprocessing techniques and lexical patterns / Cagnoni, S.; Ferrari, L.; Fornacciari, P.; Mordonini, M.; Sani, L.; Tomaiuolo, M.. - In: INTERNATIONAL JOURNAL OF DATA ANALYSIS TECHNIQUES AND STRATEGIES. - ISSN 1755-8050. - 13:3(2021), pp. 171-185. [10.1504/IJDATS.2021.118022]
			
	Appare nelle tipologie:
	
				1.1 Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
IJDATS130301 CAGNONI_205827.pdf solo utenti autorizzati Descrizione: Articolo Tipologia: Documento in Post-print Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 768.05 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	768.05 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11381/2902789

Citazioni

ND

5

ND

social impact