This paper describes theShukran Sentiment Analysis system. TheShukran is a social network micro-blogging service that allows users posting photos or videos and descriptions of their daily life activities. This social network rapidly gained a large amount of users. It provides people from different cultures and countries the possibility to share in different languages their stories, ideas, opinions, and news from their real life, and makes the cultural diversity the center of relationships between its users. Sentiment analysis aims to extract the opinion of the public about some topic by processing text data. One of its several tasks, the polarity detection, aims at categorizing the elements in a dataset (sentences, posts, etc.) into classes such as positive, negative and neutral. In the system we propose, and that represents the sentiment analysis core engine of theShukran social network, we will detect the original language of users posts, translate them into English and evaluate their sentiment (whether positive, negative or neutral). We propose the use of a Naive Bayes classifier and SentiWordNet and SenticNet for the sentiment evaluation. The language detection and translation are performed using TextBlob, a Python library for processing textual data.
Language independent sentiment analysis of the Shukran social network using apache spark
IGUIDER, WALID;Reforgiato Recupero
2017-01-01
Abstract
This paper describes theShukran Sentiment Analysis system. TheShukran is a social network micro-blogging service that allows users posting photos or videos and descriptions of their daily life activities. This social network rapidly gained a large amount of users. It provides people from different cultures and countries the possibility to share in different languages their stories, ideas, opinions, and news from their real life, and makes the cultural diversity the center of relationships between its users. Sentiment analysis aims to extract the opinion of the public about some topic by processing text data. One of its several tasks, the polarity detection, aims at categorizing the elements in a dataset (sentences, posts, etc.) into classes such as positive, negative and neutral. In the system we propose, and that represents the sentiment analysis core engine of theShukran social network, we will detect the original language of users posts, translate them into English and evaluate their sentiment (whether positive, negative or neutral). We propose the use of a Naive Bayes classifier and SentiWordNet and SenticNet for the sentiment evaluation. The language detection and translation are performed using TextBlob, a Python library for processing textual data.File | Dimensione | Formato | |
---|---|---|---|
emsasw2017_paper_14.pdf
Solo gestori archivio
Tipologia:
versione pre-print
Dimensione
145.75 kB
Formato
Adobe PDF
|
145.75 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.