社交媒体内容的情感分析

论文标题

社交媒体内容的情感分析

Sentiment Analysis on Social Media Content

论文作者

Samuels, Antony, Mcgonical, John

论文摘要

如今，来自世界各地的人们使用社交媒体网站共享信息。例如，Twitter是一个平台，用户发送，读取称为Tweets的帖子并与不同社区进行交互。用户分享他们的日常生活，对品牌和地点等所有事物发表意见。公司可以通过收集与意见相关的数据从这个庞大的平台中受益。本文的目的是提出一个模型，该模型可以对Twitter收集的真实数据进行情感分析。 Twitter中的数据是高度非结构化的，这使得很难分析。但是，我们提出的模型与该领域的先前工作不同，因为它结合了监督和无监督的机器学习算法的使用。执行情感分析的过程如下：直接从Twitter API提取，然后清洁和发现执行的数据。之后，将数据馈送到几个模型中，以进行培训。每条推文根据其情绪是正面，负面还是中性的，都根据其情感进行分类。收集了有关麦当劳和肯德基两个主题的数据，以表明哪家餐厅更受欢迎。使用了不同的机器学习算法。使用各种测试指标（如交叉验证和F得分）测试了这些模型的结果。此外，我们的模型在直接从Twitter提取的采矿文本上展示了强大的性能。

Nowadays, people from all around the world use social media sites to share information. Twitter for example is a platform in which users send, read posts known as tweets and interact with different communities. Users share their daily lives, post their opinions on everything such as brands and places. Companies can benefit from this massive platform by collecting data related to opinions on them. The aim of this paper is to present a model that can perform sentiment analysis of real data collected from Twitter. Data in Twitter is highly unstructured which makes it difficult to analyze. However, our proposed model is different from prior work in this field because it combined the use of supervised and unsupervised machine learning algorithms. The process of performing sentiment analysis as follows: Tweet extracted directly from Twitter API, then cleaning and discovery of data performed. After that the data were fed into several models for the purpose of training. Each tweet extracted classified based on its sentiment whether it is a positive, negative or neutral. Data were collected on two subjects McDonalds and KFC to show which restaurant has more popularity. Different machine learning algorithms were used. The result from these models were tested using various testing metrics like cross validation and f-score. Moreover, our model demonstrates strong performance on mining texts extracted directly from Twitter.

下载PDF全文

下载文献需遵守相关版权规定

论文标题