论文标题

关于SARS-COV-2 Omicron变体的推文的探索性研究:来自情感分析,语言解释,源跟踪,类型分类和嵌入URL检测的见解

An Exploratory Study of Tweets about the SARS-CoV-2 Omicron Variant: Insights from Sentiment Analysis, Language Interpretation, Source Tracking, Type Classification, and Embedded URL Detection

论文作者

Thakur, Nirmalya, Han, Chia Y.

论文摘要

本文介绍了一项有关在Twitter上不断生成的大数据的探索性研究结果,与信息,新闻,观点,观点,观点,思想,反馈和经验有关COVID-19-19的大流行有关,并针对Omicron变体的特定侧重,这是此时的全球占主导地位的SARS-SARS-COV-2。总共研究了关于Omicron变体的12028个推文,分析的推文的特定特征包括 - 情感,语言,来源,类型和嵌入式URL。这项研究的发现是多种多样的。首先,从情感分析中可以看出,有50.5%的推文具有中性情绪。其他情绪分别在15.6%,14.0%,12.5%和7.5%的推文中发现了其他情绪 - 坏,好,可怕和出色。其次,语言解释的结果表明,有65.9%的推文以英语发布。其次是西班牙语,法语,意大利语和其他语言。第三,源跟踪的发现表明,Android的Twitter与35.2%的推文有关。其次是Twitter Web应用程序,用于iPhone的Twitter,用于iPad的Twitter和其他来源。第四,研究推文的类型表明,转发的占推文的60.8%,随后是原始推文和答复,分别占了19.8%和19.4%的推文。第五,就嵌入式URL分析而言,推文中最常见的域是Twitter.com,其次是Biorxiv.org,Nature.com和其他域。最后,为了支持该领域的类似研究,我们已经开发了一个Twitter数据集,该数据集包含500,000多条有关SARS-COV-2 Omicron变体的推文,因为该变体是2021年11月24日的第一个检测情况。

This paper presents the findings of an exploratory study on the continuously generating Big Data on Twitter related to the sharing of information, news, views, opinions, ideas, feedback, and experiences about the COVID-19 pandemic, with a specific focus on the Omicron variant, which is the globally dominant variant of SARS-CoV-2 at this time. A total of 12028 tweets about the Omicron variant were studied, and the specific characteristics of tweets that were analyzed include - sentiment, language, source, type, and embedded URLs. The findings of this study are manifold. First, from sentiment analysis, it was observed that 50.5% of tweets had a neutral emotion. The other emotions - bad, good, terrible, and great were found in 15.6%, 14.0%, 12.5%, and 7.5% of the tweets, respectively. Second, the findings of language interpretation showed that 65.9% of the tweets were posted in English. It was followed by Spanish, French, Italian, and other languages. Third, the findings from source tracking showed that Twitter for Android was associated with 35.2% of tweets. It was followed by Twitter Web App, Twitter for iPhone, Twitter for iPad, and other sources. Fourth, studying the type of tweets revealed that retweets accounted for 60.8% of the tweets, it was followed by original tweets and replies that accounted for 19.8% and 19.4% of the tweets, respectively. Fifth, in terms of embedded URL analysis, the most common domain embedded in the tweets was found to be twitter.com, which was followed by biorxiv.org, nature.com, and other domains. Finally, to support similar research in this field, we have developed a Twitter dataset that comprises more than 500,000 tweets about the SARS-CoV-2 omicron variant since the first detected case of this variant on November 24, 2021.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源