论文标题

量化在线研究的长期兴趣

Quantifying the Online Long-Term Interest in Research

论文作者

Shahzad, Murtuza, Alhoori, Hamed, Freedman, Reva, Rahman, Shaikh Abdul

论文摘要

在多个在线平台上的数量越来越多。尽管这些文章的学术影响得到了广泛的研究,但在线分享的在线文章确定的在线兴趣尚不清楚。认识到在线上提到的研究文章的时间对研究人员来说可能是有价值的信息。在本文中,我们分析了用户共享和/或讨论学术文章的多个社交媒体平台。我们建立了三个用于论文的集群,基于年度在线提及的发行日期,范围从1920年到2016年。使用这三个群集的每个在线社交媒体指标,我们建立了机器学习模型来预测对研究文章的长期在线兴趣。我们采用两种不同的方法来解决预测任务:回归和分类。对于回归方法,多层感知器模型表现最好,对于分类方法,基于树的模型的性能比其他模型更好。我们发现,在经济和工业的背景下(即专利),旧文章最为明显。相比之下,最近发表的文章在研究平台(即Mendeley)中最为明显,其次是社交媒体平台(即Twitter)。

Research articles are being shared in increasing numbers on multiple online platforms. Although the scholarly impact of these articles has been widely studied, the online interest determined by how long the research articles are shared online remains unclear. Being cognizant of how long a research article is mentioned online could be valuable information to the researchers. In this paper, we analyzed multiple social media platforms on which users share and/or discuss scholarly articles. We built three clusters for papers, based on the number of yearly online mentions having publication dates ranging from the year 1920 to 2016. Using the online social media metrics for each of these three clusters, we built machine learning models to predict the long-term online interest in research articles. We addressed the prediction task with two different approaches: regression and classification. For the regression approach, the Multi-Layer Perceptron model performed best, and for the classification approach, the tree-based models performed better than other models. We found that old articles are most evident in the contexts of economics and industry (i.e., patents). In contrast, recently published articles are most evident in research platforms (i.e., Mendeley) followed by social media platforms (i.e., Twitter).

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源