论文标题
基于级联回声室检测
Cascade-based Echo Chamber Detection
论文作者
论文摘要
尽管社交媒体中的Echo Chambers受到了相当大的审查,但仍缺少用于检测和分析的一般模型。在这项工作中,我们旨在通过提出一种概率生成模型来填补这一空白,该模型通过一系列潜在社区来解释社交媒体足迹(即社交网络结构和信息的传播),其特征在于一定程度的回声室行为和观点极性。具体而言,回声室被建模为可渗透到具有相似意识形态极性的信息的社区,并且不受相反倾向的信息的不渗透:这允许将回声室与缺乏明确意识形态保持一致的社区区分。 为了了解模型参数,我们提出了对广义期望最大化算法的可扩展的随机适应,该算法优化了观察社会联系和信息传播的关节可能性。合成数据的实验表明,我们的算法能够及其具有回声室行为和意见极性的程度正确地重建地面真相社区。关于两极分化的社会和政治辩论的现实数据的实验,例如英国脱欧公投或COVID-19疫苗运动,证实了我们提议在检测Echo Chambers方面的有效性。最后,我们展示了我们的模型如何提高辅助预测任务的准确性,例如立场检测和未来传播的预测。
Despite echo chambers in social media have been under considerable scrutiny, general models for their detection and analysis are missing. In this work, we aim to fill this gap by proposing a probabilistic generative model that explains social media footprints -- i.e., social network structure and propagations of information -- through a set of latent communities, characterized by a degree of echo-chamber behavior and by an opinion polarity. Specifically, echo chambers are modeled as communities that are permeable to pieces of information with similar ideological polarity, and impermeable to information of opposed leaning: this allows discriminating echo chambers from communities that lack a clear ideological alignment. To learn the model parameters we propose a scalable, stochastic adaptation of the Generalized Expectation Maximization algorithm, that optimizes the joint likelihood of observing social connections and information propagation. Experiments on synthetic data show that our algorithm is able to correctly reconstruct ground-truth latent communities with their degree of echo-chamber behavior and opinion polarity. Experiments on real-world data about polarized social and political debates, such as the Brexit referendum or the COVID-19 vaccine campaign, confirm the effectiveness of our proposal in detecting echo chambers. Finally, we show how our model can improve accuracy in auxiliary predictive tasks, such as stance detection and prediction of future propagations.