批准：评估注释的Reddit职位语料库

论文标题

批准：评估注释的Reddit职位语料库

APPReddit: a Corpus of Reddit Posts Annotated for Appraisal

论文作者

Stranisci, Marco Antonio, Frenda, Simona, Ceccaldi, Eleonora, Basile, Valerio, Damiano, Rossana, Patti, Viviana

论文摘要

尽管有大量的情感识别计算资源，但缺乏依赖评估模型的数据集。根据评估理论，情感是对事件的多维评估的结果。在本文中，我们介绍了批准的，这是根据该理论注释的第一个非实验数据的语料库。在描述了它的发展之后，我们将资源与Enisear进行了比较，Enisear是在实验环境中创建的事件和评估注释。结果表明，尽管数据和注释方案的类型不同，但可以映射这两个语料库。经过批准的培训的SVM模型可以预测四个评估维度而不会造成大量损失。在单个培训集中合并两个语料库可提高4个维度中3个的预测。此类发现为评估预测的更好性能分类模型铺平了道路。

Despite the large number of computational resources for emotion recognition, there is a lack of data sets relying on appraisal models. According to Appraisal theories, emotions are the outcome of a multi-dimensional evaluation of events. In this paper, we present APPReddit, the first corpus of non-experimental data annotated according to this theory. After describing its development, we compare our resource with enISEAR, a corpus of events created in an experimental setting and annotated for appraisal. Results show that the two corpora can be mapped notwithstanding different typologies of data and annotations schemes. A SVM model trained on APPReddit predicts four appraisal dimensions without significant loss. Merging both corpora in a single training set increases the prediction of 3 out of 4 dimensions. Such findings pave the way to a better performing classification model for appraisal prediction.

下载PDF全文

下载文献需遵守相关版权规定

论文标题