基于移动传感的情绪推论模型的概括和个性化：对八个国家 /地区的大学生的分析

论文标题

基于移动传感的情绪推论模型的概括和个性化：对八个国家 /地区的大学生的分析

Generalization and Personalization of Mobile Sensing-Based Mood Inference Models: An Analysis of College Students in Eight Countries

论文作者

Meegahapola, Lakmal, Droz, William, Kun, Peter, de Gotzen, Amalia, Nutakki, Chaitanya, Diwakar, Shyam, Correa, Salvador Ruiz, Song, Donglei, Xu, Hao, Bidoglia, Miriam, Gaskell, George, Chagnaa, Altangerel, Ganbold, Amarsanaa, Zundui, Tsolmon, Caprini, Carlo, Miorandi, Daniele, Hume, Alethia, Zarza, Jose Luis, Cernuzzi, Luca, Bison, Ivano, Britez, Marcelo Rodas, Busso, Matteo, Chenu-Abente, Ronald, Gunel, Can, Giunchiglia, Fausto, Schelenz, Laura, Gatica-Perez, Daniel

论文摘要

在过去的十年中，在Ubicomp文献中已经研究了对移动传感数据的情绪推论。该推论可以在一般移动应用程序中进行上下文感知和个性化的用户体验，以及在移动健康应用程序中的有价值的反馈和干预措施。但是，尽管在许多研究中都强调了模型的概括问题，但重点一直是使用不同的感应方式和机器学习技术改善模型的准确性，并在同质种群中收集了数据集。相比之下，对研究情绪推理模型的性能的关注较少，以评估模型是否推广到新国家。在这项研究中，我们收集了一个来自八个国家（中国，丹麦，印度，意大利，意大利，墨西哥，墨西哥，蒙古，英国巴拉圭）的678名参与者的329k自我报告的移动传感数据集，以评估地理多样性对情绪推理模型的影响。 We define and evaluate country-specific (trained and tested within a country), continent-specific (trained and tested within a continent), country-agnostic (tested on a country not seen on training data), and multi-country (trained and tested with multiple countries) approaches trained on sensor data for two mood inference tasks with population-level (non-personalized) and hybrid (partially personalized) models.我们表明，部分个性化的国家特异性模型在接收器操作特征曲线（AUROC）下执行最佳的屈服区域，范围为0.78-0.98，用于两类（负相对于正价），对于三类（负与中性与正价）的分数为0.76-0.94。总体而言，我们发现了对新国家的情绪推论模型的概括问题，以及国家的地理相似性如何影响情绪推论。

Mood inference with mobile sensing data has been studied in ubicomp literature over the last decade. This inference enables context-aware and personalized user experiences in general mobile apps and valuable feedback and interventions in mobile health apps. However, even though model generalization issues have been highlighted in many studies, the focus has always been on improving the accuracies of models using different sensing modalities and machine learning techniques, with datasets collected in homogeneous populations. In contrast, less attention has been given to studying the performance of mood inference models to assess whether models generalize to new countries. In this study, we collected a mobile sensing dataset with 329K self-reports from 678 participants in eight countries (China, Denmark, India, Italy, Mexico, Mongolia, Paraguay, UK) to assess the effect of geographical diversity on mood inference models. We define and evaluate country-specific (trained and tested within a country), continent-specific (trained and tested within a continent), country-agnostic (tested on a country not seen on training data), and multi-country (trained and tested with multiple countries) approaches trained on sensor data for two mood inference tasks with population-level (non-personalized) and hybrid (partially personalized) models. We show that partially personalized country-specific models perform the best yielding area under the receiver operating characteristic curve (AUROC) scores of the range 0.78-0.98 for two-class (negative vs. positive valence) and 0.76-0.94 for three-class (negative vs. neutral vs. positive valence) inference. Overall, we uncover generalization issues of mood inference models to new countries and how the geographical similarity of countries might impact mood inference.

下载PDF全文

下载文献需遵守相关版权规定

论文标题