论文标题
通过复发性神经网络预测心理任务中的人类决策
Predicting human decision making in psychological tasks with recurrent neural networks
论文作者
论文摘要
与传统的时间序列不同,人类决策的动作序列通常涉及许多认知过程,例如信念,欲望,意图和心理理论,即其他人在想什么。这使得预测人类决策具有挑战性,以对基本的心理机制进行不利对待。我们在这里建议使用基于长期短期记忆网络(LSTM)的经常性神经网络体系结构来预测从事游戏活动的人类受试者采取的行动的时间序列,这是该研究领域中这种方法的首次应用。在这项研究中,我们从8个发表的文献中对人类数据进行了整理,其中包括168,386个个人决策,并将其后处理分为8,257个行为轨迹,每两名球员的行为轨迹9个行为。同样,我们从爱荷华州赌博任务实验的10种不同的已发表研究中,对健康的人类受试者进行了65种动作的617个轨迹。我们在行为数据上培训了我们的预测网络,并在爱荷华州赌博任务的单一代理方案和迭代囚犯的难题的多代理情景中都在预测人类决策轨迹方面证明了明显的优势。此外,我们观察到,与表现不佳的人相比,表现最佳的LSTM网络的权重往往具有更广泛的分布,以及更大的偏见,这表明对每组采用的策略分布的分布可能解释。
Unlike traditional time series, the action sequences of human decision making usually involve many cognitive processes such as beliefs, desires, intentions, and theory of mind, i.e., what others are thinking. This makes predicting human decision-making challenging to be treated agnostically to the underlying psychological mechanisms. We propose here to use a recurrent neural network architecture based on long short-term memory networks (LSTM) to predict the time series of the actions taken by human subjects engaged in gaming activity, the first application of such methods in this research domain. In this study, we collate the human data from 8 published literature of the Iterated Prisoner's Dilemma comprising 168,386 individual decisions and post-process them into 8,257 behavioral trajectories of 9 actions each for both players. Similarly, we collate 617 trajectories of 95 actions from 10 different published studies of Iowa Gambling Task experiments with healthy human subjects. We train our prediction networks on the behavioral data and demonstrate a clear advantage over the state-of-the-art methods in predicting human decision-making trajectories in both the single-agent scenario of the Iowa Gambling Task and the multi-agent scenario of the Iterated Prisoner's Dilemma. Moreover, we observe that the weights of the LSTM networks modeling the top performers tend to have a wider distribution compared to poor performers, as well as a larger bias, which suggest possible interpretations for the distribution of strategies adopted by each group.