基于内省的可解释的强化学习在情节和非剧本场景中

论文标题

基于内省的可解释的强化学习在情节和非剧本场景中

Introspection-based Explainable Reinforcement Learning in Episodic and Non-episodic Scenarios

论文作者

Schroeter, Niclas, Cruz, Francisco, Wermter, Stefan

论文摘要

随着机器人系统和人类机器人环境在当今社会中的越来越多，了解机器人采取的行动背后的推理变得越来越重要。为了提高这种理解，向用户提供了有关为什么采取特定措施的解释。除其他效果外，这些解释改善了用户在其机器人合作伙伴中的信任。创建这些解释的一种选择是一种基于内省的方法，可以与加强学习剂一起使用，以提供成功的概率。这些反过来又可以用来以人为理解的方式推理代理商采取的行动。在这项工作中，这种基于内省的方法是根据情节和非剧本机器人模拟任务进一步开发和评估的。此外，提出了Q值的附加归一化步骤，该步骤可以使基于内省的方法在负面和相对较小的Q值中使用。获得的结果表明，内省对情节机器人技术任务的生存能力，此外，基于内省的方法也可用于生成对非疾病机器人环境中采取的动作的解释。

With the increasing presence of robotic systems and human-robot environments in today's society, understanding the reasoning behind actions taken by a robot is becoming more important. To increase this understanding, users are provided with explanations as to why a specific action was taken. Among other effects, these explanations improve the trust of users in their robotic partners. One option for creating these explanations is an introspection-based approach which can be used in conjunction with reinforcement learning agents to provide probabilities of success. These can in turn be used to reason about the actions taken by the agent in a human-understandable fashion. In this work, this introspection-based approach is developed and evaluated further on the basis of an episodic and a non-episodic robotics simulation task. Furthermore, an additional normalization step to the Q-values is proposed, which enables the usage of the introspection-based approach on negative and comparatively small Q-values. Results obtained show the viability of introspection for episodic robotics tasks and, additionally, that the introspection-based approach can be used to generate explanations for the actions taken in a non-episodic robotics environment as well.

下载PDF全文

下载文献需遵守相关版权规定

论文标题