论文标题

总结和比较试剂动力学与对比的时空抽象

Summarising and Comparing Agent Dynamics with Contrastive Spatiotemporal Abstraction

论文作者

Bewley, Tom, Lawry, Jonathan, Richards, Arthur

论文摘要

我们介绍了一种数据驱动的模型 - 不合SNOSTIC技术,用于在不断发展的动力学系统中生成人体解剖的摘要,例如控制代理的学习过程。它涉及根据信息理论差异度量沿空间和时间维度沿空间和时间维度的聚合。为连续状态空间概述了一种实用算法,并部署了借助图形和文本交流方法来总结深层增强学习代理的学习历史。我们希望我们的方法与代理解释性领域中的现有技术互补。

We introduce a data-driven, model-agnostic technique for generating a human-interpretable summary of the salient points of contrast within an evolving dynamical system, such as the learning process of a control agent. It involves the aggregation of transition data along both spatial and temporal dimensions according to an information-theoretic divergence measure. A practical algorithm is outlined for continuous state spaces, and deployed to summarise the learning histories of deep reinforcement learning agents with the aid of graphical and textual communication methods. We expect our method to be complementary to existing techniques in the realm of agent interpretability.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源