S2RMS：空间结构的复发模块

论文标题

S2RMS：空间结构的复发模块

S2RMs: Spatially Structured Recurrent Modules

论文作者

Rahaman, Nasim, Goyal, Anirudh, Gondal, Muhammad Waleed, Wuthrich, Manuel, Bauer, Stefan, Sharma, Yash, Bengio, Yoshua, Schölkopf, Bernhard

论文摘要

通过适当的归纳偏差捕获数据生成过程的结构，可以帮助学习良好概括并强大的输入分布变化的模型。尽管利用空间和时间结构的方法发现了广泛的应用，但最近的工作证明了使用少数相互作用的模块合奏利用稀疏和模块化结构的模型的潜力。在这项工作中，我们迈出了能够同时利用模块化和时空结构的动态模型。我们通过将建模的动态系统抽象为自主且稀疏相互作用的子系统的集合来实现这一目标。子系统根据所学的拓扑相互作用，但也由基础现实系统的空间结构所告知。这导致了一类模型，这些模型非常适合建模系统的动力学，这些系统仅提供局部视图，以及这些视图的相应空间位置。关于从挑战性的starcraft2域中的部分观察结果，从裁剪框架和多代理世界建模的视频预测任务上，我们发现我们的模型对可用视图的数量更加强大，并且可以更好地概括到没有额外培训的新任务的情况下，即使与在训练分配中表现出色或更好地相比，我们的模型也可以进行额外的培训。

Capturing the structure of a data-generating process by means of appropriate inductive biases can help in learning models that generalize well and are robust to changes in the input distribution. While methods that harness spatial and temporal structures find broad application, recent work has demonstrated the potential of models that leverage sparse and modular structure using an ensemble of sparingly interacting modules. In this work, we take a step towards dynamic models that are capable of simultaneously exploiting both modular and spatiotemporal structures. We accomplish this by abstracting the modeled dynamical system as a collection of autonomous but sparsely interacting sub-systems. The sub-systems interact according to a topology that is learned, but also informed by the spatial structure of the underlying real-world system. This results in a class of models that are well suited for modeling the dynamics of systems that only offer local views into their state, along with corresponding spatial locations of those views. On the tasks of video prediction from cropped frames and multi-agent world modeling from partial observations in the challenging Starcraft2 domain, we find our models to be more robust to the number of available views and better capable of generalization to novel tasks without additional training, even when compared against strong baselines that perform equally well or better on the training distribution.

下载PDF全文

下载文献需遵守相关版权规定

论文标题