论文标题
可扩展的图形网络用于粒子模拟
Scalable Graph Networks for Particle Simulations
论文作者
论文摘要
直接从观察结果中学习系统动态是机器学习的一个有希望的方向,因为它有可能增强我们理解物理系统的能力。但是,由于存在非线性电位和许多相互作用,许多现实世界系统的动力学要具有挑战性,这些相互作用与粒子$ n $的数量相互缩放,如n体问题而言。在这项工作中,我们引入了一种将完全连接的交互图转换为层次结构图的方法,该图将边缘数量减少到$ o(n)$。这会导致线性时间和空间的复杂性,而层次图的预计数需要$ o(n \ log(n))$时间和$ o(n)$ space。使用我们的方法,即使在单个GPU上,我们也能够在更大的粒子计数上训练模型。我们评估相空间位置的精度和能量保护如何取决于模拟粒子的数量。即使在大规模的重力N体模拟上,我们的方法仍保持高精度和效率,如果使用完全连接的图,则无法在单台计算机上运行。在模拟库仑相互作用时,还可以观察到类似的结果。此外,我们对该新的分层模型的性能进行了几个重要的观察,包括:i)它的准确性随着模拟中的颗粒数量而提高,而ii)它对未见粒子计数的概括也比使用所有$ o(n^2)$交互的模型要好得多。
Learning system dynamics directly from observations is a promising direction in machine learning due to its potential to significantly enhance our ability to understand physical systems. However, the dynamics of many real-world systems are challenging to learn due to the presence of nonlinear potentials and a number of interactions that scales quadratically with the number of particles $N$, as in the case of the N-body problem. In this work, we introduce an approach that transforms a fully-connected interaction graph into a hierarchical one which reduces the number of edges to $O(N)$. This results in linear time and space complexity while the pre-computation of the hierarchical graph requires $O(N\log (N))$ time and $O(N)$ space. Using our approach, we are able to train models on much larger particle counts, even on a single GPU. We evaluate how the phase space position accuracy and energy conservation depend on the number of simulated particles. Our approach retains high accuracy and efficiency even on large-scale gravitational N-body simulations which are impossible to run on a single machine if a fully-connected graph is used. Similar results are also observed when simulating Coulomb interactions. Furthermore, we make several important observations regarding the performance of this new hierarchical model, including: i) its accuracy tends to improve with the number of particles in the simulation and ii) its generalisation to unseen particle counts is also much better than for models that use all $O(N^2)$ interactions.