论文标题
深度学习间潜能的快速不确定性估计
Fast Uncertainty Estimates in Deep Learning Interatomic Potentials
论文作者
论文摘要
深度学习已成为一种有希望的范式,可以访问分子和材料特性的高度准确预测。然而,当前方法共享的一个常见的缩写是,神经网络仅给出其预测的点估计,而与这些估计相关的预测不确定性并不带来。现有的不确定性量化工作主要利用了独立训练的神经网络整体的预测的标准偏差。这会在培训和预测中造成大量的计算开销,通常会导致更昂贵的预测顺序。在这里,我们提出了一种基于单个神经网络的预测不确定性的方法,而无需合奏。这使我们能够获得不确定性估计值,而在标准培训和推理上几乎没有其他计算开销。我们证明,不确定性估计的质量与从深层合奏中获得的质量相匹配。我们进一步研究了我们测试系统的整个配置空间的方法和深层集合的不确定性估计,并将不确定性与势能表面进行了比较。最后,我们研究该方法在主动学习环境中的疗效,并找到结果以降低计算成本的速度降低的基于整体的策略。
Deep learning has emerged as a promising paradigm to give access to highly accurate predictions of molecular and materials properties. A common short-coming shared by current approaches, however, is that neural networks only give point estimates of their predictions and do not come with predictive uncertainties associated with these estimates. Existing uncertainty quantification efforts have primarily leveraged the standard deviation of predictions across an ensemble of independently trained neural networks. This incurs a large computational overhead in both training and prediction that often results in order-of-magnitude more expensive predictions. Here, we propose a method to estimate the predictive uncertainty based on a single neural network without the need for an ensemble. This allows us to obtain uncertainty estimates with virtually no additional computational overhead over standard training and inference. We demonstrate that the quality of the uncertainty estimates matches those obtained from deep ensembles. We further examine the uncertainty estimates of our methods and deep ensembles across the configuration space of our test system and compare the uncertainties to the potential energy surface. Finally, we study the efficacy of the method in an active learning setting and find the results to match an ensemble-based strategy at order-of-magnitude reduced computational cost.