论文标题
人工智能实时预测和物理解释纳米级金属簇中的原子结合能
Artificial intelligence real-time prediction and physical interpretation of atomic binding energies in nano-scale metal clusters
论文作者
论文摘要
单个原子位点通常决定材料的功能和性能,例如催化剂,半导体或酶。因此,计算和理解此类站点的特性是理性材料设计过程的关键组成部分。由于原子水平上的复杂电子效应,原子位点特性通常从计算昂贵的第一原则计算得出,因为需要这种理论水平才能达到相关的准确性。在这项研究中,我们提出了一种广泛适用的机器学习方法(ML),以实时计算高精度。该方法适用于复杂的非晶体原子结构,因此为纳米材料,无定形系统和材料界面的高通量筛选提供了可能性。我们的方法包括一个可靠的特征方案,可以将原子结构转换为可以由普通机器学习模型使用的特征。进行基于遗传算法(GA)的特征选择,我们展示了如何建立ML模型隐含的结构 - 培训关系的直观物理解释。通过这种方法,我们计算金属纳米颗粒的原子位点稳定性,范围从3-55个原子实时范围为0.11-0.14 eV范围。我们还建立了该位点的化学身份是确定原子位点稳定性的最重要因素,其次是结构特征,例如键距离和角度。两者都在开源python模块中发布了特征化和GA特征选择功能。通过这种方法,我们通过数据驱动的材料筛选实现了高度专业的现实世界纳米催化剂的有效合理设计。
Single atomic sites often determine the functionality and performance of materials, such as catalysts, semi-conductors or enzymes. Computing and understanding the properties of such sites is therefore a crucial component of the rational materials design process. Because of complex electronic effects at the atomic level, atomic site properties are conventionally derived from computationally expensive first-principle calculations, as this level of theory is required to achieve relevant accuracy. In this study, we present a widely applicable machine learning (ML) approach to compute atomic site properties with high accuracy in real time. The approach works well for complex non-crystalline atomic structures and therefore opens up the possibility for high-throughput screenings of nano-materials, amorphous systems and materials interfaces. Our approach includes a robust featurization scheme to transform atomic structures into features which can be used by common machine learning models. Performing a genetic algorithm (GA) based feature selection, we show how to establish an intuitive physical interpretation of the structure-property relations implied by the ML models. With this approach, we compute atomic site stabilities of metal nanoparticles ranging from 3-55 atoms with mean absolute errors in the range of 0.11-0.14 eV in real time. We also establish the chemical identity of the site as most important factor in determining atomic site stabilities, followed by structural features like bond distances and angles. Both, the featurization and GA feature selection functionality are published in open-source python modules. With this method, we enable the efficient rational design of highly specialized real-world nano-catalysts through data-driven materials screening.