论文标题
深度贝叶斯转移学习的惩罚复杂性,并应用于材料信息学
A penalized complexity prior for deep Bayesian transfer learning with application to materials informatics
论文作者
论文摘要
材料信息学领域的一个关键任务是使用机器学习来预测材料的属性和功能。快速准确的预测模型使研究人员能够更有效地识别或构建具有理想特性的材料。与许多领域一样,深度学习是最新的方法之一,但是由于限制了数据可用性,计算资源和时间,深度学习模型在材料信息学上并不总是可行的。因此,将深度学习应用于材料信息学问题以开发有效的转移学习算法存在迫切需要。贝叶斯框架是自然的转移学习,因为从源数据中训练的模型可以在先前的分布中编码为目标任务。但是,在文献中,贝叶斯关于转移学习的看法相对不明显,并且对于深度学习而言是复杂的,因为参数空间很大,并且对单个参数的解释尚不清楚。因此,我们不是基于对单个参数的主观先验分布,而是基于对源和目标任务的预测模型之间的kullback-leibler差异的惩罚复杂性提出了一种新的贝叶斯转移学习方法。我们通过模拟显示,所提出的方法在各种设置上都优于其他转移学习方法。然后将新方法应用于预测材料科学问题,在该问题中,我们显示了根据其结构特性估算材料的频带间隙的提高精度。
A key task in the emerging field of materials informatics is to use machine learning to predict a material's properties and functions. A fast and accurate predictive model allows researchers to more efficiently identify or construct a material with desirable properties. As in many fields, deep learning is one of the state-of-the art approaches, but fully training a deep learning model is not always feasible in materials informatics due to limitations on data availability, computational resources, and time. Accordingly, there is a critical need in the application of deep learning to materials informatics problems to develop efficient transfer learning algorithms. The Bayesian framework is natural for transfer learning because the model trained from the source data can be encoded in the prior distribution for the target task of interest. However, the Bayesian perspective on transfer learning is relatively unaccounted for in the literature, and is complicated for deep learning because the parameter space is large and the interpretations of individual parameters are unclear. Therefore, rather than subjective prior distributions for individual parameters, we propose a new Bayesian transfer learning approach based on the penalized complexity prior on the Kullback-Leibler divergence between the predictive models of the source and target tasks. We show via simulations that the proposed method outperforms other transfer learning methods across a variety of settings. The new method is then applied to a predictive materials science problem where we show improved precision for estimating the band gap of a material based on its structural properties.