论文标题

电池模型校准,深入增强学习

Battery Model Calibration with Deep Reinforcement Learning

论文作者

Unagar, Ajaykumar, Tian, Yuan, Arias-Chao, Manuel, Fink, Olga

论文摘要

锂离子(LI-I)电池最近变得普遍,并用于许多物理资产。为了很好地预测电池放电的结束,已经开发了详细的电化学LI-I电池模型。它们的参数通常在运行之前进行校准,并且通常在操作过程中未重新校准。但是,由于电池的性能受老化的影响,因此计算电池模型与真实物理系统之间的现实差距导致预测不准确。监督的机器学习算法将需要广泛的代表性培训数据集将观察结果映射到地面真相校准参数。对于许多实际应用,这可能是不可行的。在本文中,我们实施了一个基于增强学习的框架,以可靠,有效地推断电池模型的校准参数。该框架实现了计算模型参数的实时推断,以补偿观测值的真人差距。最重要的是,所提出的方法不需要任何标记的数据样本(观察结果和地面真相校准参数)。此外,该框架不需要有关基础物理模型的任何信息。实验结果表明,所提出的方法能够以高精度和高鲁棒性来推断模型参数。尽管所达到的结果与受监督的机器学习获得的结果相当,但它们在培训过程中并不依赖地面真相。

Lithium-Ion (Li-I) batteries have recently become pervasive and are used in many physical assets. To enable a good prediction of the end of discharge of batteries, detailed electrochemical Li-I battery models have been developed. Their parameters are typically calibrated before they are taken into operation and are typically not re-calibrated during operation. However, since battery performance is affected by aging, the reality gap between the computational battery models and the real physical systems leads to inaccurate predictions. A supervised machine learning algorithm would require an extensive representative training dataset mapping the observation to the ground truth calibration parameters. This may be infeasible for many practical applications. In this paper, we implement a Reinforcement Learning-based framework for reliably and efficiently inferring calibration parameters of battery models. The framework enables real-time inference of the computational model parameters in order to compensate the reality-gap from the observations. Most importantly, the proposed methodology does not need any labeled data samples, (samples of observations and the ground truth calibration parameters). Furthermore, the framework does not require any information on the underlying physical model. The experimental results demonstrate that the proposed methodology is capable of inferring the model parameters with high accuracy and high robustness. While the achieved results are comparable to those obtained with supervised machine learning, they do not rely on the ground truth information during training.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源