可解释性和因果关系发现机器学习模型，以预测液压压裂后CBM井的产生

论文标题

可解释性和因果关系发现机器学习模型，以预测液压压裂后CBM井的产生

Interpretability and causal discovery of the machine learning models to predict the production of CBM wells after hydraulic fracturing

论文作者

Min, Chao, Wen, Guoquan, Gou, Liangjie, Li, Xiaogang, Yang, Zhaozhong

论文摘要

机器学习方法在液压压裂后的CBM井的生产预测中得到了广泛的研究，但由于概括能力低和缺乏可解释性，仅在实践中使用。在本文中提出了一种新颖的方法，以发现观察到的数据中的潜在因果关系，该数据旨在找到一种间接的方法来解释机器学习结果。基于因果发现理论，通过显式输入，输出，处理和混杂变量得出了因果图。然后，使用SHAP来分析因素对生产能力的影响，从而间接解释了机器学习模型。所提出的方法可以捕获因子与输出之间的基本非线性关系，这可以根据因素的相关分析来弥补传统机器学习程序的限制。 CBM数据的实验表明，通过提出的方法，生产与地质/工程因素之间的检测关系与实际的物理机制一致。同时，与传统方法相比，可解释的机器学习模型在预测生产能力方面具有更好的性能，其准确性平均提高了20％。

Machine learning approaches are widely studied in the production prediction of CBM wells after hydraulic fracturing, but merely used in practice due to the low generalization ability and the lack of interpretability. A novel methodology is proposed in this article to discover the latent causality from observed data, which is aimed at finding an indirect way to interpret the machine learning results. Based on the theory of causal discovery, a causal graph is derived with explicit input, output, treatment and confounding variables. Then, SHAP is employed to analyze the influence of the factors on the production capability, which indirectly interprets the machine learning models. The proposed method can capture the underlying nonlinear relationship between the factors and the output, which remedies the limitation of the traditional machine learning routines based on the correlation analysis of factors. The experiment on the data of CBM shows that the detected relationship between the production and the geological/engineering factors by the presented method, is coincident with the actual physical mechanism. Meanwhile, compared with traditional methods, the interpretable machine learning models have better performance in forecasting production capability, averaging 20% improvement in accuracy.

下载PDF全文

下载文献需遵守相关版权规定

论文标题