一种新型的半监督数据驱动的数据驱动方法，用于使用未标记的数据诊断

论文标题

一种新型的半监督数据驱动的数据驱动方法，用于使用未标记的数据诊断

A Novel Semi-Supervised Data-Driven Method for Chiller Fault Diagnosis with Unlabeled Data

论文作者

Li, Bingxu, Cheng, Fanyong, Zhang, Xin, Cui, Can, Cai, Wenjian

论文摘要

在实用冷却器系统中，应用有效的故障诊断技术可以显着降低能耗并提高建筑物的能源效率。现有方法用于过错诊断冷水机的成功取决于可以训练足够标记的数据的条件。但是，在实践中，标签的获取既费力又昂贵。通常，标记的数据的数量有限，大多数可用数据是未标记的。现有方法无法利用未标记数据中包含的信息，这显着限制了冷却器系统中故障诊断性能的改善。为了有效利用未标记的数据来进一步改善故障诊断性能并降低对标记数据的依赖性，我们提出了一种基于半基础的对逆念网络的新型半监督数据驱动的数据驱动的错误诊断方法，该方法将未标记和标记的数据纳入学习过程中。半生成对抗网络可以从未标记的数据中学习数据分布信息，此信息可以有助于显着提高诊断性能。实验结果证明了该方法的有效性。在只有80个标记的样品和16000个未标记的样品的情况下，提出的方法可以将诊断精度提高到84％，而监督的基线方法最多只能达到65％的精度。此外，如果有足够的未标记样品，则建议使用最小的标记样品数量减少约60％。

In practical chiller systems, applying efficient fault diagnosis techniques can significantly reduce energy consumption and improve energy efficiency of buildings. The success of the existing methods for fault diagnosis of chillers relies on the condition that sufficient labeled data are available for training. However, label acquisition is laborious and costly in practice. Usually, the number of labeled data is limited and most data available are unlabeled. The existing methods cannot exploit the information contained in unlabeled data, which significantly limits the improvement of fault diagnosis performance in chiller systems. To make effective use of unlabeled data to further improve fault diagnosis performance and reduce the dependency on labeled data, we proposed a novel semi-supervised data-driven fault diagnosis method for chiller systems based on the semi-generative adversarial network, which incorporates both unlabeled and labeled data into learning process. The semi-generative adversarial network can learn the information of data distribution from unlabeled data and this information can help to significantly improve the diagnostic performance. Experimental results demonstrate the effectiveness of the proposed method. Under the scenario that there are only 80 labeled samples and 16000 unlabeled samples, the proposed method can improve the diagnostic accuracy to 84%, while the supervised baseline methods only reach the accuracy of 65% at most. Besides, the minimal required number of labeled samples can be reduced by about 60% with the proposed method when there are enough unlabeled samples.

下载PDF全文

下载文献需遵守相关版权规定

论文标题