在资源有限的设置中，使用X射线图像的Covid-19和结核病检测的深层和机器学习模型的有效混合物

论文标题

在资源有限的设置中，使用X射线图像的Covid-19和结核病检测的深层和机器学习模型的有效混合物

An Efficient Mixture of Deep and Machine Learning Models for COVID-19 and Tuberculosis Detection Using X-Ray Images in Resource Limited Settings

论文作者

Al-Timemy, Ali H., Khushaba, Rami N., Mosa, Zahraa M., Escudero, Javier

论文摘要

前线的临床医生需要快速评估症状患者是否确实患有Covid-19。在低资源设置中可能无法访问生物技术测试，这项任务的困难会加剧。此外，在几个低收入和中等收入国家中，结核病（TB）仍然是一个主要的健康问题，其常见症状包括发烧，咳嗽和疲倦，类似于Covid-19。为了帮助检测Covid-19，我们提出了从胸部X射线图像中提取深度功能（DF），这是大多数医院中可用的技术，以及它们随后使用不需要大量计算资源的机器学习方法进行分类。我们编制了一个五级X射线胸部图像的数据集，包括平衡数量的Covid-19，病毒性肺炎，细菌性肺炎，结核病和健康病例。我们比较了将14个单个最先进的预训练的深层网络与传统机器学习分类器提取的DF提取的管道的性能。由Resnet-50组成的DF计算和子空间判别分类器集合的管道是五个类别分类的最佳性能，可实现91.6 + 2.6％的检测准确性（准确性 + 95％的置信区间）。此外，在简单的三级和两级分类问题中，同一管道的精度为98.6+1.4％和99.9+0.5％，重点是区分Covid-19，TB和健康病例；和Covid-19和健康图像。该管道在计算上是高效的，仅需0.19秒即可每X射线图像提取DF，并且在CPU机器上使用超过2000张图像的传统分类器训练传统分类器。结果表明，使用我们的管道在检测Covid-19中的潜在好处，尤其是在资源有限的设置中，并且可以使用有限的计算资源运行。

Clinicians in the frontline need to assess quickly whether a patient with symptoms indeed has COVID-19 or not. The difficulty of this task is exacerbated in low resource settings that may not have access to biotechnology tests. Furthermore, Tuberculosis (TB) remains a major health problem in several low- and middle-income countries and its common symptoms include fever, cough and tiredness, similarly to COVID-19. In order to help in the detection of COVID-19, we propose the extraction of deep features (DF) from chest X-ray images, a technology available in most hospitals, and their subsequent classification using machine learning methods that do not require large computational resources. We compiled a five-class dataset of X-ray chest images including a balanced number of COVID-19, viral pneumonia, bacterial pneumonia, TB, and healthy cases. We compared the performance of pipelines combining 14 individual state-of-the-art pre-trained deep networks for DF extraction with traditional machine learning classifiers. A pipeline consisting of ResNet-50 for DF computation and ensemble of subspace discriminant classifier was the best performer in the classification of the five classes, achieving a detection accuracy of 91.6+ 2.6% (accuracy + 95% Confidence Interval). Furthermore, the same pipeline achieved accuracies of 98.6+1.4% and 99.9+0.5% in simpler three-class and two-class classification problems focused on distinguishing COVID-19, TB and healthy cases; and COVID-19 and healthy images, respectively. The pipeline was computationally efficient requiring just 0.19 second to extract DF per X-ray image and 2 minutes for training a traditional classifier with more than 2000 images on a CPU machine. The results suggest the potential benefits of using our pipeline in the detection of COVID-19, particularly in resource-limited settings and it can run with limited computational resources.

下载PDF全文

下载文献需遵守相关版权规定

论文标题