论文标题

在资源有限的设置中,使用X射线图像的Covid-19和结核病检测的深层和机器学习模型的有效混合物

An Efficient Mixture of Deep and Machine Learning Models for COVID-19 and Tuberculosis Detection Using X-Ray Images in Resource Limited Settings

论文作者

Al-Timemy, Ali H., Khushaba, Rami N., Mosa, Zahraa M., Escudero, Javier

论文摘要

前线的临床医生需要快速评估症状患者是否确实患有Covid-19。在低资源设置中可能无法访问生物技术测试,这项任务的困难会加剧。此外,在几个低收入和中等收入国家中,结核病(TB)仍然是一个主要的健康问题,其常见症状包括发烧,咳嗽和疲倦,类似于Covid-19。为了帮助检测Covid-19,我们提出了从胸部X射线图像中提取深度功能(DF),这是大多数医院中可用的技术,以及它们随后使用不需要大量计算资源的机器学习方法进行分类。我们编制了一个五级X射线胸部图像的数据集,包括平衡数量的Covid-19,病毒性肺炎,细菌性肺炎,结核病和健康病例。我们比较了将14个单个最先进的预训练的深层网络与传统机器学习分类器提取的DF提取的管道的性能。由Resnet-50组成的DF计算和子空间判别分类器集合的管道是五个类别分类的最佳性能,可实现91.6 + 2.6%的检测准确性(准确性 + 95%的置信区间)。此外,在简单的三级和两级分类问题中,同一管道的精度为98.6+1.4%和99.9+0.5%,重点是区分Covid-19,TB和健康病例;和Covid-19和健康图像。该管道在计算上是高效的,仅需0.19秒即可每X射线图像提取DF,并且在CPU机器上使用超过2000张图像的传统分类器训练传统分类器。结果表明,使用我们的管道在检测Covid-19中的潜在好处,尤其是在资源有限的设置中,并且可以使用有限的计算资源运行。

Clinicians in the frontline need to assess quickly whether a patient with symptoms indeed has COVID-19 or not. The difficulty of this task is exacerbated in low resource settings that may not have access to biotechnology tests. Furthermore, Tuberculosis (TB) remains a major health problem in several low- and middle-income countries and its common symptoms include fever, cough and tiredness, similarly to COVID-19. In order to help in the detection of COVID-19, we propose the extraction of deep features (DF) from chest X-ray images, a technology available in most hospitals, and their subsequent classification using machine learning methods that do not require large computational resources. We compiled a five-class dataset of X-ray chest images including a balanced number of COVID-19, viral pneumonia, bacterial pneumonia, TB, and healthy cases. We compared the performance of pipelines combining 14 individual state-of-the-art pre-trained deep networks for DF extraction with traditional machine learning classifiers. A pipeline consisting of ResNet-50 for DF computation and ensemble of subspace discriminant classifier was the best performer in the classification of the five classes, achieving a detection accuracy of 91.6+ 2.6% (accuracy + 95% Confidence Interval). Furthermore, the same pipeline achieved accuracies of 98.6+1.4% and 99.9+0.5% in simpler three-class and two-class classification problems focused on distinguishing COVID-19, TB and healthy cases; and COVID-19 and healthy images, respectively. The pipeline was computationally efficient requiring just 0.19 second to extract DF per X-ray image and 2 minutes for training a traditional classifier with more than 2000 images on a CPU machine. The results suggest the potential benefits of using our pipeline in the detection of COVID-19, particularly in resource-limited settings and it can run with limited computational resources.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源