论文标题
通过内存驱动的变压器生成放射学报告
Generating Radiology Reports via Memory-driven Transformer
论文作者
论文摘要
医学成像经常用于诊断和治疗的临床实践和试验中。编写成像报告很耗时,对于缺乏经验的放射科医生来说可能是错误的。因此,高度需要自动生成放射学报告,以减轻放射科医生的工作量并促进临床自动化,这是将人工智能应用于医疗领域的重要任务。在本文中,我们建议使用内存驱动的变压器生成放射学报告,其中关系内存旨在记录生成过程的关键信息,并应用了以内存驱动的条件层归一化来将存储器纳入变压器的解码器。关于两个流行的放射学报告数据集IU X射线和MIMIC-CXR的实验结果表明,我们所提出的方法在语言产生指标和临床评估方面都优于先前的模型。特别是,这是我们最了解的Mimic-CXR的第一次报告。进一步的分析还表明,我们的方法能够用必要的医学术语以及有意义的图像文本注意映射生成长期报告。
Medical imaging is frequently used in clinical practice and trials for diagnosis and treatment. Writing imaging reports is time-consuming and can be error-prone for inexperienced radiologists. Therefore, automatically generating radiology reports is highly desired to lighten the workload of radiologists and accordingly promote clinical automation, which is an essential task to apply artificial intelligence to the medical domain. In this paper, we propose to generate radiology reports with memory-driven Transformer, where a relational memory is designed to record key information of the generation process and a memory-driven conditional layer normalization is applied to incorporating the memory into the decoder of Transformer. Experimental results on two prevailing radiology report datasets, IU X-Ray and MIMIC-CXR, show that our proposed approach outperforms previous models with respect to both language generation metrics and clinical evaluations. Particularly, this is the first work reporting the generation results on MIMIC-CXR to the best of our knowledge. Further analyses also demonstrate that our approach is able to generate long reports with necessary medical terms as well as meaningful image-text attention mappings.