论文标题
差异:分子对接的扩散步骤,曲折和转弯
DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
论文作者
论文摘要
预测小分子配体与蛋白质的结合结构(一种称为分子对接的任务)对于药物设计至关重要。与传统的基于搜索的方法相比,最近的深度学习方法将扩展坞视为回归问题,但尚未提供准确性的实质性提高。相反,我们将分子对接作为生成建模问题并发展为Diffdock,这是对配体姿势的非欧国人歧管的扩散生成模型。为此,我们将此歧管映射到涉及对接的自由度(翻译,旋转和扭转)的产品空间,并在此空间上开发了有效的扩散过程。从经验上讲,Diffdock在PDBBIND上获得了38%的TOP-1成功率(RMSD <2a),显着超过了先前最新的传统对接(23%)和深度学习方法(20%)方法。此外,虽然以前的方法无法在计算折叠结构(最大准确性10.4%)上停靠,但Diffdock的精度显着较高(21.7%)。最后,Diffdock具有快速的推理时间,并以高选择性准确性提供了置信度估计。
Predicting the binding structure of a small molecule ligand to a protein -- a task known as molecular docking -- is critical to drug design. Recent deep learning methods that treat docking as a regression problem have decreased runtime compared to traditional search-based methods but have yet to offer substantial improvements in accuracy. We instead frame molecular docking as a generative modeling problem and develop DiffDock, a diffusion generative model over the non-Euclidean manifold of ligand poses. To do so, we map this manifold to the product space of the degrees of freedom (translational, rotational, and torsional) involved in docking and develop an efficient diffusion process on this space. Empirically, DiffDock obtains a 38% top-1 success rate (RMSD<2A) on PDBBind, significantly outperforming the previous state-of-the-art of traditional docking (23%) and deep learning (20%) methods. Moreover, while previous methods are not able to dock on computationally folded structures (maximum accuracy 10.4%), DiffDock maintains significantly higher precision (21.7%). Finally, DiffDock has fast inference times and provides confidence estimates with high selective accuracy.