论文标题

最后:蛋白质轨迹的潜在空间辅助自适应采样

LAST: Latent Space Assisted Adaptive Sampling for Protein Trajectories

论文作者

Tian, Hao, Jiang, Xi, Xiao, Sian, La Force, Hunter, Larson, Eric C., Tao, Peng

论文摘要

分子动力学(MD)模拟广泛用于研究蛋白质构象和动力学。但是,传统的模拟被困在难以逃脱的某些局部能量最小值中。因此,大多数计算时间都花在已经访问的区域中进行抽样。这导致了无效的抽样过程,并进一步阻碍了负担得起的模拟时间中蛋白质运动的探索。深度学习的进步为蛋白质采样提供了新的机会。变分自动编码器是一类深度学习模型,可以学习一个可以捕获输入数据的关键特征的低维表示(称为潜在空间)。基于此特征,我们提出了一种新的自适应采样方法,潜在空间辅助自适应采样(最后),以加速蛋白质构象空间的探索。该方法包括(i)(i)变异自动编码器训练的周期,(ii)潜在空间上的种子结构选择以及(iii)通过其他MD模拟进行构象采样。通过对两个蛋白质系统的四个结构进行采样来验证所提出的方法:大肠杆菌腺苷激酶(ADK)的两个亚稳态和生动的两个本地状态(VVD)。在所有四个构象中,都证明种子结构位于构象分布的边界上。此外,与两个系统中的常规MD(CMD)模拟相比,在较短的模拟时间中观察到了巨大的构象变化。在亚稳态ADK模拟中,最后一次探索了两个向两个稳定状态的过渡路径,而CMD被困在一个能量盆地中。在VVD光态仿真中,最后一个比CMD模拟的构象模拟要快三倍。

Molecular dynamics (MD) simulation is widely used to study protein conformations and dynamics. However, conventional simulation suffers from being trapped in some local energy minima that are hard to escape. Thus, most computational time is spent sampling in the already visited regions. This leads to an inefficient sampling process and further hinders the exploration of protein movements in affordable simulation time. The advancement of deep learning provides new opportunities for protein sampling. Variational autoencoders are a class of deep learning models to learn a low-dimensional representation (referred to as the latent space) that can capture the key features of the input data. Based on this characteristic, we proposed a new adaptive sampling method, latent space assisted adaptive sampling for protein trajectories (LAST), to accelerate the exploration of protein conformational space. This method comprises cycles of (i) variational autoencoders training, (ii) seed structure selection on the latent space and (iii) conformational sampling through additional MD simulations. The proposed approach is validated through the sampling of four structures of two protein systems: two metastable states of E. Coli adenosine kinase (ADK) and two native states of Vivid (VVD). In all four conformations, seed structures were shown to lie on the boundary of conformation distributions. Moreover, large conformational changes were observed in a shorter simulation time when compared with conventional MD (cMD) simulations in both systems. In metastable ADK simulations, LAST explored two transition paths toward two stable states while cMD became trapped in an energy basin. In VVD light state simulations, LAST was three times faster than cMD simulation with a similar conformational space.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源