论文标题

用于自动化基于合规的消除和扩展的管道(PACE2):用于高通量生物分子材料仿真工作流的系统框架

Pipeline for Automating Compliance-based Elimination and Extension (PACE2): A Systematic Framework for High-throughput Biomolecular Material Simulation Workflows

论文作者

Mushnoori, Srinivas C., Zang, Ethan, Banerjee, Akash, Hooten, Mason, Merzky, Andre, Turilli, Matteo, Jha, Shantenu, Dutt, Meenakshi

论文摘要

可以使用集合分子动力学有效探测生物分子材料通过动力学界面过程(例如自组装和融合)的形成。但是,在研究较大的组成相空间时,这种方法需要大量模拟。此外,很难预测每个模拟是否产生具有所需特性或结果的生物分子材料,以及每个模拟的运行时间。这些困难可以通过基于规则的管理系统来克服,这些系统包括间歇性检查,可变采样,过早终止和单个分子动力学模拟的扩展。这种管理系统的自动化可以大大减少管理大型分子动力学模拟的开销。为此,提出了一个基于工作流程的高通量计算框架,用于自动化基于合规性的消除和扩展的管道(PACE2),提出了用于生物分子材料模拟的。 PACE2框架包括模拟 - 分析管道。每个管道都包括时间分离的仿真和分析任务。当分子动力学仿真完成时,触发分析任务,该任务评估了分子动力学轨迹的依从性。兼容的分子动力学模拟将扩展到下一个分子动力学阶段,并具有合适的采样速率,以允许进行其他详细的分析。取消了不合规的分子动力学模拟,其计算资源要么重新分配或释放。该框架旨在在本地台式计算机和高性能计算资源上运行。将来,该框架将被扩展到解决广义的工作流程并调查其他类别的材料。

The formation of biomolecular materials via dynamical interfacial processes such as self-assembly and fusion, for diverse compositions and external conditions, can be efficiently probed using ensemble Molecular Dynamics. However, this approach requires a large number of simulations when investigating a large composition phase space. In addition, there is difficulty in predicting whether each simulation is yielding biomolecular materials with the desired properties or outcomes and how long each simulation will run for. These difficulties can be overcome by rules-based management systems which include intermittent inspection, variable sampling, premature termination and extension of the individual Molecular Dynamics simulations. The automation of such a management system can significantly reduce the overhead of managing large ensembles of Molecular Dynamics simulations. To this end, a high-throughput workflows-based computational framework, Pipeline for Automating Compliance-based Elimination and Extension (PACE2), for biomolecular materials simulations is proposed. The PACE2 framework encompasses Simulation-Analysis Pipelines. Each Pipeline includes temporally separated simulation and analysis tasks. When a Molecular Dynamics simulation completes, an analysis task is triggered which evaluates the Molecular Dynamics trajectory for compliance. Compliant Molecular Dynamics simulations are extended to the next Molecular Dynamics phase with a suitable sample rate to allow additional, detailed analysis. Non-compliant Molecular Dynamics simulations are eliminated, and their computational resources are either reallocated or released. The framework is designed to run on local desktop computers and high performance computing resources. In the future, the framework will be extended to address generalized workflows and investigate other classes of materials.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源