论文标题

与PENSA的生物分子构象合奏的系统分析

Systematic Analysis of Biomolecular Conformational Ensembles with PENSA

论文作者

Vögele, Martin, Thomson, Neil J., Truong, Sang T., McAvity, Jasper, Zachariae, Ulrich, Dror, Ron O.

论文摘要

原子水平的模拟被广泛用于研究生物分子及其动力学。此类研究中的一个共同目标是比较在几种条件下(例如,与各种突变或结合配体)对分子系统的模拟,以确定在这些条件下采用的分子构象之间的差异。但是,模拟对越来越大,更复杂的系统产生的大量数据通常使很难识别与特定生化现象相关的结构特征。我们提出了一个名为PENSA的灵活软件包,该软件包可以对生物分子构象合奏进行全面而彻底的研究。它提供了特征和特征转化,可完全表示蛋白质和核酸(包括水和离子结合位点)等生物分子,从而避免了手动特征选择会带来的偏见。 Pensa实现了系统地比较分子特征在合奏中的分布的方法,以找到它们之间的显着差异并识别感兴趣的区域。它还包括一种新的方法,可以量化生物分子两个区域之间的特定状态信息,例如,允许跟踪信息流以识别变构途径。 Pensa还提供了方便的工具,用于加载数据和可视化结果,从而快速处理并易于解释。 Pensa是一个开源Python库,可在https://github.com/drorlab/pensa上维护,以及一个示例工作流和教程。我们通过展示如何有效地确定分子机制的有效性来证明其在现实世界中的有用性。

Atomic-level simulations are widely used to study biomolecules and their dynamics. A common goal in such studies is to compare simulations of a molecular system under several conditions -- for example, with various mutations or bound ligands -- in order to identify differences between the molecular conformations adopted under these conditions. However, the large amount of data produced by simulations of ever larger and more complex systems often renders it difficult to identify the structural features that are relevant for a particular biochemical phenomenon. We present a flexible software package named PENSA that enables a comprehensive and thorough investigation into biomolecular conformational ensembles. It provides featurizations and feature transformations that allow for a complete representation of biomolecules like proteins and nucleic acids, including water and ion binding sites, thus avoiding bias that would come with manual feature selection. PENSA implements methods to systematically compare the distributions of molecular features across ensembles to find the significant differences between them and identify regions of interest. It also includes a novel approach to quantify the state-specific information between two regions of a biomolecule, which allows, e.g., tracing information flow to identify allosteric pathways. PENSA also comes with convenient tools for loading data and visualizing results, making them quick to process and easy to interpret. PENSA is an open-source Python library maintained at https://github.com/drorlab/pensa along with an example workflow and a tutorial. We demonstrate its usefulness in real-world examples by showing how it helps to determine molecular mechanisms efficiently.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源