论文标题

ULSA:代表综合协议的统一语言

ULSA: Unified Language of Synthesis Actions for Representation of Synthesis Protocols

论文作者

Wang, Zheren, Cruse, Kevin, Fei, Yuxing, Chia, Ann, Zeng, Yan, Huo, Haoyan, He, Tanjin, Deng, Bowen, Kononova, Olga, Ceder, Gerbrand

论文摘要

应用AI功率来预测新型材料的合成需要高质量的大规模数据集。从科学出版物中提取综合信息仍然具有挑战性,尤其是在提取合成作用方面,因为缺乏使用坚实,健壮且成熟的本体论来描述合成程序的全面标记数据集。在这项工作中,我们提出了第一种统一的合成作用语言(ULSA)来描述陶瓷合成程序。我们根据拟议的ULSA计划创建了一个由域专家注释的3,040个合成程序的数据集。为了证明ULSA的功能,我们构建了一个基于神经网络的模型,以将任意陶瓷的合成段落映射到ULSA中,并将其用于构建合成流程图以进行合成程序。对流程图的分析表明,(a)ULSA涵盖了描述合成程序时研究人员使用的基本词汇,并且(b)可以捕获合成方案的重要特征。这项工作是建立综合本体论和自主机器人合成的坚实基础的重要一步。

Applying AI power to predict syntheses of novel materials requires high-quality, large-scale datasets. Extraction of synthesis information from scientific publications is still challenging, especially for extracting synthesis actions, because of the lack of a comprehensive labeled dataset using a solid, robust, and well-established ontology for describing synthesis procedures. In this work, we propose the first Unified Language of Synthesis Actions (ULSA) for describing ceramics synthesis procedures. We created a dataset of 3,040 synthesis procedures annotated by domain experts according to the proposed ULSA scheme. To demonstrate the capabilities of ULSA, we built a neural network-based model to map arbitrary ceramics synthesis paragraphs into ULSA and used it to construct synthesis flowcharts for synthesis procedures. Analysis for the flowcharts showed that (a) ULSA covers essential vocabulary used by researchers when describing synthesis procedures and (b) it can capture important features of synthesis protocols. This work is an important step towards creating a synthesis ontology and a solid foundation for autonomous robotic synthesis.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源