Logicsolver：通过逻辑提示增强学习解决可解释的数学单词问题解决问题

论文标题

Logicsolver：通过逻辑提示增强学习解决可解释的数学单词问题解决问题

LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning

论文作者

Yang, Zhicheng, Qin, Jinghui, Chen, Jiaqi, Lin, Liang, Liang, Xiaodan

论文摘要

最近，深度学习模型在MWP解决答案准确性方面取得了长足的进步。但是，它们是无法解释的，因为他们主要依靠浅启发式方法来实现高性能，而无需理解和推理扎根的数学逻辑。为了解决这个问题并迈出了可解释的MWP解决方案的一步，我们首先构建了一个名为InterMWP的高质量MWP数据集，该数据集由11,495 MWP组成，并注释了基于代数知识的可解释的逻辑公式，作为每个解决方案方程的接地语言逻辑。与现有的MWP数据集不同，我们的InterMWP基准要求求解器不仅输出解决方案表达式，还可以预测相应的逻辑公式。我们进一步提出了一种新颖的方法，并以逻辑提示和解释产生（称为Logicsolver）提出。对于每个MWP，我们的Logicsolver首先检索一些高度相关的代数知识，然后将它们传递到主干模型，以提示提示MWPS的语义表示。借助这些改进的语义表示，我们的逻辑上可以同时根据生成的解决方案表达式生成相应的解决方案表达式和可解释的知识公式。实验结果表明，我们的Logicsolver比基线具有更强的基于逻辑公式的解释性，同时同时借助逻辑提示来实现更高的答案准确性。源代码和数据集可从https://github.com/yangzhch6/intermwp获得。

Recently, deep learning models have made great progress in MWP solving on answer accuracy. However, they are uninterpretable since they mainly rely on shallow heuristics to achieve high performance without understanding and reasoning the grounded math logic. To address this issue and make a step towards interpretable MWP solving, we first construct a high-quality MWP dataset named InterMWP which consists of 11,495 MWPs and annotates interpretable logical formulas based on algebraic knowledge as the grounded linguistic logic of each solution equation. Different from existing MWP datasets, our InterMWP benchmark asks for a solver to not only output the solution expressions but also predict the corresponding logical formulas. We further propose a novel approach with logical prompt and interpretation generation, called LogicSolver. For each MWP, our LogicSolver first retrieves some highly-correlated algebraic knowledge and then passes them to the backbone model as prompts to improve the semantic representations of MWPs. With these improved semantic representations, our LogicSolver generates corresponding solution expressions and interpretable knowledge formulas in accord with the generated solution expressions, simultaneously. Experimental results show that our LogicSolver has stronger logical formula-based interpretability than baselines while achieving higher answer accuracy with the help of logical prompts, simultaneously. The source code and dataset is available at https://github.com/yangzhch6/InterMWP.

下载PDF全文

下载文献需遵守相关版权规定

论文标题