论文标题
您不知道好奇的机器应该具有的五种特定好奇心
Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have
论文作者
论文摘要
对机器试剂的好奇心一直是生动研究活动的重点。对人类和动物的好奇心的研究,尤其是特定的好奇心,已经发掘出了几种特性,这些特性将为机器学习者带来重要的好处,但在机器智能中尚未得到充分探索。在这项工作中,我们对动物和机器好奇心领域进行了全面的多学科调查。作为这项工作的主要贡献,我们将该调查作为基础,介绍和定义我们认为是特定好奇心最重要的五个特性:1)对不可验证的指称人的定向,2)满足时停止时,3)自愿曝光,4)瞬变和5)连贯的长期学习。作为这项工作的第二个主要贡献,我们展示了如何在概念验证强化学习者中共同实现这些特性:我们演示了该特性如何在该代理的行为中表现出来在简单的非剧本网格世界中,包括好奇心诱导的位置和诱导的好奇心目标。正如我们希望的那样,我们对计算特定好奇剂的例子表现出短期的定向行为,同时更新长期偏好以适应性地寻找好奇心引起的情况。因此,这项工作提出了具有里程碑意义的综合和转化,将特定好奇心转化为机器学习和强化学习的领域,并提供了一种新颖的看法,以了解特定的好奇心如何运作以及将来可能集成到复杂环境中的目标,决策,决策的计算剂的行为中。
Curiosity for machine agents has been a focus of lively research activity. The study of human and animal curiosity, particularly specific curiosity, has unearthed several properties that would offer important benefits for machine learners, but that have not yet been well-explored in machine intelligence. In this work, we conduct a comprehensive, multidisciplinary survey of the field of animal and machine curiosity. As a principal contribution of this work, we use this survey as a foundation to introduce and define what we consider to be five of the most important properties of specific curiosity: 1) directedness towards inostensible referents, 2) cessation when satisfied, 3) voluntary exposure, 4) transience, and 5) coherent long-term learning. As a second main contribution of this work, we show how these properties may be implemented together in a proof-of-concept reinforcement learning agent: we demonstrate how the properties manifest in the behaviour of this agent in a simple non-episodic grid-world environment that includes curiosity-inducing locations and induced targets of curiosity. As we would hope, our example of a computational specific curiosity agent exhibits short-term directed behaviour while updating long-term preferences to adaptively seek out curiosity-inducing situations. This work, therefore, presents a landmark synthesis and translation of specific curiosity to the domain of machine learning and reinforcement learning and provides a novel view into how specific curiosity operates and in the future might be integrated into the behaviour of goal-seeking, decision-making computational agents in complex environments.