论文标题
即时教授无人机:情感反馈可以作为培训人造代理的学习信号吗?
Teaching Drones on the Fly: Can Emotional Feedback Serve as Learning Signal for Training Artificial Agents?
论文作者
论文摘要
我们研究是否可以直接利用自然主义的情感人类反馈作为奖励信号,以通过交互式人工增强学习来训练人造药物。为了回答这个问题,我们设计了一个受动物训练启发的实验环境,其中人类测试受试者通过对无人机的行动选择提供情感反馈来互动地教模仿的无人机代理他们所需的命令映射。我们提出了第一项经验证明研究和分析,证实可以将人的面部情绪表达直接作为互动学习环境中的奖励信号而直接利用。因此,我们为更自然和直观的增强学习形式贡献了经验发现,专门为非专家使用者设计。
We investigate whether naturalistic emotional human feedback can be directly exploited as a reward signal for training artificial agents via interactive human-in-the-loop reinforcement learning. To answer this question, we devise an experimental setting inspired by animal training, in which human test subjects interactively teach an emulated drone agent their desired command-action-mapping by providing emotional feedback on the drone's action selections. We present a first empirical proof-of-concept study and analysis confirming that human facial emotion expression can be directly exploited as reward signal in such interactive learning settings. Thereby, we contribute empirical findings towards more naturalistic and intuitive forms of reinforcement learning especially designed for non-expert users.