认真对待原则：一种混合价值对齐方式的方法

论文标题

认真对待原则：一种混合价值对齐方式的方法

Taking Principles Seriously: A Hybrid Approach to Value Alignment

论文作者

Kim, Tae Wan, Hooker, John, Donaldson, Thomas

论文摘要

AI中价值一致性（VA）系统发展（VA）系统的重要一步是了解VA如何反映有效的道德原则。我们建议，VA系统的设计师通过利用一种混合方法来结合伦理，其中伦理推理和经验观察都起着作用。我们认为，这避免了“自然主义的谬误”，这是一种试图从“ IS”中获得“应该”的尝试，当不承担谬论时，它提供了更适当的道德推理形式。使用量化的模型逻辑，我们精确地提出了源自义务伦理的原则，并显示了它们如何在AI规则基础上为任何给定的行动计划暗示特定的“测试命题”。仅当测试主张在经验上是正确的，这是根据经验VA做出的判断。这允许经验VA与独立合理的道德原则无缝集成。

An important step in the development of value alignment (VA) systems in AI is understanding how VA can reflect valid ethical principles. We propose that designers of VA systems incorporate ethics by utilizing a hybrid approach in which both ethical reasoning and empirical observation play a role. This, we argue, avoids committing the "naturalistic fallacy," which is an attempt to derive "ought" from "is," and it provides a more adequate form of ethical reasoning when the fallacy is not committed. Using quantified model logic, we precisely formulate principles derived from deontological ethics and show how they imply particular "test propositions" for any given action plan in an AI rule base. The action plan is ethical only if the test proposition is empirically true, a judgment that is made on the basis of empirical VA. This permits empirical VA to integrate seamlessly with independently justified ethical principles.

下载PDF全文

下载文献需遵守相关版权规定

论文标题