论文标题
关于可行的解释与被遗忘的权利之间的权衡
On the Trade-Off between Actionable Explanations and the Right to be Forgotten
论文作者
论文摘要
随着机器学习(ML)模型越来越多地被部署在高风险应用程序中,决策者提出了更严格的数据保护法规(例如GDPR,CCPA)。一个关键原则是“被遗忘的权利”,它使用户有权删除其数据。另一个关键原则是实现可操作的解释的权利,也称为算法追索权,允许用户逆转不利的决策。迄今为止,尚不清楚这两个原则是否可以同时运行。因此,我们在数据删除请求的背景下介绍和研究追索权无效的问题。更具体地说,我们从理论上和经验上分析流行的最先进算法的行为,并证明如果这些算法产生的回收物可能会无效,如果少数数据删除请求(例如1或2)保证了预测模型的保证更新。为了设置可区分模型,我们建议一个框架,以确定关键训练点的最小子集,当删除时,该框架将无效的回流分数最大化。使用我们的框架,我们从经验上表明,从训练集中删除2个数据实例可以使流行的最先进算法最多无效所有回流的95%。因此,我们的工作提出了关于“被遗忘的权利”的“可行解释权”兼容性的基本问题,同时还提供了有关追索性鲁棒性决定因素的建设性见解。
As machine learning (ML) models are increasingly being deployed in high-stakes applications, policymakers have suggested tighter data protection regulations (e.g., GDPR, CCPA). One key principle is the "right to be forgotten" which gives users the right to have their data deleted. Another key principle is the right to an actionable explanation, also known as algorithmic recourse, allowing users to reverse unfavorable decisions. To date, it is unknown whether these two principles can be operationalized simultaneously. Therefore, we introduce and study the problem of recourse invalidation in the context of data deletion requests. More specifically, we theoretically and empirically analyze the behavior of popular state-of-the-art algorithms and demonstrate that the recourses generated by these algorithms are likely to be invalidated if a small number of data deletion requests (e.g., 1 or 2) warrant updates of the predictive model. For the setting of differentiable models, we suggest a framework to identify a minimal subset of critical training points which, when removed, maximize the fraction of invalidated recourses. Using our framework, we empirically show that the removal of as little as 2 data instances from the training set can invalidate up to 95 percent of all recourses output by popular state-of-the-art algorithms. Thus, our work raises fundamental questions about the compatibility of "the right to an actionable explanation" in the context of the "right to be forgotten", while also providing constructive insights on the determining factors of recourse robustness.