论文标题
ECCV 2020的Fairface挑战:分析面部识别的偏见
FairFace Challenge at ECCV 2020: Analyzing Bias in Face Recognition
论文作者
论文摘要
这项工作总结了2020年查尔恩(Chalearn)看着人们的面对面识别和分析挑战,并提供了最高的解决方案和结果分析的描述。挑战的目的是评估在存在其他混杂属性的情况下,在1:1的任务上,提交算法的性别和肤色的准确性和偏见。使用基于重新注释的IJB-C的野外数据集对参与者进行评估,并以12.5k的新图像和其他标签进一步丰富了参与者。数据集不平衡,该数据集模拟了现实世界的情况,基于AI的模型应该介绍公平的结果对数据进行培训和评估。挑战吸引了151名参与者,他们总共提交了超过1.8K的意见。挑战的最后阶段吸引了36支活跃的团队,其中10支超过了0.999 AUC-ROC,同时在拟议的偏见指标中取得了非常低的分数。参与者的常见策略是面临预处理,数据分布的均质化,偏置意识损失功能和集成模型的使用。对深色肤色的女性以及眼镜和年轻年龄增加假阳性率的潜力,对前十名团队的分析显示出较高的假阳性率(和较低的假负率)。
This work summarizes the 2020 ChaLearn Looking at People Fair Face Recognition and Analysis Challenge and provides a description of the top-winning solutions and analysis of the results. The aim of the challenge was to evaluate accuracy and bias in gender and skin colour of submitted algorithms on the task of 1:1 face verification in the presence of other confounding attributes. Participants were evaluated using an in-the-wild dataset based on reannotated IJB-C, further enriched by 12.5K new images and additional labels. The dataset is not balanced, which simulates a real world scenario where AI-based models supposed to present fair outcomes are trained and evaluated on imbalanced data. The challenge attracted 151 participants, who made more than 1.8K submissions in total. The final phase of the challenge attracted 36 active teams out of which 10 exceeded 0.999 AUC-ROC while achieving very low scores in the proposed bias metrics. Common strategies by the participants were face pre-processing, homogenization of data distributions, the use of bias aware loss functions and ensemble models. The analysis of top-10 teams shows higher false positive rates (and lower false negative rates) for females with dark skin tone as well as the potential of eyeglasses and young age to increase the false positive rates too.