论文标题
部分可观测时空混沌系统的无模型预测
A study on the distribution of social biases in self-supervised learning visual models
论文作者
论文摘要
如果对数据进行充分采样,深层神经网络在学习数据分布方面具有有效的效率。但是,它们可能会被培训数据中隐式纳入的非相关因素强烈偏见。这些包括操作偏见,例如无效或不均匀的数据采样,也包括道德问题,因为即使在培训数据中,社会偏见也隐含地存在\ textemdash,或在不公平的培训时间表中明确定义。在对人类过程产生影响的任务中,学习社会偏见可能会产生歧视性,不道德和不信任的后果。通常假定社会偏见是源于对标记数据的监督学习,因此,自我监督的学习(SSL)错误地看成是一种有效且无偏见的解决方案,因为它不需要标记的数据。但是,最近证明一种流行的SSL方法还融合了偏见。在本文中,我们研究了使用ImageNet数据训练的各种SSL视觉模型的偏见,该模型使用心理专家设计的方法和数据集来衡量社会偏见。我们表明,SSL模型的类型与所包含的偏差数量之间存在相关性。此外,结果还表明,该数字并不严格取决于模型的准确性和在整个网络中的变化。最后,我们得出的结论是,仔细的SSL模型选择过程可以减少部署模型中社交偏见的数量,同时保持高性能。
Deep neural networks are efficient at learning the data distribution if it is sufficiently sampled. However, they can be strongly biased by non-relevant factors implicitly incorporated in the training data. These include operational biases, such as ineffective or uneven data sampling, but also ethical concerns, as the social biases are implicitly present\textemdash even inadvertently, in the training data or explicitly defined in unfair training schedules. In tasks having impact on human processes, the learning of social biases may produce discriminatory, unethical and untrustworthy consequences. It is often assumed that social biases stem from supervised learning on labelled data, and thus, Self-Supervised Learning (SSL) wrongly appears as an efficient and bias-free solution, as it does not require labelled data. However, it was recently proven that a popular SSL method also incorporates biases. In this paper, we study the biases of a varied set of SSL visual models, trained using ImageNet data, using a method and dataset designed by psychological experts to measure social biases. We show that there is a correlation between the type of the SSL model and the number of biases that it incorporates. Furthermore, the results also suggest that this number does not strictly depend on the model's accuracy and changes throughout the network. Finally, we conclude that a careful SSL model selection process can reduce the number of social biases in the deployed model, whilst keeping high performance.