通过挖掘蒙面自动编码器的潜在空间来掩盖烟雾fau识别

论文标题

通过挖掘蒙面自动编码器的潜在空间来掩盖烟雾fau识别

Occlusion-Robust FAU Recognition by Mining Latent Space of Masked Autoencoders

论文作者

Jiang, Minyang, Wang, Yongwei, McKeown, Martin J., Wang, Z. Jane

论文摘要

面部作用单元（FAU）对于细粒的面部表达分析至关重要。尽管已经使用理想的高质量图像对FAU检测进行了积极研究，但在严重遮挡的条件下未对其进行彻底研究。在本文中，我们提出了第一种闭塞烟FAU识别方法，以在重闭障碍下保持FAU检测性能。我们的新方法利用了蒙面自动编码器（MAE）潜在空间的丰富信息，并将其转换为FAU功能。绕过遮挡重建步骤，我们的模型通过挖掘经过预告片的蒙版自动编码器的潜在空间有效地提取了闭塞面的FAU特征。节点和边缘级知识蒸馏都被用于指导我们的模型以找到潜在空间向量和FAU功能之间的映射。彻底研究了面部遮挡条件，包括随机的小斑块和大块。 BP4D和DISFA数据集的实验结果表明，我们的方法可以在研究的面部遮挡下实现最先进的性能，从而大大优于现有的基线方法。特别是，即使在重大阻塞下，提出的方法也可以在正常条件下达到可比的性能与最先进的方法。

Facial action units (FAUs) are critical for fine-grained facial expression analysis. Although FAU detection has been actively studied using ideally high quality images, it was not thoroughly studied under heavily occluded conditions. In this paper, we propose the first occlusion-robust FAU recognition method to maintain FAU detection performance under heavy occlusions. Our novel approach takes advantage of rich information from the latent space of masked autoencoder (MAE) and transforms it into FAU features. Bypassing the occlusion reconstruction step, our model efficiently extracts FAU features of occluded faces by mining the latent space of a pretrained masked autoencoder. Both node and edge-level knowledge distillation are also employed to guide our model to find a mapping between latent space vectors and FAU features. Facial occlusion conditions, including random small patches and large blocks, are thoroughly studied. Experimental results on BP4D and DISFA datasets show that our method can achieve state-of-the-art performances under the studied facial occlusion, significantly outperforming existing baseline methods. In particular, even under heavy occlusion, the proposed method can achieve comparable performance as state-of-the-art methods under normal conditions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题