论文标题
通过在图像上揭示卷积痕迹来对抗深层
Fighting Deepfake by Exposing the Convolutional Traces on Images
论文作者
论文摘要
人工智能和图像处理的进步正在改变人们与数字图像和视频互动的方式。诸如FaceApp之类的广泛移动应用程序利用最先进的生成对抗网络(GAN)在人脸照片上产生极端的转换,例如性别交换,衰老等。结果是完全现实的,即使对非经验的用户也非常容易被利用。这种媒体对象以Deepfake的名字命名,并在多媒体取证领域提出了新的挑战:DeepFake检测挑战。的确,即使对于人眼,将深击与真实图像区分也可能是一项艰巨的任务,但是最近的作品正在尝试应用用于生成图像的相同技术,以通过初步良好的良好结果区分它们,但有许多局限性:采用的卷积神经网络不是那么强大,并不是那么强大,证明是特定于上下文并倾向于从图像中提取语义学。在本文中,提出了一种新方法,目的是从图像中提取深膜指纹。该方法基于经过训练的预期最大化算法,以检测和提取代表GAN在图像生成过程中留下的卷积痕迹(CT)的指纹。 CT证明,在DeepFake检测任务中,与最先进的歧视能力相比,具有很高的歧视能力,也证明对不同的攻击也是可靠的。考虑到来自10种不同的gan体系结构的深击不仅涉及面部图像,因此达到了超过98%的总体分类准确性,CT证明是可靠的,并且对图像语义没有任何依赖。最后,对faceApp产生的深击进行的测试在伪造的检测任务中达到了93%的精度,证明了所提出的技术在实际情况下的有效性。
Advances in Artificial Intelligence and Image Processing are changing the way people interacts with digital images and video. Widespread mobile apps like FACEAPP make use of the most advanced Generative Adversarial Networks (GAN) to produce extreme transformations on human face photos such gender swap, aging, etc. The results are utterly realistic and extremely easy to be exploited even for non-experienced users. This kind of media object took the name of Deepfake and raised a new challenge in the multimedia forensics field: the Deepfake detection challenge. Indeed, discriminating a Deepfake from a real image could be a difficult task even for human eyes but recent works are trying to apply the same technology used for generating images for discriminating them with preliminary good results but with many limitations: employed Convolutional Neural Networks are not so robust, demonstrate to be specific to the context and tend to extract semantics from images. In this paper, a new approach aimed to extract a Deepfake fingerprint from images is proposed. The method is based on the Expectation-Maximization algorithm trained to detect and extract a fingerprint that represents the Convolutional Traces (CT) left by GANs during image generation. The CT demonstrates to have high discriminative power achieving better results than state-of-the-art in the Deepfake detection task also proving to be robust to different attacks. Achieving an overall classification accuracy of over 98%, considering Deepfakes from 10 different GAN architectures not only involved in images of faces, the CT demonstrates to be reliable and without any dependence on image semantic. Finally, tests carried out on Deepfakes generated by FACEAPP achieving 93% of accuracy in the fake detection task, demonstrated the effectiveness of the proposed technique on a real-case scenario.