论文标题
洗衣仪:反洗钱的自我监督图表学习
LaundroGraph: Self-Supervised Graph Representation Learning for Anti-Money Laundering
论文作者
论文摘要
反洗钱(AML)法规要求金融机构根据一系列规则部署AML系统,这些规则在触发时,构成了可疑警报的基础,由人类分析师评估。审查这些案例是一项繁琐而复杂的任务,需要分析师浏览大型财务互动网络以验证可疑运动。此外,这些系统具有很高的假阳性率(估计超过95 \%)。标签的稀缺性阻碍了基于监督学习的替代系统的使用,从而降低了其在现实世界应用中的适用性。 在这项工作中,我们介绍了自助扫描,这是一种新颖的自我监督图表表示方法,将银行客户和财务交易编码为有意义的表示形式。这些表示形式用于提供见解以协助AML审核过程,例如确定给定客户的异常运动。洗衣仪代表了作为客户交易两分图的基础金融互动网络,并在完全自我监督的链接预测任务上训练图形神经网络。我们从经验上证明,我们的方法在自我监督的链接预测上使用现实世界数据集优于其他强大的基线,将最佳的非毛具基线提高了$ 12 $ P.P. AUC。目的是通过在审查后向分析师提供这些AI驱动的见解来提高审查过程的效率。据我们所知,这是在AML检测背景下的第一个完全自我监督的系统。
Anti-money laundering (AML) regulations mandate financial institutions to deploy AML systems based on a set of rules that, when triggered, form the basis of a suspicious alert to be assessed by human analysts. Reviewing these cases is a cumbersome and complex task that requires analysts to navigate a large network of financial interactions to validate suspicious movements. Furthermore, these systems have very high false positive rates (estimated to be over 95\%). The scarcity of labels hinders the use of alternative systems based on supervised learning, reducing their applicability in real-world applications. In this work we present LaundroGraph, a novel self-supervised graph representation learning approach to encode banking customers and financial transactions into meaningful representations. These representations are used to provide insights to assist the AML reviewing process, such as identifying anomalous movements for a given customer. LaundroGraph represents the underlying network of financial interactions as a customer-transaction bipartite graph and trains a graph neural network on a fully self-supervised link prediction task. We empirically demonstrate that our approach outperforms other strong baselines on self-supervised link prediction using a real-world dataset, improving the best non-graph baseline by $12$ p.p. of AUC. The goal is to increase the efficiency of the reviewing process by supplying these AI-powered insights to the analysts upon review. To the best of our knowledge, this is the first fully self-supervised system within the context of AML detection.