论文标题
使用自我监督的相互对比学习的异质图神经网络
Heterogeneous Graph Neural Networks using Self-supervised Reciprocally Contrastive Learning
论文作者
论文摘要
异质图神经网络(HGNN)是一种非常流行的技术,用于建模和分析异质图。大多数现有的基于HGNN的方法都是监督或半监督的学习方法,需要注释图,这是昂贵且耗时的。已经提出了自我监督的对比度学习,以通过挖掘隐藏在给定数据中的固有信息来解决需要带注释数据的问题。但是,现有的对比学习方法对于异质图而言不足,因为它们仅基于数据扰动或预定义的结构属性(例如元数据)构建对比度视图,而忽略了在节点属性和图形拓扑中可能存在的声音。我们首次开发了一种新颖而强大的异质图对比度学习方法,即HGCL,它引入了有关节点属性和图形拓扑的各个指导的两种观点,并通过相互偏差的机制集成和增强它们,以更好地模拟异质图。在这种新方法中,我们在这两种观点中采用了独特但最合适的属性和拓扑融合机制,这有利于分别在属性和拓扑中挖掘相关信息。我们进一步使用属性相似性和拓扑相关性来构建高质量的对比样本。在三个大型现实世界的异质图上进行了广泛的实验,证明了HGCL优于最先进的方法。
Heterogeneous graph neural network (HGNN) is a very popular technique for the modeling and analysis of heterogeneous graphs. Most existing HGNN-based approaches are supervised or semi-supervised learning methods requiring graphs to be annotated, which is costly and time-consuming. Self-supervised contrastive learning has been proposed to address the problem of requiring annotated data by mining intrinsic information hidden within the given data. However, the existing contrastive learning methods are inadequate for heterogeneous graphs because they construct contrastive views only based on data perturbation or pre-defined structural properties (e.g., meta-path) in graph data while ignore the noises that may exist in both node attributes and graph topologies. We develop for the first time a novel and robust heterogeneous graph contrastive learning approach, namely HGCL, which introduces two views on respective guidance of node attributes and graph topologies and integrates and enhances them by reciprocally contrastive mechanism to better model heterogeneous graphs. In this new approach, we adopt distinct but most suitable attribute and topology fusion mechanisms in the two views, which are conducive to mining relevant information in attributes and topologies separately. We further use both attribute similarity and topological correlation to construct high-quality contrastive samples. Extensive experiments on three large real-world heterogeneous graphs demonstrate the superiority and robustness of HGCL over state-of-the-art methods.