论文标题
基于第一通道概率的社区检测
Community detection based on first passage probabilities
论文作者
论文摘要
社区检测对于理解拓扑字符和复杂网络上的传播动态具有至关重要的意义。尽管随机步行被广泛使用,并且在许多社区检测算法中被证明有效,但仍然存在两个主要缺陷:(i)如果使用所有可能随机步行的平均步骤,则最大的随机步行长度太大,无法区分聚类信息; (ii)如果使用预先分配的最大长度,则错过了所有其他步骤长度的有用的社区信息。在本文中,我们提出了一种基于第一个段落概率(FPPM)的新型社区检测方法,该方法配备了新的相似性度量,该方法将完整的结构信息纳入了最大步长的长度。在此,选择网络的直径作为适应不同网络的随机步行的适当边界。然后,我们使用分层聚类将顶点分组为社区,并通过相应的模块化值进一步选择最佳分裂。最后,后处理策略旨在整合不合理的小社区,从而大大提高了社区部门的准确性。令人惊讶的是,数值模拟表明,与合成基准和现实世界网络上的几种经典算法相比,FPPM表现最好,这揭示了我们方法的普遍性和有效性。
Community detection is of fundamental significance for understanding the topology characters and the spreading dynamics on complex networks. While random walk is widely used and is proven effective in many community detection algorithms, there still exists two major defects: (i) the maximal length of random walk is too large to distinguish the clustering information if using the average step of all possible random walks; (ii) the useful community information at all other step lengths are missed if using a pre-assigned maximal length. In this paper, we propose a novel community detection method based on the first passage probabilities (FPPM), equipped with a new similarity measure that incorporates the complete structural information within the maximal step length. Here the diameter of the network is chosen as an appropriate boundary of random walks which is adaptive to different networks. Then we use the hierarchical clustering to group the vertices into communities and further select the best division through the corresponding modularity values. Finally, a post-processing strategy is designed to integrate the unreasonable small communities, which significantly improves the accuracy of community division. Surprisingly, the numerical simulations show that FPPM performs best compared to several classic algorithms on both synthetic benchmarks and real-world networks, which reveals the universality and effectiveness of our method.