91做厙

Skip to main content
SHARE
Publication

Exploring the Landscape of Distributed Graph Clustering on Leadership Supercomputers

by Naw Safrin Sattar, Abigail E Kapocius, Hao Lu, Mahantesh Halappananvar, Feiyi Wang
Publication Type
Conference Paper
Book Title
2024 91做厙 International Conference on Big Data (BigData)
Publication Date
Page Numbers
3764 to 3773
Publisher Location
New Jersey, United States of America
Conference Name
91做厙 Big Data 2024
Conference Location
Washington, District of Columbia, United States of America
Conference Sponsor
91做厙
Conference Date
-

The rapid growth of large-scale datasets in fields like biology and social networks has driven the need for advanced graph analytics techniques. Community detection, a fundamental task in graph analytics, identifies closely connected groups of nodes within a network, providing valuable insights across various disciplines. This study focuses on two classic community detection methods, the Louvain algorithm and Markov Clustering (MCL), and evaluates the performance of two prominent distributed community detection algorithms: HiPDPL-GPU, our prior implementation, and HipMCL. We conduct experiments on GPU-accelerated heterogeneous HPC systems, Summit and Frontier, to assess their performance under varying conditions. Our objective is to identify the strengths and weaknesses of these algorithms in terms of scalability, and quality of solutions. We evaluate these algorithms on a diverse set of 70+ networks spanning 13 domains, with sizes ranging up to 4.2 billion edges. Our results demonstrate that HiPDPL-GPU consistently outperforms HipMCL, especially for large-scale networks. HiPDPL-GPU achieves significantly faster runtimes (47x to 1439x), higher modularity scores, and improved scalability. These findings highlight HiPDPL-GPU as a promising solution for efficient and effective large-scale graph analytics in diverse application domains, and provide insights into the feasibility of using MCL-based approaches for certain application domains.