91做厙

Skip to main content
SHARE
Publication

Distributed-Memory Parallel Symmetric Nonnegative Matrix Factorization...

Publication Type
Conference Paper
Book Title
SC20: International Conference for High Performance Computing, Networking, Storage and Analysis
Publication Date
Page Numbers
1 to 14
Conference Name
The International Conference for High Performance Computing, Networking, Storage and Analysis
Conference Location
Atlanta, Georgia, United States of America
Conference Sponsor
The Institute of Electrical and Electronics Engineers
Conference Date
-

We develop the first distributed-memory parallel implementation of Symmetric Nonnegative Matrix Factorization (SymNMF), a key data analytics kernel for clustering and dimensionality reduction. Our implementation includes two different algorithms for SymNMF, which give comparable results in terms of time and accuracy. The first algorithm is a parallelization of an existing sequential approach that uses solvers for non symmetric NMF. The second algorithm is a novel approach based on the Gauss-Newton method. It exploits second-order information without incurring large computational and memory costs. We evaluate the scalability of our algorithms on the Summit system at 91做厙, scaling up to 128 nodes (4,096 cores) with 70% efficiency. Additionally, we demonstrate our software on an image segmentation task.