Parallelization of sparse-matrix vector product • KDE Community Forums

This forum has been archived. All content is frozen. Please use KDE Discuss instead.

Board index

Parallelization of sparse-matrix vector product

Page 1 of 1 (4 posts)

Tags:

alnurn Registered Member Posts 2 Karma 0	Parallelization of sparse-matrix vector product Thu Feb 13, 2014 3:38 pm Hi, I'm writing a computing-intensive application that makes a heavy use of conjugate gradient with very large (10^6) sparse matrix (namely a VLSI circuit placer). For its simplicity and since I use C++, I chosed Eigen. However, I would like more performance and an 8x speedup from using parallelism would be particularly useful. AFAIK, there is no support for parallel sparse-matrix*vector product now in Eigen. I don't know the codebase yet, but I would like to give some time if I can help with an implementation. Is anyone else interested in this functionality? In the meantime, do you know if there are other sparse-matrix libraries that use thread parallelism without the pain of using MPI - or is it already in Eigen and I missed it? Thanks.
ggael Moderator Posts 3447 Karma 19 OS	Re: Parallelization of sparse-matrix vector product Fri Feb 14, 2014 1:59 pm I have local changes somewhere implementing parallel sparse*dense products but I never pushed them because I never observed nice speed-up. Probably because of NUMA.
alnurn Registered Member Posts 2 Karma 0	Re: Parallelization of sparse-matrix vector product Sat Feb 15, 2014 2:54 pm It's surprising! I would expect an almost linear speedup for large matrices, since only the vector will have to be shared and each thread will access the sparse matrix's storage in a streaming manner. NUMA... are you benchmarking on a cluster? I'll probably end up trying a naive parallelization with openmp and see how it works for me. Thank you!
ggael Moderator Posts 3447 Karma 19 OS	Re: Parallelization of sparse-matrix vector product Sat Feb 15, 2014 7:06 pm sparse-vector products are memory bounded, so performance gain probably depend a lot on the matrix structure. If you wanna try something, search for "scaleAndAddTo" in Eigen/src/SparseCore/SparseDenseProduct.h and add an omp directive on the outermost loop.

Page 1 of 1 (4 posts)

Bookmarks

Who is online

Registered users: Baidu [Spider], Bing [Bot], Google [Bot], Yahoo [Bot]