Registered Member
|
Hi,
I've got two versions of code that essentially do the power iterations for SVD of 5000x120,000 matrices, one that first computes the 5000x5000 covariance and then uses that and another that multiplies alternately by X and X^T and uses less memory but is about 6x slower even taking into account computation of the covariance. Both running with 16 openmp threads with NDEBUG. Am I right in understanding that matrix-matrix products are multi-threaded but matrix-vector products are not? Since a vector is pretty much a matrix, is there way to "hack" it so that matrix-vector products are run by the matrix-matrix code? Thanks. |
Moderator
|
Yes, for instance:
however don't expect any gain from this hack as multi-threading is not very efficient on matrix-vector problems (memory bound). The overall performance might even be slower. You might also try:
Please let us know whether you get any speedup from this. Also, make sure that you have 16 hardware cores, not 8 multi-threaded ones. In the later case, use 8 threads instead of 16. |
Registered users: Bing [Bot], claydoh, Google [Bot], rblackwell, Yahoo [Bot]