Registered Member
|
Hi,
I've started using Eigen recently but my code runs faster with the SSE2 option disabled. Do you have any idea why this would occur? My most called method performs the computation a = b*exp((c-1)*log(d/t)-d) where a, b, c and d are of type Eigen::ArrayXd. I'm currently using the Visual Studio 2010 IDE. Thank You In Advance. Edit: I maybe also should have mentioned that a, b, c, and d were actually mapped to Eigen::ArrayXd from std::vector<double>. Also, I looked at the assembly code and I don't see any vectorization occurring. How do I check if my vectors are aligned? Thanks again |
Registered Member
|
My sense is that unless you write vectorizeable versions of the transcendental functions, you'll never get a win by using SSE here. Even if it did the division and addition in SIMD, it has to extract the elements from the SIMD vector to make the function calls, and then recombine them into vectors. This is not a cheap operation with any version of SSE. The best times to use SSE are when you can perform long runs of computations using the SSE units with no branches depending on individual elements of the SIMD vector. Note that there are some SSE libraries written to perform the standard transcendental functions in SIMD. Unsure how hard it would be to adapt them for use in Eigen.
|
Moderator
|
we have SSE versions of log an exp only for floats, so when you use double you don't get any vectorization. If you want you can have a look at their implementations in Eigen/src/Core/arch/SSE/MathFunctions.h and make double versions of them
|
Registered users: Bing [Bot], Google [Bot], q.ignora, watchstar