Registered Member
|
Hi,
I wonder why the following method does not rely on vectorized kernel like __MM_TRANSPOSE4_PS Eigen::Matrix<float,4,4> mat; ... mat.transposeInPlace(); I did handcode the method using intrinsics but it is a bit frustrating since Eigen allowed me to clean away all the other assembly hacks from my code Thank you for your help, Laurent |
Moderator
|
Sure, we should definitely support that. Internally, we even have ptranspose(...) intrinsics for all architectures and supported packet types... So that should be very easy!
|
Registered Member
|
Cool ! So I Just have to wait now
|
Moderator
|
Here you go:
https://bitbucket.org/eigen/eigen/commits/32a8021225d5/ Changeset: 32a8021225d5 User: ggael Date: 2015-01-26 16:09:01+00:00 Summary: Enable vectorization of transposeInPlace for PacketSize x PacketSize matrices As an exemple, the generated code for:
is now:
|
Registered Member
|
Wow... that is reactive !
Thank you very very much. Laurent |
Registered users: Bing [Bot], Google [Bot], Yahoo [Bot]