Registered Member
|
I met a performance issue when using dot() (or, prod().sum()) of Eigen3.3.5. After some efforts, I found it's caused by denormal floating point.
I have done with a piece of benchmarking:
My computer is new MacBook pro, and I use cmake(3.12.2), release mode, no addition flags. build with Clang10.0.0 then run, console output:
build with Gcc8.2.0 then run, console output:
Obviously, eigen3.3.5 + clang10.0.0 + float type + denormal(&inf) value => performance issue. But why? |
Registered users: Bing [Bot], Google [Bot], Sogou [Bot]