Non optimal implementation of AutoDiffScalar's tanh?

Board index

Page 1 of 1 (7 posts)

Tags:

emilf Registered Member Posts 29 Karma 0 OS	Non optimal implementation of AutoDiffScalar's tanh? Mon Oct 03, 2016 12:59 pm Hi all! I went through the AutoDiffScalar's definitions for derivatives and found one that seems non-optimal, tanh. As the tanh(x.value()) is already calculated it would be good to reuse it, as this tentative improvement: Code: Select all `EIGEN_AUTODIFF_DECLARE_GLOBAL_UNARY(tanh, using std::cosh; using std::tanh; const Scalar th = tanh(x.value()); return Eigen::MakeAutoDiffScalar(th, x.derivatives() * (1 - numext::abs2(th)));)` It uses the d/dx tanh(x) = 1 - tanh(x)^2, which removes the extra cosh evaluation and the division. Minor thing, but seems nice to have it as good as the rest. Unless there is something here that I have missed which made the choice of cosh better. Thanks for your time!
ggael Moderator Posts 3447 Karma 19 OS	Re: Non optimal implementation of AutoDiffScalar's tanh? Tue Oct 04, 2016 7:59 am The problem is that 1 - numext::abs2(th) will suffer from so called catastrophic cancellation (https://en.wikipedia.org/wiki/Loss_of_significance) for x not too small. For instance, with x around 9 I observe relative errors of about 10^-8 with double precision (so you lost half the precision).
emilf Registered Member Posts 29 Karma 0 OS	Re: Non optimal implementation of AutoDiffScalar's tanh? Tue Oct 04, 2016 8:37 am Oh, it was that bad. I only did a quick test in Matlab and it seems rewriting it as (1 - th) * (1 + th) does perform significantly better than the naive implementation. However, I was unable to get the relative error you got, would you mind sharing the code you used to test the relative error in a good way? I would like to play a bit with the expression. Thanks Gael!
ggael Moderator Posts 3447 Karma 19 OS	Re: Non optimal implementation of AutoDiffScalar's tanh? Tue Oct 04, 2016 11:56 am You'll get the same issue with the "1 - th" factor. octave/matlab code: x=1:0.001:10; max(abs((1-tanh(x).^2)-(1./cosh(x).^2)) ./ (1./cosh(x).^2))
emilf Registered Member Posts 29 Karma 0 OS	Re: Non optimal implementation of AutoDiffScalar's tanh? Tue Oct 04, 2016 1:46 pm Hi Gael, thank you for the code! I see you are using precision rather than accuracy as the performance criteria. Quite interesting that the accuracy between the two representations is always within epsilon (1.11e-16) for double numbers, but that the precision goes as high as 1e-8. But when you divide epsilon with a number close to zero it will indeed scale a lot. Thanks for the check!
ggael Moderator Posts 3447 Karma 19 OS	Re: Non optimal implementation of AutoDiffScalar's tanh? Tue Oct 04, 2016 3:30 pm Comparing float to double: Code: Select all `#include <iostream> #include <Eigen/Dense> using namespace Eigen; using namespace std; int main() { ArrayXf x = ArrayXf::LinSpaced(1000,1,10); ArrayXf ref = (1./cosh(x.cast<double>()).abs2()).cast<float>(); std::cout << (abs( (1.f-tanh(x).abs2()) - ref ) / ref).maxCoeff() << std::endl; std::cout << (abs( (1.f/cosh(x).abs2()) - ref ) / ref).maxCoeff() << std::endl; }` This gives me: Code: Select all `3.99167 2.07985e-07` So the "fast" version is completely off.
emilf Registered Member Posts 29 Karma 0 OS	Re: Non optimal implementation of AutoDiffScalar's tanh? Tue Oct 04, 2016 4:35 pm Indeed, quite atrocious. Thanks for giving it a test!