Registered Member
|
Hi, everyone!
I use Eigen and Raspberry Pi for some time but never both together. So, I wrote a simple benchmark consisting of 30 executions of 100 matrix multiplications batches running on my new Raspberry PI 4 - 64-bit OS. The following chart shows the execution time for 4 different setups (512x512 matrices, 460x460 matrices, with and without vectorization): I compiled it using:
and
My question is: Can I do something else in order to speed up matrix multiplication on this hardware and OS? Or have I hit my nose on the RPI4 best performance? Some notes: 0 - I wouldn't like to use
1 - I've followed the instructions on http://eigen.tuxfamily.org/index.php?ti ... ization.3F: . My GCC is:
The code used in this experiment is below:
I really appreciate any advice and/or critic! |
Registered users: bartoloni, Bing [Bot], Google [Bot], Yahoo [Bot]