Speed up with multi-core?

Thu Oct 10, 2013 6:12 pm

Hi, I am trying to speed up matrix multiplication with multi-core. I enabled openMP and did some benchmark. I discovered that the default eigen multiplication speeds up well with 4 cores, but with 8 or 12 cores the speed is only marginally better than 4 cores (in rare occasions even worse). There must be some improvement that can be done about the efficiency.

I read some literature and decided to try to implement OpenMP threads myself. There is some double buffering that can be done to reduce the latency of moving data from SDRam to local memory. My question is: 1. does eigen already implement double buffering (if so why the efficiency is not so good) and 2. is it possible to have the control of low level buffering behavior and still call eigen to benefit from it?

BTW I am not an expert on optimization - I just read a few papers just now.

Thanks.

Speed up with multi-core?

Page 1 of 1 (2 posts)

Speed up with multi-core?

Re: Speed up with multi-core?

Bookmarks

Who is online