Mentions légales du service

Skip to content
Snippets Groups Projects
  1. Nov 30, 2016
  2. Sep 22, 2016
  3. Sep 20, 2016
    • Guillaume Sylvand's avatar
      Add possibility to use z/cgemm3m for complex mat-mat products · 747c7935
      Guillaume Sylvand authored
      This routine, available in MKL, does a product in 6n^3 ops instead of 8n^3
      but is interesting only for "large enough" matrices (to be tested...)
      Potentially, we gain 25 % in all complex computations.
      It could be interesting to look for it / implement it in cuda.
      
      !!! Note that the flop counters are not updated         !!!
      !!! In C/Z accuracy, most flops counter should be x0.75 !!!
      
      IT is OFF by default
      It is activated with MORSE_Enable(MORSE_GEMM3M)
      In the timing routines, it is activated with --gemm3m
      747c7935
  4. Sep 11, 2016
  5. Sep 07, 2016
  6. Apr 13, 2016
  7. Dec 01, 2015
  8. Nov 03, 2015
  9. Oct 03, 2015
  10. Sep 15, 2015
  11. Sep 14, 2015
  12. May 22, 2015
  13. Feb 05, 2015
  14. Jan 30, 2015
  15. Dec 08, 2014
  16. Dec 03, 2014
  17. Nov 21, 2014
  18. Nov 19, 2014
  19. Nov 16, 2014
Loading