Mentions légales du service

Skip to content
Snippets Groups Projects
  1. Feb 06, 2018
  2. Feb 05, 2018
  3. Jan 31, 2018
  4. Jan 30, 2018
  5. Jan 24, 2018
  6. Oct 18, 2017
  7. Oct 04, 2017
  8. Sep 13, 2017
  9. Jun 16, 2017
  10. Jun 13, 2017
  11. Jun 12, 2017
  12. Apr 05, 2017
  13. Jan 03, 2017
  14. Dec 24, 2016
  15. Dec 21, 2016
  16. Dec 16, 2016
  17. Dec 15, 2016
  18. Dec 09, 2016
  19. Dec 04, 2016
  20. Dec 01, 2016
    • PRUVOST Florent's avatar
      Re-work the cmake for simulation mode: · 7255c7c4
      PRUVOST Florent authored
          - use (starpu_cpu_func_t) 1 trick, same as cuda_func
          - cpu funtions are not defined anymore avoiding the dependency to coreblas
          - add #if !defined(CHAMELEON_SIMULATION) where it is needed
          - remove dependency to the coreblas library (become useless)
          - remove useless simucblas, simulapacke libraries
          - remove CHAMELEON_SIMULATION_MAGMA cmake variable and definition
            - keep using CHAMELEON_USE_CUDA and CHAMELEON_USE_MAGMA to consider CUDA kernels
            - this avoid to introduce useless new variables
          - work on messages
      
      7255c7c4
  21. Nov 30, 2016
  22. Sep 22, 2016
  23. Sep 20, 2016
    • Guillaume Sylvand's avatar
      Add possibility to use z/cgemm3m for complex mat-mat products · 747c7935
      Guillaume Sylvand authored
      This routine, available in MKL, does a product in 6n^3 ops instead of 8n^3
      but is interesting only for "large enough" matrices (to be tested...)
      Potentially, we gain 25 % in all complex computations.
      It could be interesting to look for it / implement it in cuda.
      
      !!! Note that the flop counters are not updated         !!!
      !!! In C/Z accuracy, most flops counter should be x0.75 !!!
      
      IT is OFF by default
      It is activated with MORSE_Enable(MORSE_GEMM3M)
      In the timing routines, it is activated with --gemm3m
      747c7935
Loading