Mentions légales du service

Skip to content
Snippets Groups Projects
  1. Jul 06, 2017
  2. Apr 14, 2017
    • THIBAULT Samuel's avatar
      Add Out-of-Core option · 3e6305c6
      THIBAULT Samuel authored and Mathieu Faverge's avatar Mathieu Faverge committed
      Add MORSE_Desc_Create_OOC, which is like MORSE_Desc_Create, but does not
      actually allocate a matrix, thus letting the runtime allocate on-demand the
      tiles, possibly pushing them to the disk.
      
      Add a --ooc option to tests to enable this.
      3e6305c6
  3. Mar 14, 2017
  4. Mar 09, 2017
  5. Mar 06, 2017
  6. Feb 14, 2017
  7. Dec 24, 2016
  8. Dec 09, 2016
  9. Dec 01, 2016
    • PRUVOST Florent's avatar
      Re-work the cmake for simulation mode: · 7255c7c4
      PRUVOST Florent authored
          - use (starpu_cpu_func_t) 1 trick, same as cuda_func
          - cpu funtions are not defined anymore avoiding the dependency to coreblas
          - add #if !defined(CHAMELEON_SIMULATION) where it is needed
          - remove dependency to the coreblas library (become useless)
          - remove useless simucblas, simulapacke libraries
          - remove CHAMELEON_SIMULATION_MAGMA cmake variable and definition
            - keep using CHAMELEON_USE_CUDA and CHAMELEON_USE_MAGMA to consider CUDA kernels
            - this avoid to introduce useless new variables
          - work on messages
      
      7255c7c4
  10. Oct 12, 2016
  11. Sep 20, 2016
    • Guillaume Sylvand's avatar
      Add possibility to use z/cgemm3m for complex mat-mat products · 747c7935
      Guillaume Sylvand authored
      This routine, available in MKL, does a product in 6n^3 ops instead of 8n^3
      but is interesting only for "large enough" matrices (to be tested...)
      Potentially, we gain 25 % in all complex computations.
      It could be interesting to look for it / implement it in cuda.
      
      !!! Note that the flop counters are not updated         !!!
      !!! In C/Z accuracy, most flops counter should be x0.75 !!!
      
      IT is OFF by default
      It is activated with MORSE_Enable(MORSE_GEMM3M)
      In the timing routines, it is activated with --gemm3m
      747c7935
    • Guillaume Sylvand's avatar
      Add a 'progress indicator' feature, that displays a percentage of completion · 92a3c4a1
      Guillaume Sylvand authored
      IT is OFF by default
      It is activated with MORSE_Enable(MORSE_PROGRESS)
      In the timing routines, it is activated with --progress
      No progress is printed for tasks faster than 10 seconds
      92a3c4a1
  12. Sep 09, 2016
  13. Sep 07, 2016
  14. Oct 05, 2015
  15. Sep 29, 2015
  16. Sep 28, 2015
  17. Sep 17, 2015
  18. Sep 16, 2015
    • THIBAULT Samuel's avatar
      Introduce MORSE_Distributed_start, MORSE_Distributed_stop, · 34558c7a
      THIBAULT Samuel authored
      MORSE_Distributed_size, MORSE_Distributed_rank so that applications do not
      hardcode the use of MPI.
      
      Introduce RUNTIME_distributed_rank, RUNTIME_distributed_size,
      RUNTIME_distributed_barrier, so that MORSE does not hardcode the use of MPI
      either.
      
      This allows to use simgrid-mpi.
      34558c7a
  19. Jul 28, 2015
  20. Feb 05, 2015
  21. Nov 19, 2014
  22. Nov 16, 2014
Loading