Commits · f420ba3b573be64440f59a9adb038fe6d1b86412 · solverstack / Chameleon

Mar 17, 2017
- Adding pruning stats on top of memaccess · 2f441bb8
  THIBAULT Samuel authored 8 years ago
  
  2f441bb8
Mar 14, 2017
- Change the error option, into warning option to disable the messages · 439efbf8
  Mathieu Faverge authored 8 years ago
  
  439efbf8
Dec 24, 2016
- Apply the max/min change to other runtimes and timings · 4b52f1eb
  Mathieu Faverge authored 8 years ago
  
  4b52f1eb
Dec 09, 2016
- Merge back the twosided branch · 22e8ea8e
  Mathieu Faverge authored 8 years ago
  
  22e8ea8e
Oct 12, 2016
- timing: better handles the return codes (and the failures...) · 002fea6e
  Guillaume Sylvand authored 8 years ago
  
  002fea6e
- timing: add option --bigmat to choose if we allocate one big 'mat' array or if... · b269b119
  Guillaume Sylvand authored 8 years ago
  
  timing: add option --bigmat to choose if we allocate one big 'mat' array or if the runtime allocates the tile one by one
  b269b119
Sep 20, 2016

Add possibility to use z/cgemm3m for complex mat-mat products · 747c7935

This routine, available in MKL, does a product in 6n^3 ops instead of 8n^3
but is interesting only for "large enough" matrices (to be tested...)
Potentially, we gain 25 % in all complex computations.
It could be interesting to look for it / implement it in cuda.

!!! Note that the flop counters are not updated         !!!
!!! In C/Z accuracy, most flops counter should be x0.75 !!!

IT is OFF by default
It is activated with MORSE_Enable(MORSE_GEMM3M)
In the timing routines, it is activated with --gemm3m

747c7935

Add a 'progress indicator' feature, that displays a percentage of completion · 92a3c4a1

Guillaume Sylvand authored 8 years ago

IT is OFF by default
It is activated with MORSE_Enable(MORSE_PROGRESS)
In the timing routines, it is activated with --progress
No progress is printed for tasks faster than 10 seconds

92a3c4a1

Sep 16, 2015

Introduce MORSE_Distributed_start, MORSE_Distributed_stop, · 34558c7a

THIBAULT Samuel authored 9 years ago

MORSE_Distributed_size, MORSE_Distributed_rank so that applications do not
hardcode the use of MPI.

Introduce RUNTIME_distributed_rank, RUNTIME_distributed_size,
RUNTIME_distributed_barrier, so that MORSE does not hardcode the use of MPI
either.

This allows to use simgrid-mpi.

34558c7a

Jul 28, 2015
- distribute data needed for ibnb and block-diagonal workspaces · b7731f39
  PRUVOST Florent authored 9 years ago
  
  b7731f39
Nov 19, 2014

change copyright - correct whitespace - place cmake module depending on... · 1bf6a900

PRUVOST Florent authored 10 years ago

change copyright - correct whitespace - place cmake module depending on chameleon in cmake_modules and no more in cmake_modules/morse

1bf6a900

Nov 16, 2014
- save number of MPI process and print it with main informations in timings · b91a45ae
  PRUVOST Florent authored 10 years ago
  
  b91a45ae
- change name MAGMAMORSE and cousins to CHAMELEON · 043add66
  PRUVOST Florent authored 10 years ago
  
  043add66
- mv new_magmamorse folder in chameleon · 5ca19ea0
  PRUVOST Florent authored 10 years ago
  
  5ca19ea0
- Branching from branches/new_magmamorse to trunk/chameleon at 2005 · 74616e26
  PRUVOST Florent authored 10 years ago
  
  74616e26

Admin message