Commits · 657f29b36ecab58c355ce5485c562c4cdd26dc5f · solverstack / Chameleon

Jul 06, 2017
- fixed compilation bug · 657f29b3
  BOUCHERIE Raphael authored 7 years ago
  
  657f29b3
- Checking if parameters are correct · 38b89ecf
  BOUCHERIE Raphael authored 7 years ago
  
  38b89ecf
- init for opt · 2c4fcd3d
  BOUCHERIE Raphael authored 7 years ago
  
  2c4fcd3d
- Minor · 1e39f0fd
  BOUCHERIE Raphael authored 7 years ago
  
  1e39f0fd
- updated help display · 0622c5e0
  BOUCHERIE Raphael authored 7 years ago
  
  0622c5e0
- long option · 69a9c17f
  BOUCHERIE Raphael authored 7 years ago
  
  69a9c17f
- typo · 6a8b5a8a
  BOUCHERIE Raphael authored 7 years ago
  
  6a8b5a8a
- n_range option · 1bc1e05a
  BOUCHERIE Raphael authored 7 years ago
  
  1bc1e05a
- timing options · 60f9e36e
  BOUCHERIE Raphael authored 7 years ago
  
  60f9e36e
- updated show_help · 2504cfdb
  BOUCHERIE Raphael authored 7 years ago
  
  2504cfdb
- adding getopt for timing · 2d32f27b
  BOUCHERIE Raphael authored 7 years ago
  
  2d32f27b
- added getopt for timing file · 6a21f331
  BOUCHERIE Raphael authored 7 years ago
  
  6a21f331
Apr 14, 2017

THIBAULT Samuel authored 8 years ago and

Mathieu Faverge committed 7 years ago

Add MORSE_Desc_Create_OOC, which is like MORSE_Desc_Create, but does not
actually allocate a matrix, thus letting the runtime allocate on-demand the
tiles, possibly pushing them to the disk.

Add a --ooc option to tests to enable this.

3e6305c6

Mar 14, 2017
- Change the error option, into warning option to disable the messages · 439efbf8
  Mathieu Faverge authored 8 years ago
  
  439efbf8
Mar 09, 2017
- Fix comment · 89f34f2c
  Mathieu Faverge authored 8 years ago
  
  89f34f2c
Mar 06, 2017
- Fix issue 16 · 33e626d7
  Mathieu Faverge authored 8 years ago
  
  33e626d7
Feb 14, 2017
- Fix some header paths which had the wrong format · 7f577a1c
  COJEAN Terry authored 8 years ago
  
  7f577a1c
Dec 24, 2016
- Apply the max/min change to other runtimes and timings · 4b52f1eb
  Mathieu Faverge authored 8 years ago
  
  4b52f1eb
Dec 09, 2016
- Merge back the twosided branch · 22e8ea8e
  Mathieu Faverge authored 8 years ago
  
  22e8ea8e
Dec 01, 2016

Re-work the cmake for simulation mode: · 7255c7c4

PRUVOST Florent authored 8 years ago

    - use (starpu_cpu_func_t) 1 trick, same as cuda_func
    - cpu funtions are not defined anymore avoiding the dependency to coreblas
    - add #if !defined(CHAMELEON_SIMULATION) where it is needed
    - remove dependency to the coreblas library (become useless)
    - remove useless simucblas, simulapacke libraries
    - remove CHAMELEON_SIMULATION_MAGMA cmake variable and definition
      - keep using CHAMELEON_USE_CUDA and CHAMELEON_USE_MAGMA to consider CUDA kernels
      - this avoid to introduce useless new variables
    - work on messages

7255c7c4

Oct 12, 2016
- timing: better handles the return codes (and the failures...) · 002fea6e
  Guillaume Sylvand authored 8 years ago
  
  002fea6e
- timing: add option --bigmat to choose if we allocate one big 'mat' array or if... · b269b119
  Guillaume Sylvand authored 8 years ago
  
  timing: add option --bigmat to choose if we allocate one big 'mat' array or if the runtime allocates the tile one by one
  b269b119
Sep 20, 2016

Add possibility to use z/cgemm3m for complex mat-mat products · 747c7935

Guillaume Sylvand authored 8 years ago

This routine, available in MKL, does a product in 6n^3 ops instead of 8n^3
but is interesting only for "large enough" matrices (to be tested...)
Potentially, we gain 25 % in all complex computations.
It could be interesting to look for it / implement it in cuda.

!!! Note that the flop counters are not updated         !!!
!!! In C/Z accuracy, most flops counter should be x0.75 !!!

IT is OFF by default
It is activated with MORSE_Enable(MORSE_GEMM3M)
In the timing routines, it is activated with --gemm3m

747c7935

Add a 'progress indicator' feature, that displays a percentage of completion · 92a3c4a1

Guillaume Sylvand authored 8 years ago

IT is OFF by default
It is activated with MORSE_Enable(MORSE_PROGRESS)
In the timing routines, it is activated with --progress
No progress is printed for tasks faster than 10 seconds

92a3c4a1

Sep 09, 2016
- chameleon: avoid warning about implicit declaration · c7061a85
  PRUVOST Florent authored 8 years ago
  
  c7061a85
Sep 07, 2016
- Disable profiling in timing driver instead of doing it in the runtime initialization · 112920b4
  Guillaume Sylvand authored 8 years ago
  
  112920b4
Oct 05, 2015
- Move variable declaration to avoid warning · f9eb09aa
  Mathieu Faverge authored 9 years ago
  
  f9eb09aa
Sep 29, 2015
- use warmup option by default for timings · e49b76d4
  PRUVOST Florent authored 9 years ago
  
  e49b76d4
Sep 28, 2015
- Fix documentation of default warmup configuration · 6f5abd11
  THIBAULT Samuel authored 9 years ago
  
  6f5abd11
Sep 17, 2015
- Rename distributed_size/rank into comm_size/rank, and use RUNTIME_barrier · a5069068
  THIBAULT Samuel authored 9 years ago
  
  instead of introducing RUNTIME_distributed_barrier
  a5069068
Sep 16, 2015

Introduce MORSE_Distributed_start, MORSE_Distributed_stop, · 34558c7a

THIBAULT Samuel authored 9 years ago

MORSE_Distributed_size, MORSE_Distributed_rank so that applications do not
hardcode the use of MPI.

Introduce RUNTIME_distributed_rank, RUNTIME_distributed_size,
RUNTIME_distributed_barrier, so that MORSE does not hardcode the use of MPI
either.

This allows to use simgrid-mpi.

34558c7a

Jul 28, 2015
- no warmup by default, activate it in timing drivers with --warmup · ae0fd1df
  PRUVOST Florent authored 9 years ago
  
  ae0fd1df
Feb 05, 2015

remove the extra token > in ...cblas.h>> · bdddf59a
PRUVOST Florent authored 10 years ago

bdddf59a

change the way we include our own header files --> relative to the root - when... · 004c8548

PRUVOST Florent authored 10 years ago

change the way we include our own header files --> relative to the root - when plasma is in the same env, chameleon can take some headers not belonging to it (ex: #include descriptor.h, this file states in plasma install dir also) which make compilation errors

004c8548

Nov 19, 2014

change copyright - correct whitespace - place cmake module depending on... · 1bf6a900

PRUVOST Florent authored 10 years ago

change copyright - correct whitespace - place cmake module depending on chameleon in cmake_modules and no more in cmake_modules/morse

1bf6a900

Nov 16, 2014
- save number of MPI process and print it with main informations in timings · b91a45ae
  PRUVOST Florent authored 10 years ago
  
  b91a45ae
- change MORSE to CHAMELEON for some printf · f7f2eda1
  PRUVOST Florent authored 10 years ago
  
  f7f2eda1
- this chameleon first version is set to 0.9 · 9c761fcf
  PRUVOST Florent authored 10 years ago
  
  9c761fcf
- change name MAGMAMORSE and cousins to CHAMELEON · 043add66
  PRUVOST Florent authored 10 years ago
  
  043add66
- mv new_magmamorse folder in chameleon · 5ca19ea0
  PRUVOST Florent authored 10 years ago
  
  5ca19ea0