- Feb 06, 2018
-
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
- Feb 05, 2018
-
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
- Jan 31, 2018
-
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
- Jan 30, 2018
-
-
Mathieu Faverge authored
-
- Jan 24, 2018
-
-
Mathieu Faverge authored
-
- Oct 18, 2017
-
-
PRUVOST Florent authored
-
- Oct 04, 2017
-
-
PRUVOST Florent authored
-
- Sep 13, 2017
-
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
- Jun 16, 2017
-
-
-
BOUCHERIE Raphael authored
-
BOUCHERIE Raphael authored
-
- Jun 13, 2017
-
-
PRUVOST Florent authored
-
- Jun 12, 2017
-
-
PRUVOST Florent authored
-
- Apr 05, 2017
-
-
Mathieu Faverge authored
-
- Jan 03, 2017
-
-
PRUVOST Florent authored
-
- Dec 24, 2016
-
-
Mathieu Faverge authored
-
- Dec 21, 2016
-
-
Mathieu Faverge authored
-
- Dec 16, 2016
-
-
Mathieu Faverge authored
-
- Dec 15, 2016
-
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
- Dec 09, 2016
-
-
Mathieu Faverge authored
-
- Dec 04, 2016
-
-
Mathieu Faverge authored
-
- Dec 01, 2016
-
-
PRUVOST Florent authored
- use (starpu_cpu_func_t) 1 trick, same as cuda_func - cpu funtions are not defined anymore avoiding the dependency to coreblas - add #if !defined(CHAMELEON_SIMULATION) where it is needed - remove dependency to the coreblas library (become useless) - remove useless simucblas, simulapacke libraries - remove CHAMELEON_SIMULATION_MAGMA cmake variable and definition - keep using CHAMELEON_USE_CUDA and CHAMELEON_USE_MAGMA to consider CUDA kernels - this avoid to introduce useless new variables - work on messages
-
- Nov 30, 2016
-
-
PRUVOST Florent authored
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
- Sep 22, 2016
-
-
Guillaume Sylvand authored
to avoid reverse dependency libcoreblas->libchameleon set_coreblas_gemm3m_enabled()/get_coreblas_gemm3m_enabled() allow to set/get this variable.
-
- Sep 20, 2016
-
-
Guillaume Sylvand authored
This routine, available in MKL, does a product in 6n^3 ops instead of 8n^3 but is interesting only for "large enough" matrices (to be tested...) Potentially, we gain 25 % in all complex computations. It could be interesting to look for it / implement it in cuda. !!! Note that the flop counters are not updated !!! !!! In C/Z accuracy, most flops counter should be x0.75 !!! IT is OFF by default It is activated with MORSE_Enable(MORSE_GEMM3M) In the timing routines, it is activated with --gemm3m
-