- Jan 06, 2020
-
-
- Dec 20, 2019
-
-
Mathieu Faverge authored
Fix issue #87 by setting the number of threads to the default on the machine Closes #87 See merge request !183
-
- Dec 19, 2019
-
-
Mathieu Faverge authored
-
- Dec 13, 2019
-
-
Mathieu Faverge authored
add eztrace module for TCORE_ functions See merge request solverstack/chameleon!181
-
Remove the FXT starpu option and the enable tracaing. It is now done by eztrace, or directly with STARPU_USE_FXT
-
- Dec 09, 2019
-
-
Mathieu Faverge authored
Feature/ci/plafrim/nmad See merge request solverstack/chameleon!180
-
-
- Nov 29, 2019
-
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
- Nov 27, 2019
-
-
Mathieu Faverge authored
-
- Nov 26, 2019
-
-
Mathieu Faverge authored
-
Mathieu Faverge authored
Fix issues on new_testing See merge request !179
-
Mathieu Faverge authored
-
- Nov 25, 2019
-
-
Mathieu Faverge authored
Add an option CHAMELEON_RUNTIME_SYNC to debug quark and StarPU in synchronous mode See merge request !177
-
Mathieu Faverge authored
-
- Nov 23, 2019
-
-
Mathieu Faverge authored
Coverity See merge request !178
-
Mathieu Faverge authored
-
- Nov 22, 2019
-
-
Mathieu Faverge authored
Replace tile pointers by CHAM_tile data structure This PR aims at replacing the couples (A, lda) by a data structure CHAM_tile_t that can be used to hide different kind of tiles: * Full-rank matrices * Low-rank matrices in any format * A descriptor itself to do recursive algorithm ``` typedef struct chameleon_tile_s { int m, n, ld; void *mat; } CHAM_tile_t; ``` In the future, a format field will be added, to switch between formats, and the runtime handle will be stored within this structure instead of within the matrix descriptor. Former INSERT_TASK functions: ```C void INSERT_TASK_zgemm( const RUNTIME_option_t *options, cham_trans_t transA, cham_trans_t transB, int m, int n, int k, int nb, CHAMELEON_Complex64_t alpha, const CHAM_desc_t *A, int Am, int An, int lda, const CHAM_desc_t *B, int Bm, int Bn, int ldb, CHAMELEON_Complex64_t beta, const CHAM_desc_t *C, int Cm, int Cn, int ldc ); ``` is replaced by ```C void INSERT_TASK_zgemm( const RUNTIME_option_t *options, cham_trans_t transA, cham_trans_t transB, int m, int n, int k, int nb, CHAMELEON_Complex64_t alpha, const CHAM_desc_t *A, int Am, int An, const CHAM_desc_t *B, int Bm, int Bn, CHAMELEON_Complex64_t beta, const CHAM_desc_t *C, int Cm, int Cn ); ``` With this change, comes another one in the kernel. A extra layer has been added to switch between the different kind of tiles if needed in `coreblas/compute/core_ztile.c`. For now, however, only the classic full-rank interface is available, however the layer is already called to simplify the work on hierarchical descriptors and h-matrices. All codelets from Quark, StarPU and OpenMP have thus been modified to use this layer instead of the classic CORE_z layer. This change has not been done in PaRSEC, and thus advanced matrix format will not be available with PaRSEC as long as it stays like this. Regarding StarPU, to handle this new format in distributed, a cham_tile_interface has been added to handle the packing/unpacking of the data and support this new type of information. This is the major change in the runtime directory. @all I do not recommend to read the full changes, but at least please have a look at the changes in all files except `compute/*`, and `runtime/*/codelets/`. All changes outside these directories are minimal and should be proofread, and if you can test this PR, especially with GPUs and OOC. That would be great. Thanks. Edit: I have also directly added the format field in the CHAM_tile_t structure to check the format and limit the conflicts between parallel developments. All TCORE interfaces include assert that the tiles are in the full-rank format in order to avoid problems when developing new functionality. See merge request !142
-
Mathieu Faverge authored
Add CHAM_tile_t interface
-
- Nov 19, 2019
-
-
PRUVOST Florent authored
update coverity job See merge request solverstack/chameleon!176
-
PRUVOST Florent authored
-
- Nov 18, 2019
-
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
- Nov 17, 2019
-
-
Mathieu Faverge authored
Missing changes from !166 See merge request !175
-
- Nov 16, 2019
-
-
Mathieu Faverge authored
Add norms output to QR functions, A, A-fact(A), and Id-QQ' Remove debug printf
-
Mathieu Faverge authored
-
- Nov 15, 2019
-
-
Mathieu Faverge authored
New testing/timing structure See merge request !166
-
- Nov 08, 2019
-
-
- Nov 07, 2019
-
-
Mathieu Faverge authored
Fix hemm/symm summa versions * Fix hemm/symm summa versions * Fix her2k in lower case (conj(alpha) misplaced) * Simplify the code of her2k/syr2k See merge request !174
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
- Nov 05, 2019
-
-
Mathieu Faverge authored
Fix check for trans parameters See merge request !173
-
Mathieu Faverge authored
-
Mathieu Faverge authored
-
- Nov 04, 2019
-
-
Mathieu Faverge authored
Fix unmqr/unmlq family functions for corner cases See merge request !172
-
Mathieu Faverge authored
-
- Oct 29, 2019
-
-
Mathieu Faverge authored
-
- Oct 25, 2019
-
-
Mathieu Faverge authored
Change default parameters and add functionalities to the descriptor interface See merge request !171
-
- Oct 22, 2019
-
-
Mathieu Faverge authored
-
- Oct 11, 2019
-
-
Mathieu Faverge authored
-