Commits · a2f0e50dbce3e2d35c4aa419c5f04843929bb3d5 · solverstack / Chameleon

Feb 06, 2018
- Fix date format when present · a2f0e50d
  Mathieu Faverge authored 7 years ago
  
  a2f0e50d
- Update version number · 7f98a5b1
  Mathieu Faverge authored 7 years ago
  
  7f98a5b1
- Add @file line to all source files as the first doxygen command · 34b0b92a
  Mathieu Faverge authored 7 years ago
  
  34b0b92a
- Fix incorrect @file lines · 158c93b0
  Mathieu Faverge authored 7 years ago
  
  158c93b0
- Copyrights in all .c and .h should be fine · eb5acc7b
  Mathieu Faverge authored 7 years ago
  
  eb5acc7b
- Minor to compact the files · e6365e9c
  Mathieu Faverge authored 7 years ago
  
  e6365e9c
Feb 05, 2018
- Update Inria,INP,UNiv Bordeaux, CNRS copyrights · 9e2f09ba
  Mathieu Faverge authored 7 years ago
  
  9e2f09ba
- Update ICL copyrights · d0e8e4d7
  Mathieu Faverge authored 7 years ago
  
  d0e8e4d7
Jan 31, 2018
- Disable GPU on LQ case as it is not implemented yet · fbd087b7
  Mathieu Faverge authored 7 years ago
  
  fbd087b7
- Add tplqt and tpmlqt kernels to later link on MKL · 504741b3
  Mathieu Faverge authored 7 years ago
  
  504741b3
- Change the test in tpqrt · 941c433a
  Mathieu Faverge authored 7 years ago
  
  941c433a
Jan 30, 2018
- Silent warning about double identical test · 09a6085c
  Mathieu Faverge authored 7 years ago
  
  09a6085c
Jan 24, 2018
- Fix issue with desc created/destroyed in mat_alloc/free · b33153a7
  Mathieu Faverge authored 7 years ago
  
  b33153a7
Oct 18, 2017
- add dependencies between sources target and library to avoid parallel build failure · 8c67f24c
  PRUVOST Florent authored 7 years ago
  
  8c67f24c
Oct 04, 2017
- create a new target to force generation of sources, useful for generating the doc · e39bac46
  PRUVOST Florent authored 7 years ago
  
  e39bac46
Sep 13, 2017
- Restore INSTALL_NAME_DIR property for MacOS · 49e064cd
  Mathieu Faverge authored 7 years ago
  
  49e064cd
- Silent the new warning on gemm3m · 73a65e2f
  Mathieu Faverge authored 7 years ago
  
  73a65e2f
- Move the check for simulation to the higher level · 2825b6e1
  Mathieu Faverge authored 7 years ago
  
  2825b6e1
- Restructure headers in coreblas directory · 1036fc63
  Mathieu Faverge authored 7 years ago
  
  1036fc63
- Fix includes with the move to subdir chameleon, and fix some warnings · 75c85b49
  Mathieu Faverge authored 7 years ago
  
  75c85b49
Jun 16, 2017
- Fix major issue in parameters check of ttmlq · 12f41621
  Mathieu Faverge authored 7 years ago and BOUCHERIE Raphael committed 7 years ago
  
  12f41621
- resolved discussion 11 · cc40a4d2
  BOUCHERIE Raphael authored 7 years ago
  
  cc40a4d2
- test for zgels_param pass · 540d53b0
  BOUCHERIE Raphael authored 7 years ago
  
  540d53b0
Jun 13, 2017
- remove definitions from pkg-config that are useless for users · 84d5ddec
  PRUVOST Florent authored 7 years ago
  
  84d5ddec
Jun 12, 2017
- add definitions in the pkg-config file · 02a28fb9
  PRUVOST Florent authored 7 years ago
  
  02a28fb9
Apr 05, 2017
- Use pointer arithmetic as in cudablas version · ee734dd3
  Mathieu Faverge authored 7 years ago
  
  ee734dd3
Jan 03, 2017
- remove references to simulapacke and simucblas which do not exist anymore · bb567c44
  PRUVOST Florent authored 8 years ago
  
  bb567c44
Dec 24, 2016
- Cleanup warnings, especially by using a static inline function instead of a macro for min/max · 9e668381
  Mathieu Faverge authored 8 years ago
  
  9e668381
Dec 21, 2016
- Cleanup and silent warnings · dfc3fae8
  Mathieu Faverge authored 8 years ago
  
  dfc3fae8
Dec 16, 2016
- Add codelets in all three runtimes · 22869caf
  Mathieu Faverge authored 8 years ago
  
  22869caf
Dec 15, 2016
- Add compilation of the kernels · 9224a468
  Mathieu Faverge authored 8 years ago
  
  9224a468
- Add tpqrt and tpmqrt kernels · 9c2e2baf
  Mathieu Faverge authored 8 years ago
  
  9c2e2baf
Dec 09, 2016
- Merge back the twosided branch · 22e8ea8e
  Mathieu Faverge authored 8 years ago
  
  22e8ea8e
Dec 04, 2016
- Make unmqr/unmlq call the larfb · 31e42080
  Mathieu Faverge authored 8 years ago
  
  31e42080
Dec 01, 2016

Re-work the cmake for simulation mode: · 7255c7c4

PRUVOST Florent authored 8 years ago

    - use (starpu_cpu_func_t) 1 trick, same as cuda_func
    - cpu funtions are not defined anymore avoiding the dependency to coreblas
    - add #if !defined(CHAMELEON_SIMULATION) where it is needed
    - remove dependency to the coreblas library (become useless)
    - remove useless simucblas, simulapacke libraries
    - remove CHAMELEON_SIMULATION_MAGMA cmake variable and definition
      - keep using CHAMELEON_USE_CUDA and CHAMELEON_USE_MAGMA to consider CUDA kernels
      - this avoid to introduce useless new variables
    - work on messages

7255c7c4

Nov 30, 2016
- save chameleon dependencies in the course of find process · fc00c516
  PRUVOST Florent authored 8 years ago
  
  fc00c516
- Merge back qdwh to integrate lascal · 280e9e88
  Mathieu Faverge authored 8 years ago
  
  280e9e88
- Fix a lot of QR/LQ functions (I thought it was already in the trunk ...) · 0a540293
  Mathieu Faverge authored 8 years ago
  
  0a540293
Sep 22, 2016

Move the flag gemm3m_enabled from MORSE_context_t to coreblas/compute/global.c · 7a0889ad

Guillaume Sylvand authored 8 years ago

to avoid reverse dependency libcoreblas->libchameleon
set_coreblas_gemm3m_enabled()/get_coreblas_gemm3m_enabled() allow to set/get
this variable.

7a0889ad

Sep 20, 2016

Add possibility to use z/cgemm3m for complex mat-mat products · 747c7935

Guillaume Sylvand authored 8 years ago

This routine, available in MKL, does a product in 6n^3 ops instead of 8n^3
but is interesting only for "large enough" matrices (to be tested...)
Potentially, we gain 25 % in all complex computations.
It could be interesting to look for it / implement it in cuda.

!!! Note that the flop counters are not updated         !!!
!!! In C/Z accuracy, most flops counter should be x0.75 !!!

IT is OFF by default
It is activated with MORSE_Enable(MORSE_GEMM3M)
In the timing routines, it is activated with --gemm3m

747c7935