Mentions légales du service

Skip to content
Snippets Groups Projects

Starpu/alloc on the fly

Merged Mathieu Faverge requested to merge faverge/chameleon:starpu/alloc_on_the_fly into master
All threads resolved!

Modify the way the workspaces are allocated to be able to allocate them on the fly.

The objectives are:

  • Allocate less memory in QR like algorithms, as only the useful T tiles are allocated.
  • Be more asynchronous in some algorithm as QR again, or norms by avoiding the required sequence_wait at the end of the call before freeing the allocated workspaces. This is also used in the upcoming SUMMA algorithms.

The changes are:

  • switch geadd to axpy in norm computations as they may be optimized with mkl
  • switch workspaces from global allocation to tile allocation
  • update QR kernels that generating the T tiles to set it to 0 first. This can not be done through global memset anymore, and to avoid an complete allocation of the matrix, this is moved in the codelets to initialized only the touched tiles.
Edited by Mathieu Faverge

Merge request reports

Checking pipeline status.

Approval is optional

Merged by Mathieu FavergeMathieu Faverge 6 years ago (Feb 8, 2019 9:49am UTC)

Merge details

  • Changes merged into master with fa6d78a3.
  • Deleted the source branch.
  • Auto-merge enabled

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
  • Thanks a lot @faverge for noticing this!

    Indeed the dependencies length were very wrong, I thought I caught them all when rebasing my original PR... Thanks for fixing them!

  • Philippe Virouleau
  • added 1 commit

    Compare with previous version

  • Mathieu Faverge resolved all discussions

    resolved all discussions

  • Mathieu Faverge enabled an automatic merge when the pipeline for c100ef0f succeeds

    enabled an automatic merge when the pipeline for c100ef0f succeeds

  • Mathieu Faverge mentioned in commit fa6d78a3

    mentioned in commit fa6d78a3

  • Please register or sign in to reply
    Loading