Reduce data access type whenever possible
Compare changes
This PR aims at reducing the data access type to enable more cases with the allocation on the fly.
For example, with a gemm: alpha A *B + beta * C
, and beta = 0. C
is used only as output, and not as inout. This changes all the codelets to STARPU_VARIABLE_NBUFFERS.