@@ -372,11 +328,9 @@ StarPU will <b>automatically evict</b> data from the main memory in advance, and
<h4>All in all</h4>
<p>
All that means that, with the help
of <ahref="doc/html/cExtensions.html">StarPU's extensions to the C
language</a>, the following sequential source code of a tiled version of
the classical Cholesky factorization algorithm using BLAS is also valid
StarPU code, possibly running on all the CPUs and GPUs, and given a data
All that means that the following sequential source code of a tiled version of
the classical Cholesky factorization algorithm using BLAS is also (a
almost) valid StarPU code, possibly running on all the CPUs and GPUs, and given a data
distribution over MPI nodes, it is even a distributed version!
</p>
...
...
@@ -397,7 +351,6 @@ for (k = 0; k < tiles; k++) {
<li>SMP/Multicore Processors (x86, PPC, ARM, ... all Debian architecture have been tested) </li>
<li>NVIDIA GPUs (e.g. heterogeneous multi-GPU), with pipelined and concurrent kernel execution support (new in v1.2) and GPU-GPU direct transfers (new in v1.1)</li>