This branch is the work from @cojean to add the support for the parallel getrf kernel to perform LU with partial pivoting.
This is just a quick (and try to be not dirty) rebase on top of the last master. This require quite some work to restore this work.