Mpi example matrix mult
The goal of this merge request is to provide a more interesting example for matrix multiplication where 2D block cyclic distribution is used initially. Blocks of C still are computed in place as in mpi/examples/matrix_mult/mm.c