ENH: Improve performance of matrix multiplication
The performance of the matrix multiplication of the FastLinearOperator
can be improved by performing it with a single matrix product instead of
one by row.
This commit modifies the initialization of the FastLinearOperator
and
its __matmul__
method to be more performant.