Fix hip memory pinning and hipblas configure issue
- Fix hip memory pinning issue, should improve performance for both cuda and rocm backends.
- Fix hipblas configure issue where we would use the wrong hipblas.h headers (only concerns hip for cuda backend)
Edited by Loris