Missing CUDA kernels, and fix many warnings
- Add missing herfb and tpmqt CUDA kernels
- Silent all warnings in Debug with gcc 5.4
- Replace max/min macros with static inline functions to avoid warnings about comparing unsigned/signed int and or int sizes.
@sylvand : can you tell me if you still have trouble when compiling with StarPU ? I hope, I removed every probems.