Perfmodels generalization
The goal is the make the bus calibration phase general for any driver. This is a WIP merge request so we can comment early what's needs to be changed.
So far I've only tested the CUDA driver and made no change to the other drivers.
Tested on CUDA and HIP drivers, basic testing on OpenCL driver. MPI and TCPIP drivers has been left untouched