Add all the architecture to manage multi-precision data descriptors.
- Add StarPU codelets for data conversion on GPU for half, and CPU/GPU for single
- Add precision reduction and restoration algorithms to have a mix of numerical precision in a single descriptor