Improve support for `tf.IndexedSlices` row-sparse gradients.

changed milestone to %Release declearn 2.1.0

I wrote a first draft that was pushed to the indexed-slices branch. I will now stall effort on that matter and come back to it once the GPU-support MR has been merged, enabling to push this draft on top of the changes introduced as part of the latter MR.

When I have time, I will also look into the way torch behaves with the kind of architecture that has tensorflow produce IndexedSlices gradients.

I have now integrated my initial draft together with the GPU-support-related backend changes; things work rather smoothly.
I encountered the case when densifying sparse gradients is required, namely when adding noise as part of the DP-SGD algorithm, and added support for it.
I modified the GradientTestCase test-util to turn one of the tensorflow gradients to (non-sparse) IndexedSlices, so that the existing unit tests now cover the use of this structure.
Similarly, I checked that the current RNN model properly results in returning and processing IndexedSlices outputs.
Finally, I remarked that the same model does not produce sparse gradients in Torch, and will therefore for now not look into additional sparse tensors support (in any framework).

mentioned in merge request !33 (merged)

mentioned in commit fec1fa06

closed with merge request !33 (merged)

Improve support for `tf.IndexedSlices` row-sparse gradients.

Technical context and problem root

Proposed solution

Designs

Child items ...

Activity

Admin message

Improve support for `tf.IndexedSlices` row-sparse gradients.

Technical context and problem root

Proposed solution

Activity