Backward failure when using Data Provider with drop_last set to False

Maybe related to !23 that was previously fixed

changed the description

During the forward pass, input and *op.getOutput(0) have correct dimensions.

During the backward pass, *op.getInput(0), *op.getOutput(0) and output_grad have correct dimensions, but the dimension of *op.getInput(0)->grad() has not been updated.

In the Aidge::Tensor class, it is unclear whether a tensor and its associated gradient tensor (if any) must always have the same dimensions. If so, this causes problems when calling Tensor::resize, because the tensor's dimensions are resized, but the gradient tensor's dimensions are unchanged. As a result, when the user calls scheduler.forward(), the gradient tensors are never resized (if already exist) to account for changes in the batch size.

I propose a fix that involves resizing gradient tensors in Aidge::Optimizer::resetGrad, but it might be better to modify Tensor::resize instead (to be discussed within the Aidge development team).

I think it would be better indeed to resize the gradient tensor in the resize() function (if it has been initialized!). It seems to be a more generic solution and I don't see how resizing the tensor would not require to automatically resize the gradient...

Ok, I propose this new merge request !397 that modifies resize() function for Tensor

mentioned in merge request !41 (closed)

added Bug 🐛 label

mentioned in merge request aidge_core!397 (merged)

Fixed in aidge_core!397 (merged).

closed

Backward failure when using Data Provider with drop_last set to False

What commit version of aidge do you use

Problem description

Reproducible example code

Designs

Child items ...

Activity