Issue with Conv operator and backend CUDA : giving bad results when the output is bigger than the input

changed the description

@hrouis Please provide a test case.

I have attached the models the the issue

changed the description

added Bug 🐛 label

changed title from Conv is giving bad results in case the output is bigger than the input to Issue with Conv operator and backend CUDA : giving bad results when the output is bigger than the input

added StatusWork in Progress label

The reason we have different results than ONNX in the second case (an error of the order of 10⁻³) is the algorithm used in the convolution, in Aidge we use cudnnFindConvolutionForwardAlgorithm() to find the optimal algorithm which gives:

1st case (1x96x112x112 -> 1x16x112x112): CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_GEMM
2nd case (1x16x112x112 -> 1x96x112x112): CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_PRECOMP_GEMM

Appearently ONNX's conv doesn't use the same algorithm in the 2nd case. When I forced using CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_GEMM for Aidge, we got the same results for both cases.

@olivierbichler @cmoineau what do you think I should do, keep using cudnnFindConvolutionForwardAlgorithm() knowing we won't get the same results as ONNX's inference on some convolutions or should I force using CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_GEMM and do some changes (probably the cpu kernel and some tests)?

mentioned in merge request !33 (merged)

Actually no changes to cpu kernel nor tests was required when forcing the fwd algo to CUDNN_CONVOLUTION_FWD_ALGO_IMPLICIT_GEMM in the contrary, it turns out that when running the conv random input test that compares to CPU kernel results multiple times, the test doesn't pass and by forcing the algo, all tests pass so I added the modification in !33 (merged)

added StatusReview Ready label and removed StatusWork in Progress label

closed

removed StatusReview Ready label

Issue with Conv operator and backend CUDA : giving bad results when the output is bigger than the input

Problem description

Designs

Child items ...

Activity