Support 2D Bias for Gemm operator
According to ONNX's documentation, the Bias should be broadcastable to the shape (BatchSize, OutputFeature) which means it could be given with that shape.
This is already the case for one of the ONNX tests:
Edited by Houssem ROUIS