Learning backend cuda
Context
This MR is intended to add the forward implementations for Accuracy computation operators:
- And
- ArgMax
- ReduceSum
And the forward implementations for operators used in the MSE #20 (closed):
- Mul
- ReduceMean
- Pow
Also a fix to #23 (closed) is added: update Tensor descriptor when shape changes.
Also a fix to #7 (closed) is added: force conv fwd algorithm.
We also add operator Ln forward implementation to support #24 (closed).
Modified files
-
AndImpl.hpp
,AndImpl_CUDA_kernels.hpp
,AndImpl_CUDA_kernels.cu
,AndImpl.cpp
andTests_AndImpl.cpp
, add And forward impl; -
ArgMaxImpl.hpp
,ArgMaxImpl_CUDA_kernels.hpp
,ArgMaxImpl_CUDA_kernels.cu
,ArgMaxImpl.cpp
andTests_ArgMaxImpl.cpp
, add ArgMaxforward impl; -
ReduceSumImpl.hpp
,ReduceSumImpl_CUDA_kernels.hpp
,ReduceSumImpl_CUDA_kernels.cu
,ReduceSumImpl.cpp
andTests_ReduceSumImpl.cpp
, add ReduceSum forward impl; -
LnImpl.hpp
,LnImpl_CUDA_kernels.hpp
,LnImpl_CUDA_kernels.cu
,LnImpl.cpp
andTests_LnImpl.cpp
, add Ln forward impl; -
ConvImpl.hpp
,ConvImpl.cpp
andTests_ConvImpl.cpp
, force Conv Fwd algorithm; -
TensorImpl.hpp
, update Tensor descriptor when tensor shape has changed; -
MulImpl.hpp
,MulImpl.cpp
andTest_MulImpl.cpp
, add Mul forward() and backward(); -
ReduceMeanImpl.hpp
,ReduceMeanImpl.cpp
andTest_ReduceMeanImpl.cpp
, add ReduceMean forward() and backward(); -
PowImpl.hpp
,PowImpl.cpp
andTest_PowImpl.cpp
, add Pow forward() and backward();
TODO
-
Mul -
ReduceMean -
Pow -
And -
ArgMax -
ReduceSum -
Fix TensorImpl
Edited by Houssem ROUIS