Tensor prerequisite

mentioned in issue #13 (closed)

I am not sure I understand what is the question, but what is wrong with the current assumption? The only assumption is: TensorImpl stores a contiguous chunk of memory somewhere (it can be anywhere: the CPU, GPU, SoC memory...).

nothing's wrong, except that, AFAIK, it's not written in the specification thus not necessarily shared and understood by the contributors. Yet the "somewhere" is bothering me. TensorImpl, for now, can return an address to storage. From a GPU what would be the meaning of this address, if the storage is not in the process memory space (ie in RAM)?

Should we return a handle instead of a pointer (the windows way?) or nether return anything from TensorImpl but only from its backend (issue, the signature will depend on the backend).

Besides, Tensor, and TensorImpl or C++ constructs, they have no meaning outside a CPU context.

Perhaps we should go backward and reduce Tensor interface to be only a handle with a few "geometrical" properties and no access to implementation. Implementation will only be defined on a backend (it will break a lot of things).

As I understood, TensorImpl was a way, from CPU, to access to a RAM-available copy of the data.

mentioned in merge request !9 (merged)

changed milestone to %v0.1.0

set weight to 118

removed weight

Having a pointer to data has no meaning outside CPU (a "cuda" pointer can be manipulated only through cuda functions). Each backend may have it's own way to represent a storage location. Getting/setting a data to/from CPU thus always implies a transfert (transmitter). Several mechanisms (not exclusive) might exist:

Explicitely vs implicitely creating a CPU tensor which is a synchronised copy of the desired data on the hardware target;
sparse/seldom access vs bulk frequent access to the data: actual best implementation might depend on the application and on the HW communication layer.

mentioned in issue #42 (closed)

closed

Tensor prerequisite

Designs

Child items ...

Activity