Tensor/TensorImpl efficiency

I fully agree with your view. I think that was the idea from the beginning, since for example the dimensions are stored in Tensor and not TensorImpl. And from my point of view, other memory layout informations should be stored in Tensor as well (for example, stride, etc).

As I still think that Tensor and TensorImpl (with current naming) must have different sizes with a Tensor "geometry" always contained inside its TensorImpl one.
To achieve the efficiency discussed here, a solution would be to replace a pointer to TensorImpl by a plain object (aggregation) that contains all the stored data layout informations and a pointer to backend implementation.

(sorry for the botched mermaid syntax)
Tensor o-- TensorStorage --> TensorBackend

(Tensor contains a TensorStorage that has a unique_ptr to a TensorBackend)

@olivierbichler what do you think of this?

As you'll see in !13 (closed) and #42 (closed), so far I cannot separate TensorStorage from TensorBackend, as the backend part also needs the data in the TensorStorage part (at least the data type and the number of elements).
As it does not seem critical at this point I kept data access through the TensorImpl pointer (the concerned properties are seldom used and not in compute intensive context I think).

mentioned in merge request !13 (closed)

mentioned in issue #42 (closed)

Currently, this issue seems obsolete considering the updated design of the Tensor and TensorImpl classes, and therefore does not address a concrete problem.

closed

Tensor/TensorImpl efficiency

Designs

Child items ...

Activity