Refactor `Storage` with templates #469

IvanaGyro · 2024-09-13T20:28:18Z

Try to refactor Storage with templates to reduce duplicated code and prevent low-level memory management.

Too many files include `utils_internal_interface.hpp`, but don't use the symbols directly declared in `utils_internal_interface.hpp`. This causes the build system to unnecessarily compile almost the whole project when a file included by `utils_internal_interface.hpp` is modified. Following the [Include What You Use](https://google.github.io/styleguide/cppguide.html#Include_What_You_Use) the rule can reduce the probability of circular including and the building time.

Follow the "Include What You Use" rule.

`to_(device)` will be removed.

Fix typo in tests/gpu/BlockUniTensor_test.cpp

kaihsin · 2024-09-14T00:21:12Z

Make sure you staging the progress into multiple PRs instead of making a big one for all modification. Other than that LGTM so far

kaihsin · 2024-09-14T00:35:13Z

@IvanaGyro Let me know if you hit a point where the refactor is ready. I want to start moving Storage + Tensor into another repo
cytnx-core

IvanaGyro · 2024-09-14T17:30:41Z

I suggest deciding whether to move Storage and Tensor after the refactoring process. Currently, only three files (excluding CUDA kernels) remain for Storage. Furthermore, we can combine Tensor with Tensor_Impl and merge Storage with Storage_base, provided we maintain the current API and ensure high memory efficiency. If we also consider replacing Storage with thrust::vector, there will only be two classes beneath Tensor. If there are only a few files below the Tensor level, keeping those files in the same repository could simplify maintenance.

kaihsin · 2024-09-14T19:49:47Z

I want to keep the wrapper class (Tensor and Storage) from their impl. We will need to separate the package for a few strategy reasons.

…

On Sat, Sep 14, 2024, 13:31 Ivana ***@***.***> wrote: I suggest deciding whether to move Storage and Tensor after the refactoring process. Currently, only three files (excluding CUDA kernels) remain for Storage. Furthermore, we can combine Tensor with Tensor_Impl and merge Storage with Storage_base, provided we maintain the current API and ensure high memory efficiency. If we also consider replacing Storage with trust::vector, there will only be two classes beneath Tensor. If there are only a few files below the Tensor level, keeping those files in the same repository could simplify maintenance. — Reply to this email directly, view it on GitHub <#469 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFCX3SLJQ3B3Q4T6B4CB6ATZWRXFNAVCNFSM6AAAAABOGBNGMSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGNJRGA3TCNRVGA> . You are receiving this because you commented.Message ID: ***@***.***>

IvanaGyro · 2024-09-15T05:35:33Z

It's fine to leave Tensor and Storage as abstract classes apart from their implementation.

I wonder what are the reasons for splitting the repo? What are the plans for maintaining the dependencies in the linear algebra components, CUDA kernel, and the other high-level components?

kaihsin · 2024-09-15T06:29:19Z

It's fine to leave Tensor and Storage as abstract classes apart from their implementation.

I wonder what are the reasons for splitting the repo? What are the plans for maintaining the dependencies in the linear algebra components, CUDA kernel, and the other high-level components?

The plan is to separate UniTensor from the underlying Tensor, so other ppl can use Tensor and also bridge torch.Tensor, xtensor, jax and numpy directly.

kaihsin · 2024-09-15T06:30:18Z

All GPU, CUDA kernels for Tensor (and below) should go to cytnx-core

kaihsin · 2024-09-15T06:31:58Z

It's fine to leave Tensor and Storage as abstract classes apart from their implementation.

I wonder what are the reasons for splitting the repo? What are the plans for maintaining the dependencies in the linear algebra components, CUDA kernel, and the other high-level components?

Its more like a wrapper than the abstract class. Storage_base/ Tensor_impl was supposed to be the abstract class, and Tensor/Storage is the wrapper for C++ API

yingjerkao · 2024-09-15T06:33:50Z

It's fine to leave Tensor and Storage as abstract classes apart from their implementation.
I wonder what are the reasons for splitting the repo? What are the plans for maintaining the dependencies in the linear algebra components, CUDA kernel, and the other high-level components?

Its more like a wrapper than the abstract class. Storage_base/ Tensor_impl was supposed to be the abstract class, and Tensor/Storage is the wrapper for C++ API

In light of this, I wonder if there is a class diagram explaining all these somewhere? It should be included in the developer's manual

kaihsin · 2024-09-15T06:36:25Z

I would love to chat with @IvanaGyro and ppl who are involved on this. I feel like it would be nice to have a conversation on why it was design in that way, what was the concern, and maybe there are better way to fulfill them.

A few convo among the issues makes me think we are not on the same page, and I think it is also hard for @IvanaGyro to work on stuffs when lacking informations

yingjerkao · 2024-09-15T06:40:31Z

Still it would be nice to have some documentation. At some point I understood the design but I don't recall all the details.

kaihsin · 2024-09-15T06:41:19Z

I suggest we have a meeting and maybe someone can take a note?

ianmccul · 2024-09-15T09:51:11Z

dtype and device would be better implemented using std::variant. That way, the enumeration of all of the possible types and devices only needs to happen once. That would turn a lot of runtime errors (such as missing some function for a specific dtype or device) into compile time errors, and much easier to maintain. Single or multiple dispatch is really easy with std::variant and visitors. With the current design, adding a new dtype would be a major hassle, and no way to check that all of the required functions are supplied without runtime testing.

Also, for the C++ code, is there a reason for restricting the types of a Tensor to a specific set? For Python it is obviously required that it is a bounded set, but C++ code could be able to construct a Tensor on any type. Eg, if I wanted to experiment with some Tensor<float128>, or Tensor<BigFloat>?

Edit: github ate my formatting!

IvanaGyro added 9 commits September 6, 2024 14:27

Remove annoying debug message

3b88405

Explicitly convert to complex from Scalar

0640f4b

Clear "include" statements in utils_internal_gpu/

6e20b4a

Clear file headers of tests/Storage_test

654a3db

Follow the "Include What You Use" rule.

Remove unsed files

f092084

Not to call to_(device) in to(device)

cd17a1f

`to_(device)` will be removed.

Fix typo

c6c0aa4

Fix typo in tests/gpu/BlockUniTensor_test.cpp

Refactor Storage with templates

b92ab11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor `Storage` with templates #469

Refactor `Storage` with templates #469

IvanaGyro commented Sep 13, 2024

kaihsin commented Sep 14, 2024 •

edited

Loading

kaihsin commented Sep 14, 2024

IvanaGyro commented Sep 14, 2024 •

edited

Loading

kaihsin commented Sep 14, 2024 via email

IvanaGyro commented Sep 15, 2024 •

edited

Loading

kaihsin commented Sep 15, 2024

kaihsin commented Sep 15, 2024

kaihsin commented Sep 15, 2024

yingjerkao commented Sep 15, 2024

kaihsin commented Sep 15, 2024

yingjerkao commented Sep 15, 2024

kaihsin commented Sep 15, 2024

ianmccul commented Sep 15, 2024 •

edited

Loading

Refactor Storage with templates #469

Are you sure you want to change the base?

Refactor Storage with templates #469

Conversation

IvanaGyro commented Sep 13, 2024

kaihsin commented Sep 14, 2024 • edited Loading

kaihsin commented Sep 14, 2024

IvanaGyro commented Sep 14, 2024 • edited Loading

kaihsin commented Sep 14, 2024 via email

IvanaGyro commented Sep 15, 2024 • edited Loading

kaihsin commented Sep 15, 2024

kaihsin commented Sep 15, 2024

kaihsin commented Sep 15, 2024

yingjerkao commented Sep 15, 2024

kaihsin commented Sep 15, 2024

yingjerkao commented Sep 15, 2024

kaihsin commented Sep 15, 2024

ianmccul commented Sep 15, 2024 • edited Loading

Refactor `Storage` with templates #469

Refactor `Storage` with templates #469

kaihsin commented Sep 14, 2024 •

edited

Loading

IvanaGyro commented Sep 14, 2024 •

edited

Loading

IvanaGyro commented Sep 15, 2024 •

edited

Loading

ianmccul commented Sep 15, 2024 •

edited

Loading