Enable cuda #14

angry-crab · 2023-01-24T04:48:38Z

BTW, I notice that USE_LLVM is set to OFF in config.cmake.patch. Is that intended?

Signed-off-by: Xinyu Wang <xinyu.wang@tier4.jp>

ambroise-arm · 2023-01-24T09:01:58Z

BTW, I notice that USE_LLVM is set to OFF in config.cmake.patch. Is that intended?

I'm not sure why Josh removed it in ee82c84. At the same time this patch is for the ROS1 version of the package, which to my knowledge is only used in a test non-default config of a single package of Autoware.ai. And no one is complaining, so probably not used.

Resolves autowarefoundation/modelzoo#85

Who/what is going to compile this package with the CUDA enablement? The CI? The user?

angry-crab · 2023-01-24T09:23:00Z

Who/what is going to compile this package with the CUDA enablement? The CI? The user?

The user, since this package provides the runtime lib.
I made it draft, and we don't want to merge this pr until all models could be compiled for cuda backend.

ambroise-arm · 2023-01-24T13:04:25Z

The user, since this package provides the runtime lib.

Then I am confused on how it would all come together. Does your comment here autowarefoundation/modelzoo#85 (comment) about not having tvm_vendor in universe still stand? Would the user be expected to check out this repository manually?

I made it draft, and we don't want to merge this pr until all models could be compiled for cuda backend.

I don't see any problem with merging this PR now. Compiling the models for CUDA is independent of the runtime functionality provided by this package. And if the CUDA runtime doesn't work on some models, it will most likely be TVM that has to be fixed, not tvm_vendor.

angry-crab · 2023-01-24T14:58:10Z

Then I am confused on how it would all come together.

Sorry for the confusion. I think the way how it works currently is that tvm_vendor exists in the docker image anyway because it is released on rosdistro. When we build a docker image, ROS would try to install it.

Does your comment here autowarefoundation/modelzoo#85 (comment) about not having tvm_vendor in universe still stand?

Yes, my comment still stands. After merging this package, we will make a release to rosdistro. Then the package would be available after image built.

Would the user be expected to check out this repository manually?

No, user does not need to checkout this repo. ROS would do it during setup. But if anyone wants to use it before the rosdistro release, he/she has to build this repo manually.

ambroise-arm · 2023-01-24T15:53:50Z

tvm_vendor exists in the docker image anyway because it is released on rosdistro

Precisely. I am not sure how the release process works, but I assume that it is automated, and that this process uses an environment that doesn't have cuda available.
But even if it did, there is still the problem of the portability of a cuda-enabled tvm_vendor. Here autowarefoundation/modelzoo#85 (comment) I don't think that you tested the problematic case I am thinking of. I've just tried the following:

Use a cuda docker image to build tvm_runtime with cuda enabled (to replicate a release of tvm_vendor with cuda enabed)
Copy the resulting install directory to an non-cuda autoware image (to replicate the distribution of that package to a non-cuda autoware environment)
Try to run a model with non-cuda backend
It fails: error while loading shared libraries: libcudart.so.11.0: cannot open shared object file: No such file or directory

Because of the tvm runtime shared object:

$ ldd install/tvm_vendor/lib/libtvm_runtime.so 
	linux-vdso.so.1 (0x00007ffd7cf2a000)
	libcudart.so.11.0 => not found
	libcuda.so.1 => not found
	libOpenCL.so.1 => /lib/x86_64-linux-gnu/libOpenCL.so.1 (0x00007f3d0d58f000)
	libvulkan.so.1 => /lib/x86_64-linux-gnu/libvulkan.so.1 (0x00007f3d0d521000)
	libopenblas.so.0 => /lib/x86_64-linux-gnu/libopenblas.so.0 (0x00007f3d0b0d0000)
	libstdc++.so.6 => /lib/x86_64-linux-gnu/libstdc++.so.6 (0x00007f3d0aea6000)
	libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007f3d0adbf000)
	libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007f3d0ad9f000)
	libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f3d0ab77000)
	/lib64/ld-linux-x86-64.so.2 (0x00007f3d0d845000)
	libgfortran.so.5 => /lib/x86_64-linux-gnu/libgfortran.so.5 (0x00007f3d0a89c000)
	libquadmath.so.0 => /lib/x86_64-linux-gnu/libquadmath.so.0 (0x00007f3d0a852000)

angry-crab · 2023-01-25T04:46:27Z

Copy the resulting install directory to an non-cuda autoware image (to replicate the distribution of that package to a non-cuda autoware environment)

I think the problem occurs here because the runtime was compiled with cuda at the first place. But if you check the Cmakelist of tvm_vendor, it will clone and compile tvm runtime in another environment. I believe rosdistro is not similar to debian packages such that the you may compile the source code in your local machine. But debian packages are compiled libs.

ambroise-arm · 2023-01-30T15:44:59Z

I believe rosdistro is not similar to debian packages such that the you may compile the source code in your local machine.

I am not familiar with this. But if you are confident that it handles my concerns, then great.
Regardless, this PR can be merged now, worse case it won't change anything from the current state of things.

angry-crab · 2023-01-31T05:30:23Z

@wep21 Do you have any comment/concern about this pr? Thanks.

wep21 · 2023-01-31T08:40:04Z

@angry-crab I have no concern about this pr, but will you add this package into autoware repos because ros buildfarm environment does not have cuda?

angry-crab · 2023-01-31T08:53:05Z

ros buildfarm environment does not have cuda

I see. That will be a problem. But we probably don't want to add this package to autoware repo because it takes time to compile. I guess it means the user needs to clone and rebuild this package if he/she wants to use cuda.

I thought that when we call rosdistro install, ros would clone this package and compile it locally. Does it mean the user have to clone and rebuild the package anyway if he/she wants to use cuda?

wep21 · 2023-02-01T06:30:16Z

Does it mean the user have to clone and rebuild the package anyway if he/she wants to use cuda?

Yes, I think so.

angry-crab · 2023-02-02T04:10:44Z

Yes, I think so.

I guess then we don't have to make a rosdistro release at this time.

Xinyu Wang added 2 commits January 18, 2023 13:37

add cuda patch

6f4d3ed

Signed-off-by: Xinyu Wang <xinyu.wang@tier4.jp>

fix patch

5157c76

Signed-off-by: Xinyu Wang <xinyu.wang@tier4.jp>

angry-crab requested a review from ambroise-arm January 24, 2023 04:48

angry-crab marked this pull request as draft January 24, 2023 09:25

angry-crab marked this pull request as ready for review January 31, 2023 05:29

angry-crab requested a review from wep21 January 31, 2023 05:30

ambroise-arm mentioned this pull request Mar 20, 2023

Generate Debian packages for the ROS packages in Autoware.core and Autoware.universe autowarefoundation/autoware#3222

Open

31 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable cuda #14

Enable cuda #14

angry-crab commented Jan 24, 2023

ambroise-arm commented Jan 24, 2023

angry-crab commented Jan 24, 2023 •

edited

Loading

ambroise-arm commented Jan 24, 2023

angry-crab commented Jan 24, 2023 •

edited

Loading

ambroise-arm commented Jan 24, 2023

angry-crab commented Jan 25, 2023

ambroise-arm commented Jan 30, 2023

angry-crab commented Jan 31, 2023

wep21 commented Jan 31, 2023

angry-crab commented Jan 31, 2023 •

edited

Loading

wep21 commented Feb 1, 2023 •

edited

Loading

angry-crab commented Feb 2, 2023

Enable cuda #14

Are you sure you want to change the base?

Enable cuda #14

Conversation

angry-crab commented Jan 24, 2023

ambroise-arm commented Jan 24, 2023

angry-crab commented Jan 24, 2023 • edited Loading

ambroise-arm commented Jan 24, 2023

angry-crab commented Jan 24, 2023 • edited Loading

ambroise-arm commented Jan 24, 2023

angry-crab commented Jan 25, 2023

ambroise-arm commented Jan 30, 2023

angry-crab commented Jan 31, 2023

wep21 commented Jan 31, 2023

angry-crab commented Jan 31, 2023 • edited Loading

wep21 commented Feb 1, 2023 • edited Loading

angry-crab commented Feb 2, 2023

angry-crab commented Jan 24, 2023 •

edited

Loading

angry-crab commented Jan 24, 2023 •

edited

Loading

angry-crab commented Jan 31, 2023 •

edited

Loading

wep21 commented Feb 1, 2023 •

edited

Loading