Fix the wrong type and pass real-number value with device_type to devices #1253

yhmtsai · 2023-01-05T22:35:12Z

I face some type issue when I develop the half precision computation in ginkgo. I extract some fix here because the current way I tried in half precision computation may not be workable in the end.

There are two main fix in this PR:

the tuple type mismatch of indextype and valuetype
passing the remove_complex<ValueType> with as_device_type because we will use __half from vendor library not gko::half

yhmtsai · 2023-01-07T22:18:50Z

rebase!

thoasm

LGTM!
I would like to see a PR description change to detail why you now need to change the type between devices for remove_complex<T> floating point types.

thoasm · 2023-01-12T16:20:00Z

cuda/factorization/par_ilut_filter_kernel.cu

-                old_row_ptrs, as_cuda_type(old_vals), num_rows, threshold,
-                new_row_ptrs, lower);
+                old_row_ptrs, as_cuda_type(old_vals), num_rows,
+                as_cuda_type(threshold), new_row_ptrs, lower);


Do we actually need that? As far as I can see, gko::half should be the same on CPU vs. GPU.
Are you changing the type so that gko::half is a different type on CPU and GPU?
Looks like that is your plan in #1257.

Yes, I will use __half on the GPU but gko::half on CPU.

upsj

LGTM!

common/cuda_hip/matrix/fbcsr_kernels.hpp.inc

pratikvn

There are a few extraneous as_device_type in the following kernels:

csr_kernels.cu: 302, 304, 315, 317 and maybe a few other places in this file
cb_gmres_kernels.cu: 351, 414

Probably the same in the hip kernels as well.

I think these places below are also missing a as_device_type ?:

idr_kernels.cu: 102

Does cublas, curand and cusparse have half precision support ? We dont seem to currently use as_device_type while passing the pointers to those wrappers.

Otherwise LGTM!

common/unified/multigrid/pgm_kernels.cpp

yhmtsai · 2023-02-08T19:44:47Z

curandGenerateNormal from cuda does not support half.
Some of cublas and cusparse might have half support. it will use as_culib_type

thoasm

LGTM!

yhmtsai · 2023-02-09T10:59:28Z

@pratikvn for idr curand, I do not convert it to device now because we only use the float double and generate 2x for complex (currently based on std::complex)

Co-authored-by: Pratik Nayak <pratikvn@protonmail.com>

sonarcloud · 2023-02-10T04:15:12Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
0 Code Smells

No Coverage information
10.7% Duplication

yhmtsai self-assigned this Jan 5, 2023

ginkgo-bot added mod:cuda This is related to the CUDA module. mod:hip This is related to the HIP module. type:factorization This is related to the Factorizations type:matrix-format This is related to the Matrix formats type:multigrid This is related to multigrid labels Jan 5, 2023

ginkgo-bot force-pushed the fixtype branch from b7d6aaa to 3a213eb Compare January 7, 2023 22:19

yhmtsai force-pushed the fixtype branch from 3a213eb to a8bf27b Compare January 8, 2023 03:45

yhmtsai mentioned this pull request Jan 12, 2023

Half precision support #1257

Open

12 tasks

thoasm approved these changes Jan 13, 2023

View reviewed changes

yhmtsai force-pushed the fixtype branch from a8bf27b to 37f50b4 Compare February 8, 2023 10:32

yhmtsai added the 1:ST:ready-for-review This PR is ready for review label Feb 8, 2023

yhmtsai requested review from a team and thoasm February 8, 2023 10:34

upsj approved these changes Feb 8, 2023

View reviewed changes

common/cuda_hip/matrix/fbcsr_kernels.hpp.inc Show resolved Hide resolved

pratikvn approved these changes Feb 8, 2023

View reviewed changes

common/unified/multigrid/pgm_kernels.cpp Outdated Show resolved Hide resolved

common/unified/multigrid/pgm_kernels.cpp Outdated Show resolved Hide resolved

thoasm approved these changes Feb 8, 2023

View reviewed changes

fix type mismatch and convert real valuetype to device (for __half)

9de37fa

yhmtsai force-pushed the fixtype branch from 37f50b4 to 4872acd Compare February 9, 2023 10:57

yhmtsai added 1:ST:ready-to-merge This PR is ready to merge. and removed 1:ST:ready-for-review This PR is ready for review labels Feb 9, 2023

yhmtsai changed the title ~~fix some type issue~~ Fix the wrong type and pass real-number value with device_type to devices Feb 9, 2023

delete some redundent as_device_type

d7b41c7

Co-authored-by: Pratik Nayak <pratikvn@protonmail.com>

yhmtsai force-pushed the fixtype branch from 4872acd to d7b41c7 Compare February 9, 2023 11:08

yhmtsai merged commit 19b7249 into develop Feb 9, 2023

yhmtsai deleted the fixtype branch February 9, 2023 22:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the wrong type and pass real-number value with device_type to devices #1253

Fix the wrong type and pass real-number value with device_type to devices #1253

yhmtsai commented Jan 5, 2023 •

edited

Loading

yhmtsai commented Jan 7, 2023

thoasm left a comment

thoasm Jan 12, 2023

yhmtsai Jan 16, 2023

upsj left a comment

pratikvn left a comment

yhmtsai commented Feb 8, 2023

thoasm left a comment

yhmtsai commented Feb 9, 2023

sonarcloud bot commented Feb 10, 2023

Fix the wrong type and pass real-number value with device_type to devices #1253

Fix the wrong type and pass real-number value with device_type to devices #1253

Conversation

yhmtsai commented Jan 5, 2023 • edited Loading

yhmtsai commented Jan 7, 2023

thoasm left a comment

Choose a reason for hiding this comment

thoasm Jan 12, 2023

Choose a reason for hiding this comment

yhmtsai Jan 16, 2023

Choose a reason for hiding this comment

upsj left a comment

Choose a reason for hiding this comment

pratikvn left a comment

Choose a reason for hiding this comment

yhmtsai commented Feb 8, 2023

thoasm left a comment

Choose a reason for hiding this comment

yhmtsai commented Feb 9, 2023

sonarcloud bot commented Feb 10, 2023

yhmtsai commented Jan 5, 2023 •

edited

Loading