[release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) #2308

xinyazhang · 2025-07-02T15:49:32Z

Fixes SWDEV-540240, SWDEV-540309, SWDEV-539989

...

80cca70
created a static global variable that used at::cuda::warp_size() to
initialize its value, which needs GPUs to be visible to query device
properties. However, GPUs are not present on CPU-only build systems.

Convert static variable into a static function, thus preventing static
initialization.

http://rocm-ci.amd.com/job/pyt_whl_docker_mainline/1461/artifact/build_artifacts.txt/*view*/

Ran microbenchmark to confirm basic functionality:

root@ubb4-rack-22:/var/lib/jenkins/pytorch-micro-benchmarking# python3 micro_benchmarking_pytorch.py --network resnet50
INFO: running forward and backward for warmup.
INFO: running the benchmark..
OK: finished running benchmark..
--------------------SUMMARY--------------------------
Microbenchmark for network : resnet50
Num devices: 1
Dtype: FP32
Mini batch size [img] : 64
Time per mini-batch : 0.10158218145370483
Throughput [img/sec] : 630.0317544289736=

…:warp_size() (#2293) Fixes SWDEV-540240, SWDEV-540309, SWDEV-539989 ``` ... ``` 80cca70 created a static global variable that used `at::cuda::warp_size()` to initialize its value, which needs GPUs to be visible to query device properties. However, GPUs are not present on CPU-only build systems. Convert static variable into a static function, thus preventing static initialization. http://rocm-ci.amd.com/job/pyt_whl_docker_mainline/1461/artifact/build_artifacts.txt/*view*/ Ran microbenchmark to confirm basic functionality: ``` root@ubb4-rack-22:/var/lib/jenkins/pytorch-micro-benchmarking# python3 micro_benchmarking_pytorch.py --network resnet50 INFO: running forward and backward for warmup. INFO: running the benchmark.. OK: finished running benchmark.. --------------------SUMMARY-------------------------- Microbenchmark for network : resnet50 Num devices: 1 Dtype: FP32 Mini batch size [img] : 64 Time per mini-batch : 0.10158218145370483 Throughput [img/sec] : 630.0317544289736= ```

rocm-repo-management-api · 2025-07-02T15:51:01Z

Jenkins build for fd2a0432ae459fdabb6d3e5651ff4b918ab947fa commit finished as FAILURE
Links: Blue Ocean view / Build artifacts

xinyazhang · 2025-07-07T23:10:15Z

Superseded by #2318

xinyazhang changed the title ~~[rocm7.0_internal_testing] Prevent static initialization of at::cuda::warp_size() (#2293)~~ [release/2.4] Prevent static initialization of at::cuda::warp_size() (#2293) Jul 2, 2025

xinyazhang changed the title ~~[release/2.4] Prevent static initialization of at::cuda::warp_size() (#2293)~~ [release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) Jul 2, 2025

xinyazhang requested review from pruthvistony, jithunnair-amd and jeffdaily July 2, 2025 15:54

xinyazhang marked this pull request as ready for review July 2, 2025 16:13

xinyazhang marked this pull request as draft July 2, 2025 19:44

xinyazhang closed this Jul 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) #2308

[release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) #2308

xinyazhang commented Jul 2, 2025

Uh oh!

rocm-repo-management-api bot commented Jul 2, 2025 •

edited

Loading

Uh oh!

xinyazhang commented Jul 7, 2025

Uh oh!

Uh oh!

[release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) #2308

[release/2.4] Prevent static initialization of at::cuda::warp_size() (Backport #2293) #2308

Conversation

xinyazhang commented Jul 2, 2025

Uh oh!

rocm-repo-management-api bot commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xinyazhang commented Jul 7, 2025

Uh oh!

Uh oh!

rocm-repo-management-api bot commented Jul 2, 2025 •

edited

Loading