Attaching pipeline layout to hal.interface.binding.subspan & co. #18098

benvanik · 2024-08-02T22:12:27Z

This allows for the whole layout to be known locally when lowering out of HAL and into target-specific binding data structures. This information was (and is still) available on the exports but annoying to get to and not present in all tests. This allowed removing the descriptor type from the subspan op and will allow for us to have non-i32 push constant types in the future. Verifiers were added for both push constant and descriptor set/binding ordinals now that the information is cheap to verify.

Progress on #17875 (this is needed for lowering non-0 ordinal descriptor sets to CPU/CUDA/ROCM targets).

This allows for the whole layout to be known locally when lowering out of HAL and into target-specific binding data structures. This information was (and is still) available on the exports but annoying to get to and not present in all tests.

benvanik · 2024-08-02T22:27:56Z

I split the codegen changes out from the actual dialect changes. ~~The current failures are WGSL-related due to push constant emulation adding a descriptor set without updating the pipeline layout - I'll see if I can fix that.~~ WGSL fixed!

This isn't fantastic as it doesn't update other attrs in the program but it was always doing magic like this so it's probably ok until we want to rework WebGPU support.

ScottTodd · 2024-08-05T16:33:20Z

runtime/src/iree/hal/cts/testdata/command_buffer_push_constants_test.mlir

-      %input_0 = hal.interface.constant.load[0] : i32
-      %input_1 = hal.interface.constant.load[1] : i32
-      %input_2 = hal.interface.constant.load[2] : i32
-      %input_3 = hal.interface.constant.load[3] : i32
+      %input_0 = hal.interface.constant.load layout(#pipeline_layout) ordinal(0) : i32
+      %input_1 = hal.interface.constant.load layout(#pipeline_layout) ordinal(1) : i32
+      %input_2 = hal.interface.constant.load layout(#pipeline_layout) ordinal(2) : i32
+      %input_3 = hal.interface.constant.load layout(#pipeline_layout) ordinal(3) : i32


Nice having these explicit. Needing to look them up by walking the IR to find an export op was annoying :P

Can we ever end up using multiple pipeline layouts at this point? Maybe we could have a verifier that enforces only one?

#pipeline_layout_0 = #hal.pipeline.layout<push_constants = 4, sets = [ #hal.descriptor_set.layout<0, bindings = [ #hal.descriptor_set.binding<0, storage_buffer> ]> ]> #pipeline_layout_1 = #hal.pipeline.layout<push_constants = 2, sets = [ #hal.descriptor_set.layout<0, bindings = [ #hal.descriptor_set.binding<0, storage_buffer> ]> ]> hal.executable.source public @executable { hal.executable.export public @write_push_constants ordinal(0) layout(#pipeline_layout) attributes {workgroup_size = [1 : index, 1 : index, 1 : index]} { ^bb0(%arg0: !hal.device): %c1 = arith.constant 1 : index hal.return %c1, %c1, %c1 : index, index, index } builtin.module { func.func @write_push_constants() { %input_0 = hal.interface.constant.load layout(#pipeline_layout_0) ordinal(0) : i32 %input_1 = hal.interface.constant.load layout(#pipeline_layout_1) ordinal(0) : i32

It's possible so long as they are compatible - this could let us have multiple exported entry points that share some subset of buffers but have differing push constant counts, etc (so two top level functions that fetch push constants/buffers, then a common function that can still reference the base push constants/buffers). We could have a verifier on the executable that maybe poked in and looked for them, but it's likely going to need to be a dedicated pass given how much of the IR tree it pulls together. We'd also then have to codify what pipeline compatibility is and I'm not yet sure we know if we even want to allow that so I'm punting on it for now :P

Yeah I figured the analysis would be complex enough that it would need to be a full pass and not a local verifier.

Following runtime changes in IREE - Attaching pipeline layout to hal.interface.binding.subspan. iree-org/iree#18098 - Adding flag placeholders to semaphores/events. iree-org/iree#18122

benvanik added the compiler/dialects Relating to the IREE compiler dialects (flow, hal, vm) label Aug 2, 2024

benvanik requested a review from ScottTodd August 2, 2024 22:12

benvanik force-pushed the users/benvanik/binding-layout branch from 28e7acf to bdb22a5 Compare August 2, 2024 22:26

benvanik added 2 commits August 2, 2024 15:31

Fixing codegen tests using hal.interface.binding.subspan & co.

6f8988b

Fixing WGSLReplacePushConstants to update pipeline layouts.

f2e87ce

This isn't fantastic as it doesn't update other attrs in the program but it was always doing magic like this so it's probably ok until we want to rework WebGPU support.

benvanik force-pushed the users/benvanik/binding-layout branch from bdb22a5 to f2e87ce Compare August 3, 2024 15:24

benvanik marked this pull request as ready for review August 3, 2024 15:24

benvanik requested review from hanhanW, MaheshRavishankar, IanWood1, antiagainst, kuhar, qedawkins and Groverkss as code owners August 3, 2024 15:24

benvanik removed request for antiagainst, MaheshRavishankar, kuhar, hanhanW, qedawkins, Groverkss and IanWood1 August 3, 2024 15:25

Adding support for non-zero descriptor sets in LLVMCPU/VMVX.

06c6642

benvanik force-pushed the users/benvanik/binding-layout branch from b78e194 to 06c6642 Compare August 4, 2024 00:57

ScottTodd approved these changes Aug 5, 2024

View reviewed changes

benvanik merged commit 2193406 into main Aug 5, 2024
45 checks passed

benvanik deleted the users/benvanik/binding-layout branch August 5, 2024 16:49

yzhang93 mentioned this pull request Aug 9, 2024

Bump iree nod-ai/iree-amd-aie#660

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attaching pipeline layout to hal.interface.binding.subspan & co. #18098

Attaching pipeline layout to hal.interface.binding.subspan & co. #18098

benvanik commented Aug 2, 2024 •

edited

Loading

benvanik commented Aug 2, 2024 •

edited

Loading

ScottTodd Aug 5, 2024

benvanik Aug 5, 2024

ScottTodd Aug 5, 2024

Attaching pipeline layout to hal.interface.binding.subspan & co. #18098

Attaching pipeline layout to hal.interface.binding.subspan & co. #18098

Conversation

benvanik commented Aug 2, 2024 • edited Loading

benvanik commented Aug 2, 2024 • edited Loading

ScottTodd Aug 5, 2024

Choose a reason for hiding this comment

benvanik Aug 5, 2024

Choose a reason for hiding this comment

ScottTodd Aug 5, 2024

Choose a reason for hiding this comment

benvanik commented Aug 2, 2024 •

edited

Loading

benvanik commented Aug 2, 2024 •

edited

Loading