Perf. improvement for `CuStateVecCircuitSimulator::observe` #1002

1tnguyen · 2023-12-05T05:59:47Z

Description

Currently, we don't use the batched Pauli expectation API for multi-term spin_op expectation calculation.

This API is substantially faster than applying change-of-basis gates then observing <ZZZ..ZZ> for each term.

Hence, this PR modifies custatevec backend to use batched Pauli expectation API by default for deterministic cudaq::observe (no shots)

(1) Change CircuitSimulator::observe to return observe_result, which can encapsulate the final expectation value as well as individual terms' expectation values.

(2) Use custatevec batched API and populate the observe_result accordingly.

(3) Add a backward compatibility test for the term-by-term mode.

See also: #1477, #1430

This mode is only activated with env. variable CUDAQ_OBSERVE_FROM_SAMPLING=OFF/0/FALSE. Currently, we didn't use the batched Pauli APIs and skipped it entirely for fp-32. Modify it to use batched Pauli expectation APIs for perf.

github-actions · 2023-12-05T06:54:08Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

schweitzpgi

LGTM.

runtime/nvqir/custatevec/CuStateVecCircuitSimulator.cu

bmhowe23

I suspect we need some docs for the CUDAQ_OBSERVE_FROM_SAMPLING environment variable.

amccaskey · 2024-01-26T14:10:01Z

@1tnguyen seems like we should also update CircuitSimulator::shouldObserveFromSampling() to return false by default. What do you think? Should this new approach be the default? Is it always faster?

1tnguyen · 2024-01-26T20:49:13Z

@1tnguyen seems like we should also update CircuitSimulator::shouldObserveFromSampling() to return false by default. What do you think? Should this new approach be the default? Is it always faster?

Yes, it's always faster: a lot faster for large Hamiltonians since the time to run change-of-basis gates and then uncompute them is not insignificant.

There is an issue with making it the default. Currently, we expect that cudaq::observe does return term-by-term results like in this test.
Hence, we need to make an additional change to make CircuitSimulator::observe able to return term-by-term results if it can (like in this case, we perform batch exp-val computation and do have all the individual results). Then, we can make CuStateVecCircuitSimulator::shouldObserveFromSampling return true by default.

What do you think?

amccaskey · 2024-01-27T12:48:17Z

@1tnguyen seems like we should also update CircuitSimulator::shouldObserveFromSampling() to return false by default. What do you think? Should this new approach be the default? Is it always faster?

Yes, it's always faster: a lot faster for large Hamiltonians since the time to run change-of-basis gates and then uncompute them is not insignificant.

There is an issue with making it the default. Currently, we expect that cudaq::observe does return term-by-term results like in this test. Hence, we need to make an additional change to make CircuitSimulator::observe able to return term-by-term results if it can (like in this case, we perform batch exp-val computation and do have all the individual results). Then, we can make CuStateVecCircuitSimulator::shouldObserveFromSampling return true by default.

What do you think?

I say go for it - make that change here in this PR.

github-actions · 2024-01-30T03:14:36Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

This allows a fast batched observation while still allowing per-term data to be progagated up the stack.

- Use Shots > 0 condition to know if sampling is required. - Adjust base shouldObserveFromSampling to set default if needed. Plus, the environment variable can be used to both turn the feature on or off. e.g., for benchmarking purposes.

github-actions · 2024-03-18T03:53:15Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

github-actions · 2024-03-21T21:15:14Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

1tnguyen · 2024-03-21T21:28:10Z

This PR is ready for (re-)review. Ping reviewers... :-)

github-actions · 2024-03-21T22:04:08Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

… unsigned

bmhowe23

This will be a very nice optimization. Are all the interface changes invisible to the end user?

unittests/CMakeLists.txt

unittests/backends/qpp_observe/QppObserveTester.cpp

bmhowe23

👍

github-actions · 2024-03-21T23:33:24Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

Perf. improvement for CuStateVecCircuitSimulator::observe

eebaed2

This mode is only activated with env. variable CUDAQ_OBSERVE_FROM_SAMPLING=OFF/0/FALSE. Currently, we didn't use the batched Pauli APIs and skipped it entirely for fp-32. Modify it to use batched Pauli expectation APIs for perf.

1tnguyen requested review from amccaskey and bmhowe23 December 5, 2023 05:59

Merge branch 'main' into tnguyen/cusv-perf

06447fe

github-actions bot pushed a commit that referenced this pull request Dec 5, 2023

Docs preview for PR #1002.

c8ac7da

schweitzpgi reviewed Dec 5, 2023

View reviewed changes

runtime/nvqir/custatevec/CuStateVecCircuitSimulator.cu Show resolved Hide resolved

bmhowe23 reviewed Dec 27, 2023

View reviewed changes

Merge branch 'main' into tnguyen/cusv-perf

15786da

github-actions bot pushed a commit that referenced this pull request Jan 30, 2024

Docs preview for PR #1002.

bf60466

1tnguyen added 4 commits March 15, 2024 20:00

Merge branch 'main' into tnguyen/cusv-perf

1bf74e4

CircuitSimulator::observer to return cudaq::observe_result

debd992

This allows a fast batched observation while still allowing per-term data to be progagated up the stack.

Remove debug code

5dda027

github-actions bot pushed a commit that referenced this pull request Mar 18, 2024

Docs preview for PR #1002.

447477b

1tnguyen and others added 2 commits March 22, 2024 07:41

Merge branch 'main' into tnguyen/cusv-perf

6cc188a

Change the tests to provide coverage for the not-default mode

579d8e6

github-actions bot pushed a commit that referenced this pull request Mar 21, 2024

Docs preview for PR #1002.

ee1b2f8

Rename the test

8ea57f3

github-actions bot pushed a commit that referenced this pull request Mar 21, 2024

Docs preview for PR #1002.

098f4c9

Fixed a regression: we need to handle -1 explicitly since shots is an…

7b0e1c1

… unsigned

bmhowe23 reviewed Mar 21, 2024

View reviewed changes

unittests/CMakeLists.txt Show resolved Hide resolved

unittests/backends/qpp_observe/QppObserveTester.cpp Show resolved Hide resolved

bmhowe23 approved these changes Mar 21, 2024

View reviewed changes

github-actions bot pushed a commit that referenced this pull request Mar 21, 2024

Docs preview for PR #1002.

2ed2356

1tnguyen merged commit c1f0af9 into NVIDIA:main Mar 21, 2024
133 checks passed

github-actions bot locked and limited conversation to collaborators Mar 21, 2024

bmhowe23 added the performance label Mar 22, 2024

bettinaheim added this to the release 0.7.1 milestone Apr 17, 2024

bettinaheim added the enhancement New feature or request label Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perf. improvement for `CuStateVecCircuitSimulator::observe` #1002

Perf. improvement for `CuStateVecCircuitSimulator::observe` #1002

1tnguyen commented Dec 5, 2023 •

edited by bettinaheim

Loading

github-actions bot commented Dec 5, 2023

schweitzpgi left a comment

bmhowe23 left a comment

amccaskey commented Jan 26, 2024

1tnguyen commented Jan 26, 2024

amccaskey commented Jan 27, 2024

github-actions bot commented Jan 30, 2024

github-actions bot commented Mar 18, 2024

github-actions bot commented Mar 21, 2024

1tnguyen commented Mar 21, 2024

github-actions bot commented Mar 21, 2024

bmhowe23 left a comment

bmhowe23 left a comment

github-actions bot commented Mar 21, 2024

Perf. improvement for CuStateVecCircuitSimulator::observe #1002

Perf. improvement for CuStateVecCircuitSimulator::observe #1002

Conversation

1tnguyen commented Dec 5, 2023 • edited by bettinaheim Loading

Description

github-actions bot commented Dec 5, 2023

schweitzpgi left a comment

Choose a reason for hiding this comment

bmhowe23 left a comment

Choose a reason for hiding this comment

amccaskey commented Jan 26, 2024

1tnguyen commented Jan 26, 2024

amccaskey commented Jan 27, 2024

github-actions bot commented Jan 30, 2024

github-actions bot commented Mar 18, 2024

github-actions bot commented Mar 21, 2024

1tnguyen commented Mar 21, 2024

github-actions bot commented Mar 21, 2024

bmhowe23 left a comment

Choose a reason for hiding this comment

bmhowe23 left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 21, 2024

Perf. improvement for `CuStateVecCircuitSimulator::observe` #1002

Perf. improvement for `CuStateVecCircuitSimulator::observe` #1002

1tnguyen commented Dec 5, 2023 •

edited by bettinaheim

Loading