Batching of circuits to overcome memory issues when using statevector simulator #209

rsln-s · 2021-09-09T20:49:00Z

Summary

Currently batch_size parameter is ignored if the statevector simulator is used. This leads to unreasonable memory use due to the size of the transpiled circuits, up to 1TB of RAM for 800 by 800 kernel matrix and 20 qubits (see qiskit-terra, issue #6991). This pull request fixes this by transpiling and simulating circuits in batches, never storing the entire 800 circuits. The modification uses batch_size parameter that is already used in non-statevector case.

Details and comments

I had success by setting batch_size=50 (memory footprint down to <20 GB).

CLAassistant · 2021-09-09T20:49:04Z

All committers have signed the CLA.

adekusar-drl · 2021-09-10T07:59:00Z

@rsln-s Thanks a lot for your contribution. Could you please add a reno file with a bug fix description? Also, if possible add a test to cover this fix.

@attp could you please take a look at the changes?

rsln-s · 2021-09-10T13:58:28Z

Added reno file. The correctness of the output of the code is checked by tests already in place; testing the memory usage may be complicated within constraints of gh-actions.

releasenotes/notes/batch-circuits-statevector-522a842c6f68d954.yaml

….yaml Co-authored-by: Steve Wood <40241007+woodsp-ibm@users.noreply.github.com>

woodsp-ibm · 2021-09-14T20:42:42Z

The spell checker also fails on this transpiled word from the release note. If you add it to .pylintdict file in the root, i.e. to our custom dictionary, since its correctly spelt, then that should get it to pass CI. (You will see the words are in lowercase in alphabetic order so just add it appropriately)

…arning into kernel_batching

rsln-s · 2021-09-14T21:48:33Z

Updated the .pylintdict dictionary

adekusar-drl

Looks good to me, thanks a lot!

adekusar-drl · 2021-09-15T16:24:22Z

@woodsp-ibm Do you have any comments?

woodsp-ibm · 2021-09-15T20:52:35Z

Can I ask you to make a minor change to the docstring for batch_size in the constructor. It says

batch_size: Number of circuits to batch together for computation. Default 1000.

specifically default 1000 yet the code has

batch_size: int = 900,

So it should state the default is 900. I think it may have been 1000 at one point but was changed, if I recall correctly, to 900 to fit more with the limits around the provider.

In terms of testing you say its covered by the current unit tests. From what I can see there is nothing explicitly testing that aspect. Of course batch_size defaults to 900 and is in main path. I am not sure what we do in test is ever affected by the batch size since from what I can see the tests are pretty small. Having said that if we had a test that dropped the batch size down I am not sure how to test it given its behavior is internal to evaluate - the only way comes to mind is hooking the quantum instance execute and checking the number of circuits is as expected along the way in addition the the final result being as expected.

attp · 2021-09-15T23:36:43Z

Looks good to me.

Regarding the batch_size docstring, you are correct @woodsp-ibm the default was originally 1000, and we changed it to 900 to match the backend limits and reduce the number of jobs sent. I must have missed updating the docstring in the PR.

rsln-s · 2021-09-17T21:31:40Z

Updated the docstring as requested by @woodsp-ibm.

woodsp-ibm · 2021-09-21T20:41:24Z

Just a note. Locally I changed the QuantumInstance execute method to print the number of circuits it was passed and ran the kernel unit tests. Its quite a small number so nowhere near the 900 limit - in fact its not even in double digits. Anyway I changed default back size in the kernel to a much small number and things worked but not all numbers were limited - presumably the ones via statevector usage as I was doing this from the main branch. Cloning your fork and doing the same from there then all the counts were limited - in fact I set it to 1 as a test and it printed all 1's and passed. So it seems its working ok. It would be nice if the unit testing did somehow test out the batch size but maybe that could/should be raised as a separate issue. @adekusar-drl any thoughts here - you commented early on about a test around the fix.

adekusar-drl · 2021-09-22T09:24:43Z

@woodsp-ibm When I mentioned unit tests I did not have anything special on my mind. In general, your idea of setting batch size to 1 and then running a test on the statevector simulator make sense.

@rsln-s What do you think?

woodsp-ibm · 2021-09-22T11:49:46Z

In general, your idea of setting batch size to 1 and then running a test on the statevector simulator make sense.
I had done it so it applied to the qasm mode as well - since it appeared not tested in general. While the final outcome can be checked as currently, what is more complicated to do is to check that indeed the batch size is limiting the number of internal computations (circuits). My only thought on perhaps how to do such a check. as I mentioned in an earlier comment was about hooking the QuantumInstance execute method on the instance used with the backend such that the number of circuits given to the method could be fairly easily intercepted and checked - i.e hook it and do whatever was needed to check then call over to the original method that was hooked so that the circuit results from execute can be returned.

adekusar-drl · 2021-10-05T08:26:09Z

@woodsp-ibm I approve, merge this PR and open an issue to improve tests for QuantumKernel. Any thoughts?

woodsp-ibm

merge this PR and open an issue to improve tests for QuantumKernel. Any thoughts?

@adekusar-drl that seems fine by me.

adekusar-drl · 2021-10-11T17:21:57Z

Closed and #242 is opened.

… simulator (qiskit-community#209) Currently batch_size parameter is ignored if the statevector simulator is used. This leads to unreasonable memory use due to the size of the transpiled circuits, up to 1TB of RAM for 800 by 800 kernel matrix and 20 qubits (see qiskit-terra, issue #6991). This pull request fixes this by transpiling and simulating circuits in batches, never storing the entire 800 circuits. The modification uses batch_size parameter that is already used in non-statevector case. * initial attempt that passes the tests * further improvement * fix formatting * added reno file * Update releasenotes/notes/batch-circuits-statevector-522a842c6f68d954.yaml Co-authored-by: Steve Wood <40241007+woodsp-ibm@users.noreply.github.com> * added the word `transpiled` to the dictionary for spellchecker * updated docstring to accurately reflect default batch size Co-authored-by: Anton Dekusar <62334182+adekusar-drl@users.noreply.github.com> Co-authored-by: Steve Wood <40241007+woodsp-ibm@users.noreply.github.com> Co-authored-by: Manoel Marques <Manoel.Marques@ibm.com>

rsln-s added 2 commits September 9, 2021 13:29

initial attempt that passes the tests

4863ccc

further improvement

1ebda1b

rsln-s requested review from adekusar-drl, manoelmarques, pbark, stefan-woerner and woodsp-ibm as code owners September 9, 2021 20:49

rsln-s mentioned this pull request Sep 9, 2021

Memory usage grows quickly with circuit depth Qiskit/qiskit#6991

Closed

fix formatting

48c1930

added reno file

69bed11

Merge branch 'main' into kernel_batching

129ddce

woodsp-ibm reviewed Sep 14, 2021

View reviewed changes

releasenotes/notes/batch-circuits-statevector-522a842c6f68d954.yaml Outdated Show resolved Hide resolved

Update releasenotes/notes/batch-circuits-statevector-522a842c6f68d954…

7503a6d

….yaml Co-authored-by: Steve Wood <40241007+woodsp-ibm@users.noreply.github.com>

rsln-s added 2 commits September 14, 2021 16:46

added the word transpiled to the dictionary for spellchecker

c7021b6

Merge branch 'kernel_batching' of github.com:rsln-s/qiskit-machine-le…

16eb1f3

…arning into kernel_batching

Merge branch 'main' into kernel_batching

cd1973d

rsln-s requested a review from woodsp-ibm September 15, 2021 13:24

adekusar-drl previously approved these changes Sep 15, 2021

View reviewed changes

Merge branch 'main' into kernel_batching

6ba29ec

manoelmarques and others added 2 commits September 17, 2021 14:19

Merge branch 'main' into kernel_batching

40f8307

updated docstring to accurately reflect default batch size

710d9c5

rsln-s dismissed adekusar-drl’s stale review via 710d9c5 September 17, 2021 21:29

Merge branch 'main' into kernel_batching

b8a09ea

manoelmarques and others added 2 commits September 23, 2021 11:13

Merge branch 'main' into kernel_batching

320209c

Merge branch 'main' into kernel_batching

eb81570

Merge branch 'main' into kernel_batching

0e3f4d9

woodsp-ibm approved these changes Oct 6, 2021

View reviewed changes

manoelmarques and others added 2 commits October 6, 2021 19:04

Merge branch 'main' into kernel_batching

83aad93

Merge branch 'main' into kernel_batching

372619d

adekusar-drl mentioned this pull request Oct 11, 2021

Improve unit tests when batching is used in QuantumKernel #242

Closed

adekusar-drl merged commit d766fa3 into qiskit-community:main Oct 11, 2021

declanmillar mentioned this pull request Feb 8, 2022

Add unit tests for QuantumKernel circuit batching #306

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batching of circuits to overcome memory issues when using statevector simulator #209

Batching of circuits to overcome memory issues when using statevector simulator #209

rsln-s commented Sep 9, 2021

CLAassistant commented Sep 9, 2021 •

edited

Loading

adekusar-drl commented Sep 10, 2021

rsln-s commented Sep 10, 2021

woodsp-ibm commented Sep 14, 2021 •

edited

Loading

rsln-s commented Sep 14, 2021

adekusar-drl left a comment

adekusar-drl commented Sep 15, 2021

woodsp-ibm commented Sep 15, 2021

attp commented Sep 15, 2021

rsln-s commented Sep 17, 2021

woodsp-ibm commented Sep 21, 2021 •

edited

Loading

adekusar-drl commented Sep 22, 2021

woodsp-ibm commented Sep 22, 2021

adekusar-drl commented Oct 5, 2021

woodsp-ibm left a comment

adekusar-drl commented Oct 11, 2021

Batching of circuits to overcome memory issues when using statevector simulator #209

Batching of circuits to overcome memory issues when using statevector simulator #209

Conversation

rsln-s commented Sep 9, 2021

Summary

Details and comments

CLAassistant commented Sep 9, 2021 • edited Loading

adekusar-drl commented Sep 10, 2021

rsln-s commented Sep 10, 2021

woodsp-ibm commented Sep 14, 2021 • edited Loading

rsln-s commented Sep 14, 2021

adekusar-drl left a comment

Choose a reason for hiding this comment

adekusar-drl commented Sep 15, 2021

woodsp-ibm commented Sep 15, 2021

attp commented Sep 15, 2021

rsln-s commented Sep 17, 2021

woodsp-ibm commented Sep 21, 2021 • edited Loading

adekusar-drl commented Sep 22, 2021

woodsp-ibm commented Sep 22, 2021

adekusar-drl commented Oct 5, 2021

woodsp-ibm left a comment

Choose a reason for hiding this comment

adekusar-drl commented Oct 11, 2021

CLAassistant commented Sep 9, 2021 •

edited

Loading

woodsp-ibm commented Sep 14, 2021 •

edited

Loading

woodsp-ibm commented Sep 21, 2021 •

edited

Loading