Assert: Improve detection of bad calls to `assert.async()` callbacks #1642

Krinkle · 2021-07-29T01:26:35Z

Background

When creating two async pauses in a test, it was possible for a test to pass by invoking one of them twice, and the other not at all.

Easy scenario (though perhaps not realistic):

Use assert.async() twice, assigned as done1 and done2 in the same QUnit.test() case, and then simulate the failure scenario such that you wrongly call done1 two times, and forget to call done2.

Complex scenario across QUnit.test() and "afterEach" hooks, since these previously shared a single semaphore:

Use assert.async() once in a simple test, and schedule the resume call in the future, but then fail with an uncaught error. The uncaught error is found and Test.run() would internally kill the pause by resetting the semaphore to zero (this make sense since we shouldn't wait for the release once the test is known to have failed).
After this reset, we proceed to the "afterEach" hook. Suppose this hook is also async, and during its execution, the originally scheduled resume call happens. This would effectively end up releasing the afterEach's async pause despite not being finished yet, and then we proceed to the next test. That test would then fail when the afterEach's own release call happens, failing as "release during a different test".

This is the scenario of #1432.

Fix this and numerous other edge cases by making the returned callbacks from assert.async() strict about which locks they release.

Each lock now adds a unique token to a map, and invoking the release function decrements/removes this token from the map.

Notes

es6-map.js assigns the fallback in all browsers.
This is a bug, to be fixed later.
The isNaN(semaphore) logic was originally added in 2015 by ea3e350.
At the time, the internal resume function was public, and NaN could emerge through QUnit.start("bla") as result of semaphore += "bla". This has not been possible for a while. During PR Test: increase code coverage #1590, I did not trace the origin of this code, and thus did not realize that it was already obsolete (the semaphore itself is not publicly supported).
The "during different test" error is now almost impossible to trigger since we now kill pending locks during test failures and
tolerate all late calls equally. This meant the drooling-done.js test case now fails in a more limited way.

I added a new test case for coverage, that reproduces it still, but it's a lot more obscure – it requires the original test to pass and then also have an unexpected call during a different test.
I considered using the phrase "async lock" in the public-facing error messages, but found this perhaps too internal/technical when coming from the perspective of var done = assert.async();.

In order to keep the code shared between handling of async-await, Promise, and assert.async, but remain friendly and understandable, I went for the phrase "async pause".

Fixes #1432.

== Background == When creating two async pauses in a test, it was possible for a test to pass by invoking one of them twice, and the other not at all. Easy scenario (though perhaps not realistic): > Use `assert.async()` twice, assigned as done1 and done2 in the same > `QUnit.test()` case, and then simulate the failure scenario such that > you wrongly call done1 two times, and forget to call done2. Complex scenario across `QUnit.test()` and "afterEach" hooks, since these previously shared a single semaphore: > Use `assert.async()` once in a simple test, and schedule the resume > call in the future, but then fail with an uncaught error. The uncaught > error is found and `Test.run()` would internally kill the pause by > resetting the semaphore to zero (this make sense since we shouldn't > wait for the release once the test is known to have failed). > After this reset, we proceed to the "afterEach" hook. Suppose this > hook is also async, and during its execution, the originally scheduled > resume call happens. This would effectively end up releasing the > afterEach's async pause despite not being finished yet, and then we > proceed to the next test. That test would then fail when the afterEach's > own release call happens, failing as "release during a different test". This is the scenario of #1432. Fix this and numerous other edge cases by making the returned callbacks from `assert.async()` strict about which locks they release. Each lock now adds a unique token to a map, and invoking the release function decrements/removes this token from the map. == Notes == * es6-map.js assigns the fallback in all browsers. This is a bug, to be fixed later. * The `isNaN(semaphore)` logic was originally added in 2015 by ea3e350. At the time, the internal resume function was public, and NaN could emerge through `QUnit.start("bla")` as result of `semaphore += "bla"`. This has not been possible for a while. During PR #1590, I did not trace the origin of this code, and thus did not realize that it was already obsolete (the semaphore itself is not publicly supported). * The "during different test" error is now almost impossible to trigger since we now kill pending locks during test failures and tolerate all late calls equally. This meant the `drooling-done.js` test case now fails in a more limited way. I added a new test case for coverage, that reproduces it still, but it's a lot more obscure – it requires the original test to pass and then also have an unexpected call during a different test. * I considered using the phrase "async lock" in the public-facing error messages, but found this perhaps too internal/technical when coming from the perspective of `var done = assert.async();`. In order to keep the code shared between handling of async-await, Promise, and assert.async, but remain friendly and understandable, I went for the phrase "async pause". Fixes #1432.

smcclure15

I know my domain has some test utilities that sniff at that semaphore bit to get really picky about choosing to throw or continue in different scenarios. Some areas use QUnit.log and upon a failure, if they are in a pause situation, they want to jump to the next test so it's not left in a bad state. All of which I anticipate this handles much more gracefully after all.

That semaphore logic is completely implementation detail so that's not fair to hold up this PR, nor trigger a major version, just noting that dependency for us and others that may have found their way to relying on that.

src/test.js

src/cli/run.js

Co-authored-by: Steve McClure <smcclure15@gmail.com>

gibson042

Nice!

gibson042 · 2021-07-29T16:16:01Z

src/html-reporter/es6-map.js

+// FIXME: This check is broken. This file is embedded in the qunit.js closure,
+// thus the Map var is hoisted in that scope, and starts undefined (not a function).
 var Map = typeof Map === "function" ? Map : function StringMap() {


Should we just fix it now?

Suggested change

// FIXME: This check is broken. This file is embedded in the qunit.js closure,

// thus the Map var is hoisted in that scope, and starts undefined (not a function).

var Map = typeof Map === "function" ? Map : function StringMap() {

if ( typeof Map !== "function" ) {

var Map = function StringMap() {

That won't work, it'll still be hosted just the same, right? It's a pretty tough situation. I set it up such that it is included by rollup as intro (within the file closure, so as to not leak and be seen by end-user source code), and thus the variable will be seen by fuzzysort.js and other code we use naturally as if it was a global.

Without pulling in globalThis into here, I'm not sure how to do this because as soon as you declare anything as Map you inherently also deprive your ability to see the outer scope's value for that same reference.

Oh, right. What's needed is something like the following, which can't be fixed at this level (and is out of scope for this PR).

(function(…, Map, …) { if ( typeof Map !== "function" ) { Map = function StringMap() {…}; } … })(…, typeof Map === "function" && Map, …)

src/test.js

No need to rely on global state for this. This also makes sure it is always reported under the correct test. Either way, we still get a last-minute check in pushFailure->pushResult that it is the current test.

Closes #1432. Closes #1642.

Follows-up 163c9bc (qunitjs#1642), which changed an internalRecover() to internalStart(), whereas internalStart will (correctly) not resume if there are other pauses still remaining. Change this back to internalRecover(). Fixes qunitjs#1705.

Follows-up 163c9bc (#1642), which changed an internalRecover() to internalStart(), whereas internalStart will (correctly) not resume if there are other pauses still remaining. Change this back to internalRecover(). Fixes #1705. Closes #1739.

Krinkle requested a review from smcclure15 July 29, 2021 01:26

Krinkle force-pushed the semaphore-map branch from 3121a99 to e122217 Compare July 29, 2021 01:42

smcclure15 approved these changes Jul 29, 2021

View reviewed changes

src/test.js Outdated Show resolved Hide resolved

src/test.js Outdated Show resolved Hide resolved

src/test.js Outdated Show resolved Hide resolved

src/cli/run.js Outdated Show resolved Hide resolved

Krinkle and others added 4 commits July 29, 2021 17:09

Update src/test.js

759840b

Co-authored-by: Steve McClure <smcclure15@gmail.com>

Update src/cli/run.js

3fb2b5c

Co-authored-by: Steve McClure <smcclure15@gmail.com>

Add drooling-extra-done-outside.js

5aae48b

Rename pause.killed to pause.cancelled

b97a9e3

gibson042 reviewed Jul 29, 2021

View reviewed changes

Krinkle added 5 commits July 29, 2021 17:44

Use pause IDs as map keys, move hash to formatted string

5bc42c2

Move pause housekeeping up in timeoutHandler()

0969fb2

Call test.pushFailure() directly in timeoutHandler()

6f9ffea

No need to rely on global state for this. This also makes sure it is always reported under the correct test. Either way, we still get a last-minute check in pushFailure->pushResult that it is the current test.

Move timeoutDuration inline comment back

f754296

Rename internal test.asyncNextPauseId to test.nextPauseId

fbb9e92

Krinkle merged commit 9444070 into main Aug 2, 2021

Krinkle deleted the semaphore-map branch August 2, 2021 22:18

Krinkle added a commit that referenced this pull request Aug 2, 2021

Assert: Improve detection of bad calls to assert.async() callbacks

163c9bc

Closes #1432. Closes #1642.

Krinkle mentioned this pull request Feb 8, 2024

Core: Fix hanging assert.async() after assert.timeout() #1739

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assert: Improve detection of bad calls to `assert.async()` callbacks #1642

Assert: Improve detection of bad calls to `assert.async()` callbacks #1642

Krinkle commented Jul 29, 2021

smcclure15 left a comment

gibson042 left a comment

gibson042 Jul 29, 2021

Krinkle Jul 29, 2021

gibson042 Jul 29, 2021 •

edited

Loading

Assert: Improve detection of bad calls to assert.async() callbacks #1642

Assert: Improve detection of bad calls to assert.async() callbacks #1642

Conversation

Krinkle commented Jul 29, 2021

Background

Notes

smcclure15 left a comment

Choose a reason for hiding this comment

gibson042 left a comment

Choose a reason for hiding this comment

gibson042 Jul 29, 2021

Choose a reason for hiding this comment

Krinkle Jul 29, 2021

Choose a reason for hiding this comment

gibson042 Jul 29, 2021 • edited Loading

Choose a reason for hiding this comment

Assert: Improve detection of bad calls to `assert.async()` callbacks #1642

Assert: Improve detection of bad calls to `assert.async()` callbacks #1642

gibson042 Jul 29, 2021 •

edited

Loading