GH-119866: Spill before escaping calls in all cases. #124392

markshannon · 2024-09-23T22:39:06Z

Also fixes #123391

This PR spills the stack pointer across escaping call, also writing the contents of the stack to memory where necessary.

How it works:

Adds a new Storage class which combines the state of the stack and the local variables within a micro-op.
Adds copying and merging functionality to that class, and the underlying stack and locals, to support tracking state across divergent flow.
Splits and merges the storage in if statements.
Tracks the state of conceptual stack and local variables, ensuring the necessary values are written to memory when needed.

The code generator needs to tell when inputs are dead, in order to known when the stack_pointer should be reduced when a call escapes. We can track explicit PyStackRef_CLOSE and DECREF_INPUTS, but many cases are implicit.
For these cases we add the DEAD macro to mark the variable as dead.

To simplify parsing, the code generator also enforces PEP 7 rules for braces.
A few of the changes in bytecodes.c are a result of this change.

Both interpreter performance and JIT performance show no slowdown.

Replaces #123397

Issue: Spill the stack pointer across calls in the interpreter. #119866

…ents.

…ntax error, just a tool limitation.

picnixz

Thanks for addressing my comments! I'm unfortunately not qualified enough to review the rest :')

markshannon · 2024-10-03T14:17:46Z

I've addressed all the issues.

I've increased the number of functions that we treat as non-escaping, but not every function mentioned above.

I think it safer to do this than to try to mark everything that is non-escaping and risk making a mistake, or any of those functions changing in the future such that it does escape.

Since the cost of spilling is low, spilling around such maybe-non-escaping calls has no measurable performance impact.

Fidget-Spinner

I'm going to approve this on the condition that usage of DEAD is properly documented in a follow-up PR. I don't think it's so clear when to use it correctly and when it might be misused.

Fidget-Spinner · 2024-10-04T13:10:37Z

Alternatively, you can add the documentation in, since you need to fix the failing test cases anyways.

markshannon · 2024-10-04T14:26:16Z

I will leave the documentation of DEAD to another PR, as ERROR_IF and other macros are not documented either.
It will be easier to do them all together.

kumaraditya303 · 2024-10-04T14:28:08Z

Lib/test/test_asyncio/test_streams.py

-            self.assertListEqual(gc.get_referrers(exc), [])
-
-        asyncio.run(main())
+            self.assertListEqual(gc.get_referrers(exc), [main_coro])


Why this change?

Because we are spilling the stack pointer, so the GC can see the locals of the generator.

gvanrossum

The changes to asyncio look fine to me, but maybe a comment (along the lines of what you explained to Kumar) would be helpful.

brandtbucher

I'm far from an expert on the cases generator, but I didn't see any obvious issues there. A few notes:

It should be an error if somebody uses a name after it has been killed by either DEAD(...) or INPUTS_DEAD().
It should be an error to kill a name twice in the same path.
Can you remind me why DEAD(...)/INPUTS_DEAD() needs to be explicit in the code, and can't just be inferred from the location of the last use? I vaguely remember discussing this at the sprint, but I forget why.
Can we handle scalars-under-arrays in the analyzer, so we don't have these awkward length-one arrays?

I'm approving this PR because I think it probably needs to happen and I trust that this is the best way of doing this. But I think in general, we might need to take a step back and consider the huge amount of complexity the has accumulated in bytecodes.c, and the thousands of lines of code in the cases generator that process it. The DSL was originally introduced to reduce the boilerplate and complexity in the interpreter loop... I think it still does a good job of this, as evidenced by this PR. But the mental load when reading and editing this file has crept up incrementally with time, and I'm worried that rate is accelerating. It's beyond the scope of this issue, but we should probably consider if the DSL should be reworked to better support the needs of the cases generator.

brandtbucher · 2024-10-05T00:14:08Z

Thanks for tackling this, by the way. It was a big project and it's really cool to see it working correctly (the newly-failing test was neat to see).

markshannon · 2024-10-07T13:41:15Z

Can you remind me why DEAD(...)/INPUTS_DEAD() needs to be explicit in the code, and can't just be inferred from the location of the last use? I vaguely remember discussing this at the sprint, but I forget why.

Because variables hold references to objects the last use doesn't kill the variable. PyStackRef_CLOSE() closes the reference and kills the variable.
However, for calls that consume references and immortal objects, the code generator needs to be told that the reference is dead with DEAD

markshannon · 2024-10-07T13:43:24Z

Can we handle scalars-under-arrays in the analyzer, so we don't have these awkward length-one arrays?

Theoretically yes, but mixing scalars and arrays is awkward. We want to be able to move scalars into registers, but having gaps in the in-memory stack makes things complicated.

markshannon · 2024-10-07T13:45:44Z

#125046 for the other issues.

markshannon added 30 commits August 7, 2024 16:13

Add copy and == support to Stack class

75bda28

Merge branch 'main' into stack-copy-and-merge

c7e9102

blacken stack.py

d331371

Cases generator: Track reachability and divergent stacks in if statem…

4673d8a

…ents.

Fix type errors and rename ahead to look_ahead

132df06

Track state of output variables

b7f71d4

Handle stack and output locals togather as Storage class

b97dea9

Track locals as well as stack on differing paths

9838a05

Use 'PEP 7' in syntax error, to make it clear that this is not a C sy…

1f829be

…ntax error, just a tool limitation.

Push peeks back to stack in optimizer code gen

d2e5f12

Merge branch 'main' into stack-copy-and-merge

2753014

Update test

1754fc4

Cleanup whitespace

98f9720

Add tests for PEP 7 parsing and escaping call in condition

0284b3f

Remove merge artifact

03bea71

Remove test for escaping calls in conditions

ca2f457

Spill before escaping calls. Initial attempt

57de61f

Clean up asserts

a2e430a

Update test

95408d3

Flush locals as well on error

eb3a645

Spill stack on escaping calls. Preparatory work

00f5265

Spill stack contents on escaping calls

7de1e60

Find end of statement when anlyzing escaping calls

3bfed1b

Spill stack pointer as well. Work in progress

d7e1c82

Don't allow escaping calls in ERROR_IF or DEOPT_IF

2cc4f64

Improve tracking of stack and locals in conditional flow

8e258ad

Handle ERROR_NO_POP correctly

6da3fc6

Fix up handling of liveness

60ee3e9

Allow assignments of new refs to input as well as output variables.

9f2f3bb

Merge branch 'main' into spill-before-escaping-calls-2

87b5561

Remove debugging code

6771c4a

picnixz reviewed Oct 3, 2024

View reviewed changes

markshannon added 6 commits October 3, 2024 06:07

Add more non-escaping functions

8c48eaa

Move casts to avoid extra spilling. Regenerate files

6288f80

Sort non-escaping function names

d183767

A cast to (int) is not a call

0244d77

Prohibit switch statements in bytecodes.c and remove the last one.

efb5e7d

Insert extra braces to handle 'else if' without an 'else' correctly.

de3b86c

markshannon added 2 commits October 3, 2024 07:36

Merge branch 'main' into spill-before-escaping-calls-2

2db84df

Fix test. gc.get_referrers can now see values on generator stacks

a1e55d4

Fidget-Spinner approved these changes Oct 4, 2024

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting core review labels Oct 4, 2024

markshannon requested review from 1st1, asvetlov, gvanrossum, kumaraditya303 and willingc as code owners October 4, 2024 14:22

kumaraditya303 reviewed Oct 4, 2024

View reviewed changes

gvanrossum reviewed Oct 4, 2024

View reviewed changes

brandtbucher approved these changes Oct 5, 2024

View reviewed changes

markshannon merged commit da071fa into python:main Oct 7, 2024
62 of 63 checks passed

bedevere-app bot removed the awaiting merge label Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-119866: Spill before escaping calls in all cases. #124392

GH-119866: Spill before escaping calls in all cases. #124392

markshannon commented Sep 23, 2024 •

edited by bedevere-app bot

Loading

picnixz left a comment

markshannon commented Oct 3, 2024

Fidget-Spinner left a comment

Fidget-Spinner commented Oct 4, 2024

markshannon commented Oct 4, 2024

kumaraditya303 Oct 4, 2024

markshannon Oct 4, 2024

gvanrossum left a comment

brandtbucher left a comment

brandtbucher commented Oct 5, 2024

markshannon commented Oct 7, 2024

markshannon commented Oct 7, 2024

markshannon commented Oct 7, 2024

GH-119866: Spill before escaping calls in all cases. #124392

GH-119866: Spill before escaping calls in all cases. #124392

Conversation

markshannon commented Sep 23, 2024 • edited by bedevere-app bot Loading

picnixz left a comment

Choose a reason for hiding this comment

markshannon commented Oct 3, 2024

Fidget-Spinner left a comment

Choose a reason for hiding this comment

Fidget-Spinner commented Oct 4, 2024

markshannon commented Oct 4, 2024

kumaraditya303 Oct 4, 2024

Choose a reason for hiding this comment

markshannon Oct 4, 2024

Choose a reason for hiding this comment

gvanrossum left a comment

Choose a reason for hiding this comment

brandtbucher left a comment

Choose a reason for hiding this comment

brandtbucher commented Oct 5, 2024

markshannon commented Oct 7, 2024

markshannon commented Oct 7, 2024

markshannon commented Oct 7, 2024

markshannon commented Sep 23, 2024 •

edited by bedevere-app bot

Loading