Enable definition for generated interpreter cases to be composed from multiple files #102021

jbower-fb · 2023-02-18T04:14:47Z

Feature or enhancement

Allow specifying multiple input files to the generate_cases.py script, making it behave mostly as if the input files were concatenated. Additionally allow existing definitions of instructions to be explicitly overridden using a new override keyword.

I attach a PR with a proposed initial implementation.

Pitch

In Cinder we add a number of new instructions to support our features like Static Python. We also currently have a few tweaks to existing instructions. When we migrate to the new upstream generated interpreter it would be preferable if we could avoid having to make changes to the core bytecodes.c and keep our own definitions/changes separate. As well as easing upstream merges, this would also avoid us having to copy/fork more than we need for Cinder features in a standalone module.

I've made an initial implementation which allows extra files to be passed to generate_cases.py by repeated use of the -i argument. E.g.:

$ generate_cases.py -i bytecodes.c -i cinder-bytecodes.c -o generated_cases.c.h

This mostly behaves as if the input files are concatenated but parsing only takes place between the BEGIN/END BYTECODES markers in each file. We also take advantage of mostly existing book-keeping features to track which input files definitions come from when producing errors.

I've also added a new override keyword which can prefix instruction definitions to explicitly express the intent to override an existing definition. E.g.:

inst(NOP, (--)) {
}

// This is the definition which ends up being used in generation.
override inst(NOP, (--)) {
  magic();
}

// Error - previous definition of NOP exists and "override" not specified.
inst(NOP, (--)) {
}

// Error - requested override but no previous definition of ZOP exists.
override inst(ZOP, (--)) {
}

The goal of explicitly calling out overrides is to quickly reveal if either: something we modify is removed from upstream, or if a new opcode we add ends up with a name clash with a new upstream opcode.

Previous discussion

The idea of having multiple input files for interpreter generation was briefly discussed around the faster-cpython project.

Linked PRs

gh-102021 : Allow multiple input files for interpreter loop generator #102022

The text was updated successfully, but these errors were encountered:

…#102022) The input files no longer use `-i`.

gvanrossum · 2023-03-04T04:59:57Z

Thanks!

* main: pythongh-102021 : Allow multiple input files for interpreter loop generator (python#102022) Add import of `unittest.mock.Mock` in documentation (python#102346) pythongh-102383: [docs] Arguments of `PyObject_CopyData` are `PyObject *` (python#102390) pythongh-101754: Document that Windows converts keys in `os.environ` to uppercase (pythonGH-101840) pythongh-102324: Improve tests of `typing.override` (python#102325)

…erator (python#102022) The input files no longer use `-i`.

jbower-fb added the type-feature A feature request or enhancement label Feb 18, 2023

bedevere-bot mentioned this issue Feb 18, 2023

gh-102021 : Allow multiple input files for interpreter loop generator #102022

Merged

terryjreedy mentioned this issue Feb 18, 2023

Enable generated interpreter cases to be composed from multiple inputs #102020

Closed

gvanrossum pushed a commit that referenced this issue Mar 4, 2023

gh-102021 : Allow multiple input files for interpreter loop generator (…

8de59c1

…#102022) The input files no longer use `-i`.

gvanrossum closed this as completed Mar 4, 2023

hugovk pushed a commit to hugovk/cpython that referenced this issue Mar 6, 2023

pythongh-102021 : Allow multiple input files for interpreter loop gen…

46488eb

…erator (python#102022) The input files no longer use `-i`.

jbower-fb mentioned this issue Nov 6, 2023

GH-111485: Allow arbitrary annotations on instructions and micro-ops. #111697

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable definition for generated interpreter cases to be composed from multiple files #102021

Enable definition for generated interpreter cases to be composed from multiple files #102021

jbower-fb commented Feb 18, 2023 •

edited by bedevere-bot

Loading

gvanrossum commented Mar 4, 2023

Enable definition for generated interpreter cases to be composed from multiple files #102021

Enable definition for generated interpreter cases to be composed from multiple files #102021

Comments

jbower-fb commented Feb 18, 2023 • edited by bedevere-bot Loading

Feature or enhancement

Pitch

Previous discussion

Linked PRs

gvanrossum commented Mar 4, 2023

jbower-fb commented Feb 18, 2023 •

edited by bedevere-bot

Loading