Add test to re-execute specified range of mainnet C-Chain blocks #4019

aaronbuchwald · 2025-06-17T21:45:32Z

Why this should be merged

This PR adds a configurable C-Chain benchmark that can be run locally with Taskfile or via GitHub Action (this PR currently includes triggers for pull requests and manual GH Action, but this should be changed to a cronjob on master and manual GH Action before merge and once done testing).

The test clones a block database and current state database, so that it can execute a range of blocks starting from the state at an arbitrary height N.

The CI workflow copies the data from an S3 Bucket in the Ava Labs Experimental Account in us-east-2 and requires AWS credentials to be provided. It can also be used with local file system directory paths for use on Snoopy/Linus (dedicated machines with large SSDs used for Firewood testing).

How this works

Constructs an EVM instance mimicking the params passed in from AvalancheGo including passing in a metrics registry with the expected prefix and chain labels attached to it, so that it can be used directly with existing Grafana Dashboards used with tmpnet rather than needing to maintain an alternative without the prefixes/labels.

How this was tested

This has been tested by running in CI and running through the steps documented in the README. To review this PR, please walk through the steps in the benchmark README.

Need to be documented in RELEASES.md?

No.

Follow Ups to this PR

Remove PR trigger in favor of weekly cronjob (currently in use to trigger while testing this PR)
Identify the best place to run this job ie. ARC, self-hosted runner, Blacksmith, etc.
Integrate GitHub Action Benchmark performance tracking on master by setting up gh-pages branch or a separate repo (demo of what this will look like on my fork of Canoto here

Potential Future Work

Identify a reasonable micro-benchmark that can be run on every PR and provide a reasonable signal whether the change triggers a performance improvement or degradation
Implement a job to re-execute the entire C-Chain history for full coverage that uses multiple snapshots and executes multiple subranges in parallel (ie. [0, 5m], [5m, 10m], etc.)
Implement a task/job to archive all blocks over configured ranges to S3 by fetching directly from the network
Add an optional feature at the end of the re-execution job that can post the resulting state to S3 (useful for manual GH Action entry point to create new current state snapshots)
Add support for long-lived feature branches or alternative configurations to run as a cronjob in addition to the default configuration run on the master branch of AvalancheGo
Support multiple configurations (archive/full) and targets (EBS, SSD, etc) to run the benchmark against
Support all VMs (P-Chain and X-Chain) for complete test coverage and benchmarking other VMs performance
Provide a "How-To Guide" for custom VM testing using the same approach

…r s3

…ecution bench

…a and trigger test

.github/actions/run-monitored-tmpnet-cmd/action.yml

.github/workflows/ci.yml

tests/load/prometheus.go

aaronbuchwald · 2025-07-14T18:14:08Z

Note: running locally correctly exports the metrics to Grafana, but when using Linus/Snoopy (or in general), must be careful to ensure that prometheus is running correctly.

I ran into some issues where

avalanchego/tests/fixture/tmpnet/monitor_processes.go

Line 442 in 217c3ce

    
           fullCmd := "nohup " + cmdName + " " + args + " > " + logFilename + " 2>&1 & echo -n \"$!\" > " + pidPath

fails silently to start exporting the metrics because the address was already occupied by Prometheus configured on these nodes.

Since StartPrometheus is not currently configurable, it may be desirable to make it configurable and ensure that it fails loudly rather than silently as it was not obvious why metrics were being exported locally, but not when running on Linus/Snoopy.

aaronbuchwald · 2025-07-16T17:04:16Z

.github/workflows/c-chain-benchmark.yml

+        - name: Download Previous Benchmark Result
+          uses: actions/cache@v4
+          with:
+            path: ./cache
+            key: ${{ runner.os }}-reexecute-cchain-range-benchmark.json
+        - name: Compare Benchmark Results
+          uses: benchmark-action/github-action-benchmark@v1
+          with:
+            tool: 'go'
+            output-file-path: $GITHUB_WORKSPACE/reexecution-data/reexecute-cchain-range.txt
+            external-data-json-path: ./cache/${{ runner.os }}-reexecute-cchain-range-benchmark.json


Writing to the cache and comparing against the last entry enables a comparison.

This should be changed to write to the cache only on the master branch for comparison and other triggers should compare against the latest baseline set by master

Taskfile.yml

aaronbuchwald · 2025-07-16T19:09:27Z

Trying out running on ARC and Blacksmith here:

maru-ava · 2025-07-17T02:44:02Z

.github/workflows/c-chain-benchmark.yml

+      start-block:
+        description: 'The start block for the benchmark.'
+        required: false
+        default: '100'


Rather than duplicating the defaults here and in the job, maybe use env vars e.g.

env: START_BLOCK: '100' ... on: pull_request: workflow_dispatch: inputs: start-block: description: 'The start block for the benchmark.' required: false default: ${{ env.START_BLOCK }} ...

maru-ava · 2025-07-17T02:47:51Z

.github/workflows/c-chain-benchmark.yml

+        - name: Configure AWS Credentials
+          uses: aws-actions/configure-aws-credentials@v4
+          with:
+            role-to-assume: ${{ secrets.AWS_S3_READ_ONLY_ROLE }}


Since this secret won't be available from fork branches, you'll want to make job execution conditional (as per the example of run-monitored-tmpnet-cmd so that the job doesn't fail on fork branch PRs.

maru-ava · 2025-07-17T02:52:53Z

.github/workflows/c-chain-benchmark.yml

+        - name: Set task env via GITHUB_ENV
+          id: set-params
+          run: |
+            if [[ "${{ github.event_name }}" == "workflow_dispatch" ]]; then


Since github.event.inputs.* will be undefined when the job is not triggered by workflow_dispatch, a simpler way of setting each variable would be using || to supply a default:

echo "START_BLOCK=${{ github.event.inputs.start-block || env.START_BLOCK }}" >> $GITHUB_ENV`

Note that the echoed text needs to be appended to $GITHUB_ENV to affect the environment - the current form would only result in log output.

maru-ava · 2025-07-17T02:56:21Z

.github/workflows/c-chain-benchmark.yml

+            artifact_prefix: c-chain-reexecute-range
+            prometheus_username: ${{ secrets.PROMETHEUS_ID || '' }}
+            prometheus_password: ${{ secrets.PROMETHEUS_PASSWORD || '' }}
+            grafana_dashboard_id: 'c-chain-processing-custom-v8/c-chain-processing'


Is this the correct dashboard ID now that you've incorporated the target metrics into the main C-Chain dashboard?

maru-ava · 2025-07-17T02:58:11Z

.github/workflows/c-chain-benchmark.yml

+          uses: ./.github/actions/run-monitored-tmpnet-cmd
+          with:
+            run: ./scripts/run_task.sh reexecute-cchain-range-with-copied-data EXECUTION_DATA_DIR=${{ github.workspace }}/reexecution-data BENCHMARK_OUTPUT_FILE=${{ github.workspace }}/reexecute-cchain-range-benchmark-res.txt
+            artifact_prefix: c-chain-reexecute-range


This can be omitted since it is only relevant if inputs.runtime is set to one of process or kube. If there is an artifact to collect, the job will need to call actions/upload-artifact itself.

maru-ava · 2025-07-17T04:27:21Z

tests/bench/c/vm_test.go

+		// This default delay is set above the default scrape interval used by StartPrometheus.
+		time.Sleep(tmpnet.NetworkShutdownDelay)
+
+		r.NoError(server.Stop())


If this fails, it will prevent the removal of sdConfigFilePath. Ensure that both cleanup steps will always execute by registering them separately with tb.Cleanup.

maru-ava · 2025-07-17T04:29:01Z

tests/bench/c/vm_test.go

+func getCounterMetricValue(tb testing.TB, registry prometheus.Gatherer, query string) (float64, error) {
+	metricFamilies, err := registry.Gather()
+	r := require.New(tb)
+	r.NoError(err)


In a given function, please avoid mixing testify assertions with a returned error. In this case, maybe just return the error given that the caller is already using testify to check it?

maru-ava · 2025-07-17T04:32:44Z

tests/bench/c/vm_test.go

+	sdConfigFilePath, err = tmpnet.WritePrometheusSDConfig(name, tmpnet.SDConfig{
+		Targets: []string{server.Address()},
+		Labels:  labels,
+	}, true)


Please try to avoid the use of magic values like this one in favor of either a descriptively-named variable (e.g. withGithubLabels or by adding a comment e.g. true /* withGithubLabels */.

maru-ava · 2025-07-17T04:34:15Z

tests/bench/c/vm_test.go

+// "avalanche_snowman" and the chain label (ex. chain="C") that would be handled
+// by the[chain manager](../../../chains/manager.go).
+func newConsensusMetrics(registry prometheus.Registerer) (*consensusMetrics, error) {
+	errs := wrappers.Errs{}


Why is this suggested for a single potential error?

maru-ava · 2025-07-17T04:40:40Z

tests/bench/c/vm_test.go

+func (e *vmExecutor) executeSequence(ctx context.Context, blkChan <-chan blockResult, executionTimeout time.Duration) error {
+	blkID, err := e.vm.LastAccepted(ctx)
+	if err != nil {
+		return fmt.Errorf("failed to get last accepted block: %w", err)


(No action required) Maybe differentiate the error from the subsequent one e.g. block -> block id?

aaronbuchwald added 3 commits June 17, 2025 15:39

feat(scripts): add copy_dir.sh script to copy from local filesystem o…

a9f395f

…r s3

feat(scripts): add script and task def to import cchain data for reex…

197214e

…ecution bench

test(cchain): add vm reexecution test and task to import required dat…

ba1a234

…a and trigger test

github-project-automation bot added this to avalanchego Jun 17, 2025

aaronbuchwald added 2 commits June 17, 2025 17:50

improve comment on cchain bench test

d1cdb8e

Move required data to avalanchego-bootstrap-testing s3 bucket

3ca35a8

aaronbuchwald changed the title ~~Aaronbuchwald/cchain reexecute range test~~ test(cchain): Add test to re-execute specified range of mainnet blocks Jun 17, 2025

aaronbuchwald changed the title ~~test(cchain): Add test to re-execute specified range of mainnet blocks~~ Add test to re-execute specified range of mainnet C-Chain blocks Jun 17, 2025

aaronbuchwald added 6 commits June 25, 2025 14:25

add TestPrometheusIntegration

0dee263

remove no files found err condition

962a250

remove bash -x prefix from run monitored tmpnet cmd

ef940be

Add grafana preview dashboard name as input to run monitored tmpnet cmd

8dfee2b

Add helper to write prometheus sd config file

1aa89e5

Get vm test working

65e64c7

aaronbuchwald commented Jun 26, 2025

View reviewed changes

.github/actions/run-monitored-tmpnet-cmd/action.yml Outdated Show resolved Hide resolved

aaronbuchwald added 2 commits June 26, 2025 10:06

move bash -x in cmd of run monitored tmpnet cmd to call sites

0240e3a

use runtime input to disable artifact collection and avoid err

7e3ef87

aaronbuchwald commented Jun 26, 2025

View reviewed changes

.github/workflows/ci.yml Outdated Show resolved Hide resolved

aaronbuchwald commented Jun 26, 2025

View reviewed changes

tests/load/prometheus.go Outdated Show resolved Hide resolved

aaronbuchwald added 11 commits June 26, 2025 10:30

fix up monitor processes

ef9ed07

move prometheus server into separate test file

c010bf8

Add aws read only role usage

8d2c061

cleanup

b625ee8

Merge branch 'master' into aaronbuchwald/cchain-reexecute-range-test

ee19b2d

ensure s5cmd exists in copy dir script

8027234

fix s5cmd install

0335061

add combo task

111bb53

remove s5cmd from flake.nix

dbd3f34

add s5cmd

bc28c6b

remove debugging tests

9f84199

aaronbuchwald added 6 commits July 11, 2025 16:26

fix formatting

d948c35

fix formatting

2ee1ee7

Fix cache usage for bench comparison

2e1d408

check comparison

7bf39d8

Add chainID to subnetID lookup to snow ctx

59f33b4

fix lint

f39d9ce

aaronbuchwald added 10 commits July 15, 2025 11:12

update to use ARC

96e3d47

add xz before nix

2ac8765

update to use GITHUB_ACTION_PATH

e4aca3c

Fix github workspace var usage

4dcf897

cleanup var names

69240ea

Merge branch 'master' into aaronbuchwald/cchain-reexecute-range-test

35481c0

fix lint

4fcbf0c

revert back to gh runner

b064fb5

remove bash -x prefix

70e3db2

revert bash -x from ci.yml

030cc87

aaronbuchwald commented Jul 16, 2025

View reviewed changes

cleanup

7a69cea

aaronbuchwald commented Jul 16, 2025

View reviewed changes

Taskfile.yml Outdated Show resolved Hide resolved

aaronbuchwald added 3 commits July 16, 2025 13:22

Update taskfile var names

dba9a84

move benchmark output file to arg

de17f16

switch back to using gh context

9c71b7b

aaronbuchwald marked this pull request as ready for review July 16, 2025 18:53

aaronbuchwald requested a review from maru-ava as a code owner July 16, 2025 18:53

aaronbuchwald requested review from Elvis339 and RodrigoVillar July 16, 2025 18:53

Add cronjob for C-Chain benchmark to run weekly on Sunday

a39a1e4

maru-ava reviewed Jul 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add test to re-execute specified range of mainnet C-Chain blocks #4019

Add test to re-execute specified range of mainnet C-Chain blocks #4019

Uh oh!

aaronbuchwald commented Jun 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aaronbuchwald commented Jul 14, 2025

Uh oh!

aaronbuchwald Jul 16, 2025

Uh oh!

Uh oh!

aaronbuchwald commented Jul 16, 2025

Uh oh!

maru-ava Jul 17, 2025

Uh oh!

maru-ava Jul 17, 2025

Uh oh!

maru-ava Jul 17, 2025

Uh oh!

maru-ava Jul 17, 2025

Uh oh!

maru-ava Jul 17, 2025

Uh oh!

maru-ava Jul 17, 2025

Uh oh!

maru-ava Jul 17, 2025

Uh oh!

maru-ava Jul 17, 2025

Uh oh!

maru-ava Jul 17, 2025

Uh oh!

maru-ava Jul 17, 2025

Uh oh!

Uh oh!

Add test to re-execute specified range of mainnet C-Chain blocks #4019

Are you sure you want to change the base?

Add test to re-execute specified range of mainnet C-Chain blocks #4019

Uh oh!

Conversation

aaronbuchwald commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why this should be merged

How this works

How this was tested

Need to be documented in RELEASES.md?

Follow Ups to this PR

Potential Future Work

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aaronbuchwald commented Jul 14, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aaronbuchwald commented Jul 16, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aaronbuchwald commented Jun 17, 2025 •

edited

Loading