core/state: copy the snap when copying the state #22340

holiman · 2021-02-17T12:04:09Z

This PR attempts to fix an issue that was found on YoloV3, where the sealer does not deliver on the snap/1 protocol. The reason for this, is that the state which the miner operates on does not have access to the snap, and thus is forced to use the trie-backend for reading and cannot write updates to the snapshot tree

On mainnet, that would be bad, because whenever a miner mines a block, it would cause a gap in the snapshot tree, making the miner fall out of sync with the snapshot and essentially nuking the snapshot functionality. Perhaps also going into regenerate-mode again and again.

This PR is a hacky first attempt to fix it. It does address the issue, but perhaps not optimally so, and there's an open question how we should handle the case where the snapDestructs are non-empty (if that could ever happen?).

To repro this case, I used a tiny private clique network and sync between two nodes locally. I can provide the files to repro it if anyone wants to give it a spin.

karalabe · 2021-02-17T12:12:08Z

core/state/statedb.go

+		state.snapAccounts = make(map[common.Hash][]byte)
+		state.snapStorage = make(map[common.Hash]map[common.Hash][]byte)
+		if len(s.snapAccounts)+len(s.snapDestructs)+len(s.snapStorage) != 0 {
+			panic("Oy vey!")


I'd expect this to blow up on pre-byzantium with intermediate root calls?

So... deep copy then?

karalabe · 2021-02-17T12:14:05Z

core/state/statedb.go

+		// If we copy, we need to ensure concurrency safety.
+		// If we don't copy, we run the risk of consensus breaking.
+		// In theory, as the state is copied, it's still 'fresh', and these
+		// should be empty.


The miner copies the state after running the transactions, but before claiming the block reward. It does this so it ca pile more txs on top. In that case, the state is fresh only if no tx populated it (i.e. post byzantium)

holiman · 2021-02-17T13:32:56Z

core/state/statedb.go

+		}
+		state.snapAccounts = make(map[common.Hash][]byte)
+		for k, v := range s.snapAccounts {
+			state.snapAccounts[k] = v


Q: do we need to also copy the v byteslice?

I don't think so. Important to know though that these values get inserted verbatim into a diff layer and when retrieving storage at least (or account RLP too) those get returned again verbatim. So snapshot.Storage(0x)[2] = 2 would modify it. That said, I don't know of any reason why you'd do such a thing :D

The snapshot explicitly warns:

// Note the returned slot is not a copy, please don't modify it. func (dl *diffLayer) Storage(accountHash, storageHash common.Hash) ([]byte, error) {

holiman · 2021-02-17T13:33:13Z

core/state/statedb.go

+		for k, v := range s.snapStorage {
+			temp := make(map[common.Hash][]byte)
+			for kk, vv := range v {
+				temp[kk] = vv


And here, copy byteslice or not?

Same as account

karalabe · 2021-02-17T14:21:53Z

core/state/statedb.go

+		}
+	}
+
+	state.snaps = s.snaps


Any reason for doing this a second time?

... twice as secure ... (doh)

fixed, squashpushed

karalabe

SGTM

karalabe · 2021-02-18T08:05:26Z

Perhaps to note for posterity, post-byzantium it should not ever happen that the maps contain something because we only ever commit once at the end of the block. Pre-byzantium however we do intermediate root after every txs which essentially flushes into these sets, thus we need the deep copy.

Debatable long term whether we should stop support for mining pre-byzantium (or pre-your-fav-fork), but that's for another day.

Cherry pick bug fixes from upstream for snapshots, which will enable higher transaction throughput. It also enables snapshots by default (which is one of the commits pulled from upstream). Upstream commits included: 68754f3 cmd/utils: grant snapshot cache to trie if disabled (ethereum#21416) 3ee91b9 core/state/snapshot: reduce disk layer depth during generation a15d71a core/state/snapshot: stop generator if it hits missing trie nodes (ethereum#21649) 43c278c core/state: disable snapshot iteration if it's not fully constructed (ethereum#21682) b63e3c3 core: improve snapshot journal recovery (ethereum#21594) e640267 core/state/snapshot: fix journal recovery from generating old journal (ethereum#21775) 7b7b327 core/state/snapshot: update generator marker in sync with flushes 167ff56 core/state/snapshot: gethring -> gathering typo (ethereum#22104) d2e1b17 snapshot, trie: fixed typos, mostly in snapshot pkg (ethereum#22133) c4deebb core/state/snapshot: add generation logs to storage too 5e9f5ca core/state/snapshot: write snapshot generator in batch (ethereum#22163) 18145ad core/state: maintain one more diff layer (ethereum#21730) 04a7226 snapshot: merge loops for better performance (ethereum#22160) 994cdc6 cmd/utils: enable snapshots by default 9ec3329 core/state/snapshot: ensure Cap retains a min number of layers 52e5c38 core/state: copy the snap when copying the state (ethereum#22340) a31f6d5 core/state/snapshot: fix panic on missing parent 61ff3e8 core/state/snapshot, ethdb: track deletions more accurately (ethereum#22582) c79fc20 core/state/snapshot: fix data race in diff layer (ethereum#22540) Other changes Commit f9b5530 (not from upstream) fixes an incorrect default DatabaseCache value due to an earlier bad merge. Tested Automated tests Testing on a private testnet Backwards compatibility Enabling snapshots by default is a breaking change in terms of the CLI flags, but will not cause backwards incompatibility between the node and other nodes. Co-authored-by: Péter Szilágyi <peterke@gmail.com> Co-authored-by: gary rong <garyrong0905@gmail.com> Co-authored-by: Melvin Junhee Woo <melvin.woo@groundx.xyz> Co-authored-by: Martin Holst Swende <martin@swende.se> Co-authored-by: Edgar Aroutiounian <edgar.factorial@gmail.com>

core/state: copy the snap when copying the state

4dd633c

holiman requested review from karalabe and rjl493456442 as code owners February 17, 2021 12:04

karalabe reviewed Feb 17, 2021

View reviewed changes

holiman commented Feb 17, 2021

View reviewed changes

karalabe reviewed Feb 17, 2021

View reviewed changes

core/state: deep-copy snap stuff during state Copy

8226a5c

holiman force-pushed the miner_snap branch from 04e5c9f to 8226a5c Compare February 17, 2021 15:18

karalabe approved these changes Feb 18, 2021

View reviewed changes

karalabe added this to the 1.10.0 milestone Feb 18, 2021

karalabe merged commit 52e5c38 into ethereum:master Feb 18, 2021

quorumbot mentioned this pull request Sep 3, 2021

[Upgrade] Go-Ethereum release v1.10.0 Consensys/quorum#1249

Merged

9 tasks

baptiste-b-pegasys mentioned this pull request Sep 3, 2021

[Upgrade] Go-Ethereum release v1.10.0 baptiste-b-pegasys/quorum#16

Closed

9 tasks

This was referenced Sep 23, 2022

Metadium to master METADIUM/go-metadium#24

Closed

Metadium to master METADIUM/go-metadium#25

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core/state: copy the snap when copying the state #22340

core/state: copy the snap when copying the state #22340

holiman commented Feb 17, 2021

karalabe Feb 17, 2021

holiman Feb 17, 2021

karalabe Feb 17, 2021

holiman Feb 17, 2021

karalabe Feb 17, 2021 •

edited

Loading

holiman Feb 17, 2021

karalabe Feb 17, 2021

karalabe Feb 17, 2021

holiman Feb 17, 2021

holiman Feb 17, 2021

karalabe left a comment

karalabe commented Feb 18, 2021

core/state: copy the snap when copying the state #22340

core/state: copy the snap when copying the state #22340

Conversation

holiman commented Feb 17, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

karalabe Feb 17, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

karalabe left a comment

Choose a reason for hiding this comment

karalabe commented Feb 18, 2021

karalabe Feb 17, 2021 •

edited

Loading