triedb/pathdb, eth: introduce Double-Buffer Mechanism in PathDB #30464

rjl493456442 · 2024-09-19T06:18:10Z

Previously, PathDB used a single buffer to aggregate database writes, which needed to be flushed atomically. However, flushing large amounts of data (e.g., 256MB) caused significant overhead, often blocking the system for around 3 seconds during the flush.

To mitigate this overhead and reduce performance spikes, a double-buffer mechanism is introduced. When the active buffer fills up, it is marked as frozen and a background flushing process is triggered. Meanwhile, a new buffer is allocated for incoming writes, allowing operations to continue uninterrupted.

This approach reduces system blocking times and provides flexibility in adjusting buffer parameters for improved performance.

Previously, PathDB used a single buffer to aggregate database writes, which needed to be flushed atomically. However, flushing large amounts of data (e.g., 256MB) caused significant overhead, often blocking the system for around 3 seconds during the flush. To mitigate this overhead and reduce performance spikes, a double-buffer mechanism is introduced. When the active buffer fills up, it is marked as frozen and a background flushing process is triggered. Meanwhile, a new buffer is allocated for incoming writes, allowing operations to continue uninterrupted. This approach reduces system blocking times and provides flexibility in adjusting buffer parameters for improved performance.

holiman

All in all, this looks promising, I suspect this could help quite a bit

triedb/pathdb/nodebuffer.go

holiman · 2024-09-19T06:58:09Z

triedb/pathdb/nodebuffer.go

+		nodes := writeNodes(batch, b.nodes, clean)
+		rawdb.WritePersistentStateID(batch, id)
+
+		// Flush all mutations in a single batch


Note: at this point, mutations were already applied on the clean, i.e, dl.cleans cache. That happened during writeNodes. I've tried to figure out if that is a problem, but come to the conclusion that it's fine, but just wanted to raise it so you can also give it a think.

Regarding "flush all mutations in a single batch" -- is that important only because of crash-safety, or some other more subtle reason?

How about this
in disklayer.go, function node(), we lookup a node. Order:

buffer

frozen

cleans

database

And if found, write to cleans

if dl.cleans != nil && len(blob) > 0 { dl.cleans.Set(key, blob) cleanWriteMeter.Mark(int64(len(blob))) }

I'm trying to think of a case where this write-to-cleans conflicts with the write-to-cleans in the background committer writeNodes method.

if it's found in buffer/frozen => return and no interaction with cache

if it's found in cache => return

if it's found in disk (it implicitly means the item is not in these places above, even the item is marked as deleted, it will still be caught in buffer/frozen/cache), load it from db and add it into the cache

so, no conflict should happen

But i have to say it's a really good point, i haven't thought about it

Regarding "flush all mutations in a single batch" -- is that important only because of crash-safety, or some other more subtle reason?

Only because of crash-safety

rjl493456442 requested review from karalabe and holiman as code owners September 19, 2024 06:18

rjl493456442 force-pushed the multibuffer branch from 569b961 to b48c0c9 Compare September 19, 2024 06:48

rjl493456442 force-pushed the multibuffer branch from b48c0c9 to 20b4ffd Compare September 19, 2024 07:08

holiman reviewed Sep 19, 2024

View reviewed changes

triedb/pathdb: address comments from martin

432633f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

triedb/pathdb, eth: introduce Double-Buffer Mechanism in PathDB #30464

triedb/pathdb, eth: introduce Double-Buffer Mechanism in PathDB #30464

rjl493456442 commented Sep 19, 2024

holiman left a comment

holiman Sep 19, 2024

holiman Sep 19, 2024

rjl493456442 Sep 19, 2024 •

edited

Loading

rjl493456442 Sep 19, 2024

triedb/pathdb, eth: introduce Double-Buffer Mechanism in PathDB #30464

Are you sure you want to change the base?

triedb/pathdb, eth: introduce Double-Buffer Mechanism in PathDB #30464

Conversation

rjl493456442 commented Sep 19, 2024

holiman left a comment

Choose a reason for hiding this comment

holiman Sep 19, 2024

Choose a reason for hiding this comment

holiman Sep 19, 2024

Choose a reason for hiding this comment

rjl493456442 Sep 19, 2024 • edited Loading

Choose a reason for hiding this comment

rjl493456442 Sep 19, 2024

Choose a reason for hiding this comment

rjl493456442 Sep 19, 2024 •

edited

Loading