Offline DSI simulation: Immediate pruning #33

keyboardAnt · 2024-07-16T14:50:34Z

This PR improves the accuracy of the offline DSI simulation by considering the following edge case where the first draft token of an iteration is rejected. This PR increases DSI speedups only slightly. The plots for ndim=50 in this setting do not change.

The edge case: If the current iteration's first draft token is rejected, we do not necessarily need to incur the latency of drafting. We incur the latency of drafting only if we didn't incur the latency of verifying the previous iteration. Note that the edge case holds for the online DSI simulation.

Update dsi.py

8cb79bb

keyboardAnt merged commit a9c4907 into main Jul 16, 2024
2 checks passed

keyboardAnt deleted the nadav/offline-simul-immediate-pruning branch July 16, 2024 14:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Offline DSI simulation: Immediate pruning #33

Offline DSI simulation: Immediate pruning #33

Uh oh!

keyboardAnt commented Jul 16, 2024

Uh oh!

Uh oh!

Uh oh!

Offline DSI simulation: Immediate pruning #33

Offline DSI simulation: Immediate pruning #33

Uh oh!

Conversation

keyboardAnt commented Jul 16, 2024

Uh oh!

Uh oh!

Uh oh!