Update core.py to have 1 extra token #30

mosheber · 2024-07-14T13:55:12Z

added 1 token from fix history and all correct from the target

* added 1 token from fix history and all correct from the target

* added +1 to the expected token count per iteration

keyboardAnt

Please see the attached comments.

keyboardAnt · 2024-07-14T20:46:21Z

dsi/online/simul/core.py

+    total_tokens += correct + 1

    sim_shared_dict["total_tokens"] = total_tokens


Since total_tokens is not used after the following line, please consider this minor suggested change:

Suggested change

total_tokens += correct + 1

sim_shared_dict["total_tokens"] = total_tokens

sim_shared_dict["total_tokens"] = total_tokens + correct + 1

keyboardAnt · 2024-07-14T20:51:11Z

dsi/online/simul/core.py

@@ -99,7 +99,7 @@ def target_done_callback(args, res):
    else:
        # ALL CORRECT with {total_tokens + draft_tokens}

-        res_dict["total_tokens"] += res_dict["correct"]
+        res_dict["total_tokens"] += res_dict["correct"] + 1

        if res_dict["total_tokens"] > args.max_tokens:


Does it mean we generate max_tokens+1 new tokens? Why not >=?

Suggested change

if res_dict["total_tokens"] > args.max_tokens:

if res_dict["total_tokens"] >= args.max_tokens:

keyboardAnt · 2024-07-14T21:04:43Z

dsi/online/simul/core.py

@@ -99,7 +99,7 @@ def target_done_callback(args, res):
    else:
        # ALL CORRECT with {total_tokens + draft_tokens}

-        res_dict["total_tokens"] += res_dict["correct"]
+        res_dict["total_tokens"] += res_dict["correct"] + 1


We call this line if the target accepts all the 'lookahead' draft tokens, right? Does setting res_dict["total_tokens"] += res_dict["correct"] + 1 mean we accept an additional token? I'm asking because we shouldn't accept an additional token unless it is the last token. We accept an additional token only if it is the last token (e.g., the 50th token where config.S == 50) or the target rejects at least one draft token (in the current iteration). Instead, we should terminate the iteration that speculates this additional token with probability 1 - acceptance_rate.

To conclude, there are two changes to the online simulation to boost the speedup of DSI:

Accept an extra token if the target rejects a draft or if the extra token is the last.

Simulate an immediate validation of the extra token by terminating the corresponding speculating iteration with probability 1 - acceptance_rate.

We can separate them into two PRs.

keyboardAnt · 2024-07-21T17:56:18Z

@mosheber, please see my previous comments and let me know when the tests pass so I can do another iteration.

…culative-inference into add_1_from_target

keyboardAnt

Please merge or rebase main and remove the skip marker in test_duration and test_num_of_fix_history (tests/integration/online/test_simul.py). To run these two tests serially: python ./scripts/test.py online -- -vvv

Update core.py to have 1 extra token

0e688ec

* added 1 token from fix history and all correct from the target

mosheber requested a review from keyboardAnt July 14, 2024 13:55

mosheber and others added 2 commits July 14, 2024 17:27

Update test_simul.py to correct target tokens

4b9ac89

* added +1 to the expected token count per iteration

pre-commit

e2d98cd

keyboardAnt reviewed Jul 14, 2024

View reviewed changes

keyboardAnt mentioned this pull request Jul 16, 2024

Offline DSI simulation: Immediate pruning #33

Merged

Add fix history condition

8a21526

keyboardAnt mentioned this pull request Jul 25, 2024

Online simulation for small acceptance rates #37

Open

mosheber added 2 commits July 28, 2024 07:01

added +1 changes

1fb0623

Merge branch 'main' of https://github.com/keyboardAnt/distributed-spe…

edee6bf

…culative-inference into add_1_from_target

keyboardAnt self-requested a review July 28, 2024 20:45

keyboardAnt requested changes Jul 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update core.py to have 1 extra token #30

Update core.py to have 1 extra token #30

Uh oh!

mosheber commented Jul 14, 2024

Uh oh!

keyboardAnt left a comment

Uh oh!

keyboardAnt Jul 14, 2024

Uh oh!

mosheber Jul 28, 2024

Uh oh!

keyboardAnt Jul 14, 2024

Uh oh!

mosheber Jul 28, 2024

Uh oh!

keyboardAnt Jul 14, 2024

Uh oh!

mosheber Jul 28, 2024

Uh oh!

keyboardAnt commented Jul 21, 2024

Uh oh!

keyboardAnt left a comment •

edited

Loading

Uh oh!

Uh oh!

		total_tokens += correct + 1

		sim_shared_dict["total_tokens"] = total_tokens

	if res_dict["total_tokens"] > args.max_tokens:
	if res_dict["total_tokens"] >= args.max_tokens:

Update core.py to have 1 extra token #30

Are you sure you want to change the base?

Update core.py to have 1 extra token #30

Uh oh!

Conversation

mosheber commented Jul 14, 2024

Uh oh!

keyboardAnt left a comment

Choose a reason for hiding this comment

Uh oh!

keyboardAnt Jul 14, 2024

Choose a reason for hiding this comment

Uh oh!

mosheber Jul 28, 2024

Choose a reason for hiding this comment

Uh oh!

keyboardAnt Jul 14, 2024

Choose a reason for hiding this comment

Uh oh!

mosheber Jul 28, 2024

Choose a reason for hiding this comment

Uh oh!

keyboardAnt Jul 14, 2024

Choose a reason for hiding this comment

Uh oh!

mosheber Jul 28, 2024

Choose a reason for hiding this comment

Uh oh!

keyboardAnt commented Jul 21, 2024

Uh oh!

keyboardAnt left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

keyboardAnt left a comment •

edited

Loading