Allow continuation in Instruct and Interact executors; fix a minor leak #852

dpmm99 · 2024-07-12T22:34:21Z

The LLaVA path needs tested; I haven't used LLaVA in any capacity yet, myself. I tested both InstructExecutor and InteractiveExecutor in my actual applications by repeatedly pausing and continuing, and I found no cases where the model cut off a word/switched trains of thought when continuing, nor cases where it clearly misunderstood the next prompt after a continuation.

SignalRT · 2024-07-14T19:34:47Z

@dpmm99, I checked the PR with llava, and it seems to work. My only question is that that the behavior changes when I switch the image.

Comparing the behavior with ollama when there is no prompt:

When tested with the first image the behavior is the same that ollama.
When tested with the second image the behavior seems to change and repeats the last answer, in the case of ollama the behavior is the same that with the first image.

Would we need to make some changes in the example LLavaInteractiveModeExecute.cs when we switch the images?

dpmm99 · 2024-07-14T21:39:56Z

Thanks for looking!

If you just interrupted generation in the InferAsync loop and then called InferAsync again with text: null, I'd expect it to continue the cut-off generation as if it hadn't been cut off. For example, the inference loop around line 107 could be changed to this to trivially test cancelling and continuing once:

var stopAfterWords = 5;
var cancellationToken = new CancellationTokenSource();
await foreach (var text in ex.InferAsync(prompt, inferenceParams, cancellationToken.Token))
{
    Console.Write(text);
    if (--stopAfterWords <= 0) cancellationToken.Cancel();
}
Console.Write("|"); //Just something to indicate that my call to Cancel happened
await foreach (var text in ex.InferAsync(null, inferenceParams))
{
    Console.Write(text);
}

It looks like that example just resets the KV cache for each new prompt if you give it a new image. I'm not sure if there's other state data that needs reset, but the example should probably call ex.GetStateData before the first prompt and ex.LoadState before each prompt and maybe even ex.Context.GetState and ex.Context.LoadState to reset it properly. If you just leave the prompt blank and hit enter again in this example, it won't trigger the new continuation behavior, because I maintained the original behavior for empty strings.

I just downloaded the LLaVA models and tested (5x) the above modification to that example to make it cancel and continue on its own, and it finishes the sentence after the display-only | like I'd expect.

SignalRT

Looks god to me.

martindevans · 2024-07-15T20:37:59Z

I resolved the merge conflict, once the CI has finished I'll merge this.

Allow continuation in Instruct and Interact executors; fix a minor leak

9052e41

martindevans requested review from SignalRT and AsakusaRinne July 13, 2024 19:32

SignalRT self-assigned this Jul 14, 2024

SignalRT approved these changes Jul 15, 2024

View reviewed changes

Merge branch 'master' into feat/continuation

317a7b0

martindevans merged commit 5025ad9 into SciSharp:master Jul 15, 2024
6 checks passed

dpmm99 deleted the feat/continuation branch July 16, 2024 22:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow continuation in Instruct and Interact executors; fix a minor leak #852

Allow continuation in Instruct and Interact executors; fix a minor leak #852

dpmm99 commented Jul 12, 2024

SignalRT commented Jul 14, 2024

dpmm99 commented Jul 14, 2024 •

edited

Loading

SignalRT left a comment

martindevans commented Jul 15, 2024

Allow continuation in Instruct and Interact executors; fix a minor leak #852

Allow continuation in Instruct and Interact executors; fix a minor leak #852

Conversation

dpmm99 commented Jul 12, 2024

SignalRT commented Jul 14, 2024

dpmm99 commented Jul 14, 2024 • edited Loading

SignalRT left a comment

Choose a reason for hiding this comment

martindevans commented Jul 15, 2024

dpmm99 commented Jul 14, 2024 •

edited

Loading