Avoid hardcoding a space at the beginning of the prompt. #1315

ivanstepanovftw · 2023-05-04T12:11:26Z

Added in #242 without a strong rationale.
Users can already insert a space manually at the beginning of the prompt if desired.

For example, I cannot get rid of token 15629 -> ' Manager'. I am wanted it to be 3260 -> 'Manager':

main: prompt: ' Manager's Persona: Manager I work with in my company.
Manager: I am waiting.'
main: number of tokens in prompt = 22
     1 -> ''
 15629 -> ' Manager'
 29915 -> '''
 29879 -> 's'
  5196 -> ' Person'
 29874 -> 'a'
 29901 -> ':'
 15629 -> ' Manager'
   306 -> ' I'
   664 -> ' work'
   411 -> ' with'
   297 -> ' in'
   590 -> ' my'
  5001 -> ' company'
 29889 -> '.'
    13 -> '
'
  3260 -> 'Manager'
 29901 -> ':'
   306 -> ' I'
   626 -> ' am'
 10534 -> ' waiting'
 29889 -> '.'

DannyDaemonic · 2023-05-04T12:24:59Z

I could be wrong about what's happening here, but I think with OpenLLaMA the BOS token is a lot more important: #1291

ivanstepanovftw · 2023-05-04T12:29:13Z

I am trying pygmalion-7b model, which prompt looks like this example:

Assistant's Persona: Assistant is a highly intelligent language model trained to comply with user requests.
<START>
Assistant: Hello! How may I help you today?
You: What is Zork?
Assistant:

slaren · 2023-05-04T12:35:50Z

The rationale is just duplicating what the SentencePiece tokenizer does.

Green-Sky · 2023-05-04T12:56:45Z

The rational was that the llama models where trained with a prefixed space. I agree that not every model has that requirement, but this was done to make it easier for users without this knowledge.

ivanstepanovftw · 2023-05-04T12:57:32Z

Oh I see

% echo "I saw a girl with a telescope." | spm_encode --model=m.model
▁I ▁saw ▁a ▁girl ▁with ▁a ▁ te le s c o pe .

ivanstepanovftw · 2023-05-04T13:56:48Z

This is called dummy prefix google/sentencepiece#282
Closing as LLaMA and derivatives have default tokenizer settings.

ggerganov · 2023-05-04T15:42:25Z

You can put this functionality behind a cmd arg

Avoid hardcoding a space at the beginning of the prompt.

795a644

ivanstepanovftw closed this May 4, 2023

ivanstepanovftw deleted the space branch May 4, 2023 14:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid hardcoding a space at the beginning of the prompt. #1315

Avoid hardcoding a space at the beginning of the prompt. #1315

ivanstepanovftw commented May 4, 2023 •

edited by Green-Sky

Loading

DannyDaemonic commented May 4, 2023

ivanstepanovftw commented May 4, 2023

slaren commented May 4, 2023

Green-Sky commented May 4, 2023

ivanstepanovftw commented May 4, 2023

ivanstepanovftw commented May 4, 2023

ggerganov commented May 4, 2023

Avoid hardcoding a space at the beginning of the prompt. #1315

Avoid hardcoding a space at the beginning of the prompt. #1315

Conversation

ivanstepanovftw commented May 4, 2023 • edited by Green-Sky Loading

DannyDaemonic commented May 4, 2023

ivanstepanovftw commented May 4, 2023

slaren commented May 4, 2023

Green-Sky commented May 4, 2023

ivanstepanovftw commented May 4, 2023

ivanstepanovftw commented May 4, 2023

ggerganov commented May 4, 2023

ivanstepanovftw commented May 4, 2023 •

edited by Green-Sky

Loading