New model editions (GPT4) #340

deep-diver · 2023-04-14T13:48:16Z

I have trained the following models on GPT4 generated Alpaca dataset(from the one in this repo), and they are available through Hugging Face Model hub.

You can also find out the link for the training logs on each Model repository.
I hope this might be useful for someone, and I also hope these could be included in the list in this repo.

T-Atlas · 2023-05-16T10:30:39Z

Hi @deep-diver
I tried using GPT-4 data to train the adapter myself, but I found compared to models trained with original data, adapter models trained with GPT-4 data will output instructions and inputs during generation.
python generate.py --load_8bit --base_model 'decapoda-research/llama-7b-hf' --lora_weights 'gpt4-alpaca- lora-7b'
I want to know if this is normal?

The following is an example of raw data training
python generate.py --load_8bit --base_model='decapoda-research/llama-7b-hf'

deep-diver · 2023-05-17T04:55:18Z

I think so. You need to trim to get after Response

T-Atlas · 2023-05-17T05:35:49Z

I think so. You need to trim to get after Response

But I think the format and prompt template of these two pieces of data are the same. Do you have any understanding of why there is such a difference?

su-park · 2023-05-18T00:51:09Z

Hello.
I am experiencing the same issue as the one @T-Atlas posted above.
I have prepared a benchmark set and compared the performance of Alpaca-7b with regards to the same prompt.
The instruction and inputs are attached to the generated output like an echo.

JianqiaoLu · 2023-05-18T10:50:12Z

I think so. You need to trim to get after Response

I think so. You need to trim to get after Response

But I think the format and prompt template of these two pieces of data are the same. Do you have any understanding of why there is such a difference?

It looks like the loss not only applies to model genereated output but also such template such as "instruction:" and "input：{input}"

T-Atlas · 2023-05-19T02:20:00Z

I think so. You need to trim to get after Response

I think so. You need to trim to get after Response

But I think the format and prompt template of these two pieces of data are the same. Do you have any understanding of why there is such a difference?

It looks like the loss not only applies to model genereated output but also such template such as "instruction:" and "input：{input}"

Sounds reasonable, do you have any attempts to correct it?

JianqiaoLu · 2023-05-19T02:57:53Z

I think so. You need to trim to get after Response

I think so. You need to trim to get after Response

But I think the format and prompt template of these two pieces of data are the same. Do you have any understanding of why there is such a difference?

It looks like the loss not only applies to model genereated output but also such template such as "instruction:" and "input：{input}"

Sounds reasonable, do you have any attempts to correct it?

The only way that comes to my mind is to re fine-tune the model and set labels of "instruction, input, etc" to -100.

tloen closed this as completed in a5815d4 Apr 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New model editions (GPT4) #340

New model editions (GPT4) #340

deep-diver commented Apr 14, 2023 •

edited

Loading

T-Atlas commented May 16, 2023

deep-diver commented May 17, 2023

T-Atlas commented May 17, 2023

su-park commented May 18, 2023

JianqiaoLu commented May 18, 2023

T-Atlas commented May 19, 2023

JianqiaoLu commented May 19, 2023

New model editions (GPT4) #340

New model editions (GPT4) #340

Comments

deep-diver commented Apr 14, 2023 • edited Loading

T-Atlas commented May 16, 2023

deep-diver commented May 17, 2023

T-Atlas commented May 17, 2023

su-park commented May 18, 2023

JianqiaoLu commented May 18, 2023

T-Atlas commented May 19, 2023

JianqiaoLu commented May 19, 2023

deep-diver commented Apr 14, 2023 •

edited

Loading