feat: add Phi-3-mini model #119

WhiteNight123 · 2024-08-15T07:34:00Z

What's new!!

The phi-3-mini model is a transformer decoder with a 4K context length, and a 128K extended version called phi-3-mini-128K using LongRope. It’s designed to be compatible with Llama-2, sharing the same tokenizer and package ecosystem. The model has 3072 hidden dimensions, 32 heads, and 32 layers, trained on 3.3T tokens in bfloat16. It’s chat-finetuned with a specific template for interaction.
"<|user|>\n Question <|end|>\n <|assistant|>"

https://huggingface.co/microsoft/Phi-3-mini-4k-instruct

Signed-off-by: Guo Xiaoqiang

Fix BUG

Don't use llamafile_sgemm when dst is fp32 with aggregated_tensors

Signed-off-by: yirongjie

添加Phi-3-mini模型

WhiteNight123 and others added 4 commits August 15, 2024 15:26

add Phi-3-mini model

00a1164

添加Phi-3-mini模型

Merge branch 'main' into main

3c05ab7

fix

30a3f07

fix: matmul fp32 with aggregated_tensors

f0d743c

yirongjie self-requested a review August 15, 2024 15:32

yirongjie approved these changes Aug 15, 2024

View reviewed changes

yirongjie merged commit 59d8f4e into UbiquitousLearning:main Aug 15, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Phi-3-mini model #119

feat: add Phi-3-mini model #119

WhiteNight123 commented Aug 15, 2024 •

edited by yirongjie

Loading

feat: add Phi-3-mini model #119

feat: add Phi-3-mini model #119

Conversation

WhiteNight123 commented Aug 15, 2024 • edited by yirongjie Loading

What's new!!

Fix BUG

WhiteNight123 commented Aug 15, 2024 •

edited by yirongjie

Loading