Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
gemma : use more bits for the token_embd.weight tensor (#5650)
* gemma : use Q8_0 for the token_embd.weight tensor * llama : quantize token_embd.weight using output type
- Loading branch information