You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is there a way to modify logits_warper, so that it will apply different parameters (top_p, top_k, temperature) for each item in a batch instead of applying the same parameters for all items in the batch:
It will very useful for inference because it will increase the throughput. Currently, you cannot really use #7552 for inference because the same parameters apply to the entire batch.
Your contribution
Can help with implementing this feature and reviewing the PR.
The text was updated successfully, but these errors were encountered:
Sorry for replying so late. We recently allowed to pass customized logits wrapper to generate(). Could you maybe try to build a custom wrapper for your purpose this way?
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
Please note that issues that do not follow the contributing guidelines are likely to be ignored.
🚀 Feature request
Is there a way to modify
logits_warper
, so that it will apply different parameters (top_p
,top_k
,temperature
) for each item in a batch instead of applying the same parameters for all items in the batch:transformers/src/transformers/generation_utils.py
Line 1562 in f25a933
Motivation
It will very useful for inference because it will increase the throughput. Currently, you cannot really use #7552 for inference because the same parameters apply to the entire batch.
Your contribution
Can help with implementing this feature and reviewing the PR.
The text was updated successfully, but these errors were encountered: