Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sync release with main @ v0.5.0.post1-99-g8720c92e #63

Merged

Conversation

dtrifiro
Copy link

No description provided.

xingweiqu and others added 30 commits May 30, 2024 19:24
Co-authored-by: Zhuohan Li <zhuohan123@gmail.com>
…e ::ordered_metadata modifier (introduced with PTX 8.5)" (vllm-project#5149)
Co-authored-by: xuhao <xuhao@cambricon.com>
Co-authored-by: Varun Sundar Rabindranath <varun@neuralmagic.com>
Co-authored-by: Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com>
…e_sharded_state.py (vllm-project#5151)

Signed-off-by: Ye Cao <caoye.cao@alibaba-inc.com>
…#5184)

Co-authored-by: mgoin <michael@neuralmagic.com>
…llm-project#4927)

This PR enables the fused topk_softmax kernel used in moe layer for HIP
Signed-off-by: kevin <kevin@anyscale.com>
@dtrifiro dtrifiro changed the title sync with main @ 8720c92e sync with main @ v0.5.0.post1-99-g8720c92e Jun 21, 2024
@dtrifiro dtrifiro requested a review from heyselbi June 21, 2024 14:16
@dtrifiro dtrifiro changed the base branch from main to release June 21, 2024 14:43
@dtrifiro dtrifiro changed the title sync with main @ v0.5.0.post1-99-g8720c92e sync release with main @ v0.5.0.post1-99-g8720c92e Jun 21, 2024
@heyselbi
Copy link

/approve
/lgtm

Copy link

openshift-ci bot commented Jun 21, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dtrifiro, heyselbi

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-merge-bot openshift-merge-bot bot merged commit 7cc6a9b into opendatahub-io:release Jun 21, 2024
16 checks passed
heyselbi pushed a commit to red-hat-data-services/vllm that referenced this pull request Jun 21, 2024
…main

sync release with main @ v0.5.0.post1-99-g8720c92e
@dtrifiro dtrifiro deleted the sync-release-with-main branch June 24, 2024 09:58
Xaenalt pushed a commit that referenced this pull request Sep 18, 2024
* Add more detailed event names to profiler

* Add more profiler stats

* separate prompt and decode batch utilization

* Add more metrics

* revert engine/metrics.py changes

* un-singletonify (what a funny word) habana profiler

* formatting

* add batch block utilization metric

* fix division by zero

* fix batch_block_utilization formula

* minor refactors
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.