Popular repositories Loading
-
LookaheadDecoding
LookaheadDecoding Public[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
-
Consistency_LLM
Consistency_LLM Public[ICML 2024] CLLMs: Consistency Large Language Models
Repositories
Showing 10 of 19 repositories
- Awesome-Video-Attention Public
A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and caching, etc.
hao-ai-lab/Awesome-Video-Attention’s past year of commit activity - hao-ai-lab.github.io Public
hao-ai-lab/hao-ai-lab.github.io’s past year of commit activity - ComfyUI Public Forked from comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
hao-ai-lab/ComfyUI’s past year of commit activity - vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
hao-ai-lab/vllm’s past year of commit activity - dynamo Public Forked from ai-dynamo/dynamo
A Datacenter Scale Distributed Inference Serving Framework
hao-ai-lab/dynamo’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…