TE Gemma tutorial attempt#2 #1839

sudhakarsingh27 · 2025-06-02T20:52:04Z

Description

Adds a tutorial to showcase how to:

use TE TransformerEngine layer in place of HuggingFace's GemmaDecoderLayer in Gemma models.
use non-paged and paged KV cache from TE
use CUDA Graphs and fp8_model_init to optimize generation times

Attempt#1 @ #829

Type of change

Documentation change (change only to the documentation, either a fix or a new content)

Signed-off-by: Sudhakar Singh <sudhakars@nvidia.com>

…mma_tutorial_base

Signed-off-by: Sudhakar Singh <sudhakars@nvidia.com>

for more information, see https://pre-commit.ci

Signed-off-by: Sudhakar Singh <sudhakars@nvidia.com>

…ransformerEngine into te_gemma_tutorial_base_test

for more information, see https://pre-commit.ci

sudhakarsingh27 force-pushed the te_gemma_tutorial_base branch from 03729bc to 2a514cf Compare June 2, 2025 21:10

sudhakarsingh27 added 2 commits June 2, 2025 14:16

add tutorial files and other local changes

2430700

Signed-off-by: Sudhakar Singh <sudhakars@nvidia.com>

Merge branch 'main' of github.com:NVIDIA/TransformerEngine into te_ge…

4757bfa

…mma_tutorial_base

sudhakarsingh27 force-pushed the te_gemma_tutorial_base branch from 2a514cf to 4757bfa Compare June 2, 2025 21:19

remove extraneous code for easy debu

d56f439

Signed-off-by: Sudhakar Singh <sudhakars@nvidia.com>

sudhakarsingh27 force-pushed the te_gemma_tutorial_base branch 3 times, most recently from 5d7538e to 93960fd Compare June 16, 2025 22:09

make cuda graphs work with non-paged and paged attention

6cd3c1a

Signed-off-by: Sudhakar Singh <sudhakars@nvidia.com>

sudhakarsingh27 force-pushed the te_gemma_tutorial_base branch from 588fcd6 to 6cd3c1a Compare June 17, 2025 22:27

pre-commit-ci bot and others added 6 commits June 17, 2025 22:27

[pre-commit.ci] auto fixes from pre-commit.com hooks

2d12b72

for more information, see https://pre-commit.ci

perf imp for kv cache ops

97b756c

Signed-off-by: Sudhakar Singh <sudhakars@nvidia.com>

add code for calibration

5011eb3

Signed-off-by: Sudhakar Singh <sudhakars@nvidia.com>

Merge branch 'te_gemma_tutorial_base' of github.com:sudhakarsingh27/T…

dea99f6

…ransformerEngine into te_gemma_tutorial_base_test

Merge branch 'te_gemma_tutorial_base_test' into te_gemma_tutorial_base

714ff34

[pre-commit.ci] auto fixes from pre-commit.com hooks

0f7ea22

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TE Gemma tutorial attempt#2 #1839

TE Gemma tutorial attempt#2 #1839

Uh oh!

sudhakarsingh27 commented Jun 2, 2025

Uh oh!

Uh oh!

TE Gemma tutorial attempt#2 #1839

Are you sure you want to change the base?

TE Gemma tutorial attempt#2 #1839

Uh oh!

Conversation

sudhakarsingh27 commented Jun 2, 2025

Description

Type of change

Uh oh!

Uh oh!