-
Notifications
You must be signed in to change notification settings - Fork 451
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[BUG fix] solve the CI bug of router fusion
bug
Something isn't working
#1944
opened Jul 11, 2025 by
Autumn1998
Loading…
13 tasks
mxfp8 (for all gemm layouts) is not supported on 120+ arch yet
#1939
opened Jul 9, 2025 by
sudhakarsingh27
Loading…
1 of 13 tasks
Run-time checks for CUDA and cuBLAS versions
bug
Something isn't working
#1938
opened Jul 9, 2025 by
timmoon10
Loading…
5 of 13 tasks
[Common] Skip cuDNN 9.10.0/9.10.1 due to bugs
2.6.0
#1937
opened Jul 8, 2025 by
cyanguwa
Loading…
8 of 13 tasks
[JAX] Support Flax sharding constraints
#1933
opened Jul 7, 2025 by
jberchtold-nvidia
Loading…
8 of 13 tasks
[BUILD] Exclude ninja from required packages
#1932
opened Jul 7, 2025 by
phu0ngng
Loading…
5 of 13 tasks
[PyTorch] Optimize the performance of permute fusion kernels
#1927
opened Jul 4, 2025 by
hxbai
Loading…
7 of 13 tasks
[PyTorch] Fuse permute+pad and unpermute+unpad ops for FP8 optimization
#1921
opened Jul 3, 2025 by
xiaoxi-wangfj
Loading…
3 of 12 tasks
[JAX] Resolve test conflict in JAX helper tests
#1916
opened Jul 1, 2025 by
emmanuel-ferdman
Loading…
6 of 13 tasks
[Common] Optimize KV cache related kernels
2.6.0
#1914
opened Jun 30, 2025 by
cyanguwa
Loading…
8 of 13 tasks
Fix import error when flash attention 3 is installed
#1913
opened Jun 30, 2025 by
HollowMan6
Loading…
7 of 13 tasks
[PyTorch debug] Improve precision debug tools performance
#1909
opened Jun 30, 2025 by
pggPL
Loading…
9 of 13 tasks
[PyTorch] Support FA3 MLA CP feature
#1907
opened Jun 28, 2025 by
zhujian19891203
Loading…
7 of 13 tasks
[PyTorch Debug] Support log fp8 tensor stats for blockwise recipe
#1905
opened Jun 27, 2025 by
lengerfulluse
Loading…
12 tasks
[Common] NVFP4 kernels
enhancement
New feature or request
#1904
opened Jun 27, 2025 by
Oleg-Goncharov
•
Draft
5 of 13 tasks
[PyTorch Debug] More advanced stats for Quantized Tensors
#1897
opened Jun 26, 2025 by
pggPL
Loading…
2 of 13 tasks
Handle dtypes more carefully in multi-tensor Adam
bug
Something isn't working
#1888
opened Jun 17, 2025 by
timmoon10
Loading…
6 of 13 tasks
[PyTorch] Add save_original_input in Linear/GroupedLinear to save memory
#1865
opened Jun 11, 2025 by
hxbai
Loading…
8 of 13 tasks
[JAX] Collective GEMM custom op + primitive + minimal supporting functions
jax
#1846
opened Jun 3, 2025 by
denera
Loading…
5 of 13 tasks
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.