V4.34 upstream rebase #95

bfineran · 2023-10-26T17:40:58Z

applies all current NM-transformers commits to a copy of upstream/v4.43-release

after approval from the team, main will be set to this branch - can also squash in the ~10 commits that make these changes if necessary

* Add recipe_name to default file names * Upgrade to transformers release V4.30.2 (#62) * Update trainer and model flows to accommodate sparseml Disable FP16 on QAT start (#12) * Override LRScheduler when using LRModifiers * Disable FP16 on QAT start * keep wrapped scaler object for training after disabling Using QATMatMul in DistilBERT model class (#41) Removed double quantization of output of context layer. (#45) Fix DataParallel validation forward signatures (#47) * Fix: DataParallel validation forward signatures * Update: generalize forward_fn selection Best model after epoch (#46) fix sclaer check for non fp16 mode in trainer (#38) Mobilebert QAT (#55) * Remove duplicate quantization of vocabulary. enable a QATWrapper for non-parameterized matmuls in BERT self attention (#9) * Utils and auxillary changes update Zoo stub loading for SparseZoo 1.1 refactor (#54) add flag to signal NM integration is active (#32) Add recipe_name to file names * Fix errors introduced in manual cherry-pick upgrade Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> * update build versions for NM fork pypi push (#74) * fix nightly package name (#75) * add make build command (#76) * add GHA workflow files to build nightly and release packages (#77) * add GHA workflow files to build nightly and release packages * fix name --------- Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> * bump up version to 1.6.0 (#79) Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> --------- Co-authored-by: Konstantin <konstantin@neuralmagic.com> Co-authored-by: Konstantin Gulin <66528950+KSGulin@users.noreply.github.com> Co-authored-by: dhuangnm <74931910+dhuangnm@users.noreply.github.com> Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

* improve GHA workflow files to build nightly and release, and report status to testmo * clean up * report exit code * Assign value to exit_code --------- Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

DistributedSampler is used but not imported in `trainer.py`

* Quantize attention matmuls * Quantize attention matmuls

(previous commits) * Add recipe_name to default file names * Upgrade to transformers release V4.30.2 (#62) * Update trainer and model flows to accommodate sparseml Disable FP16 on QAT start (#12) * Override LRScheduler when using LRModifiers * Disable FP16 on QAT start * keep wrapped scaler object for training after disabling Using QATMatMul in DistilBERT model class (#41) Removed double quantization of output of context layer. (#45) Fix DataParallel validation forward signatures (#47) * Fix: DataParallel validation forward signatures * Update: generalize forward_fn selection Best model after epoch (#46) fix sclaer check for non fp16 mode in trainer (#38) Mobilebert QAT (#55) * Remove duplicate quantization of vocabulary. enable a QATWrapper for non-parameterized matmuls in BERT self attention (#9) * Utils and auxillary changes update Zoo stub loading for SparseZoo 1.1 refactor (#54) add flag to signal NM integration is active (#32) Add recipe_name to file names * Fix errors introduced in manual cherry-pick upgrade Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> * update build versions for NM fork pypi push (#74) * fix nightly package name (#75) * add make build command (#76) * add GHA workflow files to build nightly and release packages (#77) * add GHA workflow files to build nightly and release packages * fix name --------- Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> * bump up version to 1.6.0 (#79) Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> --------- Co-authored-by: Konstantin <konstantin@neuralmagic.com> Co-authored-by: Konstantin Gulin <66528950+KSGulin@users.noreply.github.com> Co-authored-by: dhuangnm <74931910+dhuangnm@users.noreply.github.com> Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> minor improvements for build workflow files (#83) Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> fix minor issue (#84) Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> OPT with quantizable MatMuls (#85) fix a minor issue for release build (#86) Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> update version in version.py Testmo (#91) * improve GHA workflow files to build nightly and release, and report status to testmo * clean up * report exit code * Assign value to exit_code --------- Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> Update trainer.py - fix DistributedSampler import (#93) DistributedSampler is used but not imported in `trainer.py` Research/llama/bmm quantization (#94) * Quantize attention matmuls * Quantize attention matmuls bump base transformers version

bfineran · 2023-10-27T13:32:44Z

force pushed to main

bfineran and others added 10 commits October 26, 2023 13:36

minor improvements for build workflow files (#83)

1a33c78

Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

fix minor issue (#84)

a93ceb0

Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

OPT with quantizable MatMuls (#85)

b8ab0a1

fix a minor issue for release build (#86)

da951b8

Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

update version in version.py

aae45de

Testmo (#91)

cbc781b

* improve GHA workflow files to build nightly and release, and report status to testmo * clean up * report exit code * Assign value to exit_code --------- Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

Update trainer.py - fix DistributedSampler import (#93)

2e04144

DistributedSampler is used but not imported in `trainer.py`

Research/llama/bmm quantization (#94)

1bccd07

* Quantize attention matmuls * Quantize attention matmuls

bump base transformers version

ce4033d

bfineran requested review from Satrat, eldarkurtic, anmarques, dsikka, rahul-tuli and dbogunowicz October 26, 2023 17:40

bfineran self-assigned this Oct 26, 2023

bfineran closed this Oct 27, 2023

dbogunowicz deleted the v4.34-upstream-rebase branch December 5, 2023 10:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

V4.34 upstream rebase #95

V4.34 upstream rebase #95

bfineran commented Oct 26, 2023

bfineran commented Oct 27, 2023

V4.34 upstream rebase #95

V4.34 upstream rebase #95

Conversation

bfineran commented Oct 26, 2023

bfineran commented Oct 27, 2023