Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bring deepspeed_main up-to-date #746

Merged
merged 5 commits into from
Dec 17, 2022
Merged

Bring deepspeed_main up-to-date #746

merged 5 commits into from
Dec 17, 2022

Conversation

Quentin-Anthony
Copy link
Member

No description provided.

xv44586 and others added 5 commits December 9, 2022 12:12
* Script should error when lns *are* shared

* Update NeoXArgs docs automatically

Co-authored-by: github-actions <github-actions@github.com>
* Add support for Flash attention

* Fix attention type can be both sparse and flash

* Updates from running pre-commit on modified files

* Update README.md

Co-authored-by: Stella Biderman <stellabiderman@gmail.com>
Co-authored-by: Quentin Anthony <qganthony@yahoo.com>
…737)

* Add additional checkpointing arg

* Update NeoXArgs docs automatically

* Update NeoXArgs docs automatically

* Make default lists/dicts cleaner

* Update NeoXArgs docs automatically

* Revert "Make default lists/dicts cleaner"

This reverts commit 1bf3649.

* Update NeoXArgs docs automatically

* Change to checkpoint-scale and checkpoint-factor args

* Update NeoXArgs docs automatically

* fix checkpoint step calculation logic

* add step 0 checkpointing

* Update NeoXArgs docs automatically

* ensure save iters always a set() in computation

* Update NeoXArgs docs automatically

Co-authored-by: github-actions <github-actions@github.com>
Co-authored-by: Quentin Anthony <qganthony@yahoo.com>
Co-authored-by: Hailey Schoelkopf <hailey@slurm-login-0.slurm-login.tenant-stabilitytraining-704a100.svc.tenant.chi.local>
@Quentin-Anthony Quentin-Anthony requested a review from a team as a code owner December 17, 2022 16:20
@Quentin-Anthony Quentin-Anthony requested review from StellaAthena and ShivanshuPurohit and removed request for a team December 17, 2022 16:20
@Quentin-Anthony Quentin-Anthony merged commit f6a8f5d into deepspeed_main Dec 17, 2022
@StellaAthena StellaAthena added this to the Release V2 milestone Dec 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants