包含自定义词表,以及自己实现的tokenize,detokenize。
pretrain_pipeline.py是流式输入数据。
各个程序直接使用Python运行即可,具体配置到代码里调整。
-
Notifications
You must be signed in to change notification settings - Fork 0
couldn/t5_pretrain
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
t5 pretrain ,torch ,transformer implement
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published