Skip to content

Releases: segment-any-text/wtpsplit

Release 0.5.2

01 Nov 10:18
Compare
Choose a tag to compare
  • Split sequence data is now stored in the ONNX file instead of being hardcoded: #21
  • Added verbose argument to the split(..) method of the Python bindings to display a progress bar
  • Retrained Chinese model with properly removed punctuation
  • Retrained German model with Compound Splitting as additional split level
  • docs.rs documentation now has all features enabled
  • Added methods to get the levels of the current models:

Python: splitter.get_levels()
JS: splitter.getLevels()
Rust: splitter.logic().split_sequence().get_levels()

Release 0.5.1

20 Oct 20:43
Compare
Choose a tag to compare

Introduce model versioning: With the new model architecture, old Rust releases broke because models were always fetched from the master branch. Sorry! Now they are versioned along with the library so this won't happen again. Please upgrade to this version to use the new models.

Update German and English models.

Release 0.5.0

18 Oct 21:44
Compare
Choose a tag to compare

Add five new languages:

  • Norwegian
  • Swedish
  • Turkish
  • Chinese
  • French

Retrain all models with new downsampling trick, improves Accuracy significantly at roughly the same speed.

Release 0.4.12

22 Sep 08:04
Compare
Choose a tag to compare

Add missing sigmoid to JS.

Release 0.4.10 - Better JS docs and tested Node.js support

21 Sep 16:08
Compare
Choose a tag to compare
remove outdated release instructions (now in CI :) )

Release 0.4.9

21 Sep 09:13
ba052e7
Compare
Choose a tag to compare

Testing release CI.

Release 0.4.8

21 Sep 08:44
Compare
Choose a tag to compare

Testing release CI.