KL3M Training Data

Collection and Preprocessing of Training Data for KL3M

Description

This ALEA project contains the complete source code to collect and preprocess all training data related to the KL3M embedding and generative models.

Paper

Pending arXiv submission

Citation

Pending arXiv submission

Primary Sources

Summary

TODO: Table

US

UK

uk/legislation: All enacted UK legislation via legislation.gov.uk bulk download

EU ("Federal")

eu/eurlex_oj: EU Official Journal via Cellar/EU API

Germany

Australia

Canada

India

Tasks

Extraction

Summarization

Transform and Convert

Installation

TODO

Usage

TODO

License

The source code for this ALEA project is released under the MIT License. See the LICENSE file for details.

Top-level dependencies are all licensed MIT, BSD-3, or Apache 2.0 See poetry show --tree for details.

Support

If you encounter any issues or have questions about using this ALEA project, please open an issue on GitHub.

Learn More

To learn more about ALEA and our KL3M models and data, visit the ALEA website.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
docker		docker
kl3m_data		kl3m_data
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
config.json		config.json
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KL3M Training Data

Collection and Preprocessing of Training Data for KL3M

Description

Paper

Citation

Primary Sources

Summary

US

UK

EU ("Federal")

Germany

Australia

Canada

India

Tasks

Extraction

Summarization

Transform and Convert

Installation

Usage

License

Support

Learn More

About

Releases

Packages

Languages

License

alea-institute/kl3m-data

Folders and files

Latest commit

History

Repository files navigation

KL3M Training Data

Collection and Preprocessing of Training Data for KL3M

Description

Paper

Citation

Primary Sources

Summary

US

UK

EU ("Federal")

Germany

Australia

Canada

India

Tasks

Extraction

Summarization

Transform and Convert

Installation

Usage

License

Support

Learn More

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages