Text generating transformer

This is a decoder-only transformer with simplest character-level tokenization. Script has model training and text generation examples.

How to use:

1) Install dependences:

pip install numpy torch torchinfo matplotlib

2) Configure:

# main.py

# configure model

tgt = MyTGT(data, 
		path = 'model.pt',   # path to the model, to train new model you have to delete the previous one
		context_size = 128,  # context size, bigger context usually provides more coherent generation
		batch_size = 64,     # batch size, higher values provide faster training and lower quality and vice versa
		d_model = 512,       # model depth, higher values provide slower training and higher quality and vice versa
		n_heads = 4,         # ibid
		n_layers = 3,        # ibid
		d_ffn = 512,         # ibid
		lr = 1e-4)           # learning rate, higher values provide fast but worse generalization

# configure training

# epochs, higher value means longer training time and usually better results
tgt.train(epochs = 30, plot=True, verbose = 2, print_every = 128)

# configure generation

# place any text in seed to generate next 'size' symbols, ensure that seed is smaller than context size
text = tgt.generate(seed = 'Yes, master, tell me ', size=128, temperature=1.1)

3) Run:

python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
assets		assets
script		script
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text generating transformer

How to use:

1) Install dependences:

2) Configure:

3) Run:

4) Training process:

5) Results:

About

Uh oh!

Releases

Packages

Languages

License

gloptim/text_generating_transformer

Folders and files

Latest commit

History

Repository files navigation

Text generating transformer

How to use:

1) Install dependences:

2) Configure:

3) Run:

4) Training process:

5) Results:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages