create packaged models from checkpoint #94

AdrianKs · 2020-04-15T10:53:33Z

Allows to create a packaged model, which only contains the model, entity/relation ids and the config.
It is not possible to resume training on a packaged model but it can be evaluated.

example command:
kge package checkpoint_best.pt

normal checkpoints also store entity/relation ids now

kge/job/train.py

rgemulla · 2020-04-15T11:06:12Z

How are pacakged models used? Maybe we should rename KgeModel.load_from_checkpoint to KgeModel.load_from and support checkpoints and packages (including creation of required "virtual" datasets) there.

rgemulla · 2020-04-15T11:09:24Z

TrainingJob.load should offload some functionality to KgeModel.load_from. And I guess KgeModel.load_from should accept both filenames and loaded files (to facilitate this).

AdrianKs · 2020-04-15T13:27:31Z

TrainingJob.load should offload some functionality to KgeModel.load_from. And I guess KgeModel.load_from should accept both filenames and loaded files (to facilitate this).

This could be done, but I am not sure if we should do it.
KgeModel.load_from creates a new model based on the config file provided in the checkpoint.
At the time we call TrainingJob.load we already have a model created which we just update with the parameters in the checkpoint.

If we want to move this to the load_from function we would also need to provide the new config file and there we unnecessary create a new model.
We would only offload the functionality of the model loading to load_from

This creates an overhead with no big advantages.

rgemulla · 2020-04-15T16:14:03Z

I see. We still need load_from to work with packages (that's a key point of having packages). And we should try to minimize code duplication. Ideas welcome!

AdrianKs · 2020-04-16T08:16:00Z

This does not help much with the code duplication but with the designflaw having the evaluation job use the trainjob resume.
lets move most of the content from trainjob.resume to the new method job.resume.
In job.resume we check which checkpoint we need to load and load it.

trainjob.resume then looks something like this:
def resume(self, checkpoint_file) \ checkpoint = super().resume(checkpoint_file) \ self.load(checkpoint)
evaljob.resume then just looks the same
but the load functions can be different

this does not help much with the load and load_from distinction but in general would be a lot cleaner, even if there is maybe even more duplication.
but the train job types are more independent.

AdrianKs · 2020-04-16T08:31:57Z

actually also the self.load(checkpoint) call could be done in job.resume
then we only have a job specific load function.

rgemulla · 2020-04-16T11:45:04Z

Sounds good. But note that resuming from a checkpoint generally may need to use the config from the checkpoint, not the one from the folder (which may not even exist). When cleaning up the API, this should be considered. We'd need sth. like TrainingJob.load_from(checkpoint) and KgeModel.load_from(checkpoint) as static methods. One advantage is that the training job then does not need to create a model first, it simply calls KgeModel.load_from. Also, the methods should accept both a filename or a loaded checkpoint in their "checkpoint" argument. This way, we'd only need to load the checkpoint once.

AdrianKs · 2020-04-16T12:15:20Z

in that case the function resume() could maybe completly be replaced by the static method load_from, since currently we create a new model and then call resume and afterwards run() anyways. This could be repaced by job = load_from(checkpoint) and afterwards job.run().

rgemulla · 2020-04-16T14:16:32Z

Sounds like a good idea to me.

AdrianKs · 2020-04-20T07:46:17Z

I built a first version to replace the method resume with load_from. This actually resulted in more changes than expected. Please have a look, if this is worth the changes.

AdrianKs · 2020-04-20T08:07:49Z

With these changes we could also make it possible to evaluate checkpoints without the config, I think

kge/cli.py

kge/job/eval.py

kge/job/grid_search.py

kge/job/job_factory.py

kge/job/search.py

kge/job/train.py

kge/model/kge_model.py

rgemulla · 2020-04-20T10:25:36Z

I think it's worth it. Perhaps some of the more generic checkpoint-handling code can be put into a seaparate package (kge.util.io?).

In general, I think we should have KgeModel.load_from_checkpoint(cp), Job.load_from_checkpoint(cp), Dataset.load_from_checkpoint(cp), and Config.load_from_checkpoint(cp). If we also had the reverse methods (save_to_checkpoint) everywhere, we could remove code clutter and code duplication.

kge/model/embedder/lookup_embedder.py

AdrianKs · 2020-04-20T15:48:46Z

I tried to address most of your points.

kge/config.py

kge/cli.py

kge/dataset.py

kge/model/kge_model.py

kge/util/package.py

rgemulla · 2020-04-23T11:35:03Z

I think valid is a sensible default choice.

AdrianKs · 2020-04-23T14:48:35Z

It is now possible to evaluate without a config by calling EntityRankingJob.create_from(checkpoint).

If we later want to change the console api to enable evaluation without a config, we would have to find a way to still support additional commandline options without overwriting the checkpoint config with all the default values

AdrianKs · 2020-04-29T09:31:09Z

the loading of the dataset should now work as expected. In KgeModel.create_from we just always set preload_false. In that way we can create a model without the dataset. If the datasets are later on needed it will throw an IOError

with that everything should be adressed

rgemulla

Looks very good. I added a few final points, mostly ,minor.

kge/cli.py

kge/dataset.py

rgemulla · 2020-04-29T16:27:18Z

kge/dataset.py

+            return checkpoint
+        meta_checkpoint = {}
+        for key in meta_keys:
+            meta_checkpoint[key] = self._map_indexes(None, key)


map_indexes seems to be incorrect here. Do you mean map_indexes? Also, why not just store meta["key"]?

I switched to map_indexes. If we just store meta["key"], we can not make sure, that we actually loaded this data. Could be that we didn't even read the entitiy_ids file, when calling this method.

kge/dataset.py

kge/job/eval.py

kge/job/job.py

kge/model/kge_model.py

rgemulla · 2020-05-22T10:09:28Z

What's the current state here? Can you rebase this off the current master?

# Conflicts: # kge/job/train.py

AdrianKs · 2020-05-22T13:22:41Z

separated loading of checkpoint from create_from and rebased.
Should be done now.

…anKs-package

…-package

- Renamed overwrite_config to config - Update device in config when loading checkpoint

rgemulla · 2020-05-25T16:46:25Z

I made some revisions. Let me know your thoughts. If you are fine, we can merge this PR.

AdrianKs · 2020-05-26T07:07:33Z

Thanks, the revisions look good. I'd say we are ready to merge

rgemulla · 2020-05-26T09:26:40Z

Thanks!

create packaged models from checkpoint

3e11144

rgemulla reviewed Apr 15, 2020

View reviewed changes

kge/job/train.py Outdated Show resolved Hide resolved

kge/job/train.py Outdated Show resolved Hide resolved

kge/job/train.py Outdated Show resolved Hide resolved

adopt packaged models to PR notes

dfa089a

replace method resume with static load_from

2f86e7d

rgemulla reviewed Apr 20, 2020

View reviewed changes

improve resuming from checkpoints

5dcd265

AdrianKs commented Apr 20, 2020

View reviewed changes

kge/model/embedder/lookup_embedder.py Outdated Show resolved Hide resolved

AdrianKs added 2 commits April 20, 2020 17:43

remove accidentally committed remove_key function

a79927a

add load function to auto search

5b8878b

fix resume of search jobs

927c417

rgemulla reviewed Apr 20, 2020

View reviewed changes

AdrianKs added 3 commits April 22, 2020 09:09

improve reuming from checkpoint and packaging of models

cfa552e

add num_entities and num_relations to dataset.save_to

5fa535e

don't package search job checkpoints

80adebb

allow evaluation without config with EntityRankingJob.create_from

45a2883

reformat job.py

3942cbc

rgemulla reviewed Apr 29, 2020

View reviewed changes

address package-PR comments

f87ed13

AdrianKs added 3 commits May 22, 2020 12:52

Merge remote-tracking branch 'remotes/upstream/master' into package

8d807c1

# Conflicts: # kge/job/train.py

separate loading of checkpoint from job.create_from

f2a0924

separate loading of checkpoint from model.create_from

ddee973

rgemulla added 15 commits May 25, 2020 15:05

Merge branch 'package' of https://github.com/AdrianKs/kge-1 into Adri…

ebda06a

…anKs-package

Support loading of old checkpoints with current embedders

cddb2e6

Merge branch 'master' of https://github.com/uma-pi1/kge into AdrianKs…

265a674

…-package

Package revisions part 1

cacc485

- Renamed overwrite_config to config - Update device in config when loading checkpoint

Update docs

dea2d28

Support empty config files

8f4f86d

Documentation udpate

e469544

Renamed some checkpoint functions

5db17f7

SImplify Config.create_from

e90e299

Minor revision of Dataset

a22ff1f

Fix resume of auto search jobs with new API

b0bb3a8

Add some additional keys to packages

30bfeff

Add Config#load_config

277c68d

Consistent method names in Dataset

69666f3

Revised Job loading

ced8f48

rgemulla merged commit fde4565 into uma-pi1:master May 26, 2020

This was referenced May 26, 2020

Support evaluation directly on a checkpoint #85

Open

Provide "packaged" models #73

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

create packaged models from checkpoint #94

create packaged models from checkpoint #94

AdrianKs commented Apr 15, 2020

rgemulla commented Apr 15, 2020 •

edited

Loading

rgemulla commented Apr 15, 2020

AdrianKs commented Apr 15, 2020

rgemulla commented Apr 15, 2020

AdrianKs commented Apr 16, 2020 •

edited

Loading

AdrianKs commented Apr 16, 2020

rgemulla commented Apr 16, 2020

AdrianKs commented Apr 16, 2020 •

edited

Loading

rgemulla commented Apr 16, 2020 via email

AdrianKs commented Apr 20, 2020

AdrianKs commented Apr 20, 2020

rgemulla commented Apr 20, 2020

AdrianKs commented Apr 20, 2020

rgemulla commented Apr 23, 2020 via email

AdrianKs commented Apr 23, 2020

AdrianKs commented Apr 29, 2020

rgemulla left a comment

rgemulla Apr 29, 2020

AdrianKs May 4, 2020

rgemulla commented May 22, 2020

AdrianKs commented May 22, 2020

rgemulla commented May 25, 2020

AdrianKs commented May 26, 2020

rgemulla commented May 26, 2020

create packaged models from checkpoint #94

create packaged models from checkpoint #94

Conversation

AdrianKs commented Apr 15, 2020

rgemulla commented Apr 15, 2020 • edited Loading

rgemulla commented Apr 15, 2020

AdrianKs commented Apr 15, 2020

rgemulla commented Apr 15, 2020

AdrianKs commented Apr 16, 2020 • edited Loading

AdrianKs commented Apr 16, 2020

rgemulla commented Apr 16, 2020

AdrianKs commented Apr 16, 2020 • edited Loading

rgemulla commented Apr 16, 2020 via email

AdrianKs commented Apr 20, 2020

AdrianKs commented Apr 20, 2020

rgemulla commented Apr 20, 2020

AdrianKs commented Apr 20, 2020

rgemulla commented Apr 23, 2020 via email

AdrianKs commented Apr 23, 2020

AdrianKs commented Apr 29, 2020

rgemulla left a comment

Choose a reason for hiding this comment

rgemulla Apr 29, 2020

Choose a reason for hiding this comment

AdrianKs May 4, 2020

Choose a reason for hiding this comment

rgemulla commented May 22, 2020

AdrianKs commented May 22, 2020

rgemulla commented May 25, 2020

AdrianKs commented May 26, 2020

rgemulla commented May 26, 2020

rgemulla commented Apr 15, 2020 •

edited

Loading

AdrianKs commented Apr 16, 2020 •

edited

Loading

AdrianKs commented Apr 16, 2020 •

edited

Loading