Transfer Learning fails when training conditional model based on dataset labels #98

ageroul · 2021-04-28T08:28:14Z

Hi,
I have prepared my dataset according to dataset_tool.py. Dimensions are 256x256 and has 5 classes(labels). The dataset.json file is also fine. Here is the problem:
When running python train.py and my Transfer Learning source network is ffhq256 the execution fails pretty soon (in the beginning of "Constructing networks") with this error:
RuntimeError: The size of tensor a (1024) must match the size of tensor b (512) at non-singleton dimension 1
When I run the same code but with the option cond='False' (ignore the dataset labels) the problem disappears and the transfer learning continues without error.
What is the problem here?
Thanks in advance!
PS: I also tried ffhq512 (with option cond="True") but then I get error again: RuntimeError: The size of tensor a (256) must match the size of tensor b (512) at non-singleton dimension 0

The text was updated successfully, but these errors were encountered:

wdf19961118 · 2021-04-29T08:28:49Z

You can set --gpus=1 and try again, I see that training_set_iterator = iter(torch.utils.data.DataLoader(dataset=training_set, sampler=training_set_sampler, batch_size=batch_size//num_gpus, **data_loader_kwargs)) in training_loop.py, maybe it is the reason for your error. Good luck!

ageroul · 2021-04-29T08:40:52Z

You can set --gpus=1 and try again, I see that training_set_iterator = iter(torch.utils.data.DataLoader(dataset=training_set, sampler=training_set_sampler, batch_size=batch_size//num_gpus, **data_loader_kwargs)) in training_loop.py, maybe it is the reason for your error. Good luck!

Thanks for the answer,
Unfortunately this is not the issue as I already set the option --gpus=1 in train.py.

chengkeng · 2021-04-29T17:17:24Z

I also encountered the same situation.

chengkeng · 2021-04-29T22:25:37Z

This must be re-trained, remove "--resume=xxx"

ageroul · 2021-04-30T04:42:32Z

This must be re-trained, remove "--resume=xxx"

If it "must" be retrained then there is no transfer learning happening...

wdf19961118 · 2021-05-06T07:22:22Z

You want to train a conditional model initialized by unconditional model(ffhq256), right? However, the structure of conditional model is different from unconditional model. You can print the structure and see that.

Gass2109 · 2021-07-28T08:59:32Z

Because the conditional model takes as input the concatenation (in the first dimension) of the label features (bs, 256) and the latent code (bs, 256), which gives a tensor of shape (bs, 512). However, the unconditional model takes only the latent representation (bs, 256). hope that helps :)

thusinh1969 · 2021-08-05T19:23:37Z

I closed my question because this is the reason !

Steve

wenhaoyong · 2021-08-18T12:37:51Z

I encountered a similar problem and I fixed it with the option' cond="True" '. Thx.

49xxy · 2022-09-06T12:16:19Z

我遇到了类似的情况，我用选项' cond="True" '修复了它。谢谢。

How did you solve it? I sincerely hope to get your help

This was referenced Jul 7, 2021

ImportError: No module named 'upfirdn2d_plugin' #97

Open

train.py fails when gpus=2 (or something other than gpus=1) #139

Closed

thusinh1969 mentioned this issue Aug 5, 2021

Can not transfer-learning with different number of "classes" #156

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transfer Learning fails when training conditional model based on dataset labels #98

Transfer Learning fails when training conditional model based on dataset labels #98

ageroul commented Apr 28, 2021

wdf19961118 commented Apr 29, 2021

ageroul commented Apr 29, 2021

chengkeng commented Apr 29, 2021

chengkeng commented Apr 29, 2021

ageroul commented Apr 30, 2021

wdf19961118 commented May 6, 2021

Gass2109 commented Jul 28, 2021

thusinh1969 commented Aug 5, 2021

wenhaoyong commented Aug 18, 2021

49xxy commented Sep 6, 2022

Transfer Learning fails when training conditional model based on dataset labels #98

Transfer Learning fails when training conditional model based on dataset labels #98

Comments

ageroul commented Apr 28, 2021

wdf19961118 commented Apr 29, 2021

ageroul commented Apr 29, 2021

chengkeng commented Apr 29, 2021

chengkeng commented Apr 29, 2021

ageroul commented Apr 30, 2021

wdf19961118 commented May 6, 2021

Gass2109 commented Jul 28, 2021

thusinh1969 commented Aug 5, 2021

wenhaoyong commented Aug 18, 2021

49xxy commented Sep 6, 2022