[Question]: Why is expand_context set to false when training (TransformerEmbeddings)? #3166

kobiche · 2023-03-28T09:15:42Z

Question

I was debugging the behaviour of the model during training, I landed on this peculiar behaviour.
If I understand it correctly, the expand_context is set to false if we are training the model. Why? Why is the context expanded only during inference?

alanakbik · 2023-03-28T09:24:04Z

Hello @kobiche the context is always expanded during inference. During training, we apply a "context dropout" where the context is only expanded if the random number is above the dropout value. The idea is to make the model robust to work without context as well (for instance, if someone wants to predict for only a single sentence.)

kobiche · 2023-03-28T11:41:09Z

Thanks for the clarification, I think I misinterpreted the if-condition.

GH-3166: add comments and one-sided dropout

kobiche added the question Further information is requested label Mar 28, 2023

alanakbik added a commit that referenced this issue Mar 28, 2023

GH-3166: add comments and one-sided dropout

b432e3f

kobiche closed this as completed Mar 28, 2023

alanakbik added a commit that referenced this issue Mar 28, 2023

Merge pull request #3168 from flairNLP/GH-3166-expand-context

45deb59

GH-3166: add comments and one-sided dropout

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: Why is expand_context set to false when training (TransformerEmbeddings)? #3166

[Question]: Why is expand_context set to false when training (TransformerEmbeddings)? #3166

kobiche commented Mar 28, 2023

alanakbik commented Mar 28, 2023

kobiche commented Mar 28, 2023

[Question]: Why is expand_context set to false when training (TransformerEmbeddings)? #3166

[Question]: Why is expand_context set to false when training (TransformerEmbeddings)? #3166

Comments

kobiche commented Mar 28, 2023

Question

alanakbik commented Mar 28, 2023

kobiche commented Mar 28, 2023