Updated token.py for gradient calculations #2574

vsamarths · 2021-12-28T14:07:06Z

The TransformersWordEmbeddings forward pass was not put within the gradient_context. Hence the forward pass was not encapsulated within the torch.no_grad() in case fine_tuning = False is set . This fix improves GPU memory management and speed.

moved fwd pass to no_grad

alanakbik · 2021-12-28T14:24:45Z

@vsamarths thanks for improving this!

@helpmefindaname can you take a look? (Since you're refactoring the TransformerEmbeddings at the moment ;))

helpmefindaname · 2021-12-29T11:02:46Z

This looks good, I've added the change to my PR

alanakbik · 2021-12-29T11:44:55Z

Great! I'll also merge this to master to recognize the contribution! Thanks a lot @vsamarths!

Update token.py

7b1dff8

moved fwd pass to no_grad

alanakbik merged commit bba5b5c into flairNLP:master Dec 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated token.py for gradient calculations #2574

Updated token.py for gradient calculations #2574

vsamarths commented Dec 28, 2021

alanakbik commented Dec 28, 2021

helpmefindaname commented Dec 29, 2021

alanakbik commented Dec 29, 2021

Updated token.py for gradient calculations #2574

Updated token.py for gradient calculations #2574

Conversation

vsamarths commented Dec 28, 2021

alanakbik commented Dec 28, 2021

helpmefindaname commented Dec 29, 2021

alanakbik commented Dec 29, 2021