Bf/combine transformer embeddings #2558

helpmefindaname · 2021-12-16T22:35:02Z

creates a transformer embedding that combines both TransformerWordEmbedding and TransformerDocumentEmbedding

it should be able to:

still load all embeddings saved with previous versions
should not need to load the tokenizer -> Cannot load NER model from local file #2445 should be fixed
allow using document embeddings and token embeddings at once (might be usefull for Multitask Learning in flair #2508 ?)
allow using long sequences / document context for DocumentClassification too (if the pooling is max or mean)
make code better maintainable, as there is no duplicated code

current state:

first implementation
run test
manually test training with all kinds of features
test loading old embeddings / test backwardscompability

alanakbik

Thanks for adding this! Still testing, but found an error that appears with the following code:

embeddings = TransformerWordEmbeddings(model='xlm-roberta-base',
                                       layers="-1",
                                       subtoken_pooling="first",
                                       fine_tune=True,
                                       use_context=False,
                                       )
text = "."
sentence = Sentence(text)
embeddings.embed(sentence)

Suggestion to solve this (I think) added in-line.

flair/data.py

flair/embeddings/base.py

alanakbik

Thanks a lot for refactoring this!

alanakbik · 2021-12-31T09:20:40Z

@helpmefindaname I found another error. It seems the fix for the previous error now broke sentences that are too long (over 512 subtokens).

Reproducible with this script:

from flair.data import Sentence
from flair.embeddings import TransformerWordEmbeddings

# example transformer embeddings
embeddings = TransformerWordEmbeddings(model='distilbert-base-uncased')

# create sentence with more than 512 subtokens
long_sentence = Sentence('a ' * 513)

# embed
embeddings.embed(long_sentence)

Throws the same assertion error as previously, i.e.:

  File ".../flair/flair/embeddings/base.py", line 769, in _add_embeddings_internal
    self._add_embeddings_to_sentences(expanded_sentences)
  File ".../flair/flair/embeddings/base.py", line 728, in _add_embeddings_to_sentences
    self._extract_token_embeddings(sentence_hidden_states, sentences, all_token_subtoken_lengths)
  File ".../flair/flair/embeddings/base.py", line 656, in _extract_token_embeddings
    assert subword_start_idx < subword_end_idx <= sentence_hidden_state.size()[1]
AssertionError

Any ideas how to fix this?

isaac47 · 2022-09-20T15:27:20Z

Hi,
this merged request is done? How can use it?

helpmefindaname force-pushed the bf/combine_transformer_embeddings branch 3 times, most recently from 6de0b1e to 3803b00 Compare December 28, 2021 09:33

helpmefindaname mentioned this pull request Dec 29, 2021

Updated token.py for gradient calculations #2574

Merged

helpmefindaname added 18 commits December 29, 2021 12:49

combine transformer embeddings

87ca702

combine transformer embeddings

004de59

fix pooling_operation in TransformerEmbeddings

f059bbc

fix loading state_dict

cd3e893

fix size call

9b1b333

fix transformer embeddings tests

fb515e8

load state dict directly

862fbd3

don't save model state dict twice

d119729

also load model with kwargs

a338821

don't save config and tokenizer data as instance parameters

c5b7879

stop batch size from loading

63cd871

load state dict afterwards if provided

a8dba68

load model state dict

ab1086b

dummy commit to retrigger github actions

a7edfbc

remove legacy flag

8452bc9

don't compute gradients if not finetune mode

3ac3231

context embedding also for TransformerDocumentEmbeddings

cc0750e

fix document embedding extraction

12f78cb

helpmefindaname force-pushed the bf/combine_transformer_embeddings branch from 0a84cf3 to 12f78cb Compare December 29, 2021 11:50

helpmefindaname marked this pull request as ready for review December 29, 2021 11:51

alanakbik requested changes Dec 29, 2021

View reviewed changes

flair/data.py Show resolved Hide resolved

flair/embeddings/base.py Outdated Show resolved Hide resolved

fix padding removal

bf8f757

helpmefindaname requested a review from alanakbik December 30, 2021 11:46

alanakbik approved these changes Dec 30, 2021

View reviewed changes

alanakbik merged commit 7d5746f into flairNLP:master Dec 30, 2021

helpmefindaname deleted the bf/combine_transformer_embeddings branch November 28, 2022 10:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bf/combine transformer embeddings #2558

Bf/combine transformer embeddings #2558

helpmefindaname commented Dec 16, 2021 •

edited

Loading

alanakbik left a comment

alanakbik left a comment

alanakbik commented Dec 31, 2021

isaac47 commented Sep 20, 2022

Bf/combine transformer embeddings #2558

Bf/combine transformer embeddings #2558

Conversation

helpmefindaname commented Dec 16, 2021 • edited Loading

alanakbik left a comment

Choose a reason for hiding this comment

alanakbik left a comment

Choose a reason for hiding this comment

alanakbik commented Dec 31, 2021

isaac47 commented Sep 20, 2022

helpmefindaname commented Dec 16, 2021 •

edited

Loading