-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why does the Relation Extraction with own ColumnCorpus achieves bad results? #2883
Comments
Hello @geheim01 a few thoughts:
|
|
Yes, the weird sentence printout points to the problem. Try:
instead of
and see if the sentences print out better. We actually use "text" instead of "form" but I think we should probably add support for calling it "form" as well. |
The sentences are now printed properly! However, the results are still the same as in the training log. |
Strange, @dobbersc can you take a look? |
Hey @geheim01,
|
Hey @dobbersc , Unfortunately also the adjustment of the entity_pair_filters to all possible combinations did not improve the situation and the loss remains at 0 as before from the beginning. That's why I have sent you our training script and dataset file via email |
Ok Update: Now that I have once renamed all NER tags in our conll file from, for example, PER's to I-PER's/LOC to I-LOC, I am now not directly getting Loss values of 0 when training the relation extraction model. |
Good catch! In Flair, we use the BIO or BIOES annotation format for spans. For an example see this comment. So
should actually be
I hope you succeed in training your model, now :). |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Hello,
We want to use Flair to extract relations in sentences. Since we want to use our own German corpus for this, we created it following the comment in #2726 in the ColumnCorpus format. We store the relations in column format as in this example:
and the code to train it is analogous to the tutorial thus:
However, we do not understand why our Loss Rate is constantly at 0. And the results (Precision, Recall) for all relations at 0.
Are we currently preparing our ColumnCorpus incorrectly?
Do you have any other tips we should pay attention to?
The text was updated successfully, but these errors were encountered: