You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was looking at some examples of the dataset, in particular in Spanish, and I noticed some numbers were written as digits in the context text, but written as words in the answers.
For example:
Context
[...] Josh Norman [...] consiguió 4 intercepciones [...]
Question
¿Cuántos balones interceptó Josh Norman?
Answer
['cuatro']
In the whole context text, the number of interceptions by Josh Norman is never written as "cuatro", but only as "4". Therefore, the model couldn't possibly find the span with the right answer. This isn't handled by the normalize_text function either.
The text was updated successfully, but these errors were encountered:
Hello
I was looking at some examples of the dataset, in particular in Spanish, and I noticed some numbers were written as digits in the context text, but written as words in the answers.
For example:
Context
Question
Answer
In the whole context text, the number of interceptions by Josh Norman is never written as "cuatro", but only as "4". Therefore, the model couldn't possibly find the span with the right answer. This isn't handled by the
normalize_text
function either.The text was updated successfully, but these errors were encountered: