You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
an open-source dataset to evaluate LLMs' safety mechanism at a low cost. The dataset consists only of prompts to which responsible language models should not answer.
The text was updated successfully, but these errors were encountered:
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs
https://arxiv.org/abs/2308.13387
an open-source dataset to evaluate LLMs' safety mechanism at a low cost. The dataset consists only of prompts to which responsible language models should not answer.
The text was updated successfully, but these errors were encountered: