You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -120,7 +120,6 @@ The axis labels of models are ordered by their agreement with human preferences,
120
120
121
121
Note:
122
122
*`Human` here means the human annotators who label the preference data of the HelpSteer2-Preference dataset. Note that humans typically have diverse preferences and different LLMs are aligned with different human annotators. So this heatmap is just a reference based on the HelpSteer2-Preference dataset and does not imply any particular LLM is poorly aligned with human preferences.
123
-
* Some LLMs do not follow our prompt template well ... We demonstrate the success rate below to let readers be aware that the metric computation for them is not as reliable as other LLMs
124
123
125
124
**Preference Similarity Visualization with UMAP.** To further enhance our understanding of these relationships, we employed UMAP dimensionality reduction to project the preference patterns into a more interpretable 2D space:
0 commit comments