llm-alignment

Star

Here are 4 public repositories matching this topic...

glorgao / SelectiveDPO

Star

Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples

llm-alignment

Updated Apr 9, 2025
Python

ktjkc / reflextrust

Star

Ever noticed how AI changes tone mid-dialogue? ReflexTrust decodes the hidden trust system behind LLM behavior — and shows how alignment actually works.

safety adaptive-learning ai-safety explainable-ai red-teaming trust-framework ai-alignment prompt-engineering ai-behavior context-aware-ai trust-modulation llm-alignment trust-modeling llm-behavior trust-calibration contextual-alignment

Updated May 26, 2025

KID-22 / LLM-SBM

Star

SIGIR 2025 "Mitigating Source Bias with LLM Alignment"

information-retrieval fairness cocktail trustworthy dense-retrieval source-bias llm-alignment

Updated Apr 28, 2025
Python

yarakyrychenko / c3ai

Star

C3AI: Crafting and Evaluating Constitutions for CAI

constitutional-ai llm-alignment

Updated Apr 30, 2025
Python

Improve this page

Add a description, image, and links to the llm-alignment topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-alignment topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-alignment

Here are 4 public repositories matching this topic...

glorgao / SelectiveDPO

ktjkc / reflextrust

KID-22 / LLM-SBM

yarakyrychenko / c3ai

Improve this page

Add this topic to your repo