The official PyTorch implementation for the Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence
efficient tree-search gg prm self-consistency confidence dvts rl-training llm inference-time-compute grpo test-time-scaling guided-by-gut
-
Updated
Jun 9, 2025 - Python