Robust Information-Gain Control Fixing belief trap failures in LLM agents with distributionally robust information gain
What Matters in RL for Diffusion Models? The dominant role of noise in RL post-training for diffusion models
Improving Activation Steering Gated Cropped Attention-Delta Steering fixes KV-cache contamination in multi-turn dialogue
CogGym A scalable framework for comparing AI models against humans using cognitive science experiments