Flipping the Dialogue: Training and Evaluating User Language Models Paper • 2510.06552 • Published Oct 8, 2025 • 2
Building Social World Models with Large Language Models Paper • 2606.11482 • Published 15 days ago • 2
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published Apr 29 • 112
Beyond Mode Collapse: Distribution Matching for Diverse Reasoning Paper • 2605.19461 • Published May 19 • 2
HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation Paper • 2603.23871 • Published Mar 25 • 1
Unmasking On-Policy Distillation: Where It Helps, Where It Hurts, and Why Paper • 2605.10889 • Published May 11 • 6
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published Apr 1 • 56