DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents Paper • 2602.07035 • Published 9 days ago • 27
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published 30 days ago • 147
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published 29 days ago • 89
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14, 2025 • 187
MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research Paper • 2505.19955 • Published May 26, 2025 • 14
ConfTuner: Training Large Language Models to Express Their Confidence Verbally Paper • 2508.18847 • Published Aug 26, 2025 • 2
ConfTuner: Training Large Language Models to Express Their Confidence Verbally Paper • 2508.18847 • Published Aug 26, 2025 • 2