SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published 12 days ago • 93
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 16 days ago • 139
V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts Paper • 2603.10848 • Published Mar 11 • 14
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published 23 days ago • 77
Advancing Block Diffusion Language Models for Test-Time Scaling Paper • 2602.09555 • Published Feb 10 • 4
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper • 2601.15876 • Published Jan 22 • 92