PRBench: End-to-end Paper Reproduction in Physics Research Paper • 2603.27646 • Published 5 days ago • 27
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models Paper • 2603.27481 • Published 5 days ago • 34
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward Paper • 2603.26599 • Published 7 days ago • 49
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 14 days ago • 305
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 5 days ago • 125
Emergent Social Intelligence Risks in Generative Multi-Agent Systems Paper • 2603.27771 • Published 5 days ago • 48
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published 8 days ago • 46
DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models Paper • 2603.23499 • Published 10 days ago • 50
Vega: Learning to Drive with Natural Language Instructions Paper • 2603.25741 • Published 8 days ago • 6
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 8 days ago • 125
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation Paper • 2603.22117 • Published 11 days ago • 28
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 11 days ago • 120
Alignment Makes Language Models Normative, Not Descriptive Paper • 2603.17218 • Published 16 days ago • 46
Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding Paper • 2603.13366 • Published 25 days ago • 94
WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation Paper • 2603.15132 • Published 18 days ago • 35
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs Paper • 2603.18004 • Published 16 days ago • 12
Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context Paper • 2603.15653 • Published 27 days ago • 12
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 15 days ago • 65