Rethinking the Role of Efficient Attention in Hybrid Architectures Paper • 2606.15378 • Published 6 days ago • 12
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients Paper • 2606.18216 • Published 3 days ago • 48
Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models Paper • 2606.16281 • Published 4 days ago • 31
Direct 3D-Aware Object Insertion via Decomposed Visual Proxies Paper • 2606.06601 • Published 15 days ago • 26
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning Paper • 2606.04923 • Published 16 days ago • 39
From Activation to Causality: Discovery of Causal Visual Representations in the Human Brain Paper • 2605.23895 • Published 28 days ago • 52
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 24 days ago • 142
Running on Zero Agents Featured 62 L2P - Z-Image 6B Pixel-Space 🎨 62 End-to-end pixel-space 6B diffusion via L2P
Toto 2.0: Time Series Forecasting Enters the Scaling Era Paper • 2605.20119 • Published about 1 month ago • 39
Enhancing Train-Free Infinite-Frame Generation for Consistent Long Videos Paper • 2605.18233 • Published May 18 • 92
GATES: Self-Distillation under Privileged Context with Consensus Gating Paper • 2602.20574 • Published Feb 24 • 1
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 196