Translation as a Bridging Action: Transferring Manipulation Skills from Humans to Robots Paper • 2606.28133 • Published 9 days ago • 39
UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating Paper • 2606.21661 • Published 16 days ago • 28
UnityShots: Memory-Driven Multi-Shot Audio-Video Generation with Boundary-Aware Gating Paper • 2606.21661 • Published 16 days ago • 28
Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models Paper • 2606.25041 • Published 12 days ago • 115
RoPE-Aware Bit Allocation for KV-Cache Quantization Paper • 2606.24033 • Published 12 days ago • 8 • 2
OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics Paper • 2606.09826 • Published 27 days ago • 19
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published May 26 • 145
On-Policy Adversarial Flow Distillation for Autoregressive Video Generation Paper • 2605.26105 • Published May 25 • 19