ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model Paper • 2603.22281 • Published 19 days ago • 17
Camouflaged Image Synthesis Is All You Need to Boost Camouflaged Detection Paper • 2308.06701 • Published Aug 13, 2023 • 1
OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising Paper • 2404.02227 • Published Apr 2, 2024
Dense Video Understanding with Gated Residual Tokenization Paper • 2509.14199 • Published Sep 17, 2025 • 3
Neural Discrete Token Representation Learning for Extreme Token Reduction in Video Large Language Models Paper • 2503.16980 • Published Mar 21, 2025 • 4