Qwen/Qwen3-VL-235B-A22B-Instruct Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 230k • • 348
PAI-Bench: A Comprehensive Benchmark For Physical AI Paper • 2512.01989 • Published Dec 1, 2025 • 5
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning Paper • 2512.02425 • Published about 1 month ago • 23
RedHatAI/Qwen2.5-VL-72B-Instruct-FP8-dynamic Image-to-Text • 73B • Updated Apr 25, 2025 • 9.01k • 15
Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising Paper • 2511.08633 • Published Nov 9, 2025 • 54
Adaptive Multi-Agent Response Refinement in Conversational Systems Paper • 2511.08319 • Published Nov 11, 2025 • 41
Latent Diffusion Model without Variational Autoencoder Paper • 2510.15301 • Published Oct 17, 2025 • 49
Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs Paper • 2510.09201 • Published Oct 10, 2025 • 49
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs Paper • 2510.07499 • Published Oct 8, 2025 • 48
ACON: Optimizing Context Compression for Long-horizon LLM Agents Paper • 2510.00615 • Published Oct 1, 2025 • 32