VISTA: View-Consistent Self-Verified Training for GUI Grounding Paper • 2606.14579 • Published 14 days ago • 8
N-GRPO: Embedding-Level Neighbor Mixing for Enhanced Policy Optimization Paper • 2606.10768 • Published 17 days ago • 24
Unified Generation and Self-Verification for Vision-Language Models via Advantage Decoupled Preference Optimization Paper • 2601.01483 • Published Jan 4 • 1