VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding Paper • 2412.03735 • Published Dec 4, 2024
Concrete Jungle: Towards Concreteness Paved Contrastive Negative Mining for Compositional Understanding Paper • 2604.13313 • Published 10 days ago • 12