LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 10 days ago • 119
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation Paper • 2504.17502 • Published Apr 24, 2025 • 55