Ovis: Structural Embedding Alignment for Multimodal Large Language Model Paper โข 2405.20797 โข Published May 31, 2024 โข 32
FLUX.1 Collection A collection of our FLUX.1 models and LoRAs. โข 13 items โข Updated Jan 2 โข 298
Ovis2.5 Collection Our next-generation MLLMs for native-resolution vision and advanced reasoning โข 5 items โข Updated Aug 19, 2025 โข 57
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper โข 2505.02567 โข Published May 5, 2025 โข 82 โข 5