Dynamic-LLaVA Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification Paper • 2412.00876 • Published Dec 1, 2024 Osilly/Dynamic-LLaVA-7B 7B • Updated Sep 18, 2025 • 25 Osilly/Dynamic-LLaVA-13B 13B • Updated Sep 18, 2025 • 4 Osilly/Dynamic-LLaVA-TokenPacker-7B Updated Sep 18, 2025
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification Paper • 2412.00876 • Published Dec 1, 2024
Vision-R1 Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Paper • 2503.06749 • Published Mar 9, 2025 • 31 Osilly/Vision-R1-cold Preview • Updated Mar 23, 2025 • 122 • 14 Osilly/Vision-R1-7B 8B • Updated Apr 13, 2025 • 447 • 12 Osilly/Vision-R1-CI-7B 8B • Updated Jun 24, 2025 • 117
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Paper • 2503.06749 • Published Mar 9, 2025 • 31
Interleaving Reasoning Generation Osilly/IRG-Toy-Dataset Viewer • Updated Sep 14, 2025 • 600 • 57 • 1 Interleaving Reasoning for Better Text-to-Image Generation Paper • 2509.06945 • Published Sep 8, 2025 • 14
Interleaving Reasoning for Better Text-to-Image Generation Paper • 2509.06945 • Published Sep 8, 2025 • 14
Dynamic-LLaVA Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification Paper • 2412.00876 • Published Dec 1, 2024 Osilly/Dynamic-LLaVA-7B 7B • Updated Sep 18, 2025 • 25 Osilly/Dynamic-LLaVA-13B 13B • Updated Sep 18, 2025 • 4 Osilly/Dynamic-LLaVA-TokenPacker-7B Updated Sep 18, 2025
Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification Paper • 2412.00876 • Published Dec 1, 2024
Interleaving Reasoning Generation Osilly/IRG-Toy-Dataset Viewer • Updated Sep 14, 2025 • 600 • 57 • 1 Interleaving Reasoning for Better Text-to-Image Generation Paper • 2509.06945 • Published Sep 8, 2025 • 14
Interleaving Reasoning for Better Text-to-Image Generation Paper • 2509.06945 • Published Sep 8, 2025 • 14
Vision-R1 Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Paper • 2503.06749 • Published Mar 9, 2025 • 31 Osilly/Vision-R1-cold Preview • Updated Mar 23, 2025 • 122 • 14 Osilly/Vision-R1-7B 8B • Updated Apr 13, 2025 • 447 • 12 Osilly/Vision-R1-CI-7B 8B • Updated Jun 24, 2025 • 117
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Paper • 2503.06749 • Published Mar 9, 2025 • 31