Qwen/Qwen3-30B-A3B-Thinking-2507-FP8 Text Generation • 31B • Updated Jul 30, 2025 • 40.3k • 59
ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 200k • 1.1k
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated 3 days ago • 165
Running on CPU Upgrade Featured 999 Model Memory Utility 🚀 999 Calculate vRAM needed for model training and inference
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5, 2024 • 71
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation Paper • 2410.09584 • Published Oct 12, 2024 • 48