deepseek-ai/DeepSeek-V3.1-Terminus Text Generation β’ 685B β’ Updated Sep 29, 2025 β’ 22.6k β’ β’ 358
Qwen/Qwen3-Next-80B-A3B-Instruct Text Generation β’ 81B β’ Updated Sep 17, 2025 β’ 2.95M β’ β’ 888
Running 215 FineVision: Open Data is All You Need π 215 A new open-source dataset for training VLMs
google/embeddinggemma-300m Sentence Similarity β’ 0.3B β’ Updated Sep 25, 2025 β’ 703k β’ β’ 1.4k
Running on Zero Featured 801 Qwen Image Edit β 801 Edit and enhance images based on descriptive instructions
ngxson/Home-Cook-Mistral-Small-Omni-24B-2507-GGUF Any-to-Any β’ 24B β’ Updated Jul 28, 2025 β’ 222 β’ 26
Running 3.63k The Ultra-Scale Playbook π 3.63k The ultimate guide to training LLM on large GPU Clusters