unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF Text Generation β’ 121B β’ Updated 13 days ago β’ 86.5k β’ 101
Running Featured 69 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems π 69 Who needs 1T parameters? Olympiad proofs with a 4B model
Running on CPU Upgrade Featured 3.08k The Smol Training Playbook π 3.08k The secrets to building world-class LLMs
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm β’ 5 items β’ Updated May 5, 2025 β’ 42
Running on CPU Upgrade Featured 1.29k Open ASR Leaderboard π 1.29k Explore speech model benchmarks and request new evaluations
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance Apr 16, 2025 β’ 67
view article Article The Transformers Library: standardizing model definitions +2 May 15, 2025 β’ 121