kakaocorp/kanana-2-30b-a3b-thinking-2601 Text Generation ⢠31B ⢠Updated 21 days ago ⢠1.21k ⢠54
naver-hyperclovax/HyperCLOVAX-SEED-Think-32B Text Generation ⢠33B ⢠Updated 30 days ago ⢠4.43k ⢠395
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation ⢠32B ⢠Updated 17 days ago ⢠571k ⢠613
Runtime error Featured 2.95k The Smol Training Playbook š 2.95k The secrets to building world-class LLMs
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method ⢠26 items ⢠Updated 8 days ago ⢠99