naver-hyperclovax/HyperCLOVAX-SEED-Think-32B Text Generation β’ 33B β’ Updated about 7 hours ago β’ 22.1k β’ 90
view article Article How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day 25 days ago β’ 46
Flash Sparse Attention: An Alternative Efficient Implementation of Native Sparse Attention Kernel Paper β’ 2508.18224 β’ Published Aug 25, 2025 β’ 1
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation Paper β’ 2511.09611 β’ Published Nov 12, 2025 β’ 69
moonshotai/Kimi-Linear-48B-A3B-Instruct Text Generation β’ 49B β’ Updated 17 days ago β’ 90.7k β’ 515
KORMo: Korean Open Reasoning Model for Everyone Paper β’ 2510.09426 β’ Published Oct 10, 2025 β’ 83