Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Paper • 2512.20605 • Published 6 days ago • 54
Reinforcement Learning for Self-Improving Agent with Skill Library Paper • 2512.17102 • Published 11 days ago • 27
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 12 days ago • 85
Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability Collection A compilation of sparse auto-encoders trained on large language models. • 37 items • Updated 13 days ago • 16
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 17 items • Updated 6 days ago • 39
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published Feb 20 • 193
ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms Paper • 2502.06556 • Published Feb 10 • 3
MERA Code: A Unified Framework for Evaluating Code Generation Across Tasks Paper • 2507.12284 • Published Jul 16 • 7