Ornith-1.0 Collection Ornith-1.0 is a family of open-source LLMs specialized for agentic coding. • 8 items • Updated 2 days ago • 243
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company Paper • 2604.22446 • Published Apr 24 • 124
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 50
DFlash Collection Block Diffusion for Flash Speculative Decoding • 23 items • Updated about 20 hours ago • 140
view article Article AI and the Future of Cybersecurity: Why Openness Matters +1 meg, yjernite, clem • Apr 21 • 42
Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 10 items • Updated 12 days ago • 56
PGC Psychiatric GWAS Summary Statistics Collection ~1 billion rows of genome-wide association study (GWAS) NOTE: We are in the process to transfer these datasets to the Psychiatric Genomics Consortiu • 12 items • Updated 8 days ago • 92
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face dvgodoy • Feb 11, 2025 • 124
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 165
view article Article Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions nvidia • Jun 10, 2025 • 25