view article Article Designing the hf CLI as an agent-optimized way to work with the Hub celinah, Wauplin • 13 days ago • 57
view article Article ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM ibm-research • 20 days ago • 17
view article Article Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler +3 ariG23498, sayakpaul, sergiopaniego, ror, pcuenq • 19 days ago • 112
view article Article Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic ibm-research • 15 days ago • 86
📝 Research & Long-Form Blog Posts Collection In-depth technical articles and research pieces published by Hugging Face • 18 items • Updated 19 days ago • 33
view article Article Open Responses: What you need to know +2 evalstate, burtenshaw, merve, pcuenq • Jan 15 • 112
view article Article Liberate your OpenClaw +6 clem, burtenshaw, pcuenq, jeffboudier, merve, nielsr, victor, mishig • Mar 27 • 47
view article Article Harness, Scaffold, and the AI Agent Terms Worth Getting Right sergiopaniego, ariG23498 • 23 days ago • 113
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 50
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers tomaarsen • Apr 16 • 72
view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! medmekk, marcsun13 • Mar 7, 2025 • 98
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 908
view article Article 🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do FINAL-Bench • Mar 10 • 38