ViDoRe Benchmark V3 Collection ViDoRe V3 is our latest benchmark, engineered to set a new industry gold standard for multi-modal, enterprise document retrieval evaluation. • 8 items • Updated Nov 5, 2025 • 16
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases Nov 5, 2025 • 57
view article Article How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day 24 days ago • 46
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper • 2511.21689 • Published Nov 26, 2025 • 111
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment Paper • 2510.07743 • Published Oct 9, 2025 • 8
view article Article Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text Oct 20, 2025 • 34
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL +4 Jun 3, 2025 • 96
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 267
view article Article Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm Mar 19, 2025 • 8
RaDeR training datasets Collection These are some of the retrieval training datasets used for training RaDeR models, sonsisting of different types of query combinations. • 3 items • Updated Jun 12, 2025 • 1
JinaVDR (Visual Document Retrieval) Collection max. ~1000 images and OCR text included • 42 items • Updated Jul 20, 2025 • 8
Solving math word problems with process- and outcome-based feedback Paper • 2211.14275 • Published Nov 25, 2022 • 10