DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published about 1 month ago • 60
faezeb/tulu3_rewritten_400k_rubrics-single-verifiable-nocode-filetered-ot2 Viewer • Updated Aug 14 • 215k • 23
faezeb/tulu3_rewritten_400k_rubrics-single-verifiable-nocode-filetered-ot2 Viewer • Updated Aug 14 • 215k • 23