DCAgent/eval-terminal-bench-2.0__alfworld-swesmith-r2__eval_ctx131k_non_it_8x_eval_ Updated about 19 hours ago • 4
DCAgent/eval-swebench-verified-random-100-folders__rl__40GPU_base_32b__ctx32k_non_it_16x_eval_ Viewer • Updated about 20 hours ago • 1.5k • 19
DCAgent/eval-terminal-bench-2.0__rl__40GPU_base_32b__ctx32k_non_it_16x_eval_ Viewer • Updated 1 day ago • 1.04k • 17
DCAgent/eval-openthoughts-tblite__rl__40GPU_base_32b__ctx32k_non_it_16x_eval_ Viewer • Updated 1 day ago • 686 • 18
DCAgent/eval-openthoughts-tblite__alfworld-swesmith-r2__eval_ctx131k_non_it_8x_eval_ Updated 1 day ago • 11
DCAgent/swegym-tasks-patched-upsampled_10k_glm_4.7_traces_jupiter Viewer • Updated 1 day ago • 20.8k • 14
DCAgent/Magicoder-Evol-Instruct-110K-sandboxes-1_10k_glm_4.7_traces_jupiter Viewer • Updated 2 days ago • 10.6k • 21
DCAgent/eval-openthoughts-tblite__syh-r2eg-askl-glm_4__ctx32k_non_it_16x_eval_ Viewer • Updated 2 days ago • 1.71k • 12
DCAgent/freelancer-projects-sandboxes_glm_4.7_traces_jupiter Viewer • Updated 2 days ago • 11.7k • 22
DCAgent/neulab-agenttuning-webshop-sandboxes_glm_4.7_traces_jupiter Viewer • Updated 2 days ago • 10.4k • 38
DCAgent/eval-terminal-bench-2.0__syh-r2eg-askl-glm_4__ctx32k_non_it_16x_eval_ Viewer • Updated 2 days ago • 944 • 15
DCAgent/exp_rpt_crosscodeeval-python-v2_10k_glm_4.7_traces_jupiter Viewer • Updated 3 days ago • 1.63k • 33
DCAgent/eval-openthoughts-tblite__sft_GLM-4-7-swesmith__ctx32k_non_it_16x_eval_ Viewer • Updated 3 days ago • 1.88k • 21
DCAgent/glaive-code-assistant-sandboxes_glm_4.7_traces_jupiter Viewer • Updated 3 days ago • 10k • 40