Add MMLU-Pro evaluation result (84.0)

#242

by burtenshaw HF Staff - opened Jan 28

←

YAML Metadata Error: Invalid content in Eval Result file .eval_results/mmlu-pro.yaml

Check out the documentation for more information.

Show details

Task ID "mmlu_pro" does not match any task in dataset "TIGER-Lab/MMLU-Pro". Available: none

Files changed (1) hide show

.eval_results/mmlu-pro.yaml ADDED Viewed

+- dataset:
+    id: TIGER-Lab/MMLU-Pro
+    task_id: mmlu_pro
+  value: 84.0
+  date: '2026-01-28'
+  source:
+    url: https://huggingface.co/deepseek-ai/DeepSeek-R1
+    name: Model Card
+    user: burtenshaw