Model Card for Model ID

Tasks Version Filter n-shot Metric Value Stderr
arc_challenge 1 none 25 acc 0.2108 ยฑ 0.0119
none 25 acc_norm 0.2423 ยฑ 0.0125
truthfulqa_mc2 2 none 0 acc 0.4356 ยฑ 0.0151
winogrande 1 none 5 acc 0.5138 ยฑ 0.014
hellaswag 1 none 10 acc 0.2938 ยฑ 0.0045
none 10 acc_norm 0.3242 ยฑ 0.0047
gsm8k 3 strict-match 5 exact_match 0.0129 ยฑ 0.0031
flexible-extract 5 exact_match 0.0197 ยฑ 0.0038

MMLU (0.2649701754385965, 0.004451753262466369)

Tasks Version Filter n-shot Metric Value Stderr
world_religions 0 none 5 acc 0.2281 ยฑ 0.0322
virology 0 none 5 acc 0.1747 ยฑ 0.0296
us_foreign_policy 0 none 5 acc 0.2600 ยฑ 0.0441
sociology 0 none 5 acc 0.2736 ยฑ 0.0315
security_studies 0 none 5 acc 0.4000 ยฑ 0.0314
public_relations 0 none 5 acc 0.2273 ยฑ 0.0401
professional_psychology 0 none 5 acc 0.2467 ยฑ 0.0174
professional_medicine 0 none 5 acc 0.4485 ยฑ 0.0302
professional_law 0 none 5 acc 0.2490 ยฑ 0.0110
professional_accounting 0 none 5 acc 0.2340 ยฑ 0.0253
prehistory 0 none 5 acc 0.2315 ยฑ 0.0235
philosophy 0 none 5 acc 0.2154 ยฑ 0.0234
nutrition 0 none 5 acc 0.2516 ยฑ 0.0248
moral_scenarios 0 none 5 acc 0.2536 ยฑ 0.0146
moral_disputes 0 none 5 acc 0.1879 ยฑ 0.0210
miscellaneous 0 none 5 acc 0.2197 ยฑ 0.0148
medical_genetics 0 none 5 acc 0.1900 ยฑ 0.0394
marketing 0 none 5 acc 0.1923 ยฑ 0.0258
management 0 none 5 acc 0.3301 ยฑ 0.0466
machine_learning 0 none 5 acc 0.1875 ยฑ 0.0370
logical_fallacies 0 none 5 acc 0.2577 ยฑ 0.0344
jurisprudence 0 none 5 acc 0.2222 ยฑ 0.0402
international_law 0 none 5 acc 0.3802 ยฑ 0.0443
human_sexuality 0 none 5 acc 0.2137 ยฑ 0.0360
human_aging 0 none 5 acc 0.1121 ยฑ 0.0212
high_school_world_history 0 none 5 acc 0.2743 ยฑ 0.0290
high_school_us_history 0 none 5 acc 0.2353 ยฑ 0.0298
high_school_statistics 0 none 5 acc 0.4722 ยฑ 0.0340
high_school_psychology 0 none 5 acc 0.3358 ยฑ 0.0202
high_school_physics 0 none 5 acc 0.3245 ยฑ 0.0382
high_school_microeconomics 0 none 5 acc 0.2605 ยฑ 0.0285
high_school_mathematics 0 none 5 acc 0.2741 ยฑ 0.0272
high_school_macroeconomics 0 none 5 acc 0.3615 ยฑ 0.0244
high_school_government_and_politics 0 none 5 acc 0.3679 ยฑ 0.0348
high_school_geography 0 none 5 acc 0.3535 ยฑ 0.0341
high_school_european_history 0 none 5 acc 0.2485 ยฑ 0.0337
high_school_computer_science 0 none 5 acc 0.1600 ยฑ 0.0368
high_school_chemistry 0 none 5 acc 0.2709 ยฑ 0.0313
high_school_biology 0 none 5 acc 0.3032 ยฑ 0.0261
global_facts 0 none 5 acc 0.2500 ยฑ 0.0435
formal_logic 0 none 5 acc 0.1587 ยฑ 0.0327
elementary_mathematics 0 none 5 acc 0.2857 ยฑ 0.0233
electrical_engineering 0 none 5 acc 0.2483 ยฑ 0.0360
econometrics 0 none 5 acc 0.2895 ยฑ 0.0427
conceptual_physics 0 none 5 acc 0.2894 ยฑ 0.0296
computer_security 0 none 5 acc 0.1900 ยฑ 0.0394
college_physics 0 none 5 acc 0.2451 ยฑ 0.0428
college_medicine 0 none 5 acc 0.2775 ยฑ 0.0341
college_mathematics 0 none 5 acc 0.2800 ยฑ 0.0451
college_computer_science 0 none 5 acc 0.2400 ยฑ 0.0429
college_chemistry 0 none 5 acc 0.3300 ยฑ 0.0473
college_biology 0 none 5 acc 0.2639 ยฑ 0.0369
clinical_knowledge 0 none 5 acc 0.3094 ยฑ 0.0285
business_ethics 0 none 5 acc 0.1900 ยฑ 0.0394
astronomy 0 none 5 acc 0.2303 ยฑ 0.0343
anatomy 0 none 5 acc 0.3259 ยฑ 0.0405
abstract_algebra 0 none 5 acc 0.2700 ยฑ 0.0446

Model Details

Model Description

This is the model card of a ๐Ÿค— transformers model that has been pushed on the Hub. This model card has been automatically generated.

  • Developed by: [More Information Needed]
  • Funded by [optional]: [More Information Needed]
  • Shared by [optional]: [More Information Needed]
  • Model type: [More Information Needed]
  • Language(s) (NLP): [More Information Needed]
  • License: [More Information Needed]
  • Finetuned from model [optional]: [More Information Needed]

Model Sources [optional]

  • Repository: [More Information Needed]
  • Paper [optional]: [More Information Needed]
  • Demo [optional]: [More Information Needed]

Uses

Direct Use

[More Information Needed]

Downstream Use [optional]

[More Information Needed]

Out-of-Scope Use

[More Information Needed]

Bias, Risks, and Limitations

[More Information Needed]

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

How to Get Started with the Model

Use the code below to get started with the model.

[More Information Needed]

Training Details

Training Data

[More Information Needed]

Training Procedure

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

  • Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: [More Information Needed]
  • Hours used: [More Information Needed]
  • Cloud Provider: [More Information Needed]
  • Compute Region: [More Information Needed]
  • Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Downloads last month
5
Safetensors
Model size
0.3B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support