Model Card for Model ID

Tasks Version Filter n-shot Metric Value Stderr
arc_challenge 1 none 25 acc 0.1741 ยฑ 0.0111
none 25 acc_norm 0.2304 ยฑ 0.0123
truthfulqa_mc2 2 none 0 acc 0.4616 ยฑ 0.0156
winogrande 1 none 5 acc 0.5107 ยฑ 0.014
hellaswag 1 none 10 acc 0.2753 ยฑ 0.0045
none 10 acc_norm 0.2857 ยฑ 0.0045
gsm8k 3 strict-match 5 exact_match 0.0061 ยฑ 0.0021
flexible-extract 5 exact_match 0.0129 ยฑ 0.0031

MMLU (0.2534122807017544, 0.004405796567928279)

Tasks Version Filter n-shot Metric Value Stderr
world_religions 0 none 5 acc 0.2222 ยฑ 0.0319
virology 0 none 5 acc 0.1988 ยฑ 0.0311
us_foreign_policy 0 none 5 acc 0.2300 ยฑ 0.0423
sociology 0 none 5 acc 0.2338 ยฑ 0.0299
security_studies 0 none 5 acc 0.3673 ยฑ 0.0309
public_relations 0 none 5 acc 0.2273 ยฑ 0.0401
professional_psychology 0 none 5 acc 0.2402 ยฑ 0.0173
professional_medicine 0 none 5 acc 0.4265 ยฑ 0.0300
professional_law 0 none 5 acc 0.2419 ยฑ 0.0109
professional_accounting 0 none 5 acc 0.2589 ยฑ 0.0261
prehistory 0 none 5 acc 0.2716 ยฑ 0.0247
philosophy 0 none 5 acc 0.2412 ยฑ 0.0243
nutrition 0 none 5 acc 0.2516 ยฑ 0.0248
moral_scenarios 0 none 5 acc 0.2514 ยฑ 0.0145
moral_disputes 0 none 5 acc 0.2139 ยฑ 0.0221
miscellaneous 0 none 5 acc 0.2644 ยฑ 0.0158
medical_genetics 0 none 5 acc 0.3000 ยฑ 0.0461
marketing 0 none 5 acc 0.1923 ยฑ 0.0258
management 0 none 5 acc 0.1942 ยฑ 0.0392
machine_learning 0 none 5 acc 0.2500 ยฑ 0.0411
logical_fallacies 0 none 5 acc 0.2638 ยฑ 0.0346
jurisprudence 0 none 5 acc 0.1759 ยฑ 0.0368
international_law 0 none 5 acc 0.3554 ยฑ 0.0437
human_sexuality 0 none 5 acc 0.2443 ยฑ 0.0377
human_aging 0 none 5 acc 0.1928 ยฑ 0.0265
high_school_world_history 0 none 5 acc 0.2700 ยฑ 0.0289
high_school_us_history 0 none 5 acc 0.2990 ยฑ 0.0321
high_school_statistics 0 none 5 acc 0.4074 ยฑ 0.0335
high_school_psychology 0 none 5 acc 0.2422 ยฑ 0.0184
high_school_physics 0 none 5 acc 0.2053 ยฑ 0.0330
high_school_microeconomics 0 none 5 acc 0.2479 ยฑ 0.0280
high_school_mathematics 0 none 5 acc 0.2815 ยฑ 0.0274
high_school_macroeconomics 0 none 5 acc 0.2128 ยฑ 0.0208
high_school_government_and_politics 0 none 5 acc 0.2435 ยฑ 0.0310
high_school_geography 0 none 5 acc 0.3232 ยฑ 0.0333
high_school_european_history 0 none 5 acc 0.2848 ยฑ 0.0352
high_school_computer_science 0 none 5 acc 0.2800 ยฑ 0.0451
high_school_chemistry 0 none 5 acc 0.2906 ยฑ 0.0319
high_school_biology 0 none 5 acc 0.3032 ยฑ 0.0261
global_facts 0 none 5 acc 0.1600 ยฑ 0.0368
formal_logic 0 none 5 acc 0.1429 ยฑ 0.0313
elementary_mathematics 0 none 5 acc 0.2434 ยฑ 0.0221
electrical_engineering 0 none 5 acc 0.2483 ยฑ 0.0360
econometrics 0 none 5 acc 0.2544 ยฑ 0.0410
conceptual_physics 0 none 5 acc 0.3064 ยฑ 0.0301
computer_security 0 none 5 acc 0.1700 ยฑ 0.0378
college_physics 0 none 5 acc 0.2745 ยฑ 0.0444
college_medicine 0 none 5 acc 0.2601 ยฑ 0.0335
college_mathematics 0 none 5 acc 0.2500 ยฑ 0.0435
college_computer_science 0 none 5 acc 0.2900 ยฑ 0.0456
college_chemistry 0 none 5 acc 0.2400 ยฑ 0.0429
college_biology 0 none 5 acc 0.2500 ยฑ 0.0362
clinical_knowledge 0 none 5 acc 0.2075 ยฑ 0.0250
business_ethics 0 none 5 acc 0.2000 ยฑ 0.0402
astronomy 0 none 5 acc 0.1974 ยฑ 0.0324
anatomy 0 none 5 acc 0.3185 ยฑ 0.0402
abstract_algebra 0 none 5 acc 0.2300 ยฑ 0.0423

Model Details

Model Description

This is the model card of a ๐Ÿค— transformers model that has been pushed on the Hub. This model card has been automatically generated.

  • Developed by: [More Information Needed]
  • Funded by [optional]: [More Information Needed]
  • Shared by [optional]: [More Information Needed]
  • Model type: [More Information Needed]
  • Language(s) (NLP): [More Information Needed]
  • License: [More Information Needed]
  • Finetuned from model [optional]: [More Information Needed]

Model Sources [optional]

  • Repository: [More Information Needed]
  • Paper [optional]: [More Information Needed]
  • Demo [optional]: [More Information Needed]

Uses

Direct Use

[More Information Needed]

Downstream Use [optional]

[More Information Needed]

Out-of-Scope Use

[More Information Needed]

Bias, Risks, and Limitations

[More Information Needed]

Recommendations

Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.

How to Get Started with the Model

Use the code below to get started with the model.

[More Information Needed]

Training Details

Training Data

[More Information Needed]

Training Procedure

Preprocessing [optional]

[More Information Needed]

Training Hyperparameters

  • Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: [More Information Needed]
  • Hours used: [More Information Needed]
  • Cloud Provider: [More Information Needed]
  • Compute Region: [More Information Needed]
  • Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]

Downloads last month
2
Safetensors
Model size
0.1B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support