-
Notifications
You must be signed in to change notification settings - Fork 696
Pull requests: open-compass/opencompass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Fix] Rename smolinstruct_pp_acc_0_shot_instruct dataset list as {}_datasets
#2340
opened Dec 2, 2025 by
cnlnpjhsy
Loading…
[Doc]Update evaluation configuration for Qwen3 model
#2332
opened Nov 26, 2025 by
LukeLIN-web
Loading…
6 tasks
feat(config): add Meta-Llama-3.1-8B-Instruct for MMLU benchmark
#2325
opened Nov 25, 2025 by
6taco
Loading…
6 tasks
Add ProcessBench dataset and evaluation configuration
#2274
opened Sep 16, 2025 by
sudanl
Loading…
6 tasks done
[fix] Handle None value for max_out_len parameter in HuggingFace model
#2271
opened Sep 15, 2025 by
Nexround
Loading…
[Feature] Support pass@1 evaluation for multi predictions in MathEvaluator
#2253
opened Aug 28, 2025 by
DELEnomore
Loading…
feat: Add Zebra Grid dataset support with ZeroEval alignment
#2234
opened Aug 11, 2025 by
max-yue
Loading…
[Fix] Deprecate unused and error formated math500 gen file
#2206
opened Jul 17, 2025 by
liushz
Loading…
6 tasks
[Fix] livecodebench serialization and timeout errors
#2204
opened Jul 15, 2025 by
f14-bertolotti
Loading…
6 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.