Run 97865fb5-e26…
completed 2c09af0e-90e1-4d79-8f05-bb867284cf1eRun success rate
Tasks the run solved on its last attempt / tasks attempted in this run.
Formula: COUNT(distinct tasks where last attempt passed) / COUNT(distinct tasks attempted in this run)
Per-run metric for the model's "final answer" on each task. Differs from leaderboard pass_at_n: this denominator is the run's own attempted-task count, not the task set size, so partial runs are not penalised for unattempted tasks.
Avg attempt score
Mean per-attempt score on a 0–100 point scale (partial credit). Drill-down only.
Formula: Mean of attempt scores across all results rows: SUM(score) / COUNT(*) over the results table. Each attempt earns 0–100 points based on compile + test outcomes.
Drill-down companion to pass_at_n. Rewards partial credit but not directly comparable to pass rate; use for within-model analysis.
| Task | Difficulty | Attempt | Score | Tests | Compile | Duration | |
|---|---|---|---|---|---|---|---|
| CG-AL-E001 | easy | 1 | 100.0 / 100 | 7/7 | OK | 41.3s | |
| CG-AL-E002 | easy | 1 | 100.0 / 100 | 6/6 | OK | 3m 4s | |
| CG-AL-E003 | easy | 1 | 100.0 / 100 | 5/5 | OK | 41.9s | |
| CG-AL-E004 | easy | 1 | 100.0 / 100 | 6/6 | OK | 47.9s | |
| CG-AL-E005 | easy | 1 | 100.0 / 100 | 13/13 | OK | 56.3s | |
| CG-AL-E006 | easy | 2 | 100.0 / 100 | 7/7 | OK | 3m 40s | |
| CG-AL-E007 | easy | 1 | 100.0 / 100 | 7/7 | OK | 1m 18s | |
| CG-AL-E008 | easy | 1 | 100.0 / 100 | 6/6 | OK | 1m 37s | |
| CG-AL-E009 | easy | 1 | 100.0 / 100 | 5/5 | OK | 1m 53s | |
| CG-AL-E010 | easy | 1 | 100.0 / 100 | 5/5 | OK | 2m 8s | |
| CG-AL-E031 | easy | 1 | 100.0 / 100 | 3/3 | OK | 2m 16s | |
| CG-AL-E032 | easy | 1 | 100.0 / 100 | 1/1 | OK | 2m 12s | |
| CG-AL-E045 | easy | 1 | 100.0 / 100 | 4/4 | OK | 23.0s | |
| CG-AL-E050 | easy | 2 | 0.0 / 100 | 0/0 | FAIL | 6m 23s | |
| CG-AL-E051 | easy | 1 | 100.0 / 100 | 15/15 | OK | 49.5s | |
| CG-AL-E052 | easy | 2 | 100.0 / 100 | 16/16 | OK | 19m 38s | |
| CG-AL-E053 | easy | 1 | 100.0 / 100 | 3/3 | OK | 2m 35s | |
| CG-AL-E054 | easy | 2 | 100.0 / 100 | 9/9 | OK | 7m 23s | |
| CG-AL-E055 | easy | 1 | 100.0 / 100 | 8/8 | OK | 1m 30s | |
| CG-AL-E056 | easy | 2 | 100.0 / 100 | 15/15 | OK | 38.8s | |
| CG-AL-E057 | easy | 2 | 0.0 / 100 | 0/0 | FAIL | 18m 21s | |
| CG-AL-E058 | easy | 2 | 100.0 / 100 | 1/1 | OK | 7m 7s | |
| CG-AL-H001 | hard | 1 | 100.0 / 100 | 25/25 | OK | 20m 46s | |
| CG-AL-H002 | hard | 1 | 100.0 / 100 | 4/4 | OK | 19m 8s | |
| CG-AL-H003 | hard | 1 | 100.0 / 100 | 5/5 | OK | 19m 49s | |
| CG-AL-H004 | hard | 1 | 100.0 / 100 | 14/14 | OK | 20m 8s | |
| CG-AL-H005 | hard | 1 | 100.0 / 100 | 6/6 | OK | 1m 1s | |
| CG-AL-H006 | hard | 1 | 100.0 / 100 | 6/6 | OK | 15m 45s | |
| CG-AL-H007 | hard | 1 | 100.0 / 100 | 10/10 | OK | 6m 36s | |
| CG-AL-H008 | hard | 1 | 100.0 / 100 | 10/10 | OK | 4m 31s | |
| CG-AL-H009 | hard | 1 | 100.0 / 100 | 11/11 | OK | 45.7s | |
| CG-AL-H010 | hard | 1 | 100.0 / 100 | 8/8 | OK | 1m 53s | |
| CG-AL-H011 | hard | 2 | 100.0 / 100 | 5/5 | OK | 47.5s | |
| CG-AL-H013 | hard | 1 | 100.0 / 100 | 9/9 | OK | 55.2s | |
| CG-AL-H014 | hard | 1 | 100.0 / 100 | 7/7 | OK | 51.4s | |
| CG-AL-H015 | hard | 1 | 100.0 / 100 | 4/4 | OK | 34.1s | |
| CG-AL-H016 | hard | 1 | 100.0 / 100 | 4/4 | OK | 23.6s | |
| CG-AL-H017 | hard | 2 | 100.0 / 100 | 3/3 | OK | 1m 12s | |
| CG-AL-H018 | hard | 1 | 100.0 / 100 | 6/6 | OK | 1m 29s | |
| CG-AL-H019 | hard | 1 | 100.0 / 100 | 5/5 | OK | 1m 6s | |
| CG-AL-H020 | hard | 1 | 100.0 / 100 | 10/10 | OK | 1m 33s | |
| CG-AL-H021 | hard | 2 | 100.0 / 100 | 20/20 | OK | 3m 24s | |
| CG-AL-H022 | hard | 1 | 100.0 / 100 | 21/21 | OK | 1m 2s | |
| CG-AL-H023 | hard | 1 | 100.0 / 100 | 25/25 | OK | 1m 33s | |
| CG-AL-H024 | hard | 1 | 100.0 / 100 | 9/9 | OK | 28.7s | |
| CG-AL-H025 | hard | 1 | 100.0 / 100 | 7/7 | OK | 49.6s | |
| CG-AL-H026 | hard | 1 | 100.0 / 100 | 8/8 | OK | 34.0s | |
| CG-AL-H027 | hard | 2 | 62.5 / 100 | 2/4 | OK | 1m 15s | |
| CG-AL-H028 | hard | 1 | 100.0 / 100 | 9/9 | OK | 28.1s | |
| CG-AL-H029 | hard | 2 | 100.0 / 100 | 18/18 | OK | 6m 39s | |
| CG-AL-H030 | hard | 2 | 100.0 / 100 | 10/10 | OK | 1m 59s | |
| CG-AL-H031 | hard | 1 | 100.0 / 100 | 22/22 | OK | 2m 0s | |
| CG-AL-H032 | hard | 1 | 100.0 / 100 | 19/19 | OK | 3m 29s | |
| CG-AL-H033 | hard | 1 | 100.0 / 100 | 5/5 | OK | 3m 2s | |
| CG-AL-H034 | hard | 2 | 100.0 / 100 | 3/3 | OK | 21.2s | |
| CG-AL-H035 | hard | 1 | 100.0 / 100 | 3/3 | OK | 18.8s | |
| CG-AL-H036 | hard | 2 | 100.0 / 100 | 5/5 | OK | 7m 28s | |
| CG-AL-H037 | hard | 2 | 100.0 / 100 | 3/3 | OK | 22.4s | |
| CG-AL-H038 | hard | 1 | 100.0 / 100 | 3/3 | OK | 32.7s | |
| CG-AL-H039 | hard | 1 | 100.0 / 100 | 4/4 | OK | 25.0s | |
| CG-AL-H040 | hard | 1 | 100.0 / 100 | 2/2 | OK | 29.5s | |
| CG-AL-H041 | hard | 1 | 100.0 / 100 | 3/3 | OK | 42.4s | |
| CG-AL-H042 | hard | 1 | 100.0 / 100 | 3/3 | OK | 20.4s | |
| CG-AL-H043 | hard | 1 | 100.0 / 100 | 5/5 | OK | 19.4s | |
| CG-AL-H050 | hard | 1 | 100.0 / 100 | 3/3 | OK | 55.7s | |
| CG-AL-H051 | hard | 1 | 100.0 / 100 | 4/4 | OK | 26.5s | |
| CG-AL-H052 | hard | 1 | 100.0 / 100 | 5/5 | OK | 1m 11s | |
| CG-AL-H053 | hard | 1 | 100.0 / 100 | 4/4 | OK | 20.8s | |
| CG-AL-H054 | hard | 2 | 0.0 / 100 | 0/0 | FAIL | 17.2s | |
| CG-AL-H056 | hard | 2 | 100.0 / 100 | 4/4 | OK | 47.3s | |
| CG-AL-H057 | hard | 1 | 100.0 / 100 | 4/4 | OK | 2m 32s | |
| CG-AL-H058 | hard | 1 | 100.0 / 100 | 5/5 | OK | 1m 17s | |
| CG-AL-H205 | hard | 1 | 100.0 / 100 | 6/6 | OK | 42.0s | |
| CG-AL-M001 | medium | 2 | 0.0 / 100 | 0/0 | FAIL | 17.1s | |
| CG-AL-M002 | medium | 1 | 100.0 / 100 | 22/22 | OK | 31.2s | |
| CG-AL-M003 | medium | 1 | 100.0 / 100 | 9/9 | OK | 56.3s | |
| CG-AL-M004 | medium | 1 | 100.0 / 100 | 12/12 | OK | 2m 56s | |
| CG-AL-M005 | medium | 1 | 100.0 / 100 | 22/22 | OK | 42.6s | |
| CG-AL-M006 | medium | 1 | 100.0 / 100 | 18/18 | OK | 1m 5s | |
| CG-AL-M007 | medium | 2 | 0.0 / 100 | 0/0 | FAIL | 35.5s | |
| CG-AL-M008 | medium | 2 | 62.5 / 100 | 10/12 | OK | 1m 40s | |
| CG-AL-M009 | medium | 1 | 100.0 / 100 | 11/11 | OK | 36.1s | |
| CG-AL-M010 | medium | 1 | 100.0 / 100 | 21/21 | OK | 3m 22s | |
| CG-AL-M020 | medium | 1 | 100.0 / 100 | 17/17 | OK | 39.9s | |
| CG-AL-M021 | medium | 1 | 100.0 / 100 | 11/11 | OK | 28.0s | |
| CG-AL-M022 | medium | 1 | 100.0 / 100 | 9/9 | OK | 31.6s | |
| CG-AL-M023 | medium | 2 | 62.5 / 100 | 10/11 | OK | 3m 32s | |
| CG-AL-M024 | medium | 1 | 100.0 / 100 | 10/10 | OK | 34.0s | |
| CG-AL-M025 | medium | 2 | 100.0 / 100 | 7/7 | OK | 1m 10s | |
| CG-AL-M026 | medium | 1 | 100.0 / 100 | 8/8 | OK | 6m 42s | |
| CG-AL-M027 | medium | 1 | 100.0 / 100 | 18/18 | OK | 1m 0s | |
| CG-AL-M028 | medium | 2 | 100.0 / 100 | 3/3 | OK | 2m 46s | |
| CG-AL-M029 | medium | 2 | 0.0 / 100 | 0/0 | FAIL | 31.7s | |
| CG-AL-M031 | medium | 2 | 0.0 / 100 | 0/0 | FAIL | 37.1s | |
| CG-AL-M032 | medium | 1 | 100.0 / 100 | 3/3 | OK | 37.7s | |
| CG-AL-M033 | medium | 1 | 100.0 / 100 | 2/2 | OK | 46.5s | |
| CG-AL-M034 | medium | 2 | 0.0 / 100 | 0/0 | FAIL | 1m 19s | |
| CG-AL-M035 | medium | 1 | 100.0 / 100 | 2/2 | OK | 26.7s | |
| CG-AL-M036 | medium | 2 | 0.0 / 100 | 0/0 | FAIL | 6m 26s | |
| CG-AL-M037 | medium | 2 | 100.0 / 100 | 2/2 | OK | 6m 13s | |
| CG-AL-M038 | medium | 1 | 100.0 / 100 | 2/2 | OK | 2m 47s | |
| CG-AL-M039 | medium | 1 | 100.0 / 100 | 2/2 | OK | 3m 37s | |
| CG-AL-M040 | medium | 2 | 0.0 / 100 | 0/0 | FAIL | 6m 44s | |
| CG-AL-M041 | medium | 1 | 100.0 / 100 | 3/3 | OK | 8m 44s | |
| CG-AL-M042 | medium | 1 | 100.0 / 100 | 8/8 | OK | 5m 9s | |
| CG-AL-M043 | medium | 1 | 100.0 / 100 | 5/5 | OK | 5m 0s | |
| CG-AL-M044 | medium | 1 | 100.0 / 100 | 6/6 | OK | 3m 23s | |
| CG-AL-M045 | medium | 1 | 100.0 / 100 | 6/6 | OK | 34.1s | |
| CG-AL-M088 | medium | 1 | 100.0 / 100 | 6/6 | OK | 30.1s | |
| CG-AL-M112 | medium | 1 | 100.0 / 100 | 4/4 | OK | 27.4s |