CG-AL-H022

hard Reflection & Data Transfer content 83bb271a1ea8…

Description

Per-model results

Models that have attempted this task
ModelAttempt 1Attempt 2Avg scoreRuns
Claude Fable 5100.0 / 1003
Claude Opus 4.6100.0 / 1003
Claude Opus 4.7100.0 / 1006
Claude Opus 4.8100.0 / 1006
Gemini 3.1 Pro Preview100.0 / 1003
GPT-5.575.0 / 1003
Claude Sonnet 4 650.0 / 1006
Gemini 3.5 Flash33.3 / 1006
Claude Haiku 4 5 2025100131.3 / 1003