CG-AL-H018

hard Codeunits & Business Logic content 1bfdb0420e43…

Description

Per-model results

Models that have attempted this task
ModelAttempt 1Attempt 2Avg scoreRuns
Claude Fable 5100.0 / 1003
Claude Opus 4.6100.0 / 1003
Claude Opus 4.7100.0 / 1006
Claude Opus 4.8100.0 / 1006
Claude Sonnet 4 6100.0 / 1006
Gemini 3.1 Pro Preview100.0 / 1003
Gemini 3.5 Flash78.1 / 1006
GPT-5.575.0 / 1003
Claude Haiku 4 5 2025100133.3 / 1003