Gemini

Vendor: Google · 2 models

Trajectory

Gemini 2.5 ProGemini 2.0 Flash1.00.0

Concept trajectory

No baseline to compare against

This is the family's first analyzed generation. Once a second member is benched and analyzed, this section will surface the per-concept delta (resolved / persisting / regressed / new).

Members

Family members
ModelGenerationAvg scoreAvg costRunsLast run
Gemini 2.5 Pro gemini gemini-2.5-pro
20
Gemini 2.0 Flash gemini gemini-2.0-flash
20