Gemini
Trajectory
Concept trajectory
No baseline to compare against
This is the family's first analyzed generation. Once a second member is benched and analyzed, this section will surface the per-concept delta (resolved / persisting / regressed / new).
Members
| Model | Generation | Pass@N | Avg cost / task | Runs | Last run |
|---|---|---|---|---|---|
| 2 | — | — | 0 | — | |
| 2 | — | — | 0 | — | |
| 3 | 91.8% | $0.03 | 3 | 20d ago | |
| 3 | 88.2% | $0.06 | 6 | 20d ago |