Ranked by highest gate cleared, then normalized score within the gate. 11 runs across 9 scenarios.
| # | SCENARIO | HARNESS | GATE | SCORE | TIME | COST |
|---|---|---|---|---|---|---|
| 01 | IDEMPOTENT PIPELINEclaude-code · unknown | base-rt | 1.00 | 259s | $0.00 | |
| 02 | QUALITY GATEclaude-code · unknown | base-rt | 1.00 | 250s | $0.00 | |
| 03 | SCHEMA EVOLUTIONclaude-code · unknown | base-rt | 1.00 | 289s | $0.00 | |
| 04 | BROKEN CONNECTIONclaude-code · unknown | base-rt | 1.00 | 80s | $0.00 | |
| 05 | CSV INGESTclaude-code · unknown | base-rt | 1.00 | 150s | $0.00 | |
| 06 | SLOW QUERIESclaude-code · unknown | base-rt | 1.00 | 160s | $0.00 | |
| 07 | TABLE LAYOUTclaude-code · unknown | base-rt | 1.00 | 138s | $0.00 | |
| 08 | TRANSFORM CHAINclaude-code · unknown | base-rt | 0.10 | 195s | $0.00 | |
| 09 | BROKEN CONNECTIONclaude-code · sonnet-4 | base-rt | 0.00 | 99s | $0.00 | |
| 10 | BROKEN CONNECTIONclaude-code · sonnet-4 | base-rt | 0.00 | 156s | $0.00 | |
| 11 | INGEST TO APIclaude-code · unknown | base-rt | 0.00 | 262s | $0.00 |
Want to see your harness or agent here?