Drop results JSON here

or use the buttons above

{{ formattedDate }} · {{ activeRun.tasks.length }} tasks · run {{ activeRun.id }}

{{ activeRun.totalScore }} / {{ activeRun.maxScore }}

{{ Math.round(activeRun.passRate * 100) }}% pass

of {{ catData(cat.key).maxScore }} points

{{ catData(cat.key).passed }}/{{ catData(cat.key).total }} tasks passed

Tasks

{{ task.score }}/{{ task.maxScore }}

{{ v.passed ? '+' + v.points : '0' }}/{{ v.points }}

LLM Judge {{ task.judge.score }}/{{ task.judge.maxScore }}

{{ c.score }}/{{ c.max }}

Leaderboard

#	Model	Score	{{ task.shortLabel }}
{{ i + 1 }}	{{ row.agentName }}	{{ row.totalScore }} {{ Math.round(row.passRate * 100) }}%