Gaelle eval harness

Image Identify Control Desk

Configure experiments, start the runner, watch job logs, and inspect report scores in one place. Open Visualizer

Datasets checking server

Jobs live log polling

Experiment Parameters runner payload

Run Identity

Analysis

Challenge

Summary & Judge

Runner output will appear here.

Report Controls none selected

Select an experiment with judge output.

Inline Report visualizer folded in