Skip to main content

List Runs

coval runs list [OPTIONS]
OptionTypeDefaultDescription
--filterstringFilter expression (e.g., status="COMPLETED")
--page-sizenumber50Results per page
--order-bystringSort order (e.g., -create_time)
Output columns: ID, STATUS, PROGRESS, CREATED
# List all runs
coval runs list

# Filter completed runs
coval runs list --filter 'status="COMPLETED"'

Get Run

coval runs get <run_id>
ArgumentTypeRequiredDescription
run_idstringYesThe run ID
Returns full run details as JSON including status, progress, results, and metrics.
coval runs get run_abc123

Launch Run

coval runs launch [OPTIONS]
OptionTypeRequiredDescription
--agent-idstringYesAgent ID to evaluate
--persona-idstringYesPersona ID for simulated caller
--test-set-idstringYesTest set ID containing test cases
--iterationsnumberNoIterations per test case (default: 1)
--concurrencynumberNoParallel simulations
--namestringNoDisplay name for the run
--mutation-idstringNoSingle mutation ID to test
--mutation-idsstringNoComma-separated mutation IDs
# Basic run
coval runs launch \
  --agent-id ag_abc123 \
  --persona-id per_xyz789 \
  --test-set-id ts_123456

# Run with options
coval runs launch \
  --agent-id ag_abc123 \
  --persona-id per_xyz789 \
  --test-set-id ts_123456 \
  --iterations 3 \
  --concurrency 5 \
  --name "Regression Test"

# Run with mutations
coval runs launch \
  --agent-id ag_abc123 \
  --persona-id per_xyz789 \
  --test-set-id ts_123456 \
  --mutation-ids "mut_001,mut_002,mut_003"

Watch Run

Monitor a run’s progress in real time with a live progress bar.
coval runs watch <run_id> [OPTIONS]
ArgumentTypeRequiredDescription
run_idstringYesThe run ID to watch
OptionTypeDefaultDescription
--intervalnumber2Poll interval in seconds
# Watch with default interval
coval runs watch run_abc123

# Watch with faster polling
coval runs watch run_abc123 --interval 1
The watch command displays a progress bar and exits when the run reaches a terminal status.

Delete Run

coval runs delete <run_id>
ArgumentTypeRequiredDescription
run_idstringYesThe run ID to delete

Run Statuses

StatusDescription
PENDINGRun is created but not yet started
IN_QUEUERun is queued for execution
IN_PROGRESSSimulations are actively running
COMPLETEDAll simulations finished successfully
FAILEDRun encountered an error
CANCELLEDRun was cancelled
DELETEDRun was deleted
When using --filter, use the underscore-separated enum values (e.g., status="IN_PROGRESS").