List Metrics
| Option | Type | Default | Description |
|---|---|---|---|
--filter | string | — | Filter expression (supports metric_type, metric_name, create_time) |
--page-size | number | 50 | Results per page (1-100) |
--order-by | string | — | Sort field, prefix with - for descending |
--include-builtin | flag | — | Include built-in metrics (e.g. Turn Count, Audio Duration) |
Get Metric
| Argument | Type | Required | Description |
|---|---|---|---|
metric_id | string | Yes | The metric ID |
Create Metric
| Option | Type | Required | Description |
|---|---|---|---|
--name | string | Yes | Metric display name |
--description | string | Yes | What this metric evaluates |
--type | string | Yes | Metric type (see below) |
--prompt | string | No | LLM evaluation prompt (required for llm-binary, categorical, numerical and their audio variants) |
--categories | string | No | Comma-separated categories (required for categorical, audio-categorical) |
--min-value | number | No | Minimum value (required for numerical, audio-numerical) |
--max-value | number | No | Maximum value (required for numerical, audio-numerical) |
--regex-pattern | string | No | Regex pattern to match (required for regex) |
--role | string | No | Transcript role to match against (optional for regex) |
--match-mode | string | No | presence (default) or absence — absence returns 1 if pattern NOT found |
--position | string | No | any (default), first, or last message of the role |
--case-insensitive | boolean | No | Enable case-insensitive matching |
--metadata-field-type | string | No | Metadata field type (required for metadata) |
--metadata-field-key | string | No | Metadata field key to extract (required for metadata) |
--min-pause-duration-seconds | number | No | Minimum pause duration threshold (required for pause) |
Metric Types
| Type | Description | Type-Specific Options |
|---|---|---|
llm-binary | Binary (yes/no) LLM judgment | --prompt |
categorical | Categorical LLM judgment with defined options | --prompt, --categories |
numerical | Numerical score from LLM judgment | --prompt, --min-value, --max-value |
audio-binary | Binary audio analysis | --prompt |
audio-categorical | Categorical audio analysis | --prompt, --categories |
audio-numerical | Numerical audio analysis | --prompt, --min-value, --max-value |
toolcall | Tool call success verification | — |
metadata | Extract metadata field value | --metadata-field-type, --metadata-field-key |
regex | Match transcript against a regex pattern | --regex-pattern, --role, --match-mode, --position, --case-insensitive |
pause | Analyze pause durations in audio | --min-pause-duration-seconds |
Examples
Update Metric
| Argument | Type | Required | Description |
|---|---|---|---|
metric_id | string | Yes | The metric ID to update |
| Option | Type | Description |
|---|---|---|
--name | string | New display name |
--description | string | New description |
--prompt | string | New evaluation prompt |
Delete Metric
| Argument | Type | Required | Description |
|---|---|---|---|
metric_id | string | Yes | The metric ID to delete |

