Create a new custom evaluation metric.
Required fields vary by metric type:
| Metric Type | Required Fields |
|---|---|
| METRIC_LLM_BINARY | prompt |
| METRIC_CATEGORICAL | prompt, categories |
| METRIC_NUMERICAL_LLM_JUDGE | prompt, min_value, max_value |
| METRIC_AUDIO_LLM_BINARY | prompt |
| METRIC_AUDIO_LLM_CATEGORICAL | prompt, categories |
| METRIC_AUDIO_LLM_NUMERICAL | prompt, min_value, max_value |
| METRIC_TOOLCALL | prompt |
| METRIC_METADATA_FIELD | metadata_field_type, metadata_field_key |
| METRIC_TRANSCRIPT_REGEX | regex_pattern |
| METRIC_PAUSE_ANALYSIS | min_pause_duration_seconds |
API key for authentication
Create metric request
Display name
1 - 200Metric description
1 - 1000Metric evaluation type.
METRIC_LLM_BINARY - Yes/no LLM evaluationMETRIC_CATEGORICAL - Multi-class classificationMETRIC_NUMERICAL_LLM_JUDGE - Numerical scoring (1-N)METRIC_AUDIO_LLM_BINARY - Audio-based yes/noMETRIC_AUDIO_LLM_CATEGORICAL - Audio-based classificationMETRIC_AUDIO_LLM_NUMERICAL - Audio-based scoringMETRIC_TOOLCALL - Tool/function call evaluationMETRIC_METADATA_FIELD - Extract metadata fieldMETRIC_TRANSCRIPT_REGEX - Regex pattern matchingMETRIC_PAUSE_ANALYSIS - Speech pause detectionMETRIC_LLM_BINARY, METRIC_CATEGORICAL, METRIC_NUMERICAL_LLM_JUDGE, METRIC_AUDIO_LLM_BINARY, METRIC_AUDIO_LLM_CATEGORICAL, METRIC_AUDIO_LLM_NUMERICAL, METRIC_TOOLCALL, METRIC_METADATA_FIELD, METRIC_TRANSCRIPT_REGEX, METRIC_PAUSE_ANALYSIS LLM evaluation prompt. Required for LLM-based metrics.
Categories for classification. Required for categorical metrics.
2 - 50 elementsMinimum score. Required for numerical metrics.
Maximum score. Required for numerical metrics.
Field type. Required for METRIC_METADATA_FIELD.
STRING, NUMBER, BOOLEAN Metadata key. Required for METRIC_METADATA_FIELD.
Regex pattern. Required for METRIC_TRANSCRIPT_REGEX.
Speaker role filter. Optional for METRIC_TRANSCRIPT_REGEX.
agent, user Min pause duration in seconds. Required for METRIC_PAUSE_ANALYSIS.
x >= 0.5Inject OTel trace context into the LLM judge prompt during evaluation.
Supported for LLM judge metric types only (METRIC_LLM_BINARY, METRIC_CATEGORICAL,
METRIC_NUMERICAL_LLM_JUDGE, METRIC_AUDIO_LLM_BINARY, METRIC_AUDIO_LLM_CATEGORICAL,
METRIC_AUDIO_LLM_NUMERICAL).
Target condition for metric evaluation
Metric created
Metric resource