Gemini Live

Overview

Gemini Live agents are voice-to-voice agents where Coval connects directly to Google’s Gemini Live API on your behalf. You provide an API key, an agent prompt, a voice, and a model — Coval handles everything else. No webhook, no SIP trunk, and no self-hosted server required. This makes Gemini Live the fastest way to evaluate a Gemini-based agent configuration: tune the prompt, change the voice, bump the temperature, and re-run the same test set to see how the change affects behavior.

Use this connection when you want to evaluate an agent built directly on Gemini Live. If your production agent runs on Pipecat, LiveKit, or a custom stack, connect it through the matching integration instead.

Configuration Requirements

The only required field for a MODEL_TYPE_GEMINI_REALTIME agent is metadata.gemini_realtime_api_key. Every other field below is optional — if omitted, Coval applies the default shown.

Google AI API Key

Field: gemini_realtime_api_key
Type: String (required)
Purpose: Authenticates Coval’s connection to Gemini Live
Format: Any valid Google AI key. Supports both AIzaSy… (AI Studio) and AQ.… (Express Mode) prefixes.
Security: Stored encrypted and handled securely

Authentication errors (invalid key, missing Gemini Live access) surface the first time you run a simulation, not at agent save time. If your first simulation fails with an auth error, double-check the key has Gemini Live enabled.

Agent System Instructions

Field: gemini_realtime_instructions
Type: String (optional)
Default: "" (model default instructions)
Purpose: System prompt sent to Gemini Live as the agent’s system_instruction
Use Cases: Role definition, behavior guidelines, response formatting
Example: "You are a helpful customer support agent. Always be polite and provide accurate information."

Model

Field: gemini_realtime_model
Type: String (optional)
Default: models/gemini-3.1-flash-live-preview (used when the field is omitted)
Purpose: Selects which Gemini Live model powers the agent

Available Models:

Model ID	Description
`models/gemini-3.1-flash-live-preview`	Gemini 3.1 Flash Live — current recommended model
`models/gemini-2.0-flash-live-001`	Gemini 2.0 Flash Live — stable baseline

Agent Voice

Field: gemini_realtime_voice
Type: String (optional)
Default: Charon
Purpose: Prebuilt voice used by the agent

Available Voices:

Voice	Description
`Puck`	Upbeat, neutral
`Charon`	Neutral, balanced
`Kore`	Warm, female
`Fenrir`	Deep, male
`Aoede`	Bright, female
`Leda`	Clear, female
`Orus`	Authoritative, male
`Zephyr`	Breezy, neutral

Temperature

Field: gemini_realtime_temperature
Type: Number (optional)
Default: 0.8
Range: 0.0 (deterministic) to 2.0 (very random)
Purpose: Controls randomness in the agent’s responses

Simulation Timeout

Field: simulation_timeout_seconds
Type: Integer (optional)
Default: 900 (15 minutes)
Range: 1 to 1800 (30 minutes)
Purpose: Maximum duration for a single simulated conversation

Setup Instructions

Get a Google AI API key

Create a key at Google AI Studio and confirm it has Gemini Live access. Both AIzaSy… (AI Studio classic) and AQ.… (Express Mode) keys are supported.

Create the agent in Coval

Navigate to app.coval.dev/agents/create and select Gemini Live under Voice-to-Voice.

Configure the agent

Paste your Google AI API key, write your system instructions, pick a model and voice, and adjust temperature if needed.

Run a simulation

Create a test set, launch a simulation, and review the transcript and metric outputs.

Creating via the API

Gemini Live agents can also be created with the v1 Agents API:

curl -X POST https://api.coval.dev/v1/agents \
  -H "x-api-key: YOUR_COVAL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "display_name": "Gemini Live Voice Agent",
    "model_type": "MODEL_TYPE_GEMINI_REALTIME",
    "metadata": {
      "gemini_realtime_api_key": "AIzaSy...",
      "gemini_realtime_instructions": "You are a helpful customer support agent...",
      "gemini_realtime_model": "models/gemini-3.1-flash-live-preview",
      "gemini_realtime_voice": "Charon",
      "gemini_realtime_temperature": 0.8
    }
  }'

How Simulations Work

When you launch a simulation against a Gemini Live agent, Coval:

Opens a connection to Gemini Live using your API key, model, voice, and system instructions
Plays the simulated user’s turns into the live session
Captures the agent’s audio responses and transcribes them
Records the full conversation transcript
Runs your configured metrics against the transcript

Troubleshooting

Invalid or missing API key

Double-check that the key is entered correctly with no extra whitespace
Confirm the key has Gemini Live access enabled in Google AI Studio

Agent is unresponsive or cuts off early

Increase Simulation Timeout if conversations are being truncated
Check that your system instructions don’t tell the agent to end calls immediately

Voice doesn’t sound right

Switch to a different voice from the allow-list — each has a distinct character
Confirm you’re using a native-audio model, which supports the full voice catalog

Temperature validation error

Temperature must be between 0.0 and 2.0. Values outside that range are rejected at save time.

Introduction

Configuration

Observability

Step-by-Step Guides

Use Cases

Connect & Collaborate

Overview

Configuration Requirements

Google AI API Key

Agent System Instructions

Model

Agent Voice

Temperature

Simulation Timeout

Setup Instructions

Creating via the API

How Simulations Work

Troubleshooting

Introduction

Configuration

Observability

Step-by-Step Guides

Use Cases

Connect & Collaborate

​Overview

​Configuration Requirements

​Google AI API Key

​Agent System Instructions

​Model

​Agent Voice

​Temperature

​Simulation Timeout

​Setup Instructions

​Creating via the API

​How Simulations Work

​Troubleshooting

Overview

Configuration Requirements

Google AI API Key

Agent System Instructions

Model

Agent Voice

Temperature

Simulation Timeout

Setup Instructions

Creating via the API

How Simulations Work

Troubleshooting