Overview
Gemini Live agents are voice-to-voice agents where Coval connects directly to Google’s Gemini Live API on your behalf. You provide an API key, an agent prompt, a voice, and a model — Coval handles everything else. No webhook, no SIP trunk, and no self-hosted server required. This makes Gemini Live the fastest way to evaluate a Gemini-based agent configuration: tune the prompt, change the voice, bump the temperature, and re-run the same test set to see how the change affects behavior.Use this connection when you want to evaluate an agent built directly on Gemini Live. If your production agent runs on Pipecat, LiveKit, or a custom stack, connect it through the matching integration instead.
Configuration Requirements
The only required field for aMODEL_TYPE_GEMINI_REALTIME agent is metadata.gemini_realtime_api_key. Every other field below is optional — if omitted, Coval applies the default shown.
Google AI API Key
- Field:
gemini_realtime_api_key - Type: String (required)
- Purpose: Authenticates Coval’s connection to Gemini Live
- Format: Any valid Google AI key. Supports both
AIzaSy…(AI Studio) andAQ.…(Express Mode) prefixes. - Security: Stored encrypted and handled securely
Authentication errors (invalid key, missing Gemini Live access) surface the first time you run a simulation, not at agent save time. If your first simulation fails with an auth error, double-check the key has Gemini Live enabled.
Agent System Instructions
- Field:
gemini_realtime_instructions - Type: String (optional)
- Default:
""(model default instructions) - Purpose: System prompt sent to Gemini Live as the agent’s
system_instruction - Use Cases: Role definition, behavior guidelines, response formatting
- Example:
"You are a helpful customer support agent. Always be polite and provide accurate information."
Model
- Field:
gemini_realtime_model - Type: String (optional)
- Default:
models/gemini-3.1-flash-live-preview(used when the field is omitted) - Purpose: Selects which Gemini Live model powers the agent
| Model ID | Description |
|---|---|
models/gemini-3.1-flash-live-preview | Gemini 3.1 Flash Live — current recommended model |
models/gemini-2.0-flash-live-001 | Gemini 2.0 Flash Live — stable baseline |
Agent Voice
- Field:
gemini_realtime_voice - Type: String (optional)
- Default:
Charon - Purpose: Prebuilt voice used by the agent
| Voice | Description |
|---|---|
Puck | Upbeat, neutral |
Charon | Neutral, balanced |
Kore | Warm, female |
Fenrir | Deep, male |
Aoede | Bright, female |
Leda | Clear, female |
Orus | Authoritative, male |
Zephyr | Breezy, neutral |
Temperature
- Field:
gemini_realtime_temperature - Type: Number (optional)
- Default:
0.8 - Range:
0.0(deterministic) to2.0(very random) - Purpose: Controls randomness in the agent’s responses
Simulation Timeout
- Field:
simulation_timeout_seconds - Type: Integer (optional)
- Default:
900(15 minutes) - Range:
1to1800(30 minutes) - Purpose: Maximum duration for a single simulated conversation
Setup Instructions
Get a Google AI API key
Create a key at Google AI Studio and confirm it has Gemini Live access. Both
AIzaSy… (AI Studio classic) and AQ.… (Express Mode) keys are supported.Create the agent in Coval
Navigate to app.coval.dev/agents/create and select Gemini Live under Voice-to-Voice.
Configure the agent
Paste your Google AI API key, write your system instructions, pick a model and voice, and adjust temperature if needed.
Creating via the API
Gemini Live agents can also be created with the v1 Agents API:How Simulations Work
When you launch a simulation against a Gemini Live agent, Coval:- Opens a connection to Gemini Live using your API key, model, voice, and system instructions
- Plays the simulated user’s turns into the live session
- Captures the agent’s audio responses and transcribes them
- Records the full conversation transcript
- Runs your configured metrics against the transcript
Troubleshooting
Invalid or missing API key- Double-check that the key is entered correctly with no extra whitespace
- Confirm the key has Gemini Live access enabled in Google AI Studio
- Increase Simulation Timeout if conversations are being truncated
- Check that your system instructions don’t tell the agent to end calls immediately
- Switch to a different voice from the allow-list — each has a distinct character
- Confirm you’re using a
native-audiomodel, which supports the full voice catalog
- Temperature must be between
0.0and2.0. Values outside that range are rejected at save time.

