Overview

Introduction

ElevenAgents is ElevenLabs’ conversational AI framework that enables you to build interactive AI agents with real-time audio capabilities. Create voice-enabled agents that can have natural conversations, call custom tools, and provide seamless audio experiences.

Key Features

Real-Time Audio

Full-duplex audio conversations with low-latency voice input and output

Custom Tools

Event Callbacks

Rich callbacks for agent responses, user transcripts, and latency measurements

Flexible Configuration

Customize conversation config, dynamic variables, and platform settings

Quick Start

Here’s a minimal example to get started with ElevenAgents:

from elevenlabs.client import ElevenLabs
from elevenlabs.conversational_ai.conversation import Conversation
from elevenlabs.conversational_ai.default_audio_interface import DefaultAudioInterface

elevenlabs = ElevenLabs(
  api_key="YOUR_API_KEY",
)

# Create audio interface for real-time audio input/output
audio_interface = DefaultAudioInterface()

# Create conversation
conversation = Conversation(
    client=elevenlabs,
    agent_id="your-agent-id",
    requires_auth=True,
    audio_interface=audio_interface,
)

# Start the conversation
conversation.start_session()

# The conversation runs in background until you call:
conversation.end_session()

Core Components

Conversation

The Conversation class manages the WebSocket connection to your agent and handles the conversation lifecycle. It supports both synchronous and asynchronous implementations.

Learn More

Explore conversation management and callbacks

Audio Interface

The AudioInterface provides an abstraction for handling audio input and output. The SDK includes DefaultAudioInterface which uses PyAudio for real-time audio streaming.

Learn More

Learn about audio interfaces and custom implementations

Client Tools

ClientTools allows you to register custom functions that your AI agent can call during conversations, supporting both sync and async operations.

Learn More

Discover how to register and use tools

Agents API

Create, manage, and configure AI agents using the Agents API. Define conversation behavior, platform settings, and workflows.

Learn More

Read the Agents API reference

Architecture

ElevenAgents uses a WebSocket-based architecture for real-time bidirectional communication:

Audio Interface captures user audio and plays agent responses
Conversation manages the WebSocket connection and message handling
Client Tools execute custom functions when called by the agent
Agent processes audio, generates responses, and calls tools as needed

Next Steps

Create an Agent

Learn how to create and configure AI agents

Start Conversations

Manage conversation sessions and callbacks

Register Tools

Add custom functionality to your agents

Audio Setup

Configure audio input and output

Getting Started

Core Features

Conversational AI

Advanced Features

Guides

Introduction

Key Features

Real-Time Audio

Custom Tools

Event Callbacks

Flexible Configuration

Quick Start

Core Components

Conversation

Learn More

Audio Interface

Learn More

Client Tools

Learn More

Agents API

Learn More

Architecture

Next Steps

Create an Agent

Start Conversations

Register Tools

Audio Setup

Build docs developers (and LLMs) love

Getting Started

Core Features

Conversational AI

Advanced Features

Guides

​Introduction

​Key Features

Real-Time Audio

Custom Tools

Event Callbacks

Flexible Configuration

​Quick Start

​Core Components

​Conversation

Learn More

​Audio Interface

Learn More

​Client Tools

Learn More

​Agents API

Learn More

​Architecture

​Next Steps

Create an Agent

Start Conversations

Register Tools

Audio Setup

Build docs developers (and LLMs) love

Introduction

Key Features

Quick Start

Core Components

Conversation

Audio Interface

Client Tools

Agents API

Architecture

Next Steps