Hume

Overview

Hume provides expressive text-to-speech synthesis using their Octave models, which adapt pronunciation, pitch, speed, and emotional style based on context. HumeTTSService offers real-time streaming with word-level timestamps, custom voice support, and advanced synthesis controls including acting instructions, speed adjustment, and trailing silence configuration.

Hume TTS API Reference

Pipecat’s API methods for Hume TTS integration

Example Implementation

Complete example with word timestamps and interruption handling

Hume Documentation

Official Hume TTS API documentation and features

Voice Library

Browse and manage available voices

Installation

To use Hume services, install the required dependencies:

pip install "pipecat-ai[hume]"

Prerequisites

Hume Account Setup

Before using Hume TTS services, you need:

Hume Account: Sign up at Hume AI
API Key: Generate an API key from your account dashboard
Voice Selection: Choose voice IDs from the voice library or create custom voices

Required Environment Variables

HUME_API_KEY: Your Hume API key for authentication

Configuration

HumeTTSService

api_key

str

default:"None"

Hume API key. If omitted, reads the HUME_API_KEY environment variable.

voice_id

str

required

ID of the voice to use. Only voice IDs are supported; voice names are not.

sample_rate

int

default:"48000"

Output sample rate for PCM frames. Hume TTS streams at 48kHz.

params

InputParams

default:"None"

Runtime-configurable synthesis controls. See InputParams below.

InputParams

Synthesis parameters that can be set at initialization via the params constructor argument, or changed at runtime via UpdateSettingsFrame.

Parameter	Type	Default	Description
`description`	`str`	`None`	Natural-language acting directions (up to 100 characters).
`speed`	`float`	`None`	Speaking-rate multiplier (0.5-2.0).
`trailing_silence`	`float`	`None`	Seconds of silence to append at the end (0-5).

Usage

Basic Setup

from pipecat.services.hume import HumeTTSService

tts = HumeTTSService(
    api_key=os.getenv("HUME_API_KEY"),
    voice_id="your-voice-id",
)

With Acting Directions

tts = HumeTTSService(
    api_key=os.getenv("HUME_API_KEY"),
    voice_id="your-voice-id",
    params=HumeTTSService.InputParams(
        description="Speak warmly and reassuringly",
        speed=1.1,
        trailing_silence=0.5,
    ),
)

Updating Settings at Runtime

Voice and synthesis parameters can be changed mid-conversation using UpdateSettingsFrame:

from pipecat.frames.frames import UpdateSettingsFrame

await task.queue_frame(
    UpdateSettingsFrame(
        settings={
            "tts": {
                "speed": 1.3,
                "description": "Speak with excitement",
            }
        }
    )
)

Notes

Fixed sample rate: Hume TTS streams at 48kHz. Setting a different sample_rate will produce a warning.
Word timestamps: The service provides word-level timestamps for synchronized text display. Timestamps are tracked cumulatively across utterances within a turn.
Description versions: When description is provided, the service uses Hume API version "1". Without a description, it uses the newer version "2".
Audio buffering: Audio is buffered internally until a minimum chunk size is reached before being pushed as frames, reducing audio glitches.

API Reference

Services

Utilities

Frameworks

Pipeline

Overview

Hume TTS API Reference

Example Implementation

Hume Documentation

Voice Library

Installation

Prerequisites

Hume Account Setup

Required Environment Variables

Configuration

HumeTTSService

InputParams

Usage

Basic Setup

With Acting Directions

Updating Settings at Runtime

Notes

API Reference

Services

Utilities

Frameworks

Pipeline

​Overview

Hume TTS API Reference

Example Implementation

Hume Documentation

Voice Library

​Installation

​Prerequisites

​Hume Account Setup

​Required Environment Variables

​Configuration

​HumeTTSService

​InputParams

​Usage

​Basic Setup

​With Acting Directions

​Updating Settings at Runtime

​Notes

Overview

Installation

Prerequisites

Hume Account Setup

Required Environment Variables

Configuration

HumeTTSService

InputParams

Usage

Basic Setup

With Acting Directions

Updating Settings at Runtime

Notes