Overview
Hathora is a hosting provider for several models for voice AI, which can be utilized under the singleHathoraSTTService.
Hathora STT API Reference
Pipecat’s API methods for Hathora-hosted STT models
Example Implementation
Complete example using Hathora-hosted models
Hathora Models Documentation
Official Hathora documentation and features
Installation
To use Hathora services, install the required dependencies:Prerequisites
Hathora Account Setup
Before using Hathora STT services, you need:- Hathora Account: Sign up at Hathora Models Console
- API Key: Generate an API token from your Tokens page
Hathora Model Specifier
TheHathoraSTTService accepts a model: str parameter which corresponds to the model you would like to use.
You can find available specifiers here
Configuration
HathoraSTTService
Model to use for transcription. Find available models at models.hathora.dev.
Hathora API key for authentication. Falls back to the
HATHORA_API_KEY environment variable.Base API URL for the Hathora STT service.
Audio sample rate in Hz. When
None, uses the pipeline’s configured sample rate.Optional configuration parameters. See InputParams below.
InputParams
| Parameter | Type | Default | Description |
|---|---|---|---|
language | str | None | Language code, if supported by the selected model. |
config | list[ConfigOption] | None | Additional model-specific configuration options. Refer to Hathora docs for supported options per model. Each ConfigOption has a name and value field. |
Usage
Basic Setup
With Language and Config Options
Notes
- Segmented transcription:
HathoraSTTServiceextendsSegmentedSTTService, meaning it processes complete audio segments (after VAD detects the user has stopped speaking) rather than streaming audio in real time. - Model-specific features: Configuration options and language support vary by model. Check the Hathora documentation for details on each model’s capabilities.