CHIM Text-to-Speech Services

This page documents the current text-to-speech services still actively used by HerikaServer for CHIM. The split below follows the current quickstart flow: recommended first, then the other valid services.

Other Services

MeloTTS

Local text-to-speech that maps strongly to Skyrim voice types. This is useful when you want a voice-type oriented local setup instead of the XTTS-style cloned voice flow.

Mimic3

Local HTTP text-to-speech service with direct voice, rate, and volume controls. Useful if you want simple local synthesis without using the XTTS-family flow.

Piper Text-to-Speech

Local Piper endpoint integration. Good for self-hosted offline speech if you are comfortable downloading and managing voice models manually.

Kokoro

Local Kokoro endpoint integration. It appears as a valid current text-to-speech driver in the connector system and uses a local HTTP endpoint.

KoboldCPP Text-to-Speech

Local text-to-speech path exposed through the KoboldCPP extra text-to-speech endpoint. Use it only if your local KoboldCPP stack is already set up for speech.

Zonos

Zonos Gradio endpoint integration for users who want that specific self-hosted model family. It supports a wide language list and CHIM voice cache style voice ids in the current schema.

xVASynth

Legacy-friendly local Skyrim voice workflow tied to xVASynth models. Use this only if you already know you want the xVASynth route.

Azure

Hosted Azure speech synthesis with mood/style support, voice selection, and prosody controls. This is still one of the more configurable cloud voice options in the CHIM stack.

Azure Text-to-Speech

ElevenLabs

Hosted ElevenLabs speech with model, stability, similarity, style, speed, and optional v3 audio tag controls.

ElevenLabs

OpenAI Text-to-Speech

Hosted OpenAI speech synthesis using the current audio speech endpoint. Useful if you want a direct OpenAI voice path instead of a separate cloud text-to-speech provider.

OpenAI Text-to-Speech

Deepgram

Hosted Deepgram text-to-speech. It is still a valid current CHIM service, but it is more of an alternative pick than a main quickstart choice.

Deepgram Text-to-Speech

CHIM Text-to-Speech Services

Recommended

PocketTTS

Chatterbox

CHIM XTTS

Inworld

Cartesia