Voice & Audio Settings

Waifu Companion supports multiple text-to-speech engines, real-time lip-sync, audio preloading, and various audio features.

Text-to-Speech (TTS)

Three TTS providers are available, each can be independently enabled/disabled:

TikTok TTS (Primary)

Free TTS API with the largest voice selection. Enable in Settings > Voice > Enable TikTok TTS.

Voices (35+):

Kokoro TTS (Local)

High-quality, offline text-to-speech powered by ONNX Runtime. Runs locally in the browser using WebGPU (preferred) or WASM (CPU fallback).

Features:

Voices (10+):

Browser SpeechSynthesis

Built-in browser TTS with no rate limits or dependencies. Uses the system's native speech voices.

Voice Language Overrides

The app maps unsupported language codes to a base language whose voices should be used. This covers 100+ language codes:

Real-Time Lip-Sync

During TTS playback, the Live2D model's mouth animates based on audio frequency analysis:

TTS Preloading

The TTS queue manager pre-fetches the next message's audio in the background:

Speech-to-Text (STT)

Use your microphone to speak to your companion:

  1. Click the microphone icon in the chat input
  2. Speak your message
  3. Click again to stop recording
  4. Message is automatically transcribed and sent

Powered by the Web Speech API. Requires microphone permissions.

Internet Radio

Stream music from Listen.moe:

  1. Go to Settings > Audio Settings
  2. Toggle the radio on/off
  3. Volume can be adjusted in the same section

Sound Effects

UI interactions produce subtle sound effects powered by Tone.js:

Audio Settings

TTS Queue

Multi-sentence AI responses are split into individual sentences for TTS: