Documentation | VieNeu-TTS

🎙️

Clone any voice with just 3-5 seconds of reference audio. Zero-shot — no fine-tuning needed.

⚡

Audio playback starts before the sentence finishes. Under 300ms latency on modern CPUs.

🔒

Runs entirely on-device. No cloud API, no internet required after model download.

🧠

From 0.5B (best quality) to Q4 quantized (extreme speed). Pick what fits your hardware.

🌏

Native Vietnamese with seamless code-switching to English within the same sentence.

🔧

PyTorch, GGUF, LMDeploy, Remote API, Intel XPU. One SDK, many deployment options.

Simple Python API

Three lines to generate speech. Supports voice cloning, streaming, batch processing, and more.

from vieneu import Vieneu

tts = Vieneu()
audio = tts.infer(text="Xin chào bạn!")
tts.save(audio, "output.wav")