Instant Voice Cloning
Clone any voice with just 3-5 seconds of reference audio. Zero-shot — no fine-tuning needed.
Real-time Streaming
Audio playback starts before the sentence finishes. Under 300ms latency on modern CPUs.
Fully Offline
Runs entirely on-device. No cloud API, no internet required after model download.
Multiple Models
From 0.5B (best quality) to Q4 quantized (extreme speed). Pick what fits your hardware.
Vietnamese + English
Native Vietnamese with seamless code-switching to English within the same sentence.
Flexible Backends
PyTorch, GGUF, LMDeploy, Remote API, Intel XPU. One SDK, many deployment options.
Simple Python API
Three lines to generate speech. Supports voice cloning, streaming, batch processing, and more.
from vieneu import Vieneu
tts = Vieneu()
audio = tts.infer(text="Xin chào bạn!")
tts.save(audio, "output.wav")