Chuyển tới nội dung chính

VieNeu-TTS

Vietnamese Text-to-Speech with Instant Voice Cloning

🎙️

Instant Voice Cloning

Clone any voice with just 3-5 seconds of reference audio. Zero-shot — no fine-tuning needed.

Real-time Streaming

Audio playback starts before the sentence finishes. Under 300ms latency on modern CPUs.

🔒

Fully Offline

Runs entirely on-device. No cloud API, no internet required after model download.

🧠

Multiple Models

From 0.5B (best quality) to Q4 quantized (extreme speed). Pick what fits your hardware.

🌏

Vietnamese + English

Native Vietnamese with seamless code-switching to English within the same sentence.

🔧

Flexible Backends

PyTorch, GGUF, LMDeploy, Remote API, Intel XPU. One SDK, many deployment options.

Simple Python API

Three lines to generate speech. Supports voice cloning, streaming, batch processing, and more.

from vieneu import Vieneu

tts = Vieneu()
audio = tts.infer(text="Xin chào bạn!")
tts.save(audio, "output.wav")