uncloseai.
Silero TTS
Self-host — Not on the public endpoint. Clone the repo to use this engine.
What It Does
148 voices across 5 languages, all running on CPU. English, Russian, German, Spanish, and French — each language with its own set of distinct speakers.
The models are small and efficient. They download automatically the first time you use them, no manual setup required. Output is high-quality 48kHz audio.
This is the go-to engine when you need multilingual support without a GPU. The Silero team actively maintains it, and we keep it integrated and tested in the raccoon dumpster.
Example
Once self-hosted and enabled, it works through the same OpenAI-compatible API:
from openai import OpenAI
client = OpenAI(
api_key="not-needed",
base_url="http://localhost:8000/v1"
)
# Silero uses native voice names like en_0, en_50, ru_0, de_0
client.audio.speech.create(
model="tts-1-silero",
voice="en_50",
input="One hundred and forty-eight voices across five languages, all running on CPU. No GPU required. The raccoons found this one and it just works."
).stream_to_file("silero.mp3")
Available Languages
- English: ~120 voices (en_0 through en_117)
- Russian: ~10 voices
- German: ~10 voices
- Spanish: ~5 voices
- French: ~5 voices
Technical Details
- Voices: 148 across 5 languages
- Sample rate: 48kHz
- Runtime: PyTorch (torch.hub)
- Hardware: CPU only, no GPU needed
- Model size: ~50-100MB per language
- Upstream: Silero TTS, actively maintained
Self-Hosting
Models auto-download on first use. Enable by adding Silero voices to your config:
git clone https://git.unturf.com/engineering/unturf/uncloseai-speech.git
cd uncloseai-speech
make deploy
make voices-silero
Add Silero voices to voice_to_speaker.default.yaml and restart.