What it does
Core capabilities at a glance
- Glow TTS
- Hifigan
- Melgan
- Multi Speaker TTS
- Pytorch
- Speaker Encoder
- Speaker Encodings
- Speech
Deep dive
The full breakdown - performance, comparisons, and setup
TTS
TTS is a speech (TTS/STT) tool - 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production.
Overview
- 📣 ⓍTTSv2 is here with 16 languages and better performance across the board. - 📣 ⓍTTS fine-tuning code is out. Check the example recipes. - 📣 ⓍTTS can now stream with
🐸TTS is a library for advanced Text-to-Speech generation.
🛠️ Tools for training new models and fine-tuning existing models in any language.
Please use our dedicated channels for questions and discussion. Help is much more valuable if it's shared publicly so that more people can benefit from it.
[github issue tracker]: https://github.com/coqui-ai/tts/issues [github discussions]: https://github.com/coqui-ai/TTS/discussions [discord]: https://discord.gg/5eXr5seRrv [Tutorials and Examples]: https://github.com/coqui-ai/TTS/wiki/TTS-Notebooks-and-Tutorials
Underlined "TTS*" and "Judy*" are internal 🐸TTS models that are not released open-source. They are here to show the potential. Models prefixed with a dot (.Jofish .Abe and .Janice) are real human voices.
TTS is open-source, written primarily in Python, with 45,509 GitHub stars under the MPL-2.0 license. The latest release is v0.22.0 (2023-12-12).
Key capabilities
From the project's documentation:
- 📣 ⓍTTSv2 is here with 16 languages and better performance across the board.
- 📣 ⓍTTS fine-tuning code is out. Check the example recipes.
- 📣 ⓍTTS can now stream with <200ms latency.
- 📣 You can use ~1100 Fairseq models with 🐸TTS.
- 📣 🐸TTS now supports 🐢Tortoise with faster inference. Docs
- High-performance Deep Learning models for Text2Speech tasks.
Install
A quick way to get started (always check the official docs for the latest):
pip install TTSHow it fits a local-AI stack
TTS runs on your own hardware, so pair it with a model and a GPU sized to your needs. Use the VRAM calculator to pick a model that fits your card, and see what you can run for hardware guidance. Related speech (TTS/STT) tools in the directory:
Sources
- Source code & docs: coqui-ai/TTS
- Official website: http://coqui.ai
Stats from GitHub, 2026-06-08.
Frequently asked
Quick answers to common questions
What is TTS?
TTS is a tts-stt tool for local AI workloads. 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Is TTS free and open source?
Yes, TTS has 45,508 GitHub stars and is licensed under MPL-2.0. You can self-host it for free on docker.
What platforms does TTS support?
TTS runs on docker.
What hardware do I need for TTS?
The hardware requirements depend on which models you run. Check our hardware directory for compatible GPUs and systems. TTS has 45,508 GitHub stars and an active community.
Does TTS support GPU acceleration?
TTS's GPU support depends on your specific setup. Check the documentation for details. For the best performance, pair it with an NVIDIA RTX 4090 or 5090.
What are the best alternatives to TTS?
Popular alternatives include other tts-stt tools in our directory. Browse our full collection at /tool for comparisons, community reviews, and benchmark data to find the right fit for your workflow.
How much does TTS cost?
TTS is free-open-source. It is completely free and open source to self-host.
Pairs well with
Complementary tools, models, and hardware
Comments coming soon
Configure NEXT_PUBLIC_GISCUS_REPO_ID and NEXT_PUBLIC_GISCUS_CATEGORY_ID at giscus.app to enable.