What it does
Core capabilities at a glance
- Azure
- Chatgpt
- Claude
- Edge TTS
- Elevenlabs
- GPT
- LLM Webui
- Llms
Deep dive
The full breakdown - performance, comparisons, and setup
openai-edge-tts
openai-edge-tts is a speech (TTS/STT) tool - Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs.
Overview
This project provides a local, OpenAI-compatible text-to-speech (TTS) API using 'edge-tts'. It emulates the OpenAI TTS endpoint ('/v1/audio/speech'), enabling users to generate speech from text with various voice options and playback speeds, just like the OpenAI API.
'edge-tts' uses Microsoft Edge's online text-to-speech service, so it is completely free.
-
OpenAI-Compatible Endpoint: '/v1/audio/speech' with similar request structure and behavior. - SSE Streaming Support: Real-time audio streaming via Server-Sent Events when 'stream_format: "sse"' is specified. - Supported Voices: Maps OpenAI voices (alloy, echo, fable, onyx, nova, shimmer) to 'edge-tts' equivalents. - Flexible Formats: Supports multiple audio formats (mp3, opus, aac, flac, wav, pcm). - Adjustable Speed: Option to modify playback speed (0.25x to 4.0x). - Optional Direct Edge-TTS Voice Selection: Use either OpenAI voice mappings or specify any edge-tts voice directly.
-
Docker (recommended): Docker and Docker Compose for containerized setup. - Python (optional): For local development, install dependencies in 'requirements.txt'. - ffmpeg (optional): Required for audio format conversion. Optional if sticking to mp3.
- Environment Variables: Create a '.env' file in the root directory with the following variables:
Run with '-d' to run docker compose in "detached mode", meaning it will run in the background and free up your terminal.
openai-edge-tts is open-source, written primarily in Python, with 1,918 GitHub stars under the GPL-3.0 license. The latest release is v2.0.0 (2024-12-28).
Key capabilities
From the project's documentation:
- OpenAI-Compatible Endpoint: /v1/audio/speech with similar request structure and behavior.
- SSE Streaming Support: Real-time audio streaming via Server-Sent Events when stream_format: "sse" is specified.
- Supported Voices: Maps OpenAI voices (alloy, echo, fable, onyx, nova, shimmer) to edge-tts equivalents.
- Flexible Formats: Supports multiple audio formats (mp3, opus, aac, flac, wav, pcm).
- Adjustable Speed: Option to modify playback speed (0.25x to 4.0x).
- Docker (recommended): Docker and Docker Compose for containerized setup.
How it fits a local-AI stack
openai-edge-tts runs on your own hardware, so pair it with a model and a GPU sized to your needs. Use the VRAM calculator to pick a model that fits your card, and see what you can run for hardware guidance. Related speech (TTS/STT) tools in the directory:
Sources
- Source code & docs: travisvn/openai-edge-tts
- Official website: https://tts.travisvn.com
Stats from GitHub, 2026-06-08.
Frequently asked
Quick answers to common questions
What is openai-edge-tts?
openai-edge-tts is a tts-stt tool for local AI workloads. Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
Is openai-edge-tts free and open source?
Yes, openai-edge-tts has 1,918 GitHub stars and is licensed under GPL-3.0. You can self-host it for free on docker.
What platforms does openai-edge-tts support?
openai-edge-tts runs on docker.
What hardware do I need for openai-edge-tts?
The hardware requirements depend on which models you run. Check our hardware directory for compatible GPUs and systems. openai-edge-tts has 1,918 GitHub stars and an active community.
Does openai-edge-tts support GPU acceleration?
openai-edge-tts's GPU support depends on your specific setup. Check the documentation for details. For the best performance, pair it with an NVIDIA RTX 4090 or 5090.
What are the best alternatives to openai-edge-tts?
Popular alternatives include other tts-stt tools in our directory. Browse our full collection at /tool for comparisons, community reviews, and benchmark data to find the right fit for your workflow.
How much does openai-edge-tts cost?
openai-edge-tts is free-open-source. It is completely free and open source to self-host.
Pairs well with
Complementary tools, models, and hardware
Comments coming soon
Configure NEXT_PUBLIC_GISCUS_REPO_ID and NEXT_PUBLIC_GISCUS_CATEGORY_ID at giscus.app to enable.