What it does
Core capabilities at a glance
- ASR
- Audio Inference
- Local First
- Openai Compatible API
- Self Hosted AI
- Speaker Diarization
- Speech TO Text
- Text TO Speech
Deep dive
The full breakdown - performance, comparisons, and setup
izwi
izwi is a speech (TTS/STT) tool - Voice AI runtime. Local first transcription, speaker diarization, TTS, and voice cloning with an OpenAI compatible API.
Overview
Izwi is a desktop app, web UI, CLI, and local inference server for voice AI. It runs on your machine and exposes both product workflows and OpenAI-compatible API routes without requiring cloud services or API keys.
- Real-time voice conversations with local ASR, chat, and TTS models. - Text-to-speech, long-form Studio projects, voice cloning, voice design, and saved voices. - Transcription, speaker diarization, forced alignment, and realtime speech-to-text. - Local chat, model download/load/unload/delete, history, exports, and settings. - OpenAI-compatible '/v1' APIs for models, chat completions, audio speech, audio transcriptions, and preview Responses support.
Inference data stays local. Optional anonymous desktop analytics are disabled unless a user explicitly opts in, and they do not send prompts, transcripts, audio payloads, local paths, or personal identifiers.
-
macOS: install the '.dmg', drag 'Izwi.app' to Applications, then launch it. - Linux: install the '.deb' package with 'sudo dpkg -i izwi_*.deb'. - Windows: run the '.exe' installer.
-
macOS Apple Silicon release builds use Metal. - Linux and Windows release builds are CPU-only. - CUDA is supported through the Docker CUDA profile or source builds on compatible NVIDIA hosts.
Open the app at 'http://localhost:8080'. The local API reference is available at 'http://localhost:8080/docs', and the raw OpenAPI document is available at 'http://localhost:8080/openapi.json'.
Run 'izwi list' to see the enabled catalog. Current families include:
izwi is open-source, written primarily in Rust, with 338 GitHub stars under the MIT license. It was last updated on 2026-06-08.
Key capabilities
From the project's documentation:
- Real-time voice conversations with local ASR, chat, and TTS models.
- Text-to-speech, long-form Studio projects, voice cloning, voice design, and
- Transcription, speaker diarization, forced alignment, and realtime speech-to-text.
- Local chat, model download/load/unload/delete, history, exports, and settings.
- OpenAI-compatible /v1 APIs for models, chat completions, audio speech,
- macOS: install the .dmg, drag Izwi.app to Applications, then launch
How it fits a local-AI stack
izwi runs on your own hardware, so pair it with a model and a GPU sized to your needs. Use the VRAM calculator to pick a model that fits your card, and see what you can run for hardware guidance. Related speech (TTS/STT) tools in the directory:
Sources
- Source code & docs: izwi-ai/izwi
- Official website: https://www.izwiai.com
Stats from GitHub, 2026-06-08.
Frequently asked
Quick answers to common questions
What is izwi?
izwi is a tts-stt tool for local AI workloads. Voice AI runtime. Local first transcription, speaker diarization, TTS, and voice cloning with an OpenAI compatible API.
Is izwi free and open source?
Yes, izwi has 338 GitHub stars and is licensed under MIT. You can self-host it for free on macos, linux, windows, docker, web.
What platforms does izwi support?
izwi runs on macos, linux, windows, docker, web.
What hardware do I need for izwi?
The hardware requirements depend on which models you run. Check our hardware directory for compatible GPUs and systems. izwi has 338 GitHub stars and an active community.
Does izwi support GPU acceleration?
izwi supports GPU acceleration via CUDA, Metal, or Vulkan depending on your platform. For the best performance, pair it with an NVIDIA RTX 4090 or 5090.
What are the best alternatives to izwi?
Popular alternatives include other tts-stt tools in our directory. Browse our full collection at /tool for comparisons, community reviews, and benchmark data to find the right fit for your workflow.
How much does izwi cost?
izwi is free-open-source. It is completely free and open source to self-host.
Pairs well with
Complementary tools, models, and hardware
Comments coming soon
Configure NEXT_PUBLIC_GISCUS_REPO_ID and NEXT_PUBLIC_GISCUS_CATEGORY_ID at giscus.app to enable.