What it does

Core capabilities at a glance

ASR
Audio Inference
Local First
Openai Compatible API
Self Hosted AI
Speaker Diarization
Speech TO Text
Text TO Speech

Deep dive

The full breakdown - performance, comparisons, and setup

izwi

izwi is a speech (TTS/STT) tool - Voice AI runtime. Local first transcription, speaker diarization, TTS, and voice cloning with an OpenAI compatible API.

Overview

Izwi is a desktop app, web UI, CLI, and local inference server for voice AI. It runs on your machine and exposes both product workflows and OpenAI-compatible API routes without requiring cloud services or API keys.

Real-time voice conversations with local ASR, chat, and TTS models. - Text-to-speech, long-form Studio projects, voice cloning, voice design, and saved voices. - Transcription, speaker diarization, forced alignment, and realtime speech-to-text. - Local chat, model download/load/unload/delete, history, exports, and settings. - OpenAI-compatible '/v1' APIs for models, chat completions, audio speech, audio transcriptions, and preview Responses support.

Inference data stays local. Optional anonymous desktop analytics are disabled unless a user explicitly opts in, and they do not send prompts, transcripts, audio payloads, local paths, or personal identifiers.

macOS: install the '.dmg', drag 'Izwi.app' to Applications, then launch it. - Linux: install the '.deb' package with 'sudo dpkg -i izwi_*.deb'. - Windows: run the '.exe' installer.
macOS Apple Silicon release builds use Metal. - Linux and Windows release builds are CPU-only. - CUDA is supported through the Docker CUDA profile or source builds on compatible NVIDIA hosts.

Open the app at 'http://localhost:8080'. The local API reference is available at 'http://localhost:8080/docs', and the raw OpenAPI document is available at 'http://localhost:8080/openapi.json'.

Run 'izwi list' to see the enabled catalog. Current families include:

izwi is open-source, written primarily in Rust, with 338 GitHub stars under the MIT license. It was last updated on 2026-06-08.

Key capabilities

From the project's documentation:

Real-time voice conversations with local ASR, chat, and TTS models.
Text-to-speech, long-form Studio projects, voice cloning, voice design, and
Transcription, speaker diarization, forced alignment, and realtime speech-to-text.
Local chat, model download/load/unload/delete, history, exports, and settings.
OpenAI-compatible /v1 APIs for models, chat completions, audio speech,
macOS: install the .dmg, drag Izwi.app to Applications, then launch

How it fits a local-AI stack

izwi runs on your own hardware, so pair it with a model and a GPU sized to your needs. Use the VRAM calculator to pick a model that fits your card, and see what you can run for hardware guidance. Related speech (TTS/STT) tools in the directory:

Sources

Source code & docs: izwi-ai/izwi
Official website: https://www.izwiai.com

Stats from GitHub, 2026-06-08.

Frequently asked

Quick answers to common questions

What is izwi?

izwi is a tts-stt tool for local AI workloads. Voice AI runtime. Local first transcription, speaker diarization, TTS, and voice cloning with an OpenAI compatible API.

Is izwi free and open source?

Yes, izwi has 351 GitHub stars and is licensed under MIT. You can self-host it for free on macos, linux, windows, docker, web.

What platforms does izwi support?

izwi runs on macos, linux, windows, docker, web.

What hardware do I need for izwi?

The hardware requirements depend on which models you run. Check our hardware directory for compatible GPUs and systems. izwi has 351 GitHub stars and an active community.

Does izwi support GPU acceleration?

izwi supports GPU acceleration via CUDA, Metal, or Vulkan depending on your platform. For the best performance, pair it with an NVIDIA RTX 4090 or 5090.

What are the best alternatives to izwi?

Popular alternatives include other tts-stt tools in our directory. Browse our full collection at /tool for comparisons, community reviews, and benchmark data to find the right fit for your workflow.

How much does izwi cost?

izwi is free-open-source. It is completely free and open source to self-host.

Pairs well with

Complementary tools, models, and hardware

izwi

What it does

Deep dive

izwi

Overview

Key capabilities

How it fits a local-AI stack

Sources

Frequently asked

What is izwi?

Is izwi free and open source?

What platforms does izwi support?

What hardware do I need for izwi?

Does izwi support GPU acceleration?

What are the best alternatives to izwi?

How much does izwi cost?

Pairs well with

Tools

Models

Hardware