Miso One
tts-sttFeatured2,204Custom (Open Weights)

Miso One / MisoTTS

Open-weight 8B text-to-speech model with human-like emotion, one-shot voice cloning, and 110ms latency.

Updated Jun 7, 2026
Platforms
macos, linux, windows, docker
Pricing
free-open-source
Status
active
License
Custom (Open Weights)

What it does

Core capabilities at a glance

  • 8-billion parameter TTS model for expressive speech
  • One-shot voice cloning from 10-second audio sample
  • 110ms inference latency (faster than human conversation)
  • RVQ Transformer architecture (Sesame CSM-inspired)
  • Self-hostable, audio data never leaves your machine
  • Open weights available on HuggingFace
  • Mimi audio codec integration

Deep dive

The full breakdown - performance, comparisons, and setup

Miso One / MisoTTS

Miso One is an 8-billion parameter text-to-speech model from Miso Labs that sets a new standard for emotive speech synthesis. Built on an RVQ Transformer architecture inspired by Sesame's CSM, it delivers one-shot voice cloning from a 10-second sample at 110ms latency - faster than natural human conversation. The weights are open-source and self-hostable.

Why this matters

TTS has been dominated by cloud APIs (ElevenLabs, OpenAI TTS) that lock your audio data behind metered billing. Miso One is the first model at this quality tier that runs entirely on your own hardware. For developers building voice agents, accessibility tools, or content creation pipelines, this removes the privacy and cost barriers that have made high-quality TTS a recurring expense.

Performance you will see

  • Latency: 110ms end-to-end - faster than the 200ms human conversational threshold
  • Voice cloning: one-shot from 10 seconds of audio
  • Architecture: 8B RVQ Transformer, generating Mimi audio codes
  • Self-hosting: runs on a single consumer GPU (RTX 4090) at Q4

How it stacks up

FeatureMiso One (8B)Kokoro 82M (0.08B)Piper TTSElevenLabs (cloud)
Emotion★★★★★★★☆☆☆★☆☆☆☆★★★★☆
Voice cloningOne-shot 10sNoNoInstant
Latency110msReal-timeReal-time300-500ms
Self-hostedYesYesYesNo
PrivacyFullFullFullNone
Model size8B82MSmallN/A

Get started

git clone https://github.com/MisoLabsAI/MisoTTS.git
cd MisoTTS
pip install -e .
python run_misottss.py

What the community says

"Miso One is the first open TTS model that actually sounds like a human reading, not a robot. The emotion and pacing are incredible."

  • r/LocalLLaMA, 156 upvotes

When to use something else

If you need a lightweight, production-tested TTS that runs on CPU, Kokoro (82M params) or Piper are better choices. Miso One requires a GPU. If voice cloning is not needed, Kokoro's smaller footprint and broader language support may serve you better.

Frequently asked

Quick answers to common questions

What is Miso One / MisoTTS?

Miso One / MisoTTS is a tts-stt tool for local AI workloads. Open-weight 8B text-to-speech model with human-like emotion, one-shot voice cloning, and 110ms latency.

Is Miso One / MisoTTS free and open source?

Yes, Miso One / MisoTTS has 2,204 GitHub stars and is licensed under Custom (Open Weights). You can self-host it for free on macos, linux, windows, docker.

What platforms does Miso One / MisoTTS support?

Miso One / MisoTTS runs on macos, linux, windows, docker.

What hardware do I need for Miso One / MisoTTS?

The hardware requirements depend on which models you run. Check our hardware directory for compatible GPUs and systems. Miso One / MisoTTS has 2,204 GitHub stars and an active community.

Does Miso One / MisoTTS support GPU acceleration?

Miso One / MisoTTS supports GPU acceleration via CUDA, Metal, or Vulkan depending on your platform. For the best performance, pair it with an NVIDIA RTX 4090 or 5090.

What are the best alternatives to Miso One / MisoTTS?

Popular alternatives include other tts-stt tools in our directory. Browse our full collection at /tool for comparisons, community reviews, and benchmark data to find the right fit for your workflow.

How much does Miso One / MisoTTS cost?

Miso One / MisoTTS is free-open-source. It is completely free and open source to self-host.

Pairs well with

Complementary tools, models, and hardware

Comments coming soon

Configure NEXT_PUBLIC_GISCUS_REPO_ID and NEXT_PUBLIC_GISCUS_CATEGORY_ID at giscus.app to enable.