What it does
Core capabilities at a glance
- Android
- ARM
- Edge
- Edge AI
- Framework
- IOS
- Llamacpp
- LLM Inference
Deep dive
The full breakdown - performance, comparisons, and setup
cactus
cactus is a speech (TTS/STT) tool - Low-latency AI engine for mobile devices & wearables.
Overview
[![Docs][docs-shield]][docs-url] [![Website][website-shield]][website-url] [![GitHub][github-shield]][github-url] [![HuggingFace][hf-shield]][hf-url] [![Reddit][reddit-shield]][reddit-url] [![Blog][blog-shield]][blog-url]
- Fast & accurate: fastest inference on ARM CPU, Cactus quants at 4-bit matches f16 - Low RAM: zero-copy memory mapping ensures 10x lower RAM use than other engines - Multimodal: one engine for speech, vision, and language models - Cloud fallback: automatically route requests to cloud models if needed - Model-Agnostic: Custom PyTorch models can be exported to the Cactus runtime.
- Cactus Compute, Inc. (YC S25) 2. UCLA's BruinAI 3. Char (YC S25) 4. Yale's AI Society 5. National University of Singapore's AI Society 6. UC Irvine's AI@UCI 7. Imperial College's AI Society 8. University of Pennsylvania's AI@Penn 9. University of Michigan Ann-Arbor MSAIL 10. University of Colorado Boulder's AI Club
N/B: Scroll all the way up and click the shields link for resources!
[docs-shield]: https://img.shields.io/badge/Docs-555?style=for-the-badge&logo=readthedocs&logoColor=white [docs-url]: https://cactus-compute.github.io/cactus/
[website-shield]: https://img.shields.io/badge/Website-555?style=for-the-badge&logo=safari&logoColor=white [website-url]: https://cactuscompute.com/
[github-shield]: https://img.shields.io/badge/GitHub-555?style=for-the-badge&logo=github&logoColor=white [github-url]: https://github.com/cactus-compute/cactus
[hf-shield]: https://img.shields.io/badge/HuggingFace-555?style=for-the-badge&logo=huggingface&logoColor=white [hf-url]: https://huggingface.co/Cactus-Compute
[reddit-shield]: https://img.shields.io/badge/Reddit-555?style=for-the-badge&logo=reddit&logoColor=white [reddit-url]: https://www.reddit.com/r/cactuscompute/
[blog-shield]: https://img.shields.io/badge/Blog-555?style=for-the-badge&logo=hashnode&logoColor=white [blog-url]: https://cactuscompute.com/blog
cactus is open-source, written primarily in C++, with 5,321 GitHub stars under the Other license. The latest release is v1.14 (2026-04-18).
Key capabilities
From the project's documentation:
- Fast & accurate: fastest inference on ARM CPU, Cactus quants at 4-bit matches f16
- Low RAM: zero-copy memory mapping ensures 10x lower RAM use than other engines
- Multimodal: one engine for speech, vision, and language models
- Cloud fallback: automatically route requests to cloud models if needed
- Model-Agnostic: Custom PyTorch models can be exported to the Cactus runtime.
- Step 1: brew install cactus-compute/cactus/cactus
Install
A quick way to get started (always check the official docs for the latest):
brew install cactus-compute/cactus/cactusHow it fits a local-AI stack
cactus runs on your own hardware, so pair it with a model and a GPU sized to your needs. Use the VRAM calculator to pick a model that fits your card, and see what you can run for hardware guidance. Related speech (TTS/STT) tools in the directory:
Sources
- Source code & docs: cactus-compute/cactus
- Official website: https://cactuscompute.com
Stats from GitHub, 2026-06-08.
Frequently asked
Quick answers to common questions
What is cactus?
cactus is a tts-stt tool for local AI workloads. Low-latency AI engine for mobile devices & wearables
Is cactus free and open source?
Yes, cactus has 5,321 GitHub stars and is licensed under Other. You can self-host it for free on macos.
What platforms does cactus support?
cactus runs on macos.
What hardware do I need for cactus?
The hardware requirements depend on which models you run. Check our hardware directory for compatible GPUs and systems. cactus has 5,321 GitHub stars and an active community.
Does cactus support GPU acceleration?
cactus supports GPU acceleration via CUDA, Metal, or Vulkan depending on your platform. For the best performance, pair it with an NVIDIA RTX 4090 or 5090.
What are the best alternatives to cactus?
Popular alternatives include other tts-stt tools in our directory. Browse our full collection at /tool for comparisons, community reviews, and benchmark data to find the right fit for your workflow.
How much does cactus cost?
cactus is free-open-source. It is completely free and open source to self-host.
Pairs well with
Complementary tools, models, and hardware
Comments coming soon
Configure NEXT_PUBLIC_GISCUS_REPO_ID and NEXT_PUBLIC_GISCUS_CATEGORY_ID at giscus.app to enable.