What it does

Core capabilities at a glance

Fastapi
Generative UI
GPT 4O
Groq
Nextjs
Ollama
Openai
Perplexity

Deep dive

The full breakdown - performance, comparisons, and setup

farfalle

farfalle is a local-AI tool - 🔍 AI search engine - self-host with local or cloud LLMs.

Overview

Please feel free to contact me on Twitter or create an issue if you have any questions.

Add support for local LLMs through Ollama - [x] Docker deployment setup - [x] Add support for searxng. Eliminates the need for external dependencies. - [x] Create a pre-built Docker Image - [x] Add support for custom LLMs through LiteLLM - [x] Chat History - [x] Expert Search - [ ] Chat with local files
Frontend: Next.js - Backend: FastAPI - Search API: SearXNG, Tavily, Serper, Bing - Logging: Logfire - Rate Limiting: Redis - Components: shadcn/ui
Search with multiple search providers (Tavily, Searxng, Serper, Bing) - Answer questions with cloud models (OpenAI/gpt4-o, OpenAI/gpt3.5-turbo, Groq/Llama3) - Answer questions with local models (llama3, mistral, gemma, phi3) - Answer questions with any custom LLMs through LiteLLM - Search with an agent that plans and executes the search for better results

Modify .env with your API keys (Optional, not required if using Ollama)

For custom setup instructions, see custom-setup-instructions.md

After the backend is deployed, copy the web service URL to your clipboard. It should look something like: https://some-service-name.onrender.com.

Use the copied backend URL in the 'NEXT_PUBLIC_API_URL' environment variable when deploying with Vercel.

farfalle is open-source, written primarily in TypeScript, with 3,531 GitHub stars under the Apache 2.0 license. It was last updated on 2024-09-27.

Key capabilities

From the project's documentation:

Add support for local LLMs through Ollama
Docker deployment setup
Add support for searxng. Eliminates the need for external dependencies.
Create a pre-built Docker Image
Add support for custom LLMs through LiteLLM
Chat with local files

How it fits a local-AI stack

farfalle runs on your own hardware, so pair it with a model and a GPU sized to your needs. Use the VRAM calculator to pick a model that fits your card, and see what you can run for hardware guidance. Related local-AI tools in the directory:

Sources

Source code & docs: rashadphz/farfalle
Official website: https://www.farfalle.dev/

Stats from GitHub, 2026-06-08.

Frequently asked

Quick answers to common questions

What is farfalle?

farfalle is a other tool for local AI workloads. 🔍 AI search engine - self-host with local or cloud LLMs

Is farfalle free and open source?

Yes, farfalle has 3,532 GitHub stars and is licensed under Apache 2.0. You can self-host it for free on docker, web.

What platforms does farfalle support?

farfalle runs on docker, web.

What hardware do I need for farfalle?

The hardware requirements depend on which models you run. Check our hardware directory for compatible GPUs and systems. farfalle has 3,532 GitHub stars and an active community.

Does farfalle support GPU acceleration?

farfalle's GPU support depends on your specific setup. Check the documentation for details. For the best performance, pair it with an NVIDIA RTX 4090 or 5090.

What are the best alternatives to farfalle?

Popular alternatives include other other tools in our directory. Browse our full collection at /tool for comparisons, community reviews, and benchmark data to find the right fit for your workflow.

How much does farfalle cost?

farfalle is free-open-source. It is completely free and open source to self-host.

Pairs well with

Complementary tools, models, and hardware

Similar tools

More tools like this one

algernon - Same category: other blinko - Same category: other Continue - Same category: other exo - Same category: other SwiftLM - inference-server NextChat - chat-ui

farfalle

What it does

Deep dive

farfalle

Overview

Key capabilities

How it fits a local-AI stack

Sources

Frequently asked

What is farfalle?

Is farfalle free and open source?

What platforms does farfalle support?

What hardware do I need for farfalle?

Does farfalle support GPU acceleration?

What are the best alternatives to farfalle?

How much does farfalle cost?

Pairs well with

Tools

Models

Hardware

Similar tools