farfalle social preview
other3,531Apache 2.0

farfalle

๐Ÿ” AI search engine - self-host with local or cloud LLMs

Updated Jun 8, 2026
Platforms
docker, web
Pricing
free-open-source
Status
active
License
Apache 2.0

What it does

Core capabilities at a glance

  • Fastapi
  • Generative UI
  • GPT 4O
  • Groq
  • Nextjs
  • Ollama
  • Openai
  • Perplexity

Deep dive

The full breakdown - performance, comparisons, and setup

farfalle

farfalle is a local-AI tool - ๐Ÿ” AI search engine - self-host with local or cloud LLMs.

Overview

Please feel free to contact me on Twitter or create an issue if you have any questions.

  • Add support for local LLMs through Ollama - [x] Docker deployment setup - [x] Add support for searxng. Eliminates the need for external dependencies. - [x] Create a pre-built Docker Image - [x] Add support for custom LLMs through LiteLLM - [x] Chat History - [x] Expert Search - [ ] Chat with local files

  • Frontend: Next.js - Backend: FastAPI - Search API: SearXNG, Tavily, Serper, Bing - Logging: Logfire - Rate Limiting: Redis - Components: shadcn/ui

  • Search with multiple search providers (Tavily, Searxng, Serper, Bing) - Answer questions with cloud models (OpenAI/gpt4-o, OpenAI/gpt3.5-turbo, Groq/Llama3) - Answer questions with local models (llama3, mistral, gemma, phi3) - Answer questions with any custom LLMs through LiteLLM - Search with an agent that plans and executes the search for better results

Modify .env with your API keys (Optional, not required if using Ollama)

For custom setup instructions, see custom-setup-instructions.md

After the backend is deployed, copy the web service URL to your clipboard. It should look something like: https://some-service-name.onrender.com.

Use the copied backend URL in the 'NEXT_PUBLIC_API_URL' environment variable when deploying with Vercel.

farfalle is open-source, written primarily in TypeScript, with 3,531 GitHub stars under the Apache 2.0 license. It was last updated on 2024-09-27.

Key capabilities

From the project's documentation:

  • Add support for local LLMs through Ollama
  • Docker deployment setup
  • Add support for searxng. Eliminates the need for external dependencies.
  • Create a pre-built Docker Image
  • Add support for custom LLMs through LiteLLM
  • Chat with local files

How it fits a local-AI stack

farfalle runs on your own hardware, so pair it with a model and a GPU sized to your needs. Use the VRAM calculator to pick a model that fits your card, and see what you can run for hardware guidance. Related local-AI tools in the directory:

Sources

Stats from GitHub, 2026-06-08.

Frequently asked

Quick answers to common questions

What is farfalle?

farfalle is a other tool for local AI workloads. ๐Ÿ” AI search engine - self-host with local or cloud LLMs

Is farfalle free and open source?

Yes, farfalle has 3,531 GitHub stars and is licensed under Apache 2.0. You can self-host it for free on docker, web.

What platforms does farfalle support?

farfalle runs on docker, web.

What hardware do I need for farfalle?

The hardware requirements depend on which models you run. Check our hardware directory for compatible GPUs and systems. farfalle has 3,531 GitHub stars and an active community.

Does farfalle support GPU acceleration?

farfalle's GPU support depends on your specific setup. Check the documentation for details. For the best performance, pair it with an NVIDIA RTX 4090 or 5090.

What are the best alternatives to farfalle?

Popular alternatives include other other tools in our directory. Browse our full collection at /tool for comparisons, community reviews, and benchmark data to find the right fit for your workflow.

How much does farfalle cost?

farfalle is free-open-source. It is completely free and open source to self-host.

Pairs well with

Complementary tools, models, and hardware

Similar tools

More tools like this one

Comments coming soon

Configure NEXT_PUBLIC_GISCUS_REPO_ID and NEXT_PUBLIC_GISCUS_CATEGORY_ID at giscus.app to enable.