What it does
Core capabilities at a glance
- Fastapi
- Generative UI
- GPT 4O
- Groq
- Nextjs
- Ollama
- Openai
- Perplexity
Deep dive
The full breakdown - performance, comparisons, and setup
farfalle
farfalle is a local-AI tool - ๐ AI search engine - self-host with local or cloud LLMs.
Overview
Please feel free to contact me on Twitter or create an issue if you have any questions.
-
Add support for local LLMs through Ollama - [x] Docker deployment setup - [x] Add support for searxng. Eliminates the need for external dependencies. - [x] Create a pre-built Docker Image - [x] Add support for custom LLMs through LiteLLM - [x] Chat History - [x] Expert Search - [ ] Chat with local files
-
Frontend: Next.js - Backend: FastAPI - Search API: SearXNG, Tavily, Serper, Bing - Logging: Logfire - Rate Limiting: Redis - Components: shadcn/ui
-
Search with multiple search providers (Tavily, Searxng, Serper, Bing) - Answer questions with cloud models (OpenAI/gpt4-o, OpenAI/gpt3.5-turbo, Groq/Llama3) - Answer questions with local models (llama3, mistral, gemma, phi3) - Answer questions with any custom LLMs through LiteLLM - Search with an agent that plans and executes the search for better results
Modify .env with your API keys (Optional, not required if using Ollama)
For custom setup instructions, see custom-setup-instructions.md
After the backend is deployed, copy the web service URL to your clipboard. It should look something like: https://some-service-name.onrender.com.
Use the copied backend URL in the 'NEXT_PUBLIC_API_URL' environment variable when deploying with Vercel.
farfalle is open-source, written primarily in TypeScript, with 3,531 GitHub stars under the Apache 2.0 license. It was last updated on 2024-09-27.
Key capabilities
From the project's documentation:
- Add support for local LLMs through Ollama
- Docker deployment setup
- Add support for searxng. Eliminates the need for external dependencies.
- Create a pre-built Docker Image
- Add support for custom LLMs through LiteLLM
- Chat with local files
How it fits a local-AI stack
farfalle runs on your own hardware, so pair it with a model and a GPU sized to your needs. Use the VRAM calculator to pick a model that fits your card, and see what you can run for hardware guidance. Related local-AI tools in the directory:
Sources
- Source code & docs: rashadphz/farfalle
- Official website: https://www.farfalle.dev/
Stats from GitHub, 2026-06-08.
Frequently asked
Quick answers to common questions
What is farfalle?
farfalle is a other tool for local AI workloads. ๐ AI search engine - self-host with local or cloud LLMs
Is farfalle free and open source?
Yes, farfalle has 3,531 GitHub stars and is licensed under Apache 2.0. You can self-host it for free on docker, web.
What platforms does farfalle support?
farfalle runs on docker, web.
What hardware do I need for farfalle?
The hardware requirements depend on which models you run. Check our hardware directory for compatible GPUs and systems. farfalle has 3,531 GitHub stars and an active community.
Does farfalle support GPU acceleration?
farfalle's GPU support depends on your specific setup. Check the documentation for details. For the best performance, pair it with an NVIDIA RTX 4090 or 5090.
What are the best alternatives to farfalle?
Popular alternatives include other other tools in our directory. Browse our full collection at /tool for comparisons, community reviews, and benchmark data to find the right fit for your workflow.
How much does farfalle cost?
farfalle is free-open-source. It is completely free and open source to self-host.
Pairs well with
Complementary tools, models, and hardware
Similar tools
More tools like this one
Comments coming soon
Configure NEXT_PUBLIC_GISCUS_REPO_ID and NEXT_PUBLIC_GISCUS_CATEGORY_ID at giscus.app to enable.