Short description Locust is a load testing tool. You describe simulated users in a small Python file, then watch how your service behaves as concurrency rises. Purpose and how people use it People use Locust to find where a service slows down or breaks under load. For an AI stack it answers a specific question:…
Category: AI/ML & vGPU
08 Dify Quick Setup Guide
Short description Dify is an all in one platform to build, run, and publish LLM apps. It covers chatbots, agents, visual workflows, and RAG knowledge bases, then lets you publish any of them as a web app or an API. Purpose and how people use it People use Dify to go from idea to a…
07 Langfuse Quick Setup Guide
Short description Langfuse is observability and tracing for LLM applications. It records every call, prompt, response, latency, and cost so you can see what your stack is actually doing. Purpose and how people use it People use Langfuse to debug and improve LLM apps. When an agent gives a strange answer, the trace shows the…
06 LiteLLM Quick Setup Guide
Short description LiteLLM is a gateway that puts every model, local or cloud, behind one OpenAI compatible API. It adds virtual keys, usage logging, and a control plane UI. Purpose and how people use it People use LiteLLM to stop juggling different SDKs and keys for different providers. Every app points at LiteLLM, and LiteLLM…
05 Langflow Quick Setup Guide
Short description Langflow is a visual flow builder for LLM pipelines and agents, similar in spirit to Flowise but with its own node set and feel. Purpose and how people use it People use Langflow to design RAG flows, agents, and prompt pipelines visually, test them in a built in playground, and export them as…
04 Flowise Quick Setup Guide
Short description Flowise is a visual drag and drop builder for LLM apps and agents. You assemble chains, agents, and RAG pipelines as boxes on a canvas. Purpose and how people use it People use Flowise to prototype chatbots, RAG assistants, and tool using agents quickly, then expose them as an API or an embed…
03 n8n Quick Setup Guide
Short description n8n is a workflow automation platform with strong AI building blocks. You wire nodes together on a canvas to automate tasks and build agents. Purpose and how people use it People use n8n to connect apps and APIs and to build AI agents and pipelines without writing much code. Typical uses include automated…
02 Open WebUI Quick Setup Guide
Short description Open WebUI is a polished, locally hosted chat interface for your own models. It looks and feels like ChatGPT but runs entirely on your machine. Purpose and how people use it It is the front door to a local AI stack. People use it for everyday chat with local models, for document chat…
Part 0a: Setting Up Windows (WSL, Ubuntu, and Docker)
Who this is for This guide gets a Windows machine ready to run the rest of the stack. You will install WSL (a way to run real Linux inside Windows), the Ubuntu Linux system, and Docker Desktop. After this you can follow any tool guide in the series. You only do this once. Step 1:…
Build a Local AI Stack: Series Index
This is a set of quick setup guides for running a full local AI tooling stack on your own hardware. Every tool runs on your machine, talks to your own models, and costs nothing per request. The guides are written to be posted one at a time, so each one stands on its own. Before…