Getting Started

Get Sabre running in under a minute. One install, one command, real answers.

Install Sabre

Terminal

$ curl -fsSL https://getsabre.io/install.sh | bash

The installer automatically sets up Python, Ollama, and downloads the default model. No API keys or accounts required.

Launch Sabre with the default model:

Terminal

$ sabre

Use the --model flag to pick a specific model. See benchmarks for model recommendations.

Terminal

$ sabre --model qwen3.6:35b-a3b

You can also pass a query directly:

Terminal

$ sabre --model qwen3.6:35b-a3b "Scale my deployment to 5 replicas"

Ask Sabre about a real issue in your cluster:

Terminal

$ sabre --model qwen3.6:35b-a3b "Why is my pod crashlooping?"

Here are more things you can try:

Sabre follows the XDG Base Directory specification. All data stays on your machine:

Squeeze more performance out of Ollama with these environment variables:

Variable	Value	Description
`OLLAMA_MULTIUSER_CACHE`	`1`	Enables prompt caching across requests. Reduces latency for repeated prefixes.
`OLLAMA_MLX`	`1`	Uses the MLX runner on macOS for faster Apple Silicon inference.

Set these before starting Ollama:

Terminal

$ OLLAMA_MULTIUSER_CACHE=1 OLLAMA_MLX=1 ollama serve