Minimal Docker Compose setup for running Ollama locally — exposes the API on :11434 with a named volume so pulled models persist across restarts.
:11434
The "optionally with a web UI" phrase from the description is not yet wired. Open Web UI sidecar can be added — see Roadmap.
docker compose up -d
Ollama API → http://localhost:11434.
http://localhost:11434
Pull a model:
docker compose exec ollama ollama pull llama3.2
Chat from CLI:
docker compose exec ollama ollama run llama3.2
Smoke test via API:
curl http://localhost:11434/api/generate -d '{ "model": "llama3.2", "prompt": "Why is the sky blue?", "stream": false }'
ollama/ollama
11434:11434
ollama:/root/.ollama
unless-stopped
To enable GPU, add deploy.resources.reservations.devices for the NVIDIA runtime — see the Ollama Docker docs. CPU-only by default.
deploy.resources.reservations.devices
ghcr.io/open-webui/open-webui:main
:3000
Apache-2.0