unhosted-core
RustView on GitHub →The daemon. Wraps llama.cpp's llama-server on each node and presents a single OpenAI-compatible inference endpoint across the whole cluster.
What it does: mDNS peer discovery on the LAN, one-click pairing over the internet for trusted nodes, round-robin routing across local + peers with hot-reload, model management, VRAM-pooling via RPC-enabled llama.cpp, public-mode policy + signed-receipt verification.
Why it lives in its own crate: the daemon's release cadence is hardware-bound (Apple silicon updates, CUDA driver versions, ROCm releases). Compliance and rails work for the public mode happens on a different clock — that's unhosted-payments.