server-config/services/llama-cpp-xmrig-pause.py at e41f869843f3234b7870b30d2a56b30cb8337ac0

Archived

This repository has been archived on 2026-04-18. You can view files and clone it. You cannot open issues or pull requests or push a commit.

Files

Simon Gardling df15be01ea llama-cpp: pause xmrig during active inference requests

Add sidecar service that polls llama-cpp /slots endpoint every 3s.
When any slot is processing, stops xmrig. Restarts xmrig after 10s
grace period when all slots are idle. Handles unreachable llama-cpp
gracefully (leaves xmrig untouched).

2026-04-02 17:43:07 -04:00

2.6 KiB

Raw Blame History

View Raw

2.6 KiB Raw Blame History

2.6 KiB

Raw Blame History