server-config

Archived

Author	SHA1	Message	Date
Simon Gardling	d48f27701f	xmrig-auto-pause: add hysteresis to prevent stop/start thrashing xmrig's RandomX pollutes the L3 cache, making other processes appear ~3-8% busier. With a single 5% threshold for both stopping and resuming, the script oscillates: start xmrig -> cache pressure inflates CPU -> stop xmrig -> CPU drops -> restart -> repeat. Split into CPU_STOP_THRESHOLD (15%) and CPU_RESUME_THRESHOLD (5%). The stop threshold sits above xmrig's indirect pressure, so only genuine workloads trigger a pause. The resume threshold confirms the system is truly idle before restarting.	2026-04-07 01:09:06 -04:00
Simon Gardling	7afd1f35d2	xmrig-auto-pause: fix	2026-04-06 13:11:54 -04:00
Simon Gardling	bbcd662c28	xmrig-auto-pause: fix stuck state after external restart, add startup cooldown All checks were successful Build and Deploy / deploy (push) Successful in 8m47s Two bugs found during live verification on the server: 1. Stuck state after external restart: if something else restarted xmrig (e.g. deploy-rs activation) while paused_by_us=True, the script never detected this and became permanently stuck — unable to stop xmrig on future load because it thought xmrig was already stopped. Fix: when paused_by_us=True and busy, check if xmrig is actually running. If so, reset paused_by_us=False and re-stop it. 2. Flapping on xmrig restart: RandomX dataset init takes ~3.7s of intense non-nice CPU, which the script detected as real workload and immediately re-stopped xmrig after every restart, creating a start-stop loop. Fix: add STARTUP_COOLDOWN (default 10s) — after starting xmrig, skip CPU checks until the cooldown expires. Both bugs were present in production: the script had been stuck since Apr 3 (2+ days) with xmrig running unmanaged alongside llama-server.	2026-04-05 23:20:47 -04:00
Simon Gardling	324a9123db	better organize related monero and matrix services All checks were successful Build and Deploy / deploy (push) Successful in 2m48s	2026-04-04 14:32:26 -04:00
Simon Gardling	daf82c16ba	fix xmrig pause	2026-04-03 14:39:20 -04:00

5 Commits