This repository has been archived on 2026-04-18. You can view files and clone it. You cannot open issues or pull requests or push a commit.
Files
server-config/services
Simon Gardling 0235617627 monitoring: fix intel-gpu-collector crash resilience
Wrap entire read_one_sample() in try/except to handle all failures
(missing binary, permission errors, malformed JSON, timeouts).
Write zero-valued metrics on failure instead of exiting non-zero.
Increase timeout from 5s to 8s for slower GPU initialization.
2026-04-02 17:43:13 -04:00
..
2026-04-01 20:37:18 -04:00
2026-03-21 12:13:53 -04:00
2026-03-21 13:28:18 -04:00
2026-03-21 12:13:53 -04:00
2026-03-21 12:13:53 -04:00
2026-03-31 17:25:06 -04:00
2026-03-31 17:25:06 -04:00
2026-03-21 12:13:53 -04:00
2026-03-21 12:13:53 -04:00
2026-04-02 16:09:17 -04:00
2026-03-30 13:05:22 -04:00
2026-03-21 12:13:53 -04:00
2026-03-03 14:31:36 -05:00
2026-03-21 12:13:53 -04:00
2026-04-01 15:25:40 -04:00
2026-03-21 12:13:53 -04:00
2026-03-21 12:13:53 -04:00
2026-03-03 14:29:12 -05:00
2026-03-03 19:39:10 -05:00
2026-03-21 12:13:53 -04:00