Files
nixos/services
Simon Gardling 9baeaa5c23 llama-cpp: add grafana annotations for inference requests
Poll /slots endpoint, create annotations when slots start processing,
close with token count when complete. Includes NixOS VM test with
mock llama-cpp and grafana servers. Dashboard annotation entry added.
2026-04-02 17:43:49 -04:00
..
2026-04-01 20:37:18 -04:00
2026-03-21 12:13:53 -04:00
2026-03-21 13:28:18 -04:00
2026-03-21 12:13:53 -04:00
2026-03-21 12:13:53 -04:00
2026-03-31 17:25:06 -04:00
2026-03-31 17:25:06 -04:00
2026-03-21 12:13:53 -04:00
2026-03-21 12:13:53 -04:00
2026-04-02 16:09:17 -04:00
2026-03-30 13:05:22 -04:00
2026-03-21 12:13:53 -04:00
2026-03-03 14:31:36 -05:00
2026-03-21 12:13:53 -04:00
2026-04-01 15:25:40 -04:00
2026-03-21 12:13:53 -04:00
2026-03-21 12:13:53 -04:00
2026-03-03 14:29:12 -05:00
2026-03-03 19:39:10 -05:00
2026-03-21 12:13:53 -04:00