nixos

Author	SHA1	Message	Date
Simon Gardling	479ec43b8f	llama-cpp: integrate native prometheus /metrics endpoint llama.cpp server has a built-in /metrics endpoint exposing prompt_tokens_seconds, predicted_tokens_seconds, tokens_predicted_total, n_decode_total, and n_busy_slots_per_decode. Enable it with --metrics and add a Prometheus scrape target, replacing the need for any external metric collection for LLM inference monitoring.	2026-04-03 15:19:11 -04:00
Simon Gardling	37ac88fc0f	lib: replace deprecated overrideDerivation with overrideAttrs overrideDerivation has been deprecated since 2019. The new overrideAttrs properly handles the env attribute set used by modern derivations to avoid the NIX_CFLAGS_COMPILE overlap error between env and top-level derivation arguments.	2026-04-03 15:18:22 -04:00
Simon Gardling	47aeb58f7a	llama-cpp: do logging	2026-04-03 14:39:46 -04:00
Simon Gardling	daf82c16ba	fix xmrig pause	2026-04-03 14:39:20 -04:00
Simon Gardling	d4d01d63f1	llama-cpp: update + re-enable + gemma 4 E4B	2026-04-03 14:06:35 -04:00
Simon Gardling	e765a98487	recyclarr: reset back to default basically	2026-04-03 13:45:26 -04:00
Simon Gardling	124d33963e	organize	2026-04-03 00:47:12 -04:00
Simon Gardling	1451f902ad	grafana: re-organize	2026-04-03 00:39:42 -04:00
Simon Gardling	8e6619097d	update	2026-04-03 00:20:13 -04:00
Simon Gardling	c2ff07b329	llama-cpp: disable	2026-04-03 00:17:38 -04:00
Simon Gardling	9e235abf48	monitoring: fix disk-usage-collector timer calendar spec	2026-04-03 00:17:21 -04:00
Simon Gardling	096ffeb943	llama-cpp: xmrig + grafana hooks	2026-04-03 00:17:17 -04:00
Simon Gardling	ab9c12cb97	llama-cpp: general changes	2026-04-03 00:17:14 -04:00
Simon Gardling	5e9e6bcd40	pi: fix llama.cpp provider discovery with auth Add api, authHeader, and discovery.type fields so omp can discover models via GET /v1/models with the Bearer token.	2026-04-02 18:14:09 -04:00
Simon Gardling	0aeb6c5523	llama-cpp: add API key auth via --api-key-file Generate and encrypt a Bearer token for llama-cpp's built-in auth. Remove caddy_auth from the vhost since basic auth blocks Bearer-only clients. Internal sidecars (xmrig-pause, annotations) connect directly to localhost and are unaffected (/slots is public).	2026-04-02 18:02:23 -04:00
Simon Gardling	bfe7a65db2	monitoring: add zpool and boot partition usage metrics Add textfile collector for ZFS pool utilization (tank, hdds) and boot drive partitions (/boot, /persistent, /nix). Runs every 60s. Add two Grafana dashboard panels: ZFS Pool Utilization and Boot Drive Partitions as Row 5.	2026-04-02 18:02:23 -04:00
Simon Gardling	3e35fea183	pi: fix openrouter apiKey, add llama.cpp provider openrouter was broken: !cat + nix store path is not valid omp config. Use builtins.readFile to inline the key at eval time. Add self-hosted llama.cpp provider at llm.sigkill.computer with Bearer token auth.	2026-04-02 17:57:51 -04:00
Simon Gardling	e41f869843	trilium: add self-hosted note-taking service Add trilium-server on port 8787 behind Caddy reverse proxy at notes.sigkill.computer. Data stored on ZFS tank pool with serviceMountWithZpool for mount ordering.	2026-04-02 17:44:04 -04:00
Simon Gardling	9baeaa5c23	llama-cpp: add grafana annotations for inference requests Poll /slots endpoint, create annotations when slots start processing, close with token count when complete. Includes NixOS VM test with mock llama-cpp and grafana servers. Dashboard annotation entry added.	2026-04-02 17:43:49 -04:00
Simon Gardling	0235617627	monitoring: fix intel-gpu-collector crash resilience Wrap entire read_one_sample() in try/except to handle all failures (missing binary, permission errors, malformed JSON, timeouts). Write zero-valued metrics on failure instead of exiting non-zero. Increase timeout from 5s to 8s for slower GPU initialization.	2026-04-02 17:43:13 -04:00
Simon Gardling	df15be01ea	llama-cpp: pause xmrig during active inference requests Add sidecar service that polls llama-cpp /slots endpoint every 3s. When any slot is processing, stops xmrig. Restarts xmrig after 10s grace period when all slots are idle. Handles unreachable llama-cpp gracefully (leaves xmrig untouched).	2026-04-02 17:43:07 -04:00
Simon Gardling	50453cf0b5	llama-cpp: adjust args	2026-04-02 16:09:17 -04:00
Simon Gardling	bb6ea2f1d5	llama-cpp: cpu only	2026-04-02 15:32:39 -04:00
Simon Gardling	f342521d46	llama-cpp: re-add w/ turboquant	2026-04-02 13:42:39 -04:00
Simon Gardling	7e779ca0f7	power optimizations	2026-04-02 13:13:38 -04:00
Simon Gardling	9a3ac53c50	mreow: power stuff	2026-04-02 13:06:59 -04:00
Simon Gardling	84bb728633	update	2026-04-02 12:53:25 -04:00
Simon Gardling	3768e032ba	update	2026-04-02 00:07:24 -04:00
Simon Gardling	06b2016bd6	recyclarr: things	2026-04-01 20:37:18 -04:00
Simon Gardling	f9694ae033	qbt: fix categories	2026-04-01 15:25:40 -04:00
Simon Gardling	f775f22dbf	recylcarr: restart service after config change	2026-04-01 15:25:31 -04:00
Simon Gardling	07a808271d	Move from opencode to oh-my-pi	2026-04-01 14:33:44 -04:00
Simon Gardling	302bb599db	update	2026-04-01 13:25:53 -04:00
Simon Gardling	1bb0844649	update	2026-04-01 13:12:14 -04:00
Simon Gardling	297264a34a	tests: extract shared jellyfin test helpers and use real jellyfin in annotations test	2026-04-01 11:24:44 -04:00
Simon Gardling	a5206b9ec6	monitoring: add grafana annotations for zfs scrub events	2026-04-01 11:24:43 -04:00
Simon Gardling	3196b38db7	tests: extract shared mock grafana server from jellyfin test	2026-04-01 11:24:43 -04:00
Simon Gardling	59d33cea3d	grafana: power improvement	2026-04-01 11:24:40 -04:00
Simon Gardling	fdf57873d7	prowlarr: fix perms	2026-03-31 23:31:31 -04:00
Simon Gardling	f1b7679196	grafana: remove unused stuff	2026-03-31 18:55:07 -04:00
Simon Gardling	5856d835ba	grafana: qbt smoothing	2026-03-31 18:54:57 -04:00
Simon Gardling	f77f596222	opencode: move android stuff to android-ui skill	2026-03-31 18:44:08 -04:00
Simon Gardling	a288e18e6d	grafana: qbt stats	2026-03-31 17:47:53 -04:00
Simon Gardling	c6b889cea3	grafana: more things 1. Smoothed out power draw - UPS only reports on 9 watt intervals, so smoothing it out gives more relative detail on trends 2. Add jellyfin integration - Good for seeing correlations between statistics and jellyfin streams 3. intel gpu stats - Provides info on utilization of the gpu	2026-03-31 17:25:06 -04:00
Simon Gardling	0027489052	grafana: smooth power draw	2026-03-31 13:20:07 -04:00
Simon Gardling	ebc4c66fc3	update	2026-03-31 12:52:21 -04:00
Simon Gardling	bc227a89c1	remove old secrets	2026-03-31 12:47:09 -04:00
Simon Gardling	e3be112b82	grafana: init Shows powerdraw, temps, uptime, and jellyfin streams	2026-03-31 12:38:43 -04:00
Simon Gardling	5375f8ee34	gitea: add actions runner and CI/CD deploy workflow This will avoid me having to run "deploy" myself on my laptop. All I will need to do is push a commit and it will self-deploy.	2026-03-31 12:38:43 -04:00
Simon Gardling	e4feaa35ad	secrets: migrate build-time secrets to agenix runtime - coturn: switch static-auth-secret to static-auth-secret-file - matrix: switch registration_token and turn_secret to file-based - murmur: switch password to environmentFile with agenix - p2pool: move public wallet address to service-configs.nix	2026-03-31 12:38:43 -04:00

... 2 3 4 5 6 ...

1577 Commits