llama-cpp: set batch size to 4096
All checks were successful
Build and Deploy / deploy (push) Successful in 1m22s

This commit is contained in:
2026-04-06 02:29:37 -04:00
parent 9addb1569a
commit 6d47f02a0f

View File

@@ -40,6 +40,10 @@ in
"--metrics"
"--alias"
modelAlias
"-b"
"4096"
"-ub"
"4096"
];
};