LocalAI/backend/cpp
Ettore Di Giacinto b27d38a53d fix(buun-llama-cpp): backport logit_bias_eog field to grpc-server copy
LocalAI's shared grpc-server.cpp reaches
ctx_server.get_meta().logit_bias_eog twice (the twin params_from_json_cmpl
callsites). That accessor was added to server_context_meta upstream after
buun's 2026-04-05 fork-point, so compiling against buun errors with
  'struct server_context_meta' has no member named 'logit_bias_eog'.

Rewrite the call sites — only in the buun grpc-server.cpp copy — to source
the vector from params_base.sampling.logit_bias_eog instead. That vector is
the underlying data the upstream meta accessor eventually returns (buun
still carries common_params_sampling::logit_bias_eog at common.h:280), so
the substitution yields identical behavior on both trees.

The sed is guarded by a grep for the call site, so this patch is
self-disabling once buun rebases past the upstream refactor.

Assisted-by: Claude:Opus-4.7 [Read] [Edit] [Bash] [WebFetch]
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2026-04-24 12:52:53 +00:00
..
buun-llama-cpp fix(buun-llama-cpp): backport logit_bias_eog field to grpc-server copy 2026-04-24 12:52:53 +00:00
grpc fix: speedup git submodule update with --single-branch (#2847) 2024-07-13 22:32:25 +02:00
ik-llama-cpp fix(ik-llama-cpp): patch clip.cpp for new ggml_quantize_chunk signature (#9531) 2026-04-24 13:07:26 +02:00
llama-cpp chore: ⬆️ Update ggml-org/llama.cpp to 187a45637054881ecacf17f8e2f6f8f2ba7df1c7 (#9520) 2026-04-24 09:17:06 +02:00
turboquant fix(turboquant): drop ignore-eos patch, bump fork to b8967-627ebbc (#9423) 2026-04-19 21:05:21 +02:00