..
gen_inference_defaults
feat: inferencing default, automatic tool parsing fallback and wire min_p ( #9092 )
2026-03-22 00:57:15 +01:00
meta
feat(ui): Interactive model config editor with autocomplete ( #9149 )
2026-04-07 14:42:23 +02:00
application_config.go
fix(traces): cap captured body size to keep admin Traces UI responsive ( #9946 )
2026-05-22 15:29:24 +02:00
application_config_test.go
feat: backend versioning, upgrade detection and auto-upgrade ( #9315 )
2026-04-11 22:31:15 +02:00
backend_capabilities.go
feat(realtime): Add Liquid Audio s2s model and assistant mode on talk page ( #9801 )
2026-05-13 21:57:27 +02:00
backend_capabilities_test.go
feat(gallery): Speed up load times and clean gallery entries ( #9211 )
2026-05-06 14:51:38 +02:00
backend_hooks.go
feat(vllm): parity with llama.cpp backend ( #9328 )
2026-04-13 11:00:29 +02:00
config_suite_test.go
dependencies(grpcio): bump to fix CI issues ( #2362 )
2024-05-21 14:33:47 +02:00
distributed_config.go
fix(distributed): cascade-clean stale node_models rows + filter routing by healthy status ( #9754 )
2026-05-13 21:57:50 +02:00
gallery.go
feat(gallery): verify backend OCI images with keyless cosign ( #9823 )
2026-05-18 08:02:20 +02:00
gguf.go
feat(llama-cpp): bump to MTP-merge SHA and automatically set MTP defaults ( #9852 )
2026-05-16 22:42:48 +02:00
gguf_reasoning_test.go
Respect explicit reasoning config during GGUF thinking probe ( #9463 )
2026-04-21 21:53:10 +02:00
hooks_llamacpp.go
feat(vllm): parity with llama.cpp backend ( #9328 )
2026-04-13 11:00:29 +02:00
hooks_test.go
feat(config): default prompt_cache_all to true ( #9951 )
2026-05-22 22:06:22 +02:00
hooks_vllm.go
feat(vllm): expose AsyncEngineArgs via generic engine_args YAML map ( #9563 )
2026-04-29 00:49:28 +02:00
inference_defaults.go
feat: inferencing default, automatic tool parsing fallback and wire min_p ( #9092 )
2026-03-22 00:57:15 +01:00
inference_defaults.json
chore: bump inference defaults from unsloth ( #9396 )
2026-04-17 09:05:55 +02:00
inference_defaults_test.go
feat: inferencing default, automatic tool parsing fallback and wire min_p ( #9092 )
2026-03-22 00:57:15 +01:00
model_config.go
feat(config): default prompt_cache_all to true ( #9951 )
2026-05-22 22:06:22 +02:00
model_config_filter.go
feat: add distributed mode ( #9124 )
2026-03-30 00:47:27 +02:00
model_config_loader.go
feat(concurrency-groups): per-model exclusive groups for backend loading ( #9662 )
2026-05-05 08:42:50 +02:00
model_config_loader_test.go
feat(concurrency-groups): per-model exclusive groups for backend loading ( #9662 )
2026-05-05 08:42:50 +02:00
model_config_test.go
feat(concurrency-groups): per-model exclusive groups for backend loading ( #9662 )
2026-05-05 08:42:50 +02:00
model_test.go
fix(tests): inline model_test fixtures after tests/models_fixtures removal
2026-04-28 12:58:49 +00:00
mtp.go
feat(llama-cpp): bump to MTP-merge SHA and automatically set MTP defaults ( #9852 )
2026-05-16 22:42:48 +02:00
mtp_test.go
feat(llama-cpp): bump to MTP-merge SHA and automatically set MTP defaults ( #9852 )
2026-05-16 22:42:48 +02:00
parser_defaults.json
feat(vllm): parity with llama.cpp backend ( #9328 )
2026-04-13 11:00:29 +02:00
runtime_settings.go
fix(traces): cap captured body size to keep admin Traces UI responsive ( #9946 )
2026-05-22 15:29:24 +02:00
runtime_settings_persist.go
feat(branding): admin-configurable instance name, tagline, and assets ( #9635 )
2026-05-02 15:51:36 +02:00
runtime_settings_persist_test.go
feat(branding): admin-configurable instance name, tagline, and assets ( #9635 )
2026-05-02 15:51:36 +02:00