mirror of
https://github.com/open-metadata/OpenMetadata
synced 2026-05-24 09:39:11 +00:00
* Fix k8s operator exit handler pod loop and TTL cleanup, add tolerations support (#26772)
Fix two bugs in the OMJob operator:
- Exit handler pods were recreated indefinitely because findExitHandlerPod()
lacked the name-based fallback that findMainPod() already had, causing
label propagation delays to trigger repeated pod creation events
- Terminal phase handler never rescheduled for TTL-based cleanup, so pods
were never cleaned up after ttlSecondsAfterFinished expired
Add tolerations support for ingestion pod scheduling across the full stack:
- Operator: OMJobPodSpec field, PodManager.buildPod(), CRD schema
- Server: OMJob model, K8sPipelineClientConfig parsing, K8sPipelineClient
builder, K8sJobUtils serialization
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Add K8S_TOLERATIONS env var mapping in openmetadata.yaml
Adds the tolerations config binding so the server picks up the
K8S_TOLERATIONS env var set by the Helm chart secret.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* Add tolerations to k8s test values for local validation
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
* fix cleanup
* Address PR review: remove redundant pod lookup and guard null items
- Remove redundant server-created pod selector fallback in findMainPod()
since buildPodSelector() now matches all pods by omjob-name alone
- Add null guard for getItems() in deletePods() to prevent NPE
- Update local test values for namespace and image config
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
(cherry picked from commit
|
||
|---|---|---|
| .. | ||
| distributed-test | ||
| helm | ||
| mock-oidc-provider | ||
| .env.sso-test | ||
| docker-compose-fuseki.yml | ||
| docker-compose-gcp.yml | ||
| docker-compose-postgres-fuseki.yml | ||
| docker-compose-postgres.yml | ||
| docker-compose.yml | ||
| Dockerfile | ||