OpenMetadata

mirror of https://github.com/open-metadata/OpenMetadata synced 2026-05-24 09:39:11 +00:00

Author	SHA1	Message	Date
Sid	ca2d0122db	test(playwright): add nightly SAML session renewal coverage (#27619 ) * test(playwright): add nightly SAML session renewal spec Covers OM's JWT refresh behavior for SAML sessions end-to-end against the local Keycloak fixture: silent refresh after expiry, concurrent 401s queuing behind a single refresh call, and forced re-login when the server-side SAML HttpSession is gone. Reuses the snapshot/restore mechanism and keycloak-azure-saml provider helper introduced in #27164; shortens samlConfiguration.security.token Validity to 10s so the suite observes multiple expiry cycles in <60s. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * Update openmetadata-ui/src/main/resources/ui/playwright/utils/sessionRenewal.ts Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * test(playwright): drop expiry wait from refresh-on-reload SSO specs The reactive 401 refresh path races with the AuthProvider useEffect that wires tokenService.renewToken from authenticatorRef — if the 401 from /users/loggedInUser lands before that effect commits the populated ref, refreshToken() returns null and the user is logged out instead of refreshed. With tokenValidity=10s (< EXPIRY_THRESHOLD_MILLES=60s), the UI's proactive timer in startTokenExpiryTimer fires immediately on every mount, so /auth/refresh is exercised on each reload regardless of expiry state. Assertions on token rotation and session continuity still cover "silent refresh works end-to-end". The SAML-session-gone case still waits for expiry — it needs to. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * test(playwright): trigger refresh via SPA nav in SSO renewal specs page.reload() remounts React and re-races the axios interceptor setup in AuthProvider — the useEffect that wires authenticatorRef.renewIdToken onto TokenService has a ref-typed dependency that doesn't reliably re-run, so the first 401 after reload sometimes finds renewToken=null and the interceptor silently logs the user out instead of refreshing. Click the Explore sidebar link instead. The click triggers authenticated API calls while staying inside the already-mounted React tree, so the interceptor always reaches the wired TokenService. Spec now passes 10/10 locally. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Siddhant <siddhant@MacBook-Pro-621.local> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-05-04 11:48:45 +05:30
Chirag Madlani	d095413ed1	fix(ci): nightly workflow running stale project getting failed [skip-ci] (#27849 ) Some checks are pending Integration Tests - MySQL + Elasticsearch / Detect Changes (push) Waiting to run Details Integration Tests - MySQL + Elasticsearch / integration-tests-mysql-elasticsearch (push) Blocked by required conditions Details Integration Tests - PostgreSQL + OpenSearch / Detect Changes (push) Waiting to run Details Integration Tests - PostgreSQL + OpenSearch / integration-tests-postgres-opensearch (push) Blocked by required conditions Details Java Checkstyle / java-checkstyle (push) Waiting to run Details Maven Collate Tests / maven-collate-ci (push) Waiting to run Details OpenMetadata Service Unit Tests / Detect Changes (push) Waiting to run Details OpenMetadata Service Unit Tests / openmetadata-service-unit-tests (mysql) (push) Blocked by required conditions Details OpenMetadata Service Unit Tests / openmetadata-service-unit-tests (postgresql) (push) Blocked by required conditions Details OpenMetadata Service Unit Tests / k8s_operator-unit-tests (push) Blocked by required conditions Details OpenMetadata Service Unit Tests / openmetadata-service-unit-tests-status (push) Blocked by required conditions Details Publish Package to Maven Central Repository / publish-maven-packages (push) Waiting to run Details	2026-05-04 10:53:16 +05:30
miriann-uu	7b01731754	GEN-5164: Add cherry pick matrix (#27674 )	2026-04-29 10:39:31 +05:30
IceS2	84ed278720	chore(ingestion): enable basedpyright across the codebase via baseline (#27755 ) * chore(ingestion): enable basedpyright across the codebase via baseline Removes the ~25 paths from `[tool.basedpyright] ignore` (which excluded roughly 90% of the codebase from type checking) and grandfathers the existing violations into a baseline file. New violations in any previously-ignored file now fail CI. Changes: - ingestion/pyproject.toml: drop the entire `ignore = [...]` block - ingestion/setup.py: bump `basedpyright~=1.14` to `~=1.39.0` - ingestion/.basedpyright/baseline.json (new, ~13MB): captures the starting violation set (~18.8K errors + ~37.4K warnings) so the migration is behavior-preserving. Regenerate with `cd ingestion && basedpyright -p pyproject.toml --baselinefile .basedpyright/baseline.json --writebaseline`. basedpyright analysis has minor non-determinism (similar to ruff's), so re-running --writebaseline a few times converges the baseline. - ingestion/noxfile.py: pass `--baselinefile .basedpyright/baseline.json` to the basedpyright invocation in the `static-checks` session so CI honors the grandfathering. CI already runs the session via `cd ingestion && nox --no-venv -s static-checks` (py-tests.yml). - ingestion/Makefile: `make static-checks` now delegates to `nox -s static-checks` so local invocations match CI exactly. Also drops the dead Python 3.9 / OM_SKIP_SDK_PY39 branch (we require Python >=3.10 since the previous modernization PR). - .gitignore: add `.serena/` (local language-server cache) * chore(ingestion): add nox to the dev dependency set The static-checks Makefile target and the py-tests CI job both delegate to `nox -s static-checks`, but nox was being installed as a separate side step (`pip install nox` in `install_dev_env`, `uv pip install nox` in the test-environment composite action). Listing it in dev extras means a plain `pip install ingestion[dev]` brings it in. * chore(ingestion): pin basedpyright analysis to py3.10; CI runs once Following the basedpyright + multi-Python-version research: - ingestion/pyproject.toml: add `pythonVersion = "3.10"` to [tool.basedpyright] so type-checking always analyzes for the lowest supported Python version. Forward-incompatible code (tomllib usage, PEP 695 generics, etc.) is caught at type-check time regardless of which Python interpreter runs the checker. - .github/workflows/py-tests.yml: gate the "Run Static Checks" step on `matrix.py-version == '3.10'`. With pythonVersion pinned, results are identical across the matrix; running once avoids redundant work and keeps the baseline file deterministic. Unit tests still run on the full 3.10/3.11/3.12 matrix to verify runtime compatibility. - ingestion/.basedpyright/baseline.json: regenerated cleanly with the new pythonVersion config (~18.8K errors / ~37.3K warnings, similar scale to the previous baseline). Aligns with the canonical type-check-on-floor / test-on-matrix pattern used by Pydantic, CPython, and other major Python projects. * chore(ingestion): pin basedpyright pythonPlatform to Linux + regen baseline CI's previous run still surfaced ~9 issues (2 errors + 7 warnings) that weren't in the baseline. Root cause: my local environment differs from CI's in three ways that affect type inference — Python interpreter (3.11 vs 3.10), platform (Darwin vs Linux), and pip-resolved package versions (couchbase, avro, trino, sqlalchemy stubs all differ slightly). This commit closes the platform gap and regenerates the baseline from a fresh CI-equivalent environment: - ingestion/pyproject.toml: add `pythonPlatform = "Linux"` to [tool.basedpyright] so type-checking uses the Linux subset of stdlib / third-party stubs regardless of where the analyzer runs. - ingestion/.basedpyright/baseline.json: regenerated against a fresh Python 3.10 venv installed via `uv pip install ingestion[test]` (the same install path CI's setup-openmetadata-test-environment composite action uses). New scale: ~18.7K errors / ~37.5K warnings — same ballpark as the previous baseline, with column positions now matching CI's environment. Local-developer note: when running `make static-checks` from a venv that doesn't mirror CI exactly (e.g. macOS, Python 3.11, different package versions), you may see drift errors. The supported workflow for regenerating the baseline is to mirror CI: python3.10 -m venv /tmp/ci-mirror source /tmp/ci-mirror/bin/activate uv pip install --upgrade pip "setuptools<81" uv pip install --no-build-isolation "cx_Oracle>=8.3.0,<9" uv pip install -e "ingestion[test]" uv pip install "basedpyright~=1.39.0" nox cd ingestion && basedpyright -p pyproject.toml \ --baselinefile .basedpyright/baseline.json --writebaseline * chore(ingestion): drop pythonPlatform pin and regen baseline from CI-mirror The previous attempt added `pythonPlatform = "Linux"` thinking it would make the local-generated baseline match CI. It did the opposite — Linux platform stubs activate additional conditional code paths that weren't analyzed before, so CI saw 101 errors instead of the prior 2 errors. Reverting: - Drop `pythonPlatform = "Linux"` from [tool.basedpyright]. Without it, basedpyright analyzes for the host platform; on CI's ubuntu-latest runner that's Linux automatically, but type-stub coverage stays the same as before (matching the `d9196dff6b` baseline). - Regenerate ingestion/.basedpyright/baseline.json against a fresh Python 3.10 venv installed via `uv pip install ingestion[test]` (mirroring CI's setup-openmetadata-test-environment composite action). ~18.8K errors / 37.7K warnings captured — same scale as the working `d9196dff6b` version. Local-developer note: any baseline regeneration done on macOS will drift from CI's Linux env (different transitive package versions, different stubs). The supported local mirror procedure: python3.10 -m venv /tmp/ci-mirror source /tmp/ci-mirror/bin/activate uv pip install --upgrade pip "setuptools<81" uv pip install --no-build-isolation "cx_Oracle>=8.3.0,<9" uv pip install -e "ingestion[test]" uv pip install "basedpyright~=1.39.0" nox cd ingestion && basedpyright -p pyproject.toml \\ --baselinefile .basedpyright/baseline.json --writebaseline * chore(ingestion): regen baseline from full CI install (mac arm64 mirror) Prior CI-mirror only installed [test], skipping [all] and the four --no-deps SA pins (sqlalchemy-redshift/databricks/ibmi, pydoris-custom). That left ~75 connector packages out of the analysis env, so basedpyright couldn't resolve types from databricks.sqlalchemy, GE 0.18 Batch, sklearn BaseEstimator, airflow SQLAlchemy models, pandas/numpy stubs, etc. CI saw 129 errors absent from the baseline. Regenerated against a fresh py3.10 venv that mirrors .github/actions/setup-openmetadata-test-environment exactly: uv pip install ./ingestion[dev] make generate uv pip install "setuptools<81" uv pip install --no-build-isolation "cx_Oracle>=8.3.0,<9" uv pip install --no-deps sqlalchemy-redshift==0.8.14 \ sqlalchemy-databricks==0.2.0 \ sqlalchemy-ibmi==0.9.3 \ pydoris-custom==1.1.0 uv pip install ./ingestion[all] uv pip install ./ingestion[test] uv pip install nox First run: 128 errors, 272 warnings — within 1 error of CI's 129/272. Wrote baseline with 56,100 entries across 1,035 files. Verify run with the new baseline reports 0/0/0. macOS arm64 vs Linux x86_64 wheel resolution may leave a small residual (~3-7 errors per the `d9196dff6b` precedent). Re-run --writebaseline 2-3x if any show up in CI. * chore(ingestion): silence avro.py:95 basedpyright residual CI's Linux fastavro stub returns Schema as `str \| List[Any]`, while the macOS arm64 wheel narrows to `str` — the only error not absorbed by the regenerated baseline. Add a targeted pyright: ignore on the parse_avro_schema call instead of broadening behavior. * chore(ingestion): tolerate cross-platform pyright ignore drift CI's `--baselinemode=lock` (default) requires the baseline to match exactly — neither up nor down. Two related issues: 1. The avro.py noqa silenced not just the surfaced error but 10 cascading entries at line 95 (sub-errors propagating from the unresolved `schema` arg type). Baseline went `down by 10` → lock violated → exit 3 even with `0 errors` reported. Regenerate baseline so the 10 stale entries are dropped. 2. The macOS arm64 fastavro stub doesn't surface that error in the first place, so basedpyright treats the noqa as `reportUnnecessaryTypeIgnoreComment` locally — causing the opposite lock mismatch on CI (a warning entry that doesn't exist there). Disable the rule so platform-specific residuals can land without flapping between local and CI. * chore(ingestion): use --baselinemode=discard for cross-platform tolerance CI's implicit default is `lock`, which fails on any baseline change in either direction (errors going up or down) via console.error → exit 3. That cannot accommodate macOS arm64 vs Linux x86_64 stub drift: a baseline regenerated locally always carries some entries that don't fire on CI (and vice versa). `auto` would tolerate the drift but silently overwrites the baseline file — unacceptable in CI, where unreviewed changes never get committed back. `discard` is the right balance: - New errors not in the baseline still fail the run (early-return path in BaselineHandler.write before the lock/discard branch). - Stale baseline entries (errors that no longer fire on the current platform) print an info message and exit 0. - The baseline file is never modified.	2026-04-27 17:15:44 +02:00
IceS2	1fa0c79d27	chore(github): migrate issue templates to structured forms (#27710 ) * chore(github): migrate issue templates to structured forms - Convert bug_report, feature_request, doc_update to GitHub issue forms (YAML) - Add connector_bug form with free-text Connector field - Drop epic and feature_task templates (stale since 2022, no usage evidence) - Add auto-label workflow that maps the Connector field to a namespaced connector:<name> label, falling back to connector:other on 0 or 2+ matches - Labels are applied exclusively and auto-created with a grey "Connector" description when missing * chore(github): drop redundant pipeline type field from connector_bug form Feature area already covers metadata/lineage/profiler/usage distinction. * fix(github): address PR review feedback - bug_report.yml: add labels: ["bug"] for pattern consistency - label-connector.yml: add contents: read permission (needed by checkout) - label_connector.py: raise on unexpected HTTP status; accept 404 for idempotent GET-label and DELETE-label-from-issue; stop echoing the raw Connector field value into workflow logs	2026-04-24 14:08:20 +02:00
Mayur Singal	878421a644	fix: enable subprocess coverage tracking for CLI E2E tests (#27329 ) * fix: enable subprocess coverage tracking for CLI E2E tests CLI E2E tests run connectors via `subprocess.Popen("metadata ingest")` but the subprocess coverage data was silently lost. Two issues: 1. Missing `parallel = true` in coverage config — parent pytest process and child subprocess both wrote to the same `.coverage` file, causing data collision. With parallel mode, each process writes to its own `.coverage.<pid>` file that `coverage combine` can merge. 2. `COVERAGE_PROCESS_START` used a relative path (`ingestion/pyproject.toml`) in sitecustomize.py. Resolved to absolute using `GITHUB_WORKSPACE`. Evidence: Metabase (zero unit tests, only E2E) shows 53.6% on SonarCloud with client.py at 17.2% — inspection of .coverage.metabase confirms only import-time + in-process setup lines are present, with zero method body coverage from the subprocess execution. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: remove -a (append) flags incompatible with parallel coverage mode `coverage run -a` and `coverage combine -a` conflict with `parallel = true` in the coverage config. In parallel mode each process writes to its own `.coverage.<pid>` file, and `coverage combine` merges them — no append needed. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * MINOR: Fix snowflake e2e (#26677) * MINOR: Fix snowflake e2e * fix pyformat * improve snowflake test * fix count * mark flaky auto classification test * improve test address comment --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-23 06:57:30 +02:00
Sid	0a98f5bf32	test(playwright): add nightly SSO login spec starting (#27164 ) * test(playwright): add nightly SSO login spec starting with Okta Extends Playwright coverage end-to-end for SSO login flows. Today's SSO coverage (Features/SSOConfiguration.spec.ts) only asserts the config form UI. This adds a new suite that configures OpenMetadata to an external identity provider, drives a real login through the provider's hosted UI, and validates the resulting session against the OM API. Phase 1 ships Okta only (integrator-9351624.okta.com). Additional providers (Auth0, Azure, Cognito, SAML, Google) plug into the same dispatcher by adding a ProviderHelper implementation. ## What's new - playwright/e2e/Auth/SSOLogin.spec.ts — two-test suite tagged @sso 1. Asserts the SSO sign-in button renders on /signin with the correct brand label and that the basic-auth form is not shown. 2. Clicks the button, drives the provider's login widget, follows the OAuth callback, completes first-run self-signup when needed, lands on /my-data, then verifies the JWT by calling GET /api/v1/users/loggedInUser and asserting the returned email matches SSO_USERNAME. - playwright/utils/ssoAuth.ts — provider-agnostic orchestration: applyProviderConfig (PUT /api/v1/system/security/config), restoreBasicAuth, buildAuthContextFromJwt, verifyLoggedInUserMatches. Composes existing getApiContext/getAuthContext/getToken helpers — no token extraction or HTTP plumbing is reimplemented. - playwright/utils/sso-providers/{index,okta}.ts — ProviderHelper interface plus the Okta Identity Engine widget driver. Defaults the dev tenant values from the committed openmetadata.yaml snippet so the spec only needs SSO_USERNAME/SSO_PASSWORD to run locally. - playwright/constant/ssoAuth.ts — env var key constants, PROVIDER_BUTTON_TEXT map, and the BASIC_AUTH_CONFIG payload used for cleanup. - playwright.config.ts — new 'sso-auth' project matching playwright/e2e/Auth/*/.spec.ts with its own serial workers, and '/Auth/' added to the chromium project's testIgnore so these tests never run in the default suite. ## How provider switching works beforeAll logs in as admin via basic auth, captures the admin JWT via getToken(page) BEFORE the swap, then PUTs the Okta config. The admin JWT survives the provider swap because OM's internal JWKS stays in publicKeyUrls and the admin user's isAdmin flag is persisted in the DB. afterAll rebuilds an API context from that JWT and restores basic auth, making the spec fully idempotent — the same OM instance can run the suite repeatedly without any manual cleanup. ## Running locally export SSO_PROVIDER_TYPE=okta export SSO_USERNAME='<okta-test-user>' export SSO_PASSWORD='<okta-test-password>' npx playwright test playwright/e2e/Auth/SSOLogin.spec.ts \ --project=sso-auth --workers=1 Verified end-to-end against integrator-9351624.okta.com — both tests pass in ~12s on an already-provisioned user, ~14s on first-run self-signup. Cleanup leaves the server in basic-auth mode. ## Notes for reviewers - The existing .github/workflows/playwright-sso-tests.yml already wires up the CI matrix and secret names; this change intentionally does NOT enable the cron schedule. That lands in a follow-up once one provider is stable for a few nightly runs. - OKTA_SSO_CLIENT_ID / OKTA_SSO_DOMAIN / OKTA_SSO_PRINCIPAL_DOMAIN env vars can override the baked-in dev tenant defaults if a different Okta tenant is used in CI. * ci: add dedicated SSO Login Nightly workflow Adds .github/workflows/playwright-sso-login-nightly.yml, a standalone workflow that runs the new SSOLogin spec nightly at 03:00 UTC instead of piggy-backing on playwright-sso-tests.yml. The existing playwright-sso-tests.yml is left untouched — it still covers the SSO configuration form UI via SSOConfiguration.spec.ts and its matrix/secrets wiring is unchanged. The new workflow complements it with a real end-to-end login round-trip: - Schedule: cron '0 3 * * ' - Provider matrix: okta only for Phase 1 (extended as helpers ship) - Invokes playwright/e2e/Auth/SSOLogin.spec.ts under the new sso-auth Playwright project with workers=1 - Wires provider credentials via secrets with the existing {PROVIDER}_SSO_USERNAME / {PROVIDER}_SSO_PASSWORD convention plus optional OKTA_SSO_CLIENT_ID / OKTA_SSO_DOMAIN / OKTA_SSO_PRINCIPAL_DOMAIN overrides - Uses the shared setup-openmetadata-test-environment composite action, PostgreSQL, ingestion disabled — matching the existing SSO tests workflow - Uploads the HTML report as an artifact on every run and cleans up the docker stack in a final always-run step refactor(playwright): simplify ssoAuth helpers - verifyLoggedInUserMatches now asserts directly on the lowercased email field instead of building a candidate array and feeding it a long stringified failure message. The assertion failure already shows expected vs received, so the wrapper string was just noise. - Drop buildAuthContextFromJwt — it was a one-line wrapper around getAuthContext. The spec calls getAuthContext directly now. * refactor(playwright): address SSO suite review feedback - Extract OM_BASE_URL from PLAYWRIGHT_TEST_BASE_URL (with the same http://localhost:8585 default as playwright.config.ts) and export it from constant/ssoAuth.ts. okta.ts and BASIC_AUTH_CONFIG both consume it, so callbackUrl, the OM JWKS entry in publicKeyUrls, and the basic-auth restore payload all match the test target — including CI runs against non-default hosts. - Drop PROVIDER_BUTTON_TEXT. It was exported but never imported; the ProviderHelper.expectedButtonText field is the only source of truth for the SSO sign-in button label and the spec already reads from it. - Restore the OM convention adminPrincipals: ['admin'] in the Okta config (matches conf/openmetadata.yaml's AUTHORIZER_ADMIN_PRINCIPALS default). The previous code was granting admin to whichever IdP user ran the suite — verifyLoggedInUserMatches only needs an authenticated session, not admin, so the elevation was unnecessary. This also drops the now-unused requireEnv on SSO_USERNAME inside okta.ts; the spec itself still gates on the env var via test.skip. - Set workers: 1 on the sso-auth Playwright project. fullyParallel: false alone wasn't enough — the global workers: 3 on CI could still fan out across multiple Auth/*/.spec.ts files in the future. The explicit limit enforces full isolation as more provider specs land. * ci: avoid CodeQL "Excessive Secrets Exposure" in SSO Login Nightly Replaces the dynamic secret lookup secrets[format('{0}_SSO_USERNAME', upper(matrix.provider))] with a static reference secrets.OKTA_SSO_USERNAME CodeQL flagged the dynamic indexing because GitHub Actions can only mask & scope secrets that are referenced statically. With a computed key, the runner has no way to know which single secret is needed and conservatively materializes EVERY org and repo secret into the step's environment — even though the test only reads OKTA_SSO_. Static references let GitHub expose only the two credentials this step actually uses. Phase 1's matrix is okta-only so the change is two lines. The added inline comment documents the convention for future providers: add a sibling step gated by `if: matrix.provider == '<provider>'` with that provider's static secret references — do not bring back the secrets[format(...)] pattern. refactor(playwright): capture/restore real security config in SSO suite - Snapshot /system/security/config in beforeAll, restore exact payload in afterAll instead of PUTting a hand-rolled basic-auth baseline (preserves allowedDomains, forceSecureSessionCookie, adminPrincipals, etc.) - Strip ldap/saml subtrees from the snapshot: GET returns empty-string placeholders the PUT validator rejects - Require OKTA_SSO_{CLIENT_ID,DOMAIN,PRINCIPAL_DOMAIN} via getRequiredEnv; no more hardcoded tenant defaults - Fail fast in beforeAll if admin JWT capture returns empty string so the server is never left stuck in SSO mode - Shrink Okta provider override to just the fields Okta needs; sibling authorizer fields come from the captured snapshot Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * ci(sso-login): extract per-provider composite action Restructures the nightly workflow so provider credentials stay statically referenced for CodeQL while making it trivial to add new providers: - New composite action .github/actions/sso-login-run bundles all shared setup + test-run logic; pulls non-secret provider config from the caller's vars context dynamically (${PROVIDER_UPPER}_SSO_) - playwright-sso-login-nightly.yml becomes a thin dispatcher with one real job per provider. Each job declares environment: test so it can resolve its password via a static secrets.<PROVIDER>_SSO_PASSWORD reference (no secrets[format(...)] dynamic lookup, CodeQL clean) - Adding a provider = copy the okta job stanza, swap the secret name, add the provider to the dispatch input choices, register the helper in sso-providers/index.ts Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> refactor(playwright): move Okta tenant config to a repo constant The Okta tenant identifiers (clientId, domain, principalDomain) are non-secret OAuth public values — visible on the hosted login page during any sign-in. Keeping them in GitHub environment variables cost setup friction (5 env vars to configure locally, each a potential typo) without any security benefit. Move them back to a committed OKTA_TENANT constant in okta.ts where a reviewer can see exactly which tenant the suite is exercising. Net effect: - Local runs only need SSO_PROVIDER_TYPE, SSO_USERNAME, SSO_PASSWORD. - The test environment in GH Actions keeps OKTA_SSO_USERNAME (variable) and OKTA_SSO_PASSWORD (secret); the three tenant variables are no longer consumed. - Composite action drops the jq-based dynamic var extraction; the caller passes sso_username directly. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * ci(sso-login): move timeout-minutes from composite step to job level Composite actions don't support timeout-minutes on individual steps — that's a runner job field only. Move the 30-minute test timeout up to the dispatcher job and bump to 45 minutes to cover docker + maven setup. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * ci(sso-login): consolidate dispatcher + composite action into one file Collapse the dispatcher workflow + composite action split into a single ~115-line workflow using a strategy matrix and dynamic vars[format(...)] / secrets[format(...)] credential resolution keyed on the matrix provider name. Trade-off: - CodeQL "Excessive Secrets Exposure" (low severity) will re-flag the dynamic secret lookup. Accepted in exchange for a single source of truth and true zero-workflow-churn multi-provider support. Onboarding a new provider is now: 1. Add its name to the matrix array + dispatch options list. 2. Add <PROVIDER>_SSO_USERNAME (variable) + <PROVIDER>_SSO_PASSWORD (secret) in the test environment. 3. Register the helper in sso-providers/index.ts. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * ci(sso-login): drop provider-prefix bash step; use case-insensitive lookup GitHub secret and variable names are case-insensitive, so format('{0}_SSO_PASSWORD', matrix.provider) with the lowercase matrix value resolves correctly against the uppercase conventional names like OKTA_SSO_PASSWORD. That removes the need for a separate "Compute provider prefix" step and its cross-step env-context plumbing. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * ci(sso-login): drop redundant case-insensitivity comment Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * ci(sso-login): pin playwright install to 1.57.0 to match package.json The previous 1.51.1 pin was stale vs. the @playwright/test version in package.json. The mismatch caused browser cache path divergence — the install step wrote browsers under 1.51.1's cache and the test run looked for them under 1.57.0's cache and failed with "browsers not installed." Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor(playwright): address SSO suite review comments [skip ci] - Drive Okta tenant (clientId, domain, principalDomain) from env vars, falling back to the existing nightly tenant values as defaults - Use redirectToHomePage as the final assertion in the SSO login step - Document why the /signup vs /my-data branch is conditional Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * saml * test(playwright): add SAML providers to SSO login nightly Extend the nightly SSO login matrix with Azure AD SAML and a self-contained Keycloak SAML fixture (Azure-profile + Google-profile realms), so the suite exercises the full SAML flow end-to-end without relying on a hosted IdP. - docker/local-sso/keycloak-saml: Keycloak 26.3.3 compose + pre-imported realms bound to OM at localhost:8585, port-overridable via KEYCLOAK_SAML_PORT. - playwright sso-providers: azure-saml helper (hosted tenant, non-secret federation metadata committed) and keycloak-saml factory that fetches the realm's IdP X509 at runtime. - SSO assertion matches OM's actual SAML sign-in label ("Sign in with SAML SSO"), since providerName isn't propagated into the store for the SAML provider branch of getAuthConfig. - Workflow starts/stops the Keycloak stack only for keycloak-* matrix rows and injects the fixture credentials inline. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor(playwright): fetch Azure SAML IdP cert at runtime Drop the committed Azure Federated SSO X509 certificate and the AZURE_SAML_IDP_CERTIFICATE env fallback from the azure-saml provider. The cert now comes from Azure's federation metadata XML endpoint at test start, mirroring how the Keycloak provider resolves its realm cert, so the suite stays aligned with Azure's ~3-year cert rotations automatically. - New saml-metadata.ts exporting fetchIdpX509Certificate(descriptorUrl, label), reused by azure-saml and keycloak-saml. - azure-saml.buildConfigPayload is now async and pulls the cert from https://login.microsoftonline.com/<tenantId>/federationmetadata/2007-06/federationmetadata.xml before building the SAML payload. - keycloak-saml drops its inline cert-fetching helpers and delegates to the shared util. - Trim narration comments across the SSO suite to keep only the non-obvious rationale. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor(playwright): drop hosted Azure SAML provider The nightly Keycloak SAML fixture with Azure-profile attribute claims exercises the same OM SAML code path as the hosted Azure AD tenant. The hosted provider added external tenant/cert coupling without unique coverage, so this removes it. Drops the azure-saml helper, its env keys (AZURE_SAML_TENANT_ID / AZURE_SAML_PRINCIPAL_DOMAIN), the dispatcher registration, and the workflow dispatch option. Keycloak Azure/Google realms remain. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test(playwright): cover SSO session lifecycle end-to-end Extends the SSO login spec beyond "can you log in" to the full session round-trip: reload survives, same-context tabs inherit auth, sidebar logout (with modal confirm) lands on /signin, and post-logout refresh stays signed out. Adds a describe-scoped userContext/userPage created in beforeAll so tests 2-6 inherit the IdP-backed session; test 1 keeps its fresh fixture for the unauthenticated assertion. Cleanup closes the user context before restoring the server security config. Verified locally against keycloak-azure-saml and keycloak-google-saml realms: 6 passed each (was 2). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * remove slow from individual spec * remove slow from beforeAll * style(playwright): fix SSOLogin spec prettier issues Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test(playwright): tighten SSO sign-in locator and await logout response Address Copilot review comments on PR #27164: - Use button.signin-button to match the pattern in SSOAuthentication.spec.ts. - Await /api/v1/users/logout POST alongside the /signin navigation in the logout test to remove the race against the server response. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix * Update openmetadata-ui/src/main/resources/ui/playwright/e2e/Auth/SSOLogin.spec.ts Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * fix * test(playwright): resolve SSO creds via env vars, drop keycloak-google-saml Route Keycloak credentials through the same `vars[format(...)]` / `secrets[format(...)]` indirection as Okta via an `env_prefix` matrix column, removing the hardcoded fixture literals from the workflow. Password lookup falls back `vars \|\| secrets` so fixture passwords can live as vars while real provider secrets stay in secrets. Also drop the keycloak-google-saml variant — same IdP and realm shape as the Azure variant, so it adds CI cost without meaningful coverage. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test(playwright): post SSO login nightly results to Slack Adds a per-provider Slack notification step mirroring the pattern used by the postgresql/mysql nightly workflows — reuses the existing `slack-cli.config.json` and `playwright-slack-report` CLI against the `results.json` that the global JSON reporter already emits. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(playwright): drop logout response wait in SSO spec OktaAuthenticator.logout clears tokens locally with no backend call, and GenericAuthenticator (SAML) hits `GET /auth/logout` — neither triggers the `POST /api/v1/users/logout` the test was waiting on. The listener never matched, so `Promise.all` hung past the 180s test timeout even though the page had already navigated to /signin. Rely on `waitForURL('**/signin')` + the signin button assertion, which are the actual cross-provider success signals. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Siddhant <siddhant@MacBook-Pro-457.local> Co-authored-by: Siddhant <siddhant@MacBook-Pro-529.local> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Siddhant <siddhant@MacBook-Pro-621.local> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-04-17 13:09:54 +05:30
Aniket Katkar	12ce3b614d	Chore(UI): consolidated UI checkstyle fix commands and modify workflow comment (#27402 ) * feat: add consolidated UI checkstyle commands for all and changed files * update prt to pr * test commit to fail ui-checkstyle * update the comment * Revert "test commit to fail ui-checkstyle" This reverts commit `ed056f0629`. * Revert "update prt to pr" This reverts commit `0666fa51a3`. * Worked on comments * pull request target remove * Revert "pull request target remove" This reverts commit `b61e98c16b`. * Worked on comments	2026-04-16 17:18:22 +05:30
Teddy	50c17502cf	MINOR - Enable merge group GH event (#27371 ) * chore: added merge_group for github merge queue * chore: remove unnecessary merger group on team labeler * fix: added gates for merge queue and pull request events	2026-04-15 07:42:08 -07:00
Pere Miquel Brull	1dedc0cf15	Add k8s-operator unit tests to PR CI (#27387 ) * Add k8s-operator unit tests to PR CI pipeline The k8s operator tests only ran during manual release builds. Add a path-filtered job so they run on PRs touching openmetadata-k8s-operator/*, following the same Detect Changes pattern used by the service unit tests. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Remove -DfailIfNoTests=false — we want to catch missing tests Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Fix k8s-operator tests: add surefire includes and remove unnecessary stub Parent POM surefire includes only match org.openmetadata.service., so operator tests under org.openmetadata.operator. were silently skipped. Override with */Test.java in the operator pom.xml. Also remove unused KubernetesClient mock stub from CronOMJobReconcilerTest.setUp — no test reaches the code path that calls context.getClient(), causing UnnecessaryStubbingException. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Rename k8s-operator to k8s_operator in workflow outputs Hyphens in output names are parsed as subtraction in GitHub Actions expressions dot notation, so the job condition would never trigger. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Fix filesystem paths — underscore rename only applies to output keys The replace_all incorrectly changed directory names from openmetadata-k8s-operator to openmetadata-k8s_operator. Only the GitHub Actions output key needs the underscore; all file paths must use the actual hyphenated directory name. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Drop -am flag from k8s-operator test command openmetadata-service is a provided-scope dependency, so -am tries to compile it including shaded ES/OS jars that aren't available in a clean CI environment. The operator module compiles fine on its own. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Fix invalid YAML in conf/openmetadata.yaml The CSP policy line has unescaped colons inside the value which the YAML parser interprets as mapping indicators. Use a folded block scalar (>-) so the value is parsed as a plain string. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Build k8s-operator deps before running tests The operator depends on openmetadata-service (provided scope) which won't be in the Maven cache on a cold CI runner. Build with -am -DskipTests first, then run operator tests separately — same pattern as docker-k8s-operator.yml. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Reintroduce lenient client mock to prevent flaky NPE The reconcile flow is time-dependent — tests using "0 * * * " can reach context.getClient() near the top of the hour. Stub the full client.resources().inNamespace().resource().create() chain as lenient so early-return tests aren't penalized but happy-path tests won't NPE. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Revert conf/openmetadata.yaml — fix belongs in a separate PR Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-15 15:48:18 +02:00
Harsh Vador	f4c939869d	ci(security): add Retire.js workflow to detect bundled JS vulnerabilities (#27315 ) * ci(security): add Retire.js workflow to detect bundled JS vulnerabilities * address gitar * add om existing security scan workflow * address gitar * add slack support & remove PR check * address gitar * change job name * address comment * address comment	2026-04-15 19:12:53 +05:30
Sriharsha Chintalapani	bb0daa180e	RDF, cleanup relations and remove unnecessary bindings, add distributed mode for RDF reindex (#26902 ) * RDF, cleanup relations and remove unnecessary bindings, add distributed mode for RDF reindex * Update generated TypeScript types * Address comments from copilot * Update generated TypeScript types * fix test issues * Fix minor UI bugs * Add the missing filters * Fix RDF export API error * Add export functionality * Fix ui-checkstyle * Fix java checkstyle * Fix unit tests * Fix and increase the coverage for KnowledgeGraph.spec.ts * Fix tests * Remove rdf as default in playwright and local docker * fix ui-checkstyle * Address comments * Potential fix for pull request finding 'CodeQL / Artifact poisoning' Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * Address copilot comments * Address copilot comments * FIx tests * FIx docker * Update openmetadata-service/src/main/java/org/openmetadata/service/apps/bundles/rdf/distributed/DistributedRdfIndexCoordinator.java Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Address copilot review comments: license headers, JSON escaping, type safety, border-color, stop semantics Agent-Logs-Url: https://github.com/open-metadata/OpenMetadata/sessions/c026e52e-162b-4c9a-9874-43791d4aaac1 Co-authored-by: harshach <38649+harshach@users.noreply.github.com> * Show error toast for unsupported export format in KnowledgeGraph Agent-Logs-Url: https://github.com/open-metadata/OpenMetadata/sessions/c026e52e-162b-4c9a-9874-43791d4aaac1 Co-authored-by: harshach <38649+harshach@users.noreply.github.com> * Fix docker * Fix docker for playwright * Fix docker for playwright * Fix tests * Fix tests * Fix docker * Fix docker * Fix glossary and pagination spec flakiness * update the missing translations * Fix docker * Fix docker * Fix integration test * Fix fuseki not starting * Fixed the run local docker script * worked on comments * Fix flakiness in knowledge graph tests * Fix checkstyle --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Aniket Katkar <aniketkatkar97@gmail.com> Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: harshach <38649+harshach@users.noreply.github.com>	2026-04-14 13:24:41 -07:00
Chirag Madlani	4f7be5f014	fix(ci): filter blob pattern causing failure to sonarcloud (#27357 ) * fix(ci): filter blob pattern causing failure to sonarcloud * fix(ci): add missing backslash continuation in sonar-scanner command Agent-Logs-Url: https://github.com/open-metadata/OpenMetadata/sessions/88d229f2-81dd-4662-8295-a3bb0df03815 Co-authored-by: chirag-madlani <12962843+chirag-madlani@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>	2026-04-14 20:06:21 +05:30
Aniket Katkar	3428dfbd6a	Chore(UI): Fix rbac tests not running on PR checks (#26994 ) * Fix rbac tests not running on PR checks * update the dependency * Update the SearchRBAC dependency	2026-04-14 17:53:59 +05:30
Pere Miquel Brull	f6258819e7	ci: reduce checkout history footprint in PR workflows (#27221 ) * ci: reduce checkout history footprint in PR workflows Optimize actions/checkout usage to avoid downloading the full repo blob history on every PR run. The repo is large, so cloning everything just to run tests wastes minutes of CI time per job. - py-operator-build-test.yml: drop fetch-depth: 0 (no history needed) - openmetadata-service-unit-tests.yml: drop fetch-depth: 0 (Sonar is explicitly skipped via -Dsonar.skip=true); shallow-fetch PR base ref - airflow-apis-tests.yml, py-tests.yml, yarn-coverage.yml: add filter: blob:none to Sonar jobs so commits/trees remain available for blame while blobs are fetched lazily on demand - ui-checkstyle.yml: add filter: blob:none to all jobs that rely on tj-actions/changed-files (needs commit/tree metadata, not blobs) * ci: drop fetch-depth: 0 from jobs that don't walk history Follow-up audit after the initial pass. Four jobs were still declaring fetch-depth: 0 (plus filter: blob:none in two cases) without actually needing any history beyond HEAD. ui-checkstyle.yml - i18n-sync: runs 'yarn i18n' then 'git status --porcelain'. git status compares the working tree to HEAD; no history walk. Default depth 1 is sufficient. - app-docs: same pattern with 'yarn generate:app-docs'. py-sonarcloud-nightly.yml - py-unit-tests: only uploads a coverage artifact, no Sonar invocation. - py-integration-tests: same. - py-combine-coverage: does run SonarSource/sonarqube-scan-action, so it genuinely needs the commit graph — added filter: blob:none for parity with the PR Sonar jobs. * ci: remove unused 'Fetch PR base branch' step from service unit tests Copilot review flagged that the step was using --depth=1 while the main checkout is also shallow, which would break any merge-base operation. On investigation, nothing downstream actually uses the base ref: the only command that runs after the checkout is 'mvn ... -Dsonar.skip=true', which has no git dependency. The step was preserved defensively in the previous commit, but it's dead code — cleanest fix is to delete it.	2026-04-13 10:46:17 -07:00
Chirag Madlani	917a36c6a4	Potential fix for code scanning alert no. 1842: Artifact poisoning (#27220 ) * Potential fix for code scanning alert no. 1842: Artifact poisoning Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> * Pin Yarn version to 1.22.18 to fix artifact poisoning alert Agent-Logs-Url: https://github.com/open-metadata/OpenMetadata/sessions/29aebdb5-eef0-4a2a-be01-489deef48d2b Co-authored-by: chirag-madlani <12962843+chirag-madlani@users.noreply.github.com> * Fix artifact poisoning in update-playwright-e2e-docs.yml: replace npm install -g yarn with pinned corepack Agent-Logs-Url: https://github.com/open-metadata/OpenMetadata/sessions/550fba5a-bb13-45da-a144-b67599c9eaa4 Co-authored-by: chirag-madlani <12962843+chirag-madlani@users.noreply.github.com> * Remove corepack prepare to eliminate artifact poisoning: use only corepack enable (bundled yarn) Agent-Logs-Url: https://github.com/open-metadata/OpenMetadata/sessions/90f6ed8d-3f2b-4c3d-9a34-cd1f57c4d89c Co-authored-by: chirag-madlani <12962843+chirag-madlani@users.noreply.github.com> --------- Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>	2026-04-10 16:12:28 +05:30
Sriharsha Chintalapani	b2b49db75e	MSAL Token Renewal Fix — Safari Session Loss (#27214 ) * MSAL Token Renewal Fix — Safari Session Loss * MSAL Token Renewal Fix — Safari Session Loss * MSAL Token Renewal Fix — Safari Session Loss * apply lint * MSAL Token Renewal Fix — OIDC fix * wait for token update * fix unit tests * Add SSO playwright tests * Add tests --------- Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>	2026-04-09 17:45:00 -07:00
Mohit Yadav	3ec31e3e68	Make OpeNMetadata Service Unit Test Required (#27099 )	2026-04-09 15:58:50 -07:00
Suman Maharana	7f1fd1dae2	fix python e2e coverage (#27125 ) * fix e2e coverage * check dbt * fix dbt e2e * fix dbt warnings * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-04-08 12:28:26 +05:30
Karan Hotchandani	906d7c4c09	disable trivy scans from PR checks (#27017 ) * disable trivy scans from workflow * update workflows	2026-04-06 15:00:30 +05:30
harshsoni2024	f6599b285d	Fix: Playwright remove all browsers binaries vulnerability (#26959 )	2026-04-06 11:00:29 +05:30
Pere Miquel Brull	ba0b68c9e3	Add missing MCP entity types to EntityLink grammar (#26968 ) * Add missing MCP entity types to EntityLink grammar Add mcpServer and mcpService to ENTITY_TYPE rule in EntityLink.g4, and add mcpExecution to ENTITIES_EXCLUDED_FROM_GRAMMAR (time-series entity, not independently linkable). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Remove unnecessary safe-to-test label check from unit tests workflow The safe-to-test label is only needed for pull_request_target workflows (which run with base branch context and secrets access). This workflow uses plain pull_request, so the label check was causing spurious failures. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-02 14:33:39 +02:00
Aniket Katkar	9d6ecb218a	Chore(UI): Fix the ui-checkstyle workflow for the json changes (#26937 ) * Fix the ui-checkstyle workflow for the json changes [skip ci] * Worked on comments	2026-04-01 19:20:58 +05:30
Aniket Katkar	1f7d030567	Add scripts path to skip file (#26934 )	2026-04-01 18:31:15 +05:30
Aniket Katkar	a106572214	Ignore scripts folder for playwright and sonar check (#26929 )	2026-04-01 17:57:21 +05:30
Aniket Katkar	ff6a8c5104	Chore(UI): Update the ui-checkstyle PR comments (#26872 ) * Update the ui-checkstyle PR comments * Address comments * update PR comments content * worked on comments	2026-03-31 17:50:45 +05:30
harshsoni2024	821b2aa30f	Feat: Add python3.12 ingestion support (#26632 ) * add python3.12 support * time utils fix * pyformat fix * version changes, tests add	2026-03-30 21:57:29 +00:00
miriann-uu	7efbeb555c	GEN-4896: Fix/ghsa head ref (#26861 ) * Fix github.head_ref injection with github.event.pull_request.number * Fix github.head_ref injection with github.event.pull_request.number * Fix github.head_ref injection with github.event.pull_request.number	2026-03-30 10:48:12 -04:00
Suresh Srinivas	3ed06f3a78	Code cleanup based on IDE flagged warnings (#26808 ) * Import cleanup * Remove redundant throw clauses * Unused imports * Remove redundant overrides of method * Fix performance related warnings * Automated code cleanup from IDE * Format code to follow google formatter convention * Simplify checking for empty list * Fix failing tests * Fix broken interface * Address gitar comments * remove unit test coverage report * remove unit test coverage report * fix build --------- Co-authored-by: Sriharsha Chintalapani <harsha@getcollate.io>	2026-03-27 06:17:01 -07:00
Aniket Katkar	d1fb0445fd	Chore(UI): Add pretty and eslint for openmetadata-ui-core-components for code quality (#26686 ) * Add pretty and eslint for openmetadata-ui-core-components for code quality * Fix eslint errors * Address comments * Address comments * update checkstyle workflow to check for core-components * work on comments * change the workflow trigger for testing * Add console log for testing * fix checkstyle yml * Fix checkstyle * Revert the console.log * Revert the trigger changes * Worked on comments * Revert all progress-indicators.tsx changes * Work on comments * Update the checkstyle yml for testing * add console log * update workflow * test ui-checkstyle * update workflow * Add missing id * Remove the console log * Add and use core component nvmrc in checkstyle * move the nvmrc to ui folder * test failure * remove console log * Fix the checkstyle * Add prettier fix * Fix the format * Update workflows and files * Fix playwright checkstyle * Fix playwright changes * Fix scripts * failing commit * Revert "failing commit" This reverts commit `c7ab426142`. * Fix workflow * Work on comments --------- Co-authored-by: Chirag Madlani <12962843+chirag-madlani@users.noreply.github.com>	2026-03-26 12:53:59 +05:30
Ram Narayan Balaji	5da884e6b8	ci: fix Detect Changes job failing on push to main (#26717 ) Add checkout step before dorny/paths-filter@v3 in the changes job. For push events, paths-filter runs git branch --show-current locally which fails without a checkout; pull_request events use the GitHub API and are unaffected. Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-25 10:22:48 +05:30
Copilot	f6c183ed0d	ci: Remove dedicated ingestion shard — unify PostgreSQL E2E sharding to 5-way chromium (#26703 ) * Initial plan * Apply workflow changes: remove ingestion shard, unify args, 5-way chromium sharding Co-authored-by: ShaileshParmar11 <71748675+ShaileshParmar11@users.noreply.github.com> Agent-Logs-Url: https://github.com/open-metadata/OpenMetadata/sessions/8f0cdfed-62e8-4726-9e8b-3feeb7ab0b9f --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: ShaileshParmar11 <71748675+ShaileshParmar11@users.noreply.github.com>	2026-03-24 11:56:04 +05:30
Sriharsha Chintalapani	860c82fab2	Add Unit Tests coverage (#26360 ) * Enable more service unit tests and fix uncovered regressions * Fix remaining broadened unit-suite regressions * Add meaningful Handlebars helper coverage * Add formatter decorator unit coverage * Improve formatter decorator coverage * Improve utility, validator, and formatter coverage * Expand OIDC validator coverage * Tighten shared OIDC validator coverage * Improve user and connection utility coverage * Cover subscription utility workflows * Cover entity field utility workflows * Expand lineage and helper utility coverage * Improve auth code flow handler coverage * Expand auth code flow handler coverage * Cover entity csv parsing flows * Deepen entity csv parser coverage * Fix search builder aggregation null handling * Expand entity utility core coverage * Cover search index utility workflows * Expand search utility coverage * Expand formatter message coverage * Harden notification markdown rendering coverage * Add notification card assembler coverage * Expand EntityCsv coverage and dry-run fixes * Expand K8s pipeline client coverage * Expand saml validator coverage * Expand rdf property mapper coverage * Expand subscription utility coverage * Fix schema field extractor coverage gaps * Expand auth refresh flow coverage * Add service unit test workflow * Enforce new-code coverage on service PRs * Add Unit Test Coverage * Expand k8s pipeline and auth flow coverage * Expand entity csv batch import coverage * Expand entity csv entity creation coverage * Expand entity csv user and flush coverage * Expand entity csv typed import coverage * Cover entity csv dependency validation paths * Expand airflow and csv utility coverage * Replace placeholder authorizer tests with real coverage * Cover PII masking security flows * Tighten async service retry and shutdown coverage * Expand security util claim coverage * Fix checkstyle * Strengthen user bootstrap utility coverage * Expand user activity tracker coverage * Expand ODCS converter coverage * Expand S3 log storage coverage * Expand search repository and lineage coverage * Expand search filter and index factory coverage * Expand reindex handler coverage * Expand inherited field search coverage * Expand search cluster metrics coverage * Expand search repository lifecycle coverage * Expand slack client coverage and stabilize tests * Expand search index executor control flow coverage * Cover search index utility helpers * Cover distributed indexing strategy flows * Strengthen distributed search executor coverage * Cover search reindex pipeline flows * Cover search index logging flows * Cover search index stats tracking * Cover quartz search index progress flows * Cover search index app coordination * Cover slack progress listener behavior * Cover polling job notifier behavior * Cover redis job notifier behavior * Expand Slack notifier coverage * Cover partition worker processing flows * Expand distributed participant coverage * Cover orphan job monitor behavior * Expand distributed stats aggregator coverage * Expand distributed partition coverage * Strengthen distributed coordinator coverage * Expand search index and repository coverage * Expand search executor control flow coverage * Expand search repository delegation coverage * Expand search index executor coverage * Expand search repository helper coverage * Expand search utility coverage * Expand search index executor coverage * Expand search repository coverage * Strengthen search index manager coverage * Strengthen distributed recovery and worker coverage * Strengthen distributed executor coverage * Fix index sink batching and stats coverage * Expand elastic bulk sink behavior coverage * Expand open search bulk sink behavior coverage * Fix dropped bulk processor failure accounting * Cover migration workflow discovery paths * fix java checkstyle * Fix permission debug effect normalization * Cover migration FQN repair workflows * Fix glossary workflow migration idempotency * Cover v1100 migration utility flows * Cover v1104 migration extension flows * Fix and cover v160 migration policy flows * fix java checkstyle * Address PR review comments on vector search and csv docs * fix java checkstyle * Harden service unit test PR workflow * Cover migration utility repair flows * fix java checkstyle * Fix service unit test regressions * Split service new-code coverage check * fix java checkstyle * Fix service diff coverage regressions * fix java checkstyle * Clarify missing JaCoCo artifact failures * fix java checkstyle * Fix bulk sink lifecycle tests * simplify CI * Address PR review feedback after main merge * Fix merged service unit test expectations * Fix search repository bulk update tests * Apply spotless formatting * Use standard exception logging in search repository * Stabilize multi-domain search integration test * Apply spotless formatting * Isolate web analytic event integration timestamps --------- Co-authored-by: Pere Miquel Brull <peremiquelbrull@gmail.com>	2026-03-23 16:17:15 +01:00
IceS2	5d44a496b4	Clean up dead/legacy GitHub Actions workflows (#26634 ) * Remove disabled maven-build and maven-build-skip workflows These workflows have been fully replaced by the integration-tests-* workflows. maven-build.yml was gated by `if: false` and maven-build-skip.yml only existed to satisfy required checks for the disabled workflow. * Remove disabled Maven Postgres test workflows maven-postgres-rdf-tests-build.yml and maven-postgres-tests-build.yml were disabled (if: false / workflow_dispatch-only) and replaced by the integration-tests-* workflows. maven-postgres-tests-build-skip.yml was their required-check placeholder. * Remove placeholder ui-core-components-tests workflow The workflow only echoed "Nothing to test" with no actual test steps. Can be re-added when tests are implemented for the core components library. * Remove inactive claude-code-review workflow PR trigger was commented out, making it dispatch-only and unused. The active claude.yml workflow (triggered by @claude mentions) remains. * Remove legacy Selenium E2E test workflow All E2E tests have migrated to Playwright. This Selenium workflow also had hardcoded sleep instead of health checks and no Docker cleanup step. * Update monitor-slack-link from Python 3.9 (EOL) to 3.11 * Remove experimental py-nox-ci workflow Manual-only experimental workflow for testing Nox as a Python CI replacement. No longer in use — existing py-tests workflows handle Python CI. * Revert "Update monitor-slack-link from Python 3.9 (EOL) to 3.11" This reverts commit `ea9fa04e9d`. * Remove phylum and issues-notion-sync workflows Phylum dependency analysis and Notion issue sync are no longer in use.	2026-03-23 08:30:01 +01:00
Suman Maharana	108cfe7897	chore(ci): enhance Python E2E and SonarCloud workflows with unit and and integration tests (#26481 ) * chore(ci): enhance Python E2E and SonarCloud workflows with unit and integration tests * seperate the unit and integration test * address commensts * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * address comments --------- Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> Co-authored-by: ulixius9 <mayursingal9@gmail.com>	2026-03-23 10:47:16 +05:30
Teddy	d03373c5e2	Stale pr workflow (#26667 ) * stale pr automation * chore: auto close stale pr * chore: auto close stale pr * Update .github/workflows/stale.yml Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update .github/workflows/stale.yml Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update .github/workflows/stale.yml Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update .github/workflows/stale.yml Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-03-20 14:18:33 -07:00
Chirag Madlani	0996e716bb	ci: ui checkstyle workflow in favour to remove pre-commit (#26445 ) * ci: ui checkstyle workflow in favour to remove pre-commit * update workflow * update workflow * testing workflow with manual run * test wokflow * update * bump yarn.lock * add install antlr * fix organize-imports script * fix lint issues * revert authorize changes * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * ci: fix UI checkstyle workflow - workflow_dispatch support, step isolation, and pre-commit cleanup (#26474) * Initial plan * fix: address review comments - workflow dispatch, step isolation, pre-commit, ja-jp locale Co-authored-by: chirag-madlani <12962843+chirag-madlani@users.noreply.github.com> * remove pre-commit as not needed --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: chirag-madlani <12962843+chirag-madlani@users.noreply.github.com> * update workflow * test workflow * checkout pr * fix workflow * update * add debug logs * apply prettier * update workflow * update script * fix workflow * Fix: Remove all `any` types from UI core component theme and utility files (#26534) * Initial plan * Fix: Remove all any types from UI core component theme and utility files Co-authored-by: karanh37 <33024356+karanh37@users.noreply.github.com> * Changes before error encountered Co-authored-by: karanh37 <33024356+karanh37@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: karanh37 <33024356+karanh37@users.noreply.github.com> * seprate playwright lint * apply prettier to playwright * fix tsc step * update * skip tsc check for now * update lint:playwright * remove import-cli from playwright check * address comments * fix lint issues * fix lint * fix lint * update pull_request to pull_request target * remove pre-commit completly --------- Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com> Co-authored-by: karanh37 <33024356+karanh37@users.noreply.github.com>	2026-03-20 19:02:12 +05:30
IceS2	3c89f26422	Move path filtering from trigger level to job level for integration test workflows (#26631 ) Replace `paths:` on `pull_request_target` with a `changes` detection job using dorny/paths-filter. This prevents required checks from getting stuck as pending when PRs modify files outside the monitored paths. A job skipped by its `if` condition reports as "Success", so branch protection still works.	2026-03-20 09:18:51 +01:00
Vishnu Jain	3aed159069	Fix MCP tools entity status (#26567 ) * fix MCP tools entity status * add unit tests for MCP tools and CI path trigger	2026-03-18 11:04:00 +01:00
IceS2	e6bfa1f567	fix(ci): replace py-tests skip workflow with job-level path filtering and gate jobs (#26556 ) * fix(ci): replace py-tests skip workflow with job-level path filtering and gate jobs Replace the dual-workflow (real + skip) pattern with a single-workflow approach using dorny/paths-filter for change detection and job-level `if` conditions. A job skipped by `if` reports as "Success" for required checks, eliminating the need for companion skip workflows. Add inverse-gate status jobs (`py-tests-status`, `py-tests-postgres-status`) that only run on failure/cancellation. These are the only jobs that need to be set as required checks in branch protection — one per workflow instead of one per matrix expansion. How the gate works: - All tests pass or skipped → gate is skipped → reports "Success" - Any test fails → gate runs → exits 1 → blocks merge Changes: - py-tests.yml: remove `paths:` filter, add `changes` detection job, gate test jobs on its output, add `py-tests-status` gate job - py-tests-postgres.yml: same approach, add `py-tests-postgres-status` - Delete py-tests-skip.yml (no longer needed) * fix(ci): rename postgres gate job to py-tests-status for consistency The workflow name already provides the context (py-tests-postgres), so the gate job should just be py-tests-status like in the mysql workflow.	2026-03-17 17:33:15 +01:00
IceS2	d259d032f8	MINOR: Port py-tests improvements to py-tests-postgres workflow (#26517 ) Apply the same structural improvements from the py-tests workflow: - Add integration test sharding (shard-1/shard-2) for parallelism - Replace `make run_python_tests` with nox integration-tests session - Add explicit timeout-minutes (180) and descriptive job name - Remove unnecessary fetch-depth: 0 from checkout - Normalize indentation to 2-space with proper YAML style	2026-03-16 14:50:37 +01:00
Sriharsha Chintalapani	ba3a66038c	Add eslint-plugin-playwright enforcement with CI check (#26494 ) * Add eslint-plugin-playwright enforcement with CI check Add eslint-plugin-playwright to catch common Playwright anti-patterns automatically. 13 rules configured in two tiers: - Error (blocks CI): no-networkidle, no-page-pause, no-focused-test - Warn (tracks debt): missing-playwright-await, no-wait-for-timeout, no-force-option, no-element-handle, no-eval, no-skipped-test, prefer-web-first-assertions, no-useless-await, no-wait-for-selector, valid-expect Changes: - Install eslint-plugin-playwright, configure rules in eslint.config.mjs - Add yarn lint:playwright script - Repurpose ui-checkstyle.yml workflow to run Playwright lint on PRs - Fix last networkidle usage in ClassificationConditionalRendering - Remove stale eslint-disable comments for undefined rules - Update PLAYWRIGHT_DEVELOPER_HANDBOOK with ESLint Enforcement section - Update playwright, writing-playwright-tests, and playwright-validation skills to reference lint check Current: 0 errors, 1657 warnings (CI passes cleanly) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix CodeQL cache poisoning alert: use pull_request instead of pull_request_target Switch from pull_request_target to pull_request since this workflow only runs yarn lint:playwright — no secrets or write permissions needed. This eliminates the untrusted code execution risk flagged by CodeQL. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix all 1657 Playwright ESLint warnings across 190 files Resolve every playwright lint rule violation to enforce test quality: - no-wait-for-selector (1290): replace page.waitForSelector() with locator.waitFor() - no-wait-for-timeout (130): replace hardcoded waits with event-driven alternatives - prefer-web-first-assertions (61): use toHaveText/toBeVisible/toHaveValue - no-force-option (51): remove { force: true } bypassing actionability checks - missing-playwright-await (40): add await to fire-and-forget assertions - no-skipped-test (28): acknowledge skipped tests with eslint-disable reasons - no-eval (11): replace page.$eval with locator APIs - no-element-handle (10): replace page.$() with page.locator() - no-useless-await (8): remove await from synchronous operations - valid-expect (4): add matchers to bare expect() calls - no-networkidle (2): replace networkidle with domcontentloaded Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Remove unnecessary ANTLR install and add --frozen-lockfile to CI - Remove Install Antlr4 CLI step from playwright-lint job (not needed for ESLint, saves 10-30s and avoids external network dependency) - Add --frozen-lockfile to yarn install for reproducible CI builds Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Restore ANTLR CLI install — required by yarn postinstall script yarn install triggers build-check → js-antlr which needs antlr4 CLI. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix trivially-true assertion in special character search test Remove `(await tableRows.count()) >= 0` which is always true since count() never returns negative. The assertion now properly validates that either the table or empty state is visible after searching. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix strict mode violations: add .first() to multi-element waitFor calls The no-wait-for-selector conversion (waitForSelector → locator.waitFor) introduced strict mode violations. Playwright's Locator API throws when waitFor() matches multiple elements, unlike the old waitForSelector API. Key patterns fixed: - getByTestId('loader').waitFor() — multiple loaders on page - getByTestId('select-owner-tabs').getByTestId('loader') — tab loaders - locator('.ant-skeleton-active/content').waitFor() — multiple skeletons - locator('table/thead th').waitFor() — multiple tables - getByTestId('side-panel-classification') — multiple panels - locator('.ant-select-dropdown:visible') — multiple dropdowns - locator('.ant-popover').waitFor() — multiple popovers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix strict mode violation in incidentManager assignee/owner-link The getByTestId('assignee').getByTestId('owner-link') locator resolves to 15 elements in the incident manager table (one per row). Adding .first() matches the original waitForSelector semantics. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix test failures from lint conversions: proper waits and retry logic - GlossaryPermissions: add waitForAllLoadersToDisappear after page nav - GlossaryP3Tests: restore waits for special char search + error state - GlossaryStatusFilterLargeDataset: restore waitForTimeout for filter state settling (no reliable DOM element to wait for) - AutoPilot: use waitForAllLoadersToDisappear instead of .first() to ensure ALL loaders are gone, increase banner timeout to 60s - importUtils: use expect().toHaveCount(0) for scoped multi-element loader/skeleton waits instead of waitForSelector or .first() - ColumnBulkOperations: increase response timeout, add retry polling for empty state verification - DomainDataProductsWidgets: add poll-based wait for DOM update after asset removal instead of removed waitForTimeout - customizeLandingPage: scope widget click to dialog to avoid matching stale elements on the page behind the modal Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Restore force:true where genuinely needed with eslint-disable comments The no-force-option ESLint rule removal broke tests where force was required for Ant Select comboboxes (selected item overlay covers input), popover triggers (partially obstructed by animation), data grid buttons (covered by overlay), and drag-and-drop (row hover overlays). Each restoration includes an eslint-disable-line comment explaining why force is necessary. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Stabilize 4 flaky tests with retry wrappers and proper waits - DataContractsSemanticRules: Wrap reload + status assertions in expect().toPass() to handle async backend contract validation - DataContracts: Increase timeout on dynamically rendered row filter and column text assertions that contain UUID-suffixed names - Customproperties-part2: Wrap user search-and-select loop in expect().toPass() retry to handle search dropdown rendering races - UserProfileOnlineStatus: Add profile render wait, navigation completion guard, and increased badge visibility timeout Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Replace .first().waitFor loader pattern with waitForAllLoadersToDisappear The pattern `page.getByTestId('loader').first().waitFor({ state: 'detached' })` only waits for the FIRST loader element to detach. When multiple loaders exist (nested components, tabs, popovers), this causes tests to proceed before the page is fully loaded — the root cause of ~18 flaky tests. Replaced ~500 occurrences across 103 files with the correct `waitForAllLoadersToDisappear(page)` which uses `expect(loaders).toHaveCount(0)` to wait for ALL loaders to disappear. Scoped loader waits (e.g., within select-owner-tabs, test-case-container, tags-container) are intentionally preserved since they correctly target a specific container's single loader. Also fixes UserProfileOnlineStatus.spec.ts: - Added createOrFetchUser helper for idempotent user creation on retries - Added afterAll cleanup for test users - Replaced waitForURL with redirectToHomePage for navigation Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Karan Hotchandani <33024356+karanh37@users.noreply.github.com>	2026-03-16 18:12:12 +05:30
Sriharsha Chintalapani	12b364313c	Fix Metrics collection; reduce no.of metrics; improve slow request lo… (#25751 ) * Fix Metrics collection; reduce no.of metrics; improve slow request logging * Move sync calls to search & rdf to async * Improve slow request tracking * Improve slow request tracking * Add clear breakdown in slow request * Batch TestCaseRepository calls * Batch API calls * Initial Implementation of ReadEngine * Improvements with ReadEngine/WriteEngine * Improvements with ReadEngine/WriteEngine * Improvements with ReadEngine/WriteEngine * Improve by removing unnecessary ser/de * Additional improvements with PatchFieldsPlanner * Further performance improvements * Further performance improvements * Address comments * Merge from main * Address comments * Address comments * Address latest feedback - 2/21 * fix merge conflict * Address Slow Request review * Address the comments * Address comments; Fix tests * Fixes to the failing tests * Fix bugs in tests * Fix checkstyle * Address playwright tests * Fix tests * Fix bugs * Fix tests * address comments * Fix issues from playwright * Fix playwright tests * Fix tests for playwright * Address comments * Fix glossary test * fix checkstyle * Fix playwright issues * Fix playwright issues - incrementalChagneDesc * Restore ApprovalTaskWorkflow in GlossaryTerm and TestCase repositories The slow_request branch accidentally removed entity-specific ApprovalTaskWorkflow overrides, causing the generic parent to use checkUpdatedByTaskAssignee instead of checkUpdatedByReviewer. This broke Glossary approval and TestCase approval Playwright tests. - GlossaryTermRepository: restore ApprovalTaskWorkflow with checkUpdatedByReviewer - TestCaseRepository: restore ApprovalTaskWorkflow, preDelete guard, updateReviewers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix base ApprovalTaskWorkflow to use reviewer check instead of task assignee The centralized ApprovalTaskWorkflow in EntityRepository was using checkUpdatedByTaskAssignee instead of checkUpdatedByReviewer, breaking approval workflows for all entity types. Added verifyReviewer() as a top-level static method on EntityRepository and restored missing updateReviewers() and preDelete IN_REVIEW guards in DataContract, DataProduct, Metric, and Tag repositories. Removed now-redundant entity-specific ApprovalTaskWorkflow overrides from GlossaryTerm and TestCase repositories. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix regression introduced in backend tests; make the playwright tests stable * Stabilize the playwright tests * Stabilize the playwright tests * Improve playwright tests * Improve playwright tests * Fix team playwrights * Fix merge from main * Fix playwrigt tests * Fix playwright tests * Batch domain/data product asset counts into single ES aggregation queries Replace N individual ES count queries with single aggregation query per entity type. Domain counts roll up child counts to parent domains. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Improve Playwright test reliability and expand CI shards Add polling waits for async ES indexing, fix lineage edge selectors, use API-based setup for domain/data product widget tests, and expand CI from 6 to 8 shards with dedicated graph/landing projects. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Improve test reliability with response checks and guards - Add API response status checks in create() for Domain, DataProduct, Glossary, TableClass, and UserClass — silent API failures now throw immediately with status code and response body - Add guards in selectDataProduct() and addAssetsToDataProduct() for undefined name/fqn — clear error messages instead of cryptic "locator.fill: value: expected string, got undefined" - Fix GlossaryPermissions double navigation — remove redundant redirectToHomePage + sidebarClick before glossary.visitEntityPage() - Increase OnlineUsers timeout from 5s to 15s for CI resource pressure - Increase Tour badge timeout from 10s to 20s - Fix visitGlossaryPage: wait for loader before clicking menuitem - Remove chromium testIgnore for graph/landing/stateful test files (these must run in chromium project for 6-shard CI workflow) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Remove all networkidle waits and improve CI reliability - Remove ~780 networkidle waits across 144 test/utility files — these hang or resolve prematurely under CI load causing false negatives - Add polling.ts with waitForSearchIndexed and waitForPageLoaded helpers - Convert checkAssetsCount and search functions to expect.poll() for async ES indexing tolerance - Increase expect timeout to 15s for CI environments - Split CI into 8 shards with dedicated projects (stateful/graph/landing) to reduce thread contention - Fix GITHUB_STEP_SUMMARY size overflow (base64 screenshots → table) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Fix genuine test failures from networkidle removal - GlossaryPagination: Fix waitForResponse race conditions - register listener BEFORE the triggering action, add */ URL prefix - LanguageOverride: Fix selector from getByText('EN') to getByText('English - EN') matching actual dropdown text - NestedColumnsExpandCollapse: Fix URL glob pattern, use dispatchEvent to avoid inner Link navigation, add waitForResponse for filtered search - lineage.ts: Revert dragConnection hover approach that broke React Flow connection mode, keep direct dispatchEvent - customizeLandingPage.ts: Remove waitForURL that hangs after page.goto - Teams.spec.ts: Add isJoinable: false for private team creation - UserDetails.spec.ts: Revert Escape/clickOutside save flow that dismissed edit mode before saving roles - Users.spec.ts: Revert Data Consumer permissions test to original simple approach using fixtures Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Playwright: Relax OnlineUsers activity time assertion The "Online now" exact match fails under CI load because the activity timestamp may show as "X seconds ago" or "X minutes ago" by the time the page renders. Changed to accept any recent activity format. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Fix 4 genuine test failures from CI run 1. saveCustomizeLayoutPage: Use response predicate matching both POST (create) and PUT (update) patterns instead of glob that only matched updates. Fixes 180s timeout in drag-and-drop test when layout doesn't exist yet (fullyParallel=true). 2. GlossaryMiscOperations: Add test.slow(true) — test does 9 sequential page navigations that exceed the 60s timeout. 3. DomainDataProductsWidgets "Assign Widgets": Add test.slow(true) — calls addAndVerifyWidget twice, each with multiple navigations. 4. DomainFilterQueryFilter: Add waitForAllLoadersToDisappear before clicking domain-dropdown after search operations that trigger page re-renders. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Fix AutoPilot test — reload page after API status poll The AutoPilot status banner never appeared because: 1. checkAutoPilotStatus polls the workflow API directly via apiContext (outside the browser), not through page network requests 2. The UI uses WebSocket for live updates, but the socket connection is only established when the page loads with status=RUNNING 3. Since the page loaded before the workflow started, the socket was never connected, so the UI never received the completion event Fix: reload the page after checkAutoPilotStatus confirms the workflow finished, so the UI renders with the current state. Also increase the banner visibility timeout to 30s for CI environments. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Fix flaky tests — entity collisions, missing cleanup, expect timeout - Replace Date.now() with uuid() for entity names in CustomProperties tests to prevent collisions when parallel workers execute within the same millisecond - Fix FollowingWidget: move shared adminUser create/delete to top-level base.beforeAll/afterAll to prevent duplicate user creation across 11 parallel test.describe blocks - Add missing afterAll cleanup to OnlineUsers, Metric, CustomPropertyAdvanceSearch, and CustomProperties tests to prevent entity/user leaks between runs - Replace hardcoded metric name in MetricSearch with uuid-based name - Add global expect timeout of 15s (up from 5s default) for CI resilience Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix Playwright CI: include UI in build-once Maven build The build-once optimization (#26423) used -DonlyBackend -pl !openmetadata-ui which produces a tar.gz without the compiled React app. The Docker container starts but cannot serve the login page, causing auth.setup.ts to timeout on all 6 shards waiting for input[id="email"] to appear. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix CodeQL security warnings - Replace Math.random() with crypto.randomUUID() for test data generation - Escape backslash characters in CSS selectors for glossary FQN values - Use page.getByTestId() instead of raw CSS selectors in entity utils - Increase RSA key size from 512 to 2048 bits in JwtFilterTest - Skip archive entries containing '..' in JsonUtils.getResourcesFromJarFile Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Fix user cleanup to prevent 'Email Already Exists' failures - Glossary.spec.ts: Fix typo user3.create→delete in afterAll, add missing adminUser.delete - Teams.spec.ts: Add afterAll cleanup hooks for 3 nested describe blocks that were missing them (EditUser, DataConsumer, Owner) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Add afterAll cleanup hooks and fix test reliability - InputOutputPorts.spec.ts: Add afterAll for domain/tables/topics/dashboards - Users.spec.ts: Add top-level afterAll for all shared entities - Entity.spec.ts: Add afterAll for shared + per-entity-type cleanup - Pagination.spec.ts: Add afterAll for 13 describe blocks (services, DBs, etc.) - DataProductRename.spec.ts: Add afterAll cleanup - TestCaseIncidentPermissions.spec.ts: Add afterAll for users/roles/policies/table - ImpactAnalysis.spec.ts: Add afterAll for all 7 entity types - NestedColumnsExpandCollapse.spec.ts: Add afterAll for 4 describe blocks - DataProductPermissions.spec.ts: Add afterAll cleanup - ServiceEntityPermissions.spec.ts: Add afterAll for testUser + per-entity - ServiceForm.spec.ts: Add afterAll for adminUser - domain.ts: Replace waitForTimeout(2000) with proper loader/tab waits Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Trigger Playwright CI * Playwright: Fix 2 failures and 26 flaky tests with proper waits Fix remaining 2 genuine failures: - DomainDataProductsWidgets: add test.slow(true) for ES indexing lag - Users.spec.ts: add test.slow(true) and loader waits for owner search Fix 26 flaky tests by addressing 5 root cause patterns: - Response listener after trigger: MetricCustomUnitFlow, DomainUIInteractions - Missing loader wait after navigation: 16 tests across CustomizeDetailPage, DataProductPersonaCustomization, DataContracts, ExploreTree, and others - Element not rendered after API response: EntityVersionPages, ODCSImportExport - DOM not settled after loader: Domains nested rename - Permission cache propagation: GlossaryPermissions Shared utility improvements: - waitForPatchResponse uses entity-specific URL pattern - openColumnDetailPanel accepts entityEndpoint param with API response wait - Entity.spec.ts uses dynamic entity.endpoint instead of hardcoded tables Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Fix addOwner retry to wait for search API response The owner search retry loop was refilling the search input but not waiting for the API response before checking item visibility. This caused the poll to repeatedly check stale/empty results. Fix: await search response and loader detach in each retry iteration. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Fix owner listitem selector — remove exact match The owner selection list items include avatar initials (e.g., "G") in their accessible name, making exact: true fail since the accessible name is "G UserName" not just "UserName". Switching to substring matching fixes the Users.spec.ts persistent failure. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Fix 10 remaining flaky tests with proper waits - ColumnLevelTests: loader wait after visiting test case panel - DataQualityPermissions: loader wait after visiting test suite page - IncidentManagerDateFilter: loader wait after page reload - InputOutputPorts: wait for warning alert before asserting - Lineage: replace 5 hardcoded waitForTimeout(500) with loader waits - CustomizeDetailPage: dialog close waits, fix missing await on expect - DataProductPersonaCustomization: loader wait + modal visibility check - GlossaryPermissions: increase permission propagation wait, loader wait - GlossaryHierarchy: loader waits after modal close and glossary select - ExploreTree: loader waits after API response before UI interaction Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix CodeQL security alerts: incomplete escaping and Zip Slip 1. entity.ts: Use JSON.stringify().slice(1,-1) for proper escaping of both backslashes and double quotes in filter values, replacing the incomplete .replace(/"/g, '\\"') approach. 2. JsonUtils.java: Strengthen Zip Slip protection by normalizing paths via Paths.get().normalize() and rejecting entries starting with "/" or resolving to parent traversal after normalization. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix tests * Fix tests * Fix recordChange field name mismatches and CodeQL alert - ServiceEntityRepository: recordChange("ingestionAgent") → "ingestionRunner" to match the JSON property name. The shouldCompare() gate in PATCH flow was silently dropping ingestionRunner changes because the field name didn't match patchedFields. - DataContractRepository: compareAndUpdate("status") → "entityStatus" to match the JSON property name, same root cause. - JsonUtils: Simplify Zip Slip check to string-based validation to satisfy CodeQL taint analysis. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Remove serial mode from Users.spec.ts to prevent cascade failures A single flaky test failure was causing ~19 tests across 5 unrelated describe blocks to be skipped. Matches main branch behavior (parallel). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Playwright: Fix flaky tests — missing awaits, hardcoded waits, silent catches - DataProductPersonaCustomization: add missing await on expect() calls - TestCaseIncidentPermissions: poll for incident creation instead of one-shot query - TestCaseResultPermissions: add loader wait after Data Quality tab click - GlossaryPermissions: replace waitForTimeout(3000) with toPass() retry - BulkImport: remove 4 unnecessary waitForTimeout calls - importUtils/testCases: replace waitForTimeout(500) with grid visibility assert - GlossaryAssets: add loader wait, remove silent .catch(() => false) pattern Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix CodeQL Zip Slip alert with Path.normalize() sanitization CodeQL doesn't recognize String.contains("..") as proper Zip Slip mitigation. Use Path.normalize() + isAbsolute/startsWith checks which CodeQL's taint analysis model understands. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix Playwright flaky tests: modal visibility, toast race, query card assertion - DataProductPersonaCustomization: wait for dialog close before clicking add-widget-button - entity.ts restoreEntity: dismiss stale toast before restore to avoid race condition - QueryEntity: replace page.$$() with auto-retrying expect().toBeVisible() Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Fix flaky TableResourceIT by preventing parallel multi-domain rule mutation Both test_multipleDomainInheritance (TableResourceIT) and test_csvImportEntityRuleValidation (DatabaseServiceResourceIT) toggle the global "Multiple Domains are not allowed" rule. When running concurrently, one overwrites the other's setting causing spurious failures. Add @ResourceLock("MULTI_DOMAIN_RULE") to serialize only these two tests while keeping all others concurrent. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Mohit Yadav <105265192+mohityadav766@users.noreply.github.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-13 13:38:31 -07:00
Aniket Katkar	d4a6bdb26d	Update the typescript type check workflow to work on forked PRs (#26455 ) * Update the typescript type check workflow to work on forked PRs * update trigger workflow settings to run on PRs * Worked on comments * Fix workflow stuck in pending after bot commit * Update workflow to comment on PRs even if its triggered from dispatch * Work on comments	2026-03-13 11:47:32 +05:30
Aniket Katkar	9921b2678b	Update the typescript type generation workflow inputs to run on demand on a PR (#26456 )	2026-03-13 08:46:34 +05:30
Sriharsha Chintalapani	1ef0463b70	Fix Playwright CI build: install antlr4 CLI before Maven build (#26446 ) The UI module's yarn preinstall runs js-antlr which requires the antlr4 CLI. This matches the pattern used in docker-openmetadata-server.yml. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 08:42:50 -07:00
Sriharsha Chintalapani	debbe52a6b	Fix Playwright CI: include UI module in build-once Maven build (#26444 ) The build-once optimization (#26423) used `-DonlyBackend -pl !openmetadata-ui` which produces a tar.gz without the compiled React app. The Docker container starts but cannot serve the login page, causing auth.setup.ts to timeout on all 6 shards (input[id="email"] never appears). The fix removes the backend-only flags so the full distribution including the UI is built once and shared across shards. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 08:29:47 -07:00
Sriharsha Chintalapani	d1e40cd3c3	Build Maven once and share across Playwright shards (#26423 ) Previously each of the 6 Playwright shards independently ran the full Maven build (~10-15 min each), wasting ~50 min of compute per CI run. Extract Maven build into a dedicated `build` job that runs once: - Builds with `mvn -DskipTests -DonlyBackend clean package` - Uploads openmetadata-*.tar.gz as a GitHub Actions artifact - Moves label check here to gate all downstream work Shard jobs now: - Download the pre-built artifact - Pass `-s true` to run_local_docker.sh to skip Maven - No longer need Maven cache Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 07:26:14 -07:00
Sriharsha Chintalapani	08142790f3	Revert "Improve Playwright CI: reduce shard contention and fix STEP_SUMMARY size (#26417 )" (#26421 ) Reverts the 8-shard configuration back to 6 shards. The dedicated stateful/graph/landing shards increased infrastructure failure surface without reducing test flakiness. This restores the simpler 6-shard layout where shards 3-6 split chromium tests evenly. Keeps the consolidated PR comment summary from #26418. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 22:32:30 -07:00
Sriharsha Chintalapani	b6ffccd48a	Consolidate Playwright results into a single PR comment (#26418 ) Replace per-shard PR comments with a single consolidated comment posted by a new playwright-summary job that runs after all shards complete. Shows per-shard breakdown table, aggregated failures, and flaky tests in one place. Cleans up old per-shard comments from previous runs. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 19:24:20 -07:00

1 2 3 4 5 ...

679 commits