Flaky Test: Pod groups when Single CQ should allow to preempt the lower priority group #4434

tenzen-y · 2025-02-27T17:34:55Z

What happened:
The below E2E test failed.

End To End Suite: kindest/node:v1.30.0: [It] Pod groups when Single CQ should allow to preempt the lower priority group

{Timed out after 45.001s.
The function passed to Eventually failed at /home/prow/go/src/kubernetes-sigs/kueue/test/e2e/singlecluster/pod_test.go:483 with:
Expected
    <v1.PodPhase>: Succeeded
to equal
    <v1.PodPhase>: Failed failed [FAILED] Timed out after 45.001s.
The function passed to Eventually failed at /home/prow/go/src/kubernetes-sigs/kueue/test/e2e/singlecluster/pod_test.go:483 with:
Expected
    <v1.PodPhase>: Succeeded
to equal
    <v1.PodPhase>: Failed
In [It] at: /home/prow/go/src/kubernetes-sigs/kueue/test/e2e/singlecluster/pod_test.go:485 @ 02/27/25 01:48:44.842
}

What you expected to happen:
No errors.

How to reproduce it (as minimally and precisely as possible):
https://prow.k8s.io/view/gs/kubernetes-ci-logs/logs/periodic-kueue-test-e2e-release-0-10-1-30/1894925220897099776

Anything else we need to know?:

Environment:

Kubernetes version (use kubectl version):
Kueue version (use git describe --tags --dirty --always):
Cloud provider or hardware configuration:
OS (e.g: cat /etc/os-release):
Kernel (e.g. uname -a):
Install tools:
Others:

The text was updated successfully, but these errors were encountered:

tenzen-y · 2025-02-27T17:35:06Z

/kind flake

mimowo · 2025-02-27T17:43:06Z

/assign @mszadkow
I believe this is after the recent changes, as we use BehaviorExitFast. The Pod succeeds if it has enough time to complete, it fails if the Delete request is faster. I think we should use WaitForDeletion, and just let the pod to be deleted and failed. We may just need to use Pod's spec.terminationgraceperiodseconds=1 to make it fast. ~~Alternatively trigger /exit 1 instead of exit 1 to let it fail.~~ - this will not work becuase the Pod is deleted due to preemption.

mszadkow · 2025-02-28T09:49:27Z

Got it, will try with suggested solution.

tenzen-y added the kind/bug Categorizes issue or PR as related to a bug. label Feb 27, 2025

k8s-ci-robot added the kind/flake Categorizes issue or PR as related to a flaky test. label Feb 27, 2025

k8s-ci-robot assigned mszadkow Feb 27, 2025

mszadkow linked a pull request Feb 28, 2025 that will close this issue

[Flake] Change image behavior of high-priority-group pod #4438

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flaky Test: Pod groups when Single CQ should allow to preempt the lower priority group #4434

Flaky Test: Pod groups when Single CQ should allow to preempt the lower priority group #4434

tenzen-y commented Feb 27, 2025

tenzen-y commented Feb 27, 2025

mimowo commented Feb 27, 2025 •

edited

Loading

mszadkow commented Feb 28, 2025

Flaky Test: Pod groups when Single CQ should allow to preempt the lower priority group #4434

Flaky Test: Pod groups when Single CQ should allow to preempt the lower priority group #4434

Comments

tenzen-y commented Feb 27, 2025

tenzen-y commented Feb 27, 2025

mimowo commented Feb 27, 2025 • edited Loading

mszadkow commented Feb 28, 2025

mimowo commented Feb 27, 2025 •

edited

Loading