Skip to content

Releases: kubernetes-sigs/jobset

v0.8.0

28 Feb 20:48
v0.8.0
ce55237
Compare
Choose a tag to compare

Release v0.8.0

Highlights

Deprecations

  • #721 - Drop kube-rbac-proxy from deployment
  • #752 - Remove namespace option from component config

Changelog

  • Doc updates for 0.7 [main branch] by @danielvegamyhre in #692
  • Bump the kubernetes group with 7 updates by @dependabot in #693
  • Image support multi-arch by @phuhung273 in #694
  • Use meta api native condition status by @tenzen-y in #695
  • Introduce Go std slices and maps lib by @tenzen-y in #696
  • Remove duplicated condition judgement by @tenzen-y in #697
  • Bump github.com/onsi/gomega from 1.34.2 to 1.35.1 by @dependabot in #700
  • Bump github.com/onsi/ginkgo/v2 from 2.20.2 to 2.21.0 by @dependabot in #699
  • Add Coordinator concept by @avrittrohwer in #702
  • Bump github.com/open-policy-agent/cert-controller from 0.11.0 to 0.12.0 by @dependabot in #704
  • Propagate schedulingGates set on PodTemplate when resuming JobSet by @mimowo in #705
  • Update docs for release v0.7.1 by @ahg-g in #712
  • Bump sigs.k8s.io/structured-merge-diff/v4 from 4.4.1 to 4.4.3 in the kubernetes group across 1 directory by @dependabot in #710
  • Bump the kubernetes group with 7 updates by @dependabot in #714
  • Bump github.com/onsi/ginkgo/v2 from 2.21.0 to 2.22.0 by @dependabot in #716
  • fix wget failure to resolve relative paths by @kannon92 in #718
  • Bump github.com/stretchr/testify from 1.9.0 to 1.10.0 by @dependabot in #715
  • Bump github.com/onsi/gomega from 1.35.1 to 1.36.0 by @dependabot in #717
  • Allow for one to install jobset in a different namespace by @kannon92 in #719
  • Bump sigs.k8s.io/controller-runtime from 0.19.2 to 0.19.3 in the kubernetes group by @dependabot in #725
  • Remove kube-rbac-proxy by @kannon92 in #722
  • use component config for default installation by @kannon92 in #724
  • Update documents to point to v0.7.2 by @ahg-g in #732
  • fix docker warnings by @kannon92 in #727
  • upgrade python dependencies for sdk by @kannon92 in #728
  • KEP-672: Serial Job Execution with DependsOn API by @andreyvelich in #680
  • Bump the kubernetes group with 7 updates by @dependabot in #733
  • Bump github.com/onsi/gomega from 1.36.0 to 1.36.1 by @dependabot in #734
  • update golang net to 0.33 to satisfy security alerts by @kannon92 in #735
  • Bump github.com/onsi/ginkgo/v2 from 2.22.0 to 2.22.1 by @dependabot in #739
  • Bump github.com/onsi/gomega from 1.36.1 to 1.36.2 by @dependabot in #741
  • Bump github.com/onsi/ginkgo/v2 from 2.22.1 to 2.22.2 by @dependabot in #742
  • Bump braces from 3.0.2 to 3.0.3 in /site by @kannon92 in #744
  • update k8s to 0.32 apis by @kannon92 in #738
  • Propagate job labels and annotations by @imreddy13 in #737
  • disable http/2 for metrics server by @kannon92 in #745
  • Minimize the number of unnecessary logs that get emitted by @imreddy13 in #746
  • inject namespace in case we want to test against non standard deployment by @kannon92 in #749
  • fix security warnings in client go code by @kannon92 in #743
  • Bump sigs.k8s.io/controller-runtime from 0.19.3 to 0.19.4 in the kubernetes group by @dependabot in #750
  • Remove Namespace from the JobSet Config by @andreyvelich in #752
  • Turn off internal cert management via config by @ardaguclu in #755
  • update gen-sdk.sh to generate sdk using docker container by @epicseven-cup in #681
  • Use config metrics binding address if flag is not set by @ardaguclu in #756
  • Bump the kubernetes group with 7 updates by @dependabot in #759
  • Self nominate Kevin Hannon for approval rights by @kannon92 in #758
  • copy all files in project rather than piece by piece by @kannon92 in #765
  • add go mod download to Dockerfile by @kannon92 in #769
  • update python sdk files with latest changes by @kannon92 in #770
  • Bump sigs.k8s.io/controller-runtime from 0.20.0 to 0.20.1 in the kubernetes group by @dependabot in #772
  • KEP-672: Implement the DependsOn API by @andreyvelich in #740
  • Set user agent for requests coming from the jobset controller to "jobset" by @imreddy13 in #775
  • Updating the documentation for Pod DNS and underlying headless service. by @raushan2016 in #779
  • Remove the Configuration API as a CRD by @ahg-g in #781
  • Set image tag and commit version at build time by @ahg-g in #780
  • Increase memory limit and remove cpu limit for the default deployment by @priyanshikhetwani in #783
  • Add missing external types to apply configurations by @astefanutti in #782
  • Bump the kubernetes group with 7 updates by @dependabot in #784
  • feature: add Helm chart for jobset by @ChenYi015 in #785
  • helm: disable Promethues metrics exporting by default by @ChenYi015 in #789
  • Bump github.com/google/go-cmp from 0.6.0 to 0.7.0 by @dependabot in #794
  • add make file targets for helm by @kannon92 in #792
  • Bump github.com/prometheus/client_golang from 1.20.5 to 1.21.0 by @dependabot in #795
  • Fix helm chart push but disable it from cloud build for testing by @kannon92 in #798
  • enable helm chart push for cloudbuild for postsubmit by @kannon92 in #799
  • add gotoolchain and update cloudbuild name by @kannon92 in #800

New Contributors

Full Changelog: v0.7.0...v0.8.0

v0.7.3

24 Jan 18:48
v0.7.3
152f624
Compare
Choose a tag to compare

What's Changed

  • [Release 0.7]: cherry-pick allow for one to install jobset in a different namespace by @kannon92 in #751
  • [release-0.7]: cherry-pick turn off internal cert management via config by @kannon92 in #757

Full Changelog: v0.7.2...v0.7.3

Release v0.7.2

10 Dec 21:36
v0.7.2
9cb030b
Compare
Choose a tag to compare

What's Changed

  • Update docs for v0.7.0 (release branch) by @danielvegamyhre in #691
  • Automated cherry pick of #705: Propagate schedulingGates set on PodTemplate when resuming by @mimowo in #706

Full Changelog: v0.7.0...v0.7.2

Release v0.7.1

18 Nov 23:35
v0.7.1
Compare
Choose a tag to compare

What's Changed

  • Update docs for v0.7.0 (release branch) by @danielvegamyhre in #691
  • Automated cherry pick of #705: Propagate schedulingGates set on PodTemplate when resuming by @mimowo in #706

Full Changelog: v0.7.0...v0.7.1

v0.7.0

26 Oct 18:20
f76f2a7
Compare
Choose a tag to compare

Highlights

What's Changed

  • fix: delete active jobs right away when job finishes even when TTLSecondsAfterFinished is set by @CecileRobertMichon in #667
  • Bump github.com/onsi/ginkgo/v2 from 2.20.0 to 2.20.1 by @dependabot in #663
  • Bump github.com/prometheus/client_golang from 1.20.0 to 1.20.2 by @dependabot in #664
  • Bump kubernetes dependencies to v0.31.x. by @mbobrovskyi in #670
  • Bump github.com/onsi/ginkgo/v2 from 2.20.1 to 2.20.2 by @dependabot in #668
  • Bump github.com/onsi/gomega from 1.34.1 to 1.34.2 by @dependabot in #669
  • chore: update README.md e2e test version for v1.31.0 by @googs1025 in #671
  • Add test-python-sdk on Makefile test. by @mbobrovskyi in #673
  • Bump github.com/prometheus/client_golang from 1.20.2 to 1.20.3 by @dependabot in #674
  • feat: add component config by @rainfd in #609
  • Bump the kubernetes group with 6 updates by @dependabot in #675
  • Add global-job-replicas label/annotation by @GiuseppeTT in #677
  • Add examples for three existing failure policy actions. by @jedwins1998 in #601
  • Bump github.com/prometheus/client_golang from 1.20.3 to 1.20.4 by @dependabot in #679
  • chore: use symbolic link instead of directory by @googs1025 in #630
  • Priority-based exclusive placement by @ahg-g in #687
  • Bump github.com/prometheus/client_golang from 1.20.4 to 1.20.5 by @dependabot in #688
  • Add restart strategy by @nstogner in #686

New Contributors

Full Changelog: v0.7.0-devel...v0.7.0

v0.6.0

20 Aug 16:20
d66f1d5
Compare
Choose a tag to compare

Highlights

  • New JobSet Failure Policy API - allows users to configure different behavior for different types of errors, enabling them to use compute resources more efficiently and improve ML training goodput.
  • Add Coordinator field to JobSet spec, enabling user to define a global coordinator pod for distributed ML/HPC workloads. The stable network endpoint for this pod will be added as a label and annotation to every Job and Pod in the JobSet for easy use in application code. A common use case for this is TPU Multislice training with multiple different Job templates. See linked issue for details.
  • Add global Job index label/annotation to every Job and Pod, which is needed to support TPU Multislice training with multiple different Job templates. See linked issue for details.
  • Added new metrics
  • Improved test coverage
  • Bug fixes
  • New examples and documentation

What's Changed

New Contributors

Full Changelog: v0.6.0-devel...v0.6.0

JobSet v0.5.2

04 Jun 17:42
8637f29
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.5.1...v0.5.2

v0.5.1

09 May 17:50
43f8137
Compare
Choose a tag to compare

Highlights

  • Fixed bug causing foreground cascading deletion policy to not work properly on JobSets #562
  • Fixed field path in error message in validation for ManagedBy field #527
  • Test coverage improvements, refactoring, additional documentation

What's Changed

Full Changelog: v0.6.0-devel...v0.5.1

v0.5.0

15 Apr 20:12
cb941fc
Compare
Choose a tag to compare

What's Changed

Highlights

  • JobSet TTL support added in #443
  • Docsite is live at https://jobset.sigs.k8s.io/ with updated documentation and examples.
  • Include first failed job name in event emitted when JobSet fails, to speed up the debugging process for large complex workloads #477
  • Lower default resource request for JobSet controller manager so it fits on default cloud CPU VMs, but keep high limit to support maximum performance #480
  • Perform only 1 JobSet status update per reconcile attempt to reduce pressure on k8s apiserver #494
  • Introduced MangedBy field to the JobSet spec to enable Multi-Kueue support

Detailed release notes

New Contributors

Full Changelog: v0.5.0-devel...v0.5.0

v0.4.0

28 Feb 21:12
9f2cb14
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.4.0-devel...v0.4.0