Skip to content

Conversation

@mollyheamazon
Copy link
Collaborator

What's changing and why?

Init experience pytorch job integ test has been failure after the HPTO launch. Root cause is that it is using a command flag with previously set command that is no longer supported by the addition of HyperpodElasticAgent installed image. Thus removing this flag so that the job will run successfully.
For how to use container command with HyperpodElasticAgent installed image, see this doc: https://docs.aws.amazon.com/sagemaker/latest/dg/sagemaker-eks-operator-install.html#sagemaker-eks-operator-elastic-agent

Before/After UX

Before:

After:

How was this change tested?

Are unit tests added?

Are integration tests added?

Reviewer Guidelines

‼️ Merge Requirements: PRs with failing integration tests cannot be merged without justification.

One of the following must be true:

  • All automated PR checks pass
  • Failed tests include local run results/screenshots proving they work
  • Changes are documentation-only

@mollyheamazon mollyheamazon requested a review from a team as a code owner December 12, 2025 22:54
Copy link
Collaborator

@aviruthen aviruthen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approved pending unit/integ tests pass!

@mollyheamazon mollyheamazon merged commit a824151 into aws:main Dec 13, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants