Skip to content

Conversation

@hehe7318
Copy link
Contributor

Description of changes

  • Cherry-pick changes from release-3.14 branch to develop branch

Checklist

  • Make sure you are pointing to the right branch.
  • If you're creating a patch for a branch other than develop add the branch name as prefix in the PR title (e.g. [release-3.6]).
  • Check all commits' messages are clear, describing what and why vs how.
  • Make sure to have added unit tests or integration tests to cover the new/modified code.
  • Check if documentation is impacted by this change.

Please review the guidelines for contributing and Pull Request Instructions.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

Himani Anil Deshpande and others added 29 commits December 24, 2025 15:12
* Using HN instance type for x86 for queue_without_imex as it mainly being run in isolated regions. For commercial partition we focus on ARM
* Using HN instance type for x86 for queue_without_imex as it mainly being run in isolated regions. For commercial partition we focus on ARM
* Skipping test for ALinux 2
…role (aws#7087)

* Add GetFunction and GetPolicy permissions to the build image cleanup role

* Increase pcluster build image cleanup role revision number

* Update CHANGELOG

* Add unit test for actions in CleanupRole policy

* Fix linter errors
…se AZs "usie1-az1", "usie1-az2"

FSx Lustre Scratch_2 is not supported in other AZs
…DevSettings is null, DevSettings/InstanceTypesData is not a valid JSON or DevSettings/InstanceTypesData is missing required fields.

With this change we replaced the validation error message "'NoneType' object has no attribute 'get'", with a more actionable "Could not determine network cards for instance type ${instanceType}"
* Using self for Owner Filter for patched AL2 DLAMI
…nd compute nodes

Prior to this commit, there were intermittent failures of insufficient capacity of the head node instance type.

Although the head node is not in the placement group, which is used by the integration tests framework to make the capacity reservation, this commit "hopes" that the AZ has adequate capacity of the instance type. In other words, if the integration test framework can make a capacity reservation of 35 instances in a placement group in the AZ, this commit hopes there will be an additional 1 instance anywhere in the AZ.
…nd compute nodes when possible

This commit uses the same instance type for both compute node and head node when the instance type is available in open capacity pool. When the instance type is p4, p5, p6, or hpc which require capacity reservations or capacity blocks, the head node still uses a different instance type as the logic before this commit.

Prior to this commit, there were intermittent failures of insufficient capacity of the head node instance type.

Although the head node is not in the placement group, which is used by the integration tests framework to make the capacity reservation, this commit "hopes" that the AZ has adequate capacity of the instance type. In other words, if the integration test framework can make a capacity reservation of 35 instances in a placement group in the AZ, this commit hopes there will be an additional 1 instance anywhere in the AZ.
…d_scheduling

The changed line is running on c5.large, which has 4GB of physical memory, when running with 3GB of program, it failed in isolated region on RHEL8. Slightly reduce the memory usage to 2.8GB to make the test pass. ParallelCluster test shouldn't check too strictly about memory usage because it is managed by operating systems.
…ntegration tests as FSX take 20+ mins if throttled
* the package dose not exist in Amazon Linux and Rocky/Rhel 8 AMI's; we avoid errors like Error: Unable to find a match: kernel6.12-devel-matched-6.12.58-82.121.amzn2023.aarch64
to prevent background updates of SSM agent that could lead to reboot failures.
…fails (aws#7171)

When build image fails, the image_list is empty but the code directly
accesses image_list[0], causing IndexError that masks the real error.
Add safety check to only access list elements when list is not empty.
…e on Ubuntu24 (aws#7164)

* Skip dcv installation when test_build_image on ubuntu24.04
@hehe7318 hehe7318 requested a review from a team as a code owner December 24, 2025 20:22
@hehe7318 hehe7318 requested a review from a team as a code owner December 24, 2025 20:22
@hehe7318 hehe7318 added 3.x skip-changelog-update Disables the check that enforces changelog updates in PRs labels Dec 24, 2025
@hehe7318 hehe7318 changed the title [Develop] Cherry-pick changes from release-3.14 branch to develop branch [Develop][CLI] Cherry-pick changes from release-3.14 branch to develop branch Dec 24, 2025
@hehe7318 hehe7318 merged commit 042c7aa into aws:develop Dec 25, 2025
27 of 30 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

3.x skip-changelog-update Disables the check that enforces changelog updates in PRs

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants