vault

mirror of https://github.com/hashicorp/vault.git synced 2025-12-25 03:11:40 +01:00

Author	SHA1	Message	Date
Ryan Cragun	c8e6169d5d	VAULT-31402: Add verification for all container images (#28605 ) * VAULT-31402: Add verification for all container images Add verification for all container images that are generated as part of the build. Before this change we only ever tested a limited subset of "default" containers based on Alpine Linux that we publish via the Docker hub and AWS ECR. Now we support testing all Alpine and UBI based container images. We also verify the repository and tag information embedded in each by deploying them and verifying the repo and tag metadata match our expectations. This does change the k8s scenario interface quite a bit. We now take in an archive image and set image/repo/tag information based on the scenario variants. To enable this I also needed to add `tar` to the UBI base image. It was already available in the Alpine image and is used to copy utilities to the image when deploying and configuring the cluster via Enos. Since some images contain multiple tags we also add samples for each image and randomly select which variant to test on a given PR. Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-10-07 10:16:22 -06:00
dependabot[bot]	754e97018f	Bump actions/setup-node from 4.0.2 to 4.0.3 (#27738 ) Bumps [actions/setup-node](https://github.com/actions/setup-node) from 4.0.2 to 4.0.3. - [Release notes](https://github.com/actions/setup-node/releases) - [Commits](`60edb5dd54...1e60f620b9`) --- updated-dependencies: - dependency-name: actions/setup-node dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: akshya96 <87045294+akshya96@users.noreply.github.com>	2024-08-16 16:15:59 -07:00
dependabot[bot]	3b2ea90fdb	Bump actions/upload-artifact from 4.3.3 to 4.3.6 (#28008 ) Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 4.3.3 to 4.3.6. - [Release notes](https://github.com/actions/upload-artifact/releases) - [Commits](`65462800fd...834a144ee9`) --- updated-dependencies: - dependency-name: actions/upload-artifact dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: akshya96 <87045294+akshya96@users.noreply.github.com>	2024-08-15 15:18:47 -07:00
Violet Hynes	64ce6e74da	Update actions/checkout to 4.1.7 (#27636 )	2024-07-02 09:25:21 -04:00
dependabot[bot]	2718994242	Bump actions/checkout from 4.1.5 to 4.1.6 (#27096 ) Bumps [actions/checkout](https://github.com/actions/checkout) from 4.1.5 to 4.1.6. - [Release notes](https://github.com/actions/checkout/releases) - [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md) - [Commits](`44c2b7a8a4...a5ac7e51b4`) --- updated-dependencies: - dependency-name: actions/checkout dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Violet Hynes <violet.hynes@hashicorp.com>	2024-05-17 10:06:45 -04:00
dependabot[bot]	b81a2666b2	Bump actions/checkout from 4.1.4 to 4.1.5 (#26920 ) Bumps [actions/checkout](https://github.com/actions/checkout) from 4.1.4 to 4.1.5. - [Release notes](https://github.com/actions/checkout/releases) - [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md) - [Commits](`0ad4b8fada...44c2b7a8a4`) --- updated-dependencies: - dependency-name: actions/checkout dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Violet Hynes <violet.hynes@hashicorp.com>	2024-05-10 18:15:07 +00:00
Ryan Cragun	fc4042bd2e	[QT-687] use new packaging action (#26905 ) Update hashicorp/actions-packaging-linux to our rewritten version that no longer requires building a Docker container or relies on code hosted in a non-hashicorp repo for packaging. As internal actions are not managed in the same manner as external actions in via the tsccr trusted components db, the tsccr helper is unable to easily re-pin hashicorp/* actions. As such, we unpin some pinned hashicorp/* actions to automatically pull in updates that are compatible. Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-05-10 16:51:06 +00:00
dependabot[bot]	28612efd26	Bump hashicorp/vault-action from 2.8.0 to 3.0.0 (#26847 ) Bumps [hashicorp/vault-action](https://github.com/hashicorp/vault-action) from 2.8.0 to 3.0.0. - [Release notes](https://github.com/hashicorp/vault-action/releases) - [Changelog](https://github.com/hashicorp/vault-action/blob/main/CHANGELOG.md) - [Commits](`9f522b8598...d1720f055e`) --- updated-dependencies: - dependency-name: hashicorp/vault-action dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Violet Hynes <violet.hynes@hashicorp.com>	2024-05-09 15:13:05 -04:00
Violet Hynes	73506d1395	Update hashicorp/set-product-version to version 2 (#26908 )	2024-05-09 13:48:57 -04:00
Ryan Cragun	842dff8342	[QT-711] actions: use next generation CRT actions (#26882 ) Update the Github Actions pins to use the next generation of actions that are supported by CRT. In some cases these are simply to resolve Node 16 deprecations. In others, we can now use `action/upload-artifact@v4` and `actions/download-artifact@v4` since the next generation of actions like `hashicorp/actions-docker-build@v2` and `hashicorp/actions-persist-metadata@v2` use the `v4` versions of these. Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-05-08 15:17:20 -06:00
Ryan Cragun	1f2f3ff20a	[QT-711] Pin to latest github actions (#26789 ) Pin to the latest actions in preparation for the migration to `actions/upload-artifact@v4`, `actions/download-artifact@v4`, and `hashicorp/actions-docker-build@v2` on May 6 or 7. Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-05-02 13:29:20 -06:00
Ryan Cragun	c7bdac4081	[QT-688] Pin to latest tsccr actions (#26006 ) This should resolve various Node JS 16 deprecation errors. Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-03-18 18:56:12 +00:00
Peter Wilson	6f5ed9ddd0	Try to make sure we don't run build on CE community PRs (#25909 )	2024-03-13 17:53:22 +00:00
Ryan Cragun	3b459f64e6	[QT-683] only create artifact metadata on changes that build artifacts (#25823 ) Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-03-07 10:53:12 -07:00
Ryan Cragun	f19a04b4b5	build: don't build artifacts on UI only changes (#25389 ) Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-02-13 14:44:18 -07:00
Ryan Cragun	70725fb0f7	build: allow building all artifacts with empty pull requests (#25376 ) Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-02-12 23:02:47 +00:00
Ryan Cragun	d255cb86b2	build: use the version with metadata in our metadata file (#25372 ) Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-02-12 21:24:11 +00:00
Ryan Cragun	15eb4e4060	ci: fix notifications and comments (#25356 ) * Fix the build notification. It appears that during a rebase the JSON payload was slightly corrupted. * Don't create a successful CI step summary if the CI workflow is cancelled. * Don't create a successful CI comment if the workflow was cancelled. Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-02-12 09:52:13 -07:00
Ryan Cragun	a3fb68cab8	build: fix scheduled workflow logic (#25300 ) * The scheduled workflow event name is `schedule` not `scheduled`. * We should skip our completed workflow entirely in draft mode. * Clean up and flesh out the build workflow comment a bit more. Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-02-08 18:59:42 +00:00
Ryan Cragun	89c75d3d7c	[QT-637] Streamline our build pipeline (#24892 ) Context ------- Building and testing Vault artifacts on pull requests and merges is responsible for about 1/3rd of our overall spend on Vault CI. Of the artifacts that we ship as part of a release, we do Enos testing scenarios on the `linux/amd64` and `linux/arm64` binaries and their derivative artifacts. The extended build artifacts for non-Linux platforms or less common machine architectures are not tested at this time. They are built, notarized, and signed as part of every pull request update and merge. As we don't actually test these artifacts, the only gain we get from this rather expensive behavior is that we wont merge a change that would prevent Vault from building on one of the extended targets. Extended platform or architecture changes are quite rare, so performing this work as frequently as we do is costly in both monetary and developer time for little relative safety benefit. Goals ----- Rethink and implement how and when we build binaries and artifacts of Vault so that we can spend less money on repetitive work and while also reducing the time it takes for the build and test pipelines to complete. Solution -------- Instead of building all release artifacts on every push, we'll opt to build only our testable (core) artifacts. With this change we are introducing a bit of risk. We could merge a change that breaks an extended platform and only find out after the fact when we trigger a complete build for a release. We'll hedge against that risk by building all of the release targets on a scheduled cadence to ensure that they are still buildable. We'll make building all of the targets optional on any pull request by use of a `build/all` label on the pull request. Further considerations ---------------------- * We want to reduce the total number of workflows and runners for all of our pipelines if possible. As each workflow runner has infrastructure cost and runner time penalties, using a single runner over many is often preferred. * Many of our jobs runners have been optimized for cost and performance. We should simplify the choices of which runners to use. * CRT requires us to use the same build workflow in both CE and Ent. Historically that meant that modifying `build.yml` in CE would result in a merge conflict with `build.yml` in Ent, and break our merge workflows. * Workflow flow control in both `build.yml` and `ci.yml` can be quite complicated, as each needs to maintain compatibility whether executed as CE or Ent, and when triggered with various Github events like pull_request, push, and workflow_call, each with their own requirements. * Many jobs utilize similar patterns of flow control and metadata but are not reusable. * Workflow call depth has a maximum of four, so we need to be quite considerate when calling other workflows. * Called workflows can only have 10 inputs. Implementation -------------- * Refactor the `build.yml` workflow to be agnostic to whether or not it is executing in CE or Ent. That makes future updates to the build much easier as we won't have to worry about merge conflicts when the change is merged downstream. * Extract common steps in workflows into composite actions that we can reuse. * Fix bugs where some but not all workflows would use different Git references when building and testing a pull request. * We rewrite the application, docs, and UI change helpers as a composite action. This allows us to re-use this logic to make consistent behavior choices across build and CI. * We combine several `build.yml` and `ci.yml` jobs into our final job. This reduces the number of workflows required for the same behavior while saving time overall. * Update most of our action pins. Results ------- \| Metric \| Before \| After \| Diff \| \|-------------------\|----------\|---------\|-------\| \| Duration: \| ~14-18m \| ~15-18m \| ~ = \| \| Workflows: \| 43 \| 18 \| - 58% \| \| Billable time: \| ~1h15m \| 16m \| - 79% \| \| Saved artifacts: \| 34 \| 12 \| - 65% \| Infra costs should map closely to billable time. Network I/O costs should map closely to the workflow count. Storage costs should map directly with saved artifacts. We could probably get parity with duration by getting more clever with our UBI container build, as that's where we're seeing the increase. I'm not yet concerned as it takes roughly the same time for this job to complete as it did before. While the CI workflow was not the focus on the PR, some shared refactoring does show some marginal improvements there. \| Metric \| Before \| After \| Diff \| \|-------------------\|----------\|----------\|--------\| \| Duration: \| ~24m \| ~12.75m \| - 15% \| \| Workflows: \| 55 \| 47 \| - 8% \| \| Billable time: \| ~4h20m \| ~3h36m \| - 7% \| Further focus on streamlining the CI workflows would likely result in a few more marginal improvements, but nothing on the order like we've seen with the build workflow. Signed-off-by: Ryan Cragun <me@ryan.ec>	2024-02-06 21:11:33 +00:00
Violet Hynes	a174ed395b	VAULT-23732 Update github actions to non-deprecated versions (#25203 )	2024-02-06 10:49:40 -05:00
dependabot[bot]	8a571a3e22	Bump actions/checkout from 3.5.3 to 4.1.1 (#24927 ) Bumps [actions/checkout](https://github.com/actions/checkout) from 3.5.3 to 4.1.1. - [Release notes](https://github.com/actions/checkout/releases) - [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md) - [Commits](https://github.com/actions/checkout/compare/v3.5.3...b4ffde65f46336ab88eb53be808477a3936bae11) --- updated-dependencies: - dependency-name: actions/checkout dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Violet Hynes <violet.hynes@hashicorp.com>	2024-01-18 14:31:52 +00:00
dependabot[bot]	c637b9e4c6	Bump actions/cache from 3.3.3 to 4.0.0 (#24897 ) Bumps [actions/cache](https://github.com/actions/cache) from 3.3.3 to 4.0.0. - [Release notes](https://github.com/actions/cache/releases) - [Changelog](https://github.com/actions/cache/blob/main/RELEASES.md) - [Commits](`e12d46a63a...13aacd865c`) --- updated-dependencies: - dependency-name: actions/cache dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Violet Hynes <violet.hynes@hashicorp.com>	2024-01-17 11:50:47 -05:00
dependabot[bot]	d9f0587705	Bump actions/cache from 3.3.1 to 3.3.3 (#24875 ) Bumps [actions/cache](https://github.com/actions/cache) from 3.3.1 to 3.3.3. - [Release notes](https://github.com/actions/cache/releases) - [Changelog](https://github.com/actions/cache/blob/main/RELEASES.md) - [Commits](`88522ab9f3...e12d46a63a`) --- updated-dependencies: - dependency-name: actions/cache dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Violet Hynes <violet.hynes@hashicorp.com>	2024-01-16 11:05:49 -05:00
dependabot[bot]	8e0851cfdd	Bump actions/setup-node from 3.7.0 to 4.0.1 (#24833 ) Bumps [actions/setup-node](https://github.com/actions/setup-node) from 3.7.0 to 4.0.1. - [Release notes](https://github.com/actions/setup-node/releases) - [Commits](`e33196f742...b39b52d121`) --- updated-dependencies: - dependency-name: actions/setup-node dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-01-15 14:02:16 +00:00
Violet Hynes	0a427b9fd3	Fix build failures sometimes showing as success when failures happened (#24825 )	2024-01-11 20:42:23 +00:00
Violet Hynes	7697e8b14c	CE Update of vault-action from 2.5 to 2.74 (#24812 )	2024-01-11 10:27:37 -05:00
Chris Capurso	9e044ea26e	modify linux arm build strategy to use GOARM 6 (#24061 )	2023-11-09 11:01:34 -05:00
Violet Hynes	146653dfef	VAULT-20487 update build failure slack output (#23724 ) * VAULT-20487 update build failure slack output * VAULT-20487 add new needs * VAULT-20487 make it run on my branch * VAULT-20487 make it run * VAULT-20487 finalize?	2023-10-19 08:56:35 -04:00
Ryan Cragun	391cc1157a	[QT-602] Run `proxy` and `agent` test scenarios (#23176 ) Update our `proxy` and `agent` scenarios to support new variants and perform baseline verification and their scenario specific verification. We integrate these updated scenarios into the pipeline by adding them to artifact samples. We've also improved the reliability of the `autopilot` and `replication` scenarios by refactoring our IP address gathering. Previously, we'd ask vault for the primary IP address and use some Terraform logic to determine followers. The leader IP address gathering script was also implicitly responsible for ensuring that a found leader was within a given group of hosts, and thus waiting for a given cluster to have a leader, and also for doing some arithmetic and outputting `replication` specific output data. We've broken these responsibilities into individual modules, improved their error messages, and fixed various races and bugs, including: * Fix a race between creating the file audit device and installing and starting vault in the `replication` scenario. * Fix how we determine our leader and follower IP addresses. We now query vault instead of a prior implementation that inferred the followers and sometimes did not allow all nodes to be an expected leader. * Fix a bug where we'd always always fail on the first wrong condition in the `vault_verify_performance_replication` module. We also performed some maintenance tasks on Enos scenarios byupdating our references from `oss` to `ce` to handle the naming and license changes. We also enabled `shellcheck` linting for enos module scripts. * Rename `oss` to `ce` for license and naming changes. * Convert template enos scripts to scripts that take environment variables. * Add `shellcheck` linting for enos module scripts. * Add additional `backend` and `seal` support to `proxy` and `agent` scenarios. * Update scenarios to include all baseline verification. * Add `proxy` and `agent` scenarios to artifact samples. * Remove IP address verification from the `vault_get_cluster_ips` modules and implement a new `vault_wait_for_leader` module. * Determine follower IP addresses by querying vault in the `vault_get_cluster_ips` module. * Move replication specific behavior out of the `vault_get_cluster_ips` module and into it's own `replication_data` module. * Extend initial version support for the `upgrade` and `autopilot` scenarios. We also discovered an issue with undo_logs that has been described in the VAULT-20259. As such, we've disabled the undo_logs check until it has been fixed. Signed-off-by: Ryan Cragun <me@ryan.ec>	2023-09-26 15:37:28 -06:00
Ryan Cragun	5f1d2c56a2	[QT-506] Use enos scenario samples for testing (#22641 ) Replace our prior implementation of Enos test groups with the new Enos sampling feature. With this feature we're able to describe which scenarios and variant combinations are valid for a given artifact and allow enos to create a valid sample field (a matrix of all compatible scenarios) and take an observation (select some to run) for us. This ensures that every valid scenario and variant combination will now be a candidate for testing in the pipeline. See QT-504[0] for further details on the Enos sampling capabilities. Our prior implementation only tested the amd64 and arm64 zip artifacts, as well as the Docker container. We now include the following new artifacts in the test matrix: * CE Amd64 Debian package * CE Amd64 RPM package * CE Arm64 Debian package * CE Arm64 RPM package Each artifact includes a sample definition for both pre-merge/post-merge (build) and release testing. Changes: * Remove the hand crafted `enos-run-matrices` ci matrix targets and replace them with per-artifact samples. * Use enos sampling to generate different sample groups on all pull requests. * Update the enos scenario matrices to handle HSM and FIPS packages. * Simplify enos scenarios by using shared globals instead of cargo-culted locals. Note: This will require coordination with vault-enterprise to ensure a smooth migration to the new system. Integrating new scenarios or modifying existing scenarios/variants should be much smoother after this initial migration. [0] https://github.com/hashicorp/enos/pull/102 Signed-off-by: Ryan Cragun <me@ryan.ec>	2023-09-08 12:46:32 -06:00
Sarah Thompson	a9a4b0b9ff	Onboard Vault to CRT version bump automation (#18311 ) * adding new version bump refactoring * address comments * remove changes used for testing * add the version bump event! * fix local enos scenarios * remove unnecessary local get_local_metadata steps from scenarios * add version base, pre, and meta to the get_local_metadata module * use the get_local_metadata module in the local builder for version metadata * update the version verifier to always require a build date Signed-off-by: Ryan Cragun <me@ryan.ec> * Update to embed the base version from the VERSION file directly into version.go. This ensures that any go tests can use the same (valid) version as CI and so can local builds and local enos runs. We still want to be able to set a default metadata value in version_base.go as this is not something that we set in the VERSION file - we pass this in as an ldflag in CI (matters more for ENT but we want to keep these files in sync across repos). * update comment * fixing bad merge * removing actions-go-build as it won't work with the latest go caching changes * fix logic for getting version in enos-lint.yml * fix version number * removing unneeded module --------- Signed-off-by: Ryan Cragun <me@ryan.ec> Co-authored-by: Claire <claire@hashicorp.com> Co-authored-by: Ryan Cragun <me@ryan.ec>	2023-09-06 17:08:48 +01:00
Hamid Ghaf	437a7ab934	move draft PR condition outside of verify-changes (#22002 ) * move draft PR condition outside of verify-changes * remove some stuff	2023-07-21 08:58:34 -07:00
Ryan Cragun	1a46088afb	[QT-590] Optimize the CI testing workflow (#21959 ) We further optimize the CI workflow for better costs and speed. We tested the Go CI workflows across several instance classes and update our compute choices. We achieve an average execution speed improvement of 2-2.5 minutes per test workflow while reducing the infrastructure cost by about 20%. We also also save another ~2 minutes by installing `gotestsum` from the Github release instead of downloading the Go modules and compiling it every time. In addition to the speed improvements, we also further reduced our cache usage by updating the `security-scan` workflow to not cache Go modules. We also use the `cache/save` and `cache/restore` actions for timing caches. This results is saving half as many cache results for timing data. UI test results results for 2x runs: * c6a.2xlarge (12m54s, 11m55s) * c6a.4xlarge (10m47s, 11m6s) * c6a.8xlarge (11m32s, 10m51s) * m5.2xlarge (15m23s, 14m16s) * m5.4xlarge (14m48s, 12m54s) * m5.8xlarge (12m27s, 12m24s) * m6a.2xlarge (11m55s, 12m20s) * m6a.4xlarge (10m54s, 10m43s) * m6a.8xlarge (10m33s, 10m51s) Current runner: m5.2xlarge (15m23s, 14m16s, avg 14m50s) @ 0.448/hr = $0.11 Faster candidates * c6a.2xlarge (12m54s, 11m55s, avg 12m24s) @ 0.3816/hr = $0.078 * m6a.2xlarge (11m55s, 12m20s, avg 12m8s) @ 0.4032/hr = $0.081 * c6a.4xlarge (10m47s, 11m6s, avg 10m56s) @ 0.7632/hr = $0.139 * m6a.4xlarge (10m54s, 10m43s, avg 10m48s) @ 0.8064/hr = $0.140 Best bang for the buck for test-ui: m6a.2xlarge, > 25% cost savings from current and we save ~2.5 minutes. Go test results During testing the external replication tests, when not broken up, will always take the longest. Our original analysis focuses on this job. Most other tests groups will finish ~3m faster so we'll use subtract that time when estimating the cost for the whole job. external replication job results: * c6a.2xlarge (20m49s, 19m20s, avg 20m5s) * c6a.4xlarge (19m1s, 19m38s, avg 19m20s) * c6a.8xlarge (19m51s, 18m54s, avg 19m23s) * m5.2xlarge (22m12s, 20m29s, avg 21m20s) * m5.4xlarge (20m7s, 19m3s, avg 20m35s) * m5.8xlarge (20m24s, 19m42s, avg 20m3s) * m6a.2xlarge (21m10s, 19m37s, avg 20m23s) * m6a.4xlarge (18m58s, 19m51s, avg 19m24s) * m6a.8xlarge (19m27s, 18m47s, avg 19m7s) There is little separation in time when we increase class size. In the best case a class size increase yields about a ~5% performance increase and doubles the cost. For test-go our best bang for the buck is certainly going to be in the 2xlarge class. Current runner: m5.2xlarge (22m12s, 20m29s, avg 21m20s) @ 0.448/hr (16@avg-3m + 1@avg) = $2.35 Candidates in the same class * c6a.2xlarge (20m49s, 19m20s, avg 20m5s) @ 0.3816/hr (16@avg-3m + 1@avg) = $1.86 * m6a.2xlarge (21m10s, 19m37s, avg 20m23s) @ 0.4032/hr (16@avg-3m + 1@avg) = $2.00 Best bang for the buck for test-go: c6a.2xlarge: 20% cost savings and save about ~2.25 minutes. We ran the tests with similar instances and saw similar execution times as with test-go. Therefore we can use the same recommended instance sizes. After breaking up test-go's external replication tests, the longest group was shorter on average. I choose to look at group 3 as it was usually the longest grouping: * c6a.2xlarge: (14m51s, 14m48s) * c6a.4xlarge: (14m14s, 14m15) * c6a.8xlarge: (14m0s, 13m54s) * m5.2xlarge: (15m36s, 15m35s) * m5.4xlarge: (14m46s, 14m49s) * m5.8xlarge: (14m25s, 14m25s) * m6a.2xlarge: 14m51s, 14m53s) * m6a.4xlarge: 14m16s, 14m16s) * m6a.8xlarge: (14m2s, 13m57s) Again, we see ~5% performance gains between the 2x and 8x instance classes at quadruple the cost. The c6a and m6a families are almost identical, with the c6a class being cheaper. Notes * UI and Go Test timing results: https://github.com/hashicorp/vault-enterprise/actions/runs/5556957460/jobs/10150759959 * Go Test with data race detection timing results: https://github.com/hashicorp/vault-enterprise/actions/runs/5558013192 * Go Test with replication broken up: https://github.com/hashicorp/vault-enterprise/actions/runs/5558490899 Signed-off-by: Ryan Cragun <me@ryan.ec>	2023-07-20 14:10:08 -06:00
miagilepner	dc104898f7	VAULT-17660: create CI github comment (#21853 ) * fix multiline * shellcheck, and success message for builds * add full path * cat the summary * fix and faster * fix if condition * base64 in a separate step * echo * check against empty string * add echo * only use matrix ids * only id * echo matrix * remove wrapping array * tojson * try echo again * use jq to get packages * don't quote * only run binary tests once * only run binary tests once * test what's wrong with the binary * separate file * use matrix file * failed test * update comment on success * correct variable name * bae64 fix * output to file * use multiline * fix * fix formatting * fix newline * fix whitespace * correct body, remove comma * small fixes * shellcheck * another shellcheck fix * fix deprecation checker * only run comments for prs * Update .github/workflows/test-go.yml Co-authored-by: Mike Palmiotto <mike.palmiotto@hashicorp.com> * Update .github/workflows/test-go.yml Co-authored-by: Mike Palmiotto <mike.palmiotto@hashicorp.com> * fixes --------- Co-authored-by: Mike Palmiotto <mike.palmiotto@hashicorp.com>	2023-07-18 17:31:29 +02:00
Hamid Ghaf	f3e9d159d3	report build failures in a PR comment (#21722 ) * report build failures in a PR comment * address action linter * linter * add an id * change permission * report failure from build yaml * linter fix * report workflow url * reorder jobs * complete boolean eval * single quote * experiment getting failed jobs * linter * pass failed jobs one by one * failed jobs are reported cancelled * use * instead of @ * some polishing * find comment ID, create or update it * some clean up * missing }	2023-07-17 07:45:32 -07:00
Ryan Cragun	c43345c452	[QT-589] Use the go module cache between CI and build (#21764 ) In order to reliably store Go test times in the Github Actions cache we need to reduce our cache thrashing by not using more than 10gb over all of our caches. This change reduces our cache usage significantly by sharing Go module cache between our Go CI workflows and our build workflows. We lose our per-builder cache which will result in a bit of performance hit, but we'll enable better automatic rebalancing of our CI workflows. Overall we should see a per branch reduction in cache sizes from ~17gb to ~850mb. Some preliminary investigation into this new strategy: Prior build workflow strategy on a cache miss: Download modules: ~20s Build Vault: ~40s Upload cache: ~30s Total: ~1m30s Prior build workflow strategy on a cache hit: Download and decompress modules and build cache: ~12s Build Vault: ~15s Total: ~28s New build workflow strategy on a cache miss: Download modules: ~20 Build Vault: ~40s Upload cache: ~6s Total: ~1m6s New build workflow strategy on a cache hit: Download and decompress modules: ~3s Build Vault: ~40s Total: ~43s Expected time if we used no Go caching: Download modules: ~20 Build Vault: ~40s Total: ~1m Signed-off-by: Ryan Cragun <me@ryan.ec>	2023-07-12 17:55:16 +00:00
Hamid Ghaf	3443ff1cc1	use verify changes for docs to skip tests (#21620 ) * use verify changes for docs to skip tests * add verify-changes to the needed jobs * skip go tests for doc/ui only changes * fix a job ref * change names, remove script * remove ui conditions * separate flags * feedback	2023-07-11 09:06:48 -07:00
Violet Hynes	02f43ecbc2	VAULT-17590 Enterprise-side changes (using CI Vault over GH Secret) (#21587 ) * VAULT-17590 Add failure notifications for OSS builds * VAULT-17590 Incur build failure for testing purposes * VAULT-17590 head_ref for testing * VAULT-17590 rework to rely on completed status checks * VAULT-17590 Use slackapi/slack-github-action * VAULT-17590 Remember dollar sign * VAULT-17590 finalize PR * VAULT-17590 add extra empty line * Update .github/workflows/build.yml Co-authored-by: Kuba Wieczorek <kuba.wieczorek@hashicorp.com> * Update .github/workflows/ci.yml Co-authored-by: Kuba Wieczorek <kuba.wieczorek@hashicorp.com> * VAULT-17590 fix typo * VAULT-17590 ent workflow * VAULT-17590 typo --------- Co-authored-by: Kuba Wieczorek <kuba.wieczorek@hashicorp.com>	2023-07-06 08:50:51 -04:00
Ryan Cragun	4f811661f8	[QT-576] Optimize build workflow (#21486 ) Improve our build workflow execution time by using custom runners, improved caching and conditional Web UI builds. Runners ------- We improve our build times[0] by using larger custom runners[1] when building the UI and Vault. Caching ------- We improve Vault caching by keeping a cache for each build job. This strategy has the following properties which should result in faster build times when `go.sum` hasn't been changed from prior builds, or when a pull request is retried or updated after a prior successful build: * Builds will restore cached Go modules and Go build cache according to the Go version, platform, architecture, go tags, and hash of `go.sum` that relates to each individual build workflow. This reduces the amount of time it will take to download the cache on hits and upload the cache on misses. * Parallel build workflows won't clobber each others build cache. This results in much faster compile times after cache hits because the Go compiler can reuse the platform, architecture, and tag specific build cache that it created on prior runs. * Older modules and build cache will not be uploaded when creating a new cache. This should result in lean cache sizes on an ongoing basis. * On cache misses we will have to upload our compressed module and build cache. This will slightly extend the build time for pull requests that modify `go.sum`. Web UI ------ We no longer build the web UI in every build workflow. Instead we separate the UI building into its own workflow and cache the resulting assets. The same UI assets are restored from cache during build worklows. This strategy has the following properties: * If the `ui` directory has not changed from prior builds we'll restore `http/web_ui` from cache and skip building the UI for no reason. * We continue to use the built-in `yarn` caching functionality in `action/setup-node`. The default mode saves the `yarn` global cache. to improve UI build times if the cache has not been modified. Changes ------- * Add per platform/archicture Go module and build caching * Move UI building into a separate job and cache the result * Restore UI cache during build * Pin workflows Notes ----- [0] https://hashicorp.atlassian.net/browse/QT-578 [1] https://github.com/hashicorp/vault/actions/runs/5415830307/jobs/9844829929 Signed-off-by: Ryan Cragun <me@ryan.ec>	2023-07-05 19:25:22 +00:00
Violet Hynes	44354eb147	VAULT-17590 Add failure notifications for OSS builds (#21561 ) * VAULT-17590 Add failure notifications for OSS builds * VAULT-17590 Incur build failure for testing purposes * VAULT-17590 head_ref for testing * VAULT-17590 rework to rely on completed status checks * VAULT-17590 Use slackapi/slack-github-action * VAULT-17590 Remember dollar sign * VAULT-17590 finalize PR * VAULT-17590 add extra empty line * Update .github/workflows/build.yml Co-authored-by: Kuba Wieczorek <kuba.wieczorek@hashicorp.com> * Update .github/workflows/ci.yml Co-authored-by: Kuba Wieczorek <kuba.wieczorek@hashicorp.com> --------- Co-authored-by: Kuba Wieczorek <kuba.wieczorek@hashicorp.com>	2023-07-05 16:44:39 +00:00
miagilepner	eecae3a827	VAULT-17586: Add concurency to ci and build workflows (#21550 )	2023-07-04 13:43:47 +02:00
Ryan Cragun	157b976253	ci: request vpc quota increase (#20360 ) * Fix regions on two service quotas * Request an increase in VPCs per region * Pin github actions workflows Signed-off-by: Ryan Cragun <me@ryan.ec>	2023-05-22 11:18:06 -06:00
Jaymala	a9aa744ae4	Update verify-changes to support external docs branches (#20535 ) * Update verify-changes to support external docs branches Signed-off-by: Jaymala Sinha <jaymala@hashicorp.com> * Revert QT-545 as it Enos workflow is not a workflow_run event Signed-off-by: Jaymala Sinha <jaymala@hashicorp.com> --------- Signed-off-by: Jaymala Sinha <jaymala@hashicorp.com>	2023-05-08 15:03:23 -04:00
Nick Cabatoff	21bde241e0	Don't run build workflow on draft PRs. (#20443 )	2023-05-01 13:52:41 -04:00
Jaymala	d766848664	Fail completed successfully check for failing Enos tests (#20335 ) * Force required completed-successfully check to fail when builds or tests fail Signed-off-by: Jaymala Sinha <jaymala@hashicorp.com> * Update to fail cancelled workflows Signed-off-by: Jaymala Sinha <jaymala@hashicorp.com> --------- Signed-off-by: Jaymala Sinha <jaymala@hashicorp.com>	2023-04-26 15:16:31 -04:00
Nick Cabatoff	945cf13509	Move all checkout@v2 users to v3. (#20315 )	2023-04-24 15:25:58 +00:00
Jaymala	166f270613	[QT-517] Skip builds for docs PRs (#20036 ) * [QT-517] Skip builds for docs changes Signed-off-by: Jaymala Sinha <jaymala@hashicorp.com> * [QT-545] Enable Enos tests to also run on forked PRs Signed-off-by: Jaymala Sinha <jaymala@hashicorp.com> * Add comments and fix CI errors Signed-off-by: Jaymala Sinha <jaymala@hashicorp.com> --------- Signed-off-by: Jaymala Sinha <jaymala@hashicorp.com>	2023-04-21 10:58:47 -04:00
Marc Boudreau	ecb8763d45	Migrating CircleCI Jobs to GHA Workflow (#19662 ) * address lint reports * add diff-oss-ci and test-ui jobs to ci GHA workflow * Add actions linter workflow * Fix actions linter errors * pin 3rd party components with SHA hash and limit actionlint workflow to pull requests touching paths under .github directory * Fix actionlint runner * pin SHA hash of 3rd party components use .go-version file to provide go version to setup-go action remove unncessary ref parameter in checkout action --------- Co-authored-by: Brian Shore <bshore@hashicorp.com>	2023-03-22 15:02:06 -04:00
Ryan Cragun	ec9f33fb46	[QT-470] Normalize and simplify container build workflows (#18900 ) * [QT-470] Normalize and simplify container build workflows Signed-off-by: Ryan Cragun <me@ryan.ec>	2023-01-30 15:01:51 -07:00

1 2

78 Commits