actions-runner-controller

Commit Graph

Author	SHA1	Message	Date
Bassem Dghaidi	6da1cde09c	Update runner version to 2.301.1 (#2182 ) Co-authored-by: TingluoHuang <TingluoHuang@github.com>	2023-01-19 05:36:05 -05:00
Yusuke Kuoka	d32319be50	fix(e2e): Make runner graceful shutdown checker cancellable (#2145 ) So that the whole test run can be stopped immediately with a failure, without failing until the verify timeout.	2023-01-13 07:15:37 +09:00
Yusuke Kuoka	057b04763f	fix(e2e): Use the correct full chart name in test (#2146 ) The whole E2E test breaks due to the invalid chart name without this fix.	2023-01-13 07:15:05 +09:00
Bassem Dghaidi	e71c64683b	Update runner version to 2.300.2 (#2141 ) * Update runner version to 2.300.2 * Bump up runner and container hooks versions * Bump up runner version * Bump up runner and container hooks versions * Update actions-runner-dind-rootless.ubuntu-22.04.dockerfile * Update actions-runner-dind.ubuntu-20.04.dockerfile * Update actions-runner-dind.ubuntu-22.04.dockerfile * Update actions-runner.ubuntu-20.04.dockerfile * Update actions-runner.ubuntu-22.04.dockerfile * Bump up runner versions * Bump up container hooks versions	2023-01-11 08:29:32 -05:00
Nikola Jokic	aa6dab5a9a	Changes to folder structure to allow multigroups and changed go mod name (#2105 ) * Changed folder structure to allow multi group registration * included actions.github.com directory for resources and controllers * updated go module to actions/actions-runner-controller * publish arc packages under actions-runner-controller * Update charts/actions-runner-controller/docs/UPGRADING.md Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-12-28 09:38:34 +09:00
Callum Tait	a8417ec67e	feat: dind-rootless 22.04 runner (#2033 ) * feat: dind-rootless 22.04 runner * runner: Bring back packages needed by rootlesskit * e2e: Update E2E buildvars with ubuntu 22.04 dockerfiles * feat: use new uid for runner user * e2e: Make it possible to inject ubuntu version via envvar for actiosn-runner-dind image * doc: Use fsGroup=1001 for IRSA on Ubuntu 22.04 runner Co-authored-by: toast-gear <toast-gear@users.noreply.github.com> Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-12-07 19:02:35 +09:00
malachiobadeyi	fbdfe0df8c	1770 update log format and add additional fields to webhook server logs (#1771 ) * 1770 update log format and add runID and Id to worflow logs update tests, change log format for controllers.HorizontalRunnerAutoscalerGitHubWebhook use logging package remove unused modules add setup name to setuplog add flag to change log format change flag name to enableProdLogConfig move log opts to logger package remove empty else and reset timeEncoder update flag description use get function to handle nil rename flag and update logger function Update main.go Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com> Update controllers/horizontal_runner_autoscaler_webhook.go Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com> Update logging/logger.go Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com> copy log opt per each NewLogger call revert to use autoscaler.log update flag descript and remove unused imports add logFormat to readme rename setupLog to logger make fmt * Fix E2E along the way Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-11-04 10:46:58 +09:00
Yusuke Kuoka	9bb416084b	e2e: Fix continuous rolling updater to do stop on test completion (#1979 )	2022-11-03 11:55:36 +00:00
Yusuke Kuoka	fdb049ba1e	e2e: Bump runner version to 2.299.1 (#1980 )	2022-11-03 10:02:54 +00:00
Yusuke Kuoka	c74ad6195f	Fix runners to do their best to gracefully stop on pod eviction (#1759 ) Ref #1535 Ref #1581 Signed-off-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-11-01 20:30:10 +09:00
Yusuke Kuoka	666bba784c	e2e: Bump runner version to 2.297.0 (#1850 ) * e2e: Bump runner version to 2.296.2 * Update e2e_test.go	2022-10-04 20:25:13 +09:00
Cory Miller	c91e76f169	Add golangci-lilnt to CI (#1794 ) This introduces a linter to PRs to help with code reviews and code hygiene. I've also gone ahead and fixed (or ignored) the existing lints. I've only setup the default linters right now. There are many more options that are documented at https://golangci-lint.run/. The GitHub Action should add appropriate annotations to the lint job for the PR. Contributors can also lint locally using `make lint`.	2022-09-21 09:08:22 +09:00
Yusuke Kuoka	f73713859c	e2e: Fix workflow for rootless-dind test to actually pass	2022-08-27 07:12:06 +00:00
Yusuke Kuoka	e0a7be253e	e2e: Change the default runner rolling-update interval from 10s to 60s to let the runners actually get jobs assigned by GitHub Actions	2022-08-27 07:11:17 +00:00
Yusuke Kuoka	915739b972	e2e: Fix broken token expiration checks	2022-08-27 07:10:10 +00:00
Yusuke Kuoka	4925880e5e	e2e: Install workflow before starting continuous rolling-updates of runners	2022-08-27 07:08:56 +00:00
Yusuke Kuoka	c143fd50b5	e2e: Use newer version of actions/runner(0.296.0)	2022-08-27 07:07:56 +00:00
Yusuke Kuoka	dbd668ae2d	e2e: Set ARC_E2E_SKIP_RUNNERDEPLOYMENT to skip RunnerDeployment test	2022-08-26 01:48:54 +00:00
Yusuke Kuoka	5c1be3265b	e2e: Fix the token check to actually fail on expiration	2022-08-26 01:48:36 +00:00
Yusuke Kuoka	ebcd838501	e2e: Continuous rolling-update of runners while workflow jobs are running This should help revealing issues like https://github.com/actions-runner-controller/actions-runner-controller/issues/1535 if any.	2022-08-26 01:28:08 +00:00
Yusuke Kuoka	6ef276b239	e2e: Custom RBAC resources for make test success reporting work when k8s container mode or runner update hook is enabled	2022-08-26 01:28:08 +00:00
Yusuke Kuoka	f70f325f48	e2e: Set ARC_E2E_DO_DOCKER_BUILD to verify docker-build	2022-08-26 01:28:08 +00:00
Yusuke Kuoka	f7c336f9dd	e2e: Mention maintained versions of cert-manager for reference	2022-08-26 01:28:08 +00:00
Yusuke Kuoka	4bf1c12a98	e2e: Fix inability to install the stable version of ARC before the edge / Validate GH tokenn on start (#1748 ) Let me improve two things I had found while I was E2E-testing ARC for the upcoming 0.26.0 release. Signed-off-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-08-25 10:25:06 +09:00
Yusuke Kuoka	ea94b3cc5b	e2e: Add new option to test rootless docker (#1742 ) Related to #1644 Signed-off-by: Yusuke Kuoka <ykuoka@gmail.com> Signed-off-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-08-24 10:42:45 +09:00
Yusuke Kuoka	544d620bc3	e2e: Ensure ARC is roll-updated on deployment even if the container image tag name does not change	2022-07-10 16:16:32 +09:00
Yusuke Kuoka	7e4b6ebd6d	chart: Add rbac.allowGrantingKubernetesContainerModePermissions	2022-07-10 16:16:32 +09:00
Yusuke Kuoka	473295e3fc	Enhance the E2E test to be runnable against remote clusters on e.g. AWS EKS (#1610 ) This contains apparently enough changes to the current E2E test code to make it runnable against remote Kubernetes clusters. I was actually able to make the test passing against my AWS EKS based test clusters with these changes. You still need to trigger it manually from a local checkout of the ARC repo today. But this might be the foundation for automated E2E tests against major cloud providers.	2022-07-07 20:48:07 +09:00
Yusuke Kuoka	9f6f962fc7	Add toubleshooting for cert-manager ca error (#1598 ) I encountered this once while E2E testing ARC with K8s 1.22 and cert-manager 1.1.1. The K8s version is too high / The cert-manager is too low so you generally need to fix either. In a standard scenario, it should be more feasible and meaningful to upgrade cert-manager to a recent enough version that supports the new Kubernetes version.	2022-07-07 11:27:49 +09:00
Yusuke Kuoka	2a475f25c7	Use Argo Tunnel for exposing the autoscaler's webhook server (#1595 ) I've been manually setting up Argo Tunnel to expose the webhook server while running E2E tests so that I can cover the webhook-based autoscaling. This automates the setup process so that we can automatiaclly bring up and down cloudflared before/after the test run, so that it can be a part of our upcoming automated E2E test.	2022-07-07 11:27:27 +09:00
Yusuke Kuoka	b8e4eee904	Make it easier to E2E test on various K8s versions (#1599 )	2022-07-06 08:57:21 +09:00
Yusuke Kuoka	4446ba57e1	Cover ARC upgrade in E2E test (#1592 ) * Cover ARC upgrade in E2E test so that we can make it extra sure that you can upgrade the existing installation of ARC to the next and also (hopefully) it is backward-compatible, or at least it does not break immediately after upgrading. * Consolidate E2E tests for RS and RD * Fix E2E for RD to pass * Add some comment in E2E for how to release disk consumed after dozens of test runs	2022-07-01 21:32:05 +09:00
Yusuke Kuoka	dc4f116bda	Reflect manual test scenario for containerMode=kubernetes to E2E (#1588 ) With this my semi-automatic E2E manual testing becomes even easier :)	2022-06-30 09:09:58 +09:00
Yusuke Kuoka	6f3e23973d	Bump E2E runner version to 2.294.0 (#1586 ) so that every runner does not result in auto-updating itself on startup in E2E, which makes E2E take longer to complete.	2022-06-29 22:05:50 +09:00
Yusuke Kuoka	9974b1a2b7	e2e: Enable buildx in more images (#1530 )	2022-06-14 09:29:30 +01:00
Yusuke Kuoka	18bfb28c0b	e2e: ARC_E2E_NO_CLEANUP to prevent cleanup (#1470 ) A small improvement to our E2E test suite which allows you to set `ARC_E2E_NO_CLEANUP=whatever` to let it prevent the kind cluster cleanup on successful test run, so that you can rerun it without waiting for the new kind cluster to come up.	2022-05-26 10:59:50 +09:00
Yusuke Kuoka	63be0223ad	fix: Avoid duplicate volume and mount name error for generic ephemeral volume as "work" (#1471 ) * fix: Avoid duplicate volume and mount name error for generic ephemeral volume as "work" While manually testing configurations being documented in #1464, I discovered that the use of dynamic ephemeral volume for "work" directory was not working correctly due to the valiadation error. This fixes the runner pod generation logic to not add the default volume and volume mount for "work" dir, so that the error disappears. Ref #1464 * e2e: Ensure work generic ephemeral volume to work as expected	2022-05-22 10:25:50 +09:00
Yusuke Kuoka	84210f3d2b	Bump Go to 1.18.2 (#1462 ) As a part of #1298, I'm going to use Go fuzzing which is availabls since Go 1.18. Co-authored-by: Callum Tait <15716903+toast-gear@users.noreply.github.com>	2022-05-19 10:33:31 +01:00
Yusuke Kuoka	b5194fd75a	Enhance RunnerSet to optionally retain PVs accross restarts (#1340 ) * Enhance RunnerSet to optionally retain PVs accross restarts This is our initial attempt to bring back the ability to retain PVs across runner pod restarts when using RunnerSet. The implementation is composed of two new controllers, `runnerpersistentvolumeclaim-controller` and `runnerpersistentvolume-controller`. It all starts from our existing `runnerset-controller`. The controller now tries to mark any PVCs created by StatefulSets created for the RunnerSet. Once the controller terminated statefulsets, their corresponding PVCs are clean up by `runnerpersistentvolumeclaim-controller`, then PVs are unbound from their corresponding PVCs by `runnerpersistentvolume-controller` so that they can be reused by future PVCs createf for future StatefulSets that shares the same same StorageClass. Ref #1286 * Update E2E test suite to cover runner, docker, and go caching with RunnerSet + PVs Ref #1286	2022-05-16 09:26:48 +09:00
Yusuke Kuoka	dabbc99c78	refactor(controller): stop auto-setting RUNNER_FEATURE_FLAG_EPHEMERAL (#1385 ) This feature flag was provided from ARC to runner container automatically to let it use `--ephemeral` instead of `--once` by default. As the support for `--once` is being dropped from the runner image via #1384, we no longer need that. Ref #1196	2022-05-11 11:42:55 +01:00
Yusuke Kuoka	c1e5829b03	refactor(runner): ability to opt-out of using --ephemeral / opt-in to legacy --once for GHES older than 3.3 (#1384 ) * runner: Remove the ability to use the deprecated `--once` flag Ref #1196 * runner: Ability to opt-out of using --ephemeral Although we are going to eventually remove the ability to use the legacy --once flag as proposed in #1196, there might be folks still using legacy GHES versions 3.2 or earlier. This commit removes the existing feature flag to opt-in for --ephemeral, while adding another feature flag RUNNER_FEATURE_FLAG_ONCE to opt-in for --once so that folks stuck in legacy GHES versions can still use ARC. Since this change every user starts using --ephemeral by default. If they see any issues on legacy GHES instance, RUNNER_FEATURE_FLAG_ONCE=true can be set to opt-in to keep using --once, which gives one more ARC release until they upgrade their GHES instance. But beware, we won't support legacy GHES instances forever as it's going to be a maintenance nightmare. Please upgrade! Ref #1196	2022-05-11 09:55:33 +01:00
Yusuke Kuoka	631a70a35f	Fix runner pod to be cleaned up earlier regardless of the sync period (#1299 ) Ref #1291	2022-04-03 11:12:44 +09:00
Callum Tait	2cb04ddde7	* feat: move to new run.sh container friendly file (#1244 ) * fix: unit tests were very broken Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-03-22 19:02:51 +00:00
Yusuke Kuoka	1cc06e7408	e2e: Make enterprise runners optional for testing GitHub App As GitHub App does not allow ARC to access enterprise runner related API endpoints, like the create-registration-token API.	2022-03-13 13:11:26 +00:00
Yusuke Kuoka	c4b24f8366	Prevent static runners from terminating due to unregister timeout The unregister timeout of 1 minute (no matter how long it is) can negatively impact availability of static runner constantly running workflow jobs, and ephemeral runner that runs a long-running job. We deal with that by completely removing the unregistaration timeout, so that regarldess of the type of runner(static or ephemeral) it waits forever until it successfully to get unregistered before being terminated.	2022-03-13 07:26:36 +00:00
Yusuke Kuoka	da2adc0cc5	e2e: Omit RUNNER_FEATURE_FLAG_EPHEMERAL when TEST_FEATURE_FLAG_EPHEMERAL is not set	2022-03-12 14:08:23 +00:00
Yusuke Kuoka	c3dd1c5c05	e2e: Make TEST_FEATURE_FLAG_EPHEMERAL optional	2022-03-12 13:32:42 +00:00
Yusuke Kuoka	22ef7b3a71	acceptance,e2e: Fix deploy.sh and e2e_test.go for testing with GitHub App	2022-03-12 12:10:04 +00:00
Yusuke Kuoka	14a878bfae	refactor: Make RunnerReplicaSet and Runner backed by the same logic that backs RunnerSet	2022-03-06 05:53:26 +00:00
Yusuke Kuoka	5030e075a9	dockerfile,e2e: Use buildx and cache mounts for faster rebuilds in E2E	2022-03-02 19:03:20 +09:00
Yusuke Kuoka	3115d71471	acceptance,e2e: Enhance deploy.sh to support more types of runnersets	2022-03-02 19:03:20 +09:00
Yusuke Kuoka	d4a9750e20	acceptance,e2e: Enhance E2E test and deploy.sh to support scaleDownDelaySeconds~ and minReplicas for HRA	2022-02-20 13:45:42 +00:00
Yusuke Kuoka	4e6bfd8114	e2e: Add ability to toggle dockerdWithinRunnerContainer	2022-02-20 04:37:15 +00:00
Yusuke Kuoka	f3ceccd904	acceptance: Improve deploy.sh to recreate ARC (not runner) pods on new test id So that one does not need to manually recreate ARC pods frequently.	2022-02-19 12:22:53 +00:00
Yusuke Kuoka	ba4bd7c0db	e2e,acceptance: Cover enterprise runners (#1124 ) Adds various code and changes I have used while testing #1062	2022-02-17 09:16:28 +09:00
Chris Bui	1b911749a6	feat: disable automatic runner updates (#1088 ) * Add env variable to configure `disablupdate` flag * Write test for entrypoint disable update * Rename flag, update docs for DISABLE_RUNNER_UPDATE * chore: bump runner version in makefile Co-authored-by: Callum Tait <15716903+toast-gear@users.noreply.github.com>	2022-02-03 21:03:38 +00:00
Yusuke Kuoka	b305e38b17	Add webhook-based autoscale for Enterprise runners (#906 ) Fixes #892	2021-11-09 09:04:19 +09:00
Yusuke Kuoka	1a75b4558b	Fix E2E test to actualy pass I have a dedicated GitHub organization and a private repository to run this E2E test. After a few fixes included in this change, it has successfully passed.	2021-09-15 09:34:48 +09:00
Sebastien Le Digabel	bf35c51440	Adding unit test for ephemeral feature flag This was something that was missing in #707. Adding a new test to make sure the ephemeral feature flag from upstream is set up correctly by the script.	2021-09-14 16:37:25 +09:00
Sebastien Le Digabel	ec0915ce7c	Adding some unit testing for entrypoint.sh The unit tests are simulating a run for entrypoint. It creates some dummy config.sh and runsvc.sh and makes sure the logic behind entrypoint.sh is correct. Unfortunately the entrypoint.sh contains some sections that are not mockable so I had to put some logic in there too. Testing includes for now: - the normal scenario - the normal non-ephemeral scenario - the configuration failure scenario Also tested the entrypoint.sh on a real runner, still works as expected.	2021-09-06 08:51:28 +09:00
Yusuke Kuoka	c78116b0f9	e2e: Cover RunnerDeployment (#668 ) Previously the E2E test suite covered only RunnerSet. This refactors the existing E2E test code to extract the common test structure into a `env` struct and its methods, and use it to write two very similar tests, one for RunnerSet and another for RunnerDeployment.	2021-06-29 17:52:43 +09:00
Yusuke Kuoka	7722730dc0	e2e: Concurrent workflow jobs (#667 ) Enhances out existing E2E test suite to additionally support triggering two or more concurrent workflow jobs and verifying all the results, so that you can ensure the runners managed by the controller are able to handle jobs reliably when loaded.	2021-06-29 14:34:27 +09:00
Yusuke Kuoka	7a305d2892	e2e: Install and run workflow and verify the result (#661 ) This enhances the E2E test suite introduced in #658 to also include the following steps: - Install GitHub Actions workflow - Trigger a workflow run via a git commit - Verify the workflow run result In the workflow, we use `kubectl create cm --from-literal` to create a configmap that contains an unique test ID. In the last step we obtain the configmap from within the E2E test and check the test ID to match the expected one. To install a GitHub Actions workflow, we clone a GitHub repository denoted by the TEST_REPO envvar, progmatically generate a few files with some Go code, run `git-add`, `git-commit`, and then `git-push` to actually push the files to the repository. A single commit containing an updated workflow definition and an updated file seems to run a workflow derived to the definition introduced in the commit, which was a bit surpirising and useful behaviour. At this point, the E2E test fully covers all the steps for a GitHub token based installation. We need to add scenarios for more deployment options, like GitHub App, RunnerDeployment, HRA, and so on. But each of them would worth another pull request.	2021-06-28 08:30:32 +09:00
Yusuke Kuoka	2703fa75d6	Add e2e test (#658 ) This is the initial version of our E2E test suite which is currently a subset of the acceptance test suite reimplemented in Go. To run it, pass `-run ^TestE2E$` to `go test`, without `-short`, like `go test -timeout 600s -run ^TestE2E$ github.com/actions-runner-controller/actions-runner-controller/test/e2e -v`. `make test` is modified to pass `-short` to `go test` by default to skip E2E tests. The biggest benefit of rewriting the acceptance test in Go turned out to be the fact that you can easily rerun each step- a go-test "subtest"- individually from your IDE, for faster turnaround. Both VS Code and IntelliJ IDEA/GoLand are known to work. In the near future, we will add more steps to the suite, like actually git-comminting some Actions workflow and pushing some commit to trigger a workflow run, and verify the workflow and job run results, and finally run it on our `test` workflow to fully automated E2E testing. But that s another story.	2021-06-27 16:28:07 +09:00

1 2 3

114 Commits