actions-runner-controller

Commit Graph

Author	SHA1	Message	Date
Callum Tait	212b9daec3	feat: 22.04 default runner image (#2050 ) * feat: 22.04 default runner image * docs: update bundled software * chore: remove test in Dockerfile * ci: add 22.04 runner build * chore: remove build-essential * chore: remove python path entry Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-12-02 07:29:59 +09:00
Callum Tait	c1fb793773	feat: bump docker and hooks in 20.04 (#2063 ) Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-12-02 06:40:12 +09:00
Callum Tait	63d2cbfdaa	ci: multiple ubuntu version (#2036 ) * ci: prepare ci for multiple runners * chore: rename dockerfiles * chore: sup multiple os in makefile * chore: changes to support multiple versions * chore: remove test for TARGETPLATFORM * chore: fixes and add individual targets * ci: add latest tag back in * ci: remove latest suffix tag Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-12-01 00:00:16 +09:00
Igor Sarkisov	95c324b550	Add rootless runner to the Makefile and improve target platform handling. (#2005 ) * Add rootless runner to the Makefile and improve target platform handling * Add rootless image to docker-push-ubuntu target * Update runner/Makefile * Update runner/actions-runner-dind-rootless.dockerfile * Update runner/actions-runner-dind.dockerfile * Update runner/actions-runner.dockerfile Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-11-26 18:10:26 +09:00
Callum Tait	87f566e1e6	feat: add docker-compose and clean up the default runner (#1924 ) * feat: clean and add docker-compose * feat: make docker compose download arch aware * fix: use new ARG name * fix: correct case in url * ci: add some debug output to workflow * ci: add ARG for docker * fix: various fixes * chore: more alignment changes * chore: use /usr/bin over /usr/local/bin * chore: more logical order * fix: add recursive flag * chore: actions/runner stuff with actions/runner * ci: bump checkout to latest * fix: rootless build Co-authored-by: toast-gear <toast-gear@users.noreply.github.com> Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-11-25 10:31:13 +09:00
Callum Tait	666ce8f917	feat: add docker-compose and clean up the dind runner (#1925 ) * feat: align runner and add docker compose * feat: make docker compose download arch aware * fix: use new ARG name * chore: alignment stuff * chore: use /usr/bin over /usr/local/bin * chore: replicate default runner order * feat: set-up actions container hooks * chore: small flags * fix: install all docker components Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-11-22 12:10:38 +09:00
Callum Tait	9ba4b6b96a	chore: clean up the dind rootless dockerfile so it aligns with the other runners (#1926 ) * chore: align dockerfile with other runners * chore: superfluous comments * feat: make docker compose download arch aware * chore: stuff * chore: align runner tool cache set-up * fix: copy and paste error * feat: add container hooks * feat: add rootless into makefile * feat: support all architectures and fix compose * fix: export SKIP_IPTABLES correctly Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-11-22 12:10:28 +09:00
Yusuke Kuoka	154fcde7d0	runner: Make WAIT_FOR_DOCKER_SECONDS configurable and working (#1999 ) * runner: Make WAIT_FOR_DOCKER_SECONDS configurable and working Ref #1830 Ref #1804 * Update acceptance/testdata/runnerdeploy.envsubst.yaml Co-authored-by: Callum Tait <15716903+toast-gear@users.noreply.github.com> * Update docs/detailed-docs.md Co-authored-by: Callum Tait <15716903+toast-gear@users.noreply.github.com> Co-authored-by: Callum Tait <15716903+toast-gear@users.noreply.github.com>	2022-11-22 12:08:54 +09:00
Richard Fussenegger	61d1235d2a	Added `DEBIAN_FRONTEND=noninteractive` to `sudo` (#1859 ) By default `sudo` drops all environment variables and executes its commands with a clean environment. This is by design, but for the `DEBIAN_FRONTEND` environment variable it is not what we want, since it results in installers being interactive. This adds the `env_keep` instruction to `/etc/sudoers` to keep `DEBIAN_FRONTEND` with its `noninteractive` value, and thus pass it on to commands that care about it. Note that this makes no difference in our builds, because we are running them directly as `root`. However, for users of our image this is going to make a difference, since they start out as `runner` and have to use `sudo`. Co-authored-by: Fleshgrinder <fleshgrinder@users.noreply.github.com>	2022-11-05 17:20:53 +09:00
Claudio Vellage	3b36a81db6	Allow to set docker default address pool (#1971 ) * Allow to set docker default address pool * fixup! Allow to set docker default address pool Signed-off-by: Yusuke Kuoka <ykuoka@gmail.com> * Revert unnecessary chart ver bump * Update docs for DOCKER_DEFAULT_ADDRESS_POOL_* * Fix the dockerd default address pool scripts to actually work as probably intended * Update the E2E testdata runnerdeployment to accomodate the new docker default addr pool options * Correct default dockerd addr pool doc Signed-off-by: Yusuke Kuoka <ykuoka@gmail.com> Co-authored-by: Claudio Vellage <claudio.vellage@pm.me> Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-11-05 14:46:32 +09:00
Yusuke Kuoka	63e8f32281	Fix permission issue when you use PV for rootless dind cache (#1977 ) * Fix permission issue when you use PV for rootless dind cache This fixes the said issue I have found while testing #1759. Signed-off-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-11-04 06:46:21 +09:00
Yusuke Kuoka	8505c95719	runner: Fix rootless dind to respect specified MTU (#1976 ) While testing #1759, I found an issue in the rootless dind entrypoint that it was not respecting the configured MTU for dind docker due to a permission issue. This fixes that.	2022-11-04 06:29:03 +09:00
Yusuke Kuoka	3de8085b87	Fix rootless dind to do write logs (#1978 ) It turned out too hard to debug configuration issues on the rootless dind daemon as it was not writing any logs to stdout/stderr of the container. This fixes that, so that any rootless dind configuration or startup errors are visible in e.g. the kubectl-logs output.	2022-11-04 06:28:47 +09:00
renovate[bot]	6234c568bd	chore(deps): update dependency actions/runner to v2.299.1 (#1973 ) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>	2022-11-03 14:40:06 +09:00
Yusuke Kuoka	c74ad6195f	Fix runners to do their best to gracefully stop on pod eviction (#1759 ) Ref #1535 Ref #1581 Signed-off-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-11-01 20:30:10 +09:00
Yusuke Kuoka	e1762ba746	Fix inability to configure MTU for rootless dind runner (#1856 ) Follow-up for https://github.com/actions-runner-controller/actions-runner-controller/pull/1644	2022-10-13 09:04:56 +09:00
renovate[bot]	437d0173b0	chore(deps): update dependency actions/runner to v2.298.2 (#1891 ) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>	2022-10-05 08:16:38 +09:00
Yusuke Kuoka	2dd13b4a19	runner: Address all shellcheck findings (#1854 ) I am about to revisit #1517, #1454, #1561, and #1560 as a part of our on-going effort for a major enhancement to the runner entrypoints being made in #1759. This change updates and reintroduces #1517 contributed by @CASABECI in a way it becomes applicable to today's code-base.	2022-10-04 20:30:27 +09:00
renovate[bot]	5fd6ec4bc8	chore(deps): update dependency actions/runner to v2.297.0 (#1860 ) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>	2022-09-27 09:11:53 +09:00
Yusuke Kuoka	f3fcb428ae	rootless-dind-dockerfile: Add comment about installation path	2022-09-25 07:50:12 +09:00
Yusuke Kuoka	41bae32a9f	runner: Dump supervisor log on dockerd timeout	2022-09-25 07:50:12 +09:00
Yusuke Kuoka	e5bb130fda	Add MTU propagation docker-shim also to rootless dind runner images Related to #1201	2022-09-25 07:50:12 +09:00
Tiago Melo	e7a21cfc53	feat: Add container to propagate host network MTU (#1201 ) * feat: Add container to propagate host network MTU Some network environments use non-standard MTU values. In these situations, the `DockerMTU` setting might be used to specify the MTU setting for the `bridge` network created by Docker. However, when the Github Actions workflow creates networks, it doesn't propagate the `bridge` network MTU which can lead to `connection reset by peer` messages. To overcome this, I've created a new docker image called `summerwind/actions-runner-mtu` that shims the docker binary in order to propagate the MTU setting to networks created by Github workflows. This is a follow-up on the discussion in (#1046)[https://github.com/actions-runner-controller/actions-runner-controller/issues/1046] and uses a separate image since there might be some unintended side-effects with this approach. * fixup! feat: Add container to propagate host network MTU Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-09-23 17:08:28 +09:00
Frederic MARTIN	e32a8054d0	🍱 add git-lfs package as standard tool (#1821 )	2022-09-21 11:04:43 +09:00
David Girón	e4fd4bc99c	Update dependency docker/cli to v20.10.18 (#1803 )	2022-09-16 10:25:12 +09:00
renovate[bot]	0615c2adb1	chore(deps): update dependency actions/runner to v2.296.2 (#1791 ) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>	2022-09-09 18:43:00 +09:00
renovate[bot]	e233f7ad6a	chore(deps): update dependency actions/runner to v2.296.1	2022-09-01 12:31:39 +00:00
renovate[bot]	55ca7bfdf5	chore(deps): update dependency actions/runner to v2.296.0	2022-08-23 19:47:18 +00:00
Callum Tait	3724b46033	chore(deps): update dependency actions/runner to v2.295.0 (#1723 )	2022-08-16 20:11:46 +09:00
renovate[bot]	784019f3d7	chore(deps): update dependency actions/runner to v2.295.0	2022-08-11 11:36:27 +00:00
Natalie Somersall	fc55477c1c	remove fuse-overlayfs (#1690 )	2022-08-04 13:25:55 +09:00
Natalie Somersall	37aa1a0b8c	Add rootless DinD runner (#1644 ) * add rootless dind images * add small blurb on rootless dind * Add ToC entry for README section	2022-08-03 11:45:02 +09:00
k.bigwheel (kazufumi nishida)	98b17dc0a5	Fix the dind image to work with the latest entrypoint.sh (#1624 ) Fixes #1621	2022-07-12 09:11:04 +09:00
Giovanni Barillari	c658dcfa6d	fix #1621 : add missing COPY statements to dind docker image	2022-07-11 20:44:35 +09:00
Felipe Galindo Sanchez	11cb9b7882	feat: allow to discover runner statuses (#1268 ) * feat: allow to discover runner statuses * fix manifests * Bump runner version to 2.289.1 which includes the hooks support * Add feedback from review * Update reference to newRunnerPod * Fix TestNewRunnerPodFromRunnerController and make hooks file names job specific * Fix additional TestNewRunnerPod test * Cover additional feedback from review * fix rbac manager role * Add permissions to service account for container mode if not provided * Rename flag to runner.statusUpdateHook.enabled and fix needsServiceAccount Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-07-10 15:11:29 +09:00
Callum Tait	e3deb0d752	chore: move runner docker check (#1548 )	2022-06-30 11:31:50 +09:00
Callum Tait	82641e5036	chore: move HOME to more logical place (#1460 ) * chore: move HOME to more logical place * chore: don't break the PATH * chore: don't break the PATH Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-06-30 11:21:05 +09:00
Vladyslav Miletskyi	2fe6adf5b7	Runner Entrypoint: fix daemon.json (#1409 ) * Runner Entrypoint: fix daemon.json Do not owerwrite daemon.json if it already exists. Usage: custom images, which are using public image as source. * Update runner/startup.sh Co-authored-by: Callum Tait <15716903+toast-gear@users.noreply.github.com> Co-authored-by: Callum Tait <15716903+toast-gear@users.noreply.github.com>	2022-06-30 11:03:12 +09:00
Yusuke Kuoka	9b28e633c1	Drop support for --once (#1580 ) Ref #1196	2022-06-29 21:49:52 +09:00
Thomas Boop	0386c0734c	`containerMode` option to allow running jobs in k8's instead of docker (#1546 ) * added containerMode=kubernetes env variables to the runner * removed unused logging * restored configs and charts * restored makefile cert version and acceptance/run * added workVolumeClaimTemplate in pod definition, including logic * added claim template name based on the runner * Apply suggestions from code review update errors * added concurrent cleanup before runner pod is deleted * update manifests * added retry after 30s if pod cleanup contains err * added admission webhook check, made workVolumeClaimTemplate mandatory for k8s * style changes and added comments * added izZero timestamp check for deleting runner-linked pods * changed order of local variable to avoid copy if p is deleted * removed docker from container mode k8s * restored charts, config, makefile * restored forked files back and not the ARC ones * created PersistentVolume on containerMode k8s * create pv only if storage class name is local-storage * removed actions if storage class name is local-storage * added service account validation if container mode kubernetes * changed the coding style to match rest of the ARC * added validation to the runnerdeployment webhook * specified fields more precisely, added webhook validation to the replicaset as well * remake manifests * wraped delete runner-linked-pods in kube mode * fixed empty line * fixed import * makefile changes for hooks * added cleanup secrets * create manifests * docs * update access modes * update dockerfile * nit changes * fixed dockerfile * rewrite allowing reuse for runners and runnersets * deepcopy forgot to stage * changed privileged * make manifests * partly moved to finalizer, still need to apply finalizer first * finalizer added if env variable used in container mode exists * bump runner version * error message moved from Error to Info on cleanup pods/secrets * removed useless dereferencing, added transformation tests of workVolumeClaimTemplate * Apply suggestions from code review * Update controllers/utils_test.go Co-authored-by: Thomas Boop <52323235+thboop@users.noreply.github.com> * Update controllers/utils_test.go Co-authored-by: Thomas Boop <52323235+thboop@users.noreply.github.com> * add hook version to cli, update to 0.1.2 * Apply suggestions from code review * Update controllers/utils_test.go * Update runner/Makefile * Fix missing secret permission and the error handling * Fix a runnerpod reconciler finalizer to not trigger unnecessary retry Co-authored-by: Nikola Jokic <nikola-jokic@github.com> Co-authored-by: Nikola Jokic <97525037+nikola-jokic@users.noreply.github.com> Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com>	2022-06-28 14:12:40 +09:00
Callum Tait	84d16c1c12	revert: "Overhauled `startup.sh` Script (#1454 )" (#1561 ) This reverts commit `071898c96b`.	2022-06-23 12:39:32 +01:00
Richard Fussenegger	071898c96b	Overhauled `startup.sh` Script (#1454 ) This overhaul turns it into a shellcheck valid script with explicit error handling for all possible situations I could think of. This change takes https://github.com/actions-runner-controller/actions-runner-controller/pull/1409 into account and things can be merged in any order. There are a few important changes here to the logic: - The wait logic for checking if docker comes up was fundamentally flawed because it checks for the PID. Docker will always come up and thus become visible in the process list, just to immediately die when it encounters an issue, after which supervisor starts it again. This means that our check so far is flaky due to the `sleep 1` it might encounter a PID, or it might not, and the existence of the PID does not mean anything. The `docker ps` check we have in the `entrypoint.sh` script does not suffer from this as it checks for a feature of docker and not a PID. I thus entirely removed the PID check, and instead I am handing things over to our `entrypoint.sh` script by setting the environment variables correctly. - This change has an influence on the `docker0` interface MTU configuration, because the interface might or might not exist after we started docker. Hence, I changed this to a time boxed loop that tries for one minute to set up the interface's MTU. In case the command fails we log an error and continue with the run. - I changed the entire MTU handling by validating its value before configuring it, logging an error and continuing without if it is set incorrectly. This ensures that we are not going to send our users on a bug hunt. - The way we started supervisord did not make much sense to me. It sends itself into the background automatically, there is no need for us to do so with Bash. The decision to not fail on errors but continue is a deliberate choice, because I believe that running a build is more important than having a perfectly configured system. However, this strategy might also hide issues for all users who are not properly checking their logs. It also makes testing harder. Hence, we could change all error conditions from graceful to panicking. We should then align the exit codes across `startup.sh` and `entrypoint.sh` to ensure that every possible error condition has its own unique error code for easy debugging.	2022-06-23 09:37:01 +09:00
renovate[bot]	f24e2fa44e	chore(deps): update dependency actions/runner to v2.294.0	2022-06-22 21:45:32 +00:00
Renovate Bot	933b0c7888	chore(deps): update dependency actions/runner to v2.293.0	2022-06-13 17:09:29 +00:00
renovate[bot]	ac27df8301	chore(deps): update dependency actions/runner to v2.292.0 (#1475 ) Co-authored-by: Renovate Bot <bot@renovateapp.com>	2022-05-27 09:49:46 +09:00
Bernardo Meurer	bf45aa9f6b	refactor(runner/entrypoint): don't mv externalstmp if it's not there (#1315 )	2022-05-16 18:37:37 +09:00
Richard Fussenegger	cdc9d20e7a	Renamed Runner Dockerfiles (#1248 ) Renamed the runner dockerfiles so that we have proper syntax highlighting for them, as well as a consistent way to map from the image name to the dockerfile. Added a `.dockerignore` file to avoid uploading things to the daemon that we never use.	2022-05-16 11:41:28 +09:00
Yusuke Kuoka	c1e5829b03	refactor(runner): ability to opt-out of using --ephemeral / opt-in to legacy --once for GHES older than 3.3 (#1384 ) * runner: Remove the ability to use the deprecated `--once` flag Ref #1196 * runner: Ability to opt-out of using --ephemeral Although we are going to eventually remove the ability to use the legacy --once flag as proposed in #1196, there might be folks still using legacy GHES versions 3.2 or earlier. This commit removes the existing feature flag to opt-in for --ephemeral, while adding another feature flag RUNNER_FEATURE_FLAG_ONCE to opt-in for --once so that folks stuck in legacy GHES versions can still use ARC. Since this change every user starts using --ephemeral by default. If they see any issues on legacy GHES instance, RUNNER_FEATURE_FLAG_ONCE=true can be set to opt-in to keep using --once, which gives one more ARC release until they upgrade their GHES instance. But beware, we won't support legacy GHES instances forever as it's going to be a maintenance nightmare. Please upgrade! Ref #1196	2022-05-11 09:55:33 +01:00
Renovate Bot	800d6bd586	chore(deps): update dependency actions/runner to v2.291.1	2022-04-29 19:05:31 +00:00
Callum Tait	059481b610	refactor: remove legacy controller Docker build (#1360 ) [skip ci] * refactor: remove legacy build and use buildkit * refactor: add runner version to root makefie * refactor: enable buildkit for runner make build * refactor: ignore runner makefile in ci Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-04-27 08:21:02 +01:00
Renovate Bot	81951780b1	chore(deps): update dependency actions/runner to v2.290.1	2022-04-14 18:36:24 +00:00
Callum Tait	352e206148	refactor: use apt-get instead of apt (#1342 ) Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-04-13 09:40:15 +01:00
Richard Fussenegger	6288036ed4	Removed `modprobe` Script (#1247 ) [skip ci] * Removed `modprobe` Script I was able to find out that this script originates from https://github.com/docker-library/docker/blob/master/modprobe.sh but our image does not have `lsmod` nor `modprobe` installed. Hence, if it were in use, it would fail every time. 🤔 * fix: correct command order Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-04-13 09:39:55 +01:00
Callum Tait	4a3b7bc8d5	refactor: location of some runner cmds (#1337 ) Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-04-12 22:18:34 +01:00
Richard Fussenegger	8db071c4ba	Improved Bash Logger (#1246 ) * Improved Bash Logger This is a first step towards having robust Bash scripts in the runner images. The changes _could_ be considered breaking, depending on our backwards compatibility definition. * Fixed Log Formatting Issues Co-authored-by: Callum Tait <15716903+toast-gear@users.noreply.github.com>	2022-04-12 22:02:06 +01:00
Renovate Bot	7b8057e417	chore(deps): update dependency actions/runner to v2.290.0	2022-04-12 20:46:19 +00:00
Rolf Ahrenberg	7124451cea	chore: fix typo (#1316 ) [skip ci]	2022-04-08 17:32:01 +01:00
Bernardo Meurer	e46df413a1	refactor(runner/entrypoint): check for externalstmp (#1277 ) * refactor(runner/entrypoint): check for externalstmp [skip ci] Co-authored-by: Callum Tait <15716903+toast-gear@users.noreply.github.com>	2022-03-30 12:18:18 +01:00
Milan Aleks	13e7b440a8	chore: typo fix in runner Dockerfile [skip ci] (#1270 )	2022-03-29 11:05:24 +01:00
Yusuke Kuoka	debf53c640	Fix missing pip bin path (/home/runner/.local/bin) (#1263 ) Fixes #1261	2022-03-23 10:28:12 +09:00
Callum Tait	2cb04ddde7	* feat: move to new run.sh container friendly file (#1244 ) * fix: unit tests were very broken Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-03-22 19:02:51 +00:00
Richard Fussenegger	532a2bb2a9	feat: remove registration-only runner logic from entrypoint (#1249 ) Closes #1207	2022-03-22 18:33:14 +00:00
Richard Fussenegger	a68eede616	feat: copy dotfiles from asset to service dir (#1136 ) * feat: copy dotfiles from asset to service dir * Fixed `UNITTEST` Condition * Load `/etc/environment` See https://github.com/actions/runner/issues/1703 for context on this change.	2022-03-18 07:40:52 +00:00
toast-gear	c4c6e833a7	chore: add deprecation warning	2022-03-14 12:35:07 +00:00
Callum Tait	6f591ee774	chore: bump docker version (#1094 ) * chore: bump docker version Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-02-07 20:10:02 +00:00
Callum Tait	cc25dd7926	chore: change to trigger build (#1093 ) * chore: change to trigger build * ci: actually use variable Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-02-03 21:23:42 +00:00
Chris Bui	1b911749a6	feat: disable automatic runner updates (#1088 ) * Add env variable to configure `disablupdate` flag * Write test for entrypoint disable update * Rename flag, update docs for DISABLE_RUNNER_UPDATE * chore: bump runner version in makefile Co-authored-by: Callum Tait <15716903+toast-gear@users.noreply.github.com>	2022-02-03 21:03:38 +00:00
Callum Tait	f09a974ac2	chore: change to trigger build (#1079 ) * chore: change to trigger build Co-authored-by: toast-gear <toast-gear@users.noreply.github.com>	2022-01-28 21:57:53 +00:00
cspargo	9d5a562407	fix: use copy instead of move (#1066 ) * fix: use copy instead of move Co-authored-by: Colin Spargo <cspargo@users.noreply.github.com>	2022-01-28 21:24:52 +00:00
Callum Tait	ad48851dc9	feat: expose if docker is enabled and wait for docker to be ready (#962 ) Resolves #897 Resolves #915	2021-12-29 10:23:35 +09:00
Callum Tait	031b1848e0	ci: separate ubuntu versions out in ci (#969 ) * ci: separate ubuntu versions out in ci	2021-11-30 14:09:33 +00:00
Rolf Ahrenberg	e5b5ee6f1d	Make target platform configurable for runner builds	2021-09-14 16:37:04 +09:00
Sebastien Le Digabel	a98729b08b	Adding github action for entrypoint unit test ... and adding safety mechanism in UNITTEST handling.	2021-09-06 08:51:28 +09:00
Sebastien Le Digabel	ec0915ce7c	Adding some unit testing for entrypoint.sh The unit tests are simulating a run for entrypoint. It creates some dummy config.sh and runsvc.sh and makes sure the logic behind entrypoint.sh is correct. Unfortunately the entrypoint.sh contains some sections that are not mockable so I had to put some logic in there too. Testing includes for now: - the normal scenario - the normal non-ephemeral scenario - the configuration failure scenario Also tested the entrypoint.sh on a real runner, still works as expected.	2021-09-06 08:51:28 +09:00
Sebastien Le Digabel	d355f05ac0	Adding retry after config and formatted logging Adding a basic retry loop during configuration. If configuration fails, the runner will just straight into a retry loop and will continuously fail until it dies after a while. This change will retry 10 times and will exit if the configuration wasn't successful. Also, changed the logging format, adding a bit of color in the event of success or failure.	2021-09-06 08:51:28 +09:00
toast-gear	5b4b65664c	chore: bump actions runner version (#736 )	2021-08-19 14:47:17 +01:00
toast-gear	b6465c5d09	chore: bump docker and runner version and add imageos env var (#730 ) * chore: bump runner version * chore: bump docker version * feat: add in ImageOS env var * chore: adding missing fail switches	2021-08-18 15:50:17 +01:00
Hiroki Matsumoto	dc9f9b0bfb	fix: arch type with downloading dumb-init. (#723 ) * fix: arch type with downloading dumb-init. * fix: arch type with downloading dumb-init in Dockerfile.dindrunner * fix: add -f option with curl	2021-08-11 16:43:25 +01:00
callum-tait-pbx	a9421edd46	chore: bump dumb-init (#710 ) * chore: bump dumb-init and align files * ci: align make file with root make file	2021-08-11 09:55:09 +09:00
Rob Bos	fb66b28569	Change `move` command to `copy` to prevent issues (#716 ) Prevents issues when /runner and /runnertmp are in different devices Fixes #686	2021-08-11 09:53:42 +09:00
Yusuke Kuoka	fabead8c8e	feat: Workflow job based ephemeral runner scaling (#721 ) This add support for two upcoming enhancements on the GitHub side of self-hosted runners, ephemeral runners, and `workflow_jow` events. You can't use these yet. These features are not yet generally available to all GitHub users. Please take this pull request as a preparation to make it available to actions-runner-controller users as soon as possible after GitHub released the necessary features on their end. Ephemeral runners: The former, ephemeral runners, is basically the reliable alternative to `--once`, which we've been using when you enabled `ephemeral: true` (default in actions-runner-controller). `--once` has been suffering from a race issue #466. `--ephemeral` fixes that. To enable ephemeral runners with `actions/runner`, you give `--ephemeral` to `config.sh`. This updated version of `actions-runner-controller` does it for you, by using `--ephemeral` instead of `--once` when you set `RUNNER_FEATURE_FLAG_EPHEMERAL=true`. Please read the section `Ephemeral Runners` in the updated version of our README for more information. Note that ephemeral runners is not released on GitHub yet. And `RUNNER_FEATURE_FLAG_EPHEMERAL=true` won't work at all until the feature gets released on GitHub. Stay tuned for an announcement from GitHub! `workflow_job` events: `workflow_job` is the additional webhook event that corresponds to each GitHub Actions workflow job run. It provides `actions-runner-controller` a solid foundation to improve our webhook-based autoscale. Formerly, we've been exploiting webhook events like `check_run` for autoscaling. However, as none of our supported events has included `labels`, you had to configure an HRA to only match relevant `check_run` events. It wasn't trivial. In contrast, a `workflow_job` event payload contains `labels` of runners requested. `actions-runner-controller` is able to automatically decide which HRA to scale by filtering the corresponding RunnerDeployment by `labels` included in the webhook payload. So all you need to use webhook-based autoscale will be to enable `workflow_job` on GitHub and expose actions-runner-controller's webhook server to the internet. Note that the current implementation of `workflow_job` support works in two ways, increment, and decrement. An increment happens when the webhook server receives` workflow_job` of `queued` status. A decrement happens when it receives `workflow_job` of `completed` status. The latter is used to make scaling-down faster so that you waste money less than before. You still don't suffer from flapping, as a scale-down is still subject to `scaleDownDelaySecondsAfterScaleOut `. Please read the section `Example 3: Scale on each `workflow_job` event` in the updated version of our README for more information on its usage.	2021-08-11 09:52:04 +09:00
toast-gear	743e6d6202	feat: bump runner version (#705 ) * feat: bump runner version * feat: remove deprecated env var * docs: updating the docs Co-authored-by: Callum James Tait <callum.tait@photobox.com>	2021-07-30 19:58:04 +09:00
toast-gear	82d1be7791	chore: deprecate STARTUP_DELAY (#678 ) * chore: deprecate STARTUP_DELAY * chore: adding better comments * chore: whitespace correction	2021-07-03 11:51:07 +01:00
toast-gear	044f4ad4ea	chore: updating to use non-deprecated env var (#660 ) Fixes #659 Co-authored-by: Callum James Tait <callum.tait@photobox.com>	2021-06-29 08:54:59 +09:00
toast-gear	605ec158f4	fix: make AGENT_TOOLSDIRECTORY an env var (#657 ) Co-authored-by: Callum James Tait <callum.tait@photobox.com>	2021-06-26 20:51:10 +09:00
Yusuke Kuoka	8b90b0f0e3	Clean up import list (#645 ) Resolves #644	2021-06-22 17:55:06 +09:00
Shubham Gopale	1084a37174	We are exiting if its a registration-only runner (#641 )	2021-06-22 17:26:03 +09:00
Yusuke Kuoka	9e4dbf497c	feat: RunnerSet backed by StatefulSet (#629 ) * feat: RunnerSet backed by StatefulSet Unlike a runner deployment, a runner set can manage a set of stateful runners by combining a statefulset and an admission webhook that mutates statefulset-managed pods with required envvars and registration tokens. Resolves #613 Ref #612 * Upgrade controller-runtime to 0.9.0 * Bump Go to 1.16.x following controller-runtime 0.9.0 * Upgrade kubebuilder to 2.3.2 for updated etcd and apiserver following local setup * Fix startup failure due to missing LeaderElectionID * Fix the issue that any pods become unable to start once actions-runner-controller got failed after the mutating webhook has been registered * Allow force-updating statefulset * Fix runner container missing work and certs-client volume mounts and DOCKER_HOST and DOCKER_TLS_VERIFY envvars when dockerdWithinRunner=false * Fix runnerset-controller not applying statefulset.spec.template.spec changes when there were no changes in runnerset spec * Enable running acceptance tests against arbitrary kind cluster * RunnerSet supports non-ephemeral runners only today * fix: docker-build from root Makefile on intel mac * fix: arch check fixes for mac and ARM * ci: aligning test data format and patching checks * fix: removing namespace in test data * chore: adding more ignores * chore: removing leading space in shebang * Re-add metrics to org hra testdata * Bump cert-manager to v1.1.1 and fix deploy.sh Co-authored-by: toast-gear <15716903+toast-gear@users.noreply.github.com> Co-authored-by: Callum James Tait <callum.tait@photobox.com>	2021-06-22 17:10:09 +09:00
Tim Birkett	a93fd21f21	feat: add STARTUP_DELAY to entrypoint.sh (#592 ) Ref #591 Co-authored-by: Yusuke Kuoka <ykuoka@gmail.com>	2021-06-04 08:57:59 +09:00
Vladyslav Miletskyi	30ab0c0b71	Fix actions-runner-dind not to fail setting up MTU (#589 ) Fixes #588	2021-06-04 08:54:46 +09:00
toast-gear	2e083bca28	fix: fixing mising pip PATH (#585 ) * fix: fixing mising pip PATH * chore: removing User Site Directory Co-authored-by: Callum James Tait <callum.tait@photobox.com>	2021-06-01 09:21:14 +09:00
Callum James Tait	859e04a680	chore: moving python to alphabetical order	2021-05-26 09:32:01 +09:00
Callum James Tait	c0821d4ede	chore: correcting lists removal path	2021-05-26 09:32:01 +09:00
Callum James Tait	c3a6e45920	chore: aligning package order	2021-05-26 09:32:01 +09:00
Callum James Tait	818dfd6515	chore: whitespace alignment	2021-05-26 09:32:01 +09:00
Callum James Tait	726b39aedd	feat: adding pip to base image	2021-05-26 09:32:01 +09:00
Thejas N	588872a316	feat: allow ephemeral runner to be optional (#498 ) - Adds `ephemeral` option to `runner.spec` ``` .... template: spec: ephemeral: false repository: mumoshu/actions-runner-controller-ci .... ``` - `ephemeral` defaults to `true` - `entrypoint.sh` in runner/Dockerfile modified to read `RUNNER_EPHEMERAL` flag - Runner images are backward-compatible. `--once` is omitted only when the new envvar `RUNNER_EPHEMERAL` is explicitly set to `false`. Resolves #457	2021-05-02 19:04:14 +09:00
Yusuke Kuoka	dbd7b486d2	feat: Support for scaling from/to zero (#465 ) This is an attempt to support scaling from/to zero. The basic idea is that we create a one-off "registration-only" runner pod on RunnerReplicaSet being scaled to zero, so that there is one "offline" runner, which enables GitHub Actions to queue jobs instead of discarding those. GitHub Actions seems to immediately throw away the new job when there are no runners at all. Generally, having runners of any status, `busy`, `idle`, or `offline` would prevent GitHub actions from failing jobs. But retaining `busy` or `idle` runners means that we need to keep runner pods running, which conflicts with our desired to scale to/from zero, hence we retain `offline` runners. In this change, I enhanced the runnerreplicaset controller to create a registration-only runner on very beginning of its reconciliation logic, only when a runnerreplicaset is scaled to zero. The runner controller creates the registration-only runner pod, waits for it to become "offline", and then removes the runner pod. The runner on GitHub stays `offline`, until the runner resource on K8s is deleted. As we remove the registration-only runner pod as soon as it registers, this doesn't block cluster-autoscaler. Related to #447	2021-05-02 16:11:36 +09:00
ToMe25	ba175148c8	Locally build runner image instead of pulling it (#473 ) * Fix acceptance helm test not using newly built controller image * Locally build runner image instead of pulling it * Revert runner controller image pull policy to always and add a line to the test deployment to use IfNotPresent * Change runner repository from summerwind/action-runner to the owner of actions-runner-controller. Also fix some Makefile formatting. * Undo renaming acceptance/pull to docker-pull * Some env var cleanup Rename USERNAME to DOCKER_USER(is still used for github too tho) Add RUNNER_NAME var(defaults to $DOCKER_USER/actions-runner) Add TEST_REPO(defaults to $DOCKER_USER/actions-runner-controller)	2021-05-01 15:10:57 +09:00
callum-tait-pbx	db45a375d0	chore: bump runner (#486 ) * chore: bump runner * chore: bumper runner in ci	2021-04-27 08:38:40 +09:00

1 2 3 4 5

201 Commits