postgres-operator

Commit Graph

Author	SHA1	Message	Date
Christian Rohmann	ece341d516	Allow pod environment variables to also be sourced from a secret (#946 ) * Extend operator configuration to allow for a pod_environment_secret just like pod_environment_configmap * Add all keys from PodEnvironmentSecrets as ENV vars (using SecretKeyRef to protect the value) * Apply envVars from pod_environment_configmap and pod_environment_secrets before doing the global settings from the operator config. This allows them to be overriden by the user (via configmap / secret) * Add ability use a Secret for custom pod envVars (via pod_environment_secret) to admin documentation * Add pod_environment_secret to Helm chart values.yaml * Add unit tests for PodEnvironmentConfigMap and PodEnvironmentSecret - highly inspired by @kupson and his very similar PR #481 * Added new parameter pod_environment_secret to operatorconfig CRD and configmap examples * Add pod_environment_secret to the operationconfiguration CRD Co-authored-by: Christian Rohmann <christian.rohmann@inovex.de>	2020-07-30 10:48:16 +02:00
Igor Yanchenko	88735a798a	Resize volume by changing pvc size if enabled in config. (#958 ) * Try to resize pvc if resizing pv has failed * added config option to switch between storage resize strategies * changes according to requests * Update pkg/controller/operator_config.go Co-authored-by: Felix Kunde <felix-kunde@gmx.de> * enable_storage_resize documented added examples to the default configuration and helm value files * enable_storage_resize renamed to volume_resize_mode, off by default * volume_resize_mode renamed to storage_resize_mode * Update pkg/apis/acid.zalan.do/v1/crds.go * pkg/cluster/volumes.go updated * Update docs/reference/operator_parameters.md * Update manifests/postgresql-operator-default-configuration.yaml * Update pkg/controller/operator_config.go * Update pkg/util/config/config.go * Update charts/postgres-operator/values-crd.yaml * Update charts/postgres-operator/values.yaml * Update docs/reference/operator_parameters.md * added logging if no changes required Co-authored-by: Felix Kunde <felix-kunde@gmx.de>	2020-07-03 10:53:37 +02:00
Felix Kunde	0c6655a22d	skip creation later to improve visibility of errors (#1013 ) * try to emit error for missing team name in cluster name * skip creation after new cluster object * move SetStatus to k8sclient and emit event when skipping creation and rename to SetPostgresCRDStatus Co-authored-by: Felix Kunde <felix.kunde@zalando.de>	2020-06-17 13:32:16 +02:00
Felix Kunde	3c352fb460	bump pooler image and more coalescing for CRD config (#1004 ) Co-authored-by: Felix Kunde <felix.kunde@zalando.de>	2020-06-05 11:14:17 +02:00
alfredw33	2b0def5bc8	Support for GCS WAL-E backups (#620 ) * Support for WAL_GS_BUCKET and GOOGLE_APPLICATION_CREDENTIALS environtment variables * Fixed merge issue but also removed all changes to support macos. * Updated test to new format * Missed macos specific changes * Added documentation and addressed comments * Update docs/administrator.md * Update docs/administrator.md * Update e2e/run.sh Co-authored-by: Felix Kunde <felix-kunde@gmx.de>	2020-06-03 17:33:48 +02:00
Felix Kunde	bb3d2fa678	Bump v1.5.0 (#954 ) * bump to v1.5.0 * update helm charts and docs * update helm charts and packages * update images for spilo, logical-backup and pooler	2020-05-05 12:52:54 +02:00
Felix Kunde	76d43525f7	define more default values for opConfig CRD (#955 )	2020-05-04 16:23:21 +02:00
Rafia Sabih	d52296c323	Propagate annotations to the StatefulSet (#932 ) * Initial commit * Corrections - set the type of the new configuration parameter to be array of strings - propagate the annotations to statefulset at sync * Enable regular expression matching * Improvements -handle rollingUpdate flag -modularize code -rename config parameter name * fix merge error * Pass annotations to connection pooler deployment * update code-gen * Add documentation and update manifests * add e2e test and introduce option in configmap * fix service annotations test * Add unit test * fix e2e tests * better key lookup of annotations tests * add debug message for annotation tests * Fix typos * minor fix for looping * Handle update path and renaming - handle the update path to update sts and connection pooler deployment. This way no need to wait for sync - rename the parameter to downscaler_annotations - handle other review comments * another try to fix python loops * Avoid unneccessary update events * Update manifests * some final polishing * fix cluster_test after polishing Co-authored-by: Rafia Sabih <rafia.sabih@zalando.de> Co-authored-by: Felix Kunde <felix-kunde@gmx.de>	2020-05-04 14:46:56 +02:00
Felix Kunde	865d5b41a7	set event broadcasting to Infof and update rbac (#952 )	2020-04-29 17:26:46 +02:00
Felix Kunde	d76203b3f9	Bootstrapped databases with best practice role setup (#843 ) * PreparedDatabases with default role setup * merge changes from master * include preparedDatabases spec check when syncing databases * create a default preparedDB if not specified * add more default privileges for schemas * use empty brackets block for undefined objects * cover more default privilege scenarios and always define admin role * add DefaultUsers flag * support extensions and defaultUsers for preparedDatabases * remove exact version in deployment manifest * enable CRD validation for new field * update generated code * reflect code review * fix typo in SQL command * add documentation for preparedDatabases feature + minor changes * some datname should stay * add unit tests * reflect some feedback * init users for preparedDatabases also on update * only change DB default privileges on creation * add one more section in user docs * one more sentence	2020-04-29 10:56:06 +02:00
Sergey Dudoladov	cc635a02e3	Lazy upgrade of the Spilo image (#859 ) * initial implementation * describe forcing the rolling upgrade * make parameter name more descriptive * add missing pieces * address review * address review * fix bug in e2e tests * fix cluster name label in e2e test * raise test timeout * load spilo test image * use available spilo image * delete replica pod for lazy update test * fix e2e * fix e2e with a vengeance * lets wait for another 30m * print pod name in error msg * print pod name in error msg 2 * raise timeout, comment other tests * subsequent updates of config * add comma * fix e2e test * run unit tests before e2e * remove conflicting dependency * Revert "remove conflicting dependency" This reverts commit `65fc09054b`. * improve cdp build * dont run unit before e2e tests * Revert "improve cdp build" This reverts commit `e2a8fa12aa`. Co-authored-by: Sergey Dudoladov <sergey.dudoladov@zalando.de> Co-authored-by: Felix Kunde <felix-kunde@gmx.de>	2020-04-29 10:07:14 +02:00
Björn Fischer	168abfe37b	Fully speced global sidecars (#890 ) * implement fully speced global sidecars * fix issue #924	2020-04-27 17:40:22 +02:00
Christian Rohmann	21b9b6fcbe	Emit K8S events to the postgresql CR as feedback to the requestor / user (#896 ) * Add EventsGetter to KubeClient to enable to sending K8S events * Add eventRecorder to the controller, initialize it and hand it down to cluster via its constructor to enable it to emit events this way * Add first set of events which then go to the postgresql custom resource the user interacts with to provide some feedback * Add right to "create" events to operator cluster role * Adapt cluster tests to new function sigurature with eventRecord (via NewFakeRecorder) * Get a proper reference before sending events to a resource Co-authored-by: Christian Rohmann <christian.rohmann@inovex.de>	2020-04-27 08:22:07 +02:00
Dmitry Dolgov	a1f2bd05b9	Prevent superuser from being a connection pool user (#906 ) * Protected and system users can't be a connection pool user It's not supported, neither it's a best practice. Also fix potential null pointer access. For protected users it makes sense by intent of protecting this users (e.g. from being overriden or used as something else than supposed). For system users the reason is the same as for superuser, it's about replicastion user and it's under patroni control. This is implemented on both levels, operator config and postgresql manifest. For the latter we just use default name in this case, assuming that operator config is always correct. For the former, since it's a serious misconfiguration, operator panics.	2020-04-09 09:21:45 +02:00
ReSearchITEng	1249626a60	kubernetes_use_configmap (#887 ) * kubernetes_use_configmap * Update manifests/postgresql-operator-default-configuration.yaml Co-Authored-By: Felix Kunde <felix-kunde@gmx.de> * Update manifests/configmap.yaml Co-Authored-By: Felix Kunde <felix-kunde@gmx.de> * Update charts/postgres-operator/values.yaml Co-Authored-By: Felix Kunde <felix-kunde@gmx.de> * go.fmt Co-authored-by: Felix Kunde <felix-kunde@gmx.de>	2020-04-02 13:20:45 +02:00
Felix Kunde	b43b22dfcc	Call me pooler, not pool (#883 ) * rename pooler parts and add example to manifest * update codegen * fix manifest and add more details to docs * reflect renaming also in e2e tests	2020-04-01 10:34:03 +02:00
Felix Kunde	66f2cda87f	Move operator to go 1.14 (#882 ) * update go modules march 2020 * update to GO 1.14 * reflect k8s client API changes	2020-03-30 15:50:17 +02:00
Dmitry Dolgov	9dfa433363	Connection pooler (#799 ) Connection pooler support Add support for a connection pooler. The idea is to make it generic enough to be able to switch between different implementations (e.g. pgbouncer or odyssey). Operator needs to create a deployment with pooler and a service for it to access. For connection pool to work properly, a database needs to be prepared by operator, namely a separate user have to be created with an access to an installed lookup function (to fetch credential for other users). This setups is supposed to be used only by robot/application users. Usually a connection pool implementation is more CPU bounded, so it makes sense to create several pods for connection pool with more emphasize on cpu resources. At the moment there are no special affinity or tolerations assigned to bring those pods closer to the database. For availability purposes minimal number of connection pool pods is 2, ideally they have to be distributed between different nodes/AZ, but it's not enforced in the operator itself. Available configuration supposed to be ergonomic and in the normal case require minimum changes to a manifest to enable connection pool. To have more control over the configuration and functionality on the pool side one can customize the corresponding docker image. Co-authored-by: Felix Kunde <felix-kunde@gmx.de>	2020-03-25 12:57:26 +01:00
Felix Kunde	cf829df1a4	define ownership between operator and clusters via annotation (#802 ) * define ownership between operator and postgres clusters * add documentation * add unit test	2020-03-17 16:34:31 +01:00
Dmitry Dolgov	d666c52172	ClusterDomain default (#863 ) * add json:omitempty option to ClusterDomain * Add default value for ClusterDomain Unfortunately, omitempty in operator configuration CRD doesn't mean that defauls from operator config object will be picked up automatically. Make sure that ClusterDomain default is specified, so that even when someone will set cluster_domain = "", it will be overwritted with a default value. Co-authored-by: mlu42 <mlu42pro@gmail.com>	2020-03-13 11:51:39 +01:00
Felix Kunde	ae2a38d62a	add e2e test for node readiness label (#846 ) * add e2e test for node readiness label * refactoring and order tests alphabetically * always wait for replica after failover	2020-03-06 12:55:34 +01:00
Felix Kunde	b24da3201c	bump version to 1.4.0 + some polishing (#839 ) * bump version to 1.4.0 + some polishing * align version for UI chart * update user docs to warn for standby replicas * minor log message changes for RBAC resources	2020-02-25 09:50:54 +01:00
Felix Kunde	aea9e9bd33	postgres-pod clusterrole (#832 ) * define postgres-pod clusterrole and align rbac in chart * align UI chart rbac with operator and update doc * operator RBAC needs podsecuritypolicy to grant it to postgres-pod	2020-02-19 12:32:54 +01:00
Felix Kunde	702a194c41	switch to rbac/v1 (#829 ) * switch to rbac/v1	2020-02-17 11:25:07 +01:00
Fredrik Østrem	00f00af2e8	Fix MasterPodMoveTimeout field that cannot be unmarshalled (#816 ) * Update operator_configuration_type.go * Update operator_config.go	2020-02-11 17:16:38 +01:00
Vito Botta	a660d758a5	Add region setting for logical backups to non-AWS storage (#813 ) * Add region setting for logical backups to non-AWS storage	2020-02-10 11:48:24 +01:00
Felix Kunde	1f0312a014	make minimum limits boundaries configurable (#808 ) * make minimum limits boundaries configurable * add e2e test	2020-02-03 11:43:18 +01:00
Felix Kunde	cd110aabf4	Enforce minimum cpu and memory limits (#731 ) * add validation for PG resources and volume size * check resource requests also on UPDATE and SYNC + update docs * if cluster was running don't error on sync	2019-12-12 16:43:55 +01:00
Felix Kunde	107334fe71	Add global option to enable/disable init containers and sidecars (#478 ) * Add global option to enable/disable init containers and sidecars * update dependencies	2019-12-10 15:45:54 +01:00
Felix Kunde	a3b34f146f	Add CRD validation (#599 ) * add CRD manifests with validation * update documentation * patroni slots is not an array but a nested hash map * make deps call tools * cover validation in docs and export it in crds.go * add toggle to disable creation of CRD validation and document it * use templated service account also for CRD-configured helm deployment	2019-11-28 12:02:05 +01:00
Armin Nesiren	5f87384d7f	Passing endpoint, access and secret key to logical-backup container (#628 ) * Added possibility to add custom annotations to LoadBalancer service. * Added parameters for custom endpoint, access and secret key for logical backup. * Modified dump.sh so it knows how to handle new features. Configurable S3 SSE	2019-11-26 10:40:49 +01:00
Thomas Runyon	535517cd1b	Custom annotations 329 (#657 ) * Add ability for custom annotations to database pods	2019-11-11 10:45:35 +01:00
Dmitry Dolgov	647a4d3023	Remove service accounts cache (#685 ) For optimization purposes operator was creating a cache map to remember if service accounts and role binding was deployed to a namespace. This could lead to a problem, when a namespace was deleted, since this cache was not synchronized. For the sake of correctness remove the cache, and check every time if required service account and rbac is present. In the normal case this introduces an overhead of two API calls per an event (one to get a service accounts, one to get a role binding), which should not be a problem, unless proven otherwise.	2019-10-11 11:06:14 +02:00
Dmitry Dolgov	baae1887b3	Replace glide with Go modules (#544 ) * And attempt to build with modules and remove glide * new tools.go file to get code-generator dependency + updated codegen + remove Glide files and update docs	2019-10-02 16:18:55 +02:00
Felix Kunde	7c19cf50db	align config map, operator config, helm chart values and templates (#595 ) * align config map, operator config, helm chart values and templates * follow helm chart conventions also in CRD templates * split up values files and add comments * avoid yaml confusion in postgres manifests * bump spilo version and use example for logical_backup_s3_bucket * add ConfigTarget switch to values	2019-07-08 17:49:25 +02:00
Felix Kunde	36003b8264	enable shmVolume setting in OperatorConfiguration (#605 ) * enable shmVolume setting in OperatorConfiguration	2019-07-05 16:48:37 +02:00
Markus	93bfed3e75	Add secret mount to operator (#535 ) * add secret mount to operator	2019-06-19 12:40:49 +02:00
Felix Kunde	6918394562	Add PDB configuration toggle (#583 ) * Don't create an impossible disruption budget for smaller clusters. * sync PDB also on update	2019-06-18 10:48:21 +02:00
Maxim Ivanov	44acd7e4db	Not being able to register CRD is not a fatal error (#444 ) Operator proceeds to checking if CRD is present and ready, and if not, only then it is a fatal error.	2019-06-14 16:08:29 +02:00
Erik Inge Bolsø	6fbfee3903	decouple clusterrole name and serviceaccount name (#581 ) Decouple clusterrole name and service account name.	2019-06-14 14:24:23 +02:00
Aaron Miller	ec5b1d4d58	StatefulSet fsGroup config option to allow non-root spilo (#531 ) * StatefulSet fsGroup config option to allow non-root spilo * Allow Postgres CRD to overide SpiloFSGroup of the Operator. * Document FSGroup of a Pod cannot be changed after creation.	2019-06-04 16:38:26 +02:00
Erik Inge Bolsø	ebda39368e	database.go: remove hardcoded .svc.cluster.local dns suffix (#561 ) * database.go: substitute hardcoded .svc.cluster.local dns suffix with config parameter Use the pod's configured dns search path, for clusters where .svc.cluster.local is not correct.	2019-05-31 16:32:00 +02:00
Sergey Dudoladov	f3e1e80aaf	Add logical backup (#442 ) * Add k8s cron job to spawn logical backups * Minor doc updates	2019-05-16 15:52:01 +02:00
Felix Kunde	ad0b250b5b	patch CRD on operator update (#558 ) * patch existing CRD each time there is an operator update	2019-05-09 12:35:15 +02:00
Felix Kunde	0fbfbb23bb	Use /status subresource instead of plain manifest field (#534 ) * turns PostgresStatus type into a struct with field PostgresClusterStatus * setStatus patch target is now /status subresource * unmarshalling PostgresStatus takes care of previous status field convention * new simple bool functions status.Running(), status.Creating()	2019-05-07 12:01:45 +02:00
Sergey Dudoladov	c1d108a832	Fix CRD-based operator configuration (#541 ) * Fix CRD-based operator configuration * add inherited labels, update docker image	2019-04-15 13:52:38 +02:00
Aaron Miller	15ec6a920d	Config option to allow Spilo container to run non-privileged. (#525 ) * Config option to allow Spilo container to run non-privileged. Runs non-privileged by default. Fixes #395 * add spilo_privileged to manifests/configmap.yaml * add spilo_privileged to helm chart's values.yaml	2019-04-03 17:13:39 +02:00
Felix Kunde	313db7d10b	set default name also for RoleBinding and roleRef (#529 )	2019-04-02 17:16:47 +02:00
Sergey Dudoladov	0b53dbe5dc	Set statefulset update and management policy explicitly (#515 ) * fix logging in retry * explicitly set the stateful set update strategy to onDelete * add podManagementPolicy	2019-03-13 11:49:18 +01:00
Vineeth Reddy	db72d82f14	gofmt and golint fixes (#506 ) * fix gofmt and golint issues	2019-03-04 13:13:55 +01:00

1 2 3 4 5

203 Commits