kubespray

Commit Graph

Author	SHA1	Message	Date
Max Gautier	0f0e24be0f	etcd: throttle restart for availability (#11677 ) * etcd: throttle restart for availability During upgrade, etcd member are restarted all at once. This can impact the availability of the etcd cluster and subsequently of the Kubernetes cluster. Limit the concurrent restart so that the etcd cluster can keep quorum. * Simplify etcd handlers	2024-11-05 06:11:29 +00:00
Bogdan Sass	4b324cb0f0	Rename master to control plane - non-breaking changes only (#11394 ) K8s is moving away from the "master" terminology, so kubespray should follow the same naming conventions. See `65d886bb30/sig-architecture/naming/recommendations/001-master-control-plane.md`	2024-09-06 07:56:19 +01:00
Vlad Korolev	9a7b021eb8	Do not use ‘yes/no’ for boolean values (#11472 ) Consistent boolean values in ansible playbooks	2024-08-28 06:30:56 +01:00
Lihai Tu	8208a3f04f	Rename systemd module to systemd_service (#11396 ) Signed-off-by: tu1h <lihai.tu@daocloud.io>	2024-07-26 01:11:39 -07:00
Bas	8f5f75211f	Improving yamllint configuration (#11389 ) Signed-off-by: Bas Meijer <bas.meijer@enexis.nl>	2024-07-25 18:42:20 -07:00
Max Gautier	8ebeb88e57	Refactor "multi" handlers to use listen (#10542 ) * containerd: refactor handlers to use 'listen' * cri-dockerd: refactor handlers to use 'listen' * cri-o: refactor handlers to use 'listen' * docker: refactor handlers to use 'listen' * etcd: refactor handlers to use 'listen' * control-plane: refactor handlers to use 'listen' * kubeadm: refactor handlers to use 'listen' * node: refactor handlers to use 'listen' * preinstall: refactor handlers to use 'listen' * calico: refactor handlers to use 'listen' * kube-router: refactor handlers to use 'listen' * macvlan: refactor handlers to use 'listen'	2023-11-08 12:28:30 +01:00
Max Gautier	8f0e553e11	etcd/backup: native ansible modules instead of shell (#10540 ) This make native ansible features (dry-run, changed state) easier to have, and should have a minimal performance impact, since it only runs on the etcd members.	2023-10-30 20:05:28 +01:00
Arthur Outhenin-Chalandre	36e5d742dc	Resolve ansible-lint name errors (#10253 ) * project: fix ansible-lint name Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: ignore jinja template error in names Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: capitalize ansible name Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: update notify after name capitalization Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> --------- Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch>	2023-07-26 07:36:22 -07:00
Arthur Outhenin-Chalandre	25cb90bc2d	Upgrade ansible (#10190 ) * project: update all dependencies including ansible Upgrade to ansible 7.x and ansible-core 2.14.x. There seems to be issue with ansible 8/ansible-core 2.15 so we remain on those versions for now. It's quite a big bump already anyway. Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * tests: install aws galaxy collection Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * ansible-lint: disable various rules after ansible upgrade Temporarily disable a bunch of linting action following ansible upgrade. Those should be taken care of separately. Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: resolve deprecated-module ansible-lint error Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: resolve no-free-form ansible-lint error Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: resolve schema[meta] ansible-lint error Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: resolve schema[playbook] ansible-lint error Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: resolve schema[tasks] ansible-lint error Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: resolve risky-file-permissions ansible-lint error Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: resolve risky-shell-pipe ansible-lint error Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: remove deprecated warn args Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: use fqcn for non builtin tasks Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: resolve syntax-check[missing-file] for contrib playbook Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> * project: use arithmetic inside jinja to fix ansible 6 upgrade Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch> --------- Signed-off-by: Arthur Outhenin-Chalandre <arthur.outhenin-chalandre@proton.ch>	2023-06-26 03:15:45 -07:00
emiran-orange	2b97b661d8	Move old etcd backup removal after etcd restart (#9147 )	2022-08-05 08:09:59 -07:00
zhengtianbao	a16d427536	Set etcd-events listen port to 2383 (#8232 )	2021-12-07 00:28:01 -08:00
Etienne Champetier	de1d9df787	Only use stat get_checksum: yes when needed (#7270 ) By default Ansible stat module compute checksum, list extended attributes and find mime type To find all stat invocations that really use one of those: git grep -F stat. \| grep -vE 'stat.(islnk\|exists\|lnk_source\|writeable)' Signed-off-by: Etienne Champetier <e.champetier@ateme.com>	2021-02-10 05:36:59 -08:00
axelgobletbdr	097bec473c	fixed bug in etcd retention where backups are not sorted by date (#6860 ) * fixed bug in etcd retention where backups are not sorted by date * added directory filter to find command	2020-10-28 09:09:57 -07:00
axelgobletbdr	4b858b6466	Fixes 6621 etcd backup directory is consuming much rootfs disk space (#6836 ) * added an ansible var to manage retention of etcd backups * refactord ls/grep into find in etcd backup removal command	2020-10-23 07:09:57 -07:00
Florent Monbillard	bf8c8976dd	Upgrade etcd to 3.4.3 (#5998 )	2020-07-20 07:26:51 -07:00
Maxime Guyot	00fe3d5094	Explicitly set ETCDCTL_API and use ETCDCTL_ENDPOINTS (#6327 )	2020-07-01 04:56:16 -07:00
Matthew Mosesohn	fc072300ea	Purge legacy cleanup tasks from older than 1 year (#4450 ) We don't need to support upgrades from 2 year old installs, just from the last major version. Also changed most retried tasks to 1s delay instead of longer.	2019-04-24 00:08:05 -07:00
MarkusTeufelberger	424e59805f	ansible-lint: Fix commands that are also available as module (#4619 )	2019-04-23 22:18:00 -07:00
Qasim Sarfraz	0a3cf1a087	Fix CA cert environment variable for ectd v3 (#4381 )	2019-03-28 00:18:43 -07:00
ankitcharolia	9c83551a0e	add certificate authority file (#3433 )	2018-11-02 08:27:53 -07:00
Mathieu Herbert	59d89a37cc	add until option for etcd backup commands	2018-08-17 11:05:57 +02:00
woopstar	86e3506ae6	Etcd cluster setup makeover The current way to setup the etc cluster is messy and buggy. - It checks for cluster is healthy before the cluster is even created. - The unit files are started on handlers, not in the task, so you mess with "flush handlers". - The join_member.yml is not used. - etcd events cluster is not configured for kubeadm - remove duplicate runs between running the role on etcd nodes and k8s nodes	2018-04-01 21:38:33 +02:00
Andreas Krüger	b9b028a735	Update etcd deployment to use correct cert and key (#2572 ) * Update etcd deployment to use correct cert and key * Update to use admin cert for etcdctl commands * Update handler to use admin cert too	2018-03-31 14:06:09 -04:00
RongZhang	67ffd8e923	Add etcd-events cluster for kube-apiserver (#2385 ) Add etcd-events cluster for kube-apiserver	2018-03-01 11:39:14 +03:00
Matthew Mosesohn	dc6a17e092	Use include/import tasks (#2192 ) import_tasks will consume far less memory, so it should be used whenever it is compatible.	2018-01-29 14:37:48 +03:00
Steve Mitchell	e45b30d033	Add etcd key and cert environment variables for use with client auth	2018-01-02 13:52:17 -05:00
Matthew Mosesohn	10dd049912	Revert "Security fixes for etcd (#1778 )" (#1786 ) This reverts commit `4209f1cbfd`.	2017-10-12 14:02:51 +01:00
Matthew Mosesohn	4209f1cbfd	Security fixes for etcd (#1778 ) * Security fixes for etcd * Use certs when querying etcd	2017-10-12 13:32:54 +01:00
sgmitchell	783924e671	Change backup handler to only run v2 data backup if snap directory exists (#1594 )	2017-08-31 18:23:24 +03:00
Brad Beam	8b151d12b9	Adding yamllinter to ci steps (#1556 ) * Adding yaml linter to ci check * Minor linting fixes from yamllint * Changing CI to install python pkgs from requirements.txt - adding in a secondary requirements.txt for tests - moving yamllint to tests requirements	2017-08-24 12:09:52 +03:00
gdmelloatpoints	4ba237c5d8	Make etcd_backup_prefix configurable. Ensures that backups can be stored on a different location other than ${HOST}/var/backups, say an EBS volume on AWS.	2017-06-26 09:42:30 -04:00
Sergii Golovatiuk	f144fd1ed3	Refactor etcd role - Run docker run from script rather than directly from systemd target - Refactoring styling/templates Signed-off-by: Sergii Golovatiuk <sgolovatiuk@mirantis.com>	2017-03-24 12:34:15 +01:00
Sergii Golovatiuk	c04a6254b9	Backup etcd data before restarting etcd etcd is crucial part of kubernetes cluster. Ansible restarts etcd on reconfiguration. Backup helps operator to restore cluster manually in case of any issues. Signed-off-by: Sergii Golovatiuk <sgolovatiuk@mirantis.com>	2017-03-20 14:50:52 +01:00
Andrew Greenwood	ca9ea097df	Cleanup legacy syntax, spacing, files all to yml Migrate older inline= syntax to pure yml syntax for module args as to be consistant with most of the rest of the tasks Cleanup some spacing in various files Rename some files named yaml to yml for consistancy	2017-02-17 16:22:34 -05:00
Bogdan Dobrelya	58062be2a3	Drop non systemd OS types support Signed-off-by: Bogdan Dobrelya <bogdando@mail.ru>	2017-01-02 12:14:03 +01:00
Dan Bode	ff675d40f9	Ensure that etcd health checks always pass in the etcd handler, the reload etcd action was called after ansible waits for etcd to be up, this means that the health checks which are called immediately after fail (resulting in the etcd role always failing and never finishing) This patch changes the order to move the 'wait for etcd up' resource after the 'reload etcd resource', ensuring that the service is up before the health check is called.	2016-11-18 14:15:00 -08:00
Matthew Mosesohn	a32cd85eb7	Add etcd TLS support	2016-11-09 18:38:28 +03:00
Matthew Mosesohn	95b460ae94	Remove etcd-proxy from all nodes and use etcd multiaccess	2016-11-09 13:31:12 +03:00
Matthew Mosesohn	5668e5f767	Fix etcd restart and handler systemd tasks Changed Wants=docker.service to docker.socket Renamed handlers for reloading systemd to contain role in task name.	2016-07-29 16:32:35 +03:00
Matthew Mosesohn	1b1f5f22d4	Fix etcd standalone deployment etcd facts are generated in kubernetes/preinstall, so etcd nodes need to be evaluated first before the rest of the deployment. Moved several directory facts from kubernetes/node to kubernetes/preinstall because they are not backward dependent.	2016-07-26 18:15:06 +03:00
Matthew Mosesohn	7a86b6c73e	Set default etcd deployment to docker Improved docker reload command to wait for etcd to be up before proceeding. Switched reload to run restart because it can't reload if it is not guaranteed to be in running state.	2016-07-20 18:26:16 +03:00
Bogdan Dobrelya	32cd6e99b2	Add etcd proxy support * Enforce a etcd-proxy role to a k8s-cluster group members. This provides an HA layout for all of the k8s cluster internal clients. * Proxies to be run on each node in the group as a separate etcd instances with a readwrite proxy mode and listen the given endpoint, which is either the access_ip:2379 or the localhost:2379. * A notion for the 'kube_etcd_multiaccess' is: ignore endpoints and loadbalancers and use the etcd members IPs as a comma-separated list. Otherwise, clients shall use the local endpoint provided by a etcd-proxy instances on each etcd node. A Netwroking plugins always use that access mode. * Fix apiserver's etcd servers args to use the etcd_access_endpoint. * Fix networking plugins flannel/calico to use the etcd_endpoint. * Fix name env var for non masters to be set as well. * Fix etcd_client_url was not used anywhere and other etcd_* facts evaluation was duplicated in a few places. * Define proxy modes only in the env file, if not a master. Del an automatic proxy mode decisions for etcd nodes in init/unit scripts. * Use Wants= instead of Requires= as "This is the recommended way to hook start-up of one unit to the start-up of another unit" * Make apiserver/calico Wants= etcd-proxy to keep it always up Signed-off-by: Bogdan Dobrelya <bdobrelia@mirantis.com> Co-authored-by: Matthew Mosesohn <mmosesohn@mirantis.com>	2016-07-19 14:09:40 +02:00
Matthew Mosesohn	b3282cd0bb	Add optional deployment mode for Docker etcd_deployment_type Running etcd in Docker reduces the number of individual file downloads and services running on the host. Note: etcd container v3.0.1 moves bindir to /usr/local/bin Fixes: #298	2016-07-07 19:31:28 +03:00
Smana	a649aa8b7e	use ansible_service_mgr to detect init system	2016-02-13 11:46:53 +01:00
ant31	56b92812fa	Fix systemd reload and calico unit	2016-01-25 10:54:07 +01:00
Smaine Kahlouch	9715962356	etcd directly in host fix etcd configuration for nodes fix wrong calico checksums using a var name etcd_bin_dir fix etcd handlers for sysvinit using a var name etcd_bin_dir sysvinit script review etcd configuration	2016-01-21 11:36:11 +01:00
Antoine Legrand	5c15d14f12	Run etcd as pod	2015-12-28 22:04:39 +01:00
Smaine Kahlouch	e2984b4fdb	ha etcd with calico	2015-12-15 11:49:11 +01:00
Smaine Kahlouch	00c562828f	Initial commit	2015-10-03 22:19:50 +02:00

49 Commits (81a66cc73d0983c6899f935b2dbb7afdd8696b64)