ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	3e262e072b	containers: use --cpus instead --cpu-quota When using docker 1.13.1, the current condition: ``` {% if (container_binary == 'docker' and ceph_docker_version.split('.')[0] is version_compare('13', '>=')) or container_binary == 'podman' -%} ``` is wrong because it compares the first digit (1) whereas it should compare the second one. It means we always use `--cpu-quota` although documentation recommend using `--cpus` when docker version is 1.13.1 or higher. From the doc: > --cpu-quota=<value> Impose a CPU CFS quota on the container. The number of > microseconds per --cpu-period that the container is limited to before > throttled. As such acting as the effective ceiling. > If you use Docker 1.13 or higher, use --cpus instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-16 13:51:43 -05:00
Guillaume Abrioux	8dcbcecd71	remove container_exec_cmd_mgr fact Iterating over all monitors in order to delegate a ` {{ container_binary }}` fails when collocating mgrs with mons, because ceph-facts reset `container_exec_cmd` to point to the first member of the monitor group. The idea is to force `container_exec_cmd` to be reset in ceph-mgr. This commit also removes the `container_exec_cmd_mgr` fact. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1791282 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-15 14:03:49 -05:00
Dimitri Savineau	3cc7d5651c	tox: use vagrant_up.sh instead of vagrant up We should use the same vagrant wrapper everywhere instead of the vagrant command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-15 17:52:35 +01:00
Dimitri Savineau	a5385e1048	vagrant: temp workaround for CentOS 8 cloud image The CentOS cloud infrastructure storing the vagrant CentOS 8 image changed the directory path and remove the old 8.0 image so the vagrant box add centos/8 fails returning a 404 http error. As a workaround we can pull the image from CentOS instead of letting vagrant doing the resolution. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-15 17:52:35 +01:00
Dimitri Savineau	4e7fb5d45a	drop use_fqdn variables This has been deprecated in the previous releases. Let's drop it. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-15 11:32:39 +01:00
Dimitri Savineau	c61db12c09	travis: drop python2 support Since python2 is EOL we can drop it from travis CI matrix. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-15 10:42:51 +01:00
Guillaume Abrioux	3d0898aa5d	shrink-mds: fix condition on fs deletion the new ceph status registered in `ceph_status` will report `fsmap.up` = 0 when it's the last mds given that it's done after we shrink the mds, it means the condition is wrong. Also adding a condition so we don't try to delete the fs if a standby node is going to rejoin the cluster. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1787543 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-15 10:40:14 +01:00
Dimitri Savineau	bd87d69183	ceph-iscsi: don't use bracket with trusted_ip_list The trusted_ip_list parameter for the rbd-target-api service doesn't support ipv6 address with bracket. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1787531 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-14 11:32:36 -05:00
Guillaume Abrioux	5558664f37	osd: use _devices fact in lvm batch scenario since `fd1718f379`, we must use `_devices` when deploying with lvm batch scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-14 09:12:03 -05:00
Guillaume Abrioux	d853da2a68	update: remove legacy This task is a code duplicate, probably a legacy, let's remove it. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-13 15:18:45 -05:00
Guillaume Abrioux	2592a1e1e8	facts: fix osp/ceph external use case `d6da508a9b` broke the osp/ceph external use case. We must skip these tasks when no monitor is present in the inventory. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1790508 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-13 12:06:06 -05:00
Dimitri Savineau	f940e695ab	ceph-facts: move grafana fact to dedicated file We don't need to executed the grafana fact everytime but only during the dashboard deployment. Especially for ceph-grafana, ceph-prometheus and ceph-dashboard roles. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1790303 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-13 12:05:57 -05:00
Guillaume Abrioux	58e6bfed2d	osd: ensure osd ids collected are well restarted This commit refact the condition in the loop of that task so all potential osd ids found are well started. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1790212 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-13 12:05:48 -05:00
Guillaume Abrioux	af6875706a	osd: do not run openstack_config during upgrade There is no need to run this part of the playbook when upgrading the cluter. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-13 09:59:08 -05:00
Guillaume Abrioux	fef1cd4c4b	tests: use main playbook for add_osds job This commit replaces the playbook used for add_osds job given accordingly to the add-osd.yml playbook removal Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-13 09:59:08 -05:00
Guillaume Abrioux	3496a0efa2	osd: support scaling up using --limit This commit lets add-osd.yml in place but mark the deprecation of the playbook. Scaling up OSDs is now possible using --limit Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-13 09:59:08 -05:00
Dimitri Savineau	3900527e16	tests/setup: update mount options on EL 8 The nobarrier mount flag doesn't exist anymoer on XFS in the EL 8 kernel. That's why the task wasn't working on those systems. We can still use the other options instead of skipping the task. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-11 05:33:01 +01:00
Dimitri Savineau	e4ddcb812b	ceph-validate: fail on CentOS 7 The Ceph Octopus release is only supported on CentOS 8 Closes: #4918 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-10 14:06:02 -05:00
Guillaume Abrioux	dc672e86ec	tests: add a docker2podman scenario This commit adds a new scenario in order to test docker-to-podman.yml migration playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-10 10:21:29 -05:00
Guillaume Abrioux	b0c491800a	docker2podman: use set_fact to override variables play vars have lower precedence than role vars and `set_fact`. We must use a `set_fact` to reset these variables. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-10 10:21:29 -05:00
Guillaume Abrioux	1c2ec9fb40	docker2podman: force systemd to reload config This is needed after a change is made in systemd unit files. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-10 10:21:29 -05:00
Guillaume Abrioux	d746575fd0	docker2podman: install podman This commit adds a package installation task in order to install podman during the docker-to-podman.yml migration playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-10 10:21:29 -05:00
Dimitri Savineau	a09d1c38bf	purge-iscsi-gateways: don't run all ceph-facts We only need to have the container_binary fact. Because we're not gathering the facts from all nodes then the purge fails trying to get one of the grafana fact. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1786686 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-10 15:46:15 +01:00
Guillaume Abrioux	fd1718f379	config: exclude ceph-disk prepared osds in lvm batch report We must exclude the devices already used and prepared by ceph-disk when doing the lvm batch report. Otherwise it fails because ceph-volume complains about GPT header. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1786682 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-10 00:04:22 +01:00
Dimitri Savineau	3f344fdefe	rolling_update: run registry auth before upgrading There's some tasks using the new container image during the rolling upgrade playbook that needs to execute the registry login first otherwise the nodes won't be able to pull the container image. Unable to find image 'xxx.io/foo/bar:latest' locally Trying to pull repository xxx.io/foo/bar ... /usr/bin/docker-current: Get https://xxx.io/v2/foo/bar/manifests/latest: unauthorized Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-09 16:14:33 -05:00
Dimitri Savineau	747555dfa6	shrink-rgw: refact global workflow Instead of running the ceph roles against localhost we should do it on the first mon. The ansible and inventory hostname of the rgw nodes could be different. Ensure that the rgw instance to remove is present in the cluster. Fix rgw service and directory path. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1677431 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-09 19:02:17 +01:00
Guillaume Abrioux	86f3eeb717	mon: support replacing a mon We must pick up a mon which actually exists in ceph-facts in order to detect if a cluster is running. Otherwise, it will state no cluster is already running which will end up deploying a new monitor isolated in a new quorum. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622688 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-09 12:59:12 -05:00
Guillaume Abrioux	30200802d9	handler: fix bug `411bd07d54` introduced a bug in handlers using `handler__status` instead of `hostvars[item]['handler__status']` causes handlers to be triggered in anycase even though `handler_*_status` was set to `False` on a specific node. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622688 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 17:11:42 -05:00
Benoît Knecht	3c31b19ab3	ceph-rgw: Fix custom pool size setting RadosGW pools can be created by setting ```yaml rgw_create_pools: .rgw.root: pg_num: 512 size: 2 ``` for instance. However, doing so would create pools of size `osd_pool_default_size` regardless of the `size` value. This was due to the fact that the Ansible task used ``` {{ item.size \| default(osd_pool_default_size) }} ``` as the pool size value, but `item.size` is always undefined; the correct variable is `item.value.size`. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2020-01-08 16:16:38 -05:00
Dimitri Savineau	70eba66182	ceph-iscsi: manage ipv6 in trusted_ip_list Only the ipv4 addresses from the nodes running the dashboard mgr module were added to the trusted_ip_list configuration file on the iscsigws nodes. This also add the iscsi gateways with ipv6 configuration to the ceph dashboard. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1787531 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 13:54:04 -05:00
Guillaume Abrioux	0ae0a9ce28	shrink-mds: do not play ceph-facts entirely We only need to set `container_binary`. Let's use `tasks_from` option. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 10:39:27 -05:00
Guillaume Abrioux	77b39d235b	shrink-mds: use fact from delegated node The command is delegated on the first monitor so we must use the fact `container_binary` from this node. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 10:06:43 -05:00
Guillaume Abrioux	5adb735c78	facts: use correct python interpreter that task is delegated on the first mon so we should always use the `discovered_interpreter_python` from that node. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 10:06:43 -05:00
Guillaume Abrioux	38278a6bb5	shrink-mds: fix filesystem removal task This commit deletes the filesystem when no more MDS is present after shrinking operation. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1787543 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 10:06:43 -05:00
Guillaume Abrioux	2cfe5a04bf	shrink-mds: ensure max_mds is always honored This commit prevent from shrinking an mds node when max_mds wouldn't be honored after that operation. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 10:06:43 -05:00
Guillaume Abrioux	498bc45859	dashboard: use fqdn in external url Force fqdn to be used in external url for prometheus and alertmanager. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1765485 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 09:06:49 -05:00
Guillaume Abrioux	fca6f788a0	Revert "nfs: do not run privileged nfs container" This reverts commit `d06158e9d9`. Otherwise ganesha consumers can't dynamically update exports using dbus. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1784562 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 14:18:21 +01:00
Dimitri Savineau	931a842f21	purge-iscsi-gateways: remove node from dashboard When using the ceph dashboard with iscsi gateways nodes we also need to remove the nodes from the ceph dashboard list. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1786686 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 14:17:56 +01:00
Guillaume Abrioux	aabba3baab	ceph_volume: support filestore to bluestore migration This commit adds the filestore to bluestore migration support in ceph_volume module. We must append to the executed command only the relevant options according to what is passed in `osd_objectostore` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 11:48:21 +01:00
Dimitri Savineau	42366f0a6c	purge-container-cluster: prune exited containers Remove all stopped/exited containers. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Dimitri Savineau	254ab54f80	ceph-iscsi: remove python rtslib shaman repository The rtslib python library is now available in the distribution so we shouldn't have to use the shaman repository Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Guillaume Abrioux	4f2baaab8c	tests: disable nfs testing nfs-ganesha makes the CI failing because of issue related to SELinux. See: - https://bugzilla.redhat.com/show_bug.cgi?id=1788563 - https://github.com/nfs-ganesha/nfs-ganesha/issues/527 Until we can get this fixed, let's disable nfs-ganesha testing temporarily. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 11:13:46 +01:00
Guillaume Abrioux	e665d8e239	tests: upgrade from octopus to octopus on master we can't test upgrade from stable-4.0/CentOS 7 to master/CentOS 8. This commit refact the upgrade so we test upgrade from master/CentOS 8 to master/CentOS 8 (octopus to octopus) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 11:13:46 +01:00
Dimitri Savineau	7b3e6b932c	tests/functional: change docker to podman Some docker commands were hardcoded in tests playbooks and some conditions were not taking care of the containerized_deployment variable but only the atomic fact. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Dimitri Savineau	d758125290	ceph-nfs: add ganesha_t type to selinux Since RHEL 8.1 we need to add the ganesha_t type to the permissive SELinux list. Otherwise the nfs-ganesha service won't start. This was done on RHEL 7 previously and part of the nfs-ganesha-selinux package on RHEL 8. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1786110 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Dimitri Savineau	de8f2a9f83	container: move lvm2 package installation Before this patch, the lvm2 package installation was done during the ceph-osd role. However we were running ceph-volume command in the ceph-config role before ceph-osd. If lvm2 wasn't installed then the ceph-volume command fails: error checking path "/run/lock/lvm": stat /run/lock/lvm: no such file or directory This wasn't visible before because lvm2 was automatically installed as docker dependency but it's not the same for podman on CentOS 8. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Dimitri Savineau	d4fd38c967	ceph-nfs: change ganesha CentOS repository Since we don't have nfs-ganesha builds available on CentOS 8 at the moment on shaman then we can use the alternative repository at [1] [1] https://download.nfs-ganesha.org/3/LATEST/CentOS Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Guillaume Abrioux	217d95abb2	common: add centos8 support Ceph octopus only supports CentOS 8. This commit adds CentOS 8 support: - update vagrant image in tox configurations. - add CentOS 8 repository for el8 dependencies. - CentOS 8 container engine is podman (same than RHEL 8). - don't use the epel mirror on sepia because it's epel7 only. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Stanley Lam	2ca3364109	ceph-rgw-loadbalancer: Modify keepalived master selection Currently the keepalived template only works when system hostnames exactly match the Ansible inventory name. If these are different, all generated templates become BACKUP without a MASTER assigned. Using the inventory_hostname in the template file resolves this issue. Signed-off-by: Stanley Lam stanleylam_604@hotmail.com	2020-01-06 09:25:04 -05:00
Guillaume Abrioux	8056514134	filestore-to-bluestore: umount partitions before zapping them When an OSD is stopped, it leaves partitions mounted. We must umount them before zapping them, otherwise error like "Device is busy" will show up. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1729267 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-12-19 09:22:25 +01:00

... 13 14 15 16 17 ...

5750 Commits (6ed09197962bdbd83f5da68a06bb90c6f9841185) All Branches Search

5750 Commits (6ed09197962bdbd83f5da68a06bb90c6f9841185)

All Branches