ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Dimitri Savineau	f49090df7e	podman: Add systemd dependency on network.target When using podman, the systemd unit scripts don't have a dependency on the network. So we're not sure that the network is up and running when the containers are starting. With docker this behaviour is already handled because the systemd unit scripts depend on docker service which is started after the network. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-06-07 09:28:58 +02:00
L3D	ab54fe20ec	ansible: use 'bool' filter on boolean conditionals By running ceph-ansible there are a lot ``[DEPRECATION WARNING]`` like these: ``` [DEPRECATION WARNING]: evaluating containerized_deployment as a bare variable, this behaviour will go away and you might need to add \|bool to the expression in the future. Also see CONDITIONAL_BARE_VARS configuration toggle.. This feature will be removed in version 2.12. Deprecation warnings can be disabled by setting deprecation_warnings=False in ansible.cfg. ``` Now appended ``\| bool`` on a lot of the affected variables. Sometimes the coding style from ``variable\|bool`` changed to ``variable \| bool`` (with spaces at the pipe). Closes: #4022 Signed-off-by: L3D <l3d@c3woc.de>	2019-06-06 10:21:17 +02:00
Guillaume Abrioux	80875adba7	ceph-osd: do not relabel /run/udev in containerized context Otherwise content in /run/udev is mislabeled and prevent some services like NetworkManager from starting. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-06-04 11:32:41 -04:00
Guillaume Abrioux	e74d80e72f	rename docker_exec_cmd variable This commit renames the `docker_exec_cmd` variable to `container_exec_cmd` so it's more generic. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-05-16 16:39:13 +02:00
Rishabh Dave	89748d579a	don't access other node's docker_exec_cmd variable Except for some corner case, it's not correct to access some other node's copy of variable docker_exec_cmd. Therefore replace "hostvars[groups[mon_group_name][0]]['docker_exec_cmd']" by "docker_exec_cmd". Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-05-07 12:37:48 +02:00
Dimitri Savineau	ae266c6f2b	ansible: remove private and static attribute This will be removed in ansible 2.8 and breaks the playbook execution with this release. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-05-02 14:25:17 -04:00
Dimitri Savineau	c17106874c	ceph-osd: Increase cpu limit to 4 In containerized deployment the default osd cpu quota is too low for production environment using NVMe devices. This is causing performance degradation compared to bare-metal. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1695880 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-24 17:59:42 +02:00
Rishabh Dave	739a662c80	improve coding style Keywords requiring only one item shouldn't express it by creating a list with single item. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-04-23 15:37:07 +02:00
Andrew Schoen	5e3dfe5021	ceph-osd: do not run lvm batch tasks during update When performing a rolling update do not try to create any new osds with `ceph-volume lvm batch`. This is troublesome because when upgrading to nautilus the devices list might contain devices that are currently being used by ceph-disk and have GPT headers on them, which will cause ceph-volume to fail when trying to use such a device. Any devices originally created by ceph-disk will need to be removed from the devices list before any new osds can be created. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2019-04-18 10:55:11 +02:00
Guillaume Abrioux	f899da3172	osd: remove legacy file this file is not used anymore, let's remove it. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-11 11:57:02 -04:00
Guillaume Abrioux	4f68462009	osd: remove ceph-disk scenarios files these files aren't needed anymore since we only use lvm scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-11 11:57:02 -04:00
Guillaume Abrioux	f0416c8892	osd: remove dedicated_devices variable This variable was related to ceph-disk scenarios. Since we are entirely dropping ceph-disk support as of stable-4.0, let's remove this variable. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-11 11:57:02 -04:00
Guillaume Abrioux	4d35e9eeed	osd: remove variable osd_scenario As of stable-4.0, the only valid scenario is `lvm`. Thus, this makes this variable useless. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-11 11:57:02 -04:00
Guillaume Abrioux	4d5637fd8a	osd: remove legacy file ceph_disk_cli_options_facts.yml is not used anymore, let's remove it. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-11 11:57:02 -04:00
Sébastien Han	52df15895b	osd: default osd_scenario to lvm osd_scenario has become obsolete and defaults to lvm. With lvm there is no such things has collocated and non-collocated. Signed-off-by: Sébastien Han <seb@redhat.com>	2019-04-11 11:57:02 -04:00
Sébastien Han	e2a5aa062e	osd: remove ceph-disk support We don't support the preparation of OSD with ceph-disk. ceph-volume is only supported. However, the start operation of OSD is still supported. So let's say you change a config option, the handlers will be able to restart all the OSDs via their respective systemd unit files. Signed-off-by: Sébastien Han <seb@redhat.com> Co-authored-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-11 11:57:02 -04:00
Dimitri Savineau	7e5e4229b7	ceph-volume: Add PYTHONIOENCODING env variable Since https://github.com/ceph/ceph/commit/77912c0 ceph-volume uses stdout encoding based on LC_CTYPE and PYTHONIOENCODING environment variables. Thoses variables aren't set when using ansible. Currently this commit breaks non containerized deployment on Ubuntu. TASK [use ceph-volume to create bluestore osds] ******************** cmd: - ceph-volume - --cluster - ceph - lvm - create - --bluestore - --data - /dev/sdb rc: 1 stderr: \|- Traceback (most recent call last): (...) UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 132: ordinal not in range(128) Note that the task is failing on ansible side due to the stdout decoding but the osd creation is successful. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-02 12:41:55 +02:00
Rishabh Dave	e0beaf123a	"when" keyword should precede "block" keyword Otherwise the reader is forced to search for "when" when blocks are too long. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-03-29 16:16:04 +00:00
Guillaume Abrioux	82764afe8d	update: mask systemd service units during upgrade This prevents the packaging from restarting services before we do need to restart them in the rolling update sequence. We want to handle services restart at rolling_update playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Dimitri Savineau	179fdfbc19	ceph-osd: Ensure lvm2 is installed When using osd_scenario lvm, we never check if the lvm2 package is present on the host. When using containerized deployment and docker on CentOS/RedHat this package will be automatically installed as a dependency but not for Ubuntu distribution. OSD deployed via ceph-volume require the lvmetad.socket to be active and running. Resolves: #3728 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-20 22:26:45 +00:00
Guillaume Abrioux	987bdac963	osd: backward compatibility with old disk_list.sh location Since all files in container image have moved to `/opt/ceph-container` this check must look for new AND the old path so it's backward compatible. Otherwise it could end up by templating an inconsistent `ceph-osd-run.sh`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-18 17:25:51 +00:00
Dimitri Savineau	b7f4e3e7c7	ceph-osd: Install numactl package when needed With `3e32dce` we can run OSD containers with numactl support. When using numactl command in a containerized deployment we need to be sure that the corresponding package is installed on the host. The package installation is only executed when the ceph_osd_numactl_opts variable isn't empty. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-12 07:43:06 +00:00
Guillaume Abrioux	b3eb9206fa	osd: support numactl options on OSD activate This commit adds OSD containers activate with numactl support. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1684146 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-11 10:14:50 +01:00
Dimitri Savineau	a089e1ec23	systemd/service: Set docker.service conditionally We don't need to set After=docker.service when the container_binary variable isn't set to docker. It doesn't break anything currently but it could be confusing when using podman. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-07 20:56:11 +00:00
Dimitri Savineau	4d32ecc980	Force osd pool min_size value to integer After `b8d580b` and `e9e5d5a` we could have either item.min_size or osd_pool_default_min_size using string instead of int causing the condition to be true when it's false. As a result, the task could try to set the pool min_size value to 0 which leads to: Error EINVAL: pool min_size must be between 1 and 1 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-05 19:48:09 +00:00
Dimitri Savineau	cb381b41fe	Add CONTAINER_IMAGE env var to ceph daemons Ceph daemons will set the CONTAINER_IMAGE environment variable value in the daemon metadata. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-05 15:07:05 +00:00
Guillaume Abrioux	e9e5d5a39a	fix pool min_size customization `b8d580b3f4` introduced a bug when `min_size` isn't set (default to 0). Typical error: ``` Error EINVAL: pool min_size must be between 1 and 1 ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-05 13:29:34 +00:00
Radu Toader	b8d580b3f4	Customize pools min_size Signed-off-by: Radu Toader <radu.m.toader@gmail.com>	2019-03-05 10:57:15 +00:00
Kevin Coakley	b11dc13476	Updated 7 ansible-lint issues in the ceph-mon, ceph-osd, and ceph-rgw roles The following lint issues have been resolved: [301] Commands should not change things if nothing needs doing /home/travis/build/ceph/ceph-ansible/roles/ceph-mon/tasks/ceph_keys.yml:2 [305] Use shell only when shell functionality is required /home/travis/build/ceph/ceph-ansible/roles/ceph-osd/tasks/start_osds.yml:47 [301] Commands should not change things if nothing needs doing /home/travis/build/ceph/ceph-ansible/roles/ceph-rgw/tasks/multisite/destroy.yml:2 [301] Commands should not change things if nothing needs doing /home/travis/build/ceph/ceph-ansible/roles/ceph-rgw/tasks/multisite/destroy.yml:7 [301] Commands should not change things if nothing needs doing /home/travis/build/ceph/ceph-ansible/roles/ceph-rgw/tasks/multisite/destroy.yml:14 [301] Commands should not change things if nothing needs doing /home/travis/build/ceph/ceph-ansible/roles/ceph-rgw/tasks/multisite/destroy.yml:19 [301] Commands should not change things if nothing needs doing /home/travis/build/ceph/ceph-ansible/roles/ceph-rgw/tasks/multisite/destroy.yml:24 Signed-off-by: Kevin Coakley <kcoakley@sdsc.edu>	2019-03-04 22:25:35 +00:00
Dimitri Savineau	45a7082712	lint: Fix spaces before and after variables ansible-lint reports: [206] Variables should have spaces after {{ and before }} Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-01 17:22:24 +00:00
Kevin Coakley	038401fef2	Add changed_when: false to the "get osd ids" statement The "get osd ids" statement only registers the osd_ids_non_container variable. Running "ls /var/lib/ceph/osd/ \| sed 's/.*-//'" should never produce a change on the system. Adding changed_when: false prevents irrelevant change messages from Ansible. Signed-off-by: Kevin Coakley <kcoakley@sdsc.edu>	2019-02-28 22:46:19 +00:00
Guillaume Abrioux	d5be83e504	osd: add ipc=host in systemd template for containers in addition to `15812970f0` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-02-28 13:14:09 +00:00
Dimitri Savineau	dc1c0dcee2	ceph-osd: Drop memory flag with bluestore Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-02-26 07:27:06 +00:00
Guillaume Abrioux	21e5db8982	osd: make the 'wait for all osd to be up' task configurable introduce two new variables to make the check that 'wait for all osd to be up' configurable. It's possible that for some deployments, OSDs can take longer to be seen as UP and IN. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1676763 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-02-20 16:06:04 +00:00
David Waiting	3930791cb7	ensure at least one osd is up The existing task checks that the number of OSDs is equal to the number of up OSDs before continuing. The problem is that if none of the OSDs have been discovered yet, the task will exit immediately and subsequent pool creation will fail (num_osds = 0, num_up_osds = 0). This is related to Bugzilla 1578086. In this change, we also check that at least one OSD is present. In our testing, this results in the task correctly waiting for all OSDs to come up before continuing. Signed-off-by: David Waiting <david_waiting@comcast.com>	2019-02-19 18:31:05 +00:00
Guillaume Abrioux	d4e31b90a6	Revert "osd: container remove --pid=host" This reverts commit `bb2bbeb941`. Looks like when not passing `--pid=host` we are facing some issues when deploying more than 2 OSDs in containerized environment. At the moment, we are still troubleshooting this issue but we prefer to revert this commit so it doesn't block any PR in the CI. As soon as we have a fix; we will push a new PR to remove `--pid=host` (a revert of revert...) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-02-14 10:34:37 +00:00
Guillaume Abrioux	8c8ec63633	container: use tmpfiles.d to creates /run/ceph instead of using `RuntimeDirectory` parameter in systemd unit files, let's use a systemd `tmpfiles.d` to ensure `/run/ceph`. Explanation: `podman` doesn't create the `/var/run/ceph` if it doesn't exist the time where the container is run while `docker` used to create it. In case of `switch_to_containers` scenario, `/run/ceph` gets created by a tmpfiles.d systemd file; when switching to containers, the systemd unit file complains because `/run/ceph` already exists The better fix would be to ensure `/usr/lib/tmpfiles.d/ceph-common.conf` is removed and only rely on `RuntimeDirectory` from systemd unit file parameter but we come from a non-containerized environment which is already running, it means `/run/ceph` is already created and when starting the unit to start the container, systemd will still complain and we can't simply remove the directory if daemons are collocated. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-02-13 09:42:27 +01:00
Sébastien Han	bb2bbeb941	osd: container remove --pid=host Let's try again with the Nautilus release. Closes: https://github.com/ceph/ceph-ansible/issues/1297 Signed-off-by: Sébastien Han <seb@redhat.com>	2019-02-07 12:13:51 +00:00
John Fulton	cc0bf197e1	Fix CNI error when net=host is not used on OSD calls Follow up fix that `410abd7` missed. Related: ceph#3561 Signed-off-by: John Fulton <fulton@redhat.com>	2019-02-05 22:49:01 +00:00
Guillaume Abrioux	914d94cae8	set RuntimeDirectory in all systemd unit templates /var/run/ceph resides in a non persistent filesystem (tmpfs) After a reboot, all daemons won't start because this directory will be missing. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-02-05 18:14:28 +01:00
Guillaume Abrioux	7ade032807	osd: bind mount /var/run/udev/ without this, the command `ceph-volume lvm list --format json` hangs and takes a very long time to complete. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-02-05 18:14:28 +01:00
Guillaume Abrioux	16efdbc59b	podman: support podman installation on rhel8 Add required changes to support podman on rhel8 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1667101 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-02-05 18:14:28 +01:00
Andrew Schoen	88eda479a9	ceph-facts: generate devices when osd_auto_discovery is true This task used to live in ceph-osd, but we need it defined here to that ceph-config can use it when trying to determine the number of osds. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2019-02-01 12:28:12 +01:00
Kai Wembacher	1dd26f76bf	document missing support for non-containerized deployment Signed-off-by: Kai Wembacher <kai@ktwe.de>	2018-12-21 15:37:55 +00:00
Kai Wembacher	a273ed7f60	add support for rocksdb and wal on the same partition in non-collocated Signed-off-by: Kai Wembacher <kai@ktwe.de>	2018-12-20 14:19:46 +01:00
Guillaume Abrioux	d7e77012ef	retry on packages and repositories failures add register/until on all packaging related tasks to avoid non valid CI failure. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-12-19 14:48:27 +00:00
Guillaume Abrioux	1b8b5e0aac	meta: set the right minimum ansible version required for galaxy ceph-ansible@master requires the latest stable ansible version. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-12-11 09:59:25 +01:00
Noah Watkins	3cf5fd2c3e	start_osds: use list instead of keys (re-introduce) the python3 fix merged by: https://github.com/ceph/ceph-ansible/pull/3346 was reintroduced a few days later by: `82a6b5adec` and this patch fixes it again :) Signed-off-by: Noah Watkins <nwatkins@redhat.com>	2018-12-05 23:25:35 +00:00
Sébastien Han	82a6b5adec	osd: manage legacy ceph-disk non-container startup The code is now able (again) to start osds that where configured with ceph-disk on a non-container scenario. Closes: https://github.com/ceph/ceph-ansible/issues/3388 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `452069cb3a`)	2018-12-03 16:01:57 +01:00
Sébastien Han	ec2d1f502d	osd: re-introduce disk_list check This commit `4cc1506303 (diff-51bbe3572e46e3b219ad726da44b64ebL13)` accidentally removed this check. This is a must have for ceph-disk based containerized OSDs. Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `9b5a93e3a5`)	2018-12-03 16:01:57 +01:00
Sébastien Han	4c51130198	osd: discover osd_objectstore on the fly Applying and passing the OSD_BLUESTORE/FILESTORE on the fly is wrong for existing clusters as their config will be changed. Typically, if an OSD was prepared with ceph-disk on filestore and we change the default objectstore to bluestore, the activation will fail. The flag osd_objectstore should only be used for the preparation, not activation. The activate in this case detects the osd objecstore which prevents failures like the one described above. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-12-03 14:11:47 +00:00
Sébastien Han	bef522627e	ceph-osd: change jinja condition If an existing cluster runs this config, and has ceph-disk OSD, the `expose_partitions` won't be expected by jinja since it's inside the 'old' if. We need it as part of the osd_scenario != 'lvm' condition. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1640273 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-12-03 14:11:47 +00:00
Sébastien Han	14fc5bad12	mon: do not serialized container bootstrap This commit unifies the container and non-container code, which in the meantime gives use the ability to deploy N mon container at the same time without having to serialized the deployment. This will drastically reduces the time needed to bootstrap the cluster. Note, this is only possible since Nautilus because the monitors are bootstrap the initial keys on their own once they reach quorum. In the Nautilus version of the ceph-container mon, we stopped generating the keys 'manually' from inside the container, for more detail see: https://github.com/ceph/ceph-container/pull/1238 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-12-03 14:39:43 +01:00
Guillaume Abrioux	ccc0c9c24c	osd: remove a leftover this file is never included in ceph-osd, looks like a leftover let's remove it. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-12-03 09:12:02 +01:00
Guillaume Abrioux	0187166926	osd: remove an incorrect information This is false, `./defaults/main.yml` is not supposed to be modified directly. groups_vars a/o host_vars should always be preferred. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-12-03 08:11:35 +00:00
Guillaume Abrioux	fead0813b4	remove kv store support the next stable release will drop this feature. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-11-30 13:45:12 +00:00
Sébastien Han	bc2daaeb71	ceph-osd fix batch with container binary Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-27 16:47:40 +00:00
Sébastien Han	80ba45793d	fix template generation Position the right condition on ceph_docker_version, activate it when the container_binary is 'docker'. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-27 16:47:40 +00:00
Sébastien Han	f192bc92a2	ceph_key: use the right container runtime binary Rework all the ceph_key invocation to use either docker or podman binary. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-27 16:47:40 +00:00
Sébastien Han	a96e910114	Add new container scenario Test with podman instead of docker and also support for python 3 only. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-27 16:47:40 +00:00
Sébastien Han	997667a873	osd: expose udev into the container In order to be able to retrieve udev information, we must expose its socket. As per, https://github.com/ceph/ceph/pull/25201 ceph-volume will start consuming udev output. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-26 18:57:12 +00:00
Guillaume Abrioux	7774069d45	refact osd pool size customization Add real default value for osd pool size customization. Ceph itself has an `osd_pool_default_size` default value to `3`. If users don't specify a pool size in various pools definition within ceph-ansible, we should default to `3`. By the way, this kind of condition isn't really clear: ``` when: - rbd_pool_size \| default ("") ``` we should try to get the customized value then default to what is in `osd_pool_default_size` (which has its default value pointing to `ceph_osd_pool_default_size` (`3`) as well) and compare it to `ceph_osd_pool_default_size`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-11-21 15:42:50 +00:00
Guillaume Abrioux	d4c0960f04	mon: move `osd_pool_default_pg_num` in `ceph-defaults` `osd_pool_default_pg_num` parameter is set in `ceph-mon`. When using ceph-ansible with `--limit` on a specifc group of nodes, it will fail when trying to access this variables since it wouldn't be defined. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1518696 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-11-21 15:42:50 +00:00
Boris Ranto	c2b0cbd699	start_osds: Use list instead of keys If you use python3 based ansible then keys() returns a dict_keys object, not a list of keys. This breaks the installation on such a system. Using the list filter provides a more robust solution that should work on both python2 and python3 based ansible. You can find some more information about the issue, here: https://github.com/ansible/ansible/issues/19514 Signed-off-by: Boris Ranto <branto@redhat.com>	2018-11-20 18:48:22 +01:00
Noah Watkins	64dee9be0c	Remove outdated documentation Fixes BZ https://bugzilla.redhat.com/show_bug.cgi?id=1640525 Signed-off-by: Noah Watkins <nwatkins@redhat.com>	2018-11-15 22:26:19 +00:00
Guillaume Abrioux	f7fcc012e9	osd: commonize start_osd code since `ceph-volume` introduction, there is no need to split those tasks. Let's refact this part of the code so it's clearer. By the way, this was breaking rolling_update.yml when `openstack_config: true` playbook because nothing ensured OSDs were started in ceph-osd role (In `openstack_config.yml` there is a check ensuring all OSD are UP which was obviously failing) and resulted with OSDs on the last OSD node not started anyway. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-11-12 10:51:48 +01:00
Sébastien Han	72cae542da	lint: Don't compare to empty string description = 'Use `when: var` rather than `when: var != ""` (or ' \ 'conversely `when: not var` rather than `when: var == ""`)' Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-08 10:22:02 +00:00
Sébastien Han	f9ddc27cd5	lint: meta add company info Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-08 10:22:02 +00:00
Sébastien Han	094ae8baf1	lint: do not use local_action Use delegate_to: localhost instead. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-08 10:22:02 +00:00
Sébastien Han	037bab2922	lint: line length should not exceed 160 chars Line was too long Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-08 10:22:02 +00:00
Sébastien Han	ca7ed7dd81	galaxy roles: polish metadata Update the meta with the relavant support such as: * ansible version: min 2.4 * distro supported (tested on) centos 7 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-31 17:48:58 +01:00
Sébastien Han	a882ad7ade	lint: use command instead of shell Use command when the tasks does not have any pipes or wilcards. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-31 14:18:36 +01:00
Sébastien Han	53972ee672	lint: add changed_when to command Calling command should have changed_when false otherwise each time it runs it will show as 'changed' and this is irrelevant. Commands should not change things if nothing needs doing Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-31 14:18:36 +01:00
Rishabh Dave	8edbda96df	use blocks directives to group tasks Using block directives simplifies the playbooks and makes them more readable. Fixes: https://github.com/ceph/ceph-ansible/issues/2835 Signed-off-by: Rishabh Dave <ridave@redhat.com>	2018-10-31 09:37:43 +01:00
Sébastien Han	d209fc9d02	lint yaml Fix [error] too many blank lines (1 > 0) (empty-lines) Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-30 14:41:36 +01:00
Guillaume Abrioux	748342f5b6	roles: fix _docker_memory_limit default value append 'm' suffix to specify the unit size used in all `_docker_memory_limit`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-29 14:59:09 +01:00
Neha Ojha	b7e4d4eb84	roles: do not limit docker_memory_limit for various daemons Since we do not have enough data to put valid upper bounds for the memory usage of these daemons, do not put artificial limits by default. This will help us avoid failures like OOM kills due to low default values. Whenever required, these limits can be manually enforced by the user. More details in https://bugzilla.redhat.com/show_bug.cgi?id=1638148 Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1638148 Signed-off-by: Neha Ojha <nojha@redhat.com>	2018-10-29 14:59:09 +01:00
Rishabh Dave	ee2d52d33d	allow custom pool size Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1596339 Signed-off-by: Rishabh Dave <ridave@redhat.com>	2018-10-22 16:00:21 +02:00
Sébastien Han	fbd878c8d5	infra: rename osd-configure to add-osd and improve it The playbook has various improvements: * run ceph-validate role before doing anything * run ceph-fetch-keys only on the first monitor of the inventory list * set noup flag so PGs get distributed once all the new OSDs have been added to the cluster and unset it when they are up and running Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1624962 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-17 11:26:11 +00:00
Guillaume Abrioux	40b7747af7	remove jewel support As of now, we should no longer support Jewel in ceph-ansible. The latest ceph-ansible release supporting Jewel is `stable-3.1`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-12 23:38:17 +00:00
Sébastien Han	31a0438cb2	ceph_volume: refactor This commit does a couple of things: * Avoid code duplication * Clarify the code * add more unit tests * add myself to the author of the module Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	bfe689094e	osd: do not run when lvm scenario This task was created for ceph-disk based deployments so it's not needed when osd are prepared with ceph-volume. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	a948677de1	osd: ceph-volume activate, just pass the OSD_ID We don't need to pass the device and discover the OSD ID. We have a task that gathers all the OSD ID present on that machine, so we simply re-use them and activate them. This also handles the situation when you have multiple OSDs running on the same device. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	5f35910ee1	osd: change unit template for ceph-volume container We don't need to pass the hostname on the container name but we can keep it simple and just call it ceph-osd-$id. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	ece9e9812e	osd: do not use expose_partitions on lvm expose_partitions is only needed on ceph-disk OSDs so we don't need to activate this code when running lvm prepared OSDs. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	e39fc4f6ce	ceph_volume: add container support for batch command The batch option got recently added, while rebasing this patch it was necessary to implement it. So now, the batch option can work on containerized environments. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1630977 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	3ddcc9af16	ceph_volume: try to get ride of the dummy container If we run on a containerized deployment we pass an env variable which contains the container image. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	aa2c1b27e3	ceph-osd: ceph-volume container support Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Noah Watkins	306e308f13	Avoid using tests as filter Fixes the deprecation warning: [DEPRECATION WARNING]: Using tests as filters is deprecated. Instead of using `result\|search` use `result is search`. Signed-off-by: Noah Watkins <nwatkins@redhat.com>	2018-10-10 04:26:33 +00:00
Andrew Schoen	c453ea25c0	ceph-osd: use journal_size and block_db_size for lvm batch Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-10-09 10:09:50 -04:00
Rishabh Dave	b5d2ea269f	don't use "static" field while including tasks Instead used "import_tasks" and "include_tasks" to tell whether tasks must be included statically or dynamically. Fixes: https://github.com/ceph/ceph-ansible/issues/2998 Signed-off-by: Rishabh Dave <ridave@redhat.com>	2018-10-04 07:44:28 +00:00
Rishabh Dave	380168dadc	don't use "include" to include tasks Use "import_tasks" or "include_tasks" instead. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2018-09-27 17:53:40 +02:00
Andrew Schoen	b36f3e06b5	ceph_volume: adds the osds_per_device parameter If this is set to anything other than the default value of 1 then the --osds-per-device flag will be used by the batch command to define how many osds will be created per device. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-09-12 20:27:14 +00:00
Sébastien Han	9ba670567e	remove warning for unsupported variables As promised, these will go unsupported for 3.1 so let's actually remove them :). Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622729 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-28 13:31:57 -07:00
Sébastien Han	8c70a5b197	osd: fix ceph_release We need ceph_release in the condition, not ceph_stable_release Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1619255 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-20 20:14:56 +02:00
Sébastien Han	3149b2564f	Revert "osd: generate device list for osd_auto_discovery on rolling_update" This reverts commit `e84f11e99e`. This commit was giving a new failure later during the rolling_update process. Basically, this was modifying the list of devices and started impacting the ceph-osd itself. The modification to accomodate the osd_auto_discovery parameter should happen outside of the ceph-osd. Also we are trying to not play ceph-osd role during the rolling_update process so we can speed up the upgrade. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-16 11:13:12 +02:00
Andrew Schoen	6423ab4ad3	lvm: fix condition when selecting which scenario to run devices and lvm_volumes will always be defined, so we need to instead check it's length before deciding to run the scenario. This fixes the failure here: https://2.jenkins.ceph.com/job/ceph-ansible-prs-luminous-bluestore_lvm_osds/86/consoleFull#1667273050b5dd38fa-a56e-4233-a5ca-584604e56e3a Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-08-10 11:46:12 +02:00
Sébastien Han	e84f11e99e	osd: generate device list for osd_auto_discovery on rolling_update rolling_update relies on the list of devices when performing the restart of the OSDs. The task that is builind the devices list out of the ansible_devices dict only runs when there are no partitions on the drives. However during an upgrade the OSD are already configured, they have been prepared and have partitions so this task won't run and thus the devices list will be empty, skipping the restart during rolling_update. We now run the same task under different requirements when rolling_update is true and build a list when: * osd_auto_discovery is true * rolling_update is true * ansible_devices exists * no dm/lv are part of the discovery * the device is not removable * the device has more than 1 sector Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1613626 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-10 09:19:40 +02:00
Andrew Schoen	3592c68cca	ceph-osd: adds crush_device_class config option This is used with the lvm osd scenario. When using devices you need the option to set the crush device class for all of the OSDs that are created from those devices. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-08-09 09:41:58 -04:00
Andrew Schoen	6d431ec22d	ceph-volume: implement the 'lvm batch' subcommand This adds the action 'batch' to the ceph-volume module so that we can run the new 'ceph-volume lvm batch' subcommand. A functional test is also included. If devices is defind and osd_scenario is lvm then the 'ceph-volume lvm batch' command will be used to create the OSDs. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-08-09 09:41:58 -04:00

1 2 3 4 5 ...

563 Commits (8aed824f71e7f4ed53cd9e8e30964eff298c95cf)