ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	b723ef3fa2	purge: fix lvm-batch purge osd `lvm_volumes` and/or `devices` variable(s) can be undefined depending on the scenario chosen. These tasks should be run only if these variable are defined, otherwise it ends up with undefined variable errors. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1653307 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `0180738313`)	2019-04-04 03:38:52 +02:00
Guillaume Abrioux	3fd4354aaa	tests: switch rhel-container-podman to nautilus in stable-4.0 this should be set to nautilus. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-03 11:27:46 +02:00
Dimitri Savineau	0274b880f1	ceph-volume: Add PYTHONIOENCODING env variable Since https://github.com/ceph/ceph/commit/77912c0 ceph-volume uses stdout encoding based on LC_CTYPE and PYTHONIOENCODING environment variables. Thoses variables aren't set when using ansible. Currently this commit breaks non containerized deployment on Ubuntu. TASK [use ceph-volume to create bluestore osds] ******************** cmd: - ceph-volume - --cluster - ceph - lvm - create - --bluestore - --data - /dev/sdb rc: 1 stderr: \|- Traceback (most recent call last): (...) UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 132: ordinal not in range(128) Note that the task is failing on ansible side due to the stdout decoding but the osd creation is successful. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `7e5e4229b7`)	2019-04-03 11:27:46 +02:00
Guillaume Abrioux	655ac5eb93	tests: test idempotency only on all_daemons job there's no need to test this on all scenarios. testing idempotency on all_daemons should be enough and allow us to save precious resources for the CI. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `136bfe096c`)	2019-04-03 11:27:46 +02:00
Dimitri Savineau	47d6e505a0	tox: Set nautilus as default release On stable-4.0 branch we don't want to use dev setup but stable release (nautilus). Also update the container image tag to reflect this change. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-01 17:00:36 +02:00
Guillaume Abrioux	f55e2b08be	remove all NBSPs on master branch Similar to #3658 Since there's too many changes between master and stable branches let's commit directly in each branches instead of trying to backport this commit. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-28 11:57:55 +00:00
Dimitri Savineau	40a8e1160c	container: Add python3-docker on Ubuntu bionic When installing python-minimal on Ubuntu bionic, this will add the /usr/bin/python symlink to the default python interpreter. On bionic, this isn't python2 but python3. $ /usr/bin/python --version Python 3.6.7 The python docker library is only installed for python2 which causes issues when running the purge-docker-cluster playbook. This playbook uses the ansible docker modules and requires to have python bindings installed on the remote host. Without the bindings we can see python error reported by the docker module. msg: Failed to import docker or docker-py - No module named 'docker'. Try `pip install docker` or `pip install docker-py` (Python 2.6) Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-28 08:03:58 +00:00
Dimitri Savineau	7b7f79171a	tests/functional: Use the ansible reboot module Ansible 2.7 introduces the reboot module so we don't need to use the shell/reboot + wait_for tasks. https://docs.ansible.com/ansible/latest/modules/reboot_module.html Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-27 08:30:50 +00:00
Dimitri Savineau	bd0869cd01	tox: Fix container purge jobs On containerized CI jobs the playbook executed is purge-cluster.yml but it should be set to purge-docker-cluster.yml Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-26 21:36:56 +00:00
Dimitri Savineau	c8442f3705	rolling_update: Update systemd unit regex for nvme The systemd unit regex doesn't handle nvme devices (/dev/nvmeXn1). Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1687828 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-26 12:01:00 +00:00
Dimitri Savineau	4cca366102	travis: Remove galaxy lint rules repository The galaxy-lint-rules github repository isn't used anymore and has been archived. All the rules are now part of the ansible-lint project. https://github.com/ansible/galaxy-lint-rules https://github.com/ansible/ansible-lint Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-26 11:08:38 +00:00
Dimitri Savineau	94505a3af2	Add uca to ceph_repository choices validation Ubuntu cloud archive is configurable via ceph_repository variable but the uca choice isn't accepted. This commit fixes this issue and also validates the associated uca repository variables. Resolves: #3739 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-26 09:03:44 +00:00
Guillaume Abrioux	6f47c20c3a	rgw: fix a typo `ee2d52d33d` introduced a typo. This commit fixes it. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	3c4f464c54	rgw: cleanup legacy task this task was here for backward compatibility. It's time to remove it in the next release. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	9134624578	rgw: add a retry on pool related tasks sometimes those tasks might fail because of a timeout. I've been facing this several times in the CI, adding this retry might help and won't hurt in any case. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	78aac3e96a	update: followup on `edfdc49` all rgw instances should be stopped according to the multiple rgw instances support added in rolling_update.yml Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	f6e0185146	update: add containerized deployment upgrade support (L->N) Add a couple of fixes to allow containerized deployments upgrade support to upgrade from luminous/mimic to nautilus. - pass CEPH_CONTAINER_IMAGE and CEPH_CONTAINER_BINARY environment variable to the ceph_key module, - fix the docker exec command in 'waiting for the containerized monitor to join the quorum' task according to the `delegate_to` parameter, - override `docker_exec_cmd` in `ceph-facts` with `mon_host` when rolling_update is `True`, - do not run unnecessarily `create_mds_filesystems.yml` when performing an upgrade. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	1816b876ee	update: add missing hosts in facts gathering iscsigws were missing. The 'complete upgrade' couldn't complete because rolling_update was set to False for iscsigw nodes. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	45ba90c169	update: remove rbdmirror legacy task This task is no longer needed for next release. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	0ea0adf039	update: show all daemons version at the end Let's display all daemons version at the end of the playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	7386249c71	facts: retrieve fsid during rolling_update playbook otherwise it generates a new cluster fsid and makes the upgrade failing Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	5c3ce4ca77	mon: fetch initial keyring even when running rolling_update otherwise, the task to copy mgr keyring fails during the rolling_update. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	f0e616962d	tests: split tox configuration into multiple pieces This file is becoming too big, let's isolate the update related code in a dedicated tox configuration file. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	f31d6d9485	update: enable new nautilus-only functionality once the cluster is upgraded to nautilus, we can complete the process by disallowing pre-nautilus OSDs and enabling all new nautilus-only functionality Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	afdaa70a63	update: enable msgr2 protocol This commit enable the msgr2 protocol when the cluster is fully upgraded to nautilus Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	ef096dd021	update: ensure mgrs are upgraded after ALL monitors As of `1c760904b0`, ceph-ansible implicitly bootstrap managers on monitors. mgrs must be upgraded only after all monitors, therefore, this commit refact the way mgrs are upgraded to be sure we don't upgrade a mgr during the monitors upgrade. This commit also ensure we handle the case were we split managers on dedicated nodes. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	7fa2434f0f	update: ensure /var/lib/ceph/bootstrap-rbd-mirror is present This directory is created by ceph-config node by node. In the upgrade context we need it to be created on ALL monitors as soon as the first iteration because of the task right after which creates and sends the keyrings on all monitors. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	82764afe8d	update: mask systemd service units during upgrade This prevents the packaging from restarting services before we do need to restart them in the rolling update sequence. We want to handle services restart at rolling_update playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	8add55451c	update: set osd flags only once There is no need to set osd flags (noout, norebalance) each time we upgrade a mon. This commit moves up those tasks (before stopping the mon) so we don't need to delegate them. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	f7c6f4e0b6	update: fix tasks waiting for the node to join the quorum We actually want to ensure the node being upgraded is joining the quorum instead of the monitor picked up earlier. Indeed, the `mon_host`is used only in `delegate_to:` so we can still run ceph commands while the monitor being upgraded is stopped. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	32569b79e2	update: remove an old parameter in ceph_key module call the `containerized` parameter in ceph_key module doesn't exist anymore. This was making the module failing but was hidden because of the `ignore_errors: True`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	b4f14aba8e	ceph_key: `lookup_ceph_initial_entities` shouldn't fail on update As of nautilus, the initial keyrings list has changed, it means when upgrading from Luminous or Mimic, it is expected there's a mismatch between what is found on the cluster and the expected initial keyring list hardcoded in ceph_key module. We shouldn't fail when upgrading to nautilus. str_to_bool() took from ceph-volume. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Co-Authored-by: Alfredo Deza <adeza@redhat.com>	2019-03-25 16:02:56 -04:00
Guillaume Abrioux	e99305c684	handlers: do not trigger handlers on rolling_update rolling_update playbook already takes care of stopping/starting services during the sequence. There's no need to trigger potential unwanted services restart. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-25 16:02:56 -04:00
Dimitri Savineau	179fdfbc19	ceph-osd: Ensure lvm2 is installed When using osd_scenario lvm, we never check if the lvm2 package is present on the host. When using containerized deployment and docker on CentOS/RedHat this package will be automatically installed as a dependency but not for Ubuntu distribution. OSD deployed via ceph-volume require the lvmetad.socket to be active and running. Resolves: #3728 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-20 22:26:45 +00:00
Bruceforce	6d506dba1a	ceph_crush: fix rstrip for python 3 Removing bytes literals since rstrip only supports type String or None. Please backport to stable-3.2 Signed-off-by: Bruceforce <markus.greis@gmx.de>	2019-03-19 23:57:22 +00:00
Phuong Nguyen	3305309e87	Remove trailing forward slash in ceph_docker_registry variable from group_vars/rhcs.yml.sample file. Also fixed rhcs_edits.txt for variable ceph_docker_registry. Moved namespace to ceph_docker_image variable. Signed-off-by: Phuong Nguyen <pnguyen@redhat.com>	2019-03-19 13:15:59 +00:00
Guillaume Abrioux	987bdac963	osd: backward compatibility with old disk_list.sh location Since all files in container image have moved to `/opt/ceph-container` this check must look for new AND the old path so it's backward compatible. Otherwise it could end up by templating an inconsistent `ceph-osd-run.sh`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-18 17:25:51 +00:00
Dimitri Savineau	5c39735be5	ceph-validate: fail if there's no ipaddr available in monitor_address_block subnet When using monitor_address_block to determine the ip address of the monitor node, we need an ip address available in that cidr to be present in the ansible facts (ansible_all_ipv[46]_addresses). Currently we don't check if there's an ip address available during the ceph-validate role. As a result, the ceph-config role fails due to an empty list during ceph.conf template creation but the error isn't explicit. TASK [ceph-config : generate ceph.conf configuration file] ***** fatal: [0]: FAILED! => {"msg": "No first item, sequence was empty."} With this patch we will fail before the ceph deployment with an explicit failure message. Resolves: rhbz#1673687 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-18 16:35:36 +00:00
Dimitri Savineau	a7b1e35a16	ceph-common: Install yum plugin priorities When using community repository we need to set the priority on the ceph repositories because we could have some conflict with EPEL packages. In order to set the priority on the ceph repositories, we need to install the yum-plugin-priorities package. http://docs.ceph.com/docs/master/install/get-packages/#rpm-packages Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-16 06:24:55 +00:00
Guillaume Abrioux	655bdb189c	Revert "site.yml: run ceph-validate before facts/defaults roles" This commit wasn't making any sense and should have never got merged. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-15 16:17:15 +00:00
Rishabh Dave	f7b20dbb48	don't append path components while calling os.path.join() This creates a confusion whether directory/file names are being formed by appendng strings or path components are being appended. Since latter should never be done manually, get rid of the statements creating confusion. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-03-14 22:35:12 +00:00
Rishabh Dave	ba949acab7	don't use os.path.join() on a single path component Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-03-14 22:35:12 +00:00
Rishabh Dave	62abe7068a	use os.path.join() correctly os.path.join adds the separator (i.e. '/') between the provided path components only if needed. Providing a single path component doesn't lead to any checks. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-03-14 22:35:12 +00:00
wumingqiao	31617afca9	ceph-mgr: run mgr_modules.yml only on the first mgr host the task will be delegated to mons[0] for all mgr hosts, so we can just run it on the first host and have the same effect. Signed-off-by: wumingqiao <wumingqiao@beyondcent.com>	2019-03-14 20:16:33 +00:00
Dimitri Savineau	d8538ad4e1	Set the default crush rule in ceph.conf Currently the default crush rule value is added to the ceph config on the mon nodes as an extra configuration applied after the template generation via the ansible ini module. This implies two behaviors: 1/ On each ceph-ansible run, the ceph.conf will be regenerated via ceph-config+template and then ceph-mon+ini_file. This leads to a non necessary daemons restart. 2/ When other ceph daemons are collocated on the monitor nodes (like mgr or rgw), the default crush rule value will be erased by the ceph.conf template (mon -> mgr -> rgw). This patch adds the osd_pool_default_crush_rule config to the ceph template and only for the monitor nodes (like crush_rules.yml). The default crush rule id is read (if exist) from the current ceph configuration. The default configuration is -1 (ceph default). Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1638092 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-14 08:56:52 +00:00
Dimitri Savineau	b7f4e3e7c7	ceph-osd: Install numactl package when needed With `3e32dce` we can run OSD containers with numactl support. When using numactl command in a containerized deployment we need to be sure that the corresponding package is installed on the host. The package installation is only executed when the ceph_osd_numactl_opts variable isn't empty. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-12 07:43:06 +00:00
Guillaume Abrioux	b89a1d5c84	samples: resync sample files I suspect `./generate_group_vars_sample.sh` wasn't used in `b8d580b3f4` because it introduced a typo in `group_vars/all.yml.sample` and `group_vars/clients.yml.sample`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-11 10:14:50 +01:00
Guillaume Abrioux	b3eb9206fa	osd: support numactl options on OSD activate This commit adds OSD containers activate with numactl support. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1684146 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-03-11 10:14:50 +01:00
Dimitri Savineau	b23c05ae52	add-osd.yml: Add become flag for ceph-validate The check_devices task fails if the ceph-validate role isn't executed as a privileged user (Permission denied). failed: [osd0] (item=/dev/sdb) => {"changed": false, "err": "Error: Error opening /dev/sdb: Permission denied\n", "item": "/dev/sdb", "msg": "Error while getting device information with parted script: '/sbin/parted -s -m /dev/sdb -- unit 'MiB' print'", "out": "", "rc": 1} Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-09 05:54:46 +00:00
Dimitri Savineau	a089e1ec23	systemd/service: Set docker.service conditionally We don't need to set After=docker.service when the container_binary variable isn't set to docker. It doesn't break anything currently but it could be confusing when using podman. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-03-07 20:56:11 +00:00

1 2 3 4 5 ...

4477 Commits (b723ef3fa2d1c418c65808b2fb558fe9b92e5541) All Branches Search

4477 Commits (b723ef3fa2d1c418c65808b2fb558fe9b92e5541)

All Branches