ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	a6dac8c93d	crash: refact caps definition there is no need to use `{{ }}` syntax here. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a8bd947c7d`)	2020-10-20 09:09:14 +02:00
Guillaume Abrioux	e6b3186420	ceph-volume: refresh lvm metadata cache When running rhel8 containers on a rhel7 host, after zapping an OSD there's a discrepancy with the lvmetad cache that needs to be refreshed. Otherwise, the host still sees the lv and can makes the user confused. If user tries to redeploy an OSD, it will fail because the LV isn't present and need to be recreated. ie: ``` stderr: lsblk: ceph-block-8/block-8: not a block device stderr: blkid: error: ceph-block-8/block-8: No such file or directory stderr: Unknown device, --name=, --path=, or absolute path in /dev/ or /sys expected. usage: ceph-volume lvm prepare [-h] --data DATA [--data-size DATA_SIZE] [--data-slots DATA_SLOTS] [--filestore] [--journal JOURNAL] [--journal-size JOURNAL_SIZE] [--bluestore] [--block.db BLOCK_DB] [--block.db-size BLOCK_DB_SIZE] [--block.db-slots BLOCK_DB_SLOTS] [--block.wal BLOCK_WAL] [--block.wal-size BLOCK_WAL_SIZE] [--block.wal-slots BLOCK_WAL_SLOTS] [--osd-id OSD_ID] [--osd-fsid OSD_FSID] [--cluster-fsid CLUSTER_FSID] [--crush-device-class CRUSH_DEVICE_CLASS] [--dmcrypt] [--no-systemd] ceph-volume lvm prepare: error: Unable to proceed with non-existing device: ceph-block-8/block-8 ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1886534 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `0bb106045e`)	2020-10-19 17:56:39 -04:00
Benoît Knecht	5e67492ef4	ceph-osd: Fix check mode for start osds tasks Correctly set `osd_ids_non_container.stdout_lines` to an empty list if it's undefined (i.e. in check mode). Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `8b0023cb77`)	2020-10-19 22:53:20 +02:00
Benoît Knecht	9f5ec22d34	ceph-mon: Fix check mode for deploy monitor tasks Skip the `get initial keyring when it already exists` task when both commands whose `stdout` output it requires have been skipped (e.g. when running in check mode). Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `8f436ab5d8`)	2020-10-19 22:53:20 +02:00
Gaudenz Steinlin	f8a64ce452	ceph-crash: Only deploy key to targeted hosts The current task installs the ceph-crash key to "most" hosts via "delegate_to". This key is only used by the ceph-crash daemon and should just be installed on all hosts targeted by this role. There is no need for using a delegated task. Signed-off-by: Gaudenz Steinlin <gaudenz.steinlin@cloudscale.ch> (cherry picked from commit `68cc93fb18`)	2020-10-19 20:20:25 +02:00
Dimitri Savineau	79661bda7e	flake8: run the workflow conditionally We don't need to run flake8 on ansible modules and their tests if we don't have any modifitions. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `00b7ee27df`)	2020-10-19 13:33:30 -04:00
Guillaume Abrioux	0c66f90968	ceph-osd: start osd after systemd overrides The service should be started after the ceph-osd systemd overrides has been added, otherwise, the latter isn't considered. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1860739 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `59d0f01992`)	2020-10-15 13:52:35 +02:00
Dimitri Savineau	3f610811fe	ceph-osd: don't start the OSD services twice Using the + operation on two lists doesn't filter out the duplicate keys. Currently each OSDs is started (via systemd) twice. Instead we could use the union filter. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `4eaa65c362`)	2020-10-14 09:57:52 -04:00
Guillaume Abrioux	d258bf4d2d	handler: refact check_socket_non_container the `stat --printf=%n` returns something like following: ``` ok: [osd0] => changed=false cmd: \|- stat --printf=%n /var/run/ceph/ceph-osd*.asok delta: '0:00:00.009388' end: '2020-10-06 06:18:28.109500' failed_when_result: false rc: 0 start: '2020-10-06 06:18:28.100112' stderr: '' stderr_lines: <omitted> stdout: /var/run/ceph/ceph-osd.2.asok/var/run/ceph/ceph-osd.5.asok stdout_lines: <omitted> ``` it makes the next task "check if the ceph osd socket is in-use" grep like this: ``` ok: [osd0] => changed=false cmd: - grep - -q - /var/run/ceph/ceph-osd.2.asok/var/run/ceph/ceph-osd.5.asok - /proc/net/unix ``` which will obviously fail because this path never exists. It makes the OSD handler broken. Let's use `find` module instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `46d4d97da9`)	2020-10-09 13:55:28 +02:00
Benoît Knecht	c733af9d43	Fix Ansible check mode for site.yml.sample playbook Make sure the `site.yml.sample` playbook can be run in check mode by skipping tasks that try to read the output of commands that have been skipped. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `54ba38e35e`)	2020-10-07 07:06:19 +02:00
Guillaume Abrioux	3fa84cf44a	tests: change cephfs pool size `all_daemons` scenario can't handle pools with `size: 3` because we have 1 osd node in root=HDD and two nodes in root=default. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `e5713ea5d5`)	2020-10-06 17:13:58 +02:00
Dimitri Savineau	2185a2201d	library: add radosgw_zone module This adds radosgw_zone ansible module for replacing the command module usage with the radosgw-admin zone command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `1281e8bcc8`)	2020-10-06 15:00:17 +02:00
Dimitri Savineau	5a371b3607	library: add radosgw_zonegroup module This adds radosgw_zonegroup ansible module for replacing the command module usage with the radosgw-admin zonegroup command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `65dbe0782e`)	2020-10-06 15:00:17 +02:00
Dimitri Savineau	1643210ca6	library: add radosgw_realm module This adds radosgw_realm ansible module for replacing the command module usage with the radosgw-admin realm command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `d171f4068d`)	2020-10-06 15:00:17 +02:00
Dimitri Savineau	858169a27d	library: add radosgw_user module This adds radosgw_user ansible module for replacing the command module usage with the radosgw-admin user command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `235c7e27cc`)	2020-10-06 15:00:17 +02:00
Dimitri Savineau	4fc2d788b4	library: add ceph_fs module This adds the ceph_fs ansible module for replacing the command module usage with the ceph fs command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `bd611a785b`)	2020-10-06 14:59:49 +02:00
Guillaume Abrioux	968dd3830a	ceph_key: support using different keyring Currently the `ceph_key` module doesn't support using a different keyring than `client.admin`. This commit adds the possibility to use a different keyring. Usage: ``` ceph_key: name: "client.rgw.myrgw-node.rgw123" cluster: "ceph" user: "client.bootstrap-rgw" user_key: /var/lib/ceph/bootstrap-rgw/ceph.keyring dest: "/var/lib/ceph/radosgw/ceph-rgw.myrgw-node.rgw123/keyring" caps: osd: 'allow rwx' mon: 'allow rw' import_key: False owner: "ceph" group: "ceph" mode: "0400" ``` Where: `user` corresponds to `-n (--name)` `user_key` corresponds to `-k (--keyring)` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `12e6260266`)	2020-10-06 10:31:34 +02:00
Guillaume Abrioux	2a3b563c7e	rgw: fix multi instances scaleout in baremetal When rgw and osd are collocated, the current workflow prevents from scaling out the radosgw_num_instances parameter when rerunning the playbook in baremetal deployments. When ceph-osd notifies handlers, it means rgw handlers are triggered too. The issue with this is that they are triggered before the role ceph-rgw is run. In the case a scaleout operation is expected on `radosgw_num_instances` it causes an issue because keyrings haven't been created yet so the new instances won't start. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1881313 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a802fa2810`)	2020-10-06 10:31:34 +02:00
Guillaume Abrioux	5db74194b2	tests: reboot and test idempotency on collocation test reboot and idempotency on collocation scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f83f798206`)	2020-10-06 10:31:34 +02:00
Dimitri Savineau	a5f19b7864	ceph_key: remove backward compatibility It's time to remove this backward compatibility. Users had enough time to convert their openstack_keys and key values. We now fail in ceph-validate if the caps key isn't set. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `c960362639`)	2020-10-06 10:09:16 +02:00
Guillaume Abrioux	a40ea7e712	infrastructure-playbooks: drop add-osd playbook This playbook isn't needed anymore, we can achieve this operation by running main playbook with `--limit` option. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `20718582da`)	2020-10-06 10:03:40 +02:00
Guillaume Abrioux	32be163360	ceph-osd: refact `docker_exec_start_osd` This commit drops nested jinja construction in this set_fact task. It also rename it to `container_exec_start_osd` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `ff95fa9c32`)	2020-10-06 09:54:50 +02:00
Guillaume Abrioux	88a4d39978	flake8: fix pep8 syntax on tests/functional/tests/ tests/conftest.py and tests present in tests/functional/tests/ has been missed from previous commit Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8596f1d52c`)	2020-10-06 08:54:43 +02:00
Guillaume Abrioux	df54883fdf	flake8: fix all tests/library/.py files This commit modifies all .py files in ./tests/library/ so flake8 passes. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `e49a5241f0`) (cherry picked from commit fb98f436848189e26480697b23f45b28f51a6ccd)	2020-10-02 09:00:56 -04:00
Guillaume Abrioux	e5df63f34f	tests: refact flake8 workflow drop ricardochaves/python-lint action and use `run` steps instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f2d3432cad`) (cherry picked from commit 7378909c7b8d6a6285f14ea6c7c8987fae73939d)	2020-10-02 09:00:56 -04:00
Guillaume Abrioux	80879df44d	defaults: change defaults value this commit changes defaults value in default pool definitions. there's no need to define `pg_num`, `pgp_num`, `size` and `min_size`, `ceph_pool` module will use the current default if needed. This also drops the 3 following `set_fact` in `ceph-facts`: - osd_pool_default_pg_num, - osd_pool_default_pgp_num, - osd_pool_default_size_num Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `c101cb3931`)	2020-10-02 09:32:53 +02:00
Guillaume Abrioux	b01f1dc5c9	ceph_pool: update tests update test_ceph_pool.py due to recent refact Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8f5db079ae`)	2020-10-02 09:32:53 +02:00
Guillaume Abrioux	28b475cd72	ceph_pool: improve pg_autoscaler support This commit modifies how the `pg_autoscaler` feature is handled by the ceph_pool module. 1/ If a pool has the pg_autoscaler feature enabled, we shouldn't try to update pg/pgp. 2/ Make it more readable Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `740df379b7`)	2020-10-02 09:32:53 +02:00
Guillaume Abrioux	9fe7ab6bc6	ceph_pool: pep8 Adopt pep8 syntax in ceph_pool module Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `787878f0c3`)	2020-10-02 09:32:53 +02:00
Guillaume Abrioux	cb44f655fc	ceph_pool: refact module remove complexity about current defaults in running cluster Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `29fc115f4a`)	2020-10-02 09:32:53 +02:00
Guillaume Abrioux	7fc78939b1	library: remove legacy file This file is a leftover and should have been removed when we dropped the validate module. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8603cba9ab`)	2020-10-01 23:31:11 +02:00
Guillaume Abrioux	ed6ae6815d	tests: add github workflows Add github workflow. Especially for flake8 for now. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `1ee626a1b3`)	2020-10-01 09:26:13 -04:00
Wong Hoi Sing Edison	32a2f04cbc	library: flake8 ceph-ansible modules This commit ensure all ceph-ansible modules pass flake8 properly. Signed-off-by: Wong Hoi Sing Edison <hswong3i@gmail.com> Co-authored-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `268a39ca0e`)	2020-10-01 09:26:13 -04:00
Seena Fallah	10fc2d1d92	ceph-facts: add get default crush rule from running monitor In case of deploying new monitor node to an existing cluster, osd_pool_default_crush_rule should be taken from running monitor because ceph-osd role won't be run and the new monitor will have different osd_pool_default_crush_role from other monitors. Signed-off-by: Seena Fallah <seenafallah@gmail.com> (cherry picked from commit `ff9f4d138f`)	2020-09-29 12:15:09 -04:00
Guillaume Abrioux	72c73ac2bc	fs2bs: support `osd_auto_discovery` scenario This commit adds the `osd_auto_discovery` scenario support in the filestore-to-bluestore playbook. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1881523 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `8b1eeef18a`)	2020-09-29 16:28:43 +02:00
Ali Maredia	9f58d4a3d1	rgw multisite: check connection for realm endpoint This commit adds connection checks before realm pulls Curls are performed on the endpoint being pulled from the mons and the rgws Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1731158 Signed-off-by: Ali Maredia <amaredia@redhat.com> (cherry picked from commit `902575369c`)	2020-09-29 09:24:59 -04:00
Tyler Bishop	e3284b20ac	facts: support device aliases for (dedicated\|bluestore_wal)_devices Just likve `devices`, this commit adds the support for linux device aliases for `dedicated_devices` and `bluestore_wal_devices`. Signed-off-by: Tyler Bishop <tbishop@liquidweb.com> (cherry picked from commit `ee4b8804ae`)	2020-09-29 09:24:17 -04:00
Dimitri Savineau	6294244c4f	ceph-handler: set handler on xxx_stat result In non containerized deployment we check if the service is running via the socket file presence. This is done via the xxx_socket_stat variable that check the file socket in the /var/run/ceph/ directory. In some scenarios, we could have the socket file still present in that directory but not used by any process. That's why we have the xxx_stat variable which clean those leftovers. The problem here is that we're set the variable for the handlers status (like handler_mon_status) based on xxx_socket_stat instead of xxx_stat. That means we will trigger the handlers if there's an old socket file present on the system without any process associated. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1866834 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `733596582d`)	2020-09-29 09:23:45 -04:00
Dimitri Savineau	77c5115ad2	ceph-iscsi: create pool once from monitor `af9f6684` introduced a regression on the ceph iscsi pool creation because it was delegated to the first monitor node before that change. This patch restores the initial worflow. When the iscsi node doesn't have the admin keyring then the pool creation fails. This commit also ensures that the pool creation is only executed once when having multiple iscsi nodes. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `501b8e0fd3`)	2020-09-29 09:23:14 -04:00
Benoît Knecht	ab458a9592	library: Fix new-style modules check mode Running the `ceph_crush.py`, `ceph_key.py` or `ceph_volume.py` modules in check mode resulted in the following error: ``` New-style module did not handle its own exit ``` This was due to the fact that they simply returned a `dict` in that case, instead of calling `module.exit_json()`. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `85dd405814`)	2020-09-28 20:41:51 -04:00
Dimitri Savineau	babc8a05fd	ceph-mds: remove unused block condition Since `af9f6684` the cephfs pool(s) creation don't use the fs_pools_created variable anymore because the ceph_pool module is idempotent. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `1db4dc807c`)	2020-09-28 20:39:25 -04:00
Seena Fallah	9b0f45431d	ceph-facts: check for mon socket in its own host delegate to its own host after checking mon socket to findout if mon socket is in-use or not. Signed-off-by: Seena Fallah <seenafallah@gmail.com> (cherry picked from commit `69f7e35382`)	2020-09-28 20:38:52 -04:00
Guillaume Abrioux	5538dd8b3b	Revert "ceph-rgw: remove ceph_pool state and default value" This reverts commit `ba3512a8fc`. (cherry picked from commit `bf7b044c9a`) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-28 16:47:45 -04:00
Raffael	a8f6324b48	doc: Update methods.rst Based on the discussion in issue #5392 I added now this paragraph to this page. Signed-off-by: Raffael Luthiger <r.luthiger@huanga.com> (cherry picked from commit `9af86d250b`)	2020-09-25 15:19:47 -04:00
Benoît Knecht	cfeb6b8403	README-MULTISITE: Fix syntax issues from markdownlint This commit makes the following changes: - Remove trailing whitespace; - Use consistent header levels; - Fix code blocks; - Remove hard tabs; - Fix ordered lists; - Fix bare URLs; - Use markdown list of sections. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `2c244425ec`)	2020-09-25 14:38:58 -04:00
Dimitri Savineau	7e2e11320d	ceph-rgw: remove ceph_pool state and default value Since the state is now optional and default values are handled in the ceph_pool module itself. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `ba3512a8fc`)	2020-09-25 14:05:34 -04:00
Dimitri Savineau	1fa1b5b751	ceph_pool: add idempotency to absent state When using the "absent" state on a non existing pool then the ceph_pool module will fail and return a python traceback. Instead we should check if the pool exit or not and execute the pool deletion according to the result. The state changed is now set when the pool is actually deleted. This also disable add_file_common_args because we don't manipulate files with this module. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `047a3e2653`)	2020-09-25 13:55:20 -04:00
Dimitri Savineau	8d49d97582	ceph-config: remove ceph_release from ceph.conf.j2 We don't use ceph_release variable in the ceph.conf jinja template. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `62bd41f0d4`)	2020-09-25 13:37:36 -04:00
Guillaume Abrioux	9d04b8ca8b	ansible.cfg: remove cfg file in infrastructure-playbooks There's no need ot have a copy of this file in infrastructure-playbooks directory. playbooks in that directory can be run from the root dir of ceph-ansible. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f906caa6da`)	2020-09-25 11:12:39 -04:00
Guillaume Abrioux	113eadad72	ansible.cfg: set force_valid_group_names param As of 2.10, group names containing a dash are invalid. However, setting this option makes it still possible to use a dash in group names and prevent this warning to show up. It might need to be definitely addressed in a future ansible release. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1880476 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6938ed1302`)	2020-09-25 11:12:39 -04:00

1 2 3 4 5 ...

5418 Commits (a6dac8c93d35a564a6710365112c4079a2efe1c8) All Branches Search

5418 Commits (a6dac8c93d35a564a6710365112c4079a2efe1c8)

All Branches