ceph-ansible

Commit Graph

Author	SHA1	Message	Date
VasishtaShastry	af6abb7125	Peer addition won't be skipped if remote is not in peer rbd-mirroring is not configured as adding peer is getting skipped. Peer addition should not get skipped if its not added already Closes - https://bugzilla.redhat.com/show_bug.cgi?id=1942444 Signed-off-by: VasishtaShastry <vipin.indiasmg@gmail.com> (cherry picked from commit `006998e804`)	2021-03-26 19:14:49 +01:00
Guillaume Abrioux	f42ee9f940	cephadm_adopt: fetch and write ceph minimal config This commit makes the playbook fetch the minimal current ceph configuration and write it later on monitoring nodes so `cephadm` can proceed with the adoption. When a monitoring stack was deployed on a dedicated node, it means no `ceph.conf` file was written, `cephadm` requires a `ceph.conf` in order to adopt the daemon present on the node. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1939887 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b445df0479`)	2021-03-26 15:20:50 +01:00
Ali Maredia	80bf7030f7	docs: rgw multisite docs with new rgw_instances config Docs reflect that each instance of `rgw_instances` can now take rgw_zonemaster, rgw_zonesecondary, rgw_zonegroupmaster, rgw_multisite_proto. Signed-off-by: Ali Maredia <amaredia@redhat.com> (cherry picked from commit `a59bc2da3b`)	2021-03-26 07:42:50 +01:00
Guillaume Abrioux	dac1a284f6	library: drop ceph_facts This is never called in the playbook and seems unmaintained. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b01f16e835`)	2021-03-26 00:07:29 +01:00
Ken Dreyer	173b5599f9	README-MULTISITE: fix typos This commit fixes some typos in MULTISITE documentation. Signed-off-by: Ken Dreyer <ktdreyer@redhat.com> (cherry picked from commit `63a246db41`)	2021-03-26 00:06:39 +01:00
Guillaume Abrioux	50b95baa32	convert some missed `ansible_`` calls to `ansible_facts['']` This converts some missed calls to `ansible_*` that were missed in initial PR #6312 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `0163ecc924`)	2021-03-26 00:05:33 +01:00
Guillaume Abrioux	d6fcd78e72	clients: build filtered clients group early when the group `_filtered_clients` is built, the order can change from the original `clients` group which can cause issues since we run `ceph-container-engine` on the first client only. It means later in the playbook we can make call to the container CLI on a node where the container engine wasn't installed. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a112572734`)	2021-03-26 00:05:33 +01:00
Alex Schultz	7c6783acb1	Disable facts by default in ansible.cfg As a continuation of `a7f2fa73e6`, this change switches fact injection to off by default in the provided ansible.cfg. Signed-off-by: Alex Schultz <aschultz@redhat.com> (cherry picked from commit `db031a4993`) (cherry picked from commit `5fa4ff5ed3`)	2021-03-26 00:05:33 +01:00
Alex Schultz	815ea7765f	Use ansible_facts It has come to our attention that using ansible_* vars that are populated with INJECT_FACTS_AS_VARS=True is not very performant. In order to be able to support setting that to off, we need to update the references to use ansible_facts[<thing>] instead of ansible_<thing>. Related: ansible#73654 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1935406 Signed-off-by: Alex Schultz <aschultz@redhat.com> (cherry picked from commit `a7f2fa73e6`)	2021-03-26 00:05:33 +01:00
Guillaume Abrioux	a72ce4c04b	tests: switch to quay.ceph.io for dashboard images for some reason, `quay.io/app-sre/grafana` no longer exist. as a workaround, all dashboard related images have been mirrored on quay.ceph.io. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `c90b0985e5`)	2021-03-24 09:20:24 +01:00
Guillaume Abrioux	1fe44154de	iscsi: fetch right repo from shaman due to recent changes in shaman, we must fetch the right repo by filtering on the desired architecture. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `5801171b37`)	2021-03-24 09:20:24 +01:00
Guillaume Abrioux	48c2db97dc	tests: fix `test_rgw_is_up` test The data structure seems to have been modified in ceph@master (quincy). This commit update the test accordingly. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b8080bac41`)	2021-03-24 09:20:24 +01:00
Guillaume Abrioux	84a3f807e1	tests: fix `test_nfs_is_up` test the data structure seems to have been modified in ceph@master (quincy). This commit update the test accordingly. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `7e1db0b599`)	2021-03-24 09:20:24 +01:00
Guillaume Abrioux	3c5f5f1503	ceph_volume: fix bug in `is_lv()` This function makes the `ceph_volume` module be not idempotent in containerized context because it tries to run a container and bindmount directories that no longer exist. In that case, the `lvs` command being executed returns something different than `0` so we can't call `json.loads(out)['report'][0]['lv']` since it might throw an python error. The idea is to return `True` only if `rc` is equal to `0` and `len(result)` is greater than `0`, which means the command matched an LV. Fixes: #6284 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `ed79bc7a4e`)	2021-03-24 09:20:24 +01:00
Guillaume Abrioux	be7cfb9ccc	fix 'command -v' tasks `command -v` is a bash script which needs a shell to run. Fixes: #6325 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `14c472707c`)	2021-03-22 13:52:59 +01:00
Guillaume Abrioux	3fd6457c1d	cephadm_adopt: fetch and write ceph minimal config This commit makes the playbook fetch the minimal current ceph configuration and write it later on monitoring nodes so `cephadm` can proceed with the adoption. When a monitoring stack was deployed on a dedicated node, it means no `ceph.conf` file was written, `cephadm` requires a `ceph.conf` in order to adopt the daemon present on the node. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1939887 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b445df0479`)	2021-03-18 15:33:02 +01:00
Guillaume Abrioux	dbd53a2ef2	tests: remove sleep commands from tox ini files Since we use the rerun plugin in tox, we shouldn't need to add these `sleep` commands. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `e835c77a0e`)	2021-03-18 09:25:20 +01:00
Guillaume Abrioux	802705ff9b	facts: fix nfs/external cluster scenario These tasks shouldn't be run when at least 1 monitor isn't present in the inventory. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1937997 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `ccd1cbb732`)	2021-03-18 06:40:44 +01:00
Guillaume Abrioux	8e30a3c9f8	config: reset num_osds When collocating OSDs with other daemon, `num_osds` is incorrectly calculated because `ceph-config` is called multiple times. Indeed, the following code: ``` num_osds: "{{ lvm_list.stdout \| default('{}') \| from_json \| length \| int + num_osds \| default(0) \| int }}" ``` makes `num_osds` be incremented each time `ceph-config` is called. We have to reset it in order to get the correct number of expected OSDs. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `31a0f2653d`)	2021-03-17 17:35:37 +01:00
Matthew Vernon	6ace9bd9e5	docs: Document the prepare_osd tag There are times where being able to skip OSD creation is useful to the admin (see #1777 for example), and skipping the prepare_osd tag is a way to achieve this. Document this fact. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `e66b7b7449`)	2021-03-12 15:44:32 +01:00
Matthew Vernon	d449d15d4d	ceph-osd: add prepare_osd tag to lvm-batch scenario Sometimes it's useful to be able to skip the OSD creation step when running ceph-ansible (cf #1777). The lvm scenario has a prepare_osd tag on the relevant play. This commit adds the same tag to the lvm-batch scenario. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `88d119e95a`)	2021-03-12 15:44:32 +01:00
Guillaume Abrioux	5816a6ebe8	tests: increase nb of rerun in pytest In order to avoid false positive in the CI that I've been unable to reproduce. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f7fd1c2298`)	2021-03-12 09:38:46 +01:00
Guillaume Abrioux	8b69451652	dashboard: add missing parameter in `ceph_cmd` the `ceph_cmd` fact is missing the `--net=host` parameter. Some tasks consuming this fact can fail like following: ``` Error: error configuring network namespace for container b8ec913db1fb694ae683faf202680de7a59c714a004e533aba87e8503d29261f: Missing CNI default network ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1931365 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f143b1a647`)	2021-03-12 09:38:46 +01:00
Matthew Vernon	54cf6b4c77	Docs: fix some typos While working on the previous PR, I found a couple of typos in the docs. This fixes those. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `8b1474ab75`)	2021-03-12 09:35:54 +01:00
Guillaume Abrioux	32ad0f6fe7	common: ensure shaman returns right repo Due to recent changes in shaman, there's a chance it returns the wrong repository from architecture point of view. We can query shaman and ask for the correct architecture to get around this. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `39649f0ce8`)	2021-03-10 17:17:33 -05:00
Dimitri Savineau	09d6706697	debian/uca: remove the handler notification The "update apt cache" in the ceph-handler role was never called and the handler trigger after adding the uca repository doesn't exist at all. Instead of using a handler for that we can just set the update_cache parameter to true like the other apt_repository tasks. Resolve merge conflict from cherry-picking this commit. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-03-03 14:50:45 +01:00
Matthew Vernon	42b571b11f	Fix typo and broken link for documenting RGW frontends http://docs.ceph.com/docs/nautilus/radosgw/frontends/ 404s so replace it with a working "latest" docs link, and correct the spelling of "additional" while I'm at it. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `847611048e`)	2021-03-03 14:19:52 +01:00
Florian Haas	21e2675adb	requirements.txt: Move the six dependency into the general requirements config_template.py depends on six, which isn't listed in the default requirements.txt. This previously frequently wasn't a problem, because six used to be a standard package being installed into a venv, and lots of other projects depended on it. It also does get installed for unit and integration tests via tests/requirements.txt, so any broken dependency on six wouldn't be detected by tox runs. However, as other projects and distributions have phased out Python 2.7 support the dependency on six becomes less common. Thus, as long as ceph-ansible does require it for config_template.py, add it to the base requirements. Signed-off-by: Florian Haas <florian@citynetwork.eu> (cherry picked from commit `d49ea9818b`)	2021-03-01 15:17:10 +01:00
Guillaume Abrioux	6c61240637	defaults: update rhcs dashboard images versions The current dashboard images deployed have a bad health index. Updating to a newer version fixes this issue. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1925350 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a16ae693d8`)	2021-02-18 18:22:15 +01:00
Guillaume Abrioux	9359ee913f	library: do not always add --yes in batch mode When asking `ceph-volume` to report only in `lvm batch` context, there's a bug described in bz1896803 [1] when `--yes` is passed (which by the way isn't necessary with `--report`). This commit ensure `--yes` isn't passed to `ceph-volume` when `--report` is used. [1] https://bugzilla.redhat.com/show_bug.cgi?id=1896803 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1896803 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `fe6d6ba622`)	2021-02-14 06:29:30 +01:00
Guillaume Abrioux	2bf1b6b64d	purge: rm service-cid files This commit makes sure purge playbooks remove those file if for any reason they have been left. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1920900 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b9dd253a4f`)	2021-02-12 18:33:37 +01:00
Guillaume Abrioux	5bedb04585	switch2container: do not serialize the ceph-crash migration There's no need to slow down the playbook execution time by migrating all the `ceph-crash` instances in a serial way. Let's remove the `serial: 1` so the migration is achieved in a parallel way. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `980a5a7df4`)	2021-02-12 14:06:30 +01:00
Guillaume Abrioux	149d76da9e	tests: use V3.3-stable branch for nfs-ganesha This is the latest stable release available for octopus. Let's use it instead of using master builds. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-02-12 09:51:27 +01:00
Guillaume Abrioux	3e58f1ea6e	doc: add a note about "latest" tags See the change for details. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `4e95180c80`)	2021-02-11 16:48:05 +01:00
Dimitri Savineau	ffedb78aa7	doc: update containerized deployment This adds more documentation to the configuration and usage of containerizerd deployment. Closes: #6198 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `d42d584085`)	2021-02-11 16:48:05 +01:00
Dimitri Savineau	a1ca7f3daa	rolling_update: enforce ceph-container-engine When running the rolling_update.yml playbook and adding the dashboard component in the same time then the requirement (like container packages) aren't installed. This could lead to a failure in case of using authentication on the container registry because the playbook will try to login on the registry but podman/docker aren't yet installed. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1903504 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1918650 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `48a456dc8c`)	2021-02-10 09:58:03 +01:00
Dimitri Savineau	5572c907ee	ceph-common: enable rhcs tools repo for monitoring The monitoring node running grafana needs the rhcs tools repostory enabled in non containerized deployment to be able to install the ceph-grafana-dashboards rpm package. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1918650 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `e4dd0067c6`)	2021-02-10 09:58:03 +01:00
Guillaume Abrioux	32a84c9080	tests: pin ansible-lint version This commit pins the ansible-lint version to 4.3.7 as ceph-ansible isn't compatible with recent changes in 5.0.0 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `2f1d287b1c`)	2021-02-10 08:21:41 +01:00
Guillaume Abrioux	0e9b93db69	tests: set `mon_max_pg_per_osd` in rgw_multisite Otherwise, the job fails when it tries to create a bucket with `s3cmd mb` command because we have too many PGs per OSD. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `54bae480d2`)	2021-02-10 08:21:41 +01:00
Guillaume Abrioux	dd204d9e2f	rgw: fix a typo in multisite if `rgw_zonegroupmaster` is not defined at the rgw instance level in `rgw_instances` it will fallback to a wrong variable (`rgw_zonemaster`). Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1925247 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `931b87e830`)	2021-02-10 08:21:41 +01:00
Dimitri Savineau	2e9bf2f0fb	rolling_update: exclude clients from node-exporter Since `b105549` we don't install node-exporter on client nodes so we should also exclude the client node from the node-exporter upgrade. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `94af3c87d1`)	2021-02-10 08:14:01 +01:00
Dimitri Savineau	cb4e1a77a3	vagrant: remove centos/8 workaround The CentOS 8 vagrant box has finally been updated [1] with a recent version (the latest one 2011 which means CentOS 8.3). We don't need to download the vagrant libvirt box with a direct url anymore from the CentOS infrastructure. [1] https://app.vagrantup.com/centos/boxes/8 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `ed094ea07a`)	2021-02-09 14:40:44 +01:00
Guillaume Abrioux	a1cefe886b	purge: zap and destroy db and wal devices for lvm batch Those devices (db/wal) are never zapped in lvm batch deployment. Iterating over `dedicated_devices` and `bluestore_wal_devices` fixes this issue. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1922926 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `984191ac7f`)	2021-02-01 14:09:19 -05:00
Dimitri Savineau	fa9177d2ce	ceph-mon: add ExecStartPre docker stop to systemd We already do that in the other systemd templates (mgr, mds, etc..) and would present to add workaround in other orchestration tool. This change is for containerized deployment only. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1882724 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `3749d297c7`)	2021-01-29 11:41:16 -05:00
Guillaume Abrioux	78d9d9df11	rgw: avoid useless call to ceph-rgw since `ceph-rgw` may be called from `ceph-handler` in some contexts we should avoid rerunning it unnecessarily. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8617081664`)	2021-01-28 16:37:32 -05:00
Guillaume Abrioux	df98746378	rgw: multisite refact Add the possibility to deploy rgw multisite configuration with a mix of secondary and primary zones on a same rgw node. Before that, on a same node, all instances were either primary zones OR secondary. Now you can define a rgw instance like following: ``` rgw_instances: - instance_name: 'rgw0' rgw_zonemaster: false rgw_zonesecondary: true rgw_zonegroupmaster: false rgw_realm: 'france' rgw_zonegroup: 'zonegroup-france' rgw_zone: paris-00 radosgw_address: "{{ _radosgw_address }}" radosgw_frontend_port: 8080 rgw_zone_user: jacques.chirac rgw_zone_user_display_name: "Jacques Chirac" system_access_key: P9Eb6S8XNyo4dtZZUUMy system_secret_key: qqHCUtfdNnpHq3PZRHW5un9l0bEBM812Uhow0XfB endpoint: http://192.168.101.12:8080 ``` Basically it's now possible to define `rgw_zonemaster`, `rgw_zonesecondary` and `rgw_zonegroupmaster` at the intsance level instead of the whole node level. Also, this commit adds an option `deploy_secondary_zones` (default True) which can be set to `False` in order to explicitly ask the playbook to not deploy secondary zones in case where the corresponding endpoint are not deployed yet. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1915478 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `71a5e666e3`)	2021-01-28 16:37:32 -05:00
Guillaume Abrioux	32ac435962	library: fix bug in radosgw_zone.py If for some reason `get_zonegroup()` returns a failure, we must handle and make the module exit properly instead of failing with the following python trace: ``` Traceback (most recent call last): File "./AnsiballZ_radosgw_zone.py", line 247, in <module> _ansiballz_main() File "./AnsiballZ_radosgw_zone.py", line 234, in _ansiballz_main exitcode = debug(sys.argv[1], zipped_mod, ANSIBALLZ_PARAMS) File "./AnsiballZ_radosgw_zone.py", line 202, in debug runpy.run_module(mod_name='ansible.modules.radosgw_zone', init_globals=None, run_name='__main__', alter_sys=True) File "/usr/lib64/python3.6/runpy.py", line 205, in run_module return _run_module_code(code, init_globals, run_name, mod_spec) File "/usr/lib64/python3.6/runpy.py", line 96, in _run_module_code mod_name, mod_spec, pkg_name, script_name) File "/usr/lib64/python3.6/runpy.py", line 85, in _run_code exec(code, run_globals) File "/home/vagrant/.ansible/tmp/ansible-tmp-1610728441.41-685133-218973990589597/debug_dir/ansible/modules/radosgw_zone.py", line 467, in <module> main() File "/home/vagrant/.ansible/tmp/ansible-tmp-1610728441.41-685133-218973990589597/debug_dir/ansible/modules/radosgw_zone.py", line 463, in main run_module() File "/home/vagrant/.ansible/tmp/ansible-tmp-1610728441.41-685133-218973990589597/debug_dir/ansible/modules/radosgw_zone.py", line 425, in run_module zonegroup = json.loads(_out) File "/usr/lib64/python3.6/json/__init__.py", line 354, in loads return _default_decoder.decode(s) File "/usr/lib64/python3.6/json/decoder.py", line 339, in decode obj, end = self.raw_decode(s, idx=_w(s, 0).end()) File "/usr/lib64/python3.6/json/decoder.py", line 357, in raw_decode raise JSONDecodeError("Expecting value", s, err.value) from None json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0) ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `fedb36688d`)	2021-01-28 16:37:32 -05:00
Dimitri Savineau	c7d204ce37	ceph-defaults: change default ceph container tag The "latest" ceph container tag references the latest stable release (octopus at the moment). "latest" is an alias on "latest-octopus". On the devel branch we should use "latest-master" tag instead. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `7d56771975`)	2021-01-22 17:39:39 -05:00
Dimitri Savineau	c4f11809c4	module_utils: don't add newline to the data When executing a command via the run_command method and passing some data with stdin then the default behavior is to add append a newline. This breaks the value of password used by our modules. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `6616908577`)	2021-01-18 14:46:53 -05:00
Dimitri Savineau	fdda54eeb4	dashboard: manage password backward compatibility The ceph dashboard changed the way the password are provided via the CLI. This breaks the backward compatibility when using a recent ceph-ansible version with ceph release without that feature. This patch adds tasks for legacy workflow (ceph release without that feature) in both ceph-dashboard role and ceph_dashboard_user module. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-01-18 14:46:53 -05:00

... 3 4 5 6 7 ...

5742 Commits (5d1a7d60b3ed78e084687f4b8d2caf8b63db5574) All Branches Search

5742 Commits (5d1a7d60b3ed78e084687f4b8d2caf8b63db5574)

All Branches