ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	5210bd16df	rgw: supports pg_autoscale_mode option for pool creation Support enabling/disabling the pg autoscaler for rgw pools. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9f03a527ba`)	2021-04-01 15:32:40 +02:00
Guillaume Abrioux	66235feca3	dashboard: support prometheus storage.tsdb.retention.time parameter This commit adds the parameter `--storage.tsdb.retention.time` to the prometheus systemd unit template. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1928000 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b60c61ce45`)	2021-04-01 14:52:37 +02:00
Guillaume Abrioux	efddbdf909	nfs: set idmap config for Ceph-NFS Currently NFS Ganesha (ceph-nfs) consumes /etc/idmapd.conf, which controls mapping of user/owner identities under NFSv4+. With containerized service deployment, this file is an immutable part of the container image and cannot be modified. Here we provide group variables, and a taskk and templates for the ceph-nfs role, to set the path of the idmap configuration file and to make the most common adjustment to the contents of that file -- namely to set the 'Domain'. We default the path to /etc/ganesha/idmap.conf so that we will not conflict with /etc/idmapd.conf on the controller nodes where ganesha runs. NFSv4 clients, as used for example by the Cinder NFS driver, consume /etc/idmapd.conf and may require different settings than what is wanted for NFS Ganesha. Additionally, because we already bind /etc/ganesha from the host into the ceph-nfs container, the file NFS Ganesha consumes will no longer be an immutable part of the container. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1925646 Signed-off-by: Tom Barron tpb@dyncloud.net Co-Authored-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `2db2208e40`)	2021-04-01 14:52:12 +02:00
Guillaume Abrioux	94d227149d	defaults: add a comment about `igw_network` This add a quick documentation in ceph-defaults about `igw_network` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `c5728bdc63`)	2021-03-29 11:23:49 +02:00
Guillaume Abrioux	000b203ebf	update: followup on `07029e1` Playbook must fail anyway, the `rescue` block has been introduced for unmasking the unit after the playbook has failed. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `e9ddb972fe`)	2021-03-29 10:54:44 +02:00
VasishtaShastry	4a10f6ee72	Peer addition won't be skipped if remote is not in peer rbd-mirroring is not configured as adding peer is getting skipped. Peer addition should not get skipped if its not added already Closes - https://bugzilla.redhat.com/show_bug.cgi?id=1942444 Signed-off-by: VasishtaShastry <vipin.indiasmg@gmail.com> (cherry picked from commit `006998e804`)	2021-03-26 21:24:59 +01:00
Guillaume Abrioux	2b19dfdaae	dashboard: support igw nodes with dedicated subnet This adds the possibility to deploy the dashboard with igw nodes using a dedicated subnet. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1926170 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `c33de174f1`)	2021-03-26 19:38:37 +01:00
Guillaume Abrioux	1fd0661d3e	rolling_update: unmask monitor service after a failure if for some reason the playbook fails after the service was stopped, disabled and masked and before it got restarted, enabled and unmasked, the playbook leaves the service masked and which can make users confused and forces them to unmask the unit manually. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1917680 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `07029e1bf1`)	2021-03-26 15:20:35 +01:00
Ali Maredia	fcd9544048	docs: rgw multisite docs with new rgw_instances config Docs reflect that each instance of `rgw_instances` can now take rgw_zonemaster, rgw_zonesecondary, rgw_zonegroupmaster, rgw_multisite_proto. Signed-off-by: Ali Maredia <amaredia@redhat.com> (cherry picked from commit `a59bc2da3b`)	2021-03-26 07:42:35 +01:00
Guillaume Abrioux	8aa0dc2868	library: drop ceph_facts This is never called in the playbook and seems unmaintained. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b01f16e835`)	2021-03-26 00:07:18 +01:00
Ken Dreyer	37088d8c4f	README-MULTISITE: fix typos This commit fixes some typos in MULTISITE documentation. Signed-off-by: Ken Dreyer <ktdreyer@redhat.com> (cherry picked from commit `63a246db41`)	2021-03-26 00:06:21 +01:00
Guillaume Abrioux	8a4fd99db7	convert some missed `ansible_`` calls to `ansible_facts['']` This converts some missed calls to `ansible_*` that were missed in initial PR #6312 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `0163ecc924`)	2021-03-26 00:04:49 +01:00
Alex Schultz	181924db7b	Disable facts by default in ansible.cfg As a continuation of `a7f2fa73e6`, this change switches fact injection to off by default in the provided ansible.cfg. Signed-off-by: Alex Schultz <aschultz@redhat.com> (cherry picked from commit `db031a4993`)	2021-03-26 00:04:49 +01:00
Alex Schultz	56aac327dd	Use ansible_facts It has come to our attention that using ansible_* vars that are populated with INJECT_FACTS_AS_VARS=True is not very performant. In order to be able to support setting that to off, we need to update the references to use ansible_facts[<thing>] instead of ansible_<thing>. Related: ansible#73654 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1935406 Signed-off-by: Alex Schultz <aschultz@redhat.com> (cherry picked from commit `a7f2fa73e6`)	2021-03-26 00:04:49 +01:00
Guillaume Abrioux	ab857d8b54	tests: use master build for iscsigws pacific builds for iscsi pkgs aren't available, as a workaround we can use builds from master. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-03-24 21:36:24 +01:00
Guillaume Abrioux	723efc8576	tests: switch to quay.ceph.io for dashboard images for some reason, `quay.io/app-sre/grafana` no longer exist. as a workaround, all dashboard related images have been mirrored on quay.ceph.io. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `c90b0985e5`)	2021-03-24 21:36:24 +01:00
Guillaume Abrioux	65d1cfd634	iscsi: fetch right repo from shaman due to recent changes in shaman, we must fetch the right repo by filtering on the desired architecture. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `5801171b37`)	2021-03-24 21:36:24 +01:00
Guillaume Abrioux	439cb79e3e	tests: fix `test_rgw_is_up` test The data structure seems to have been modified in ceph@master (quincy). This commit update the test accordingly. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b8080bac41`)	2021-03-24 21:36:24 +01:00
Guillaume Abrioux	fb75fce4fa	tests: fix `test_nfs_is_up` test the data structure seems to have been modified in ceph@master (quincy). This commit update the test accordingly. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `7e1db0b599`)	2021-03-24 21:36:24 +01:00
Guillaume Abrioux	43c7c20fa9	ceph_volume: fix bug in `is_lv()` This function makes the `ceph_volume` module be not idempotent in containerized context because it tries to run a container and bindmount directories that no longer exist. In that case, the `lvs` command being executed returns something different than `0` so we can't call `json.loads(out)['report'][0]['lv']` since it might throw an python error. The idea is to return `True` only if `rc` is equal to `0` and `len(result)` is greater than `0`, which means the command matched an LV. Fixes: #6284 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `ed79bc7a4e`)	2021-03-24 21:36:24 +01:00
Guillaume Abrioux	a4d4f53080	fix 'command -v' tasks `command -v` is a bash script which needs a shell to run. Fixes: #6325 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `14c472707c`)	2021-03-22 13:52:39 +01:00
Guillaume Abrioux	8d25b4305e	adopt: convert legacy grafana-server groupname early This is a follow up on PR #6332 cephadm-adopt.yml playbook is affected by the same bug Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1938658 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `af95595c82`)	2021-03-18 08:56:44 +01:00
Guillaume Abrioux	5893a17886	tests: remove 1 client VM in external_clients job We only use 2 client in this scenario, there's no need to fire up a third VM. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `fb1a5f071a`)	2021-03-18 08:54:33 +01:00
Guillaume Abrioux	05ab3a7d50	validate: update `ceph_repository_community` check this updates the `ceph_repository_community` check in `ceph-validate` with the right ceph release expected. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `47b9b75ace`)	2021-03-18 08:54:33 +01:00
Guillaume Abrioux	01939808b0	nfs: bump nfs-ganesha version This commit updates the default version of nfs-ganesha to V3.5 which is the latest version available upstream. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `c78388e580`)	2021-03-18 08:54:33 +01:00
Guillaume Abrioux	c296824ae0	cephadm_adopt: fetch and write ceph minimal config This commit makes the playbook fetch the minimal current ceph configuration and write it later on monitoring nodes so `cephadm` can proceed with the adoption. When a monitoring stack was deployed on a dedicated node, it means no `ceph.conf` file was written, `cephadm` requires a `ceph.conf` in order to adopt the daemon present on the node. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1939887 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b445df0479`)	2021-03-18 08:51:59 +01:00
Guillaume Abrioux	688e432c32	facts: fix nfs/external cluster scenario These tasks shouldn't be run when at least 1 monitor isn't present in the inventory. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1937997 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `ccd1cbb732`)	2021-03-18 06:40:33 +01:00
Guillaume Abrioux	d65c7b4035	config: reset num_osds When collocating OSDs with other daemon, `num_osds` is incorrectly calculated because `ceph-config` is called multiple times. Indeed, the following code: ``` num_osds: "{{ lvm_list.stdout \| default('{}') \| from_json \| length \| int + num_osds \| default(0) \| int }}" ``` makes `num_osds` be incremented each time `ceph-config` is called. We have to reset it in order to get the correct number of expected OSDs. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `31a0f2653d`)	2021-03-17 17:35:19 +01:00
Guillaume Abrioux	732e5b10b8	update: convert legacy grafana-server groupname early If the legacy name `grafana-server` is still being used when upgrading from Nautilus to Pacific, the task that sets the fact `rolling_update` to `true` doesn't run on the node(s) included in that group. Indeed the play where we set this fact (`rolling_update`) only runs on the group `monitoring_group_name \| default('monitoring')`. As a workaround, we can run earlier the task which converts the `grafana-server` group name to `monitoring`. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1935554 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6ccc8b4722`)	2021-03-16 14:33:40 +01:00
Matthew Vernon	3c8191194d	docs: Document the prepare_osd tag There are times where being able to skip OSD creation is useful to the admin (see #1777 for example), and skipping the prepare_osd tag is a way to achieve this. Document this fact. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `e66b7b7449`)	2021-03-12 09:19:55 +01:00
Matthew Vernon	6deb88d8fb	ceph-osd: add prepare_osd tag to lvm-batch scenario Sometimes it's useful to be able to skip the OSD creation step when running ceph-ansible (cf #1777). The lvm scenario has a prepare_osd tag on the relevant play. This commit adds the same tag to the lvm-batch scenario. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `88d119e95a`)	2021-03-12 09:19:55 +01:00
Matthew Vernon	6a23be19f4	Docs: fix some typos While working on the previous PR, I found a couple of typos in the docs. This fixes those. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `8b1474ab75`)	2021-03-11 22:04:53 +01:00
Matthew Vernon	1a67f59789	Fix typo and broken link for documenting RGW frontends http://docs.ceph.com/docs/nautilus/radosgw/frontends/ 404s so replace it with a working "pacific" docs link, and correct the spelling of "additional" while I'm at it. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `847611048e`)	2021-03-03 14:17:31 +01:00
Guillaume Abrioux	6832c8d7a5	tests: increase nb of rerun in pytest In order to avoid false positive in the CI that I've been unable to reproduce. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f7fd1c2298`)	2021-03-03 14:12:46 +01:00
Guillaume Abrioux	f42ed8e1e0	dashboard: add missing parameter in `ceph_cmd` the `ceph_cmd` fact is missing the `--net=host` parameter. Some tasks consuming this fact can fail like following: ``` Error: error configuring network namespace for container b8ec913db1fb694ae683faf202680de7a59c714a004e533aba87e8503d29261f: Missing CNI default network ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1931365 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f143b1a647`)	2021-03-03 14:12:46 +01:00
Florian Haas	95949ec787	requirements.txt: Move the six dependency into the general requirements config_template.py depends on six, which isn't listed in the default requirements.txt. This previously frequently wasn't a problem, because six used to be a standard package being installed into a venv, and lots of other projects depended on it. It also does get installed for unit and integration tests via tests/requirements.txt, so any broken dependency on six wouldn't be detected by tox runs. However, as other projects and distributions have phased out Python 2.7 support the dependency on six becomes less common. Thus, as long as ceph-ansible does require it for config_template.py, add it to the base requirements. Signed-off-by: Florian Haas <florian@citynetwork.eu> (cherry picked from commit `d49ea9818b`)	2021-03-01 15:16:55 +01:00
Guillaume Abrioux	accdcf78e6	defaults: update rhcs dashboard images versions The current dashboard images deployed have a bad health index. Updating to a newer version fixes this issue. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1925350 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a16ae693d8`)	2021-02-18 18:21:53 +01:00
Guillaume Abrioux	bb9bba685f	library: do not always add --yes in batch mode When asking `ceph-volume` to report only in `lvm batch` context, there's a bug described in bz1896803 [1] when `--yes` is passed (which by the way isn't necessary with `--report`). This commit ensure `--yes` isn't passed to `ceph-volume` when `--report` is used. [1] https://bugzilla.redhat.com/show_bug.cgi?id=1896803 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1896803 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `fe6d6ba622`)	2021-02-14 06:29:16 +01:00
Guillaume Abrioux	3326b6d54f	purge: rm service-cid files This commit makes sure purge playbooks remove those file if for any reason they have been left. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1920900 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b9dd253a4f`)	2021-02-12 18:33:19 +01:00
Guillaume Abrioux	5803619a5d	switch2container: do not serialize the ceph-crash migration There's no need to slow down the playbook execution time by migrating all the `ceph-crash` instances in a serial way. Let's remove the `serial: 1` so the migration is achieved in a parallel way. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `980a5a7df4`)	2021-02-12 14:06:15 +01:00
Guillaume Abrioux	2feefdc861	tests: increase `mon_max_pg_per_osd` we aren't deploying enough OSD daemon, so it fails like following: ``` stderr: 'Error ERANGE: pool id 10 pg_num 256 size 2 would mean 1536 total pgs, which exceeds max 1500 (mon_max_pg_per_osd 250 * num_in_osds 6)' ``` Let's increase the value of `mon_max_pg_per_osd` in order to get around this issue in the CI. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `682116023d`)	2021-02-12 09:15:24 +01:00
Guillaume Abrioux	980a0dd00e	rolling_update: update specific pacific task update the 'require-osd-release' task. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-02-12 09:15:24 +01:00
Guillaume Abrioux	7dd4a8a059	tests: use shaman to test against ceph pacific Given there's no pacific packages available at https://download.ceph.com, let's use shaman in order to test against Ceph Pacific Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-02-12 09:15:24 +01:00
Guillaume Abrioux	9102d6c090	doc: add a note about "latest" tags See the change for details. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `4e95180c80`)	2021-02-11 16:41:50 +01:00
Dimitri Savineau	950a6ae406	cephadm-adopt: remove prometheus workaround This was fixed by [1][2] [1] https://tracker.ceph.com/issues/45120 [2] https://github.com/ceph/ceph/commit/252d4b30 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-02-10 13:51:41 +01:00
Dimitri Savineau	d42d584085	doc: update containerized deployment This adds more documentation to the configuration and usage of containerizerd deployment. Closes: #6198 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-02-10 13:50:53 +01:00
Guillaume Abrioux	7e5071856c	doc: update the documentation - mention `stable-6.0` requirements. - update some patterns. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-02-10 13:50:10 +01:00
Dimitri Savineau	48a456dc8c	rolling_update: enforce ceph-container-engine When running the rolling_update.yml playbook and adding the dashboard component in the same time then the requirement (like container packages) aren't installed. This could lead to a failure in case of using authentication on the container registry because the playbook will try to login on the registry but podman/docker aren't yet installed. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1903504 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1918650 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-02-10 08:17:11 +01:00
Dimitri Savineau	e4dd0067c6	ceph-common: enable rhcs tools repo for monitoring The monitoring node running grafana needs the rhcs tools repostory enabled in non containerized deployment to be able to install the ceph-grafana-dashboards rpm package. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1918650 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-02-10 08:17:11 +01:00
Guillaume Abrioux	2f1d287b1c	tests: pin ansible-lint version This commit pins the ansible-lint version to 4.3.7 as ceph-ansible isn't compatible with recent changes in 5.0.0 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-02-10 07:48:24 +01:00

1 2 3 4 5 ...

5747 Commits (2377da8f9b7cdb67c992a1536bd54ad2b8b30ccc) All Branches Search

5747 Commits (2377da8f9b7cdb67c992a1536bd54ad2b8b30ccc)

All Branches