ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	8fada83589	tests: set `mon_max_pg_per_osd` in rgw_multisite Otherwise, the job fails when it tries to create a bucket with `s3cmd mb` command because we have too many PGs per OSD. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `54bae480d2`)	2021-02-10 08:32:24 +01:00
Guillaume Abrioux	14267fe0c4	rgw: multisite refact Add the possibility to deploy rgw multisite configuration with a mix of secondary and primary zones on a same rgw node. Before that, on a same node, all instances were either primary zones OR secondary. Now you can define a rgw instance like following: ``` rgw_instances: - instance_name: 'rgw0' rgw_zonemaster: false rgw_zonesecondary: true rgw_zonegroupmaster: false rgw_realm: 'france' rgw_zonegroup: 'zonegroup-france' rgw_zone: paris-00 radosgw_address: "{{ _radosgw_address }}" radosgw_frontend_port: 8080 rgw_zone_user: jacques.chirac rgw_zone_user_display_name: "Jacques Chirac" system_access_key: P9Eb6S8XNyo4dtZZUUMy system_secret_key: qqHCUtfdNnpHq3PZRHW5un9l0bEBM812Uhow0XfB endpoint: http://192.168.101.12:8080 ``` Basically it's now possible to define `rgw_zonemaster`, `rgw_zonesecondary` and `rgw_zonegroupmaster` at the intsance level instead of the whole node level. Also, this commit adds an option `deploy_secondary_zones` (default True) which can be set to `False` in order to explicitly ask the playbook to not deploy secondary zones in case where the corresponding endpoint are not deployed yet. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1915478 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `71a5e666e3`)	2021-01-28 16:37:50 -05:00
Dimitri Savineau	3f16132e44	library: add ceph_osd_flag module This adds ceph_osd_flag ansible module for replacing the command module usage with the ceph osd set/unset commands. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `5da593604a`)	2020-12-15 17:36:28 +01:00
Guillaume Abrioux	8106dcff44	tests: rgw_multisite playbook test refactor Currently we create an object from the primary sites but we try to read that object still from the master which doesn't make sense, we should try to read it from a secondary site. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `e2ea403d5e`)	2020-12-15 17:30:04 +01:00
Guillaume Abrioux	d14723d5b4	mon: refact initial keyring generation adding monitor is no longer possible because we generate a new mon keyring each time the playbook is run. Fixes: #5864 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1902281 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `970c6a4ee6`)	2020-12-01 09:53:26 -05:00
Guillaume Abrioux	18b34a5bef	ceph_key: support using different keyring Currently the `ceph_key` module doesn't support using a different keyring than `client.admin`. This commit adds the possibility to use a different keyring. Usage: ``` ceph_key: name: "client.rgw.myrgw-node.rgw123" cluster: "ceph" user: "client.bootstrap-rgw" user_key: /var/lib/ceph/bootstrap-rgw/ceph.keyring dest: "/var/lib/ceph/radosgw/ceph-rgw.myrgw-node.rgw123/keyring" caps: osd: 'allow rwx' mon: 'allow rw' import_key: False owner: "ceph" group: "ceph" mode: "0400" ``` Where: `user` corresponds to `-n (--name)` `user_key` corresponds to `-k (--keyring)` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `12e6260266`)	2020-12-01 09:53:26 -05:00
Guillaume Abrioux	41c7c77817	Revert "ceph_key: support using different keyring" This reverts commit `74eb7cbecb`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-12-01 09:53:26 -05:00
Dimitri Savineau	fcf260b65b	tests: use github workflow for pytest Move the pytest testing from TravisCI to Github workflow. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `3e79f0322a`)	2020-11-18 10:49:30 -05:00
Guillaume Abrioux	04484f5c52	tests: enforce pytest-rerunfailures version This commit enforces the pytest-rerunfailures installed so it's <9.0 This is to avoid the following error: ``` ERROR: pytest-rerunfailures 9.0 has requirement pytest>=5.0, but you'll have pytest 4.6.11 which is incompatible. ``` latest version of pytest-rerunfailures isn't compatible with the version of pytest we are using. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `19097026fb`)	2020-11-18 10:49:30 -05:00
Guillaume Abrioux	7ffc8534ef	tests: change cephfs pool size `all_daemons` scenario can't handle pools with `size: 3` because we have 1 osd node in root=HDD and two nodes in root=default. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `e5713ea5d5`)	2020-10-06 17:54:51 -04:00
Guillaume Abrioux	74eb7cbecb	ceph_key: support using different keyring Currently the `ceph_key` module doesn't support using a different keyring than `client.admin`. This commit adds the possibility to use a different keyring. Usage: ``` ceph_key: name: "client.rgw.myrgw-node.rgw123" cluster: "ceph" user: "client.bootstrap-rgw" user_key: /var/lib/ceph/bootstrap-rgw/ceph.keyring dest: "/var/lib/ceph/radosgw/ceph-rgw.myrgw-node.rgw123/keyring" caps: osd: 'allow rwx' mon: 'allow rw' import_key: False owner: "ceph" group: "ceph" mode: "0400" ``` Where: `user` corresponds to `-n (--name)` `user_key` corresponds to `-k (--keyring)` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `12e6260266`)	2020-10-06 09:21:58 -04:00
Guillaume Abrioux	b13f0d12e7	tests: reboot and test idempotency on collocation test reboot and idempotency on collocation scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f83f798206`)	2020-10-06 09:21:58 -04:00
Guillaume Abrioux	765db7ceec	flake8: fix pep8 syntax on tests/functional/tests/ tests/conftest.py and tests present in tests/functional/tests/ has been missed from previous commit Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8596f1d52c`) # Conflicts: # .github/workflows/flake8.yml	2020-10-06 10:04:01 +02:00
Guillaume Abrioux	d0f29e08d8	flake8: fix all tests/library/.py files This commit modifies all .py files in ./tests/library/ so flake8 passes. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `e49a5241f0`)	2020-10-06 08:56:45 +02:00
Dimitri Savineau	fd0b9491b6	ansible: bump to ansible 2.9 Prior this commit we were supporting both ansible 2.8 and 2.9. Let's drop 2.8 now. Closes: #5459 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1879178 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-15 13:13:09 -04:00
Guillaume Abrioux	f31258d604	tests: do not run node_exporter test on clients We need to skip these tests on client nodes since we don't deploy node_exporter on them anymore Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `5650a6d7d0`)	2020-09-14 16:13:25 -04:00
Dimitri Savineau	47f24ec047	Add CentOS 8 support for rpm deployment We were only supporting CentOS 8 for containerized deployment. Since Nautilus 14.2.10 we now have el8 rpm packages so we should be able to deploy a nautilus ceph cluster with el8. Note that the nfs-ganesha isn't supported because there's no el8 rpm packages for nfs-ganesha V2.8. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-10 20:38:34 -04:00
Dimitri Savineau	0f7da8b9d1	pytest: register ceph_crash mark Otherwise we see some pytest warning. PytestUnknownMarkWarning: Unknown pytest.mark.ceph_crash - is this a typo? You can register custom marks to avoid this warning - for details, see https://docs.pytest.org/en/latest/mark.html Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `03d4620269`)	2020-09-10 20:35:04 -04:00
Guillaume Abrioux	66dde0034b	ceph-crash: introduce new role ceph-crash This commit introduces a new role `ceph-crash` in order to deploy everything needed for the ceph-crash daemon. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9d2f2108e1`)	2020-09-10 20:35:04 -04:00
Dimitri Savineau	d461631c86	tests: use grafana from quay.io This changes the grafana container image regitry from docker.io to quay.io to avoid rate limit. This also adds the missing container image values for docker2podman and podman scenarios. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `dd05d8ba90`)	2020-09-10 21:37:06 +02:00
Guillaume Abrioux	2001039c0e	tests: migrate to quay.ceph.io registry in order to avoid docker.io rate limiting Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `218aedaab6`)	2020-09-10 21:37:06 +02:00
Guillaume Abrioux	2754895b89	tests: move erasure pool testing in lvm_osds This commit moves the erasure pool creation testing from `all_daemons` to `lvm_osds` so we can decrease the number of osd nodes we spawn so the OVH Jenkins slaves aren't less overwhelmed when a `all_daemons` based scenario is being tested. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8476beb5b1`)	2020-08-20 14:16:57 +02:00
Guillaume Abrioux	bd5cde631b	tests: refact shrink_osd scenario This adds more coverage on the shrink_osd scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `7efea219d6`)	2020-08-06 13:10:42 +02:00
Guillaume Abrioux	9e40062570	tests: lvm_setup.yml, add carriage return This commit adds crlf between each task. It makes the playbook more readable. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8ef9fb68bc`)	2020-07-22 18:47:27 -04:00
Guillaume Abrioux	53793b352e	tests: (lvm_setup.yml), don't shrink lvol when rerunning lvm_setup.yml on existing cluster with OSDs already deployed, it fails like following: ``` fatal: [osd0]: FAILED! => changed=false msg: Sorry, no shrinking of data-lv2 to 0 permitted. ``` because we are asking `lvol` module to create a volume on an empty VG with size extents = `100%FREE`. The default behavior of `lvol` is to shrink the volume if the LV's current size is greater than the requested size. Given the requested size is calculated like this: `size_requested = size_percent * this_vg['free'] / 100` in our case, it is similar to: `size_requested = 100 * 0 / 100` which basically means `0` So the current LV size is well greater than the requested size which leads the module to attempt to shrink it to 0 which isn't obviously now allowed. Adding `shrink: false` to the module calls fixes this issue. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `218f4ae361`)	2020-07-22 18:47:27 -04:00
Dimitri Savineau	056a4fe866	ceph-dashboard: update create/get rgw user tasks Since [1] if a rgw user already exists then the radosgw-admin user create command will return an error instead of modifying the current user. We were already doing separated tasks for create and get operation but only for multisite configuration but it's not enough. Instead we should do the get task first and depending on the result execute the create. This commit also adds missing run_once and delegate_to statement. [1] https://github.com/ceph/ceph/commit/269e9b9 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `ac0f68ccf0`)	2020-07-20 21:21:57 +02:00
Jan Fajerski	14e9672f00	lvm_setup: lookup device from inventory, default to /dev/sd* names This fixes a long standing fail in ceph-volumes lvm test suite. Otherwise the default behaviour should not change. Signed-off-by: Jan Fajerski <jfajerski@suse.com> (cherry picked from commit `1fe8e819f9`)	2020-06-29 10:25:58 +02:00
Dimitri Savineau	a99c94ea11	ceph-osd: remove ceph-osd-run.sh script Since we only have one scenario since nautilus then we can just move the container start command from ceph-osd-run.sh to the systemd unit service. As a result, the ceph-osd-run.sh.j2 template and the ceph_osd_docker_run_script_path variable are removed. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `829990e60d`)	2020-06-23 17:35:01 +02:00
Guillaume Abrioux	8ef3fee41b	ceph_volume: make zap function idempotent This commit makes the zap function idempotent, especially when using lvm_volumes variable. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1845668 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `3f47236470`)	2020-06-23 10:49:07 +02:00
Dimitri Savineau	a97e24fee9	docker2podman: manage dashboard nodes The dashboard nodes (alertmanager, grafana, node-exporter, and prometheus) were not manage during the docker to podman migration. This adds the systemd container template of those services to a dedicated file (systemd.yml) in order to include it in the docker2podman playbook. This also adds the dashboard container images pull from docker to podman. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1829389 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `252e78b4e4`)	2020-06-03 13:20:24 -04:00
Dimitri Savineau	e34c95d28f	tests: update mgr dashboard socket listening test Since `15ed9ee` the ceph-mgr daemon binds on the IP address on the public network instead of binding on all addresses. This commit updates the testinfra code to reflect that change. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `0f0a14772c`)	2020-04-07 15:25:45 +02:00
Dimitri Savineau	47be2a2719	tests: register mark in pytest configuration Unregister marks generates warnings like: PytestUnknownMarkWarning: Unknown pytest.mark.docker - is this a typo? You can register custom marks to avoid this warning https://docs.pytest.org/en/latest/mark.html Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `ac4f8763aa`)	2020-04-07 15:25:45 +02:00
Dimitri Savineau	1cfb84ae94	tests: add dashboard testinfra configuration This commit adds basic tests for grafana, prometheus, node-exporter and ceph mgr dashboard services. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `f2c6281207`)	2020-04-07 15:25:45 +02:00
Guillaume Abrioux	825aed5ec1	ceph_key: remove 'update' state With this change, the state `present` is enough to update a keyring. If the keyring already exist, it will be updated if caps or secret passed to the module are different. If the keyring doen't exist, it will be created. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1808367 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `553584cbd0`)	2020-04-01 18:08:51 -04:00
Guillaume Abrioux	03355aec8c	tests: add more coverage in external_clients scenario Run create_users_keys.yml in external_clients scenario Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8c1c34b201`)	2020-03-31 19:42:40 -04:00
Dimitri Savineau	dcd02e6494	ceph_volume: fix multiple db/wal devices When using the lvm batch ceph-volume subcommand with dedicated devices for bluestore (db/wal) then the list of devices is convert to a string instead of being extended via an iterable. This was working with only one dedicated device but starting with more then the ceph_volume module fails. TASK [ceph-osd : use ceph-volume lvm batch to create bluestore osds] ** fatal: [xxxxxx]: FAILED! => changed=true cmd: - ceph-volume - --cluster - ceph - lvm - batch - --bluestore - --yes - --prepare - --osds-per-device - '4' - /dev/nvme2n1 - /dev/nvme3n1 - /dev/nvme4n1 - /dev/nvme5n1 - /dev/nvme6n1 - --db-devices - /dev/nvme0n1 /dev/nvme1n1 - --report - --format=json msg: non-zero return code rc: 2 stderr: \|2- stderr: lsblk: /dev/nvme0n1 /dev/nvme1n1: not a block device stderr: error: /dev/nvme0n1 /dev/nvme1n1: No such file or directory stderr: Unknown device, --name=, --path=, or absolute path in /dev/ or /sys expected. usage: ceph-volume lvm batch [-h] [--db-devices [DB_DEVICES [DB_DEVICES ...]]] [--wal-devices [WAL_DEVICES [WAL_DEVICES ...]]] [--journal-devices [JOURNAL_DEVICES [JOURNAL_DEVICES ...]]] [--no-auto] [--bluestore] [--filestore] [--report] [--yes] [--format {json,pretty}] [--dmcrypt] [--crush-device-class CRUSH_DEVICE_CLASS] [--no-systemd] [--osds-per-device OSDS_PER_DEVICE] [--block-db-size BLOCK_DB_SIZE] [--block-wal-size BLOCK_WAL_SIZE] [--journal-size JOURNAL_SIZE] [--prepare] [--osd-ids [OSD_IDS [OSD_IDS ...]]] [DEVICES [DEVICES ...]] ceph-volume lvm batch: error: Unable to proceed with non-existing device: /dev/nvme0n1 /dev/nvme1n1 So the dedicated device list is considered as a single string. This commit also adds the block_db_devices and wal_devices documentation to the ceph_volume module. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1816713 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `760b6cd7b0`)	2020-03-30 10:04:26 -04:00
Guillaume Abrioux	d682cf6de5	tests: add inventory host for 5.0 upgrade job This inventory is intended to be used in the upgrade scenario in stable-5.0 branch. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-26 11:23:23 +01:00
Dimitri Savineau	55c222d088	dashboard: allow to set read-only admin user This commit allows one to set the role for the admin user as read-only. This can be controlled via the dashboard_admin_user_ro variable but the default value is false for backward compatibility. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1810176 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `fb69f6990c`)	2020-03-19 13:24:05 -04:00
Guillaume Abrioux	c26e80fdbf	rgw: add multi-instances support when deploying multisite This commit adds the multi-instances when deploying rgw multisite Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `60a2e28189`)	2020-03-12 19:04:26 -04:00
Dimitri Savineau	3fc4cc9f62	tests/requirements: bump testinfra 3.4 is the latest testinfra release available but python2 is dropped starting 4.0. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `ccec67aa6a`)	2020-03-09 13:33:44 +01:00
Guillaume Abrioux	b9e397ebaf	tests: add more osd nodes in all_daemons scenario This commit adds more osd nodes in all_daemons scenario in order to test erasure pool creation. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9f0c6df94f`)	2020-03-06 16:10:03 +01:00
Guillaume Abrioux	0800c60721	tests: update ooo job This commit changes the value passed for the attribute 'rule_name' in openstack_pools definition. It doesn't make sense to have emptry string as passed value here. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `248978596a`)	2020-03-06 16:10:03 +01:00
Guillaume Abrioux	6cc9c28c5d	tests: add erasure pool creation test in CI This commit makes the CI testing an OSD pool erasure creation due to the recent refact of the OSD pool creation tasks in the playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8cacba1f54`)	2020-03-06 16:10:03 +01:00
Guillaume Abrioux	01559892d1	tests: enable pg autoscaler on 1 pool This commit enables the pg autoscaler on 1 pool. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a3b797e059`)	2020-03-06 16:10:03 +01:00
Ali Maredia	2c440d4427	rgw multisite: enable more than 1 realm per cluster Make it so that more than one realm, zonegroup, or zone can be created during a run of the rgw multisite ansible playbooks. The rgw hosts now need to be grouped into zones and realms in the inventory. .yml files need to be created in group_vars for the realms and zones. Sample yaml files are available. Also remove multsite destroy playbook and add --cluster before radosgw-admin commands remove manually added rgw_zone_endpoints var and have ceph-ansible automatically add the correct endpoints of all the rgws in a rgw_zone from the information provided in that rgws hostvars. Signed-off-by: Ali Maredia <amaredia@redhat.com> (cherry picked from commit `71f55bd54d`)	2020-03-04 14:39:23 -05:00
Guillaume Abrioux	96b7857347	requirements: enforce ansible version requirement See https://github.com/advisories/GHSA-3m93-m4q6-mc6v Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a2d2e70ac2`)	2020-02-27 09:56:55 -05:00
Ali Maredia	7d2a217270	rgw: extend automatic rgw pool creation capability Add support for erasure code pools. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1731148 Signed-off-by: Ali Maredia <amaredia@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `1834c1e48d`)	2020-02-17 17:44:53 -05:00
Guillaume Abrioux	e3cd719ebe	tests: add external_clients scenario This commit adds a new 'external ceph clients' scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `641729357e`)	2020-01-31 13:37:10 +01:00
Guillaume Abrioux	2e7d7b70ed	tests: set dashboard\|grafana_admin_password Set these 2 variables in all test scenarios where `dashboard_enabled` is `True` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `c040199c8f`)	2020-01-29 14:15:41 +01:00
Guillaume Abrioux	d5dca5087a	tests: add 'all_in_one' scenario Add new scenario 'all_in_one' in order to catch more collocated related issues. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `3e7dbb4b16`)	2020-01-27 17:54:39 -05:00

1 2 3 4 5 ...

552 Commits (858048560e974a78db7d46a9afaebd6ae1a83c55)