ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	218aedaab6	tests: migrate to quay.ceph.io registry in order to avoid docker.io rate limiting Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `2cbb7de3b2`)	2020-09-10 17:30:37 +02:00
Guillaume Abrioux	f22078b16d	tests: add more coverage for test_ceph_key This commit adds more coverage regarding the testing of ceph_key module Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `27ca884d99`)	2020-08-21 13:56:11 +02:00
Dimitri Savineau	dcf915f597	tests: reenable nfs-ganesha testing This re-adds the nfs-ganesha testing in non containerized deployment. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `6c11695fbe`)	2020-08-20 12:22:32 -04:00
Guillaume Abrioux	4685b411de	tests: move erasure pool testing in lvm_osds This commit moves the erasure pool creation testing from `all_daemons` to `lvm_osds` so we can decrease the number of osd nodes we spawn so the OVH Jenkins slaves aren't less overwhelmed when a `all_daemons` based scenario is being tested. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8476beb5b1`)	2020-08-20 11:55:40 +02:00
Guillaume Abrioux	4369833008	tests: test iscsigw against stable build This commit makes the ci using stable build for testing iscsigw in stable-5.0 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-08-12 12:19:18 -04:00
Guillaume Abrioux	8632db7cb8	tests: refact shrink_osd scenario This adds more coverage on the shrink_osd scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `7efea219d6`)	2020-08-06 13:09:38 +02:00
Dimitri Savineau	92a2a2cf32	pytest: register ceph_crash mark Otherwise we see some pytest warning. PytestUnknownMarkWarning: Unknown pytest.mark.ceph_crash - is this a typo? You can register custom marks to avoid this warning - for details, see https://docs.pytest.org/en/latest/mark.html Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `03d4620269`)	2020-08-06 09:41:54 +02:00
Guillaume Abrioux	e6059fdcd3	ceph-crash: introduce new role ceph-crash This commit introduces a new role `ceph-crash` in order to deploy everything needed for the ceph-crash daemon. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9d2f2108e1`)	2020-07-22 18:47:01 -04:00
Guillaume Abrioux	0b5a2648e3	tests: lvm_setup.yml, add carriage return This commit adds crlf between each task. It makes the playbook more readable. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8ef9fb68bc`)	2020-07-22 18:46:49 -04:00
Guillaume Abrioux	a4efb521f5	tests: (lvm_setup.yml), don't shrink lvol when rerunning lvm_setup.yml on existing cluster with OSDs already deployed, it fails like following: ``` fatal: [osd0]: FAILED! => changed=false msg: Sorry, no shrinking of data-lv2 to 0 permitted. ``` because we are asking `lvol` module to create a volume on an empty VG with size extents = `100%FREE`. The default behavior of `lvol` is to shrink the volume if the LV's current size is greater than the requested size. Given the requested size is calculated like this: `size_requested = size_percent * this_vg['free'] / 100` in our case, it is similar to: `size_requested = 100 * 0 / 100` which basically means `0` So the current LV size is well greater than the requested size which leads the module to attempt to shrink it to 0 which isn't obviously now allowed. Adding `shrink: false` to the module calls fixes this issue. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `218f4ae361`)	2020-07-22 18:46:49 -04:00
Dimitri Savineau	b7fd3bc844	cephadm: add playbook This adds a new playbook for deploying ceph via cephadm. This also adds a new dedicated tox file for CI purpose. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `957903d561`)	2020-07-16 12:00:14 -04:00
Dimitri Savineau	3a2bda2b11	tests: remove nfs_ganesha_stable_branch variable We don't need to override this variable in the group_vars but use the default value instead. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `fc599ed9f5`)	2020-07-06 11:00:44 -04:00
Guillaume Abrioux	07f1ec287b	tests: update nfs-ganesha to V3.3-stable not really needed in master, commit intended to be backported in octopus branch. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `5b6f5486f7`)	2020-07-06 10:37:35 -04:00
Dimitri Savineau	101412de78	vagrant: update centos image to 8.2 CentOS 8.2 (2004) has been relesed so we should switch to this image when using vagrant. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `72293b6614`)	2020-07-03 06:37:58 +02:00
Jan Fajerski	5505b713f2	lvm_setup: lookup device from inventory, default to /dev/sd* names This fixes a long standing fail in ceph-volumes lvm test suite. Otherwise the default behaviour should not change. Signed-off-by: Jan Fajerski <jfajerski@suse.com> (cherry picked from commit `1fe8e819f9`)	2020-06-27 09:31:47 -04:00
Dimitri Savineau	51cfb89501	ceph-osd: remove ceph-osd-run.sh script Since we only have one scenario since nautilus then we can just move the container start command from ceph-osd-run.sh to the systemd unit service. As a result, the ceph-osd-run.sh.j2 template and the ceph_osd_docker_run_script_path variable are removed. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `829990e60d`)	2020-06-23 17:35:24 +02:00
Guillaume Abrioux	e4f972004b	ceph_volume: make zap function idempotent This commit makes the zap function idempotent, especially when using lvm_volumes variable. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1845668 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `3f47236470`)	2020-06-23 09:50:01 +02:00
Guillaume Abrioux	06cfbb10d4	requirements: exclude ansible 2.9.10 ansible 2.9.10 seems to have introduced a bug. See https://github.com/ansible/ansible/issues/70168 This commit excludes this version from ceph-ansible requirements. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `1525990f39`)	2020-06-22 13:05:33 -04:00
Guillaume Abrioux	33897f9d92	ceph_pool: add tests Add unit tests for ceph_pool module Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `886b5256fd`)	2020-05-22 17:05:22 +02:00
Guillaume Abrioux	09a8d5d71e	tests: update pools definitions setting attributes with empty string is a bad user input. Also, removing `rule_name` attribute when creating a code erasure pool. (this rule isnt intended for code erasure pool type). Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `83faf94351`)	2020-05-22 17:05:22 +02:00
Dimitri Savineau	c27d3d9150	tests/library: parametrize ceph_volume objecstore This adds the objectstore testing for both filestore and bluestore on the ceph_volume module. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `5407e898a6`)	2020-05-15 10:20:39 -04:00
Dimitri Savineau	2d396a2311	tests/library: define container cmd once In containerized deployment, the ceph_volume module will always uses the same container command prefix for all actions. Instead of duplicate this code in all container tests we can define it once. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `a8e458c452`)	2020-05-15 10:20:39 -04:00
Dimitri Savineau	9a7af0ce6a	docker2podman: manage dashboard nodes The dashboard nodes (alertmanager, grafana, node-exporter, and prometheus) were not manage during the docker to podman migration. This adds the systemd container template of those services to a dedicated file (systemd.yml) in order to include it in the docker2podman playbook. This also adds the dashboard container images pull from docker to podman. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1829389 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `252e78b4e4`)	2020-05-13 16:41:11 -04:00
Dimitri Savineau	8a10918e49	Readd CentOS 7 with conditions The CentOS 7 distribution could still be used be deploying ceph if - it's a containerized deployment - it's a non containerized deployment without the dashboard (due to missing python3 libraries). The ceph_stable_redhat_distro variable has been remove because we can rely on the ansible_distribution_major_version fact instead. The copr el8 repository configuration is only applied for CentOS 8. The ceph-mgr-dashboard package is only installed when the dashboard_enabled variable is set to true. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `2547ab601a`)	2020-04-23 16:07:14 -04:00
Guillaume Abrioux	587b153fc3	tests: add back nfs testing on master This commit adds back nfs testing on master branch (containerized scenario only). Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `86959abf9b`)	2020-04-23 10:08:51 -04:00
Dimitri Savineau	3ce5a71a9c	tests: update mgr dashboard socket listening test Since `15ed9ee` the ceph-mgr daemon binds on the IP address on the public network instead of binding on all addresses. This commit updates the testinfra code to reflect that change. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `0f0a14772c`)	2020-04-06 18:02:22 -04:00
Dimitri Savineau	4f663412ff	tests: register mark in pytest configuration Unregister marks generates warnings like: PytestUnknownMarkWarning: Unknown pytest.mark.docker - is this a typo? You can register custom marks to avoid this warning https://docs.pytest.org/en/latest/mark.html Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `ac4f8763aa`)	2020-04-06 18:02:22 -04:00
Dimitri Savineau	1f4a08982d	tests: add dashboard testinfra configuration This commit adds basic tests for grafana, prometheus, node-exporter and ceph mgr dashboard services. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `f2c6281207`)	2020-04-06 18:02:22 -04:00
Dimitri Savineau	c094143523	vagrant: force centos 8.1 libvirt image The current centos/8 vagrant image (libvirt) is still using the CentOS 8.0 release (1905) while the 8.1 release (1911) is already available since few months. Using an update CentOS 8 release fixes slow ceph-volume/lvm commands. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `6264f6979e`)	2020-04-02 14:18:55 -04:00
Guillaume Abrioux	f02d9e1a9f	ceph_key: remove 'update' state With this change, the state `present` is enough to update a keyring. If the keyring already exist, it will be updated if caps or secret passed to the module are different. If the keyring doen't exist, it will be created. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1808367 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `553584cbd0`)	2020-04-01 18:08:40 -04:00
Guillaume Abrioux	5b89635a50	tests: add more coverage in external_clients scenario Run create_users_keys.yml in external_clients scenario Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8c1c34b201`)	2020-03-31 19:42:14 -04:00
Guillaume Abrioux	8d056a75be	tests: bump nfs-ganesha version This commit change the nfs-ganesha version for all_daemons scenario (V3.2) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-31 10:02:47 -04:00
Guillaume Abrioux	ddeb603e3e	tests: update testing This commit updates the testing so we test stable-5.0 against ceph@octopus. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-31 10:02:47 -04:00
Dimitri Savineau	bb8d11dbd4	ceph_volume: fix multiple db/wal/journal devices When using the lvm batch ceph-volume subcommand with dedicated devices for filestore (journal) or bluestore (db/wal) then the list of devices is convert to a string instead of being extended via an iterable. This was working with only one dedicated device but starting with more then the ceph_volume module fails. TASK [ceph-osd : use ceph-volume lvm batch to create bluestore osds] ** fatal: [xxxxxx]: FAILED! => changed=true cmd: - ceph-volume - --cluster - ceph - lvm - batch - --bluestore - --yes - --prepare - --osds-per-device - '4' - /dev/nvme2n1 - /dev/nvme3n1 - /dev/nvme4n1 - /dev/nvme5n1 - /dev/nvme6n1 - --db-devices - /dev/nvme0n1 /dev/nvme1n1 - --report - --format=json msg: non-zero return code rc: 2 stderr: \|2- stderr: lsblk: /dev/nvme0n1 /dev/nvme1n1: not a block device stderr: error: /dev/nvme0n1 /dev/nvme1n1: No such file or directory stderr: Unknown device, --name=, --path=, or absolute path in /dev/ or /sys expected. usage: ceph-volume lvm batch [-h] [--db-devices [DB_DEVICES [DB_DEVICES ...]]] [--wal-devices [WAL_DEVICES [WAL_DEVICES ...]]] [--journal-devices [JOURNAL_DEVICES [JOURNAL_DEVICES ...]]] [--no-auto] [--bluestore] [--filestore] [--report] [--yes] [--format {json,pretty}] [--dmcrypt] [--crush-device-class CRUSH_DEVICE_CLASS] [--no-systemd] [--osds-per-device OSDS_PER_DEVICE] [--block-db-size BLOCK_DB_SIZE] [--block-wal-size BLOCK_WAL_SIZE] [--journal-size JOURNAL_SIZE] [--prepare] [--osd-ids [OSD_IDS [OSD_IDS ...]]] [DEVICES [DEVICES ...]] ceph-volume lvm batch: error: Unable to proceed with non-existing device: /dev/nvme0n1 /dev/nvme1n1 So the dedicated device list is considered as a single string. This commit also adds the journal_devices, block_db_devices and wal_devices documentation to the ceph_volume module. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1816713 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `760b6cd7b0`)	2020-03-30 15:25:12 +02:00
Dimitri Savineau	1b094acf24	container: remove ulimit nofile parameter Since Ceph Octopus is python3 only we don't need to specify the max open files anymore with the container engine. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `64701437de`)	2020-03-30 09:22:28 -04:00
Dimitri Savineau	df8f853c85	Add pacific release Add the 16th ceph release: pacific. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-03-24 09:47:12 +01:00
Dimitri Savineau	fb69f6990c	dashboard: allow to set read-only admin user This commit allows one to set the role for the admin user as read-only. This can be controlled via the dashboard_admin_user_ro variable but the default value is false for backward compatibility. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1810176 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-03-19 15:34:41 +01:00
Guillaume Abrioux	60a2e28189	rgw: add multi-instances support when deploying multisite This commit adds the multi-instances when deploying rgw multisite Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com>	2020-03-12 16:44:48 -04:00
Dimitri Savineau	e62532de46	update osd pool set size command Since [1] we can't use osd pool without replicas (size: 1) by default. We now need to set the mon_allow_pool_size_one flag to true in the ceph configuration and add the --yes-i-really-mean-it flag to the osd pool set size cli. [1] https://github.com/ceph/ceph/commit/21508bd Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-03-11 11:25:42 +01:00
Dimitri Savineau	ccec67aa6a	tests/requirements: bump testinfra 3.4 is the latest testinfra release available but python2 is dropped starting 4.0. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-03-09 09:46:11 +01:00
Ali Maredia	71f55bd54d	rgw multisite: enable more than 1 realm per cluster Make it so that more than one realm, zonegroup, or zone can be created during a run of the rgw multisite ansible playbooks. The rgw hosts now need to be grouped into zones and realms in the inventory. .yml files need to be created in group_vars for the realms and zones. Sample yaml files are available. Also remove multsite destroy playbook and add --cluster before radosgw-admin commands remove manually added rgw_zone_endpoints var and have ceph-ansible automatically add the correct endpoints of all the rgws in a rgw_zone from the information provided in that rgws hostvars. Signed-off-by: Ali Maredia <amaredia@redhat.com>	2020-03-04 12:58:13 -05:00
Guillaume Abrioux	9f0c6df94f	tests: add more osd nodes in all_daemons scenario This commit adds more osd nodes in all_daemons scenario in order to test erasure pool creation. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-04 09:29:01 -05:00
Guillaume Abrioux	248978596a	tests: update ooo job This commit changes the value passed for the attribute 'rule_name' in openstack_pools definition. It doesn't make sense to have emptry string as passed value here. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-04 09:29:01 -05:00
Guillaume Abrioux	8cacba1f54	tests: add erasure pool creation test in CI This commit makes the CI testing an OSD pool erasure creation due to the recent refact of the OSD pool creation tasks in the playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-04 09:29:01 -05:00
Guillaume Abrioux	a3b797e059	tests: enable pg autoscaler on 1 pool This commit enables the pg autoscaler on 1 pool. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-04 09:29:01 -05:00
Guillaume Abrioux	896d00b50e	tests: add lvm batch filestore testing This commit adds an OSD node in lvm-batch scenario in order to test filestore backend. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-03 13:50:19 -05:00
Guillaume Abrioux	0fc99bb6fa	tests: increase journal_size value Looks like we are still seeing issue [1]. Let's increase this value to unlock the CI (however, it still needs to be investigated). Typical error (see [1] for further details) : ``` [root@osd2 ~]# ceph-volume --cluster ceph lvm batch --filestore --yes --journal-size '2048' /dev/sda /dev/sdb --journal-devices /dev/sdc Running command: /sbin/vgcreate --force --yes ceph-journals-817ef90b-77ac-4f52-b8a9-30893849fb78 /dev/sdc stdout: Physical volume "/dev/sdc" successfully created. stdout: Volume group "ceph-journals-817ef90b-77ac-4f52-b8a9-30893849fb78" successfully created --> Refusing to continue with configured size for journal --> RuntimeError: journal sizes must be larger than 2GB, detected: 1024.00 MB ``` [1] https://tracker.ceph.com/issues/41374 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-03 13:23:57 -05:00
Guillaume Abrioux	0326d992c2	osd: add journal option in ceph_volume call (batch) This commit adds the journal option to the ceph_volume call when scenario is lvm batch Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-02-28 17:29:59 -05:00
Guillaume Abrioux	a2d2e70ac2	requirements: enforce ansible version requirement See https://github.com/advisories/GHSA-3m93-m4q6-mc6v Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-02-27 09:28:17 -05:00
Dimitri Savineau	ac0f68ccf0	ceph-dashboard: update create/get rgw user tasks Since [1] if a rgw user already exists then the radosgw-admin user create command will return an error instead of modifying the current user. We were already doing separated tasks for create and get operation but only for multisite configuration but it's not enough. Instead we should do the get task first and depending on the result execute the create. This commit also adds missing run_once and delegate_to statement. [1] https://github.com/ceph/ceph/commit/269e9b9 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-02-18 10:22:21 +01:00
Ali Maredia	1834c1e48d	rgw: extend automatic rgw pool creation capability Add support for erasure code pools. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1731148 Signed-off-by: Ali Maredia <amaredia@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com>	2020-02-17 16:07:43 +01:00
Dimitri Savineau	85d7102a95	Revert "vagrant: temp workaround for CentOS 8 cloud image" The CentOS 8 vagrant image download is now fixed. This reverts commit `a5385e1048`. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-02-17 11:30:39 +01:00
Dimitri Savineau	779a4a6d71	tests: don't install s3cmd on containerized setup The s3cmd package should only be installed on non containerized deployment. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-02-17 11:27:52 +01:00
Guillaume Abrioux	910fc61fdc	tests: remove legacy `osd_scenario` variable As of stable-4.0 most of these references aren't needed anymore. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-02-04 10:05:33 +01:00
Guillaume Abrioux	641729357e	tests: add external_clients scenario This commit adds a new 'external ceph clients' scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-31 12:02:15 +01:00
Guillaume Abrioux	c040199c8f	tests: set dashboard\|grafana_admin_password Set these 2 variables in all test scenarios where `dashboard_enabled` is `True` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-29 08:45:34 +01:00
Guillaume Abrioux	3e7dbb4b16	tests: add 'all_in_one' scenario Add new scenario 'all_in_one' in order to catch more collocated related issues. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-27 15:30:45 -05:00
Dimitri Savineau	bb3eae0c80	filestore-to-bluestore: fix osd_auto_discovery When osd_auto_discovery is set then we need to refresh the ansible_devices fact between after the filestore OSD purge otherwise the devices fact won't be populated. Also remove the gpt header on ceph_disk_osds_devices because the devices is empty at this point for osd_auto_discovery. Adding the bool filter when needed. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1729267 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-22 09:36:09 +01:00
Dimitri Savineau	f995b079a6	filestore-to-bluestore: --destroy with raw devices We still need --destroy when using a raw device otherwise we won't be able to recreate the lvm stack on that device with bluestore. Running command: /usr/sbin/vgcreate -s 1G --force --yes ceph-bdc67a84-894a-4687-b43f-bcd76317580a /dev/sdd stderr: Physical volume '/dev/sdd' is already in volume group 'ceph-b7801d50-e827-4857-95ec-3291ad6f0151' Unable to add physical volume '/dev/sdd' to volume group 'ceph-b7801d50-e827-4857-95ec-3291ad6f0151' /dev/sdd: physical volume not initialized. --> Was unable to complete a new OSD, will rollback changes Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1792227 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-21 11:37:39 -05:00
Dimitri Savineau	a5385e1048	vagrant: temp workaround for CentOS 8 cloud image The CentOS cloud infrastructure storing the vagrant CentOS 8 image changed the directory path and remove the old 8.0 image so the vagrant box add centos/8 fails returning a 404 http error. As a workaround we can pull the image from CentOS instead of letting vagrant doing the resolution. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-15 17:52:35 +01:00
Dimitri Savineau	3900527e16	tests/setup: update mount options on EL 8 The nobarrier mount flag doesn't exist anymoer on XFS in the EL 8 kernel. That's why the task wasn't working on those systems. We can still use the other options instead of skipping the task. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-11 05:33:01 +01:00
Guillaume Abrioux	dc672e86ec	tests: add a docker2podman scenario This commit adds a new scenario in order to test docker-to-podman.yml migration playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-10 10:21:29 -05:00
Guillaume Abrioux	4f2baaab8c	tests: disable nfs testing nfs-ganesha makes the CI failing because of issue related to SELinux. See: - https://bugzilla.redhat.com/show_bug.cgi?id=1788563 - https://github.com/nfs-ganesha/nfs-ganesha/issues/527 Until we can get this fixed, let's disable nfs-ganesha testing temporarily. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 11:13:46 +01:00
Dimitri Savineau	7b3e6b932c	tests/functional: change docker to podman Some docker commands were hardcoded in tests playbooks and some conditions were not taking care of the containerized_deployment variable but only the atomic fact. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Guillaume Abrioux	217d95abb2	common: add centos8 support Ceph octopus only supports CentOS 8. This commit adds CentOS 8 support: - update vagrant image in tox configurations. - add CentOS 8 repository for el8 dependencies. - CentOS 8 container engine is podman (same than RHEL 8). - don't use the epel mirror on sepia because it's epel7 only. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Guillaume Abrioux	40de34fb5e	tests: add filestore_to_bluestore job This commit adds a new job in order to test the filestore-to-bluestore.yml infrastructure playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-12-11 09:04:41 -05:00
Dimitri Savineau	4a6d19dae2	tests: reduce max_mds from 3 to 2 Having max_mds value equals to the number of mds nodes generates a warning in the ceph cluster status: cluster: id: 6d3e49a4-ab4d-4e03-a7d6-58913b8ec00a' health: HEALTH_WARN' insufficient standby MDS daemons available' (...) services: mds: cephfs:3 {0=mds1=up:active,1=mds0=up:active,2=mds2=up:active}' Let's use 2 active and 1 standby mds. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-12-04 14:07:29 -05:00
Dimitri Savineau	3f29b243ea	tests: fix cluster health status The current ceph cluster health is in warning state: health: HEALTH_WARN 13 pool(s) have no replicas configured 2 pool(s) have non-power-of-two pg_num Because we're using only 1 replica then we need to disable the redundancy check. The pool pg num should be a power of two number (like 16). Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-11-27 16:20:17 +01:00
Dimitri Savineau	ef2cb99f73	ceph-osd: add device class to crush rules This adds device class support to crush rules when using the class key in the rule dict via the create-replicated sub command. If the class key isn't specified then we use the create-simple sub command for backward compatibility. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1636508 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-11-14 16:25:46 +01:00
Guillaume Abrioux	16bcef4f28	tests: add time command in vagrant_up.sh monitor how long it takes to get all VMs up and running Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-11-08 15:47:46 +01:00
Guillaume Abrioux	db77fbda15	tests: add coverage on purge playbook This commit adds a playbook to be played before we run purge playbook, it first creates an rbd image then map an rbd device on client0 so the purge playbook will try to unmap it. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-11-08 09:06:11 -05:00
Dimitri Savineau	02df2ab5ea	tests/requirements: bump testinfra and pytest The ansible ssh connections are now using the ssh backend instead of paramiko starting testinfra 3.1 and persistent connections too. pytest 4.6 is the latest release to be supported by python 2. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-11-04 09:09:49 -05:00
Dimitri Savineau	6ce4fde820	move library/plugins tests files under tests dir To avoid unnecessary ansible warnings during playbook execution we can move the library and plugins test files under a different directory. [WARNING]: Skipping plugin (plugins/filter/test_ipaddrs_in_ranges.py) as it seems to be invalid: cannot import name 'ipaddrs_in_ranges' Closes: #4656 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-10-28 09:23:17 +01:00
Guillaume Abrioux	b5a61fe2e3	tests: use osd ids instead of device name in ooo_collocation on master, it doesn't make sense anymore to use device name, we should use osd id instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-10-22 13:45:19 +02:00
Guillaume Abrioux	384161edcd	tests: fix keyring creation in ooo_collocation This commit removes the backslash in allow command parameter, this was needed before the ceph_key module integration. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-10-22 13:45:19 +02:00
Dimitri Savineau	3c2840da03	tests: update container tag for ooo_collocation It doesn't make sense to test the old 3.0.x container images with nautilus+ ceph releases. Also disable the dashboard deployment and switch to bluestore backend. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-10-22 13:45:19 +02:00
Guillaume Abrioux	25b98b2ce3	tests: add multimds coverage This commit makes the all_daemons scenario deploying 3 mds in order to cover the multimds case. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-10-18 13:43:13 -04:00
Dimitri Savineau	2c03c6fcd3	tests: fix the size on the second data LV The commit replaces the pv/vg/lv commands used with the ansible command module by the lvg and lvol modules. This also fixes the size of the second data LV because we were only using 50% of the remaining space instead of 100%. With a 50G device, the result was: - data-lv1 was 25G - data-lv2 was 12.5G Instead of: - data-lv1 was 25G - data-lv2 was 25G Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-10-17 15:49:15 -04:00
Dimitri Savineau	0f978d969b	Remove validate action and notario dependency The current ceph-validate role is using both validate action and fail module tasks to validate the ceph configuration. The validate action is based on the notario python library. When one of the notario validation fails then a python stack trace is reported to the ansible task. This output isn't understandable by users. This patch removes the validate action and the notario depencendy. The validation is now done with only fail ansible module. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1654790 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-10-15 11:34:49 +02:00
Dimitri Savineau	04ec1ad3cc	tests: reduce handler mon and osd delay We don't need to have high handler delay in the CI so reducing to 10 seconds. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-10-09 09:08:20 +02:00
Dimitri Savineau	010158ff84	tests: fix rgw multisite vagrant variables The secondary vagrant variables didn't have the grafana vm variable set which create an vagrant error. There was an error loading a Vagrantfile. The file being loaded and the error message are shown below. This is usually caused by an invalid or undefined variable. This patch also changes the ssh-extra-args parameter to ssh-common-args to get the same values for ssh/sftp/scp. Otherwise we can see warnings from ansible and some tasks are failing. [WARNING]: sftp transfer mechanism failed on [mon0]. Use ANSIBLE_DEBUG=1 to see detailed information It also updates the ssh-common-args value for the rgw-multisite scenario to reflect the ANSIBLE_SSH_ARGS environment variable value. Finally changing the IP addresses due to the Vagrant refact done in the commit `778c51a` Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-10-04 15:12:50 -04:00
Guillaume Abrioux	01f6dd52b3	tests: remove debug log verbosity This was added for debugging purpose. It's generating very large log output, let's remove this now. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-09-28 11:20:49 +02:00
Guillaume Abrioux	006df148d0	tests: pin jinja2 version ensure we get the latest jinja2 version. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-09-26 11:35:24 +02:00
Guillaume Abrioux	5bb6a4da42	tests: set copy_admin_key at group_vars level setting it at extra vars level prevent from setting it per node. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-09-26 11:35:24 +02:00
Guillaume Abrioux	da094ac5ee	tests: do not rely on pg_num to validate rgw_tuning_pools Since the pg_autoscaler has been enabled recently in ceph, this check should stick to validate the requested pools are well created only. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-09-18 14:05:23 +02:00
Dimitri Savineau	825045f6b4	tests: use a single grafana node on podman We don't use multiple grafana nodes for the moment on the others scenarios and I don't think this is supposed to be working. We can often see failure on grafana on that scenario. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-08-28 11:42:48 -04:00
Guillaume Abrioux	05686509f3	tests: update test_mgr_is_up() the data structure has changed in octopus: ``` "mgrmap": { "available": true, "modules": [ "dashboard", "prometheus" ], "num_standbys": 0, "services": { "prometheus": "http://mgr0:9283/" } }, ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-08-14 16:42:02 +02:00
Dimitri Savineau	31bd5e08a6	Revert "tests: disable nfs-ganesha deployment" This reverts commit `83940e624b`. Because nfs-ganesha@master (2.9-dev) build has been fixed by [1] then we can test nfs-ganesha in the CI for master/octopus. [1] https://github.com/ceph/ceph-build/pull/1346 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-08-07 10:40:43 +02:00
Dimitri Savineau	867583d5dd	tests/shrink_rgw: Disable dashboard The shrink_rgw scenario has been merge just after the PR about enable ceph dashboard by default. So right now the shrink_rgw scenrio doesn't have nodes in the grafana group and fails. We just need to set dashboard_enabled to false. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-07-31 14:53:05 -04:00
Guillaume Abrioux	0f620b2584	tests: add more memory in podman job Typical error : ``` fatal: [mon1 -> mon0]: FAILED! => changed=true cmd: - podman - exec - ceph-mon-mon0 - ceph - config - set - mgr - mgr/dashboard/ssl - 'false' delta: '0:00:00.644870' end: '2019-07-30 10:17:32.715639' msg: non-zero return code rc: 1 start: '2019-07-30 10:17:32.070769' stderr: \|- Traceback (most recent call last): File "/usr/bin/ceph", line 140, in <module> import rados ImportError: libceph-common.so.0: cannot map zero-fill pages: Cannot allocate memory Error: exit status 1 stderr_lines: <omitted> stdout: '' stdout_lines: <omitted> ``` Let's add more memory to get around this issue. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-30 13:52:44 +02:00
Guillaume Abrioux	d649e00893	tests: deploy dashboard on mons there's no dedicated nodes for mgr, let's use monitor nodes. The mgr0 instance spawned isn't used, so if this node is part of the inventory for this scenario, testinfra will complain because there's no ceph.conf on this node. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-30 13:52:44 +02:00
Rishabh Dave	236b081a3a	tests/functional: add a test for shrink-rgw.yml Add a new functional test that deploys a Ceph cluster with three nodes for MON, OSD and RGW and then runs shrink-rgw.yml to test it. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-07-30 08:45:57 +02:00
Guillaume Abrioux	3c2fd337d9	tests: test dashboard deployment with podman scenario This commit adds a grafana-server section in order to test dashboard deployment with podman. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-29 14:42:45 +02:00
Guillaume Abrioux	fb1b5b3251	dashboard: enable dashboard by default This commit enables dashboard deployment by default. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1726739 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-29 14:42:45 +02:00
Dimitri Savineau	07c6695d16	Remove NBSP characters Some NBSP are still present in the yaml files. Adding a test in travis CI. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-07-26 16:09:23 -04:00
Guillaume Abrioux	83940e624b	tests: disable nfs-ganesha deployment nfs-ganesha repositories @ dev are broken, this commit disables the nfs-ganesha deployment so the CI isn't stuck. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-24 14:13:06 +02:00
Dimitri Savineau	a9a1f633a9	tests/dashboard: use the dedicated grafana node The Vagrant dashboard scenario creates a dedicated grafana node but was not use in the ansible inventory. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-07-18 07:22:13 +02:00
Rishabh Dave	f80521f773	tests/functional: add a test for shrink-rbdmirror.yml Add a new functional test that deploys Ceph cluster with three nodes for MON, OSD and RBD Mirror and, then, runs shrink-rbdmirror.yml to test it. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-07-15 11:22:17 +02:00
Rishabh Dave	5c95c34d4b	tests/functional: add a test for shrink-mgr.yml Add a new functional test that deploys a Ceph cluster with three nodes for MON, OSD and MGR and then runs shrink-mgr.yml to test it. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-07-09 14:37:02 +02:00
Rishabh Dave	324b3b4a6c	tests/functional: add a test for shrink-mds.yml Add a new functional test that deploys a Ceph cluster with three nodes for MON, OSD and MDS and then runs shrink-mds.yml to test it. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-07-08 11:05:28 +02:00

1 2 3 4 5 ...

621 Commits (25e078f685cb613bdb9dd6acc70fefe94aad8809)