ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	c90b0985e5	tests: switch to quay.ceph.io for dashboard images for some reason, `quay.io/app-sre/grafana` no longer exist. as a workaround, all dashboard related images have been mirrored on quay.ceph.io. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-03-23 19:58:27 +01:00
Guillaume Abrioux	b8080bac41	tests: fix `test_rgw_is_up` test The data structure seems to have been modified in ceph@master (quincy). This commit update the test accordingly. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-03-23 19:58:27 +01:00
Guillaume Abrioux	7e1db0b599	tests: fix `test_nfs_is_up` test the data structure seems to have been modified in ceph@master (quincy). This commit update the test accordingly. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-03-23 19:58:27 +01:00
Guillaume Abrioux	ee1f0ce444	Revert "tests: disable nfs testing on master" This reverts commit `8372b6792f`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-03-17 13:42:20 +01:00
Guillaume Abrioux	49668378fb	tests: remove 1 client VM in external_clients job We only use 2 client in this scenario, there's no need to fire up a third VM. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-03-16 19:38:04 +01:00
Guillaume Abrioux	8372b6792f	tests: disable nfs testing on master nfs-ganesha builds in shaman are broken. This commit disables nfs-ganesha testing in order to unlock the CI. This is a temporary commit intented to be reverted. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-03-16 19:38:04 +01:00
Alex Schultz	a7f2fa73e6	Use ansible_facts It has come to our attention that using ansible_* vars that are populated with INJECT_FACTS_AS_VARS=True is not very performant. In order to be able to support setting that to off, we need to update the references to use ansible_facts[<thing>] instead of ansible_<thing>. Related: ansible#73654 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1935406 Signed-off-by: Alex Schultz <aschultz@redhat.com>	2021-03-08 20:54:02 +01:00
Guillaume Abrioux	682116023d	tests: increase `mon_max_pg_per_osd` we aren't deploying enough OSD daemon, so it fails like following: ``` stderr: 'Error ERANGE: pool id 10 pg_num 256 size 2 would mean 1536 total pgs, which exceeds max 1500 (mon_max_pg_per_osd 250 * num_in_osds 6)' ``` Let's increase the value of `mon_max_pg_per_osd` in order to get around this issue in the CI. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-02-11 16:35:55 +01:00
Guillaume Abrioux	54bae480d2	tests: set `mon_max_pg_per_osd` in rgw_multisite Otherwise, the job fails when it tries to create a bucket with `s3cmd mb` command because we have too many PGs per OSD. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-02-10 07:01:21 +01:00
Guillaume Abrioux	7c9063b1d2	tests: use lvm batch on osd2 (all_daemons) in order to test lvm batch in purge scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-02-02 17:24:17 +01:00
Guillaume Abrioux	71a5e666e3	rgw: multisite refact Add the possibility to deploy rgw multisite configuration with a mix of secondary and primary zones on a same rgw node. Before that, on a same node, all instances were either primary zones OR secondary. Now you can define a rgw instance like following: ``` rgw_instances: - instance_name: 'rgw0' rgw_zonemaster: false rgw_zonesecondary: true rgw_zonegroupmaster: false rgw_realm: 'france' rgw_zonegroup: 'zonegroup-france' rgw_zone: paris-00 radosgw_address: "{{ _radosgw_address }}" radosgw_frontend_port: 8080 rgw_zone_user: jacques.chirac rgw_zone_user_display_name: "Jacques Chirac" system_access_key: P9Eb6S8XNyo4dtZZUUMy system_secret_key: qqHCUtfdNnpHq3PZRHW5un9l0bEBM812Uhow0XfB endpoint: http://192.168.101.12:8080 ``` Basically it's now possible to define `rgw_zonemaster`, `rgw_zonesecondary` and `rgw_zonegroupmaster` at the intsance level instead of the whole node level. Also, this commit adds an option `deploy_secondary_zones` (default True) which can be set to `False` in order to explicitly ask the playbook to not deploy secondary zones in case where the corresponding endpoint are not deployed yet. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1915478 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-01-27 15:46:43 +01:00
Dimitri Savineau	bbcad9609c	grafana: update container tag to 6.7.4 This update the grafana container tag to 6.7.4. The RHCS version is now based on the RHCS 5 container image which is also based on 6.7.4. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-01-27 15:08:31 +01:00
Guillaume Abrioux	41314f49bf	Revert "tests: temporarily use david's flavor" This reverts commit `ed9f0641ee`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-01-11 15:50:55 +01:00
Guillaume Abrioux	ed9f0641ee	tests: temporarily use david's flavor master nfs ganesha builds are broken, let's use this flavor instead for now. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-01-07 17:09:46 -05:00
Guillaume Abrioux	e2ea403d5e	tests: rgw_multisite playbook test refactor Currently we create an object from the primary sites but we try to read that object still from the master which doesn't make sense, we should try to read it from a secondary site. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-12-14 17:53:21 +01:00
Dimitri Savineau	3b9cdc8502	tests: remove pyyaml workaround on OSD nodes Since [1] has been resolved then we don't need to apply this workaround anymore. [1] https://tracker.ceph.com/issues/46759 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-12-10 09:15:03 +01:00
Dimitri Savineau	c3ed124d31	library: add cephadm_bootstrap module This adds cephadm_bootstrap ansible module for replacing the command module usage with the cephadm bootstrap command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-12-01 10:30:05 +01:00
Guillaume Abrioux	e5713ea5d5	tests: change cephfs pool size `all_daemons` scenario can't handle pools with `size: 3` because we have 1 osd node in root=HDD and two nodes in root=default. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-06 09:23:52 -04:00
Guillaume Abrioux	8596f1d52c	flake8: fix pep8 syntax on tests/functional/tests/ tests/conftest.py and tests present in tests/functional/tests/ has been missed from previous commit Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-06 08:00:06 +02:00
Guillaume Abrioux	f83f798206	tests: reboot and test idempotency on collocation test reboot and idempotency on collocation scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-06 07:38:44 +02:00
Guillaume Abrioux	876b4ad248	tests: remove ooo_collocation job This job is redundant with 'collocation' job. The only difference is osd/rgw collocation so let's add this usecase in 'collocation'. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit 19d683d7acfb5344b38ac1ba4c123dcdd4d80f35)	2020-10-04 11:19:15 +02:00
Dimitri Savineau	246e31c0d3	Revert "tests: disable nfs-ganesha testing" This reverts commit `7348e9a253`. Since the nfs-ganesha rpm build for CentOS 8 has been fixed, and the nfs-ganesha segfault caused by an issue in librgw has also been fixed. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-10-02 07:43:07 +02:00
Guillaume Abrioux	c101cb3931	defaults: change defaults value this commit changes defaults value in default pool definitions. there's no need to define `pg_num`, `pgp_num`, `size` and `min_size`, `ceph_pool` module will use the current default if needed. This also drops the 3 following `set_fact` in `ceph-facts`: - osd_pool_default_pg_num, - osd_pool_default_pgp_num, - osd_pool_default_size_num Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-02 07:42:40 +02:00
Guillaume Abrioux	eefe11d90c	defaults: change default grafana-server name This change default value of grafana-server group name. Adding some tasks in ceph-defaults in order to keep backward compatibility. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-29 07:42:26 +02:00
Dimitri Savineau	e11453c6f5	Remove unused centos docker tasks The `enable extras on centos` task just doesn't work when using the variable ceph_docker_enable_centos_extra_repo to true. fatal: [xxx]; FAILED! => {"changed": false, "msg": "Parameter 'baseurl', 'metalink' or 'mirrorlist' is required."} The CentOS extras repository is enabled by default so it's pretty safe to remove this task and the associated variable. This also removes the ceph_docker_on_openstack variable as it's a leftover and it is unused. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-29 07:35:10 +02:00
Dimitri Savineau	7dfa205610	tests: disable container nfs testing Looks like nfs-ganesha 3.3 and 4.-dev doesn't work with recent changes in librgw 16.0.0. The nfs-ganesha daemon is segfaulting and restart in a loop. See https://tracker.ceph.com/issues/47520 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-17 16:51:33 -04:00
Dimitri Savineau	78cb9f44bd	tests: add quay registry for collocation baremetal Even if the non containerized collocation scenario deploys ceph with RPMs then we also deploy the dashboard/monitoring but with containers. This requires to set the registry variable to ceph's quay. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-10 14:23:21 -04:00
Dimitri Savineau	98c9afceb9	tests: use grafana from quay.io This changes the grafana container image regitry from docker.io to quay.io to avoid rate limit. This also adds the missing container image values for docker2podman and podman scenarios. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-09 10:35:02 -04:00
Guillaume Abrioux	657e6c8c3b	tests: clean legacy clean some legacies since quay.ceph.io migration Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-09 14:42:41 +02:00
Guillaume Abrioux	7348e9a253	tests: disable nfs-ganesha testing This commit diables nfs-ganesha testing on master for non-containerized deployment because the dev repos are broken at the moment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-07 12:54:29 +02:00
Guillaume Abrioux	2cbb7de3b2	tests: migrate to quay.ceph.io registry in order to avoid docker.io rate limiting Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-07 12:54:29 +02:00
Dimitri Savineau	4f308dcf4a	tests: reenable ceph-iscsi testing This re-adds the ceph-iscsi testing for both non containerized and containerized deployment since the rados connection error on ceph dev has been fixed [1]. [1] https://tracker.ceph.com/issues/47002 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-08-27 11:13:36 -04:00
Dimitri Savineau	6c11695fbe	tests: reenable nfs-ganesha testing This re-adds the nfs-ganesha testing in non containerized deployment. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-08-20 16:58:54 +02:00
Guillaume Abrioux	8476beb5b1	tests: move erasure pool testing in lvm_osds This commit moves the erasure pool creation testing from `all_daemons` to `lvm_osds` so we can decrease the number of osd nodes we spawn so the OVH Jenkins slaves aren't less overwhelmed when a `all_daemons` based scenario is being tested. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-08-20 11:50:28 +02:00
Guillaume Abrioux	093e1dcb21	tests: remove hosts-ubuntu inventories Since we've dropped ubuntu testing, we don't need these inventories anymore. Let's remove this leftover. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-08-18 11:20:48 +02:00
Guillaume Abrioux	bd9e126357	tests: disable iscsigw testing (container) Temporarily disable iscsigw testing for containerized deployments because it's broken upstream on ceph@master. non-containerized deployments use stable build for iscsigw to get around this issue. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-08-18 11:20:48 +02:00
Guillaume Abrioux	e256d8e948	tests: test iscsigw against stable Since it is broken at the moment with dev repos, let's test against stable builds so the CI is unlocked. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-08-13 09:49:00 +02:00
Guillaume Abrioux	5df6225ede	tests: change subnet in lvm_osds container scenario This commit changes the subnets in container-lvm_osds scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-08-04 14:00:05 +02:00
Guillaume Abrioux	7efea219d6	tests: refact shrink_osd scenario This adds more coverage on the shrink_osd scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-08-03 14:46:56 +02:00
Dimitri Savineau	891234668e	tests: install pyyaml on osd nodes Due to [1], ceph-volume has now a dependency on pyyaml but it's not installed by default via the package dependency. This patch only add the required package on non containerized deployment and as temporary workaround for the CI. [1] https://tracker.ceph.com/issues/46759 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-07-29 12:49:15 -04:00
Guillaume Abrioux	8ef9fb68bc	tests: lvm_setup.yml, add carriage return This commit adds crlf between each task. It makes the playbook more readable. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-07-22 14:25:45 +02:00
Guillaume Abrioux	218f4ae361	tests: (lvm_setup.yml), don't shrink lvol when rerunning lvm_setup.yml on existing cluster with OSDs already deployed, it fails like following: ``` fatal: [osd0]: FAILED! => changed=false msg: Sorry, no shrinking of data-lv2 to 0 permitted. ``` because we are asking `lvol` module to create a volume on an empty VG with size extents = `100%FREE`. The default behavior of `lvol` is to shrink the volume if the LV's current size is greater than the requested size. Given the requested size is calculated like this: `size_requested = size_percent * this_vg['free'] / 100` in our case, it is similar to: `size_requested = 100 * 0 / 100` which basically means `0` So the current LV size is well greater than the requested size which leads the module to attempt to shrink it to 0 which isn't obviously now allowed. Adding `shrink: false` to the module calls fixes this issue. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-07-22 14:25:45 +02:00
Guillaume Abrioux	9d2f2108e1	ceph-crash: introduce new role ceph-crash This commit introduces a new role `ceph-crash` in order to deploy everything needed for the ceph-crash daemon. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-07-21 20:22:12 +02:00
Dimitri Savineau	957903d561	cephadm: add playbook This adds a new playbook for deploying ceph via cephadm. This also adds a new dedicated tox file for CI purpose. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-07-16 11:40:45 -04:00
Dimitri Savineau	fc599ed9f5	tests: remove nfs_ganesha_stable_branch variable We don't need to override this variable in the group_vars but use the default value instead. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-07-06 16:58:59 +02:00
Guillaume Abrioux	5b6f5486f7	tests: update nfs-ganesha to V3.3-stable not really needed in master, commit intended to be backported in octopus branch. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-07-05 17:10:40 +02:00
Dimitri Savineau	829990e60d	ceph-osd: remove ceph-osd-run.sh script Since we only have one scenario since nautilus then we can just move the container start command from ceph-osd-run.sh to the systemd unit service. As a result, the ceph-osd-run.sh.j2 template and the ceph_osd_docker_run_script_path variable are removed. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-06-18 17:51:13 +02:00
Jan Fajerski	1fe8e819f9	lvm_setup: lookup device from inventory, default to /dev/sd* names This fixes a long standing fail in ceph-volumes lvm test suite. Otherwise the default behaviour should not change. Signed-off-by: Jan Fajerski <jfajerski@suse.com>	2020-06-16 18:17:34 +02:00
Guillaume Abrioux	83faf94351	tests: update pools definitions setting attributes with empty string is a bad user input. Also, removing `rule_name` attribute when creating a code erasure pool. (this rule isnt intended for code erasure pool type). Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-16 07:31:57 +02:00
Dimitri Savineau	252e78b4e4	docker2podman: manage dashboard nodes The dashboard nodes (alertmanager, grafana, node-exporter, and prometheus) were not manage during the docker to podman migration. This adds the systemd container template of those services to a dedicated file (systemd.yml) in order to include it in the docker2podman playbook. This also adds the dashboard container images pull from docker to podman. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1829389 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-13 12:02:00 +02:00

1 2 3 4 5 ...

517 Commits (c90b0985e50cd9a4a0160d0707af544f319a7fe8)