ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	4f2baaab8c	tests: disable nfs testing nfs-ganesha makes the CI failing because of issue related to SELinux. See: - https://bugzilla.redhat.com/show_bug.cgi?id=1788563 - https://github.com/nfs-ganesha/nfs-ganesha/issues/527 Until we can get this fixed, let's disable nfs-ganesha testing temporarily. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 11:13:46 +01:00
Dimitri Savineau	7b3e6b932c	tests/functional: change docker to podman Some docker commands were hardcoded in tests playbooks and some conditions were not taking care of the containerized_deployment variable but only the atomic fact. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Guillaume Abrioux	217d95abb2	common: add centos8 support Ceph octopus only supports CentOS 8. This commit adds CentOS 8 support: - update vagrant image in tox configurations. - add CentOS 8 repository for el8 dependencies. - CentOS 8 container engine is podman (same than RHEL 8). - don't use the epel mirror on sepia because it's epel7 only. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-08 11:13:46 +01:00
Guillaume Abrioux	40de34fb5e	tests: add filestore_to_bluestore job This commit adds a new job in order to test the filestore-to-bluestore.yml infrastructure playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-12-11 09:04:41 -05:00
Dimitri Savineau	4a6d19dae2	tests: reduce max_mds from 3 to 2 Having max_mds value equals to the number of mds nodes generates a warning in the ceph cluster status: cluster: id: 6d3e49a4-ab4d-4e03-a7d6-58913b8ec00a' health: HEALTH_WARN' insufficient standby MDS daemons available' (...) services: mds: cephfs:3 {0=mds1=up:active,1=mds0=up:active,2=mds2=up:active}' Let's use 2 active and 1 standby mds. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-12-04 14:07:29 -05:00
Dimitri Savineau	3f29b243ea	tests: fix cluster health status The current ceph cluster health is in warning state: health: HEALTH_WARN 13 pool(s) have no replicas configured 2 pool(s) have non-power-of-two pg_num Because we're using only 1 replica then we need to disable the redundancy check. The pool pg num should be a power of two number (like 16). Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-11-27 16:20:17 +01:00
Dimitri Savineau	ef2cb99f73	ceph-osd: add device class to crush rules This adds device class support to crush rules when using the class key in the rule dict via the create-replicated sub command. If the class key isn't specified then we use the create-simple sub command for backward compatibility. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1636508 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-11-14 16:25:46 +01:00
Guillaume Abrioux	db77fbda15	tests: add coverage on purge playbook This commit adds a playbook to be played before we run purge playbook, it first creates an rbd image then map an rbd device on client0 so the purge playbook will try to unmap it. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-11-08 09:06:11 -05:00
Guillaume Abrioux	384161edcd	tests: fix keyring creation in ooo_collocation This commit removes the backslash in allow command parameter, this was needed before the ceph_key module integration. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-10-22 13:45:19 +02:00
Dimitri Savineau	3c2840da03	tests: update container tag for ooo_collocation It doesn't make sense to test the old 3.0.x container images with nautilus+ ceph releases. Also disable the dashboard deployment and switch to bluestore backend. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-10-22 13:45:19 +02:00
Guillaume Abrioux	25b98b2ce3	tests: add multimds coverage This commit makes the all_daemons scenario deploying 3 mds in order to cover the multimds case. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-10-18 13:43:13 -04:00
Dimitri Savineau	2c03c6fcd3	tests: fix the size on the second data LV The commit replaces the pv/vg/lv commands used with the ansible command module by the lvg and lvol modules. This also fixes the size of the second data LV because we were only using 50% of the remaining space instead of 100%. With a 50G device, the result was: - data-lv1 was 25G - data-lv2 was 12.5G Instead of: - data-lv1 was 25G - data-lv2 was 25G Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-10-17 15:49:15 -04:00
Dimitri Savineau	04ec1ad3cc	tests: reduce handler mon and osd delay We don't need to have high handler delay in the CI so reducing to 10 seconds. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-10-09 09:08:20 +02:00
Dimitri Savineau	010158ff84	tests: fix rgw multisite vagrant variables The secondary vagrant variables didn't have the grafana vm variable set which create an vagrant error. There was an error loading a Vagrantfile. The file being loaded and the error message are shown below. This is usually caused by an invalid or undefined variable. This patch also changes the ssh-extra-args parameter to ssh-common-args to get the same values for ssh/sftp/scp. Otherwise we can see warnings from ansible and some tasks are failing. [WARNING]: sftp transfer mechanism failed on [mon0]. Use ANSIBLE_DEBUG=1 to see detailed information It also updates the ssh-common-args value for the rgw-multisite scenario to reflect the ANSIBLE_SSH_ARGS environment variable value. Finally changing the IP addresses due to the Vagrant refact done in the commit `778c51a` Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-10-04 15:12:50 -04:00
Guillaume Abrioux	01f6dd52b3	tests: remove debug log verbosity This was added for debugging purpose. It's generating very large log output, let's remove this now. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-09-28 11:20:49 +02:00
Guillaume Abrioux	5bb6a4da42	tests: set copy_admin_key at group_vars level setting it at extra vars level prevent from setting it per node. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-09-26 11:35:24 +02:00
Guillaume Abrioux	da094ac5ee	tests: do not rely on pg_num to validate rgw_tuning_pools Since the pg_autoscaler has been enabled recently in ceph, this check should stick to validate the requested pools are well created only. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-09-18 14:05:23 +02:00
Dimitri Savineau	825045f6b4	tests: use a single grafana node on podman We don't use multiple grafana nodes for the moment on the others scenarios and I don't think this is supposed to be working. We can often see failure on grafana on that scenario. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-08-28 11:42:48 -04:00
Guillaume Abrioux	05686509f3	tests: update test_mgr_is_up() the data structure has changed in octopus: ``` "mgrmap": { "available": true, "modules": [ "dashboard", "prometheus" ], "num_standbys": 0, "services": { "prometheus": "http://mgr0:9283/" } }, ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-08-14 16:42:02 +02:00
Dimitri Savineau	31bd5e08a6	Revert "tests: disable nfs-ganesha deployment" This reverts commit `83940e624b`. Because nfs-ganesha@master (2.9-dev) build has been fixed by [1] then we can test nfs-ganesha in the CI for master/octopus. [1] https://github.com/ceph/ceph-build/pull/1346 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-08-07 10:40:43 +02:00
Dimitri Savineau	867583d5dd	tests/shrink_rgw: Disable dashboard The shrink_rgw scenario has been merge just after the PR about enable ceph dashboard by default. So right now the shrink_rgw scenrio doesn't have nodes in the grafana group and fails. We just need to set dashboard_enabled to false. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-07-31 14:53:05 -04:00
Guillaume Abrioux	0f620b2584	tests: add more memory in podman job Typical error : ``` fatal: [mon1 -> mon0]: FAILED! => changed=true cmd: - podman - exec - ceph-mon-mon0 - ceph - config - set - mgr - mgr/dashboard/ssl - 'false' delta: '0:00:00.644870' end: '2019-07-30 10:17:32.715639' msg: non-zero return code rc: 1 start: '2019-07-30 10:17:32.070769' stderr: \|- Traceback (most recent call last): File "/usr/bin/ceph", line 140, in <module> import rados ImportError: libceph-common.so.0: cannot map zero-fill pages: Cannot allocate memory Error: exit status 1 stderr_lines: <omitted> stdout: '' stdout_lines: <omitted> ``` Let's add more memory to get around this issue. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-30 13:52:44 +02:00
Guillaume Abrioux	d649e00893	tests: deploy dashboard on mons there's no dedicated nodes for mgr, let's use monitor nodes. The mgr0 instance spawned isn't used, so if this node is part of the inventory for this scenario, testinfra will complain because there's no ceph.conf on this node. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-30 13:52:44 +02:00
Rishabh Dave	236b081a3a	tests/functional: add a test for shrink-rgw.yml Add a new functional test that deploys a Ceph cluster with three nodes for MON, OSD and RGW and then runs shrink-rgw.yml to test it. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-07-30 08:45:57 +02:00
Guillaume Abrioux	3c2fd337d9	tests: test dashboard deployment with podman scenario This commit adds a grafana-server section in order to test dashboard deployment with podman. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-29 14:42:45 +02:00
Guillaume Abrioux	fb1b5b3251	dashboard: enable dashboard by default This commit enables dashboard deployment by default. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1726739 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-29 14:42:45 +02:00
Dimitri Savineau	07c6695d16	Remove NBSP characters Some NBSP are still present in the yaml files. Adding a test in travis CI. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-07-26 16:09:23 -04:00
Guillaume Abrioux	83940e624b	tests: disable nfs-ganesha deployment nfs-ganesha repositories @ dev are broken, this commit disables the nfs-ganesha deployment so the CI isn't stuck. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-24 14:13:06 +02:00
Dimitri Savineau	a9a1f633a9	tests/dashboard: use the dedicated grafana node The Vagrant dashboard scenario creates a dedicated grafana node but was not use in the ansible inventory. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-07-18 07:22:13 +02:00
Rishabh Dave	f80521f773	tests/functional: add a test for shrink-rbdmirror.yml Add a new functional test that deploys Ceph cluster with three nodes for MON, OSD and RBD Mirror and, then, runs shrink-rbdmirror.yml to test it. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-07-15 11:22:17 +02:00
Rishabh Dave	5c95c34d4b	tests/functional: add a test for shrink-mgr.yml Add a new functional test that deploys a Ceph cluster with three nodes for MON, OSD and MGR and then runs shrink-mgr.yml to test it. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-07-09 14:37:02 +02:00
Rishabh Dave	324b3b4a6c	tests/functional: add a test for shrink-mds.yml Add a new functional test that deploys a Ceph cluster with three nodes for MON, OSD and MDS and then runs shrink-mds.yml to test it. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-07-08 11:05:28 +02:00
Mike Christie	1e64efc2f0	igw: Update tests to use ceph-iscsi package gateway_ip_list is depreciated and is only used when using the old ceph-iscsi-config/cli packages that are no longer being developed (GH repos are archived). Because ceph-iscsi-config/cli is no longer being worked on, this modifies the tests to stress the ceph-iscsi based installs. Signed-off-by: Mike Christie <mchristi@redhat.com>	2019-07-03 22:13:19 +02:00
Mike Christie	b7b2213be1	igw: drop gateway_ip_list for container setups The gateway_ip_list is not used in container setups, so drop it for that case. Signed-off-by: Mike Christie <mchristi@redhat.com>	2019-07-03 22:13:19 +02:00
Guillaume Abrioux	45041f52fd	tests: clean nfs_ganesha variables - clean some leftover. - move nfs_ganesha_[stable\|dev] in group_vars so dev_setup.yml can modify them. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-06-26 08:58:51 +02:00
Guillaume Abrioux	013ae62177	tests: test nfs-ganesha deployment Add back the nfs-ganesha deployment testing which was removed because of broken dependencies. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-06-26 08:58:51 +02:00
Guillaume Abrioux	9201674b5b	tests: deploy nfs-ganesha in container-all_daemons this commit bring back the nfs-ganesha testing in containerized deployment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-06-24 10:05:11 +02:00
Dimitri Savineau	da8b7ab7fb	remove ceph restapi references The ceph restapi configuration was only available until Luminous release so we don't need those leftovers for nautilus+. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-06-18 09:13:19 +02:00
Guillaume Abrioux	1019e3b3dc	tests: increase docker pull timeout CI is facing issues where docker pull reach the timeout, let's increase this to avoid CI failures. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-06-14 16:23:24 +02:00
Rishabh Dave	67071c3169	align cephfs pool creation The definitions of cephfs pools should match openstack pools. Signed-off-by: Rishabh Dave <ridave@redhat.com> Co-Authored-by: Simone Caronni <simone.caronni@teralytics.net>	2019-06-13 09:44:05 +02:00
Guillaume Abrioux	4cf17a6fdd	iscsi: assign application (rbd) to pool 'rbd' if we don't assign the rbd application tag on this pool, the cluster will get `HEALTH_WARN` state like following: ``` HEALTH_WARN application not enabled on 1 pool(s) POOL_APP_NOT_ENABLED application not enabled on 1 pool(s) application not enabled on pool 'rbd' ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-06-13 07:35:39 +02:00
Guillaume Abrioux	9e4e692c61	tests: remove unused variable `e MGR_DASHBOARD=0` isn't needed anymore here, let's remove this legacy. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-06-12 10:41:01 -04:00
Guillaume Abrioux	8dd774a99b	tests: update docker image tag used in ooo job ceph-ansible@master isn't intended to deploy luminous. Let's use latest-master on ceph-ansible@master branch Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-06-12 10:41:01 -04:00
fmount	069076bbfd	Fix units and add ability to have a dedicated instance Few fixes on systemd unit templates for node_exporter and alertmanager container parameters. Added the ability to use a dedicated instance to deploy the dashboard components (prometheus and grafana). This commit also introduces the grafana_group_name variable to refer grafana group and keep consistency with the other groups. During the integration with TripleO some grafana/prometheus template variables resulted undefined. This commit adds the ability to check if the group exist and create, accordingly, different job groups in prometheus template. Signed-off-by: fmount <fpantano@redhat.com>	2019-06-10 18:18:46 +02:00
L3D	ab54fe20ec	ansible: use 'bool' filter on boolean conditionals By running ceph-ansible there are a lot ``[DEPRECATION WARNING]`` like these: ``` [DEPRECATION WARNING]: evaluating containerized_deployment as a bare variable, this behaviour will go away and you might need to add \|bool to the expression in the future. Also see CONDITIONAL_BARE_VARS configuration toggle.. This feature will be removed in version 2.12. Deprecation warnings can be disabled by setting deprecation_warnings=False in ansible.cfg. ``` Now appended ``\| bool`` on a lot of the affected variables. Sometimes the coding style from ``variable\|bool`` changed to ``variable \| bool`` (with spaces at the pipe). Closes: #4022 Signed-off-by: L3D <l3d@c3woc.de>	2019-06-06 10:21:17 +02:00
Guillaume Abrioux	a78fb209b1	tests: test podman against atomic os instead rhel8 the rhel8 image used is an outdated beta version, it is not worth it to maintain this image upstream, since it's possible to test podman with a newer version of centos/atomic-host image. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-06-04 11:32:41 -04:00
Dimitri Savineau	de147469d7	tests: update testinfra release In order to support ansible 2.8 with testinfra we need to use the latest release (3.0.x). Adding ssh-config option to py.test. Also bumping the pytest and xdist version. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-05-20 13:04:58 +02:00
Guillaume Abrioux	17634fc3df	tests: add dashboard scenario testing This commit add a new scenario to test the dashboard deployment via ceph-ansible. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-05-16 16:39:13 +02:00
Guillaume Abrioux	2798774e96	tests: fix a typo in dev_setup.yml `c907ec41ae` introduced a typo. This commit fixes it. ``` [WARNING]: While constructing a mapping from /home/guits/ceph-ansible/tests/functional/dev_setup.yml, line 21, column 9, found a duplicate dict key (replace). ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-05-15 11:33:26 +02:00
Dimitri Savineau	52b9f3fb28	tox: Refact lvm_osds scenario The current lvm_osds only tests filestore on one OSD node. We also have bs_lvm_osds to test bluestore and encryption. Let's use only one scenario to test filestore/bluestore and with or without dmcrypt on four OSD nodes. Also use validate_dmcrypt_bool_value instead of types.boolean on dmcrypt validation via notario. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-05-09 09:38:20 +02:00

1 2 3 4 5 ...

441 Commits (a09d1c38bf80e412265f58d732c554262ef23cc7)