ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Andrew Schoen	9f68dad2ff	validate: first pass at validating the install options Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Guillaume Abrioux	a68091c923	tests: update the type for the rule used in pools As of ceph 12.2.5 the type of the parameter `type` is not a name anymore but an id, therefore an `int` is expected otherwise it will fail with the following error Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-30 08:15:18 +02:00
Sébastien Han	71efa2eaf4	ci: bump client nodes to 2 In order to test the key distribution is correct we must have 2 client nodes. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-23 18:34:58 +02:00
Sébastien Han	203c9af0ac	ci: test ansible 2.5 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-23 10:17:24 +02:00
Guillaume Abrioux	77831ccb7a	tests: update tests for mds to cover multimds case in case of multimds we must check for the number of mds up instead of just checking if the hostname of the node is in the fsmap. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-12 18:20:58 +02:00
Sébastien Han	82589021e0	ci: fix tripleO scenario Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-11 12:18:34 +02:00
Sébastien Han	2011ec3bcd	ci: client copy admin key If we don't copy the admin key we can't add the key into ceph. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-11 12:18:34 +02:00
Sébastien Han	cf73647e7a	ci: remove useless tests These are already handled by ceph-client/defaults/main.yml so the keys will be created once user_config is set to True. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-11 12:18:34 +02:00
Andrew Schoen	98e237d234	tests: no need to remove partitions in lvm_setup.yml Now that we are using ceph_volume_zap the partitions are kept around and should be able to be reused. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-04-10 14:19:21 +02:00
Sébastien Han	f3caee8460	ceph-iscsi: fix certificates generation and distribution Prior to this patch, the certificates where being generated on a single node only (because of the run_once: true). Thus certificates were not distributed on all the gateway nodes. This would require a second ansible run to work. This patches fix the creation and keys's distribution on all the nodes. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1540845 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-04 09:27:39 +02:00
John Fulton	e6e6bd078a	Refer to expected-num-ojects as expected_num_objects, not size Follow up patch to PR 2432 [1] which replaces "size" (sorry if the original bug used that term, which can be confusing) with expected_num_objects as is used in the Ceph documentation [2]. [1] https://github.com/ceph/ceph-ansible/pull/2432/files [2] http://docs.ceph.com/docs/jewel/rados/operations/pools	2018-03-26 15:41:51 +02:00
Sébastien Han	3ab89ab48c	ci: re-arrange group_vars files We should stop putting everything in 'all'. This is too easy and this is error prone as well for those who are separating variables into host type, things that you should do. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	d5f8cac820	ci: remove left over iscsi_gws file Wrong file that is not used, only iscsi-ggw that is present is correct. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	8000ae342e	remove unsed ceph_rgw_civetweb_port variable Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	f119b25bbe	client: implement proper pools creation Just like we did for the monitor and openstack_config we now have the ability to precisely create pools. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	e302c1baae	mon: add support for erasure code pool You can now specify type: erasure and erasure_profile to use when declaring the pool dictionnary. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	4806ff4ff8	ci: test pool creation on container On containerized scenario we also want to test pool creation. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	fc0fa48e0d	test: add tests for creating crush tree We now run tests on the newly created ceph_crush module. Now the CI will create a specific hierarchy for the OSD. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	fd94840a6e	ci: add copy_admin_key test to container scenario Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-02 20:59:10 +00:00
Sébastien Han	165d9dec10	remove kernel.pid_max This is now managed by Ceph packages. See: https://github.com/ceph/ceph/pull/18544/files http://tracker.ceph.com/issues/21929 Closes: https://github.com/ceph/ceph-ansible/issues/2410 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-23 13:57:57 +01:00
Guillaume Abrioux	4a8986459f	tests: change ceph_docker_image_tag for 2nd run The ceph-ansible upstream CI runs severals tests, including a 'idempotency/handlers' test. It means the playbook is run a first time and then a second time with an other container image version to ensure the handlers run properly and the containers are well restarted. This can cause issues. For instance, in that specific case which drove me to submit this commit, I've hit the case where `latest` image ships ceph 12.2.3 while the `stable-3.0` (which is the image used for the second run) ships ceph 12.2.2. The goal of this test is not to verify we can upgrade from a specific version to another but to ensure handlers are working even if it's a valid failure here. It should be caught by a test dedicated to that usecase. We just need to have a container image which has a different id for the upstream CI, we need the same content in container imagebut a different image id in the registry since the test relies on image id to decide whether the container should be restarted. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-02-23 13:54:32 +01:00
Guillaume Abrioux	707458c979	ci: add tripleo scenario testing This should help to see earlier any failure in a tripleo deployment scenario. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-02-23 13:54:32 +01:00
Sébastien Han	7d690878df	test: add test for containers resources changes We change the ceph_mon_docker_memory_limit on the second run, this should trigger a restart of services. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-14 02:01:29 +01:00
Sébastien Han	79864a8936	test: add test for restart on new container image Since we have a task to test the handlers we can test a new container to validate the service restart on a new container image. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-14 02:01:29 +01:00
Guillaume Abrioux	deaf273b25	syntax: change local_action syntax Use a nicer syntax for `local_action` tasks. We used to have oneliner like this: ``` local_action: wait_for port=22 host={{ hostvars[inventory_hostname]['ansible_default_ipv4']['address'] }} state=started delay=10 timeout=500 }} ``` The usual syntax: ``` local_action: module: wait_for port: 22 host: "{{ hostvars[inventory_hostname]['ansible_default_ipv4']['address'] }}" state: started delay: 10 timeout: 500 ``` is nicer and kind of way to keep consistency regarding the whole playbook. This also fix a potential issue about missing quotation : ``` Traceback (most recent call last): File "/tmp/ansible_wQtWsi/ansible_module_command.py", line 213, in <module> main() File "/tmp/ansible_wQtWsi/ansible_module_command.py", line 185, in main rc, out, err = module.run_command(args, executable=executable, use_unsafe_shell=shell, encoding=None, data=stdin) File "/tmp/ansible_wQtWsi/ansible_modlib.zip/ansible/module_utils/basic.py", line 2710, in run_command File "/usr/lib64/python2.7/shlex.py", line 279, in split return list(lex) File "/usr/lib64/python2.7/shlex.py", line 269, in next token = self.get_token() File "/usr/lib64/python2.7/shlex.py", line 96, in get_token raw = self.read_token() File "/usr/lib64/python2.7/shlex.py", line 172, in read_token raise ValueError, "No closing quotation" ValueError: No closing quotation ``` writing `local_action: shell echo {{ fsid }} \| tee {{ fetch_directory }}/ceph_cluster_uuid.conf` can cause trouble because it's complaining with missing quotes, this fix solves this issue. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1510555 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-31 10:45:34 +01:00
Andrew Schoen	cfb75b8e29	tests: remove crush_device_class from lvm tests The --crush-device-class flag for ceph-volume is not available in luminous so lets remove this testing option for now until it's more widely available. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-01-18 15:03:38 +01:00
Andrew Schoen	64f5772140	tests: adds crush_device_class to lvm tests Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-01-17 13:49:29 +01:00
Sébastien Han	39f2bfd5d5	fix jewel scenarios on container When deploying Jewel from master we still need to enable this code since the container image has such check. This check still exists because ceph-disk is not able to create a GPT label on a drive that does not have one. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-20 13:43:19 +01:00
Guillaume Abrioux	ab1dd3027a	client: don't try to generate keys the entrypoint to generate users keyring is `ceph-authtool`, therefore, it can expand the `$(ceph-authtool --gen-print-key)` inside the container. Users must generate a keyring themselves. This commit also adds a check to ensure keyring are properly filled when `user_config: true`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-14 17:22:07 +01:00
Guillaume Abrioux	aa0b1ed118	tests: remove OSD_FORCE_ZAP variable from tests according to ceph/ceph-container#840, this variable is no longer needed. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-14 17:55:01 +01:00
Sébastien Han	d05206236c	Merge pull request #2124 from ceph/lvm-setup-fix test: when creating the /dev/sdc2 partition specify label as gpt	2017-10-31 16:51:16 +01:00
Andrew Schoen	37a48209cc	test: when creating the /dev/sdc2 partition specify label as gpt ansible==2.4 requires that label be set to gpt, or it will be defaulted to msdos. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-10-31 09:38:47 -05:00
Guillaume Abrioux	c28882c1cd	tests: add missing test for rbd Add a missing test `test_rbd_mirror_service_is_running_from_luminous()`. Also using bash -c "<cmd>" to make testinfra aware that later in the upgrade process we are now running `luminous` ceph release so we must skip the rbd tests related to `jewel` ceph release. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-30 19:44:56 +01:00
Guillaume Abrioux	97b1cb0258	tests: followup on testing against ansible2.4 ceph-ansible is now being testing against ansible2.2 and ansible2.4. We need to update tox.ini so we use the right version of testinfra regarding which ansible version we are using. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-30 16:40:39 +01:00
Sébastien Han	b3c9de90f4	Merge pull request #2090 from ceph/ansible-2.4 [skip ci] Test ansible 2.4.1	2017-10-27 17:39:05 +02:00
Sébastien Han	faccd0acf0	Merge pull request #2100 from ceph/lvm-bluestore ceph-volume lvm bluestore support	2017-10-27 17:36:16 +02:00
Sébastien Han	c4ad247718	Test ansible 2.4.1 We now test with Ansible 2.4. We had to change testinfra's version since only recent versions work with 2.4. See: https://github.com/philpep/testinfra/issues/249 Closes: https://github.com/ceph/ceph-ansible/issues/2087 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-27 15:20:13 +02:00
Major Hayden	f73232caa4	Use check_mode instead of always_run This patch changes the `always_run: yes` task option to `check_mode: no` to avoid Ansible warnings.	2017-10-25 09:53:34 -05:00
Major Hayden	c2b5118c1b	Revert "Avoid deprecated always_run" This reverts commit `620fb37dd4`.	2017-10-25 09:48:09 -05:00
Alfredo Deza	027d57dd29	tests create a bluestore osd scenario Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 06:46:39 -04:00
Sébastien Han	a53aa9e8b4	ci: new osd scenarios This commit add new osd scenarios, it aims to simplify the CI setup and brings a better coverage on the OSD scenarios. We decided to differentiate between filestore and bluestore, thinking ahead when filestore won't be supported anymore. So we now have two classes of tests: * Filestore * Bluestore In each of those classes we have container and non-container. Then for each we test the following: * collocated * collocated dmcrypt * non-collocated * non-collocated dmcrypt * auto discovery collocated * auto discovery collocated dmcrypt This gives us a nice coverage and also reduces the footprint on the CI. We are now up to 4 scenarios, each containing 6 OSD VMs. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-18 09:26:06 +02:00
Guillaume Abrioux	7ee9aa94b5	Merge pull request #1963 from ceph/pull-in-para site-docker.yml try to fetch images in //	2017-10-13 19:35:11 +02:00
Sébastien Han	71d819620c	mds: fix fs pool creation 1. add the variables to docker_collocation 2. trigger the check when a MDS is part of the inventory file, not when we run on an MDS... Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-13 16:03:04 +02:00
Sébastien Han	90ce4276ca	ci: use a container client VM The client won't run on centos7 anymore but on Atomic host just like the rest of the daemons. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-13 15:26:03 +02:00
Sébastien Han	3e058bff06	ci: reboot with ansible instead of vagrant reload vagrant is serialized and takes a lot of time compare to simple reboot. See the benchmarks below for 3 VMs: [leseb@rick docker]$ time ANSIBLE_SSH_ARGS="-F /home/leseb/reproduce-ci/tmp.zgGC7d5mIC/build/workspace/ceph-ansible/tests/functional/centos/7/docker/vagrant_ssh_config" ansible-playbook -i /home/leseb/reproduce-ci/tmp.zgGC7d5mIC/build/workspace/ceph-ansible/tests/functional/centos/7/docker/hosts reboot.yml PLAY [mons] ************************************************************************************************************************************************************************************************** TASK [Gathering Facts] ************************************************************************************************************************************************************************************* ok: [mon1] ok: [mon2] ok: [mon0] TASK [restart machine] ************************************************************************************************************************************************************************************* changed: [mon2] changed: [mon1] changed: [mon0] TASK [wait for server to boot] ***************************************************************************************************************************************************************************** ok: [mon2 -> localhost] ok: [mon0 -> localhost] ok: [mon1 -> localhost] TASK [uptime] ********************************************************************************************************************************************************************************************** changed: [mon2] changed: [mon0] changed: [mon1] PLAY RECAP *************************************************************************************************************************************************************************************************** mon0 : ok=4 changed=2 unreachable=0 failed=0 mon1 : ok=4 changed=2 unreachable=0 failed=0 mon2 : ok=4 changed=2 unreachable=0 failed=0 real 0m35.112s user 0m5.737s sys 0m1.849s [leseb@rick docker]$ time vagrant reload ==> mon0: Halting domain... ==> mon0: Starting domain. ==> mon0: Waiting for domain to get an IP address... ==> mon0: Waiting for SSH to become available... ==> mon0: Creating shared folders metadata... ==> mon0: Rsyncing folder: /home/leseb/reproduce-ci/tmp.zgGC7d5mIC/build/workspace/ceph-ansible/tests/functional/centos/7/docker/ => /home/vagrant/sync ==> mon0: Machine already provisioned. Run `vagrant provision` or use the `--provision` ==> mon0: flag to force provisioning. Provisioners marked to run always will still run. ==> mon1: Halting domain... ==> mon1: Starting domain. ==> mon1: Waiting for domain to get an IP address... ==> mon1: Waiting for SSH to become available... ==> mon1: Creating shared folders metadata... ==> mon1: Rsyncing folder: /home/leseb/reproduce-ci/tmp.zgGC7d5mIC/build/workspace/ceph-ansible/tests/functional/centos/7/docker/ => /home/vagrant/sync ==> mon1: Machine already provisioned. Run `vagrant provision` or use the `--provision` ==> mon1: flag to force provisioning. Provisioners marked to run always will still run. ==> mon2: Halting domain... ==> mon2: Starting domain. ==> mon2: Waiting for domain to get an IP address... ==> mon2: Waiting for SSH to become available... ==> mon2: Creating shared folders metadata... ==> mon2: Rsyncing folder: /home/leseb/reproduce-ci/tmp.zgGC7d5mIC/build/workspace/ceph-ansible/tests/functional/centos/7/docker/ => /home/vagrant/sync ==> mon2: Machine already provisioned. Run `vagrant provision` or use the `--provision` ==> mon2: flag to force provisioning. Provisioners marked to run always will still run. real 1m31.850s user 0m7.387s sys 0m0.796s Reboot via Ansible: 0m35.112s Reboot via vagrant: 1m31.850s We save 1/3 time. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-13 09:04:26 +02:00
Guillaume Abrioux	17623a2157	Merge pull request #2036 from ceph/cephfs-pool mds: precisely define cephfs pool	2017-10-12 17:47:10 +02:00
Sébastien Han	6bd152d555	Merge pull request #2037 from major/remove-always-run Avoid deprecated always_run	2017-10-12 17:15:28 +02:00
Sébastien Han	b49f9bda21	mds: precisely define cephfs pool We now have a variable called ceph_pools that is mandatory when deploying a MDS. It's a dictionnary that contains a pool name and a PG count. PG count is mandatory and must be set, the playbook will fail otherwise. Closes: https://github.com/ceph/ceph-ansible/issues/2017 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-12 15:56:04 +02:00
Major Hayden	620fb37dd4	Avoid deprecated always_run The `always_run` key is deprecated and being removed in Ansible 2.4. Using it causes a warning to be displayed: [DEPRECATION WARNING]: always_run is deprecated. This patch changes all instances of `always_run` to use the `always` tag, which causes the task to run each time the playbook runs.	2017-10-12 08:29:44 -05:00
Guillaume Abrioux	a179e312fd	tests: add missing override for collocation scenario Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-12 14:43:25 +02:00

1 2 3 4 5 ...

298 Commits (4baa8389e03d9014a44f53137207b9560546511e)