ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	741ef74629	update: fix a typo `hostvars[groups[mon_host]]['ansible_hostname']` seems to be a typo. That should be `hostvars[mon_host]['ansible_hostname']` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `7c99b6df6d`)	2018-11-26 19:36:30 +00:00
Guillaume Abrioux	9022f83450	rolling_update: refact set_fact `mon_host` each monitor node should select another monitor which isn't itself. Otherwise, one node in the monitor group won't set this fact and causes failure. Typical error: ``` TASK [create potentially missing keys (rbd and rbd-mirror) when mon is containerized] * task path: /home/jenkins-build/build/workspace/ceph-ansible-prs-dev-update_docker_cluster/rolling_update.yml:200 Thursday 22 November 2018 14:02:30 +0000 (0:00:07.493) 0:02:50.005 *** fatal: [mon1]: FAILED! => {} MSG: The task includes an option with an undefined variable. The error was: 'dict object' has no attribute u'mon2' ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `af78173584`)	2018-11-26 19:36:30 +00:00
Sébastien Han	5c9aa5ed66	rolling_update: create rbd and rbd-mirror keyrings During an upgrade ceph won't create keys that were not existing on the previous version. So after the upgrade of let's Jewel to Luminous, once all the monitors have the new version they should get or create the keys. It's ok to have the task fails, especially for the rbd-mirror key, which only appears in Nautilus. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1650572 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `4e267bee4f`)	2018-11-26 19:36:30 +00:00
Sébastien Han	46a2701b5e	ceph_key: add a get_key function When checking if a key exists we also have to ensure that the key exists on the filesystem, the key can change on Ceph but still have an outdated version on the filesystem. This solves this issue. Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `691f373543`)	2018-11-26 19:36:30 +00:00
Jairo Llopis	a5aca6ebbc	Fix problem with ceph_key in python3 Pretty basic problem of iteritems removal. Signed-off-by: Jairo Llopis <yajo.sk8@gmail.com> (cherry picked from commit `fc20973c2b`)	2018-10-26 16:23:34 +02:00
Guillaume Abrioux	10403b76e3	tox: fix a typo the line setting `ANSIBLE_CONFIG` obviously contains a typo introduced by `1e283bf69b` `ANSIBLE_CONFIG` has to point to a path only (path to an ansible.cfg) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a0cceb3e44`)	2018-10-26 16:22:46 +02:00
Sébastien Han	d814644c4a	rolling_update: fix upgrade when using fqdn CLusters that were deployed using 'mon_use_fqdn' have a different unit name, so during the upgrade this must be used otherwise the upgrade will fail, looking for a unit that does not exist. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1597516 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `44d0da0dd4`)	2018-10-24 12:42:14 +00:00
Guillaume Abrioux	7c9699ad51	tests: do not install lvm2 on atomic host we need to detect whether we are running on atomic host to not try to install lvm2 package. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `d2ca24eca8`)	2018-10-16 14:35:08 +02:00
Alfredo Deza	f4a5551bfd	tests: install lvm2 before setting up ceph-volume/LVM tests Signed-off-by: Alfredo Deza <adeza@redhat.com> (cherry picked from commit `3e488e8298`)	2018-10-16 14:35:08 +02:00
Noah Watkins	e089f46607	Stringify ceph_docker_image_tag This could be a numeric input, but is treated like a string leading to runtime errors. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1635823 Signed-off-by: Noah Watkins <nwatkins@redhat.com> (cherry picked from commit `8dcc8d1434`)	2018-10-16 14:35:08 +02:00
Noah Watkins	75c9130865	Avoid using tests as filter Fixes the deprecation warning: [DEPRECATION WARNING]: Using tests as filters is deprecated. Instead of using `result\|search` use `result is search`. Signed-off-by: Noah Watkins <nwatkins@redhat.com> (cherry picked from commit `306e308f13`)	2018-10-16 14:35:08 +02:00
Andy McCrae	ee1b6dd83c	Sync config_template with upstream for Ansible 2.6 The original_basename option in the copy module changed to be _original_basename in Ansible 2.6+, this PR resyncs the config_template module to allow this to work with both Ansible 2.6+ and before. Additionally, this PR removes the _v1_config_template.py file, since ceph-ansible no longer supports versions of Ansible before version 2, and so we shouldn't continue to carry that code. Closes: #2843 Signed-off-by: Andy McCrae <andy.mccrae@gmail.com> (cherry picked from commit `a1b3d5b7c3`)	2018-10-15 22:00:35 +00:00
Sébastien Han	d0b03f6faa	switch: copy initial mon keyring We need to copy this key into /etc/ceph so when ceph-docker-common runs it can fetch it to the ansible server. Previously the task wasn't not failing because `fail_on_missing` was False before 2.5, so now it's True hence the failure. Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `bae0f41705`)	2018-10-15 13:59:21 +02:00
Guillaume Abrioux	da05c1fd31	switch: support migration when cluster is scrubbing Similar to `c13a3c3` we must allow scrubbing when running this playbook. In cluster with a large number of PGs, it can be expected some of them scrubbing, it's a normal operation. Preventing from scrubbing operation force to set noscrub flag. This commit allows to switch from non containerized to containerized environment even while PGs are scrubbing. Closes: #3182 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `54b02fe187`)	2018-10-15 13:59:21 +02:00
Guillaume Abrioux	75c2b83e43	defaults: fix osd containers handler `ceph_osd_container_stat` might not be set on other osd node. We must ensure we are on the last node before trying to evaluate `ceph_osd_container_stat`. This should have been backported but it's part of a too important refact in master that can't be backported. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-15 10:33:56 +02:00
Sébastien Han	513608cebe	switch: allow switch big clusters (more than 99 osds) The current regex had a limitation of 99 OSDs, now this limit has been removed and regardless the number of OSDs they will all be collected. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1630430 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `9fccffa1ca`) (cherry picked from commit `d5e57af23d`)	2018-10-15 10:33:56 +02:00
Guillaume Abrioux	4e4184e579	defaults: fix osd handlers that are never triggered `run_once: true` + `inventory_hostname == groups.get(osd_group_name) \| last` is a bad combination since if the only node being run isn't the last, the task will be definitly skipped. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-03 14:09:39 +00:00
Guillaume Abrioux	ba6c3a8e6b	config: look up for monitor_address_block in hostvars `monitor_address_block` should be read from hostvars[host] instead of current node being played. eg: Let's assume we have: ``` [mons] ceph-mon0 monitor_address=192.168.1.10 ceph-mon1 monitor_interface=eth1 ceph-mon2 monitor_address_block=192.168.1.0/24 ``` the ceph.conf generation task will end up with: ``` fatal: [ceph-mon0]: FAILED! => {} MSG: 'ansible.vars.hostvars.HostVarsVars object' has no attribute u'ansible_interface' ``` the reason is that it will assume `monitor_address_block` isn't defined even on ceph-mon2 because looking for `monitor_address_block` instead of `hostvars[host]['monitor_address_block']`, therefore it enters in the condition as default value: ``` {%- else -%} {% set interface = 'ansible_' + (monitor_interface \| replace('-', '_')) %} {% if ip_version == 'ipv4' -%} {{ hostvars[host][interface][ip_version]['address'] }} {%- elif ip_version == 'ipv6' -%} [{{ hostvars[host][interface][ip_version][0]['address'] }}] {%- endif %} {%- endif %} ``` `monitor_interface` is set with default value `'interface'` so the `interface` variable is built with 'ansible_' + 'interface'. It makes ansible throwing a confusing message about `'ansible_interface'`. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1635303 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6130bc841d`)	2018-10-02 21:54:09 +00:00
Guillaume Abrioux	79a5725cf6	purge: actually remove of /var/lib/ceph/* `38dc20e74b` introduced a bug in the purge playbooks because using `` in `command` module doesn't work. `/var/lib/ceph/` files are not purged it means there is a leftover. When trying to redeploy a cluster, it failed because monitor daemon was detecting existing keyring, therefore, it assumed a cluster already existed. Typical error (from container output): ``` Sep 26 13:18:16 mon0 docker[31316]: 2018-09-26 13:18:16 /entrypoint.sh: Existing mon, trying to rejoin cluster... Sep 26 13:18:16 mon0 docker[31316]: 2018-09-26 13:18:16.9323937f15b0d74700 -1 auth: unable to find a keyring on /etc/ceph/test.client.admin.keyring,/etc/ceph/test.keyring,/etc/ceph/keyring,/etc/ceph/keyring.bin,:(2) No such file or directory Sep 26 13:18:23 mon0 docker[31316]: 2018-09-26 13:18:23 /entrypoint.sh: SUCCESS ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1633563 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `144c92b21f`)	2018-09-27 21:42:43 +02:00
Matthew Vernon	0bb13cff08	restart_osd_daemon.sh.j2 - use `+` rather than `{1,}` in regex `+` is more idiomatic for "one or more" in a regex than `{1,}`; the latter was introduced in a previous fix for an incorrect `{1,2}` restriction. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `806461ac6e`)	2018-09-26 21:38:36 +00:00
Matthew Vernon	d701c192e0	restart_osd_daemon.sh.j2 - consider active+clean+* pgs as OK After restarting each OSD, restart_osd_daemon.sh checks that the cluster is in a good state before moving on to the next one. One of the checks it does is that the number of pgs in the state "active+clean" is equal to the total number of pgs in the cluster. On large clusters (e.g. we have 173,696 pgs), it is likely that at least one pg will be scrubbing and/or deep-scrubbing at any one time. These pgs are in state "active+clean+scrubbing" or "active+clean+scrubbing+deep", so the script was erroneously not including them in the "good" count. Similar concerns apply to "active+clean+snaptrim" and "active+clean+snaptrim_wait". Fix this by considering as good any pg whose state contains active+clean. Do this as an integer comparison to num_pgs in pgmap. (could this be backported to at least stable-3.0 please?) Closes: #2008 Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `04f4991648`)	2018-09-26 21:38:36 +00:00
Guillaume Abrioux	fdc2d7681d	rolling_update: ensure pgs_by_state has at least 1 entry Previous commit `c13a3c3` has removed a condition. This commit brings back this condition which is essential to ensure we won't hit a false positive result in the `when` condition for the check PGs task. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `179c4d00d7`)	2018-09-26 10:58:51 +00:00
Guillaume Abrioux	f008f40628	upgrade: consider all 'active+clean' states as valid pgs In cluster with a large number of PGs, it can be expected some of them scrubbing, it's a normal operation. Preventing from scrubbing operation force to set noscrub flag before a rolling update which is a problem because it pauses an important data integrity operation until the end of the rolling upgrade. This commit allows an upgrade even while PGs are scrubbing. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1616066 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `c13a3c3492`)	2018-09-25 14:13:16 +00:00
Giulio Fidente	7d2a13f8c7	Fix version check in ceph.conf template We need to look for ceph_release when comparing with release names, not ceph_version. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1631789 Signed-off-by: Giulio Fidente <gfidente@redhat.com> (cherry picked from commit `6126210e0e`)	2018-09-24 12:32:32 +00:00
Matthew Vernon	93bc69e81e	restart_osd_daemon.sh.j2 - Reset RETRIES between calls of check_pgs Previously RETRIES was set (by default to 40) once at the start of the script; this meant that it would only ever wait for up to 40 lots of 30s across all the OSDs on a host before bombing out. In fact, we want to be prepared to wait for the same amount of time after each OSD restart for the clusters' pgs to be happy again before continuing. Closes: #3154 Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `aa97ecf048`)	2018-09-24 11:13:21 +00:00
Guillaume Abrioux	4ce11a8493	config: set default _rgw_hostname value to respective host the default value for _rgw_hostname was took from the current node being played while it should be took from the respective node in the loop. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622505 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6d6fd514e0`)	2018-09-18 19:27:50 +00:00
Guillaume Abrioux	4e06db845e	tests: followup on `b89cc1746f` Update network subnets in group_vars/all Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `0a88bccf87`)	2018-09-14 18:22:12 +00:00
Guillaume Abrioux	2975387373	shrink-osd: fix purge osd on containerized deployment `ce1dd8d` introduced the purge osd on containers but it was incorrect. `resolve parent device` and `zap ceph osd disks` tasks must be delegated to their respective OSD nodes. Indeed, they were run on the ansible node, it means it was trying to resolve parent devices from this node where it should be done on OSD nodes. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1612095 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `4159326a18`)	2018-09-14 18:22:12 +00:00
Guillaume Abrioux	5fca5dda8f	tests: fix monitor_address for shrink_osd scenario `b89cc1746` introduced a typo. This commit fixes it Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `3382c5226c`)	2018-09-14 14:52:40 +02:00
Guillaume Abrioux	0e86587197	nfs: ignore error on semanage command for ganesha_t As of rhel 7.6, it has been decided it doesn't make sense to confine `ganesha_t` anymore. It means this domain won't exist anymore. Let's add a `failed_when: false` in order to make the deployment not failing when trying to run this command. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1626070 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a6f77340fd`)	2018-09-13 13:28:47 +00:00
Guillaume Abrioux	44fa0863fc	tests: pin sphinx version to 1.7.9 using sphinx 1.8.0 breaks our doc test CI job. Typical error: ``` Exception occurred: File "/home/jenkins-build/build/workspace/ceph-ansible-docs-pull-requests/docs/.tox/docs/lib/python2.7/site-packages/sphinx/highlighting.py", line 26, in <module> from sphinx.ext import doctest SyntaxError: unqualified exec is not allowed in function 'run' it contains a nested function with free variables (doctest.py, line 97) ``` See: https://github.com/sphinx-doc/sphinx/issues/5417 Pinning to 1.7.9 to fix our CI. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8f2c660d25`)	2018-09-13 13:24:40 +02:00
Guillaume Abrioux	8d6ba6f15c	defaults: add a default value to rgw_hostname let's add ansible_hostname as a default value for rgw_hostname if no hostname in servicemap matches ansible_fqdn. Fixes: #3063 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622505 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9ff26e80f2`)	2018-09-10 12:19:31 +00:00
Guillaume Abrioux	c4dfbf8880	tests: do not upgrade ceph release for switch_to_containers scenario Using `UPDATE_*` environment variables here will make an upgrade of the ceph release when running switch_to_containers scenario which is not correct. Eg: If ceph luminous was first deployed, then we should switch to ceph luminous containers, not to mimic. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-09-09 14:20:10 +02:00
Guillaume Abrioux	92e01ae027	Revert "client: add quotes to the dict values" This commit is adding quotes that make keyring unusuable eg: ``` client.john key: AQAN0RdbAAAAABAAH5D3WgMN9Rxw3M8jkpMIfg== caps: [mds] '' caps: [mgr] 'allow *' caps: [mon] 'allow rw' caps: [osd] 'allow rw' ``` Trying to import such a keyring and use it will result: ``` Error EACCES: access denied ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1623417 This reverts commit `424815501a`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `ecbd3e4558`)	2018-09-07 18:34:56 +00:00
Sébastien Han	6db4fceba4	purge: only purge /var/lib/ceph content Sometime /var/lib/ceph is mounted on a device so we won't be able to remove it (device busy) so let's remove its content only. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1615872 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `38dc20e74b`)	2018-09-04 14:06:49 +02:00
Tom Barron	724c39b9a0	run rados cmd in container if containerized deployment When ceph-nfs is deployed containerized and ceph-common is not installed on the host the start_nfs task fails because the rados command is missing on the host. Run rados commands from a ceph container instead so that they will succeed. Signed-off-by: Tom Barron <tpb@dyncloud.net> (cherry picked from commit `bf8f589958`)	2018-09-04 09:40:51 +00:00
Markos Chandras	fea0491249	roles: ceph-rgw: Enable the ceph-radosgw target If the ceph-radosgw target is not enabled, then enabling the ceph-radosgw@ service has no effect since nothing will pull it on the next reboot. As such, we need to ensure that the target is enabled. Signed-off-by: Markos Chandras <mchandras@suse.de> (cherry picked from commit `217f35dbdb`)	2018-09-03 15:09:40 +00:00
Andy McCrae	d0947f0fcf	Dont run client dummy container on non-x86_64 hosts The dummy client container currently wont work on non-x86_64 hosts. This PR creates a filtered client group that contains only hosts that are x86_64 - which can then be the group to run the dummy container against. This is for the specific case of a containerized_deployment where there is a mixture of non-x86_64 hosts and x86_64 hosts. As such the filtered group will contain all hosts when running with containerized_deployment: false. Currently ppc64le is not supported for Ceph server components. Signed-off-by: Andy McCrae <andy.mccrae@gmail.com> (cherry picked from commit `772e6b9be2`)	2018-08-31 12:51:14 +00:00
Sébastien Han	bc50f1038c	doc: remove old statement We have been supporting multiple devices for journalin containerized deployments for a while now and forgot about this. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622393 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `124fc727f4`)	2018-08-28 22:47:50 +00:00
Sébastien Han	65f135b057	remove warning for unsupported variables As promised, these will go unsupported for 3.1 so let's actually remove them :). Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622729 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `9ba670567e`)	2018-08-28 22:47:50 +00:00
Sébastien Han	290ee8ddc0	sites: fix conditonnal Same problem again... ceph_release_num[ceph_release] is only set in ceph-docker-common/common roles so putting the condition on that role will never work. Removing the condition. The downside of this is we will be installing packages and then skip the role on the node. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622210 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `ae5ebeeb00`)	2018-08-27 21:29:32 +00:00
Sébastien Han	0ae7bd4415	site-docker.yml: remove useless condition If we play site-docker.yml, we are already in a containerized_deployment. So the condition is not needed. Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `30cfeb5427`)	2018-08-23 15:31:28 +00:00
Sébastien Han	0725590bb4	ci: stop using different images on the same run There is no point of using hosts running on atomic AND centos hosts. So let's run containerized scenarios on Atomic only. This solves this error here: ``` fatal: [client2]: FAILED! => { "failed": true } MSG: The conditional check 'ceph_current_status.rc == 0' failed. The error was: error while evaluating conditional (ceph_current_status.rc == 0): 'dict object' has no attribute 'rc' The error appears to have been in '/home/jenkins-build/build/workspace/ceph-ansible-nightly-luminous-stable-3.1-ooo_collocation/roles/ceph-defaults/tasks/facts.yml': line 74, column 3, but may be elsewhere in the file depending on the exact syntax problem. The offending line appears to be: - name: set_fact ceph_current_status (convert to json) ^ here ``` From https://2.jenkins.ceph.com/view/ceph-ansible-stable3.1/job/ceph-ansible-nightly-luminous-stable-3.1-ooo_collocation/37/consoleFull#1765217701b5dd38fa-a56e-4233-a5ca-584604e56e3a What's happening here is all the hosts excepts the clients are running atomic, so here: https://github.com/ceph/ceph-ansible/blob/master/site-docker.yml.sample#L62 The condition will skipped all the nodes excepts the clients, thus when running ceph-default, the task "is ceph running already?" is skipped but the task above needs the rc of the skipped task. This is not an error from the playbook, it's a CI setup issue. Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `7012835d2b`)	2018-08-23 15:31:28 +00:00
Sébastien Han	0fec9f6417	release-note: stable-3.1 stable-3.1 is approaching, so let's write our first release note. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-23 13:16:29 +02:00
Sébastien Han	8f9d97d3a1	defaults: fix rgw_hostname A couple if things were wrong in the initial commit: * ceph_release_num[ceph_release] >= ceph_release_num['luminous'] will never work since the ceph_release fact is set in the roles after. So either ceph-common or ceph-docker-common set it * we can easily re-use the initial command to check if a cluster is running, it's more elegant than running it twice. * set the fact rgw_hostname on rgw nodes only Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1618678 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `6d7fa99ff7`)	2018-08-22 19:57:59 +02:00
Sébastien Han	b187c508e7	rolling_upgrade: set sortbitwise properly Running 'osd set sortbitwise' when we detect a version 12 of Ceph is wrong. When OSD are getting updated, even though the package is updated they won't send their updated version (12) and will stick with 10 if the command is not applied. So we have to check if OSD are sending a version 10 and then run the command to unlock the OSDs. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1600943 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `2e6e885bb7`)	2018-08-21 14:21:29 +00:00
Sébastien Han	4ef9d42e86	iscsi group name preserve backward compatibility Recently we renamed the group_name for iscsi iscsigws where previously it was named iscsi-gws. Existing deployments with a host file section with iscsi-gws must continue to work. This commit adds the old group name as a backoward compatility, no error from Ansible should be expected, if the hostgroup is not found nothing is played. Close: https://bugzilla.redhat.com/show_bug.cgi?id=1619167 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `77a3a682f3`)	2018-08-21 00:04:37 +02:00
Sébastien Han	aeff1dbfd8	osd: fix ceph_release We need ceph_release in the condition, not ceph_stable_release Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1619255 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `8c70a5b197`)	2018-08-20 23:13:42 +02:00
Sébastien Han	988b5a81d3	take-over-existing-cluster: do not call var_files We were using var_files long ago when default variables were not in ceph-defaults, now the role exists this is not need. Moreover having these two var files added: - roles/ceph-defaults/defaults/main.yml - group_vars/all.yml Will create collision and override necessary variables. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1555305 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `b738706810`)	2018-08-20 14:47:32 +02:00
Markos Chandras	b2de642c8e	roles: ceph-defaults: Delegate cluster information task to monitor node Since commit `f422efb1d6` ("config: ensure rgw section has the correct name") we observe the following failures in new Ceph deployment with OpenStack-Ansible fatal: [aio1_ceph-rgw_container-fc588f0a]: FAILED! => {"changed": false, "cmd": "ceph --cluster ceph -s -f json", "msg": "[Errno 2] No such file or directory" This is because the task executes 'ceph' but at this point no package installation has happened. Packages are normally installed in the 'ceph-common' role which runs after the 'ceph-defaults' one. Since we are looking to obtain cluster information, the task should be delegated to a monitor node similar to other tasks in that role Signed-off-by: Markos Chandras <mchandras@suse.de> (cherry picked from commit `37e50114de`)	2018-08-20 14:18:07 +02:00

1 2 3 4 5 ...

3819 Commits (741ef7462986b14b68e76a4c58c8f24b8ce49582) All Branches Search

3819 Commits (741ef7462986b14b68e76a4c58c8f24b8ce49582)

All Branches