ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	f463d1838e	mgr: wait for all mgr to be available before managing mgr modules, we must ensure all mgr are available otherwise we can hit failure like following: ``` stdout:Error ENOENT: all mgr daemons do not support module 'restful', pass --force to force enablement ``` It happens because all mgr are not yet available when trying to manage with mgr modules. This should have been cherry-picked from `41f7518c1b` but there's too much changes. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-11 10:02:25 +02:00
Guillaume Abrioux	64bee9cb86	osd: backward compatibility with old disk_list.sh location Since all files in container image have moved to `/opt/ceph-container` this check must look for new AND the old path so it's backward compatible. Otherwise it could end up by templating an inconsistent `ceph-osd-run.sh`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `987bdac963`)	2019-04-02 11:09:46 +02:00
Guillaume Abrioux	69cda84a21	iscsi-gws: remove a leftover remove leftover introduced by `9d590f4` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `d4b3c1d409`)	2019-03-28 15:36:26 +00:00
Guillaume Abrioux	ff243781c5	iscsi: fix permission denied error Typical error: ``` fatal: [iscsi-gw0]: FAILED! => msg: 'an error occurred while trying to read the file ''/home/guits/ceph-ansible/tests/functional/all_daemons/fetch/e5f4ab94-c099-4781-b592-dbd440a9d6f3/iscsi-gateway.key'': [Errno 13] Permission denied: b''/home/guits/ceph-ansible/tests/functional/all_daemons/fetch/e5f4ab94-c099-4781-b592-dbd440a9d6f3/iscsi-gateway.key''' ``` `become: True` is not needed on the following task: `copy crt file(s) to gateway nodes`. Since it's already set in the main playbook (site.yml/site-container.yml) The thing is that the files get generated in the 'fetch_directory' with root user because there is a 'delegate_to' + we run the playbook with `become: True` (from main playbook). The idea here is to create files under ansible user so we can open them later to copy them on the remote machine. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9d590f4339`)	2019-03-28 15:36:26 +00:00
Rishabh Dave	b39345751f	ceph-common: disable unrequired NTP services When one of the currently supported NTP services has been set up, disable rest of the NTP services on Ceph nodes. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1651875 Signed-off-by: Rishabh Dave <ridave@redhat.com> (cherry picked from commit `6fa757d343`)	2019-01-14 16:37:35 +01:00
Rishabh Dave	ada7a400c2	ceph-common: merge ntp_debian.yml and ntp_rpm.yml Merge ntp_debian.yml and ntp_rpm.yml into one (the new file is called setup_ntp.yml) since they are almost identical. Since this is as a "as it is" backport for the original commit, it also adds the feature of supporting multiple NTP daemons (namely, chronyd & timesyncd). This is to maintain consistency across all branches since the backport for stable-3.2 was auto-merged by mergify despite of conflicts. Signed-off-by: Rishabh Dave <ridave@redhat.com> (cherry picked from commit `b03ab60742`)	2019-01-14 16:37:35 +01:00
Benjamin Cherian	bb41a7da20	Add support for different NTP daemons Allow user to choose between timesyncd, chronyd and ntpd Installation will default to timesyncd since it is distributed as part of the systemd installation for most distros. Added note indicating NTP daemon type is not used for containerized deployments. Fixes issue #3086 on Github Signed-off-by: Benjamin Cherian <benjamin_cherian@amat.com> (cherry picked from commit `85071e6e53`)	2019-01-14 16:37:35 +01:00
Sébastien Han	c34027c3ba	rolling_update: do not fail on missing keys We don't want to fail on key that are not present since they will get created after the mons are updated. They will be created by the task "create potentially missing keys (rbd and rbd-mirror)". Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1650572 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-29 15:50:07 +01:00
Noah Watkins	e089f46607	Stringify ceph_docker_image_tag This could be a numeric input, but is treated like a string leading to runtime errors. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1635823 Signed-off-by: Noah Watkins <nwatkins@redhat.com> (cherry picked from commit `8dcc8d1434`)	2018-10-16 14:35:08 +02:00
Noah Watkins	75c9130865	Avoid using tests as filter Fixes the deprecation warning: [DEPRECATION WARNING]: Using tests as filters is deprecated. Instead of using `result\|search` use `result is search`. Signed-off-by: Noah Watkins <nwatkins@redhat.com> (cherry picked from commit `306e308f13`)	2018-10-16 14:35:08 +02:00
Guillaume Abrioux	75c2b83e43	defaults: fix osd containers handler `ceph_osd_container_stat` might not be set on other osd node. We must ensure we are on the last node before trying to evaluate `ceph_osd_container_stat`. This should have been backported but it's part of a too important refact in master that can't be backported. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-15 10:33:56 +02:00
Guillaume Abrioux	4e4184e579	defaults: fix osd handlers that are never triggered `run_once: true` + `inventory_hostname == groups.get(osd_group_name) \| last` is a bad combination since if the only node being run isn't the last, the task will be definitly skipped. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-03 14:09:39 +00:00
Guillaume Abrioux	ba6c3a8e6b	config: look up for monitor_address_block in hostvars `monitor_address_block` should be read from hostvars[host] instead of current node being played. eg: Let's assume we have: ``` [mons] ceph-mon0 monitor_address=192.168.1.10 ceph-mon1 monitor_interface=eth1 ceph-mon2 monitor_address_block=192.168.1.0/24 ``` the ceph.conf generation task will end up with: ``` fatal: [ceph-mon0]: FAILED! => {} MSG: 'ansible.vars.hostvars.HostVarsVars object' has no attribute u'ansible_interface' ``` the reason is that it will assume `monitor_address_block` isn't defined even on ceph-mon2 because looking for `monitor_address_block` instead of `hostvars[host]['monitor_address_block']`, therefore it enters in the condition as default value: ``` {%- else -%} {% set interface = 'ansible_' + (monitor_interface \| replace('-', '_')) %} {% if ip_version == 'ipv4' -%} {{ hostvars[host][interface][ip_version]['address'] }} {%- elif ip_version == 'ipv6' -%} [{{ hostvars[host][interface][ip_version][0]['address'] }}] {%- endif %} {%- endif %} ``` `monitor_interface` is set with default value `'interface'` so the `interface` variable is built with 'ansible_' + 'interface'. It makes ansible throwing a confusing message about `'ansible_interface'`. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1635303 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6130bc841d`)	2018-10-02 21:54:09 +00:00
Matthew Vernon	0bb13cff08	restart_osd_daemon.sh.j2 - use `+` rather than `{1,}` in regex `+` is more idiomatic for "one or more" in a regex than `{1,}`; the latter was introduced in a previous fix for an incorrect `{1,2}` restriction. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `806461ac6e`)	2018-09-26 21:38:36 +00:00
Matthew Vernon	d701c192e0	restart_osd_daemon.sh.j2 - consider active+clean+* pgs as OK After restarting each OSD, restart_osd_daemon.sh checks that the cluster is in a good state before moving on to the next one. One of the checks it does is that the number of pgs in the state "active+clean" is equal to the total number of pgs in the cluster. On large clusters (e.g. we have 173,696 pgs), it is likely that at least one pg will be scrubbing and/or deep-scrubbing at any one time. These pgs are in state "active+clean+scrubbing" or "active+clean+scrubbing+deep", so the script was erroneously not including them in the "good" count. Similar concerns apply to "active+clean+snaptrim" and "active+clean+snaptrim_wait". Fix this by considering as good any pg whose state contains active+clean. Do this as an integer comparison to num_pgs in pgmap. (could this be backported to at least stable-3.0 please?) Closes: #2008 Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `04f4991648`)	2018-09-26 21:38:36 +00:00
Giulio Fidente	7d2a13f8c7	Fix version check in ceph.conf template We need to look for ceph_release when comparing with release names, not ceph_version. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1631789 Signed-off-by: Giulio Fidente <gfidente@redhat.com> (cherry picked from commit `6126210e0e`)	2018-09-24 12:32:32 +00:00
Matthew Vernon	93bc69e81e	restart_osd_daemon.sh.j2 - Reset RETRIES between calls of check_pgs Previously RETRIES was set (by default to 40) once at the start of the script; this meant that it would only ever wait for up to 40 lots of 30s across all the OSDs on a host before bombing out. In fact, we want to be prepared to wait for the same amount of time after each OSD restart for the clusters' pgs to be happy again before continuing. Closes: #3154 Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk> (cherry picked from commit `aa97ecf048`)	2018-09-24 11:13:21 +00:00
Guillaume Abrioux	4ce11a8493	config: set default _rgw_hostname value to respective host the default value for _rgw_hostname was took from the current node being played while it should be took from the respective node in the loop. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622505 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6d6fd514e0`)	2018-09-18 19:27:50 +00:00
Guillaume Abrioux	0e86587197	nfs: ignore error on semanage command for ganesha_t As of rhel 7.6, it has been decided it doesn't make sense to confine `ganesha_t` anymore. It means this domain won't exist anymore. Let's add a `failed_when: false` in order to make the deployment not failing when trying to run this command. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1626070 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a6f77340fd`)	2018-09-13 13:28:47 +00:00
Guillaume Abrioux	8d6ba6f15c	defaults: add a default value to rgw_hostname let's add ansible_hostname as a default value for rgw_hostname if no hostname in servicemap matches ansible_fqdn. Fixes: #3063 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622505 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9ff26e80f2`)	2018-09-10 12:19:31 +00:00
Guillaume Abrioux	92e01ae027	Revert "client: add quotes to the dict values" This commit is adding quotes that make keyring unusuable eg: ``` client.john key: AQAN0RdbAAAAABAAH5D3WgMN9Rxw3M8jkpMIfg== caps: [mds] '' caps: [mgr] 'allow *' caps: [mon] 'allow rw' caps: [osd] 'allow rw' ``` Trying to import such a keyring and use it will result: ``` Error EACCES: access denied ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1623417 This reverts commit `424815501a`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `ecbd3e4558`)	2018-09-07 18:34:56 +00:00
Tom Barron	724c39b9a0	run rados cmd in container if containerized deployment When ceph-nfs is deployed containerized and ceph-common is not installed on the host the start_nfs task fails because the rados command is missing on the host. Run rados commands from a ceph container instead so that they will succeed. Signed-off-by: Tom Barron <tpb@dyncloud.net> (cherry picked from commit `bf8f589958`)	2018-09-04 09:40:51 +00:00
Markos Chandras	fea0491249	roles: ceph-rgw: Enable the ceph-radosgw target If the ceph-radosgw target is not enabled, then enabling the ceph-radosgw@ service has no effect since nothing will pull it on the next reboot. As such, we need to ensure that the target is enabled. Signed-off-by: Markos Chandras <mchandras@suse.de> (cherry picked from commit `217f35dbdb`)	2018-09-03 15:09:40 +00:00
Andy McCrae	d0947f0fcf	Dont run client dummy container on non-x86_64 hosts The dummy client container currently wont work on non-x86_64 hosts. This PR creates a filtered client group that contains only hosts that are x86_64 - which can then be the group to run the dummy container against. This is for the specific case of a containerized_deployment where there is a mixture of non-x86_64 hosts and x86_64 hosts. As such the filtered group will contain all hosts when running with containerized_deployment: false. Currently ppc64le is not supported for Ceph server components. Signed-off-by: Andy McCrae <andy.mccrae@gmail.com> (cherry picked from commit `772e6b9be2`)	2018-08-31 12:51:14 +00:00
Sébastien Han	65f135b057	remove warning for unsupported variables As promised, these will go unsupported for 3.1 so let's actually remove them :). Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622729 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `9ba670567e`)	2018-08-28 22:47:50 +00:00
Sébastien Han	8f9d97d3a1	defaults: fix rgw_hostname A couple if things were wrong in the initial commit: * ceph_release_num[ceph_release] >= ceph_release_num['luminous'] will never work since the ceph_release fact is set in the roles after. So either ceph-common or ceph-docker-common set it * we can easily re-use the initial command to check if a cluster is running, it's more elegant than running it twice. * set the fact rgw_hostname on rgw nodes only Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1618678 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `6d7fa99ff7`)	2018-08-22 19:57:59 +02:00
Sébastien Han	aeff1dbfd8	osd: fix ceph_release We need ceph_release in the condition, not ceph_stable_release Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1619255 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `8c70a5b197`)	2018-08-20 23:13:42 +02:00
Markos Chandras	b2de642c8e	roles: ceph-defaults: Delegate cluster information task to monitor node Since commit `f422efb1d6` ("config: ensure rgw section has the correct name") we observe the following failures in new Ceph deployment with OpenStack-Ansible fatal: [aio1_ceph-rgw_container-fc588f0a]: FAILED! => {"changed": false, "cmd": "ceph --cluster ceph -s -f json", "msg": "[Errno 2] No such file or directory" This is because the task executes 'ceph' but at this point no package installation has happened. Packages are normally installed in the 'ceph-common' role which runs after the 'ceph-defaults' one. Since we are looking to obtain cluster information, the task should be delegated to a monitor node similar to other tasks in that role Signed-off-by: Markos Chandras <mchandras@suse.de> (cherry picked from commit `37e50114de`)	2018-08-20 14:18:07 +02:00
Markos Chandras	e9433afd6c	roles: ceph-defaults: Check if 'rgw' attribute exists for rgw_hostname If there are no services on the cluster, then the 'rgw' could be missing and the task is failing with the following problem: msg": "The task includes an option with an undefined variable. The error was: 'dict object' has no attribute 'rgw' We fix this by checking the existence of the 'rgw' attribute. If it's missing, we skip the task since the role already contains code to set a good default rgw_hostname. Signed-off-by: Markos Chandras <mchandras@suse.de> (cherry picked from commit `126e2e3f92`)	2018-08-20 14:18:07 +02:00
Dardo D Kleiner	2c77e1ac4e	mgr: improve/fix disabled modules check Follow up on `36942af698` "disabled_modules" is always a list, it's the items in the list that can be dicts in mimic. Many ways to fix this, here's one. Signed-off-by: Dardo D Kleiner <dardokleiner@gmail.com> (cherry picked from commit `f6519e4003`)	2018-08-20 11:49:30 +00:00
Sébastien Han	28fc45e346	Revert "osd: generate device list for osd_auto_discovery on rolling_update" This reverts commit `e84f11e99e`. This commit was giving a new failure later during the rolling_update process. Basically, this was modifying the list of devices and started impacting the ceph-osd itself. The modification to accomodate the osd_auto_discovery parameter should happen outside of the ceph-osd. Also we are trying to not play ceph-osd role during the rolling_update process so we can speed up the upgrade. Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `3149b2564f`)	2018-08-16 13:35:23 +00:00
Mike Christie	c44638ae7e	stable 3.1 igw: add api setting support Port the parts of this upstream commit: commit `91bf53ee93` Author: Sébastien Han <seb@redhat.com> Date: Fri Mar 23 11:24:56 2018 +0800 ceph-iscsi: support for containerize deployment that allows configuration of API settings in roles/ceph-iscsi-gw/templates/iscsi-gateway.cfg.j2 using the iscsi-gws.yml. This fixes Red Hat BZ: https://bugzilla.redhat.com/show_bug.cgi?id=1613963 Signed-off-by: Mike Christie <mchristi@redhat.com>	2018-08-14 10:23:12 +02:00
Mike Christie	2b76e3771d	stable 3.1 igw: enable and start rbd-target-api Backport https://github.com/ceph/ceph-ansible/pull/2984 to stable 3.1. From upstream commit: commit `1164cdc002` Author: Guillaume Abrioux <gabrioux@redhat.com> Date: Thu Aug 2 11:58:47 2018 +0200 iscsigw: install ceph-iscsi-cli package installs the cli package but does not start and enable the rbd-target-api daemon needed for gwcli to communicate with the igw nodes. This just enables and starts it. This fixes Red Hat BZ https://bugzilla.redhat.com/show_bug.cgi?id=1613963. Signed-off-by: Mike Christie <mchristi@redhat.com>	2018-08-14 10:23:12 +02:00
Guillaume Abrioux	904a0a4017	fail if fqdn deployment attempted fqdn configuration possibility caused a lot of trouble, it's adding a lot of complexity because of multiple cases and the relation between ceph-ansible and ceph-container. Moreover, there is no benefit for such a feature. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1613155 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-08-13 18:55:06 +02:00
Guillaume Abrioux	97cf08e897	config: ensure rgw section has the correct name the ceph.conf.j2 always assumes the hostname used to register the radosgw in the servicemap is equivalent to `{{ ansible_hostname }}` which returns the shortname form. We need to detect which form of the hostname was used in case of already deployed cluster and update the ceph.conf accordingly. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1580408 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f422efb1d6`)	2018-08-13 18:55:06 +02:00
Guillaume Abrioux	95c28e78d1	mgr: backward compatibility for module management Follow up on `3abc253fec` The structure had even changed within `luminous` release. It was first: ``` { "enabled_modules": [ "balancer", "dashboard", "restful", "status" ], "disabled_modules": [ "influx", "localpool", "prometheus", "selftest", "zabbix" ] } ``` Then it changed for: ``` { "enabled_modules": [ "status" ], "disabled_modules": [ "balancer", "dashboard", "influx", "localpool", "prometheus", "restful", "selftest", "zabbix" ] } ``` and finally: ``` { "enabled_modules": [ "status" ], "disabled_modules": [ { "name": "balancer", "can_run": true, "error_string": "" }, { "name": "dashboard", "can_run": true, "error_string": "" } ] } ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `36942af698`)	2018-08-13 16:05:21 +00:00
Guillaume Abrioux	9a013ab333	tests: resync iscsigw group name with master let's align the name of that group in stable-3.1 with master branch. Not having the same group name on different branches is confusing and make some nightlies job failing in the CI. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-08-13 12:24:59 +02:00
Sébastien Han	8ea9d14050	osd: generate device list for osd_auto_discovery on rolling_update rolling_update relies on the list of devices when performing the restart of the OSDs. The task that is builind the devices list out of the ansible_devices dict only runs when there are no partitions on the drives. However during an upgrade the OSD are already configured, they have been prepared and have partitions so this task won't run and thus the devices list will be empty, skipping the restart during rolling_update. We now run the same task under different requirements when rolling_update is true and build a list when: * osd_auto_discovery is true * rolling_update is true * ansible_devices exists * no dm/lv are part of the discovery * the device is not removable * the device has more than 1 sector Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1613626 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `e84f11e99e`)	2018-08-10 16:30:40 +02:00
Sébastien Han	12083bdab4	mon: fix calamari initialisation If calamari is already installed and ceph has been upgraded to a higher version the initialisation will fail later. So if we detect the calamari-server is too old compare to ceph_rhcs_version we try to update it. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1601755 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `4c9e24a90f`)	2018-08-10 14:15:16 +02:00
Sébastien Han	651058bd1b	rgw: remove useless condition The include does not need a condition on containerized_deployment since we are already in an include than has the same condition. Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `5a89479abe`)	2018-08-09 15:38:17 +02:00
Sébastien Han	eba9547a6e	rgw: remove unused file copy_configs.yml was not including and is a leftover so let's remove it. Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `3bce117de2`)	2018-08-09 15:38:17 +02:00
Sébastien Han	a16dc0e1de	rgw: ability to use ceph-ansible vars into containers Since the container now simply reads the ceph.conf, we remove all the unnecessary options. Also this PR is the foundation to support multiple backend, such as the new 'beast' from Ceph Mimic. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1582411 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `4d64dd4686`) # Conflicts: # roles/ceph-rgw/tasks/docker/main.yml	2018-08-09 15:38:17 +02:00
Ken Dreyer	1a2c6a3572	common: upgrade/install ceph-test deb first When we deploy a Jewel cluster on Ubuntu with ceph_test: True, we're unable to upgrade that cluster to Luminous. "apt-get install ceph-common" fails to upgrade to luminous if a jewel ceph-test package is installed: Some packages could not be installed. This may mean that you have requested an impossible situation or if you are using the unstable distribution that some required packages have not yet been created or been moved out of Incoming. The following information may help to resolve the situation: The following packages have unmet dependencies: ceph-base : Breaks: ceph-test (< 12.2.2-14) but 10.2.11-1xenial is to be installed ceph-mon : Breaks: ceph-test (< 12.2.2-14) but 10.2.11-1xenial is to be installed In ceph-ansible master, we resolve this whole class of problem by installing all the packages in one operation (see `b338fafd90`). For the stable-3.1 branch, take a less-invasive approach, and upgrade ceph-test prior to any other package. This matches the approach I took for RPMs in `3752cc6f38`, before we had the better solution in `b338fafd90`. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1610997 Signed-off-by: Ken Dreyer <kdreyer@redhat.com>	2018-08-09 14:39:33 +02:00
Graeme Gillies	19958f5c27	Allow mgr bootstrap keyring to be defined In environments where we wish to have manual/greater control over how the bootstrap keyrings are used, we need to able to externally define what the mgr keyring secret will be and have ceph-ansible use it, instead of it being autogenerated Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1610213 Signed-off-by: Graeme Gillies <ggillies@akamai.com> (cherry picked from commit `a46025820d`)	2018-08-09 08:25:27 +00:00
Guillaume Abrioux	9403a3df09	iscsigw: install ceph-iscsi-cli package Install ceph-iscsi-cli in order to provide the `gwcli` command tool. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1602785 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `1164cdc002`)	2018-08-07 09:46:25 +02:00
Artur Fijalkowski	290035171f	Fix in regular expression matching OSD ID on non-contenerized deployment. restart_osd_daemon.sh is used to discover and restart all OSDs on a host. To do it the scripts loops the list of ceph-osd@ services in the system. This commit fixes bug in the regular expression responsile for extraction of OSDs - prior version uses `[0-9]{1,2}` expression which is ignoring all OSDS which numbers are greater than 99 (thus longer than 2 digits). Fix removed upper limit of digits in the number. This problem existed in two places in the script. Closes: #2964 Signed-off-by: Artur Fijalkowski <artur.fijalkowski@ing.com> (cherry picked from commit `52d9d406b1`)	2018-08-06 18:50:39 +00:00
Guillaume Abrioux	706d0b8289	defaults: backward compatibility with fqdn deployments This commit ensures we are backward compatible with fqdn deployments. Since ceph-container enforces deployment to be done with shortname, we must keep backward compatibility with clusters already deployed with fqdn configuration Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `0a6ff6bbf8`)	2018-08-06 14:09:35 +00:00
Sébastien Han	2d5ed5ef8e	config: enforce socket name This was introduced by `59ee2e8d3b` and made our socket checks impossible to run. The PID could be found, but the cctid cannot. This happens during upgrade to mimic and on cluster running on mimic. So let's force the admin socket the way it was so we can properly check for existing instances also the line $cluster-$name.$pid.$cctid.asok is only needed when running multiple instances of the same daemon, thing ceph-ansible cannot do at the time of writing Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1610220 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `ea9e60d48d`)	2018-08-02 12:34:48 +00:00
Mike Christie	99f84f88af	igw: fix image removal during purge We were not passing in the ceph conf info into the rbd image removal command, so if the clustername was not the default igw purge would fail due to the rbd rm command failing. This just fixes the bug by passing in the ceph conf info which has the clustername to use. This fixes Red Hat bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1601949 Signed-off-by: Mike Christie <mchristi@redhat.com> (cherry picked from commit `d572a9a602`)	2018-07-31 10:09:08 +02:00
Mike Christie	f3f734f8f3	igw: do not fail purge on rbd removal errors Instead of failing the entire purge operation when the rbd command fails just log an error. This will allow the higher level target and config cleanup to complete, and the user only has to manually delete the rbd images. Signed-off-by: Mike Christie <mchristi@redhat.com> (cherry picked from commit `6f72f96dad`)	2018-07-31 10:09:08 +02:00

1 2 3 4 5 ...

1879 Commits (88c6c3fccaa85ba6ea34b288fd80060d3d05dd88)