From 48394597c9b57897d4570e73c4e0409df58ca035 Mon Sep 17 00:00:00 2001 From: Christian Zunker Date: Wed, 20 Jun 2018 07:01:06 +0200 Subject: [PATCH] reset failed count of ceph-mgr Depending on your setup, ceph-mgr might get restarted multiple times. When this is done to fast, systemd will prevent further restarts because of configured limits in the ceph-mgr systemd unit file. Resetting the failure count will prevent this problem. The reset is done before the restart so in case of a real problem during the restart it still fails. Fixes: #2768 Signed-off-by: Christian Zunker --- roles/ceph-defaults/templates/restart_mgr_daemon.sh.j2 | 1 + 1 file changed, 1 insertion(+) diff --git a/roles/ceph-defaults/templates/restart_mgr_daemon.sh.j2 b/roles/ceph-defaults/templates/restart_mgr_daemon.sh.j2 index 472cdf459..891b4e52b 100644 --- a/roles/ceph-defaults/templates/restart_mgr_daemon.sh.j2 +++ b/roles/ceph-defaults/templates/restart_mgr_daemon.sh.j2 @@ -8,6 +8,7 @@ SOCKET=/var/run/ceph/{{ cluster }}-mgr.${MGR_NAME}.asok DOCKER_EXEC="docker exec ceph-mgr-{{ ansible_hostname }}" {% endif %} +systemctl reset-failed ceph-mgr@${MGR_NAME} # First, restart the daemon systemctl restart ceph-mgr@${MGR_NAME}