From 48394597c9b57897d4570e73c4e0409df58ca035 Mon Sep 17 00:00:00 2001
From: Christian Zunker <christian.zunker@codecentric.cloud>
Date: Wed, 20 Jun 2018 07:01:06 +0200
Subject: [PATCH] reset failed count of ceph-mgr

Depending on your setup, ceph-mgr might get restarted multiple times.
When this is done to fast, systemd will prevent further restarts because of
configured limits in the ceph-mgr systemd unit file.

Resetting the failure count will prevent this problem. The reset is done before
the restart so in case of a real problem during the restart it still fails.

Fixes: #2768

Signed-off-by: Christian Zunker <christian.zunker@codecentric.cloud>
---
 roles/ceph-defaults/templates/restart_mgr_daemon.sh.j2 | 1 +
 1 file changed, 1 insertion(+)

diff --git a/roles/ceph-defaults/templates/restart_mgr_daemon.sh.j2 b/roles/ceph-defaults/templates/restart_mgr_daemon.sh.j2
index 472cdf459..891b4e52b 100644
--- a/roles/ceph-defaults/templates/restart_mgr_daemon.sh.j2
+++ b/roles/ceph-defaults/templates/restart_mgr_daemon.sh.j2
@@ -8,6 +8,7 @@ SOCKET=/var/run/ceph/{{ cluster }}-mgr.${MGR_NAME}.asok
 DOCKER_EXEC="docker exec ceph-mgr-{{ ansible_hostname }}"
 {% endif %}
 
+systemctl reset-failed ceph-mgr@${MGR_NAME}
 # First, restart the daemon
 systemctl restart ceph-mgr@${MGR_NAME}