ceph-ansible/roles
Guillaume Abrioux 52ff9ce5d1 facts: add a retry on get current fsid task
sometimes it can happen the following task fails:

```
TASK [ceph-facts : get current fsid] *******************************************
task path: /home/jenkins-build/build/workspace/ceph-ansible-prs-dev-centos-container-update/roles/ceph-facts/tasks/facts.yml:78
Wednesday 19 June 2019  18:12:49 +0000 (0:00:00.203)       0:02:39.995 ********
fatal: [mon2 -> mon1]: FAILED! => changed=true
  cmd:
  - timeout
  - --foreground
  - -s
  - KILL
  - 600s
  - docker
  - exec
  - ceph-mon-mon1
  - ceph
  - --cluster
  - ceph
  - daemon
  - mon.mon1
  - config
  - get
  - fsid
  delta: '0:00:00.239339'
  end: '2019-06-19 18:12:49.812099'
  msg: non-zero return code
  rc: 22
  start: '2019-06-19 18:12:49.572760'
  stderr: 'admin_socket: exception getting command descriptions: [Errno 2] No such file or directory'
  stderr_lines: <omitted>
  stdout: ''
  stdout_lines: <omitted>
```

not sure exactly why since just before this task, mon1 seems to be well
UP otherwise it wouldn't have passed the task `waiting for the
containerized monitor to join the quorum`.

As a quick fix/workaround, let's add a retry which allows us to get
around this situation:

```
TASK [ceph-facts : get current fsid] *******************************************
task path: /home/jenkins-build/build/workspace/ceph-ansible-scenario/roles/ceph-facts/tasks/facts.yml:78
Thursday 20 June 2019  15:35:07 +0000 (0:00:00.201)       0:03:47.288 *********
FAILED - RETRYING: get current fsid (3 retries left).
changed: [mon2 -> mon1] => changed=true
  attempts: 2
  cmd:
  - timeout
  - --foreground
  - -s
  - KILL
  - 600s
  - docker
  - exec
  - ceph-mon-mon1
  - ceph
  - --cluster
  - ceph
  - daemon
  - mon.mon1
  - config
  - get
  - fsid
  delta: '0:00:00.290252'
  end: '2019-06-20 15:35:13.960188'
  rc: 0
  start: '2019-06-20 15:35:13.669936'
  stderr: ''
  stderr_lines: <omitted>
  stdout: |-
    {
        "fsid": "153e159d-7ade-42a7-842c-4d04348b901e"
    }
  stdout_lines: <omitted>
```

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 46a2683944)
2019-06-20 14:01:33 -04:00
..
ceph-client ansible: use 'bool' filter on boolean conditionals 2019-06-07 16:05:51 +02:00
ceph-common ansible: use 'bool' filter on boolean conditionals 2019-06-07 16:05:51 +02:00
ceph-config tests: Update ansible ssh_args variable 2019-06-17 16:45:38 +02:00
ceph-container-common ansible: use 'bool' filter on boolean conditionals 2019-06-07 16:05:51 +02:00
ceph-container-engine container-common: support podman on Ubuntu 2019-06-07 10:12:36 -04:00
ceph-dashboard dashboard: fix dashboard_url setting 2019-06-12 17:04:57 -04:00
ceph-defaults align cephfs pool creation 2019-06-18 09:17:13 +02:00
ceph-facts facts: add a retry on get current fsid task 2019-06-20 14:01:33 -04:00
ceph-fetch-keys meta: set the right minimum ansible version required for galaxy 2018-12-11 09:59:25 +01:00
ceph-grafana dashboard: add allow_embedding support 2019-06-12 17:05:26 -04:00
ceph-handler ceph-handler: replace fuser by /proc/net/unix 2019-06-12 23:00:36 +02:00
ceph-infra ceph-infra: make chronyd default NTP daemon 2019-06-14 12:21:02 +00:00
ceph-iscsi-gw iscsi: assign application (rbd) to pool 'rbd' 2019-06-13 14:51:19 -04:00
ceph-mds align cephfs pool creation 2019-06-18 09:17:13 +02:00
ceph-mgr podman: Add systemd dependency on network.target 2019-06-07 16:06:26 +02:00
ceph-mon mon: enforce mon0 delegation for initial_mon_key register 2019-06-13 07:39:07 +02:00
ceph-nfs align cephfs pool creation 2019-06-18 09:17:13 +02:00
ceph-node-exporter ceph-node-exporter: Fix systemd template 2019-06-13 07:37:26 +02:00
ceph-osd osd: remove legacy task 2019-06-19 08:41:25 +00:00
ceph-prometheus Fix units and add ability to have a dedicated instance 2019-06-12 11:48:12 +02:00
ceph-rbd-mirror podman: Add systemd dependency on network.target 2019-06-07 16:06:26 +02:00
ceph-rgw podman: Add systemd dependency on network.target 2019-06-07 16:06:26 +02:00
ceph-rgw-loadbalancer Add role definitions of ceph-rgw-loadbalancer 2019-06-06 19:44:30 +00:00
ceph-validate validate: fail in check_devices at the right task 2019-06-10 08:11:39 +02:00