ceph-ansible/roles
Matthew Vernon 04f4991648 restart_osd_daemon.sh.j2 - consider active+clean+* pgs as OK
After restarting each OSD, restart_osd_daemon.sh checks that the
cluster is in a good state before moving on to the next one. One of
the checks it does is that the number of pgs in the state
"active+clean" is equal to the total number of pgs in the cluster.

On large clusters (e.g. we have 173,696 pgs), it is likely that at
least one pg will be scrubbing and/or deep-scrubbing at any one
time. These pgs are in state "active+clean+scrubbing" or
"active+clean+scrubbing+deep", so the script was erroneously not
including them in the "good" count. Similar concerns apply to
"active+clean+snaptrim" and "active+clean+snaptrim_wait".

Fix this by considering as good any pg whose state contains
active+clean. Do this as an integer comparison to num_pgs in pgmap.

(could this be backported to at least stable-3.0 please?)

Closes: #2008
Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk>
2018-09-24 10:33:46 +00:00
..
ceph-agent ceph-agent: Add support for the openSUSE Leap distributions 2017-11-14 10:51:23 +00:00
ceph-client Revert "client: add quotes to the dict values" 2018-09-07 17:21:55 +00:00
ceph-common defaults: fix rgw_hostname 2018-08-22 17:46:00 +02:00
ceph-common-coreos syntax: change local_action syntax 2018-01-31 10:45:34 +01:00
ceph-config ceph-config: calculate num_osds for the lvm batch scenario 2018-09-20 15:41:52 +00:00
ceph-defaults restart_osd_daemon.sh.j2 - consider active+clean+* pgs as OK 2018-09-24 10:33:46 +00:00
ceph-docker-common ceph-defaults: add the nautilus 14.x entry to ceph_release_num 2018-06-01 16:51:23 +02:00
ceph-fetch-keys Add ceph_keyring_permissions variable to control permissions for 2018-06-28 15:48:39 +00:00
ceph-iscsi-gw igw: enable and start rbd-target-api 2018-09-13 19:35:45 +00:00
ceph-mds ceph-mds: enable application pool 2018-07-02 10:28:34 +00:00
ceph-mgr mgr: improve/fix disabled modules check 2018-08-20 11:23:58 +02:00
ceph-mon mon: fix `ExecStartPre` option in systemd unit file 2018-09-11 10:48:21 +02:00
ceph-nfs nfs: ignore error on semanage command for ganesha_t 2018-09-13 13:06:47 +02:00
ceph-osd ceph_volume: adds the osds_per_device parameter 2018-09-12 20:27:14 +00:00
ceph-rbd-mirror rbd-mirror: bring back compatibility with jewel deployment 2018-07-26 18:47:10 +00:00
ceph-restapi common: switch from docker module to docker_container 2018-07-10 20:08:07 +00:00
ceph-rgw roles: ceph-rgw: Enable the ceph-radosgw target 2018-09-03 15:48:58 +02:00
ceph-validate/tasks validate: fail if fqdn deployment attempted 2018-08-13 10:04:24 +02:00
ceph.ceph-common rollback previous change for ceph-common change 2016-03-02 18:44:36 +01:00
ceph.ceph-docker-common ceph-docker-common: add symlink to ceph.ceph-docker-common 2017-01-16 10:42:42 -06:00