ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Christian Zunker	48394597c9	reset failed count of ceph-mgr Depending on your setup, ceph-mgr might get restarted multiple times. When this is done to fast, systemd will prevent further restarts because of configured limits in the ceph-mgr systemd unit file. Resetting the failure count will prevent this problem. The reset is done before the restart so in case of a real problem during the restart it still fails. Fixes: #2768 Signed-off-by: Christian Zunker <christian.zunker@codecentric.cloud>	2018-06-20 13:59:16 +02:00
Sébastien Han	91bf53ee93	ceph-iscsi: support for containerize deployment We now have the ability to deploy a containerized version of ceph-iscsi. The result is similar to the non-containerized version, you simply have 3 containers running for the following services: * rbd-target-api * rbd-target-gw * tcmu-runner Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1508144 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-06-08 10:21:54 +02:00
Sébastien Han	2f43e9dab5	defaults: restart_osd_daemon unit spaces Extra space in systemctl list-units can cause restart_osd_daemon.sh to fail It looks like if you have more services enabled in the node space between "loaded" and "active" get more space as compared to one space given in command the command[1]. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1573317 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-05-18 17:53:47 +02:00
Simone Caronni	b12bf62c36	Make sure the restart_mds_daemon script is created with the correct MDS name	2018-05-08 20:53:15 +02:00
Sébastien Han	65ba85aff6	Expose /var/run/ceph Useful for softwares that do data collection/monitoring like collectd. They can connect to the socket and then retrieve information. Even though the sockets are exposed now, I'm keeping the docker exec to check the socket, this will allow newer version of ceph-ansible to work with older versions. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1563280 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-20 15:48:32 +02:00
Sébastien Han	18402b636f	defaults: add useful info if daemon are not restarted properly If OSDs don't restart normally we now also dump info of the crush map, crush rules, crush tree and pools. If the monitors don't restart normally we also print the socket status by calling mon_status and quorum_status. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Major Hayden	5676fa23b1	Convert interface names to underscores for facts If a deployer uses an interface name with a dash/hyphen in it, such as 'br-storage' for the monitor_interface group_var, the ceph.conf.j2 template fails to find the right facts. It looks for 'ansible_br-storage' but only 'ansible_br_storage' exists. This patch converts the interface name to underscores when the template does the fact lookup.	2017-12-12 09:03:40 +01:00
Guillaume Abrioux	44df3f9102	defaults: fix rgw restart script in handlers Like `80d32dec`, the path to the fact is not correct. In any case, we will retrieve the IP address in hostvars, the variable is the way we get the interface name according where it has been set (eg.: inventory host file vs. group_vars/) Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510906 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-13 16:30:03 +01:00
Sébastien Han	4413511b66	all: backward compatibility between stable-2.2 and 3.0 stable-3.0 brought numerous changes in ceph-ansible variables, this PR aims to maintain backward compatibility for someone running stable-2.2 upgrading to stable-3.0 but keeps its groups_vars untouched. We will then determine the right options to make sure the upgrade works but we are expecting that new variables should be used. We will drop this in a near future, maybe 3.1 or 3.2. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 11:54:10 +02:00
Guillaume Abrioux	c2850b11be	rbd: fix restart script for jewel In Jewel, we don't use bootstrap-rbd keyring for rbd-mirror nodes, it results with a socket path/name different according to which ceph release you are deploying. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-18 11:10:49 +02:00
Sébastien Han	7054abef99	Merge pull request #2009 from ceph/fix-clean-pg [skip ci] handler: do not test if pgs_num = 0	2017-10-07 03:39:26 +02:00
Sébastien Han	ac29e8f977	Merge pull request #1983 from jprovaznik/suffix Allow to override systemd service instance id	2017-10-06 22:40:57 +02:00
Sébastien Han	d5ae0a3340	handler: do not test if pgs_num = 0 We don't need to wait if they are no PGS. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-06 16:57:46 +02:00
Jan Provaznik	3c16af5ef2	Allow to override systemd service instance id It's useful to have constant service instance id when ceph-nfs is managed by pacemaker.	2017-10-06 08:20:37 +02:00
Sébastien Han	5968cf09b1	ci: add collocation scenario Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-04 11:19:12 +02:00
Sébastien Han	e121bc58e9	defaults: add missing handlers for rbd mirorr and mgr Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 02:38:24 +02:00
Sébastien Han	341c9e077b	nfs: fix container setup and re-arrange files Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 02:38:24 +02:00
Sébastien Han	8f71c08e7b	handler: display ceph status properly Fix bash error, doing ceph "$CEPH_CLI" -s gives us ceph '--name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/test.keyring --cluster test' -s which results in a wrongly formatted command. Removing the double quotes expands the array properly. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 17:45:35 +02:00
Sébastien Han	ed3003cf41	defaults: restart docker daemon higher delay Use default delay since the mon (in particular) can take more time to restart. Solves error with: STDERR: Error response from daemon: No such container: ceph-mon-mon0 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-14 13:38:11 -06:00
Sébastien Han	3dd47a45cb	ceph-defaults: fix handlers for mds and rgw The way we handle the restart for both mds and rgw is not ideal, it will try to restart the daemon on the host that don't run the daemon, resulting in a service file being created (see bug description). Now we restart each daemon precisely and in a serialized fashion. Note: the current implementation does NOT support multiple mds or rgw on the same node. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1469781 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-31 19:02:21 +02:00
Sébastien Han	4f0ecb7f30	switch-from-non-containerized-to-containerized: simplify This commit eases the use of the infrastructure-playbooks/switch-from-non-containerized-to-containerized-ceph-daemons.yml playbook. We basically run it with a couple of pre-tasks and then we let the playbook run the docker roles. It obviously expect to have proper variables configured in order to work. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-23 18:39:45 +02:00
Guillaume Abrioux	7a333d05ce	Add handlers for containerized deployment Until now, there is no handlers for containerized deployments. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-02 17:12:20 +02:00

22 Commits (ea9e60d48d6631631ac9294d4ef291f8d7a30d78)