ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Dimitri Savineau	d408c75d76	podman: always remove container on start In case of failure, the systemd ExecStop isn't executed so the container isn't removed. After a reboot of a failed node, the container doesn't start because the old container is still present in created state. We should always try to remove the container in ExecStartPre for this situation. A normal reboot doesn't trigger this issue and this also doesn't affect nodes running containers via docker. This behaviour was introduced by `d43769d`. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1858865 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `47b7c00287`)	2020-07-24 12:47:21 -04:00
Dimitri Savineau	eb3f065d03	podman: Add Type and PIDFile value to unit files This changes the way we are running the podman containers via systemd. They are now in dettached mode and Type/PIDFile set. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1834974 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `d43769dc2a`)	2020-06-23 17:35:01 +02:00
Dimitri Savineau	09453e22f4	docker: Add Requires on docker service When using docker container engine then the systemd unit scripts only use a dependency on the docker daemon via the After parameter. But if docker is restarted on a live system then the ceph systemd units should wait for the docker daemon to be fully restarted. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1846830 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `bd22f1d1ec`)	2020-06-22 19:11:20 -04:00
Dimitri Savineau	a97e24fee9	docker2podman: manage dashboard nodes The dashboard nodes (alertmanager, grafana, node-exporter, and prometheus) were not manage during the docker to podman migration. This adds the systemd container template of those services to a dedicated file (systemd.yml) in order to include it in the docker2podman playbook. This also adds the dashboard container images pull from docker to podman. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1829389 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `252e78b4e4`)	2020-06-03 13:20:24 -04:00
Dimitri Savineau	3617543517	containers: add KillMode=none to systemd templates Because we are relying on docker\|podman for managing containers then we don't need systemd to manage the process (like kill). Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `5a03e0ee1c`)	2020-02-18 12:10:35 -05:00
Guillaume Abrioux	6295a33912	dashboard: run node_export as privileged container Typical error: ``` type=AVC msg=audit(1575367499.582:3210): avc: denied { search } for pid=26680 comm="node_exporter" name="1" dev="proc" ino=11528 scontext=system_u:system_r:container_t:s0:c100,c1014 tcontext=system_u:system_r:init_t:s0 tclass=dir permissive=0 ``` node_exporter needs to be run as privileged to avoid avc denied error since it gathers lot of information on the host. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1762168 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `d245eb7e7d`)	2019-12-09 17:27:51 +01:00
Guillaume Abrioux	1f30327688	purge: remove docker_* task All containers are removed when systemd stops them. There is no need to call this module in purge container playbook. This commit also removes all docker_image task and remove all container images in the final cleanup play. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1776736 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `d23383a820`)	2019-12-03 09:57:11 -05:00
Guillaume Abrioux	1a9043128c	dashboard: remove cfg80211 module installation According to this comment [1], this seems to be needed to detect wifi devices. In node exporter we can see this: ``` --collector.wifi Enable the wifi collector (default: disabled). ``` since it's enabled by default and we don't even change this in our systemd templates for node-exporter, we can easily assume in the end it's not needed. Therefore, let's remove this. [1] `dbf81b6b5b (diff-961545214e21efed3b84a9e178927a08L21-L23)` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b9cdf341be`)	2019-07-29 15:46:58 +02:00
Dimitri Savineau	87db5aa55c	dashboard: use variables for port value The current port value for alertmanager, grafana, node-exporter and prometheus is hardcoded in the roles so it's not possible to change the port binding of those services. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `8ab9b719fa`)	2019-07-19 20:33:42 +00:00
Dimitri Savineau	f71e8f249f	ceph-node-exporter: Fix systemd template `069076b` introduced a bug in the systemd unit script template. This commit fixes the options used by the node-exporter container. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `d0840217f3`)	2019-06-13 07:37:26 +02:00
Dimitri Savineau	7c6a09152d	ceph-node-exporter: use modprobe ansible module Instead of using the modprobe command from the path in the systemd unit script, we can use the modprobe ansible module. That way we don't have to manage the binary path based on the linux distribution. Resolves: #4072 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `dbf81b6b5b`)	2019-06-12 10:02:54 -04:00
fmount	138fa19ccf	Fix units and add ability to have a dedicated instance Few fixes on systemd unit templates for node_exporter and alertmanager container parameters. Added the ability to use a dedicated instance to deploy the dashboard components (prometheus and grafana). This commit also introduces the grafana_group_name variable to refer grafana group and keep consistency with the other groups. During the integration with TripleO some grafana/prometheus template variables resulted undefined. This commit adds the ability to check if the group exist and create, accordingly, different job groups in prometheus template. Signed-off-by: fmount <fpantano@redhat.com> (cherry picked from commit `069076bbfd`)	2019-06-12 11:48:12 +02:00
Dimitri Savineau	e9edb5a92a	podman: Add systemd dependency on network.target When using podman, the systemd unit scripts don't have a dependency on the network. So we're not sure that the network is up and running when the containers are starting. With docker this behaviour is already handled because the systemd unit scripts depend on docker service which is started after the network. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `f49090df7e`)	2019-06-07 16:06:26 +02:00
Guillaume Abrioux	1e2f8cd909	dashboard: move defaults variables to ceph-defaults There is no need to have default values for these variables in each roles since there is no corresponding host groups Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9f0d4d6847`)	2019-05-17 16:05:58 +02:00
Guillaume Abrioux	aa80895d19	dashboard: align the way containers are managed This commit aligns the way the different containers are managed with how it's currently done with the other ceph daemon. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `cc285c417a`)	2019-05-17 16:05:58 +02:00
Guillaume Abrioux	fe5bcc2f9f	dashboard: do not call ceph-container-common from other role use site.yml to deploy ceph-container-common in order to install docker even in non-containerized deployments since there's no RPM available to deploy the differents applications needed for ceph-dashboard. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `cdff0da7d4`)	2019-05-17 16:05:58 +02:00
Guillaume Abrioux	997d179b7c	dashboard: rename template files add .j2 to all templates file related to dashboard roles. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `3578d576a4`)	2019-05-17 16:05:58 +02:00
Boris Ranto	db3f0088fc	dashboard: Support podman This adds support for podman in dashboard-related roles. It also drops the creation of custom network for the dashboard-related roles as this functionality works in a different way with podman. Signed-off-by: Boris Ranto <branto@redhat.com> (cherry picked from commit `b4d1c3693b`)	2019-05-17 16:05:58 +02:00
Boris Ranto	5ac7559736	Merge cephmetrics/dashboard-ansible repo This commit will merge dashboard-ansible installation scripts with ceph-ansible. This includes several new roles to setup ceph-dashboard and the underlying technologies like prometheus and grafana server. Signed-off-by: Boris Ranto & Zack Cerza <team-gmeno@redhat.com> Co-authored-by: Zack Cerza <zcerza@redhat.com> Co-authored-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `2f141a6e80`)	2019-05-17 16:05:58 +02:00

19 Commits (bd5cde631bf366b4a12eb36f146aba9c257b6b75)