ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Seena Fallah	92d1c81173	systemd: export params as a varaible This can help to have extra params or modify the existing ones via group vars. Signed-off-by: Seena Fallah <seenafallah@gmail.com>	2024-03-07 20:21:47 +01:00
Seena Fallah	84e10bfd03	container: cleanup container systemd units * Make common params of container args in a var to avoid duplication * The /var/lib/ceph/crash mount was missing after `637ca81c9c` * Add CEPH_USE_RANDOM_NONCE as it's needed when running inside container (can be removed for squid later) * Add NODE_NAME as some part of ceph code relies on this var * add default logging opts for Signed-off-by: Seena Fallah <seenafallah@gmail.com>	2024-02-19 23:14:26 +01:00
Guillaume Abrioux	18da10bb7a	address Ansible linter errors This addresses all errors reported by the Ansible linter. Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>	2024-02-16 00:38:19 +01:00
Guillaume Abrioux	32c2a18f8c	common: enable crb repository on mgr hosts This is needed in order to install `ceph-mgr-dashboard` as it has a dependency on `python3-grpcio-tools` which comes from crb repo. Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>	2024-02-14 09:54:13 +01:00
insatomcat	271fd82942	do not use update debian cache or try to install packages when package-install is disabled When deploying with --skip-tags=package-install (when there is no access to a repository), the playbook is still trying to update the package cache, or to install ceph-mgr packages, which makes the playbook fail. This change prevents the playbook to try to update the cache or install ceph-mgr packages when the package-install tag is skipped. Signed-off-by: Florent CARLI <florent.carli@rte-france.com>	2023-08-21 14:01:15 +02:00
Guillaume Abrioux	b03de38f39	mgr: do not use ceph/daemon entrypoint This changes the entrypoint used for ceph-mgr containerized daemons in the systemd template. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2023-05-31 23:07:13 +02:00
René Højbjerg Larsen	09590c0683	ceph-mgr: Fix reference to copy_admin_key variable Enabling installation of the admin key to mgr nodes by setting "copy_admin_key: true" is broken. This is because the variable is not referenced correctly (using inline Jinja2 templating). Signed-off-by: René Højbjerg Larsen <rhl@jfm.dk>	2023-03-16 13:14:07 +01:00
Lorenz Bausch	2f5e21b631	mgr: fix a typo This commit fixes a typo in `roles/ceph-mgr/defaults/main.yml` (s/mpdules/modules) Signed-off-by: Lorenz Bausch <info@lorenzbausch.de>	2023-03-15 16:14:44 +01:00
Teoman ONAY	d25fa6757c	Fix selinux label issues Add --security-opt label=disable to all containers accessing /var/lib/ceph. podman selinux relabeling behavious changed since version podman-3:4.2.0-1 which prevent some containers to access files in these subdirectories. Signed-off-by: Teoman ONAY <tonay@ibm.com>	2023-03-15 15:51:00 +01:00
Teoman ONAY	637ca81c9c	Collocated mgr with mon fails to start on RHEL 8.7 With podman version podman-3:4.2.0-4.module+el8.7.0+17064+3b31f55c and later, when mgr fails to start if mon is already running. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2169767 Signed-off-by: Teoman ONAY <tonay@ibm.com>	2023-02-19 01:03:03 +01:00
Dmitriy Rabotyagov	2eb0a88a67	Use upstream config_template collection In order to reduce need of module internal maintenance and to join forces on plugin development, it's proposed to switch to using upstream version of config_template module. As it's shipped as collection, it's installation for end-users is trivial and aligns with general approach of shipping extra modules. Signed-off-by: Dmitriy Rabotyagov <noonedeadpunk@ya.ru>	2022-01-18 20:22:10 +01:00
Guillaume Abrioux	f01536ea19	container: align systemd units with rpm Update `After=` and `Wants=` parameters in container systemd units and make them be aligned with the systemd units that come from the packaging. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2027440 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-12-14 13:46:27 +01:00
Guillaume Abrioux	09ef465f62	containers: introduce target systemd unit This adds ceph-*.target systemd unit files support for containerized deployments. This also fixes a regression introduced by PR #6719 (rgw and nfs systemd units not getting purged) Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1962748 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-08-18 11:08:50 -04:00
Guillaume Abrioux	1db8fa8989	roles: remove leftover from pr #4319 pr #4319 introduced some uesless `become: true` on systemd tasks. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-08-18 09:10:15 +02:00
Guillaume Abrioux	7511195738	common: do not log keyring secret let's not display any keyring secret by default in ansible log. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1980744 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-08-11 17:33:34 +02:00
Teoman ONAY	9b5d97adb9	podman pids.max default value is 2048, docker's one is 4096 which are sufficient for the default value (512) of rgw thread pool size. But if its value is increased near to the pids-limit value, it does not leave place for the other processes to spawn and run within the container and the container crashes. pids-limit set to unlimited regardless of the container engine. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1987041 Signed-off-by: Teoman ONAY <tonay@redhat.com>	2021-08-04 10:20:25 +02:00
Dimitri Savineau	cd06e7c046	ceph-mgr: move mgr module list to common Populating the ceph_mgr_modules list in the mgr_modules doesn't make sense since that file is only executed if the list isn't empty or we're using the dashboard. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-07-19 18:23:38 +02:00
Dimitri Savineau	9758e3c513	container: set tcmalloc value by default All ceph daemons need to have the TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES environment variable set to 128MB by default in container setup. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1970913 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-06-30 20:30:55 +02:00
Neelaksh Singh	d18a9860cd	Sensitive key data now hidden in output log Fixes: #6529 Signed-off-by: Neelaksh Singh <neelaksh48@gmail.com>	2021-06-08 20:46:37 +02:00
Guillaume Abrioux	bab403b603	container/systemd: ensure /var/log/ceph exists This adds a `ExecStartPre=-/usr/bin/mkdir -p /var/log/ceph` in all systemd service templates for all ceph daemon. This is specific to RHCS after a Leapp upgrade is done. Indeed, the `/var/log/ceph` seems to be removed after the upgrade. In order to work around this issue let's ensure the directory is present before trying to start the containers with podman. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1949489 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-04-14 16:37:33 +02:00
Alex Schultz	a7f2fa73e6	Use ansible_facts It has come to our attention that using ansible_* vars that are populated with INJECT_FACTS_AS_VARS=True is not very performant. In order to be able to support setting that to off, we need to update the references to use ansible_facts[<thing>] instead of ansible_<thing>. Related: ansible#73654 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1935406 Signed-off-by: Alex Schultz <aschultz@redhat.com>	2021-03-08 20:54:02 +01:00
Guillaume Abrioux	c68b124ba8	container: remove `--ignore` from `podman rm` command As of podman 2.0.5, `--ignore` param conflicts with `--storage`. ``` Nov 30 13:53:10 magna089 podman[164443]: Error: --storage conflicts with --volumes, --all, --latest, --ignore and --cidfile ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-11-30 12:24:11 -05:00
Dimitri Savineau	eaf0ebfc85	library: add ceph_mgr_module module This adds ceph_mgr_module ansible module for replacing the command module usage with the ceph mgr module enable/disable commands. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-11-30 16:52:02 +01:00
Guillaume Abrioux	f5ba6d9b01	containers: modify bindmount option This commit changes the bind mount option for the mount point `/var/lib/ceph` in the systemd template for mon and mgr containers. This is needed in case of collocating mon/mgr with osds using dmcrypt scenario. Once mon/mgr got converted to containers, the dmcrypt layer sub mount is still seen in `/var/lib/ceph`. For some reason it makes the corresponding devices busy so any other container can't open/close it. As a result, it prevents osds from starting properly. Since it only happens on the nodes converted before the OSD play, the idea is to bind mount `/var/lib/ceph` on mon and mgr with the `rshared` option so once the sub mount is unmounted, it is propagated inside the container so it doesn't see that mount point. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1896392 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-11-17 09:19:23 -05:00
Guillaume Abrioux	5ba7824c55	container: force rm --storage on ExecStartPre This is a workaround to avoid error like following: ``` Error: error creating container storage: the container name "ceph-mgr-magna022" is already in use by "4a5f674e113f837a0cc561dea5d2cd55d16ca159a647b7794ab06c4c276ef701" ``` that doesn't seem to be 100% reproducible but it shows up after a reboot. The only workaround we came up with at the moment is to run `podman rm --storage <container>` before starting it. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1887716 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-11-16 10:38:40 -05:00
Dimitri Savineau	59ecddcdd0	keyring: use ceph_key module for auth get command Instead of using ceph auth get command via the ansible command module then we can use the ceph_key module and the info state. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-11-02 17:17:29 +01:00
Dimitri Savineau	16cd183b9c	podman: force log driver to journald Since we've changed to podman configuration using the detach mode and systemd type to forking then the container logs aren't present in the journald anymore. The default conmon log driver is using k8s-file. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1890439 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-11-02 15:49:27 +01:00
Benoît Knecht	54ba38e35e	Fix Ansible check mode for site.yml.sample playbook Make sure the `site.yml.sample` playbook can be run in check mode by skipping tasks that try to read the output of commands that have been skipped. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2020-10-07 00:29:44 +02:00
Dimitri Savineau	50104650e7	add missing boolean filter Otherwise this will generate an ansible warning about the missing filter. [DEPRECATION WARNING]: evaluating xxx as a bare variable, this behaviour will go away and you might need to add \|bool to the expression in the future. Also see CONDITIONAL_BARE_VARS configuration toggle.. This feature will be removed in version 2.12. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-28 20:45:01 +02:00
Dimitri Savineau	abb4023d76	ceph_key: set state as optional Most ansible module using a state parameter default to the present value (when available) instead of using it as a mandatory option. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-14 14:12:21 -04:00
Dimitri Savineau	47b7c00287	podman: always remove container on start In case of failure, the systemd ExecStop isn't executed so the container isn't removed. After a reboot of a failed node, the container doesn't start because the old container is still present in created state. We should always try to remove the container in ExecStartPre for this situation. A normal reboot doesn't trigger this issue and this also doesn't affect nodes running containers via docker. This behaviour was introduced by `d43769d`. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1858865 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-07-23 17:00:38 +02:00
Jonathan Rosser	92288c11c5	Install python routes package as a dependancy rather than directly This is now a dependancy of ceph-mgr so will be installed automatically and does not need a specific task. This change means that ceph-mgr installs correctly on Ubuntu Focal where the python3-routes package is necessary. Signed-off-by: Jonathan Rosser <jonathan.rosser@rd.bbc.co.uk>	2020-06-26 12:26:25 -04:00
Dimitri Savineau	d43769dc2a	podman: Add Type and PIDFile value to unit files This changes the way we are running the podman containers via systemd. They are now in dettached mode and Type/PIDFile set. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1834974 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-06-23 09:37:50 +02:00
Dimitri Savineau	bd22f1d1ec	docker: Add Requires on docker service When using docker container engine then the systemd unit scripts only use a dependency on the docker daemon via the After parameter. But if docker is restarted on a live system then the ceph systemd units should wait for the docker daemon to be fully restarted. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1846830 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-06-22 23:08:50 +02:00
Dimitri Savineau	2547ab601a	Readd CentOS 7 with conditions The CentOS 7 distribution could still be used be deploying ceph if - it's a containerized deployment - it's a non containerized deployment without the dashboard (due to missing python3 libraries). The ceph_stable_redhat_distro variable has been remove because we can rely on the ansible_distribution_major_version fact instead. The copr el8 repository configuration is only applied for CentOS 8. The ceph-mgr-dashboard package is only installed when the dashboard_enabled variable is set to true. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-04-23 13:31:11 +02:00
Dimitri Savineau	6617d90733	ceph-mgr: add saml python lib for dashboard SSO The dashboard SSO mgr module requires the saml python library to be installed. This is only a valid scenario for RHCS deployment because the saml python library isn't available in other classic repositories. This package is present in RHCS Tools repository so we also need to enable it on the mgr nodes. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1820233 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-04-06 10:11:00 -04:00
Dimitri Savineau	5a03e0ee1c	containers: add KillMode=none to systemd templates Because we are relying on docker\|podman for managing containers then we don't need systemd to manage the process (like kill). Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-02-13 16:11:33 +01:00
Guillaume Abrioux	483adb5d79	common: add a default value for ceph_directories_mode Since this variable makes it possible to customize the mode for ceph directories, let's make it a bit more explicit by adding a default value in ceph-defaults. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-22 09:35:35 +01:00
Dmitriy Rabotyagov	2478a7b948	Fix undefined running_mon Since commit [1] running_mon introduced, it can be not defined which results in fatal error [2]. This patch defines default value which was used before patch [1] Signed-off-by: Dmitriy Rabotyagov <drabotyagov@vexxhost.com> [1] `8dcbcecd71` [2] https://zuul.opendev.org/t/openstack/build/c82a73aeabd64fd583694ed04b947731/log/job-output.txt#14011	2020-01-16 17:03:25 -05:00
Guillaume Abrioux	3e262e072b	containers: use --cpus instead --cpu-quota When using docker 1.13.1, the current condition: ``` {% if (container_binary == 'docker' and ceph_docker_version.split('.')[0] is version_compare('13', '>=')) or container_binary == 'podman' -%} ``` is wrong because it compares the first digit (1) whereas it should compare the second one. It means we always use `--cpu-quota` although documentation recommend using `--cpus` when docker version is 1.13.1 or higher. From the doc: > --cpu-quota=<value> Impose a CPU CFS quota on the container. The number of > microseconds per --cpu-period that the container is limited to before > throttled. As such acting as the effective ceiling. > If you use Docker 1.13 or higher, use --cpus instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-16 13:51:43 -05:00
Guillaume Abrioux	8dcbcecd71	remove container_exec_cmd_mgr fact Iterating over all monitors in order to delegate a ` {{ container_binary }}` fails when collocating mgrs with mons, because ceph-facts reset `container_exec_cmd` to point to the first member of the monitor group. The idea is to force `container_exec_cmd` to be reset in ceph-mgr. This commit also removes the `container_exec_cmd_mgr` fact. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1791282 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-15 14:03:49 -05:00
Guillaume Abrioux	cb80231725	mgr: do not copy all keyrings on all mgr There is no need to loop over all mgr nodes to set this fact, it's even breaking deployments because it tries to copy all mgr keyring on all mgr. Closes: #4602 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-10-15 15:06:46 -04:00
Guillaume Abrioux	161170524d	mgr: improve mgr keyring creation Delegating on remote node isn't necessary here since we are already iterating over the right nodes. Closes: #4518 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-10-11 09:40:07 -04:00
Guillaume Abrioux	9bad239d77	common: improve keyrings generation There is no need to get n * number of nodes the different keyrings. Adding a `run_once: true` here avoid running a ceph command too many times which could be impacting large cluster deployment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-10-02 13:09:50 +02:00
Guillaume Abrioux	bd64167469	container: isolate systemd tasks This commit isolates the systemd unit files generation for containers into separate yml files in order to be able importing each corresponding roles without playing all tasks. This is needed so we can run ceph-ansible to render systemd unit files so they call podman instead of docker. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-10-01 10:27:51 -04:00
Guillaume Abrioux	ab370b6ad8	global: remove fetch_directory dependency This commit drops the fetch_directory dependency. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1622688 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-09-26 11:35:24 +02:00
Artur Fijalkowski	011270ca69	global: make directories mode parameterizable This commit makes it possible to parametrize the ceph directories modes. So it changes hardocded mode for ceph related directories from 0755 to customizable with `ceph_directories_mode` variable. Closes: #2920 Signed-off-by: Artur Fijalkowski <artur.fijalkowski@ing.com> Co-authored-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-08-23 09:38:17 +02:00
Guillaume Abrioux	327d564106	lint: fix error [301], add `changed_when: false` when needed This commit fixes the error [301]: `[301] Commands should not change things if nothing needs doing` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-08-23 00:23:47 +02:00
Guillaume Abrioux	5b9b841108	mgr: refact 'wait for all mgr to be up' task There's no need to use `shell` module here. Instead of using `\| python -c`, let's use `from_json` filter. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-08-07 10:33:54 +02:00
Guillaume Abrioux	ec33ee7574	mgr: fix a typo this tasks isn't using the right container_exec_cmd, that's delegating to the wrong node. Let's use the right fact to fix this command. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-07-29 14:42:45 +02:00

1 2 3

133 Commits (3c0e06ea0cdf3098497274dd8ff5a63b117d20c5)