ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	34086ec233	osd: support numactl options on OSD activate This commit adds OSD containers activate with numactl support. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1684146 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `b3eb9206fa`)	2019-03-11 09:50:29 +00:00
Guillaume Abrioux	de3465b6a3	osd: add ipc=host in systemd template for containers in addition to `15812970f0` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `d5be83e504`)	2019-02-28 13:48:39 +00:00
Guillaume Abrioux	303cc85754	osd: bind mount /var/run/udev/ without this, the command `ceph-volume lvm list --format json` hangs and takes a very long time to complete. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `7ade032807`)	2019-02-06 00:37:11 +00:00
Sébastien Han	8d1c67beb2	osd: discover osd_objectstore on the fly Applying and passing the OSD_BLUESTORE/FILESTORE on the fly is wrong for existing clusters as their config will be changed. Typically, if an OSD was prepared with ceph-disk on filestore and we change the default objectstore to bluestore, the activation will fail. The flag osd_objectstore should only be used for the preparation, not activation. The activate in this case detects the osd objecstore which prevents failures like the one described above. Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `4c51130198`)	2018-12-04 09:01:50 +00:00
Sébastien Han	1151521784	ceph-osd: change jinja condition If an existing cluster runs this config, and has ceph-disk OSD, the `expose_partitions` won't be expected by jinja since it's inside the 'old' if. We need it as part of the osd_scenario != 'lvm' condition. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1640273 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `bef522627e`)	2018-12-04 09:01:50 +00:00
Sébastien Han	a948677de1	osd: ceph-volume activate, just pass the OSD_ID We don't need to pass the device and discover the OSD ID. We have a task that gathers all the OSD ID present on that machine, so we simply re-use them and activate them. This also handles the situation when you have multiple OSDs running on the same device. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	ece9e9812e	osd: do not use expose_partitions on lvm expose_partitions is only needed on ceph-disk OSDs so we don't need to activate this code when running lvm prepared OSDs. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	3ddcc9af16	ceph_volume: try to get ride of the dummy container If we run on a containerized deployment we pass an env variable which contains the container image. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	aa2c1b27e3	ceph-osd: ceph-volume container support Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Noah Watkins	306e308f13	Avoid using tests as filter Fixes the deprecation warning: [DEPRECATION WARNING]: Using tests as filters is deprecated. Instead of using `result\|search` use `result is search`. Signed-off-by: Noah Watkins <nwatkins@redhat.com>	2018-10-10 04:26:33 +00:00
Sébastien Han	2ca8c51906	osd: do not remove expose_partition container The container runs with --rm which means it will be deleted by Docker when exiting. Also 'docker rm -f' is not idempotent and returns 1 if the container does not exist. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1609007 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-30 10:38:15 +02:00
Guillaume Abrioux	7b387b506a	osd: clean legacy syntax in ceph-osd-run.sh.j2 Quick clean on a legacy syntax due to `e0a264c7e` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-09 07:29:33 +02:00
Sébastien Han	65ba85aff6	Expose /var/run/ceph Useful for softwares that do data collection/monitoring like collectd. They can connect to the socket and then retrieve information. Even though the sockets are exposed now, I'm keeping the docker exec to check the socket, this will allow newer version of ceph-ansible to work with older versions. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1563280 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-20 15:48:32 +02:00
Sébastien Han	641f141c0f	selinux: remove chcon calls We know bindmount with the :z option at the end of the -v command so this will basically run the exact same command as we used to run. So to speak: chcon -Rt svirt_sandbox_file_t /var/lib/ceph Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-19 14:59:37 +02:00
Guillaume Abrioux	e537779bb3	osd: fix osd restart when dmcrypt This commit fixes a bug that occurs especially for dmcrypt scenarios. There is an issue where the 'disk_list' container can't reach the ceph cluster because it's not launched with `--net=host`. If this container can't reach the cluster, it will hang on this step (when trying to retrieve the dm-crypt key) : ``` +common_functions.sh:448: open_encrypted_part(): ceph --cluster abc12 --name \ client.osd-lockbox.9138767f-7445-49e0-baad-35e19adca8bb --keyring \ /var/lib/ceph/osd-lockbox/9138767f-7445-49e0-baad-35e19adca8bb/keyring \ config-key get dm-crypt/osd/9138767f-7445-49e0-baad-35e19adca8bb/luks +common_functions.sh:452: open_encrypted_part(): base64 -d +common_functions.sh:452: open_encrypted_part(): cryptsetup --key-file \ -luksOpen /dev/sdb1 9138767f-7445-49e0-baad-35e19adca8bb ``` It means the `ceph-run-osd.sh` script won't be able to start the `osd_disk_activate` process in ceph-container because he won't have filled the `$DOCKER_ENV` environment variable properly. Adding `--net=host` to the 'disk_list' container fixes this issue. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1543284 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-02-08 15:45:13 +01:00
Sébastien Han	bbc79765f3	osd: best effort if no device is found during activation We have a scenario when we switch from non-container to containers. This means we don't know anything about the ceph partitions associated to an OSD. Normally in a containerized context we have files containing the preparation sequence. From these files we can get the capabilities of each OSD. As a last resort we use a ceph-disk call inside a dummy bash container to discover the ceph journal on the current osd. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1525612 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-19 14:40:48 +01:00
Christian Berendt	50a848dc40	Rename fact docker_version to ceph_docker_version The name docker_version is very generic and is also used by other roles. As a result, there may be name conflicts. To avoid this a ceph_ prefix should be used for this fact. Since it is an internal fact renaming is not a problem.	2017-12-15 20:12:21 +01:00
John Fulton	8cba44262c	Add flags for OSD 'docker run --cpuset-{cpus,mems}' Add the variables ceph_osd_docker_cpuset_cpus and ceph_osd_docker_cpuset_mems, so that a user may specify the CPUs and memory nodes of NUMA systems on which OSD containers are run. Provides a example in osds.yaml.sample to guide user based on sample `lscpu` output since cpuset-mems refers to the memory by NUMA node only while cpuset-cpus can refer to individual vCPUs within a NUMA node.	2017-12-14 16:39:35 +01:00
Guillaume Abrioux	591d77220e	osd: always run disk_list test there is no need to have a condition on this task, this test should be always run since the result will be interpreted later. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-09 11:51:16 +01:00
Sébastien Han	d4ed9a2064	osd: enhance backward compatibility During the initial implementation of this 'old' thing we were falling into this issue without noticing https://github.com/moby/moby/issues/30341 and where blindly using --rm, now this is fixed the prepare container disappears and thus activation fail. I'm fixing this for old jewel images. Also this fixes the machine reboot case where the docker logs are purgend. In the old scenario, we now store the log locally in the same directory as the ceph-osd-run.sh script. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-03 11:15:23 +01:00
Sébastien Han	5f9e50dabe	Merge pull request #2103 from andymcc/tcmalloc_settings Option to set TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES	2017-10-25 17:36:04 +02:00
Andy McCrae	7f6c39102d	Option to set TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES Use "ceph_tcmalloc_max_total_thread_cache" to set the TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES value inside /etc/default/ceph for Debian installs, or /etc/sysconfig/ceph for Red Hat/CentOS installs. By default this is set to 0, so the default package value will be used, if specified this value will be changed to match the variable, and ceph osd services will be restarted.	2017-10-25 14:38:36 +01:00
Sébastien Han	968ef04324	osd: bring backward compatibility with old Jewel images There was a huge resync from luminous to jewel in ceph-docker: https://github.com/ceph/ceph-docker/pull/797 This change brought a new handy function to discover partitions tight to an OSD. This function doesn't exist in the old image so the ceph-osd-run.sh script breaks when trying to deploy Jewel OSD with that old Jewel image version. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 16:26:41 +02:00
Christian Berendt	cf901f0171	In docker start scripts replace \u00a0 with \u0020 This will solve the following issue when starting docker containers on ubuntu: invalid argument "1\u00a0" for --cpus=1 : failed to parse 1 as a rational number Closes-bug: #2056	2017-10-16 15:16:48 +02:00
Sébastien Han	d0a9e57bfc	osd: rollback bindmount of /run/udev This is causing unknown issues when trying to start a dmcrypt container. Basically the container is stuck at mount opening the LUKS device. This is still unknown why this is causing trouble but we need to move forward. Also, this doesn't seem to help in any ways to fix the race condition we've seen. Here is the log for dmcrypt: cryptsetup 1.7.4 processing "cryptsetup --debug --verbose --key-file key luksClose fbf8887d-8694-46ca-b9ff-be79a668e2a9" Running command close. Locking memory. Installing SIGINT/SIGTERM handler. Unblocking interruption on signal. Allocating crypt device context by device fbf8887d-8694-46ca-b9ff-be79a668e2a9. Initialising device-mapper backend library. dm version [ opencount flush ] [16384] (1) dm versions [ opencount flush ] [16384] (1) Detected dm-crypt version 1.14.1, dm-ioctl version 4.35.0. Device-mapper backend running with UDEV support enabled. dm status fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush ] [16384] (1) Releasing device-mapper backend. Trying to open and read device /dev/sdc1 with direct-io. Allocating crypt device /dev/sdc1 context. Trying to open and read device /dev/sdc1 with direct-io. Initialising device-mapper backend library. dm table fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush securedata ] [16384] (1) Trying to open and read device /dev/sdc1 with direct-io. Crypto backend (gcrypt 1.5.3) initialized in cryptsetup library version 1.7.4. Detected kernel Linux 3.10.0-693.el7.x86_64 x86_64. Reading LUKS header of size 1024 from device /dev/sdc1 Key length 32, device size 1943016847 sectors, header size 2050 sectors. Deactivating volume fbf8887d-8694-46ca-b9ff-be79a668e2a9. dm status fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush ] [16384] (1) Udev cookie 0xd4d14e4 (semid 32769) created Udev cookie 0xd4d14e4 (semid 32769) incremented to 1 Udev cookie 0xd4d14e4 (semid 32769) incremented to 2 Udev cookie 0xd4d14e4 (semid 32769) assigned to REMOVE task(2) with flags (0x0) dm remove fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush retryremove ] [16384] (1) fbf8887d-8694-46ca-b9ff-be79a668e2a9: Stacking NODE_DEL [verify_udev] Udev cookie 0xd4d14e4 (semid 32769) decremented to 1 Udev cookie 0xd4d14e4 (semid 32769) waiting for zero Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-11 13:21:37 +02:00
Sébastien Han	bf99751ce1	osd: bindmount /run/udev Ensures that "udevadm" is able to check the status of udev's event queue. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-09 17:25:45 +02:00
Sébastien Han	3bd341f6c0	osd: container use id instead of dev name Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1494127 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-03 14:44:00 +02:00
Sébastien Han	46a01df434	osd: add cluster name support I forgot to add cluster name support so some partition were never mounted correctly. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 20:30:54 +02:00
Sébastien Han	45797ab968	osd: fix container reboot It's sad but we can not rely on the prepare container anymore since the log are flushed after reboot. So inpecting the container does not return anything. Now, instead we use a ephemeral container to look up for the journal/block.db/block.wal (depending if filestore or bluestore) and build the activate command accordingly. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-25 13:34:47 +02:00
Sébastien Han	2fa151b9e8	container: introduce resource limitation for containers This can be controlled via 2 options: * ceph_$DAEMON_docker_memory_limit * ceph_$DAEMON_docker_cpu_limit All daemons default to 1GB for memory and 1 CPU by default. Recommendations from: https://access.redhat.com/documentation/en-us/red_hat_ceph_storage/2/html/red_hat_ceph_storage_hardware_guide/minimum_recommendations Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-06 14:52:21 +02:00
Sébastien Han	e0a264c7e9	osd: allow multi dedicated journals for containers Fix: https://bugzilla.redhat.com/show_bug.cgi?id=1475820 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-30 12:34:06 +02:00
Sébastien Han	30991b1c0a	osd: simplify scenarios There is only two main scenarios now: * collocated: everything remains on the same device: - data, db, wal for bluestore - data and journal for filestore * non-collocated: dedicated device for some of the component Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-03 10:20:39 +02:00
Guillaume Abrioux	30a0fa31e3	Docker: Fix bug "waiting for /dev/XXX to show up" Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-12 15:02:39 +02:00
Guillaume Abrioux	0a38bfaadc	Osd: Fix bug 'uniq' command not found Due to a breaking space introduced by `d2320e412e` the command here is broken. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-12 15:02:39 +02:00
Sébastien Han	d2320e412e	osd: docker, refactor ceph-osd-run.sh.j2 Easier to read and enhance. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-06 15:49:14 +02:00
Guillaume Abrioux	ddfe019342	Refact code `ceph-docker-common`: At the moment there is a lot of duplicated tasks in each `./roles/ceph-<role>/tasks/docker/main.yml` that could be refactored in `./roles/ceph-docker-common/tasks/main.yml`. `_containerized_deployment` variables: All `_containerized_deployment` have been refactored to a single variable `containerized_deployment` duplicate `cephx` variables in `group_vars/* have been removed. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-05-24 15:55:41 +02:00
John McEleney	f1388dc2c2	Apparmor on Ubuntu Xenial will not permit containers to mount devices, even with CAP SYS_ADMIN.	2017-04-19 19:22:02 +01:00
Tobias Florek	931027e6f7	harmonize docker names Created containers now are named more or less in the form of <ansible role>-<ansible_hostname>	2017-02-23 09:15:05 +01:00
Sébastien Han	73cf0378c2	docker: osd, do not use priviledged container anymore Oh yeah! This patch adds more fine grained control on how we run the activation osd container. We now use --device to give a read, write and mknodaccess to a specific device to be consumed by Ceph. We also use SYS_ADMIN cap to allow mount operations, ceph-disk needs to temporary mount the osd data directory during the activation sequence. This patch also enables the support of dedicated journal devices when deploying ceph-docker with ceph-ansible. Depends on https://github.com/ceph/ceph-docker/pull/478 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-21 15:54:36 -05:00

39 Commits (aba3d64b873907b0fa0dd6e3f90b9081aec724e3)