ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	0bb106045e	ceph-volume: refresh lvm metadata cache When running rhel8 containers on a rhel7 host, after zapping an OSD there's a discrepancy with the lvmetad cache that needs to be refreshed. Otherwise, the host still sees the lv and can makes the user confused. If user tries to redeploy an OSD, it will fail because the LV isn't present and need to be recreated. ie: ``` stderr: lsblk: ceph-block-8/block-8: not a block device stderr: blkid: error: ceph-block-8/block-8: No such file or directory stderr: Unknown device, --name=, --path=, or absolute path in /dev/ or /sys expected. usage: ceph-volume lvm prepare [-h] --data DATA [--data-size DATA_SIZE] [--data-slots DATA_SLOTS] [--filestore] [--journal JOURNAL] [--journal-size JOURNAL_SIZE] [--bluestore] [--block.db BLOCK_DB] [--block.db-size BLOCK_DB_SIZE] [--block.db-slots BLOCK_DB_SLOTS] [--block.wal BLOCK_WAL] [--block.wal-size BLOCK_WAL_SIZE] [--block.wal-slots BLOCK_WAL_SLOTS] [--osd-id OSD_ID] [--osd-fsid OSD_FSID] [--cluster-fsid CLUSTER_FSID] [--crush-device-class CRUSH_DEVICE_CLASS] [--dmcrypt] [--no-systemd] ceph-volume lvm prepare: error: Unable to proceed with non-existing device: ceph-block-8/block-8 ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1886534 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-19 15:07:32 -04:00
Guillaume Abrioux	ec52e93cba	ceph-volume: dirty hack ceph-volume recently introduced a breaking change because of a `lvm batch` refactor. when rerunning `lvm batch --report --format json` on existing OSDs, it doesn't output a valid json on stdout. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-04 11:19:15 +02:00
Wong Hoi Sing Edison	268a39ca0e	library: flake8 ceph-ansible modules This commit ensure all ceph-ansible modules pass flake8 properly. Signed-off-by: Wong Hoi Sing Edison <hswong3i@gmail.com> Co-authored-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-01 11:23:52 +02:00
Benoît Knecht	85dd405814	library: Fix new-style modules check mode Running the `ceph_crush.py`, `ceph_key.py` or `ceph_volume.py` modules in check mode resulted in the following error: ``` New-style module did not handle its own exit ``` This was due to the fact that they simply returned a `dict` in that case, instead of calling `module.exit_json()`. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2020-09-25 19:57:35 +02:00
Guillaume Abrioux	f402ab2b87	ceph_volume: fix regression do not skip zapping if osd_fsid is passed Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-07-08 09:52:53 -04:00
Jan Fajerski	d90834b77f	ceph-volume.py: add support for batch refactored code See https://github.com/ceph/ceph/pull/34740 for the batch changes. Signed-off-by: Jan Fajerski <jfajerski@suse.com>	2020-06-30 09:46:27 +02:00
Guillaume Abrioux	3f47236470	ceph_volume: make zap function idempotent This commit makes the zap function idempotent, especially when using lvm_volumes variable. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1845668 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-06-22 22:16:29 -04:00
Rishabh Dave	4249d1e02d	library/ceph_volume: look for error messages in stderr Error message were moved to from stdout in stderr here - `b8d6dcbe9f (diff-20f7c578a4e69ec61a5869d706567a24R137)`. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1793542 Signed-off-by: Rishabh Dave <ridave@redhat.com>	2020-04-20 15:28:40 +02:00
Dimitri Savineau	64701437de	container: remove ulimit nofile parameter Since Ceph Octopus is python3 only we don't need to specify the max open files anymore with the container engine. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-03-30 09:54:23 +02:00
Dimitri Savineau	760b6cd7b0	ceph_volume: fix multiple db/wal/journal devices When using the lvm batch ceph-volume subcommand with dedicated devices for filestore (journal) or bluestore (db/wal) then the list of devices is convert to a string instead of being extended via an iterable. This was working with only one dedicated device but starting with more then the ceph_volume module fails. TASK [ceph-osd : use ceph-volume lvm batch to create bluestore osds] ** fatal: [xxxxxx]: FAILED! => changed=true cmd: - ceph-volume - --cluster - ceph - lvm - batch - --bluestore - --yes - --prepare - --osds-per-device - '4' - /dev/nvme2n1 - /dev/nvme3n1 - /dev/nvme4n1 - /dev/nvme5n1 - /dev/nvme6n1 - --db-devices - /dev/nvme0n1 /dev/nvme1n1 - --report - --format=json msg: non-zero return code rc: 2 stderr: \|2- stderr: lsblk: /dev/nvme0n1 /dev/nvme1n1: not a block device stderr: error: /dev/nvme0n1 /dev/nvme1n1: No such file or directory stderr: Unknown device, --name=, --path=, or absolute path in /dev/ or /sys expected. usage: ceph-volume lvm batch [-h] [--db-devices [DB_DEVICES [DB_DEVICES ...]]] [--wal-devices [WAL_DEVICES [WAL_DEVICES ...]]] [--journal-devices [JOURNAL_DEVICES [JOURNAL_DEVICES ...]]] [--no-auto] [--bluestore] [--filestore] [--report] [--yes] [--format {json,pretty}] [--dmcrypt] [--crush-device-class CRUSH_DEVICE_CLASS] [--no-systemd] [--osds-per-device OSDS_PER_DEVICE] [--block-db-size BLOCK_DB_SIZE] [--block-wal-size BLOCK_WAL_SIZE] [--journal-size JOURNAL_SIZE] [--prepare] [--osd-ids [OSD_IDS [OSD_IDS ...]]] [DEVICES [DEVICES ...]] ceph-volume lvm batch: error: Unable to proceed with non-existing device: /dev/nvme0n1 /dev/nvme1n1 So the dedicated device list is considered as a single string. This commit also adds the journal_devices, block_db_devices and wal_devices documentation to the ceph_volume module. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1816713 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-03-30 09:49:54 +02:00
Guillaume Abrioux	50939369ca	library: fix bug in ceph_volume This commit fixes a regression introduced by `0326d992c2`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-03 13:23:57 -05:00
Guillaume Abrioux	0326d992c2	osd: add journal option in ceph_volume call (batch) This commit adds the journal option to the ceph_volume call when scenario is lvm batch Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-02-28 17:29:59 -05:00
Guillaume Abrioux	aabba3baab	ceph_volume: support filestore to bluestore migration This commit adds the filestore to bluestore migration support in ceph_volume module. We must append to the executed command only the relevant options according to what is passed in `osd_objectostore` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-08 11:48:21 +01:00
Guillaume Abrioux	0dcacdbed0	ceph_volume: add destroy option support The zap action from ceph_volume module always implies `--destroy`. This commit adds the destroy option support so we can ask ceph-volume to not use `--destroy` when zapping a device. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-12-11 09:04:41 -05:00
Guillaume Abrioux	09e04a9197	osd: add wal_devices option support to ceph_volume module This commit adds the `wal_devices` option support to the ceph_volume module. passing a devices list in `bluestore_wal_devices` will make ceph-volume creating 1 vg using these devices to create block.wal partitions. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-09-26 11:35:24 +02:00
Guillaume Abrioux	7b836eaa47	osd: add block_db_devices option support to ceph_volume module This commit adds the `block_db_devices` option support to the ceph_volume module. passing a devices list in `dedicated_devices` will make ceph-volume creating 1 vg using these devices to create block.db partitions for data devices. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-09-26 11:35:24 +02:00
Dimitri Savineau	9a4ac46d19	ceph-osd: Add ulimit nofile on container start On containerized deployment, the OSD entrypoint runs some ceph-volume commands (lvm/simple scan and/or activate) which perform badly without the ulimit option. This option was added for all previous ceph-volume commands but not on the ceph-osd container startup. Also updating hard limit value to 4096 to reflect default baremetal value. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-08-22 16:59:08 +02:00
Dimitri Savineau	a64a61429d	library/ceph_volume.py: remove six dependency The ceph nodes couldn't have the python six library installed which could lead to error during the ceph_volume custom module execution. ImportError: No module named six The six library isn't useful in this module if we're sure that all action variables passed to the build_ceph_volume_cmd function are a list and not a string. Resolves: #4071 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-07-18 15:57:28 +02:00
Dimitri Savineau	b987534881	ceph-volume: Set max open files limit on container The ceph-volume lvm list command takes ages to complete when having a lot of LV devices on containerized deployment. For instance, with 25 OSDs on a node it takes 3 mins 44s to list the OSD. Adding the max open files limit to the container engine cli when executing the ceph-volume command seems to improve a lot thee execution time ~30s. This was impacting the OSDs creation with ceph-volume (both filestore and bluestore) when using multiple LV devices. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1702285 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-06-20 22:37:40 +02:00
Rishabh Dave	ba949acab7	don't use os.path.join() on a single path component Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-03-14 22:35:12 +00:00
Noah Watkins	15812970f0	cv: expose host ipc namespace to ceph-volume container this is needed to properly handle semaphore synchronization for udev actions via dmcrypt/cryptsetup. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1683770 Signed-off-by: Noah Watkins <noahwatkins@gmail.com>	2019-02-28 12:01:18 +00:00
Guillaume Abrioux	16efdbc59b	podman: support podman installation on rhel8 Add required changes to support podman on rhel8 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1667101 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-02-05 18:14:28 +01:00
Noah Watkins	fce9f6ef60	cv: support zap by osd fsid Signed-off-by: Noah Watkins <noahwatkins@gmail.com>	2019-01-24 16:34:13 +01:00
Noah Watkins	ba0af03b43	ceph-volume: add support for inventory command Signed-off-by: Noah Watkins <nwatkins@redhat.com>	2018-12-18 10:51:31 +01:00
Sébastien Han	a42ba03d71	ceph_volume: fix unit tests Fix the container_binary to use by mocking the CEPH_CONTAINER_BINARY env variable. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-12-03 14:39:43 +01:00
Sébastien Han	a96e910114	Add new container scenario Test with podman instead of docker and also support for python 3 only. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-27 16:47:40 +00:00
Andrew Schoen	e13f32c1c5	ceph-volume: be idempotent when the batch strategy changes If you deploy with 2 HDDs and 1 SDD then each subsequent deploy both HDD drives will be filtered out, because they're already used by ceph. ceph-volume will report this as a 'strategy change' because the device list went from a mixed type of HDD and SDD to a single type of only SDD. This situation results in a non-zero exit code from ceph-volume. We want to handle this situation gracefully and report that nothing will be changed. A similar json structure to what would have been given by ceph-volume is returned in the 'stdout' key. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1650306 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-11-26 23:23:50 +00:00
Sébastien Han	997667a873	osd: expose udev into the container In order to be able to retrieve udev information, we must expose its socket. As per, https://github.com/ceph/ceph/pull/25201 ceph-volume will start consuming udev output. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-11-26 18:57:12 +00:00
Maciej Naruszewicz	252d0f9cf2	ceph-volume: fix TypeError exception when setting osds-per-device > 1 osds-per-device needs to be passed to run_command as a string. Otherwise, expandvars method will try to iterate over an integer. Signed-off-by: Maciej Naruszewicz <maciej.naruszewicz@intel.com>	2018-10-29 21:56:37 +01:00
Sébastien Han	1df0a7acce	ceph_volume: add container support for batch https://tracker.ceph.com/issues/36363 has been resolved and the patch has been backported to luminous and mimic so let's enable the container support. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1541415 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-29 18:31:17 +01:00
Sébastien Han	91385e4ff6	ceph_volume: better error handling When loading the json, if invalid, we should fail with a meaningful error. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-26 11:19:24 +02:00
Sébastien Han	c58100002b	ceph_volume: expose ceph-volume logs on the host This will tremendously help debugging failures while performing any ceph-volume command in containers. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-26 11:19:24 +02:00
Sébastien Han	31a0438cb2	ceph_volume: refactor This commit does a couple of things: * Avoid code duplication * Clarify the code * add more unit tests * add myself to the author of the module Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	e39fc4f6ce	ceph_volume: add container support for batch command The batch option got recently added, while rebasing this patch it was necessary to implement it. So now, the batch option can work on containerized environments. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1630977 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	3ddcc9af16	ceph_volume: try to get ride of the dummy container If we run on a containerized deployment we pass an env variable which contains the container image. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Sébastien Han	aa2c1b27e3	ceph-osd: ceph-volume container support Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-10 16:08:41 -04:00
Andrew Schoen	a63ca220e6	ceph-volume: if --report fails to load json, fail with better info This handles the case gracefully where --report does not return any JSON because a validator might have failed. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-10-09 10:09:50 -04:00
Andrew Schoen	5ee305d1a0	ceph-volume: make the batch action idempotent The command is run with --report first to see if any OSDs will be created or not. If they will be, then the command is run. If not, then changed is set to False and the module exits. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-10-09 10:09:50 -04:00
Andrew Schoen	2ffad1b43a	ceph-volume: adds `lvm list` support to the ceph_volume module Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-10-09 10:09:50 -04:00
Andrew Schoen	8afef3d0de	ceph-config: use the ceph_volume module to get num_osds for lvm batch This gives us an accurate number of how many osds will be created. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-10-09 10:09:50 -04:00
Andrew Schoen	07a384ba56	ceph_volume: adds the report parameter Will pass the --report command to ceph-volume lvm batch. Results will be returned in json format. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-10-09 10:09:50 -04:00
Andrew Schoen	8bb131c712	ceph-volume: add the journal_size and block_db_size options These can be used for the the --journal-size and --block-db-size options of `lvm batch`. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-10-09 10:09:50 -04:00
Andrew Schoen	b36f3e06b5	ceph_volume: adds the osds_per_device parameter If this is set to anything other than the default value of 1 then the --osds-per-device flag will be used by the batch command to define how many osds will be created per device. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-09-12 20:27:14 +00:00
Andrew Schoen	6d431ec22d	ceph-volume: implement the 'lvm batch' subcommand This adds the action 'batch' to the ceph-volume module so that we can run the new 'ceph-volume lvm batch' subcommand. A functional test is also included. If devices is defind and osd_scenario is lvm then the 'ceph-volume lvm batch' command will be used to create the OSDs. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-08-09 09:41:58 -04:00
Andrew Schoen	4a4fb1a4df	ceph_volume: objectstore should default to 'bluestore' Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-04-10 14:19:21 +02:00
Andrew Schoen	08f4875533	ceph_volume: refactor to not run ceph osd destroy This changes state to action and gives the options 'create' or 'zap'. The zap parameter is also removed. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-04-10 14:19:21 +02:00
Andrew Schoen	36e71f6532	ceph_volume: perserve newlines in stdout and stderr when zapping Because we have many commands we might need to run the ANSIBLE_STDOUT_CALLBACK won't format these nicely because we're not reporting these back at the root level of the json result. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-04-10 14:19:21 +02:00
Andrew Schoen	a8b0d3f045	ceph_volume: rc should be 0 on successful runs Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-04-10 14:19:21 +02:00
Andrew Schoen	dbd527411c	ceph_volume: defines the zap param in module_args Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-04-10 14:19:21 +02:00
Andrew Schoen	a9b4c01b7c	ceph_volume: make state not required so I can provide a default I want a default value of 'present' for state, so it can not be made required. Othewise it'll throw a 'Module alias error' from ansible. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-04-10 14:19:21 +02:00

1 2

59 Commits (b02589ad50994be320966af78369dbb85a534dbc)