ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Dimitri Savineau	3bba1fd203	monitor: use quorum_status instead of ceph status The ceph status command returns a lot of information stored in variables and/or facts which could consume resources for nothing. When checking the quorum status, we're only using the quorum_names structure in the ceph status output. To optimize this, we could use the ceph quorum_status command which contains the same needed information. This command returns less information. $ ceph status -f json \| wc -c 2001 $ ceph quorum_status -f json \| wc -c 957 $ time ceph status -f json > /dev/null real 0m0.577s user 0m0.538s sys 0m0.029s $ time ceph quorum_status -f json > /dev/null real 0m0.544s user 0m0.527s sys 0m0.016s Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `88f91d8c12`)	2020-11-03 14:32:09 +01:00
wangxiaotong	b4c1f325a8	osds: use ceph osd stat instead of ceph status Improve the checked way of the OSD created checking process. This replaces the ceph status command by the ceph osd stat command. The osdmap structure isn't needed anymore. $ ceph status -f json \| wc -c 2001 $ ceph osd stat -f json \| wc -c 132 $ time ceph status -f json > /dev/null real 0m0.563s user 0m0.526s sys 0m0.036s $ time ceph osd stat -f json > /dev/null real 0m0.457s user 0m0.411s sys 0m0.045s Signed-off-by: wangxiaotong <wangxiaotong@fiberhome.com> (cherry picked from commit `b9cb0f12e9`)	2020-11-03 14:32:09 +01:00
Guillaume Abrioux	04d47d68fd	common: follow up on #5948 In addition to `f7e2b2c608` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `371d854a5c`)	2020-11-03 09:43:51 +01:00
Gaudenz Steinlin	2550e44e2f	openstack: use ceph_keyring_permissions by default Otherwise this task fails if no permission is set on the item. Previously the code omited the mode parameter if it was not set, but this was lost with commit `ab370b6ad8`. Signed-off-by: Gaudenz Steinlin <gaudenz.steinlin@cloudscale.ch> (cherry picked from commit `79ff79c422`)	2020-11-02 18:41:53 -05:00
Dimitri Savineau	ec8903f4af	podman: force log driver to journald Since we've changed to podman configuration using the detach mode and systemd type to forking then the container logs aren't present in the journald anymore. The default conmon log driver is using k8s-file. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1890439 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `16cd183b9c`)	2020-11-02 17:46:39 -05:00
Benoît Knecht	4e15d10e22	ceph-mon: Don't set monitor directory mode recursively After rolling updates performed with `infrastructure-playbooks/rolling_updates.yml`, files located in `/var/lib/ceph/mon/{{ cluster }}-{{ monitor_name }}` had mode 0755 (including the keyring), making them world-readable. This commit separates the task that configured permissions recursively on `/var/lib/ceph/mon/{{ cluster }}-{{ monitor_name }}` into two separate tasks: 1. Set the ownership and mode of the directory itself; 2. Recursively set ownership in the directory, but don't modify the mode. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `0d76826bbb`)	2020-11-02 17:03:04 -05:00
Dimitri Savineau	fa83929b8e	ceph-handler: fix curl ipv6 command with rgw When using the curl command with ipv6 address and brackets then we need to use the -g option otherwise the command fails. $ curl http://[fdc2:328:750b:6983::6]:8080 curl: (3) [globbing] error: bad range specification after pos 9 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `cdb7b09cd7`)	2020-11-02 16:34:51 -05:00
Guillaume Abrioux	b5985d2e83	common: drop `fetch_directory` feature This commit drops the `fetch_directory` feature. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `1cc9666c09`)	2020-10-21 18:28:25 -04:00
Guillaume Abrioux	76be9a4292	ceph-config: ceph.conf rendering refactor This commit cleans up the `main.yml` task file of `ceph-config`. It drops the local ceph.conf generation. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `900c0f4492`)	2020-10-21 18:28:25 -04:00
Guillaume Abrioux	3eed44907b	iscsi: fix ownership on iscsi-gateway.cfg This file is currently deployed with '0644' ownership making this file readable by any user on the system. Since it contains sensitive information it should be readable by the owner only. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1890119 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a822f77300`)	2020-10-21 18:27:50 -04:00
Guillaume Abrioux	a6dac8c93d	crash: refact caps definition there is no need to use `{{ }}` syntax here. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a8bd947c7d`)	2020-10-20 09:09:14 +02:00
Benoît Knecht	5e67492ef4	ceph-osd: Fix check mode for start osds tasks Correctly set `osd_ids_non_container.stdout_lines` to an empty list if it's undefined (i.e. in check mode). Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `8b0023cb77`)	2020-10-19 22:53:20 +02:00
Benoît Knecht	9f5ec22d34	ceph-mon: Fix check mode for deploy monitor tasks Skip the `get initial keyring when it already exists` task when both commands whose `stdout` output it requires have been skipped (e.g. when running in check mode). Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `8f436ab5d8`)	2020-10-19 22:53:20 +02:00
Gaudenz Steinlin	f8a64ce452	ceph-crash: Only deploy key to targeted hosts The current task installs the ceph-crash key to "most" hosts via "delegate_to". This key is only used by the ceph-crash daemon and should just be installed on all hosts targeted by this role. There is no need for using a delegated task. Signed-off-by: Gaudenz Steinlin <gaudenz.steinlin@cloudscale.ch> (cherry picked from commit `68cc93fb18`)	2020-10-19 20:20:25 +02:00
Guillaume Abrioux	0c66f90968	ceph-osd: start osd after systemd overrides The service should be started after the ceph-osd systemd overrides has been added, otherwise, the latter isn't considered. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1860739 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `59d0f01992`)	2020-10-15 13:52:35 +02:00
Dimitri Savineau	3f610811fe	ceph-osd: don't start the OSD services twice Using the + operation on two lists doesn't filter out the duplicate keys. Currently each OSDs is started (via systemd) twice. Instead we could use the union filter. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `4eaa65c362`)	2020-10-14 09:57:52 -04:00
Guillaume Abrioux	d258bf4d2d	handler: refact check_socket_non_container the `stat --printf=%n` returns something like following: ``` ok: [osd0] => changed=false cmd: \|- stat --printf=%n /var/run/ceph/ceph-osd*.asok delta: '0:00:00.009388' end: '2020-10-06 06:18:28.109500' failed_when_result: false rc: 0 start: '2020-10-06 06:18:28.100112' stderr: '' stderr_lines: <omitted> stdout: /var/run/ceph/ceph-osd.2.asok/var/run/ceph/ceph-osd.5.asok stdout_lines: <omitted> ``` it makes the next task "check if the ceph osd socket is in-use" grep like this: ``` ok: [osd0] => changed=false cmd: - grep - -q - /var/run/ceph/ceph-osd.2.asok/var/run/ceph/ceph-osd.5.asok - /proc/net/unix ``` which will obviously fail because this path never exists. It makes the OSD handler broken. Let's use `find` module instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `46d4d97da9`)	2020-10-09 13:55:28 +02:00
Benoît Knecht	c733af9d43	Fix Ansible check mode for site.yml.sample playbook Make sure the `site.yml.sample` playbook can be run in check mode by skipping tasks that try to read the output of commands that have been skipped. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `54ba38e35e`)	2020-10-07 07:06:19 +02:00
Dimitri Savineau	2185a2201d	library: add radosgw_zone module This adds radosgw_zone ansible module for replacing the command module usage with the radosgw-admin zone command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `1281e8bcc8`)	2020-10-06 15:00:17 +02:00
Dimitri Savineau	5a371b3607	library: add radosgw_zonegroup module This adds radosgw_zonegroup ansible module for replacing the command module usage with the radosgw-admin zonegroup command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `65dbe0782e`)	2020-10-06 15:00:17 +02:00
Dimitri Savineau	1643210ca6	library: add radosgw_realm module This adds radosgw_realm ansible module for replacing the command module usage with the radosgw-admin realm command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `d171f4068d`)	2020-10-06 15:00:17 +02:00
Dimitri Savineau	858169a27d	library: add radosgw_user module This adds radosgw_user ansible module for replacing the command module usage with the radosgw-admin user command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `235c7e27cc`)	2020-10-06 15:00:17 +02:00
Dimitri Savineau	4fc2d788b4	library: add ceph_fs module This adds the ceph_fs ansible module for replacing the command module usage with the ceph fs command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `bd611a785b`)	2020-10-06 14:59:49 +02:00
Guillaume Abrioux	2a3b563c7e	rgw: fix multi instances scaleout in baremetal When rgw and osd are collocated, the current workflow prevents from scaling out the radosgw_num_instances parameter when rerunning the playbook in baremetal deployments. When ceph-osd notifies handlers, it means rgw handlers are triggered too. The issue with this is that they are triggered before the role ceph-rgw is run. In the case a scaleout operation is expected on `radosgw_num_instances` it causes an issue because keyrings haven't been created yet so the new instances won't start. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1881313 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a802fa2810`)	2020-10-06 10:31:34 +02:00
Dimitri Savineau	a5f19b7864	ceph_key: remove backward compatibility It's time to remove this backward compatibility. Users had enough time to convert their openstack_keys and key values. We now fail in ceph-validate if the caps key isn't set. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `c960362639`)	2020-10-06 10:09:16 +02:00
Guillaume Abrioux	32be163360	ceph-osd: refact `docker_exec_start_osd` This commit drops nested jinja construction in this set_fact task. It also rename it to `container_exec_start_osd` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `ff95fa9c32`)	2020-10-06 09:54:50 +02:00
Guillaume Abrioux	80879df44d	defaults: change defaults value this commit changes defaults value in default pool definitions. there's no need to define `pg_num`, `pgp_num`, `size` and `min_size`, `ceph_pool` module will use the current default if needed. This also drops the 3 following `set_fact` in `ceph-facts`: - osd_pool_default_pg_num, - osd_pool_default_pgp_num, - osd_pool_default_size_num Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `c101cb3931`)	2020-10-02 09:32:53 +02:00
Guillaume Abrioux	cb44f655fc	ceph_pool: refact module remove complexity about current defaults in running cluster Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `29fc115f4a`)	2020-10-02 09:32:53 +02:00
Seena Fallah	10fc2d1d92	ceph-facts: add get default crush rule from running monitor In case of deploying new monitor node to an existing cluster, osd_pool_default_crush_rule should be taken from running monitor because ceph-osd role won't be run and the new monitor will have different osd_pool_default_crush_role from other monitors. Signed-off-by: Seena Fallah <seenafallah@gmail.com> (cherry picked from commit `ff9f4d138f`)	2020-09-29 12:15:09 -04:00
Ali Maredia	9f58d4a3d1	rgw multisite: check connection for realm endpoint This commit adds connection checks before realm pulls Curls are performed on the endpoint being pulled from the mons and the rgws Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1731158 Signed-off-by: Ali Maredia <amaredia@redhat.com> (cherry picked from commit `902575369c`)	2020-09-29 09:24:59 -04:00
Tyler Bishop	e3284b20ac	facts: support device aliases for (dedicated\|bluestore_wal)_devices Just likve `devices`, this commit adds the support for linux device aliases for `dedicated_devices` and `bluestore_wal_devices`. Signed-off-by: Tyler Bishop <tbishop@liquidweb.com> (cherry picked from commit `ee4b8804ae`)	2020-09-29 09:24:17 -04:00
Dimitri Savineau	6294244c4f	ceph-handler: set handler on xxx_stat result In non containerized deployment we check if the service is running via the socket file presence. This is done via the xxx_socket_stat variable that check the file socket in the /var/run/ceph/ directory. In some scenarios, we could have the socket file still present in that directory but not used by any process. That's why we have the xxx_stat variable which clean those leftovers. The problem here is that we're set the variable for the handlers status (like handler_mon_status) based on xxx_socket_stat instead of xxx_stat. That means we will trigger the handlers if there's an old socket file present on the system without any process associated. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1866834 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `733596582d`)	2020-09-29 09:23:45 -04:00
Dimitri Savineau	77c5115ad2	ceph-iscsi: create pool once from monitor `af9f6684` introduced a regression on the ceph iscsi pool creation because it was delegated to the first monitor node before that change. This patch restores the initial worflow. When the iscsi node doesn't have the admin keyring then the pool creation fails. This commit also ensures that the pool creation is only executed once when having multiple iscsi nodes. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `501b8e0fd3`)	2020-09-29 09:23:14 -04:00
Dimitri Savineau	babc8a05fd	ceph-mds: remove unused block condition Since `af9f6684` the cephfs pool(s) creation don't use the fs_pools_created variable anymore because the ceph_pool module is idempotent. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `1db4dc807c`)	2020-09-28 20:39:25 -04:00
Seena Fallah	9b0f45431d	ceph-facts: check for mon socket in its own host delegate to its own host after checking mon socket to findout if mon socket is in-use or not. Signed-off-by: Seena Fallah <seenafallah@gmail.com> (cherry picked from commit `69f7e35382`)	2020-09-28 20:38:52 -04:00
Guillaume Abrioux	5538dd8b3b	Revert "ceph-rgw: remove ceph_pool state and default value" This reverts commit `ba3512a8fc`. (cherry picked from commit `bf7b044c9a`) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-28 16:47:45 -04:00
Dimitri Savineau	7e2e11320d	ceph-rgw: remove ceph_pool state and default value Since the state is now optional and default values are handled in the ceph_pool module itself. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `ba3512a8fc`)	2020-09-25 14:05:34 -04:00
Dimitri Savineau	8d49d97582	ceph-config: remove ceph_release from ceph.conf.j2 We don't use ceph_release variable in the ceph.conf jinja template. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `62bd41f0d4`)	2020-09-25 13:37:36 -04:00
Dmitriy Rabotyagov	f996a232d7	Remove libjemalloc1 installation task libjemalloc1 package is not required neither for ganesha dependency nor for the package build process. So this task can be simply dropped. Signed-off-by: Dmitriy Rabotyagov <noonedeadpunk@ya.ru> (cherry picked from commit `297532ca41`)	2020-09-24 14:02:27 -04:00
Guillaume Abrioux	b714e04d91	facts: refact `ceph_uid` fact There's no need to set this fact with a `set_fact` We can achieve this in `ceph-defaults` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `bcc673f66c`)	2020-09-21 12:26:19 -04:00
Dimitri Savineau	c0654fa37a	container: quote registry password When using a quote in the registry password then we have the following error: The error was: ValueError: No closing quotation To fix this we need to use the quote filter. Close: https://bugzilla.redhat.com/show_bug.cgi?id=1880252 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `6dcfdf17d4`)	2020-09-18 14:25:44 -04:00
Guillaume Abrioux	7e29b22c76	facts: fix 'set_fact rgw_instances with rgw multisite' the current condition doesn't work, as soon as the first iteration is done the condition makes next iterations skip since `rgw_instances` got set with the first iteration. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1859872 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `ff19c1d851`)	2020-09-18 10:35:20 -04:00
Dimitri Savineau	fb43400903	ceph-infra: include iscsi nodes for logrotate The iscsi nodes aren't included in the logrotate condition. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `85643edfe3`)	2020-09-17 14:49:35 -04:00
Guillaume Abrioux	3e53d2a811	infra: support log rotation for tcmu-runner This commit adds the log rotation support for tcmu-runner. ceph-container related PR: ceph/ceph-container#1726 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1873915 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f576c02ff7`)	2020-09-16 21:35:48 -04:00
Dimitri Savineau	8f175eb60c	container: add optional http(s) proxy option When using a http(s) proxy with either docker or podman we can rely on the HTTP_PROXY, HTTPS_PROXY and NO_PROXY environment variables. But with ansible, even if those variables are defined in a source file then they aren't loaded during the container pull/login tasks. This implements the http(s) proxy support with docker/podman. Both implementations are different: 1/ docker doesn't rely en the environment variables with the CLI. Thos are needed by the docker daemon via systemd. 2/ podman uses the environment variables so we need to add them to the login/pull tasks. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1876692 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `bda3581294`)	2020-09-16 11:32:14 -04:00
Dimitri Savineau	95a073cb3b	ceph-prometheus: update pool stat counter Since [1] The bytes_used pool counter in prometheus has been renamed to stored. Closes: #5781 [1] https://github.com/ceph/ceph/commit/71fe9149 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `e54b924eaf`)	2020-09-16 10:08:46 -04:00
Dimitri Savineau	25ba7f5314	node-exporter: exclude client nodes We don't need to install node-exporter on client node because there's no ceph services running on them. This also makes sure we use the group name variables in the prometheus service template instead of hardcoding the values. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `b105549ed8`)	2020-09-14 16:13:11 -04:00
Dimitri Savineau	24698e7f4b	dashboard: use run_once at block level Instead of using run_once: true on each tasks in a block section, we can use the run_once statement at the block level. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `2c4af70abd`)	2020-09-14 15:54:22 -04:00
Dimitri Savineau	23522a11e4	ceph_key: set state as optional Most ansible module using a state parameter default to the present value (when available) instead of using it as a mandatory option. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `abb4023d76`)	2020-09-14 15:37:56 -04:00
Dimitri Savineau	e785654632	ceph_pool: set state as optional Most ansible module using a state parameter default to the present value (when available) instead of using it as a mandatory option. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `3a05aeb6cb`)	2020-09-14 15:37:56 -04:00

1 2 3 4 5 ...

2714 Commits (3bba1fd203ccf10fc9726570f20d875846c827d9)