ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	a8bd947c7d	crash: refact caps definition there is no need to use `{{ }}` syntax here. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-19 18:53:54 -04:00
Benoît Knecht	8b0023cb77	ceph-osd: Fix check mode for start osds tasks Correctly set `osd_ids_non_container.stdout_lines` to an empty list if it's undefined (i.e. in check mode). Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2020-10-19 20:22:08 +02:00
Benoît Knecht	8f436ab5d8	ceph-mon: Fix check mode for deploy monitor tasks Skip the `get initial keyring when it already exists` task when both commands whose `stdout` output it requires have been skipped (e.g. when running in check mode). Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2020-10-19 20:22:08 +02:00
Gaudenz Steinlin	68cc93fb18	ceph-crash: Only deploy key to targeted hosts The current task installs the ceph-crash key to "most" hosts via "delegate_to". This key is only used by the ceph-crash daemon and should just be installed on all hosts targeted by this role. There is no need for using a delegated task. Signed-off-by: Gaudenz Steinlin <gaudenz.steinlin@cloudscale.ch>	2020-10-19 16:54:06 +02:00
Guillaume Abrioux	59d0f01992	ceph-osd: start osd after systemd overrides The service should be started after the ceph-osd systemd overrides has been added, otherwise, the latter isn't considered. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1860739 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-15 09:19:56 +02:00
Dimitri Savineau	9252b75173	container: remove container_binding_name variable The container_binding_name package was only mandatory when we were using the docker modules (docker_image and docker_container) but since we manage both docker and podman containers without using the dedicated module then we can remove it. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-10-14 10:32:31 +02:00
Dimitri Savineau	4eaa65c362	ceph-osd: don't start the OSD services twice Using the + operation on two lists doesn't filter out the duplicate keys. Currently each OSDs is started (via systemd) twice. Instead we could use the union filter. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-10-14 10:30:39 +02:00
Guillaume Abrioux	46d4d97da9	handler: refact check_socket_non_container the `stat --printf=%n` returns something like following: ``` ok: [osd0] => changed=false cmd: \|- stat --printf=%n /var/run/ceph/ceph-osd*.asok delta: '0:00:00.009388' end: '2020-10-06 06:18:28.109500' failed_when_result: false rc: 0 start: '2020-10-06 06:18:28.100112' stderr: '' stderr_lines: <omitted> stdout: /var/run/ceph/ceph-osd.2.asok/var/run/ceph/ceph-osd.5.asok stdout_lines: <omitted> ``` it makes the next task "check if the ceph osd socket is in-use" grep like this: ``` ok: [osd0] => changed=false cmd: - grep - -q - /var/run/ceph/ceph-osd.2.asok/var/run/ceph/ceph-osd.5.asok - /proc/net/unix ``` which will obviously fail because this path never exists. It makes the OSD handler broken. Let's use `find` module instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-08 17:37:50 -04:00
Benoît Knecht	54ba38e35e	Fix Ansible check mode for site.yml.sample playbook Make sure the `site.yml.sample` playbook can be run in check mode by skipping tasks that try to read the output of commands that have been skipped. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2020-10-07 00:29:44 +02:00
Dimitri Savineau	1281e8bcc8	library: add radosgw_zone module This adds radosgw_zone ansible module for replacing the command module usage with the radosgw-admin zone command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-10-06 10:07:58 +02:00
Dimitri Savineau	65dbe0782e	library: add radosgw_zonegroup module This adds radosgw_zonegroup ansible module for replacing the command module usage with the radosgw-admin zonegroup command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-10-06 10:07:58 +02:00
Dimitri Savineau	d171f4068d	library: add radosgw_realm module This adds radosgw_realm ansible module for replacing the command module usage with the radosgw-admin realm command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-10-06 10:07:58 +02:00
Dimitri Savineau	235c7e27cc	library: add radosgw_user module This adds radosgw_user ansible module for replacing the command module usage with the radosgw-admin user command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-10-06 10:07:58 +02:00
Dimitri Savineau	bd611a785b	library: add ceph_fs module This adds the ceph_fs ansible module for replacing the command module usage with the ceph fs command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-10-06 08:02:58 +02:00
Dimitri Savineau	c960362639	ceph_key: remove backward compatibility It's time to remove this backward compatibility. Users had enough time to convert their openstack_keys and key values. We now fail in ceph-validate if the caps key isn't set. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-10-06 07:59:38 +02:00
Guillaume Abrioux	a802fa2810	rgw: fix multi instances scaleout in baremetal When rgw and osd are collocated, the current workflow prevents from scaling out the radosgw_num_instances parameter when rerunning the playbook in baremetal deployments. When ceph-osd notifies handlers, it means rgw handlers are triggered too. The issue with this is that they are triggered before the role ceph-rgw is run. In the case a scaleout operation is expected on `radosgw_num_instances` it causes an issue because keyrings haven't been created yet so the new instances won't start. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1881313 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-06 07:38:44 +02:00
Guillaume Abrioux	ff95fa9c32	ceph-osd: refact `docker_exec_start_osd` This commit drops nested jinja construction in this set_fact task. It also rename it to `container_exec_start_osd` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-04 21:18:10 +02:00
Guillaume Abrioux	c101cb3931	defaults: change defaults value this commit changes defaults value in default pool definitions. there's no need to define `pg_num`, `pgp_num`, `size` and `min_size`, `ceph_pool` module will use the current default if needed. This also drops the 3 following `set_fact` in `ceph-facts`: - osd_pool_default_pg_num, - osd_pool_default_pgp_num, - osd_pool_default_size_num Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-02 07:42:40 +02:00
Guillaume Abrioux	29fc115f4a	ceph_pool: refact module remove complexity about current defaults in running cluster Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-02 07:42:40 +02:00
Seena Fallah	ff9f4d138f	ceph-facts: add get default crush rule from running monitor In case of deploying new monitor node to an existing cluster, osd_pool_default_crush_rule should be taken from running monitor because ceph-osd role won't be run and the new monitor will have different osd_pool_default_crush_role from other monitors. Signed-off-by: Seena Fallah <seenafallah@gmail.com>	2020-09-29 09:27:58 -04:00
Guillaume Abrioux	eefe11d90c	defaults: change default grafana-server name This change default value of grafana-server group name. Adding some tasks in ceph-defaults in order to keep backward compatibility. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-29 07:42:26 +02:00
Ali Maredia	902575369c	rgw multisite: check connection for realm endpoint This commit adds connection checks before realm pulls Curls are performed on the endpoint being pulled from the mons and the rgws Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1731158 Signed-off-by: Ali Maredia <amaredia@redhat.com>	2020-09-29 07:37:21 +02:00
Dimitri Savineau	e11453c6f5	Remove unused centos docker tasks The `enable extras on centos` task just doesn't work when using the variable ceph_docker_enable_centos_extra_repo to true. fatal: [xxx]; FAILED! => {"changed": false, "msg": "Parameter 'baseurl', 'metalink' or 'mirrorlist' is required."} The CentOS extras repository is enabled by default so it's pretty safe to remove this task and the associated variable. This also removes the ceph_docker_on_openstack variable as it's a leftover and it is unused. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-29 07:35:10 +02:00
Dimitri Savineau	733596582d	ceph-handler: set handler on xxx_stat result In non containerized deployment we check if the service is running via the socket file presence. This is done via the xxx_socket_stat variable that check the file socket in the /var/run/ceph/ directory. In some scenarios, we could have the socket file still present in that directory but not used by any process. That's why we have the xxx_stat variable which clean those leftovers. The problem here is that we're set the variable for the handlers status (like handler_mon_status) based on xxx_socket_stat instead of xxx_stat. That means we will trigger the handlers if there's an old socket file present on the system without any process associated. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1866834 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-29 07:32:10 +02:00
Dimitri Savineau	501b8e0fd3	ceph-iscsi: create pool once from monitor `af9f6684` introduced a regression on the ceph iscsi pool creation because it was delegated to the first monitor node before that change. This patch restores the initial worflow. When the iscsi node doesn't have the admin keyring then the pool creation fails. This commit also ensures that the pool creation is only executed once when having multiple iscsi nodes. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-29 07:31:24 +02:00
Seena Fallah	69f7e35382	ceph-facts: check for mon socket in its own host delegate to its own host after checking mon socket to findout if mon socket is in-use or not. Signed-off-by: Seena Fallah <seenafallah@gmail.com>	2020-09-29 00:21:12 +02:00
Dimitri Savineau	50104650e7	add missing boolean filter Otherwise this will generate an ansible warning about the missing filter. [DEPRECATION WARNING]: evaluating xxx as a bare variable, this behaviour will go away and you might need to add \|bool to the expression in the future. Also see CONDITIONAL_BARE_VARS configuration toggle.. This feature will be removed in version 2.12. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-28 20:45:01 +02:00
Guillaume Abrioux	bf7b044c9a	Revert "ceph-rgw: remove ceph_pool state and default value" This reverts commit `ba3512a8fc`.	2020-09-28 16:56:33 +02:00
Dimitri Savineau	1db4dc807c	ceph-mds: remove unused block condition Since `af9f6684` the cephfs pool(s) creation don't use the fs_pools_created variable anymore because the ceph_pool module is idempotent. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-28 10:22:35 +02:00
Tyler Bishop	ee4b8804ae	facts: support device aliases for (dedicated\|bluestore_wal)_devices Just likve `devices`, this commit adds the support for linux device aliases for `dedicated_devices` and `bluestore_wal_devices`. Signed-off-by: Tyler Bishop <tbishop@liquidweb.com>	2020-09-25 19:59:45 +02:00
Dimitri Savineau	ba3512a8fc	ceph-rgw: remove ceph_pool state and default value Since the state is now optional and default values are handled in the ceph_pool module itself. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-25 19:18:07 +02:00
Dimitri Savineau	4808523403	rolling_update: remove msgr2 migration In Pacific we're are sure that users already achieved the msgr2 because that was introduced in Nautilus. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-25 19:14:42 +02:00
Dimitri Savineau	62bd41f0d4	ceph-config: remove ceph_release from ceph.conf.j2 We don't use ceph_release variable in the ceph.conf jinja template. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-25 19:13:57 +02:00
Dmitriy Rabotyagov	297532ca41	Remove libjemalloc1 installation task libjemalloc1 package is not required neither for ganesha dependency nor for the package build process. So this task can be simply dropped. Signed-off-by: Dmitriy Rabotyagov <noonedeadpunk@ya.ru>	2020-09-24 13:56:16 +02:00
Dimitri Savineau	6dcfdf17d4	container: quote registry password When using a quote in the registry password then we have the following error: The error was: ValueError: No closing quotation To fix this we need to use the quote filter. Close: https://bugzilla.redhat.com/show_bug.cgi?id=1880252 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-18 11:14:00 -04:00
Guillaume Abrioux	ff19c1d851	facts: fix 'set_fact rgw_instances with rgw multisite' the current condition doesn't work, as soon as the first iteration is done the condition makes next iterations skip since `rgw_instances` got set with the first iteration. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1859872 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-18 10:14:34 -04:00
Dimitri Savineau	85643edfe3	ceph-infra: include iscsi nodes for logrotate The iscsi nodes aren't included in the logrotate condition. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-17 20:34:56 +02:00
Guillaume Abrioux	f576c02ff7	infra: support log rotation for tcmu-runner This commit adds the log rotation support for tcmu-runner. ceph-container related PR: ceph/ceph-container#1726 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1873915 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-16 20:23:22 -04:00
Dimitri Savineau	e54b924eaf	ceph-prometheus: update pool stat counter Since [1] The bytes_used pool counter in prometheus has been renamed to stored. Closes: #5781 [1] https://github.com/ceph/ceph/commit/71fe9149 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-16 09:50:42 -04:00
Dimitri Savineau	bda3581294	container: add optional http(s) proxy option When using a http(s) proxy with either docker or podman we can rely on the HTTP_PROXY, HTTPS_PROXY and NO_PROXY environment variables. But with ansible, even if those variables are defined in a source file then they aren't loaded during the container pull/login tasks. This implements the http(s) proxy support with docker/podman. Both implementations are different: 1/ docker doesn't rely en the environment variables with the CLI. Thos are needed by the docker daemon via systemd. 2/ podman uses the environment variables so we need to add them to the login/pull tasks. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1876692 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-16 06:52:26 +02:00
Dimitri Savineau	abb4023d76	ceph_key: set state as optional Most ansible module using a state parameter default to the present value (when available) instead of using it as a mandatory option. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-14 14:12:21 -04:00
Guillaume Abrioux	f0fc59258a	Revert "ceph_pool: use default size/min_size and rule_name" This reverts commit `142934057f`. This is already handled in the ceph_pool module itself Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-14 14:12:21 -04:00
Dimitri Savineau	2c4af70abd	dashboard: use run_once at block level Instead of using run_once: true on each tasks in a block section, we can use the run_once statement at the block level. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-14 13:47:36 +02:00
Dimitri Savineau	b105549ed8	node-exporter: exclude client nodes We don't need to install node-exporter on client node because there's no ceph services running on them. This also makes sure we use the group name variables in the prometheus service template instead of hardcoding the values. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-14 13:46:51 +02:00
Dimitri Savineau	3a05aeb6cb	ceph_pool: set state as optional Most ansible module using a state parameter default to the present value (when available) instead of using it as a mandatory option. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-11 10:26:15 +02:00
Dimitri Savineau	ee6f0547ba	library: add ceph_dashboard_user module This adds the ceph_dashboard_user ansible module for replacing the command module usage with the ceph dashboard ac-user-xxx command. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-11 10:16:08 +02:00
Dimitri Savineau	142934057f	ceph_pool: use default size/min_size and rule_name Before [1] we were using default value for - size - min_size - rule_name when the key wasn't present in the pool dict. The commit [1] changed this by defaulting to omit. This patch restores the original workflow by using facts: - osd_pool_default_size - osd_pool_default_min_size - ceph_osd_pool_default_crush_rule_name [1] `af9f6684f2` Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-11 10:15:28 +02:00
Dimitri Savineau	f63022dfec	ceph-facts: only get fsid when monitor are present When running the rolling_update playbook with an inventory without monitor nodes defined (like external scenario) then we can't retrieve the cluster fsid from the running monitor. In this scenario we have to pass this information manually (group_vars or host_vars). Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1877426 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-10 13:19:44 -04:00
Dimitri Savineau	8dacbce68f	ceph-rgw: use ceph_pool module Since [1] we can use the ceph_pool module instead of using the command module combined with ceph osd pool commands. [1] `bddcb439ce` Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-10 15:16:58 +02:00
Guillaume Abrioux	657e6c8c3b	tests: clean legacy clean some legacies since quay.ceph.io migration Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-09 14:42:41 +02:00

1 2 3 4 5 ...

2714 Commits (guits-clean_main_playbook)