ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	8d9ebf2d0b	container: align systemd units with rpm Update `After=` and `Wants=` parameters in container systemd units and make them be aligned with the systemd units that come from the packaging. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2027440 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f01536ea19`) (cherry picked from commit `690c879aef`)	2022-05-19 17:55:17 +02:00
Guillaume Abrioux	fe4e04f779	dashboard: allow collecting stats from the host This commit makes podman bindmount `/:/rootfs:ro` so the container can collect data from the host. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2028775 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `0f34cd16d8`) (cherry picked from commit `2e2d893d28`)	2022-05-09 13:46:05 +02:00
Guillaume Abrioux	26b396cb6c	common: support setting pg autoscaler to off The current implementation doesn't allow to disable the pg autoscaler on created pools. This allows only 'on' or 'warn'. With this commit, this is now possible to disable it. Valid values would be ['on', 'yes', 'true', 'off', 'no', 'false'] ``` openstack_glance_pool: name: "images" pg_num: 128 pgp_num: 128 rule_name: "replicated_rule" type: 1 application: "rbd" size: 3 pg_autoscale_mode: off ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2062621 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9d1ff8f236`)	2022-05-09 13:44:31 +02:00
Teoman ONAY	b22e1b87d1	Turn off SELinux separation for containers MON and RGW Initially MONs and RGW binded /etc/pki/ca-trust/extracted using the :z flag (introduced to solve an OSP TripleO issue on RHEL - #3638) but using this flag prevents local services (like sssd) running on the host from accessing the certificates/files in that folder. Signed-off-by: Teoman ONAY <tonay@redhat.com> (cherry picked from commit `7e8ce2567e`) (cherry picked from commit `cf44ad76f6`)	2022-05-09 13:44:04 +02:00
Guillaume Abrioux	45b3e50e25	facts: follow up on `aa0cc93` when these variables are defined in the inventory host file, all tasks are skipped then because the node being played isn't aware about the values from the rgw nodes. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2063029 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `328bd7c975`)	2022-04-21 11:34:18 +02:00
Guillaume Abrioux	8d39897f38	facts: fix mon/mgr collocation `service dump` hangs when no active mgr is available. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `617dce5e10`)	2022-04-20 06:47:23 +02:00
Guillaume Abrioux	1453915ed9	dashboard: fix regression introduced by ceph/ceph-ansible/pull/7150 when no rgw is present, it fails. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2076192 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `1a56fd6a21`)	2022-04-20 06:47:16 +02:00
Guillaume Abrioux	c4ec2c4e54	dashboard: support --limit execution with rgw When the following conditions are met: - rgw is deployed, - dashboard is deployed, - playbook is called with --limit, - a node being processed is collocated on either a mon or mgr. The playbook fails because `rgw_instances` is undefined. The idea here is to make sure this variable is always defined. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2063029 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `aa0cc9381d`)	2022-04-14 08:50:33 +02:00
Guillaume Abrioux	49b8e0d89c	dashboard: always set `dashboard_server_addr` When running the playbook with `--limit`, if the play targeted doesn't match hosts present in the mgr group the playbook can fail. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2063029 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `72e4654aae`) (cherry picked from commit `d1e4b83106`)	2022-03-28 10:40:36 +02:00
Guillaume Abrioux	3c05ac6c46	dashboard: fix radosgw system user creation The radosgw system user creation will fail when `rgw_instances` is set at the host_var level because this variable won't bet set on monitor nodes, given that this is where the tasks is delegated, it fails. The idea here is to check over all rgw instances that are defined and set a boolean fact in order to check if at least one instance has `rgw_zonemaster` set to `True` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2034595 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2022-01-03 10:22:09 +01:00
Guillaume Abrioux	de14c6aeb2	validate: fix bug when using vault since a variable encrypted with vault is no longer a string but a encrypted object we can't use the filter \| length, we have to convert it to a string before. Fixes: #6991 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6ad7e52869`)	2021-11-29 13:42:43 +01:00
Guillaume Abrioux	f5dd0a8c37	mgr: append balancer module to ceph_mgr_modules otherwise the osd play in rolling_update can fail when it tries to disable it before upgrading osd nodes. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `45a1d634d8`)	2021-11-10 14:10:30 +01:00
Guillaume Abrioux	a9a7c35a74	update: support --limit on monitor nodes Change needed in order to support --limit on mon nodes. Otherwise, a call to `hostvars[groups[mon_group_name][0]]['_current_monitor_address']` throws an error: ``` "The error was: 'ansible.vars.hostvars.HostVarsVars object' has no attribute '_current_monitor_address'" ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2014304#c28 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `82eee4303b`)	2021-10-29 01:41:13 +02:00
Guillaume Abrioux	b8db1166c5	nfs/rgw: support enforcing keys if one sets `ceph_nfs_rgw_access_key` and/or `ceph_nfs_rgw_secret_key`, the nfs/rgw user creation won't take those variables into account and it will generate a user with automatically generated credentials. It ends up with a mismatch between what will be set in ganesha.conf and the created user. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2010754 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-10-26 16:39:58 +02:00
Guillaume Abrioux	a890d6a043	tests: remove all references to ceph_stable_release this is legacy and not needed anymore. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f277a39dfe`)	2021-10-02 15:48:31 +02:00
Seena Fallah	7495de3120	ceph-defaults: set ceph_stable_release default to the stable branch release ceph_stable_release is a legacy from the time where a single branch of ceph-ansible supported more than one release of ceph Signed-off-by: Seena Fallah <seenafallah@gmail.com> (cherry picked from commit `fb99626987`)	2021-10-02 15:48:31 +02:00
Dimitri Savineau	ce80ed946c	ceph-defaults: set quay.io as the default registry Because the ceph container images are now only pushed to the quay.io registry then this updates the default registry value. The docker.io registry can still be used but doesn't receive updated container images. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `e7b43c1fc6`)	2021-09-09 13:48:14 +02:00
Seena Fallah	e7b0af31c3	ceph-container-engine: allow override container_package_name and container_service_name Only include specific variables when they are undefined Signed-off-by: Seena Fallah <seenafallah@gmail.com> (cherry picked from commit `95bce32270`)	2021-09-09 13:25:00 +02:00
Dimitri Savineau	8f0b0d1285	container: explicitly pull monitoring images We don't pull the monitoring container images (alertmanager, prometheus, node-exporter and grafana) in a dedicated task like we're doing for the ceph container image. This means that the container image pull is done during the start of the systemd service. By doing this, pulling the image behind a proxy isn't working with podman. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1995574 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `5bb7240f87`)	2021-08-23 16:21:19 -04:00
Guillaume Abrioux	c7cd688f2e	iscsi: don't set default value for trusted_ip_list It restricts access to the iSCSI API. It can be left empty if the API isn't going to be access from outside the gateway node Even though this seems to be a limited use case, it's better to leave it empty by default than having a meaningless default value. We could make this variable mandatory but that would be a breaking change. Let's just add a logic in the template in order to set this variable in the configuration file only if it was specified by users. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1994930 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `6802b8dddd`)	2021-08-19 12:06:58 -04:00
Guillaume Abrioux	20583e83dd	containers: introduce target systemd unit This adds ceph-*.target systemd unit files support for containerized deployments. This also fixes a regression introduced by PR #6719 (rgw and nfs systemd units not getting purged) Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1962748 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `09ef465f62`)	2021-08-18 13:43:01 -04:00
Guillaume Abrioux	6ebbda8cef	roles: remove leftover from pr #4319 pr #4319 introduced some uesless `become: true` on systemd tasks. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `1db8fa8989`)	2021-08-18 11:08:39 -04:00
Dimitri Savineau	ffe01c7ff5	ceph-mon: do not log monitor keyring We don't want to display the keyring in the ansible log. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `e44075abd6`)	2021-08-12 13:31:12 +02:00
Guillaume Abrioux	c55c87d3c5	common: do not log keyring secret let's not display any keyring secret by default in ansible log. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1980744 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `7511195738`)	2021-08-11 17:01:22 -04:00
Benoît Knecht	a8346af4f7	ceph-rgw: Work around Jinja2 < 2.8 missng eq test EL7 ships with Jinja2 version 2.7, which is missing the `eq` test. Work around this by using `match` instead. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2021-08-11 13:53:44 +02:00
Benoît Knecht	66426e1316	ceph-rgw: Set pg_num on RGW pool if required If the `pg_num` value specified in `rgw_create_pools` is different from the actual value in the cluster, apply it with `ceph osd pool set`. This corresponds to the behavior of the `ceph_pool` module used in Ceph Ansible 5.0 onward. Also avoid setting the pool application if it's already done. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2021-08-11 13:53:44 +02:00
Dimitri Savineau	6897153ab7	ceph-dashboard: fix TLS cert openssl generation With OpenSSL version prior 1.1.1 (like CentOS 7 with 1.0.2k), the -addext doesn't exist. As a solution, this uses the default openssl.cnf configuration file as a template and add the subjectAltName in the v3_ca section. This temp openssl configuration file is removed after the TLS certificate creation. This patch also move the run_once statement at the block level. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1978869 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `5e0ace7e54`)	2021-08-09 15:14:48 -04:00
Guillaume Abrioux	02750a94cc	dashboard: subj_alt_names fact refactor the current way the variable is built results in: ``` 2021-08-03 04:18:23,020 - ceph.ceph - INFO - ok: [ceph-sangadi-4x-indpt6-node1-installer] => changed=false ansible_facts: subj_alt_names: \|- subjectAltName=ceph-sangadi-4x-indpt6-node1-installer/subjectAltName=10.0.210.223/subjectAltName=ceph-sangadi-4x-indpt6-node1-installersubjectAltName=ceph-sangadi-4x-indpt6-node2/subjectAltName=10.0.210.252/subjectAltName=ceph-sangadi-4x-indpt6-node2/ ``` which is incorrect. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1978869 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6f1a0634f7`)	2021-08-09 15:14:48 -04:00
Teoman ONAY	3d4e15cebf	podman pids.max default value is 2048, docker's one is 4096 which are sufficient for the default value (512) of rgw thread pool size. But if its value is increased near to the pids-limit value, it does not leave place for the other processes to spawn and run within the container and the container crashes. pids-limit set to unlimited regardless of the container engine. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1987041 Signed-off-by: Teoman ONAY <tonay@redhat.com> (cherry picked from commit `9b5d97adb9`)	2021-08-05 11:04:31 -04:00
Dimitri Savineau	1044940304	osds: use osd pool ls instead of osd dump command The ceph osd pool ls detail command is a subset of the ceph osd dump command. $ ceph osd dump --format json\|wc -c 10117 $ ceph osd pool ls detail --format json\|wc -c 4740 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `06471a4b82`)	2021-08-03 14:03:35 -04:00
Benoît Knecht	9668137daf	ceph-handler: Fix osd handler in check mode Run the Ceph commands that only gather information (without making any changes to the cluster) when running Ansible in check mode. This allows the tasks that depend on the variables set by those tasks to succeed in check mode. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `498acd7527`)	2021-08-02 15:54:34 +02:00
Dimitri Savineau	deac21c6bf	ceph-defaults: add missing grafana dashboards The radosgw-sync-overview and rbd-details grafana dashboars were missing from the list. Closes: #6758 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `f0ccf3ebf0`)	2021-07-27 10:53:54 -04:00
Dimitri Savineau	c39e7cb151	alertmanager: allow disable dashboard tls verify When using self-signed/untrusted CA certificates, alertmanager displays an error in logs. With this commit this should make those messages disappear. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1936299 Co-authored-by: Guillaume Abrioux <gabrioux@redhat.com> Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `9f77b929d1`)	2021-07-26 13:19:13 -04:00
Guillaume Abrioux	72bbc8285e	dashboard: support dedicated network for the dashboard This introduces a new variable `dashboard_network` in order to support deploying the dashboard on a different subnet. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1927574 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f4f73b6197`)	2021-07-26 13:19:03 -04:00
Dimitri Savineau	00e0ebc911	multisite: use node fqdn for endpoints when https When the rgw_multisite_proto variable is set to https then we shoudn't use the IP address in the zone endpoints list but the node FQDN to match the TLS certificate CN. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1965504 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `ad05a08160`)	2021-07-26 17:54:13 +02:00
Dimitri Savineau	eba580320c	ceph-mgr: don't install dashboard pkg by default This is a partial backport of `2547ab60`. We are currently installing the ceph-mgr-dashboard package even if the dashboard_enabled variable is set to false. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-07-26 17:50:42 +02:00
Dimitri Savineau	8d58c50f45	ceph-mgr: move mgr module list to common Populating the ceph_mgr_modules list in the mgr_modules doesn't make sense since that file is only executed if the list isn't empty or we're using the dashboard. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `cd06e7c046`)	2021-07-26 17:50:35 +02:00
Dimitri Savineau	364186a86e	ceph-nfs: allow overriding NFS_CORE_PARAM We already have config override variables for existing block (like ganesha_ceph_export_overrides, ganesha_log_overrides, etc...) or a global one (ganesha_conf_overrides) but redefining the NFS_CORE_PARAM block in that variable will erase all previous values (currently only Bind_Addr). ganesha_core_param_overrides: \| Enable_UDP = false; NFS_Port = 2050; Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1941775 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `9817d29543`)	2021-07-26 17:50:05 +02:00
Dimitri Savineau	ddc3df9f9a	ceph-facts: move device facts to its own file Instead of reusing the condition 'inventory_hostname in groups[osds]' on each device facts tasks then we can move all the tasks into a dedicated file and set the condition on the import_tasks statement. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `d704b05e52`)	2021-07-26 17:49:03 +02:00
Dimitri Savineau	50447e89fb	ceph-validate: check logical volumes We currently don't check if the logical volume used in lvm_volumes list for either bluestore data/db/wal or filestore data/journal exist. We're only doing this on raw devices for batch scenario. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `55bca07cb6`)	2021-07-26 17:49:03 +02:00
Dimitri Savineau	ceca225344	ceph-validate: check db/journal/wal devices too When using dedicated devices for db/journal/wal objecstore with ceph-volume lvm batch then we should also validate that those devices exist and don't use a gpt partition table in addition of the devices and lvm_volume.data variables. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `808e7106de`)	2021-07-26 17:49:03 +02:00
Dimitri Savineau	fe070fc19d	ceph-validate: use root device from ansible_mounts Instead of using findmnt command to find the device associated to the root mount point then we can use the ansible_mounts fact. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `7e50380f7f`)	2021-07-26 17:49:03 +02:00
Dimitri Savineau	f317df92ac	ceph-validate: do not resolve devices This is already done in the ceph-facts role. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `0df99dda8d`)	2021-07-26 17:49:03 +02:00
Dimitri Savineau	c67bfe84eb	ceph-validate: check block presence first Instead of doing two parted calls we can check first if the device exist and then test the partition table. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `14d458b3b4`)	2021-07-26 17:49:03 +02:00
Dimitri Savineau	5ef1d630d8	ceph-validate: check devices from lvm_volumes `2888c08` introduced a regression as the check_devices tasks file was only included based on the devices variable. But that file also validate some devices from the lvm_volumes variable. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1906022 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `ac0342b72e`)	2021-07-26 17:49:03 +02:00
Dimitri Savineau	4695df6d2b	monitoring: use config_template module for config The alertmanager, grafana and prometheus configuration file are generated with the template module which doesn't allow for using config overrides. Instead we could use the config_template plugin action and add a new variable for overrides (one for each component). With this patch, one should be able to add configuration to prometheus with the following: --- alertmanager_conf_overrides: global: smtp_smarthost: 'localhost:25' ... Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1902999 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `5a41026347`)	2021-07-26 17:47:51 +02:00
Dimitri Savineau	17b9ff03d2	common: fix py2 pool_list from_json when skipped When using python 2 and the task with a loop is skipped then it generates an error. Unexpected templating type error occurred on ({{ (pool_list.stdout \| from_json)['pools'] }}): expected string or buffer Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `cf6e33346e`)	2021-07-21 09:54:46 -04:00
Guillaume Abrioux	f7882bbc02	common: disable/enable pg_autoscaler The PG autoscaler can disrupt the PG checks so the idea here is to disable it and re-enable it back after the restart is done. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `13036115e2`)	2021-07-21 09:40:18 -04:00
Neelaksh Singh	5213612eaf	Sensitive key data now hidden in output log Fixes: #6529 Signed-off-by: Neelaksh Singh <neelaksh48@gmail.com> (cherry picked from commit `d18a9860cd`)	2021-07-12 09:43:12 +02:00
Dimitri Savineau	58dddf586e	Revert "ceph-validate: check devices from lvm_volumes" This reverts commit `3557497336`. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-07-07 17:19:35 +02:00

1 2 3 4 5 ...

2794 Commits (6485e1a69ed90b446c2cf2d9fdbf703cd8105d6d)