ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	13036115e2	common: disable/enable pg_autoscaler The PG autoscaler can disrupt the PG checks so the idea here is to disable it and re-enable it back after the restart is done. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-07-20 07:37:07 +02:00
Dimitri Savineau	d704b05e52	ceph-facts: move device facts to its own file Instead of reusing the condition 'inventory_hostname in groups[osds]' on each device facts tasks then we can move all the tasks into a dedicated file and set the condition on the import_tasks statement. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-07-02 14:02:30 +02:00
Guillaume Abrioux	8279d14d32	multisite: fix bug during switch2containers When running the switch-to-containers playbook with multisite enabled, the fact "rgw_instances" is only set for the node being processed (serial: 1), the consequence of that is that the set_fact of 'rgw_instances_all' can't iterate over all rgw node in order to look up each 'rgw_instances_host'. Adding a condition checking whether hostvars[item]["rgw_instances_host"] is defined fixes this issue. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1967926 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-06-17 01:49:29 +02:00
Guillaume Abrioux	70f19be367	docker2podman: skip some role imports from handler when running docker-to-podman playbook, there's no need to call `ceph-config` and `ceph-rgw` from the role `ceph-handler`. It can even have side effects when coming from a baremetal cluster that was previously migrated using the switch-to-containers playbook. Indeed it might complain about missing .target systemd unit since they are removed during that migration. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1944999 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-04-09 15:28:50 +02:00
Guillaume Abrioux	0163ecc924	convert some missed `ansible_`` calls to `ansible_facts['']` This converts some missed calls to `ansible_*` that were missed in initial PR #6312 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-03-25 15:19:13 +01:00
Alex Schultz	db031a4993	Disable facts by default in ansible.cfg As a continuation of `a7f2fa73e6`, this change switches fact injection to off by default in the provided ansible.cfg. Signed-off-by: Alex Schultz <aschultz@redhat.com>	2021-03-24 13:44:33 +01:00
Guillaume Abrioux	ccd1cbb732	facts: fix nfs/external cluster scenario These tasks shouldn't be run when at least 1 monitor isn't present in the inventory. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1937997 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2021-03-17 16:05:48 +01:00
Alex Schultz	a7f2fa73e6	Use ansible_facts It has come to our attention that using ansible_* vars that are populated with INJECT_FACTS_AS_VARS=True is not very performant. In order to be able to support setting that to off, we need to update the references to use ansible_facts[<thing>] instead of ansible_<thing>. Related: ansible#73654 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1935406 Signed-off-by: Alex Schultz <aschultz@redhat.com>	2021-03-08 20:54:02 +01:00
Dimitri Savineau	7208a39e57	ceph-facts: set rgw_instances_all fact once There's no need to set the rgw_instances_all fact for each node. We can rely on run_once for that one. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2021-02-01 13:49:12 +01:00
Dimitri Savineau	2aeab882f3	ceph-facts: fix grafana group conversion The conversion fact task was only executed when the grafana_server_group_name variable was explicitly set in the user configuration. If an user was using the default value then the conversion wasn't executed. This also adds back the default grafana_server_group_name value in case user was using the default value and to avoid undefined variable error. Instead of hardcoding the "monitoring" group name then we can reuse the monitoring_group_name variable. There's no need to override the monitoring_group_name variable, it's either using the default value or the one defined by the user. Finally removing the delegate_to statement on the add_host task since it's always executed on the ansible controller. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1903732 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-12-10 16:51:16 +01:00
Dimitri Savineau	2e417ab901	library: add ceph_crush_rule module This adds ceph_crush_rule ansible module for replacing the command module usage with the ceph osd crush rule commands. This module can manage both erasure and replicated crush rules. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-12-01 17:52:41 +01:00
Dimitri Savineau	e150df789e	ceph-facts: fix read osd pool default crush fact We don't need to use run_once on that task when having running monitors otherwise the read task could be skip and the set task will fail. The conditional check 'crush_rule_variable.rc == 0' failed. The error was: error while evaluating conditional (crush_rule_variable.rc == 0): 'dict object' has no attribute 'rc' Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1898856 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-11-18 12:55:43 -05:00
Benoît Knecht	c5f7343a2f	ceph-facts: Fix osd_pool_default_crush_rule fact The `osd_pool_default_crush_rule` is set based on `crush_rule_variable`, which is the output of a `grep` command. However, two consecutive tasks can set that variable, and if the second task is skipped, it still overwrites the `crush_rule_variable`, leading the `osd_pool_default_crush_rule` to be set to `ceph_osd_pool_default_crush_rule` instead of the output of the first task. This commit ensures that the fact is set right after the `crush_rule_variable` is assigned, before it can be overwritten. Closes #5912 Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2020-11-13 09:36:49 +01:00
Dimitri Savineau	3f9081931f	rgw/rbdmirror: use service dump instead of ceph -s The ceph status command returns a lot of information stored in variables and/or facts which could consume resources for nothing. When checking the rgw/rbdmirror services status, we're only using the servicmap structure in the ceph status output. To optimize this, we could use the ceph service dump command which contains the same needed information. This command returns less information and is slightly faster than the ceph status command. $ ceph status -f json \| wc -c 2001 $ ceph service dump -f json \| wc -c 1105 $ time ceph status -f json > /dev/null real 0m0.557s user 0m0.516s sys 0m0.040s $ time ceph service dump -f json > /dev/null real 0m0.454s user 0m0.434s sys 0m0.020s Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-11-03 09:05:33 +01:00
Guillaume Abrioux	1cc9666c09	common: drop `fetch_directory` feature This commit drops the `fetch_directory` feature. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-21 13:22:16 +02:00
Guillaume Abrioux	c101cb3931	defaults: change defaults value this commit changes defaults value in default pool definitions. there's no need to define `pg_num`, `pgp_num`, `size` and `min_size`, `ceph_pool` module will use the current default if needed. This also drops the 3 following `set_fact` in `ceph-facts`: - osd_pool_default_pg_num, - osd_pool_default_pgp_num, - osd_pool_default_size_num Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-10-02 07:42:40 +02:00
Seena Fallah	ff9f4d138f	ceph-facts: add get default crush rule from running monitor In case of deploying new monitor node to an existing cluster, osd_pool_default_crush_rule should be taken from running monitor because ceph-osd role won't be run and the new monitor will have different osd_pool_default_crush_role from other monitors. Signed-off-by: Seena Fallah <seenafallah@gmail.com>	2020-09-29 09:27:58 -04:00
Guillaume Abrioux	eefe11d90c	defaults: change default grafana-server name This change default value of grafana-server group name. Adding some tasks in ceph-defaults in order to keep backward compatibility. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-29 07:42:26 +02:00
Seena Fallah	69f7e35382	ceph-facts: check for mon socket in its own host delegate to its own host after checking mon socket to findout if mon socket is in-use or not. Signed-off-by: Seena Fallah <seenafallah@gmail.com>	2020-09-29 00:21:12 +02:00
Dimitri Savineau	50104650e7	add missing boolean filter Otherwise this will generate an ansible warning about the missing filter. [DEPRECATION WARNING]: evaluating xxx as a bare variable, this behaviour will go away and you might need to add \|bool to the expression in the future. Also see CONDITIONAL_BARE_VARS configuration toggle.. This feature will be removed in version 2.12. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-28 20:45:01 +02:00
Tyler Bishop	ee4b8804ae	facts: support device aliases for (dedicated\|bluestore_wal)_devices Just likve `devices`, this commit adds the support for linux device aliases for `dedicated_devices` and `bluestore_wal_devices`. Signed-off-by: Tyler Bishop <tbishop@liquidweb.com>	2020-09-25 19:59:45 +02:00
Guillaume Abrioux	ff19c1d851	facts: fix 'set_fact rgw_instances with rgw multisite' the current condition doesn't work, as soon as the first iteration is done the condition makes next iterations skip since `rgw_instances` got set with the first iteration. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1859872 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-18 10:14:34 -04:00
Dimitri Savineau	f63022dfec	ceph-facts: only get fsid when monitor are present When running the rolling_update playbook with an inventory without monitor nodes defined (like external scenario) then we can't retrieve the cluster fsid from the running monitor. In this scenario we have to pass this information manually (group_vars or host_vars). Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1877426 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-10 13:19:44 -04:00
Guillaume Abrioux	f0fe193d8e	facts: refact and optimize memory consumption there's no need to run this task on all nodes. This uses too much memory for nothing. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1856981 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-08-20 11:16:26 -04:00
Ali Maredia	5c1f4b1a1e	rgw: allow rgws to be concurrently with or without multisite Allows rgws in a ceph cluster to be run with multisite and without multisite at the same time. Signed-off-by: Ali Maredia <amaredia@redhat.com>	2020-08-17 11:11:11 +02:00
raul	110eaf5f9f	rgw: support 1+ rgw instance in `radosgw_frontend_port` Change the radosgw_frontend_port to take in account more than 1 RGW instance, in it's original form `radosgw_frontend_port: radosgw_frontend_port \| int`, it configured the 8080 port to all instances, with the following modification `radosgw_frontend_port: radosgw_frontend_port \| int + item\|int` we increase in 1 the port count. Co-authored-by: Daniel Parkes <dparkes@redhat.com> Signed-off-by: raul <rmahique@redhat.com>	2020-08-11 14:05:43 +02:00
Dimitri Savineau	4e84b4beed	ceph-facts: remove mds_name fact The mds_name fact always gets the ansible_hostname value so we don't need to have a dedicated fact for this and use the ansible_hostname fact instead. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-07-23 17:02:43 +02:00
Guillaume Abrioux	f8a951f50c	facts: fix broken facts when using --limit This commit fixes these tasks when --limit is used. It makes sure the fact is set on right nodes even when the playbook is run with `--limit` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-07-20 10:56:10 -04:00
Guillaume Abrioux	bcc673f66c	facts: refact `ceph_uid` fact There's no need to set this fact with a `set_fact` We can achieve this in `ceph-defaults` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-07-09 13:37:29 +02:00
Guillaume Abrioux	e5e81843e9	facts: always set ceph_run_cmd and ceph_admin_command always set these facts on monitor nodes whatever we run with `--limit`. Otherwise, playbook will fail when using `--limit` on nodes where these facts are used on a delegated task to monitor. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-15 10:53:15 +02:00
Dimitri Savineau	ed4f23d530	ceph-facts: fix IPv6 _radosgw_address interface When using radosgw_interface and IPv6 setup then the _radosgw_address fact doesn't use square brackets compared to the radosgw_address and radosgw_address_block configuration. Closes: #5325 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-04-28 14:35:16 -04:00
Guillaume Abrioux	378405e328	mds: fix --limit run against mds nodes This commit fixes --limit runs against mds nodes. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-04-14 10:42:43 -04:00
Guillaume Abrioux	4bcc52cb2a	osd: fix monitor_name error when scaling out OSDs This commit fixes a bug when trying to scale out osd nodes with `crush_rule_config` is enabled. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1822599 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-04-09 13:46:40 -04:00
Guillaume Abrioux	1bb9860dfd	osd: use default crush rule name when needed When `rule_name` isn't set in `crush_rules` the osd pool creation will fail. This commit adds a new fact `ceph_osd_pool_default_crush_rule_name` with the default crush rule name. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1817586 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-31 14:49:38 -04:00
Dimitri Savineau	0487d21938	ceph-facts: fix rgw_instances_all fact The rgw_instances_all fact is supposed to be the list of all radosgw instances from all rgw nodes. But the fact is always using the local rgw_instances variable so this won't work on multiple nodes. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-03-25 08:02:13 +01:00
Guillaume Abrioux	1a7f3caecb	facts: fix typo This commit fixes a typo in some task titles Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-03-23 14:03:52 -04:00
petruha	73b3fadb0e	ceph-facts: Fix system_secret_key variable handling This commit fixes the system_secret_key variable not substitued by the right value and always using the 'system_secret_key' string instead. $ egrep 'system_(access\|secret)_key' group_vars/all.yml system_access_key: foofoofoofoofoofoofo system_secret_key: barbarbarbarbarbarbarbarbarbarbarbarbarb $ ansible-playbook -vv -i hosts site.yml.sample -e rgw_multisite=true (...) - hostname: storage0 endpoint: http://192.168.100.42:8080 instance_name: rgw0 radosgw_address: 192.168.50.3 radosgw_frontend_port: 8085 rgw_realm: canada rgw_zone: montreal rgw_zone_user: justin.trudeau rgw_zone_user_display_name: Justin Trudeau rgw_zonegroup: quebec system_access_key: foofoofoofoofoofoofo system_secret_key: system_secret_key Fixes https://github.com/ceph/ceph-ansible/issues/5150 Signed-off-by: petruha <5363545+p37ruh4@users.noreply.github.com>	2020-03-16 17:38:52 -04:00
Guillaume Abrioux	60a2e28189	rgw: add multi-instances support when deploying multisite This commit adds the multi-instances when deploying rgw multisite Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com>	2020-03-12 16:44:48 -04:00
Ali Maredia	71f55bd54d	rgw multisite: enable more than 1 realm per cluster Make it so that more than one realm, zonegroup, or zone can be created during a run of the rgw multisite ansible playbooks. The rgw hosts now need to be grouped into zones and realms in the inventory. .yml files need to be created in group_vars for the realms and zones. Sample yaml files are available. Also remove multsite destroy playbook and add --cluster before radosgw-admin commands remove manually added rgw_zone_endpoints var and have ceph-ansible automatically add the correct endpoints of all the rgws in a rgw_zone from the information provided in that rgws hostvars. Signed-off-by: Ali Maredia <amaredia@redhat.com>	2020-03-04 12:58:13 -05:00
Dimitri Savineau	9d3b49293d	purge: stop rgw instances by iteration It looks like that the service module doesn't support wildcard anymore for stopping/disabling multiple services. fatal: [rgw0]: FAILED! => changed=false msg: 'This module does not currently support using glob patterns, found '''' in service name: ceph-radosgw@' ...ignoring Instead we should iterate over the rgw_instances list. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-03-02 16:32:06 +01:00
Dimitri Savineau	c6e96699f7	dashboard: allow configuring multiple grafana host When using multiple grafana hosts then we push set the grafana and prometheus URL and push the dashboard layout to a single node. grafana_server_addrs is the list of all grafana nodes and used during the ceph-dashboard role (on mgr/mon nodes). grafana_server_addr is the current grafana node used during the ceph-grafana and ceph-prometheus role (on grafana-server nodes). We don't have the grafana_server_addr fact duplication code between external vs collocated nodes. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1784011 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-02-10 11:18:45 -05:00
Dimitri Savineau	298ba0bf03	ceph-facts: set devices osd_auto_discovery on OSDs We only need to set the devices fact with osd_auto_discovery on OSD nodes. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-02-03 16:23:38 +01:00
Dimitri Savineau	ed461544a7	ceph-facts: remove is_podman fact This was used before the CentOS 8 requirement when using CentOS 7 atomic which has both docker and podman installed. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-02-03 10:11:03 -05:00
Dimitri Savineau	1fcafffdad	ceph-facts: fix _container_exec_cmd fact value When using different name between the inventory_hostname and the ansible_hostname then the _container_exec_cmd fact will get a wrong value based on the inventory_hostname instead of the ansible_hostname. This happens when the ceph cluster is already running (update/upgrade). Later the container exec commands will fail because the container name is wrong. We should always set the _container_exec_cmd based on the ansible_hostname fact. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1795792 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-29 08:44:59 +01:00
Vytenis Sabaliauskas	ed1eaa1f38	ceph-facts: Fix for 'running_mon is undefined' error, so that fact 'running_mon' is set once 'grep' successfully exits with 'rc == 0' Signed-off-by: Vytenis Sabaliauskas <vytenis.sabaliauskas@protonmail.com>	2020-01-23 16:27:11 +01:00
Dimitri Savineau	7f997e623a	ceph-facts: move facts to defaults value There's no need to define a variable via a fact if we can do it via a default value. Using a fact could be interesseting to override the default value on some condition. - ceph_uid could be set to 167 by default because it's only different on non containerized deployment on Debian/Ubuntu. - rbd_client_directory_{owner,group,mode} could be set to ceph,ceph,0770 by default install of null as we are doing in the facts. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-16 13:57:11 -05:00
Dimitri Savineau	e790b0851d	group_vars: remove useless files Delete legacy files that aren't used anymore. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-16 13:53:12 -05:00
Dimitri Savineau	4e7fb5d45a	drop use_fqdn variables This has been deprecated in the previous releases. Let's drop it. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-15 11:32:39 +01:00
Guillaume Abrioux	2592a1e1e8	facts: fix osp/ceph external use case `d6da508a9b` broke the osp/ceph external use case. We must skip these tasks when no monitor is present in the inventory. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1790508 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-01-13 12:06:06 -05:00
Dimitri Savineau	f940e695ab	ceph-facts: move grafana fact to dedicated file We don't need to executed the grafana fact everytime but only during the dashboard deployment. Especially for ceph-grafana, ceph-prometheus and ceph-dashboard roles. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1790303 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-01-13 12:05:57 -05:00

1 2 3

103 Commits (b02cc6931f467a37b32ca69dc050a7fb486119e3)