ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	18240d8b99	library: remove legacy file This file is a leftover and should have been removed when we dropped the validate module. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `8603cba9ab`)	2020-10-02 09:01:33 -04:00
Guillaume Abrioux	4a56537680	fs2bs: support `osd_auto_discovery` scenario This commit adds the `osd_auto_discovery` scenario support in the filestore-to-bluestore playbook. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1881523 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Co-authored-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `8b1eeef18a`)	2020-09-29 10:48:36 -04:00
Seena Fallah	eebed2990d	ceph-facts: add get default crush rule from running monitor In case of deploying new monitor node to an existing cluster, osd_pool_default_crush_rule should be taken from running monitor because ceph-osd role won't be run and the new monitor will have different osd_pool_default_crush_role from other monitors. Signed-off-by: Seena Fallah <seenafallah@gmail.com> (cherry picked from commit `ff9f4d138f`)	2020-09-29 16:38:38 +02:00
Ali Maredia	b753e7db15	rgw multisite: check connection for realm endpoint This commit adds connection checks before realm pulls Curls are performed on the endpoint being pulled from the mons and the rgws Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1731158 Signed-off-by: Ali Maredia <amaredia@redhat.com> (cherry picked from commit `902575369c`)	2020-09-29 16:33:20 +02:00
Dimitri Savineau	fabaec6351	ceph-handler: set handler on xxx_stat result In non containerized deployment we check if the service is running via the socket file presence. This is done via the xxx_socket_stat variable that check the file socket in the /var/run/ceph/ directory. In some scenarios, we could have the socket file still present in that directory but not used by any process. That's why we have the xxx_stat variable which clean those leftovers. The problem here is that we're set the variable for the handlers status (like handler_mon_status) based on xxx_socket_stat instead of xxx_stat. That means we will trigger the handlers if there's an old socket file present on the system without any process associated. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1866834 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `733596582d`)	2020-09-29 16:33:08 +02:00
Seena Fallah	0dd5036f6c	ceph-facts: check for mon socket in its own host delegate to its own host after checking mon socket to findout if mon socket is in-use or not. Signed-off-by: Seena Fallah <seenafallah@gmail.com> (cherry picked from commit `69f7e35382`)	2020-09-29 16:32:54 +02:00
Guillaume Abrioux	f9a6f775e9	mds: support enabling pg autoscaler on rerun This commit add the pg autoscaler enablement support on ceph-ansible rerun. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1836431 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-09-29 16:32:29 +02:00
Dimitri Savineau	7ffd3baa95	ceph-config: remove ceph_release from ceph.conf.j2 We don't use ceph_release variable in the ceph.conf jinja template. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `62bd41f0d4`)	2020-09-29 16:32:17 +02:00
Guillaume Abrioux	25e23b052b	ansible.cfg: remove cfg file in infrastructure-playbooks There's no need ot have a copy of this file in infrastructure-playbooks directory. playbooks in that directory can be run from the root dir of ceph-ansible. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f906caa6da`)	2020-09-29 16:31:33 +02:00
Guillaume Abrioux	c0755b1820	ansible.cfg: set force_valid_group_names param As of 2.10, group names containing a dash are invalid. However, setting this option makes it still possible to use a dash in group names and prevent this warning to show up. It might need to be definitely addressed in a future ansible release. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1880476 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6938ed1302`)	2020-09-29 16:31:33 +02:00
Dimitri Savineau	a47a8f8543	library/ceph_key: set no_log on secret We don't need to show this information during the module execution. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `a3f4e2b4d1`)	2020-09-29 16:31:14 +02:00
Dmitriy Rabotyagov	6d5c74aa98	Remove libjemalloc1 installation task libjemalloc1 package is not required neither for ganesha dependency nor for the package build process. So this task can be simply dropped. Signed-off-by: Dmitriy Rabotyagov <noonedeadpunk@ya.ru> (cherry picked from commit `297532ca41`)	2020-09-29 16:30:36 +02:00
Benoît Knecht	564a33c0c9	README-MULTISITE: Fix syntax issues from markdownlint This commit makes the following changes: - Remove trailing whitespace; - Use consistent header levels; - Fix code blocks; - Remove hard tabs; - Fix ordered lists; - Fix bare URLs; - Use markdown list of sections. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch> (cherry picked from commit `2c244425ec`)	2020-09-25 14:48:38 -04:00
Kefu Chai	0d8ba2e60c	docs: update URLs to point to the RTD links Fixes #5798 Signed-off-by: Kefu Chai <tchaikov@gmail.com> (cherry picked from commit `f3a78371d9`)	2020-09-25 10:47:38 -04:00
Guillaume Abrioux	f9d4eb8b41	facts: refact `ceph_uid` fact There's no need to set this fact with a `set_fact` We can achieve this in `ceph-defaults` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1875058 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `bcc673f66c`)	2020-09-21 13:49:03 -04:00
Dimitri Savineau	1385d2fdd0	ceph-facts: move facts to defaults value There's no need to define a variable via a fact if we can do it via a default value. Using a fact could be interesseting to override the default value on some condition. - ceph_uid could be set to 167 by default because it's only different on non containerized deployment on Debian/Ubuntu. - rbd_client_directory_{owner,group,mode} could be set to ceph,ceph,0770 by default install of null as we are doing in the facts. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1875058 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `7f997e623a`)	2020-09-21 13:49:03 -04:00
Dimitri Savineau	9412c44906	container: quote registry password When using a quote in the registry password then we have the following error: The error was: ValueError: No closing quotation To fix this we need to use the quote filter. Close: https://bugzilla.redhat.com/show_bug.cgi?id=1880252 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `6dcfdf17d4`)	2020-09-18 15:21:32 -04:00
Guillaume Abrioux	1527b9b12a	facts: fix 'set_fact rgw_instances with rgw multisite' the current condition doesn't work, as soon as the first iteration is done the condition makes next iterations skip since `rgw_instances` got set with the first iteration. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1859872 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `ff19c1d851`)	2020-09-18 10:35:28 -04:00
Dimitri Savineau	195ce88e26	ceph-infra: include iscsi nodes for logrotate The iscsi nodes aren't included in the logrotate condition. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `85643edfe3`)	2020-09-17 14:49:56 -04:00
Guillaume Abrioux	c60a7ad4f6	infra: support log rotation for tcmu-runner This commit adds the log rotation support for tcmu-runner. ceph-container related PR: ceph/ceph-container#1726 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1873915 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f576c02ff7`)	2020-09-16 22:37:18 -04:00
Dimitri Savineau	fbc375387a	container: add optional http(s) proxy option When using a http(s) proxy with either docker or podman we can rely on the HTTP_PROXY, HTTPS_PROXY and NO_PROXY environment variables. But with ansible, even if those variables are defined in a source file then they aren't loaded during the container pull/login tasks. This implements the http(s) proxy support with docker/podman. Both implementations are different: 1/ docker doesn't rely en the environment variables with the CLI. Thos are needed by the docker daemon via systemd. 2/ podman uses the environment variables so we need to add them to the login/pull tasks. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1876692 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `bda3581294`)	2020-09-16 11:32:24 -04:00
Dimitri Savineau	13fb83fc93	ceph-prometheus: update pool stat counter Since [1] The bytes_used pool counter in prometheus has been renamed to stored. Closes: #5781 [1] https://github.com/ceph/ceph/commit/71fe9149 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `e54b924eaf`)	2020-09-16 10:08:54 -04:00
Dimitri Savineau	05d4e76d42	switch2container: chown symlink for devices If the OSD directory is using symlinks for referencing devices (like block, db, wal for bluestore and journal for filestore) then the chown command could fail to change the owner:group on some system. $ ls -hl /var/lib/ceph/osd/ceph-0/ total 28K lrwxrwxrwx 1 ceph ceph 92 Sep 15 01:53 block -> /dev/ceph-45113532-95ca-471b-bd75-51de46f1339c/osd-data-570a1aee-60c0-44c9-8036-ffed7d67a4e6 -rw------- 1 ceph ceph 37 Sep 15 01:53 ceph_fsid -rw------- 1 ceph ceph 37 Sep 15 01:53 fsid -rw------- 1 ceph ceph 55 Sep 15 01:53 keyring -rw------- 1 ceph ceph 6 Sep 15 01:53 ready -rw------- 1 ceph ceph 3 Sep 15 02:00 require_osd_release -rw------- 1 ceph ceph 10 Sep 15 01:53 type -rw------- 1 ceph ceph 2 Sep 15 01:53 whoami $ find /var/lib/ceph/osd/ceph-0 -not -user 167 -execdir chown 167:167 {} + chown: cannot dereference './block': Permission denied $ find /var/lib/ceph/osd/ceph-0 -not -user 167 /var/lib/ceph/osd/ceph-0/block Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `da4280e243`)	2020-09-15 15:30:21 -04:00
Dimitri Savineau	dac0415d75	switch2container: remove deb systemd units When running the switch2container playbook on a Debian based system then the systemd unit path isn't the same than Red Hat based system. Because the systemd unit files aren't removed then the new container systemd unit isn't take in count. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `c1af69a7e7`)	2020-09-15 15:30:21 -04:00
Dimitri Savineau	fd0b9491b6	ansible: bump to ansible 2.9 Prior this commit we were supporting both ansible 2.8 and 2.9. Let's drop 2.8 now. Closes: #5459 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1879178 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-15 13:13:09 -04:00
Guillaume Abrioux	a88f911155	purge: remove potential socket leftover This commit ensure we remove any socket left by ceph and the `ceph-osd-run.sh` script. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1861755 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `5e91e0f3e2`)	2020-09-14 16:51:00 -04:00
Guillaume Abrioux	f31258d604	tests: do not run node_exporter test on clients We need to skip these tests on client nodes since we don't deploy node_exporter on them anymore Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `5650a6d7d0`)	2020-09-14 16:13:25 -04:00
Dimitri Savineau	5cbbc904c1	node-exporter: exclude client nodes We don't need to install node-exporter on client node because there's no ceph services running on them. This also makes sure we use the group name variables in the prometheus service template instead of hardcoding the values. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `b105549ed8`)	2020-09-14 16:13:25 -04:00
Guillaume Abrioux	edb7bdd911	Revert "Make 'disable ssl for dashboard task' idempotent." This reverts commit `f607857f2a`. > That commit [1] introduced a regression in the dashboard configuration > because the ceph config get mgr xxxx command doesn't work with > nautilus. > In that release the get operation needs an entity. > [1] `f607857` Signed-off-by: Dimitri Savineau dsavinea@redhat.com	2020-09-11 09:37:23 -04:00
Guillaume Abrioux	44e3195ded	facts: refact and optimize memory consumption there's no need to run this task on all nodes. This uses too much memory for nothing. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1856981 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f0fe193d8e`)	2020-09-11 09:37:23 -04:00
Guillaume Abrioux	448f36fbbd	config: only add related rgw section there's no need to add each rgw section on all rgw nodes. With this commit, only related rgw section are rendered. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `0a581a6e60`)	2020-09-10 20:55:07 -04:00
Dimitri Savineau	6177a87185	ceph-iscsi: remove python rtslib shaman repository The rtslib python library is now available in the distribution so we shouldn't have to use the shaman repository Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `254ab54f80`)	2020-09-10 20:38:34 -04:00
Dimitri Savineau	47f24ec047	Add CentOS 8 support for rpm deployment We were only supporting CentOS 8 for containerized deployment. Since Nautilus 14.2.10 we now have el8 rpm packages so we should be able to deploy a nautilus ceph cluster with el8. Note that the nfs-ganesha isn't supported because there's no el8 rpm packages for nfs-ganesha V2.8. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-10 20:38:34 -04:00
Niko Smeds	67d505af82	Enable HAProxy backend checks for Ceph RGW Add the `check` option to server definitions to enable basic HAProxy health checks for Ceph RADOS gateway backends. Currently traffic will be forwarded to unhealthly `radosgw.service` servers. These changes resolve the issue. Signed-off-by: Niko Smeds nikosmeds@gmail.com (cherry picked from commit `a951c1a3f0`)	2020-09-10 20:38:01 -04:00
Guillaume Abrioux	97a2640714	dashboard: refact admin user creation task this commit splits this task in order to avoid using a `shell` module. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `54d3e9650f`)	2020-09-10 20:37:42 -04:00
George Shuklin	f607857f2a	Make 'disable ssl for dashboard task' idempotent. This should reduce number of 'changed' tasks during convergence test. Signed-off-by: George Shuklin <george.shuklin@gmail.com> (cherry picked from commit `73d4bb6bd6`)	2020-09-10 20:37:26 -04:00
Rafał Wądołowski	db71eabeef	Comment out ceph_custom_key Since there is a check if ceph_custom_key is defined, there is no reason to define it by default. Signed-off-by: Rafał Wądołowski <rwadolowski@cloudferro.com> (cherry picked from commit `55cd6e83e4`)	2020-09-10 20:37:15 -04:00
Anthony Rusdi	46e4d2aeeb	ceph_custom_repo: define apt and rpm key for custom repo This commit also remove the notify on new added debian repo, force update_cache to yes and define sample ceph_custom_key vars. Signed-off-by: Anthony Rusdi <33247310+antrusd@users.noreply.github.com> (cherry picked from commit `4c592066b7`)	2020-09-10 20:37:15 -04:00
Dimitri Savineau	df70345e6a	ceph-rgw: allow specifying crush rule on pool We already support specifiying a custom crush rule during pool creation in ceph-osd role but not in ceph-rgw role. This patch adds the missing code to implement this feature. Note this is only available for replicated pool not erasure. The rule must also exist prior the pool creation. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1855439 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `cb8f0237e1`)	2020-09-10 20:36:54 -04:00
Dimitri Savineau	43da364188	container: run engine/common roles on first client We already do this in the site-container.yml playbook because we don't need docker/podman installed on all client nodes and having the container image only on the first client node. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `8ecbdc6ede`)	2020-09-10 20:36:08 -04:00
Dimitri Savineau	62f70f96ca	container: don't install the engine on all clients We only need the container engine to be installed on the first clients node in order to execute the pools/keys operation. We already do the same worflow with the ceph-container-common role which pull the ceph container image. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `9805589ef9`)	2020-09-10 20:36:08 -04:00
Dimitri Savineau	69b09f9336	Allow updating crush rule on existing pool The crush rule value was only set once during the pool creation. It was not possible to update the crush rule value by updating the value in the configuration. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1847166 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-09-10 20:35:44 -04:00
Ali Maredia	30d08e1302	rgw: allow rgws to be concurrently with or without multisite Allows rgws in a ceph cluster to be run with multisite and without multisite at the same time. Signed-off-by: Ali Maredia <amaredia@redhat.com> (cherry picked from commit `5c1f4b1a1e`)	2020-09-10 20:35:28 -04:00
Guillaume Abrioux	851a89b8fc	purge-cluster: use sysfs method for unmapping rbd devices This way we keep consistency with purge-container-cluster.yml playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `f77fa6e2a4`)	2020-09-10 20:35:16 -04:00
Dimitri Savineau	0f7da8b9d1	pytest: register ceph_crash mark Otherwise we see some pytest warning. PytestUnknownMarkWarning: Unknown pytest.mark.ceph_crash - is this a typo? You can register custom marks to avoid this warning - for details, see https://docs.pytest.org/en/latest/mark.html Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `03d4620269`)	2020-09-10 20:35:04 -04:00
Dimitri Savineau	182319d58c	ceph-handler: add missing condition on ceph-crash The ceph-crash tasks present in the ceph-handler role don't need to be executed on all nodes. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `18e3c7a0a2`)	2020-09-10 20:35:04 -04:00
Guillaume Abrioux	e0ad8194db	crash: rm container in ExecPreStart even with docker We should ensure the container is removed in `ExecPreStart` even when `{{ container_binary }}` is docker. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `39bb279a53`)	2020-09-10 20:35:04 -04:00
Guillaume Abrioux	66dde0034b	ceph-crash: introduce new role ceph-crash This commit introduces a new role `ceph-crash` in order to deploy everything needed for the ceph-crash daemon. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9d2f2108e1`)	2020-09-10 20:35:04 -04:00
Dimitri Savineau	b745c76491	ceph-facts: only get fsid when monitor are present When running the rolling_update playbook with an inventory without monitor nodes defined (like external scenario) then we can't retrieve the cluster fsid from the running monitor. In this scenario we have to pass this information manually (group_vars or host_vars). Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1877426 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `f63022dfec`)	2020-09-10 17:42:28 -04:00
Dimitri Savineau	d461631c86	tests: use grafana from quay.io This changes the grafana container image regitry from docker.io to quay.io to avoid rate limit. This also adds the missing container image values for docker2podman and podman scenarios. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `dd05d8ba90`)	2020-09-10 21:37:06 +02:00

1 2 3 4 5 ...

5238 Commits (18240d8b99952a4089caa027c3dfdd0c85aeed22) All Branches Search

5238 Commits (18240d8b99952a4089caa027c3dfdd0c85aeed22)

All Branches