ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	1525990f39	requirements: exclude ansible 2.9.10 ansible 2.9.10 seems to have introduced a bug. See https://github.com/ansible/ansible/issues/70168 This commit excludes this version from ceph-ansible requirements. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-06-19 17:32:33 -04:00
Dimitri Savineau	e41487dbce	docs: Add upgrade operation. This commit adds a chapter about the ceph upgrade process. Closes: #5393 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-06-18 17:56:53 +02:00
Dimitri Savineau	829990e60d	ceph-osd: remove ceph-osd-run.sh script Since we only have one scenario since nautilus then we can just move the container start command from ceph-osd-run.sh to the systemd unit service. As a result, the ceph-osd-run.sh.j2 template and the ceph_osd_docker_run_script_path variable are removed. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-06-18 17:51:13 +02:00
Dimitri Savineau	d67759611e	library/ceph_pool: set name parameter as required The name parameter is required. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-06-17 16:29:39 +02:00
Dimitri Savineau	0f8a61a3ae	debian/uca: remove the handler notification The "update apt cache" in the ceph-handler role was never called and the handler trigger after adding the uca repository doesn't exist at all. Instead of using a handler for that we can just set the update_cache parameter to true like the other apt_repository tasks. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-06-17 10:14:03 +02:00
Guillaume Abrioux	b91d60d384	switch_to_containers: don't set noup flag We shouldn't set this flag when running switch_to_containers playbook. Otherwise the playbook fails waiting for pgs to be clean. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1843569 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-06-17 01:32:18 +02:00
Jan Fajerski	1fe8e819f9	lvm_setup: lookup device from inventory, default to /dev/sd* names This fixes a long standing fail in ceph-volumes lvm test suite. Otherwise the default behaviour should not change. Signed-off-by: Jan Fajerski <jfajerski@suse.com>	2020-06-16 18:17:34 +02:00
Dimitri Savineau	cdb30bd125	container: inspect Id field instead of RepoDigests When a container image managed by podman isn't tag anymore then the RepoDigests field when inspecting the image doesn't return any value. This is different from docker workflow and it breaks the ceph-ansible container upgrade when collocated multiple services and using a non fix container tag (like latest or 4). $ podman images REPOSITORY TAG IMAGE ID CREATED SIZE docker.io/ceph/daemon latest 680c9c0d38c3 8 days ago 957 MB <none> <none> 011ee108bfc9 2 months ago 1.01 GB $ podman inspect 680c9c0d38c3 \| jq .[0].RepoDigests[0] "docker.io/ceph/daemon@sha256:20cf789235e23ddaf38e109b391d1496bb88011239d16862c4c106d0e05fea9e" $ podman inspect 011ee108bfc9 \| jq .[0].RepoDigests[0] null Because this field returns "null" then the ansible task trying to determine this value is failing ----------------------------- fatal: [foo]: FAILED! => msg: \|- The task includes an option with an undefined variable. The error was: None has no element 0 The error appears to be in 'roles/ceph-container-common/tasks/fetch_image.yml': line 137, column 3, but may be elsewhere in the file depending on the exact syntax problem. The offending line appears to be: - name: set_fact ceph_osd_image_repodigest_before_pulling ^ here ----------------------------- We don't have this behaviour with docker. $ docker images REPOSITORY TAG IMAGE ID CREATED SIZE docker.io/ceph/daemon latest 680c9c0d38c3 8 days ago 928 MB docker.io/ceph/daemon <none> 011ee108bfc9 2 months ago 986 MB $ docker inspect 680c9c0d38c3 \| jq .[0].RepoDigests[0] "docker.io/ceph/daemon@sha256:45e6f28bb67c81b826acb64fad5c0da1cac3dffb41a88992fe4ca2be79575fa6" $ docker inspect 011ee108bfc9 \| jq .[0].RepoDigests[0] "docker.io/ceph/daemon@sha256:b393a73309d72e43ca7d65cd3519036007947671e373eb59aa75a46185c52231" Instead we should just get the Id field. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1844496 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-06-16 17:06:25 +02:00
Dimitri Savineau	50140c9b5d	switch_to_container: fix osd systemd regex The systemd LOAD and ACTIVE fileds could have more than one space between both values. This update the systemd regex the same way we're using it in different part of the code. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1843500 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-06-16 17:04:06 +02:00
Ali Maredia	0175c205fa	rgw multisite: add master zone endpoints to zonegroup We were only adding the endpoints to the master zone but not to the zonegroup. This patch fixes the issue. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1839228 Signed-off-by: Ali Maredia <amaredia@redhat.com>	2020-06-09 09:50:18 -04:00
Dimitri Savineau	2f17f36638	mergify: remove merge on skip ci This rule will probably never be applyied and at the moment this is creating a cancelled job in the CI status. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-06-09 09:30:10 -04:00
Ansible Deployment User	3f906e0c26	rgwloadbalancer undefined index variable The vrrp_instances variable is using a loop with index but the index_var wasn't defined. As a result, the fact task was failing on this undefined index variable. The task includes an option with an undefined variable. The error was: 'index' is undefined Closes: #5395 Signed-off-by: Florian Faltermeier <florian.faltermeier@uibk.ac.at>	2020-05-26 10:03:25 -04:00
Dimitri Savineau	44e1ebaaff	ceph-nfs: add stable noarch repository When using the stable nfs ganesha repository, we need have both arch and noarch repositories enabled. Currently the noarch repository is missing which cause the non containerized deployment to fail. Closes: #5375 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-16 07:34:08 +02:00
Guillaume Abrioux	8aed824f71	switch_to_container: refact wait for pg check There is no need to make this check with several steps. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-16 07:31:57 +02:00
Guillaume Abrioux	8d556b0787	tests: report coverage status for unittests This commit adds pytest-cov usage in unittests Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-16 07:31:57 +02:00
Guillaume Abrioux	886b5256fd	ceph_pool: add tests Add unit tests for ceph_pool module Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-16 07:31:57 +02:00
Guillaume Abrioux	fa3aa5a03c	ceph_pool: support setting application at pool creation This commit adds the required changes in order to support setting application pool at initial pool creation. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-16 07:31:57 +02:00
Guillaume Abrioux	c4b7d89c18	ceph_pool: refact exec_commands() We never multiple ceph command at a time, so there's no need to have this design. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-16 07:31:57 +02:00
Guillaume Abrioux	83faf94351	tests: update pools definitions setting attributes with empty string is a bad user input. Also, removing `rule_name` attribute when creating a code erasure pool. (this rule isnt intended for code erasure pool type). Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-16 07:31:57 +02:00
Guillaume Abrioux	af9f6684f2	common: introduce ceph_pool module calls This commits calls the `ceph_pool` module for creating ceph pools everywhere it's needed in the playbook. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-16 07:31:57 +02:00
Guillaume Abrioux	bddcb439ce	library: add ceph_pool module This commit adds a new module `ceph_pool` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-16 07:31:57 +02:00
Guillaume Abrioux	8c7a48832c	common: fix target_size_ratio task enablement The condition on this task is wrong, we have to check whether `target_size_ratio` is set in the pool definition instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-15 20:57:32 +02:00
Guillaume Abrioux	e5e81843e9	facts: always set ceph_run_cmd and ceph_admin_command always set these facts on monitor nodes whatever we run with `--limit`. Otherwise, playbook will fail when using `--limit` on nodes where these facts are used on a delegated task to monitor. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-15 10:53:15 +02:00
Dimitri Savineau	5407e898a6	tests/library: parametrize ceph_volume objecstore This adds the objectstore testing for both filestore and bluestore on the ceph_volume module. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-14 17:46:43 +02:00
Dimitri Savineau	a8e458c452	tests/library: define container cmd once In containerized deployment, the ceph_volume module will always uses the same container command prefix for all actions. Instead of duplicate this code in all container tests we can define it once. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-14 17:46:43 +02:00
Guillaume Abrioux	4fb9722c48	tests: force using the more recent build We should use `latest-master-devel` for switch_to_containers job. Otherwise it might happen we actually downgrade the ceph version when the image used is older than the rpm initially used for installing ceph. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-14 11:34:51 -04:00
Guillaume Abrioux	6d9acb5e6d	test: set sitepackages=false in tox Otherwise it might try to use the system installed version of ansible when there's one available. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-14 13:09:42 +02:00
Dimitri Savineau	252e78b4e4	docker2podman: manage dashboard nodes The dashboard nodes (alertmanager, grafana, node-exporter, and prometheus) were not manage during the docker to podman migration. This adds the systemd container template of those services to a dedicated file (systemd.yml) in order to include it in the docker2podman playbook. This also adds the dashboard container images pull from docker to podman. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1829389 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-13 12:02:00 +02:00
Dimitri Savineau	d38f21aeba	docker2podman: pull images from docker daemon The docker2podman playbook only installs the podman package and updates the systemd units with the right container_binary value. We never pull the container image so if one service is restarted then the container image will be pulled first before the service can start which could cause longer downstream. To avoid to download the container image from internet again we can just pull it from the local docker daemon. The container_{binding,package,service}_name variables are removed because they are only used in the ceph-container-engine role which isn't call in this playbook. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-13 12:02:00 +02:00
Dimitri Savineau	c0a213f928	rolling_update: fix rbdmirror group name The rbdmirror group name was using the wrong variable definition. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-13 11:57:42 +02:00
Dimitri Savineau	b20519efd0	dashboard: allow disabling grafana api ssl verify When using an untrusted TLS certificate (like self-signed) on grafana then the grafana dashboards update subcommand will fail. One solution could be to trust the TLS certificate. The other one is to disable the TLS verification on the grafana API. Closes: #5324 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-13 11:56:57 +02:00
Dimitri Savineau	222fe4abd8	ceph-nfs: bind mount ganesha log directory The current ganesha log directory is only present in the container and not bind mount on the host. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-13 11:55:38 +02:00
Benoît Knecht	444b46ea24	ceph-validate: Expand templates in rgw_create_pools Same fix as `ceph-rgw` for `rgw_create_pools` pool names that contain Jinja templates. See #5348 for details. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2020-05-11 11:51:27 -04:00
Benoît Knecht	d2b7670c7d	ceph-rgw: Make sure pool name templates are expanded It is common to set templated pool names in `rgw_create_pools`, e.g. ```yaml rgw_create_pools: "{{ rgw_zone }}.rgw.buckets.index": pg_num: 16 size: 3 type: replicated ``` This worked fine with Ansible 2.8, but broke in Ansible 2.9 due to a change in the way `with_dict` works [1]. This commit replaces the use of `with_dict` with ```yaml loop: "{{ rgw_create_pools \| dict2items }}" ``` which works as intended and expands the template in the pool name. [1]: https://docs.ansible.com/ansible/latest/porting_guides/porting_guide_2.9.html#loops Closes #5348 Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2020-05-11 11:51:27 -04:00
Ali Maredia	bd1440f2cd	docs: minor fixes to README-MULTISITE.md Make all of the hosts start at 1 and not 0, also make some minor changes in scenario 3 to remova an inconsistency. Signed-off-by: Ali Maredia <amaredia@redhat.com>	2020-05-08 12:06:45 -04:00
Benoît Knecht	b7efca1785	ceph-validate: Fix "fail on unsupported CentOS release" The `dashboard_enabled` condition used a `true` filter (which doesn't exist) instead of the `bool` filter. Signed-off-by: Benoît Knecht <bknecht@protonmail.ch>	2020-05-08 10:21:11 -04:00
Dimitri Savineau	34e6e8e06c	ceph-rgw: use match instead of equalto from jinja2 The '==' jinja2 operator (or 'equalto') has been introduced in jinja2 2.8. On EL7, jinja2 version is 2.7 so the operator isn't present creating templating error like: The error was: TemplateRuntimeError: no test named '==' Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1747206 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-06 14:23:10 -04:00
Dimitri Savineau	8a890306ad	ceph-nfs: fix internal ganesha deployment Since `ea2b654d9` we're not running the rados command from the monitor nodes but from the ganesha node. Unfortunately we don't have the required keyring on that node to run the rados command as we don't import the right keyring. This commit restores the workflow for internal ganesha deployment like before `ea2b654d9` but keeps the rados commands from the ganesha node for external deployment until we have a better design. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-06 11:10:08 -04:00
Dimitri Savineau	748ac4b928	ceph-nfs: fix keyring copy for external ganesha Fix the condition on the keyring copy task that prevent the ganesha keyring to be created in the /var/lib/ceph directory. Also ensure that the directory exists first. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1831285 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-05-06 11:10:08 -04:00
Guillaume Abrioux	cf460274c7	nfs: fix 2 typo The condition is missing an index here which makes the playbook failing. Typical error: ``` The conditional check 'not item.get('skipped', False)' failed. The error was: error while evaluating conditional (not item.get('skipped', False)): 'list object' has no attribute 'get'", ``` Also, adds the missing '/keyring' on the `exec_cmd_nfs` fact. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1831342 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-05-06 11:10:08 -04:00
Dimitri Savineau	ed4f23d530	ceph-facts: fix IPv6 _radosgw_address interface When using radosgw_interface and IPv6 setup then the _radosgw_address fact doesn't use square brackets compared to the radosgw_address and radosgw_address_block configuration. Closes: #5325 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-04-28 14:35:16 -04:00
fmount	5eb363e033	Refresh ceph dashboard user role This change allows the operator to refresh the ceph dashboard admin role on multiple ceph-ansible executions. In the current state the role is set only when the user is created, and there's no way to change it if the user exists. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1826002 Signed-off-by: fmount <fpantano@redhat.com>	2020-04-23 16:28:49 -04:00
Dimitri Savineau	f1728929cd	ceph-dashboard: fix mgr dashboard IPv6 fact `15ed9ee` introduced a regression for the mgr dashboard daemon using IPv6 since the mgr dashboard configuration doesn't support brackets. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1827299 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-04-23 14:44:46 -04:00
Ali Maredia	2b32604577	docs: fix multisite docs add endpoints var in rgw_instances section + Mention of this variable was missing in the original version. + Minor revisions around the concept of secondary zone. Signed-off-by: Ali Maredia <amaredia@redhat.com>	2020-04-23 11:24:11 -04:00
Dimitri Savineau	2547ab601a	Readd CentOS 7 with conditions The CentOS 7 distribution could still be used be deploying ceph if - it's a containerized deployment - it's a non containerized deployment without the dashboard (due to missing python3 libraries). The ceph_stable_redhat_distro variable has been remove because we can rely on the ansible_distribution_major_version fact instead. The copr el8 repository configuration is only applied for CentOS 8. The ceph-mgr-dashboard package is only installed when the dashboard_enabled variable is set to true. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-04-23 13:31:11 +02:00
Guillaume Abrioux	86959abf9b	tests: add back nfs testing on master This commit adds back nfs testing on master branch (containerized scenario only). Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-04-23 13:27:48 +02:00
Guillaume Abrioux	86dc6f8206	mds: don't enable application pool on cephfs pools this commit removes the task which enable application on cephfs pools. See: https://tracker.ceph.com/issues/43761 Fixes: #5278 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-04-23 13:23:10 +02:00
ianwatsonrh	ccf6a7f153	typo: updating type check on rc Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1826884 Signed-off-by: ianwatsonrh <ianwatson@redhat.com>	2020-04-23 13:20:35 +02:00
Guillaume Abrioux	7e800303e9	doc: add day-2 operations documentation This commit is the first of a serie in order to describe all day-2 operations that are possible via ceph-ansible using a set of playbook provided in `infrastructure-playbooks` directory. Fixes: #5061 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2020-04-21 09:52:01 -04:00
Dimitri Savineau	2b9edba131	filestore-to-bluestore: fix py2 on skipped tasks When using skipped variables with from_json filter and python2 then we need to have a default value otherwise the skipped task will fail. Unexpected templating type error occurred on ({{ (ceph_volume_lvm_list.stdout \| from_json) }}): expected string or buffer Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1790472 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2020-04-20 16:19:18 +02:00

... 7 8 9 10 11 ...

5650 Commits (cc6a10bd029c574f707e2fe5a675e685ac603dad) All Branches Search

5650 Commits (cc6a10bd029c574f707e2fe5a675e685ac603dad)

All Branches