ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Guillaume Abrioux	5748215325	dashboard: fix hosts sections in main playbook ceph-dashboard should be deployed on either a dedicated mgr node or a mon if they are collocated. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `bdc870cbf5`)	2019-06-17 15:54:54 -04:00
Dimitri Savineau	b1f8518ef9	tests: Update ansible ssh_args variable Because we're using vagrant, a ssh config file will be created for each nodes with options like user, host, port, identity, etc... But via tox we're override ANSIBLE_SSH_ARGS to use this file. This remove the default value set in ansible.cfg. Also adding PreferredAuthentications=publickey because CentOS/RHEL servers are configured with GSSAPIAuthenticationis enabled for ssh server forcing the client to make a PTR DNS query. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `34f9d51178`)	2019-06-17 16:45:38 +02:00
Guillaume Abrioux	82ab98326c	tests: increase docker pull timeout CI is facing issues where docker pull reach the timeout, let's increase this to avoid CI failures. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `1019e3b3dc`)	2019-06-14 17:51:32 +00:00
Rishabh Dave	dc66a5e65a	ceph-infra: make chronyd default NTP daemon Since timesyncd is not available on RHEL-based OSs, change the default to chronyd for RHEL-based OSs. Also, chronyd is chrony on Ubuntu, so set the Ansible fact accordingly. Fixes: https://github.com/ceph/ceph-ansible/issues/3628 Signed-off-by: Rishabh Dave <ridave@redhat.com> (cherry picked from commit `9d88d3199f`)	2019-06-14 12:21:02 +00:00
Guillaume Abrioux	6805eb3184	iscsi: assign application (rbd) to pool 'rbd' if we don't assign the rbd application tag on this pool, the cluster will get `HEALTH_WARN` state like following: ``` HEALTH_WARN application not enabled on 1 pool(s) POOL_APP_NOT_ENABLED application not enabled on 1 pool(s) application not enabled on pool 'rbd' ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `4cf17a6fdd`)	2019-06-13 14:51:19 -04:00
Rishabh Dave	34e3b3f0e4	ceph-infra: update cache for Ubuntu Ubuntu-based CI jobs often fail with error code 404 while installing NTP daemons. Updating cache beforehand should fix the issue. Signed-off-by: Rishabh Dave <ridave@redhat.com> (cherry picked from commit `d1c266e6c7`)	2019-06-13 14:50:19 -04:00
Guillaume Abrioux	b1a3b6e2f1	mon: enforce mon0 delegation for initial_mon_key register since this task is designed to be always run on the first monitor, let's enforce the container name accordingly otherwise it could fail like following: ``` fatal: [mon1 -> mon0]: FAILED! => changed=true cmd: - docker - exec - ceph-mon-mon1 - ceph - --cluster - ceph - --name - mon. - -k - /var/lib/ceph/mon/ceph-mon0/keyring - auth - get-key - mon. delta: '0:00:00.085025' end: '2019-06-12 06:12:27.677936' msg: non-zero return code rc: 1 start: '2019-06-12 06:12:27.592911' stderr: 'Error response from daemon: No such container: ceph-mon-mon1' stderr_lines: <omitted> stdout: '' stdout_lines: <omitted> ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `905c2256bd`)	2019-06-13 07:39:07 +02:00
Dimitri Savineau	f71e8f249f	ceph-node-exporter: Fix systemd template `069076b` introduced a bug in the systemd unit script template. This commit fixes the options used by the node-exporter container. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `d0840217f3`)	2019-06-13 07:37:26 +02:00
Guillaume Abrioux	5e392d1a60	dashboard: add allow_embedding support Add a variable to support the allow_embedding support. See ceph/ceph-ansible/issues/4084 for details. Fixes: #4084 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `27856cc499`)	2019-06-12 17:05:26 -04:00
Guillaume Abrioux	dfdaef4158	dashboard: fix dashboard_url setting This setting must be set to something resolvable. See: ceph/ceph-ansible/issues/4085 for details Fixes: #4085 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `2c9cd9d9e7`)	2019-06-12 17:04:57 -04:00
Dimitri Savineau	3815add534	ceph-handler: replace fuser by /proc/net/unix We're using fuser command to see if a process is using a ceph unix socket file. But the fuser command runs through every PID present in /proc/<PID> to see if one of them is using the file. On a system running thousands processes, the fuser command can take a long time to finish. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1717011 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `da9891da1e`)	2019-06-12 23:00:36 +02:00
Dimitri Savineau	053ab50f69	tox-dashboard: update for nautilus We don't need to use dev_setup playbook on stable branch. We also need to remove the dev container image variables and update the value to match nautilus. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-06-12 21:56:42 +02:00
Dimitri Savineau	7c6a09152d	ceph-node-exporter: use modprobe ansible module Instead of using the modprobe command from the path in the systemd unit script, we can use the modprobe ansible module. That way we don't have to manage the binary path based on the linux distribution. Resolves: #4072 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `dbf81b6b5b`)	2019-06-12 10:02:54 -04:00
fmount	138fa19ccf	Fix units and add ability to have a dedicated instance Few fixes on systemd unit templates for node_exporter and alertmanager container parameters. Added the ability to use a dedicated instance to deploy the dashboard components (prometheus and grafana). This commit also introduces the grafana_group_name variable to refer grafana group and keep consistency with the other groups. During the integration with TripleO some grafana/prometheus template variables resulted undefined. This commit adds the ability to check if the group exist and create, accordingly, different job groups in prometheus template. Signed-off-by: fmount <fpantano@redhat.com> (cherry picked from commit `069076bbfd`)	2019-06-12 11:48:12 +02:00
Guillaume Abrioux	d36bab5557	validate: fail in check_devices at the right task see https://bugzilla.redhat.com/show_bug.cgi?id=1648168#c17 for details. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1648168#c17 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `771648304d`)	2019-06-10 08:11:39 +02:00
Guillaume Abrioux	12651433ba	spec: bring back possibility to install ceph with custom repo This can be seen as a regression for customers who were used to deploy in offline environment with custom repositories. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1673254 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `c933645bf7`)	2019-06-10 08:10:26 +02:00
Dimitri Savineau	376cb86db2	container-common: support podman on Ubuntu Currently we're only able to use podman on ubuntu if podman's installation is done manually before the ceph-ansible execution because the deb package is present in an external repository. We already manage the docker-ce installation via an external repository so we should be able to allow the podman installation with the same mechanism too. https://github.com/containers/libpod/blob/master/install.md#ubuntu Resolves: #3947 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `518ab794fb`)	2019-06-07 10:12:36 -04:00
Dimitri Savineau	e9edb5a92a	podman: Add systemd dependency on network.target When using podman, the systemd unit scripts don't have a dependency on the network. So we're not sure that the network is up and running when the containers are starting. With docker this behaviour is already handled because the systemd unit scripts depend on docker service which is started after the network. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `f49090df7e`)	2019-06-07 16:06:26 +02:00
L3D	1daca1ba83	ansible: use 'bool' filter on boolean conditionals By running ceph-ansible there are a lot ``[DEPRECATION WARNING]`` like these: ``` [DEPRECATION WARNING]: evaluating containerized_deployment as a bare variable, this behaviour will go away and you might need to add \|bool to the expression in the future. Also see CONDITIONAL_BARE_VARS configuration toggle.. This feature will be removed in version 2.12. Deprecation warnings can be disabled by setting deprecation_warnings=False in ansible.cfg. ``` Now appended ``\| bool`` on a lot of the affected variables. Sometimes the coding style from ``variable\|bool`` changed to ``variable \| bool`` (with spaces at the pipe). Closes: #4022 Signed-off-by: L3D <l3d@c3woc.de> (cherry picked from commit `ab54fe20ec`)	2019-06-07 16:05:51 +02:00
Dimitri Savineau	7a384e7ec2	purge-cluster: clean all ceph repo files We currently only purge rh_storage yum repository file but depending on the ceph_repository value we are using, the ceph repository file could have a different name. Resolves: #4056 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `44c63903ca`)	2019-06-07 12:05:40 +00:00
guihecheng	a6312ba9bc	Add section for purging rgw loadbalancer in purge-cluster.yml Signed-off-by: guihecheng <guihecheng@cmiot.chinamobile.com> (cherry picked from commit `59e702ec39`)	2019-06-06 19:44:30 +00:00
guihecheng	606b2e2082	Add section for rgw loadbalancer in site.yml This drives ceph rgw loadbalancer stuff to run. Signed-off-by: guihecheng <guihecheng@cmiot.chinamobile.com> (cherry picked from commit `96c346743b`)	2019-06-06 19:44:30 +00:00
guihecheng	c52020a4db	Add role definitions of ceph-rgw-loadbalancer This add support for rgw loadbalancer based on HAProxy and Keepalived. We define a single role ceph-rgw-loadbalancer and include HAProxy and Keepalived configurations all in this. A single haproxy backend is used to balance all RGW instances and a single frontend is exported via a single port, default 80. Keepalived is used to maintain the high availability of all haproxy instances. You are free to use any number of VIPs. A single VIP is shared across all keepalived instances and there will be one master for one VIP, selected sequentially, and others serve as backups. This assumes that each keepalived instance is on the same node as one haproxy instance and we use a simple check script to detect the state of each haproxy instance and trigger the VIP failover upon its failure. Signed-off-by: guihecheng <guihecheng@cmiot.chinamobile.com> (cherry picked from commit `35d40c65f8`)	2019-06-06 19:44:30 +00:00
Guillaume Abrioux	6449d8fd56	validate: add a check for nfs standalone if `nfs_obj_gw` is True when deploying an internal ganesha with an external ceph cluster, `ceph_nfs_rgw_access_key` and `ceph_nfs_rgw_secret_key` must be provided so the ganesha configuration file can be generated. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `003aeea45a`)	2019-06-06 12:44:37 +00:00
Guillaume Abrioux	cb125fa4c8	nfs: support internal Ganesha with external ceph cluster This commits allows to deploy an internal ganesha with an external ceph cluster. This requires to define `external_cluster_mon_ips` with a comma separated list of external monitors. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1710358 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6a6785b719`)	2019-06-06 12:44:37 +00:00
Guillaume Abrioux	61a52a97e3	ceph-osd: do not relabel /run/udev in containerized context Otherwise content in /run/udev is mislabeled and prevent some services like NetworkManager from starting. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `80875adba7`)	2019-06-04 22:09:27 +00:00
Guillaume Abrioux	3b40380870	tests: test podman against atomic os instead rhel8 the rhel8 image used is an outdated beta version, it is not worth it to maintain this image upstream, since it's possible to test podman with a newer version of centos/atomic-host image. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `a78fb209b1`)	2019-06-04 22:09:27 +00:00
Dimitri Savineau	cd80299b55	site-container: update container-engine role Since the split between container-engine and container-common roles, the tags and condition were not updated to reflect the change. - ceph-container-engine needs with_pkg tag - ceph-container-common needs fetch_container_images - we don't need to pull the container image in a dedicated task for atomic host. We can now use the ceph-container-common role. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `2d375e1aa7`)	2019-06-04 11:33:04 -04:00
Dimitri Savineau	b8bcbacdbb	ceph-nfs: use template module for configuration `789cef7` introduces a regression in the ganesha configuration file generation. The new config_template module version broke it. But the ganesha.conf file isn't an ini file and doesn't really need to use the config_template module. Instead we can use the classic template module. Resolves: #4045 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `616c484698`)	2019-06-04 14:18:51 +02:00
Dimitri Savineau	acef6665ca	ceph-facts: generate fsid on mon node The fsid generation is done via a python command. When the ansible controller node only have python3 available (like RHEL 8) then the python command isn't necessarily present causing the fsid generation to fail. We already do some resource creation (like ceph keyring secret) with the python command too but from the mon node so we should do the same for fsid. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1714631 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `daf92a9e1f`)	2019-06-03 09:01:33 -04:00
Dimitri Savineau	69835e85e7	vagrant: Default box to centos/7 We don't use ceph/ubuntu-xenial anymore but only centos/7 and centos/atomic-host. Changing the default to centos/7. Resolves: #4036 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `24d0fd7003`)	2019-05-31 15:34:38 +00:00
Kevin Carter	8b8546a46b	Sync config_template from upstream This change pulls in the most recent release of the config_template module into the ceph_ansible action plugins. Signed-off-by: Kevin Carter <kecarter@redhat.com> (cherry picked from commit `789cef7621`)	2019-05-28 16:48:23 -04:00
Guillaume Abrioux	769e0d2f5c	tests: add retries on failing tests in testinfra This commit adds `pytest-rerunfailures` in requirements.txt so we can retry failing test in testinfra to avoid false positive. (eg: sometimes it can happen for some reason a service takes too much time to start) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `4708b7615f`)	2019-05-22 15:24:57 -04:00
Guillaume Abrioux	16c6d530c6	roles: introduce `ceph-container-engine` role This commit splits the current `ceph-container-common` role. This introduces a new role `ceph-container-engine` which handles the tasks specific to the installation of containers tools (docker/podman). This is needed for the ceph-dashboard implementation for 2 main reasons: 1/ Since the ceph-dashboard stack is only containerized, we must install everything needed to run containers even in non containerized deployments. Splitting this role allows us to not have to call the full `ceph-container-common` role which would run a bunch of unneeded tasks that would have been skipped anyway. 2/ The current implementation would have required to run `ceph-container-common` on all ceph-clients nodes which would have been conflicting with `9d3517c670` (we don't want to run ceph-container-common on all client nodes, see mentioned commit for more details) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `55420d6253`)	2019-05-22 15:24:11 -04:00
Dimitri Savineau	27bd7df5cf	ceph-mgr: install python-routes for dashboard The ceph mgr dashboard requires routes python library to be installed on the system. Resolves: #3995 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `f37edfa113`)	2019-05-22 13:07:17 +02:00
Dimitri Savineau	6d521f1516	ceph-prometheus: fix error in templates - remove trailing double quotes in jinja templates - add jinja filename without .j2 suffix Resolves: #4011 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `29b0d47c8c`)	2019-05-22 08:45:31 +02:00
Dimitri Savineau	1fd81e8d42	common: use gnupg instead of gpg gpg package isn't available for all Debian/Ubuntu distribution but gnupg is. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `622d9feae9`)	2019-05-21 16:28:51 -04:00
Guillaume Abrioux	5982e17315	config: fix ipv6 As of nautilus, if you set `ms bind ipv6 = True` you must explicitly set `ms bind ipv4 = False` too, otherwise OSDs will still try to pick up an IPv4 address. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1710319 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `6ca7372a2d`)	2019-05-21 16:26:54 -04:00
Dimitri Savineau	63f99fb965	tests: update testinfra release In order to support ansible 2.8 with testinfra we need to use the latest release (3.0.x). Adding ssh-config option to py.test. Also bumping the pytest and xdist version. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `de147469d7`)	2019-05-21 09:17:46 +02:00
Dimitri Savineau	6e917da52a	ceph-nfs: apply selinux fix anyway Because ansible_distribution_version doesn't return minor version on CentOS with ansible 2.8 we can apply the selinux anyway but only for CentOS/RHEL 7. Starting RHEL 8, there's a dedicated package for selinux called nfs-ganesha-selinux [1]. Also replace the command module + semanage by the selinux_permissive module. [1] https://github.com/nfs-ganesha/nfs-ganesha/commit/a7911f Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `0ee833432e`)	2019-05-21 09:17:46 +02:00
Dimitri Savineau	78ce0aa0b5	ceph-validate: use kernel validation for iscsi Ceph iSCSI gateway requires Red Hat Enterprise Linux or CentOS 7.5 or later. Because we can not check the ansible_distribution_version fact for CentOS with ansible 2.8 (returns only the major version) we can fallback by checking the kernel option. - CONFIG_TARGET_CORE=m - CONFIG_TCM_USER2=m - CONFIG_ISCSI_TARGET=m http://docs.ceph.com/docs/master/rbd/iscsi-target-cli-manual-install/ Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `0c7fd79865`)	2019-05-21 09:17:46 +02:00
Guillaume Abrioux	d83db2c8ed	switch to ansible 2.8 - remove private attribute with import_role. - update documentation. - update rpm spec requirement. - fix MagicMock python import in unit tests. Closes: #3765 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `72d8315299`)	2019-05-21 09:17:46 +02:00
Guillaume Abrioux	8c47e063e0	dashboard: move the call to ceph-node-exporter This moves the call to ceph-node-exporter role after ceph-container-common, otherwise it will try to run container before docker or podman are installed. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `7c6a3bf825`)	2019-05-20 14:11:39 +02:00
Dimitri Savineau	bcafb182c4	common: install dependencies for apt modules When using a minimal Debian/Ubuntu distribution there's no ca-certificates and gpg packages installed so the apt modules will fail: Failed to find required executable gpg in paths: /usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin apt.cache.FetchFailedException: W:https://download.ceph.com/debian-luminous/dists/bionic/InRelease: No system certificates available. Try installing ca-certificates. Resolves: #3994 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `494746b7a6`)	2019-05-20 10:45:46 +02:00
Dimitri Savineau	32e966db73	tox: Don't copy infrastructure playbook Since `a1a871c` we don't need to copy the infrastructure playbooks under the ceph-ansible root directory. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `0f89a3f7a5`)	2019-05-20 09:38:35 +02:00
Dimitri Savineau	023cdffd95	purge-docker-cluster: don't remove data on atomic Because we don't manage the docker service on atomic (yet) via the ceph-container-common role then we can't stop docker dans remove the data. For now let's do that only for non atomic hosts. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com> (cherry picked from commit `638604929b`)	2019-05-17 10:44:52 -04:00
Guillaume Abrioux	1e2f8cd909	dashboard: move defaults variables to ceph-defaults There is no need to have default values for these variables in each roles since there is no corresponding host groups Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `9f0d4d6847`)	2019-05-17 16:05:58 +02:00
Guillaume Abrioux	e29fd842a6	rename docker_exec_cmd variable This commit renames the `docker_exec_cmd` variable to `container_exec_cmd` so it's more generic. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `e74d80e72f`)	2019-05-17 16:05:58 +02:00
Guillaume Abrioux	71a85405b7	dashboard: fix a typo `6f0643c8e` introduced a typo, the role that should be run is ceph-container-common, not ceph-common Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `acac24d984`)	2019-05-17 16:05:58 +02:00
Guillaume Abrioux	deb5f2a1bd	tests: add dashboard scenario testing This commit add a new scenario to test the dashboard deployment via ceph-ansible. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `17634fc3df`)	2019-05-17 16:05:58 +02:00

... 3 4 5 6 7 ...

4844 Commits (5568692340553f5df19f3d19d9ec6e0dbf7deda9) All Branches Search

4844 Commits (5568692340553f5df19f3d19d9ec6e0dbf7deda9)

All Branches