ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Eduard Egorov	7d7080df6c	crush: create rack type buckets and build crush tree according to {{ osd_crush_location }}. Currently, we can define crush location for each host but only crush roots and crush rules are created. This commit automates other routines for a complete solution: 1) Creates rack type crush buckets defined in {{ ceph_crush_rack }} of each osd host. If it's not defined by user then a rack named 'default_rack_{{ ceph_crush_root }}' would be added and used in next steps. 2) Move rack type crush buckets defined in {{ ceph_crush_rack }} into crush roots defined in {{ ceph_crush_root }} of each osd host. 3) Move hosts defined in {{ ceph_crush_rack }} into crush roots defined in {{ ceph_crush_root }} of each osd host. Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2018-01-11 17:42:18 +01:00
Sébastien Han	6db4aea453	osd: skip devices marked as '/dev/dead' On a non-collocated scenario, if a drive is faulty we can't really remove it from the list of 'devices' without messing up or having to re-arrange the order of the 'dedicated_devices'. We want to keep this device list ordered. This will prevent the activation failing on a device that we know is failing but we can't remove it yet to not mess up the dedicated_devices mapping with devices. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-11 17:34:32 +01:00
Guillaume Abrioux	70401f955b	container: trigger handlers on systemd file change When a systemd unit file is changed we should trigger handlers to restart the services. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 16:46:42 +01:00
Guillaume Abrioux	b29a42cba6	handlers: avoid duplicate handler Having handlers in both ceph-defaults and ceph-docker-common roles can make the playbook restarting two times services. Handlers can be triggered first time because of a change in ceph.conf and a second time because a new image has been pulled. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 16:46:42 +01:00
Sébastien Han	8a19a83354	container: restart container when there is a new image This wasn't any good choice to implement this. We had several options and none of them were ideal since handlers can not be triggered cross-roles. We could have achieved that by doing: * option 1 was to add a dependancy in the meta of the ceph-docker-common role. We had that long ago and we decided to stop so everything is managed via site.yml * option 2 was to import files from another role. This is messy and we don't that anywhere in the current code base. We will continue to do so. There is option 3 where we pull the image from the ceph-config role. This is not suitable as well since the docker command won't be available unless you run Atomic distro. This would also mean that you're trying to pull twice. First time in ceph-config, second time in ceph-docker-common The only option I came up with was to duplicate a bit of the ceph-config handlers code. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1526513 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-10 16:46:42 +01:00
Guillaume Abrioux	900f447c82	containers: fix bug when looking for existing cluster When containerized deployment, `docker_exec_cmd` is not set before the task which try to retrieve the current fsid is played, it means it considers there is no existing fsid and try to generate a new one. Typical error: ``` ok: [mon0 -> mon0] => { "changed": false, "cmd": [ "ceph", "--connect-timeout", "3", "--cluster", "test", "fsid" ], "delta": "0:00:00.179909", "end": "2018-01-09 10:36:58.759846", "failed": false, "failed_when_result": false, "rc": 1, "start": "2018-01-09 10:36:58.579937" } STDERR: Error initializing cluster client: Error('error calling conf_read_file: errno EINVAL',) ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 16:23:18 +01:00
Sébastien Han	c2e04623a5	container: change the way we force no logs inside the container Previously we were using ceph_conf_overrides however this doesn't play nice for softwares like TripleO that uses ceph_conf_overrides inside its own code. For now, and since this is the only occurence of this, we can ensure no logs through the ceph conf template. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1532619 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-10 16:21:47 +01:00
Guillaume Abrioux	acfbebe67e	defaults: rename check_socket files for containers When containerized deployment, we are not looking for a socket but for a running container. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 15:44:47 +01:00
Sébastien Han	f0787e64da	mon: use crush rules for non-container too There is no reasons why we can't use crush rules when deploying containers. So moving the inlcude in the main.yml so it can be called. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-10 15:21:36 +01:00
Sébastien Han	97f520bc74	containers: bump memory limit A default value of 4GB for MDS is more appropriate and 3GB for OSD also. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1531607 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-09 11:26:50 +01:00
Sébastien Han	0b55abe3d0	mon: always run ceph-create-keys ceph-create-keys is idempotent so it's not an issue to run it each time we play ansible. This also fix issues where the 'creates' arg skips the task and no keys get generated on newer version, e.g during an upgrade. Closes: https://github.com/ceph/ceph-ansible/issues/2228 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-21 13:50:01 +01:00
Sébastien Han	ad54e19262	rgw: disable legacy rgw service unit When upgrading from OSP11 to OSP12 container, ceph-ansible attempts to disable the RGW service provided by the overcloud image. The task attempts to stop/disable ceph-rgw@{{ ansible-hostname }} and ceph-radosgw@{{ ansible-hostname }}.service. The actual service name is ceph-radosgw@radosgw.$name Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1525209 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-21 13:48:42 +01:00
Guillaume Abrioux	895949d6c4	osd: fix check gpt the gpt label creation doesn't work even with parted module. This commit fixes the gpt label creation by using parted command instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-20 17:42:45 +01:00
Sébastien Han	bbc79765f3	osd: best effort if no device is found during activation We have a scenario when we switch from non-container to containers. This means we don't know anything about the ceph partitions associated to an OSD. Normally in a containerized context we have files containing the preparation sequence. From these files we can get the capabilities of each OSD. As a last resort we use a ceph-disk call inside a dummy bash container to discover the ceph journal on the current osd. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1525612 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-19 14:40:48 +01:00
Sébastien Han	dfbef8361d	nfs: fix package install for debian/suss systems This resolves the following error: E: There were unauthenticated packages and -y was used without --allow-unauthenticated Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-19 13:30:49 +01:00
Christian Berendt	50a848dc40	Rename fact docker_version to ceph_docker_version The name docker_version is very generic and is also used by other roles. As a result, there may be name conflicts. To avoid this a ceph_ prefix should be used for this fact. Since it is an internal fact renaming is not a problem.	2017-12-15 20:12:21 +01:00
Markos Chandras	162b7d2b23	roles: ceph-mgr: Install the ceph-mgr package on SUSE The ceph-mgr package name is identical to RedHat so add the SUSE family to the existing task.	2017-12-15 09:22:14 +01:00
Guillaume Abrioux	a24fd1cfd9	client: don't make `osd_pool_default_pg_num` mandatory making `osd_pool_default_pg_num` mandatory is a bit agressive and is unrelated when you just want to create users keyrings. Closes: #2241 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-14 17:22:07 +01:00
Guillaume Abrioux	ab1dd3027a	client: don't try to generate keys the entrypoint to generate users keyring is `ceph-authtool`, therefore, it can expand the `$(ceph-authtool --gen-print-key)` inside the container. Users must generate a keyring themselves. This commit also adds a check to ensure keyring are properly filled when `user_config: true`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-14 17:22:07 +01:00
Guillaume Abrioux	26afe46e13	docker: add missing condition for selinux tasks on `client` and `mds` roles, it tries to set selinux even on non rhel based distributions.` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-14 17:00:14 +01:00
Sébastien Han	7eaf444328	default: look for the right return code on socket stat in-use As reported in https://github.com/ceph/ceph-ansible/issues/2254, the check with fuser is not ideal. If fuser is not available the return code is 127. Here we want to make sure that we looking for the correct return code, so 1. Closes: https://github.com/ceph/ceph-ansible/issues/2254 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-14 16:59:14 +01:00
John Fulton	8cba44262c	Add flags for OSD 'docker run --cpuset-{cpus,mems}' Add the variables ceph_osd_docker_cpuset_cpus and ceph_osd_docker_cpuset_mems, so that a user may specify the CPUs and memory nodes of NUMA systems on which OSD containers are run. Provides a example in osds.yaml.sample to guide user based on sample `lscpu` output since cpuset-mems refers to the memory by NUMA node only while cpuset-cpus can refer to individual vCPUs within a NUMA node.	2017-12-14 16:39:35 +01:00
Eduard Egorov	a8a2c13f6a	firewall: add mds, nfs, restapi and iscsi ports, remove 'configure_firewall' variable used for conditional execution. Include the task only on rpm-based systems. Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2017-12-12 23:44:55 +01:00
Eduard Egorov	6a5e0da30d	firewall: configure firewalld if it's already installed on the host (#2192 ). Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2017-12-12 23:44:55 +01:00
Major Hayden	5676fa23b1	Convert interface names to underscores for facts If a deployer uses an interface name with a dash/hyphen in it, such as 'br-storage' for the monitor_interface group_var, the ceph.conf.j2 template fails to find the right facts. It looks for 'ansible_br-storage' but only 'ansible_br_storage' exists. This patch converts the interface name to underscores when the template does the fact lookup.	2017-12-12 09:03:40 +01:00
Konstantin Shalygin	d7dadc3e7b	ceph-osd: respect nvme partitions when device is a disk.	2017-12-12 09:03:18 +01:00
Guillaume Abrioux	6a9b5c9632	defaults: fix CI issue with ceph_uid fact The CI complains because of `ceph_uid` fact which doesn't exist since the docker image tag used in the CI doesn't match with this condition. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-12 09:02:37 +01:00
Andrew Schoen	788c3f351a	ceph-osd: adds osd_objectstore to the name when using the ceph_volume module This allows for easier debugging if verbosity is not set high enough. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-12-11 09:58:06 -06:00
Andrew Schoen	5e3d8dbf63	ceph-osd: use the cluster param with the ceph_volume module Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-12-11 09:58:06 -06:00
Andrew Schoen	423166f671	ceph-osd: use the new ceph_volume module for the lvm scenario Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-12-11 09:58:06 -06:00
Sébastien Han	0ea1811f6f	Merge pull request #2226 from andymcc/gpt_mklabel Skip mklabel gpt if already gpt	2017-12-11 03:12:46 -06:00
Andy McCrae	4f1e854c79	Use parted module instead of command	2017-12-11 17:33:40 +10:00
John Fulton	ffae294288	Set tighter permissions on keyrings when containerized During a containerized deployment, set the permissions of ceph.client.admin.keyring and other keyrings to chmod 600 and chown it to ceph.	2017-12-06 19:22:28 -05:00
Guillaume Abrioux	b449b16edd	Merge pull request #2215 from squidboylan/support_loopback_devices Add support for using loopback devices as OSDs	2017-11-28 14:04:47 +01:00
Sébastien Han	f94b9040eb	Merge pull request #2214 from ceph/bz-1510555 handlers: restart daemons only if docker is running	2017-11-28 12:22:50 +01:00
Sébastien Han	ef581f807d	Merge pull request #2202 from ceph/remove_leftover osd: remove leftover and fix a typo	2017-11-28 12:21:13 +01:00
wintamute	ebe0e60235	Openstack: replaced hardcoded pool names with variables for openstack (nova) user (cherry picked from commit 2bf48f1)	2017-11-28 09:06:51 +01:00
Caleb Boylan	8f02bb007f	Add support for using loopback devices as OSDs This is particularly useful in CI environments where you dont have the option of adding extra devices or volumes to the host. It is also a simple change to support loopback devices	2017-11-27 16:02:36 -08:00
Guillaume Abrioux	b26a840002	handlers: restart daemons only if docker is running In case where docker CLI is available but docker is not running, we don't want to trigger the restart of the daemons. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510555 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-27 14:59:30 +01:00
Sébastien Han	d9cfe5f6df	Merge pull request #2177 from jprovaznik/rados Allow to use rados for ganesha exports	2017-11-23 10:36:58 +01:00
Sébastien Han	bb7b29a9fc	common: install ceph-common on all the machines Since some daemons now install their own packages the task checking the ceph version fails on Debian systems. So the 'ceph-common' package must be installed on all the machines. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-22 17:11:50 +01:00
Jan Provaznik	2435c48cd5	Allow to use rados for ganesha exports	2017-11-21 15:21:32 +01:00
Guillaume Abrioux	1cba626484	osd: remove leftover and fix a typo This task was originally needed to fix a docker installation issue (see: #1030). This has been fixed, therefore it can be removed. Fixes: #2199 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-21 11:11:34 +01:00
Guillaume Abrioux	efe06be10f	osd: ensure a gpt label is set on device ceph-disk prepare will fail on jewel if a GPT label is not present on device. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-17 17:32:23 +01:00
Guillaume Abrioux	3c6f2854fe	Merge pull request #2189 from fultonj/empty-acl Make openstack_keys param support no acls list	2017-11-16 19:39:01 +01:00
John Fulton	d73f751b63	Make openstack_keys param support no acls list A recent change [1] required that the openstack_keys param always containe an acls list. However, it's possible it might not contain that list. Thus, this param sets a default for that list to be empty if it is not in the structure as defined by the user. [1] `d65cbaa539`	2017-11-16 11:29:59 -05:00
Sébastien Han	f31d8557dd	Merge pull request #2182 from ceph/fix_reboot_rbd rbd: enable ceph-rbd-mirror.target on releases prior to luminous	2017-11-16 16:55:39 +01:00
Sébastien Han	932345ab2a	osd: remove leftover from osd partition We used to support osds that are a partition. This is long gone so removing this task. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-16 14:58:40 +01:00
Sébastien Han	b1c1322357	osd: remove failed_when on activation There is no need to continue if the activation fails. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-16 14:57:49 +01:00
Sébastien Han	80d3a242d0	osd: fix bad activation for dmcrypt We were activating dmcrypt devices with the wrong command. Basically the first task execute the wrong activate command. The task fails but continues because of the 'failed_when: false'. Then the right activation sequence is being done by the next task. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-16 14:55:08 +01:00

1 2 3 4 5 ...

1653 Commits (875e14cabf7206c12d7808261bada31d3dd8939d)