ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Sébastien Han	37117071eb	common: add tools repo for iscsi gw To install iscsi gw packages we need to enable the tools repo. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1547849 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-12 13:38:34 +02:00
Douglas Fuller	c8573fe0d7	Remove deprecated allow_multimds allow_multimds will be officially deprecated in Mimic, specify it only for all versions of Ceph where it was declared stable. Going forward, specify only max_mds. Signed-off-by: Douglas Fuller <dfuller@redhat.com>	2018-04-12 10:29:17 +02:00
vasishta p shastry	020e66c1b4	Fixed a typo (extra space)	2018-04-11 14:21:15 +02:00
vasishta p shastry	e1a1f81b6f	osd: to support copy_admin_key	2018-04-11 14:21:15 +02:00
vasishta p shastry	db3a5ce6d9	mds: to support copy_admin_keyring	2018-04-11 14:21:15 +02:00
vasishta p shastry	6b59416f75	nfs: to support copy_admin_key - containerized	2018-04-11 14:21:15 +02:00
Ali Maredia	01c58695fc	nfs: ensure nfs-server server is stopped NFS-ganesha cannot start is the nfs-server service is running. This commit stops nfs-server in case it is running on a (debian, redhat, suse) node before the nfs-ganesha service starts up fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1508506 Signed-off-by: Ali Maredia <amaredia@redhat.com>	2018-04-11 14:00:48 +02:00
Ramana Raja	4a430ae29a	ceph-nfs: allow disabling ganesha caching Add a variable, ceph_nfs_disable_caching, that if set to true disables ganesha's directory and attribute caching as much as possible. Also, disable caching done by ganesha, when 'nfs_file_gw' variable is true, i.e., when Ganesha is used as CephFS's gateway. This is the recommended Ganesha setting as libcephfs already caches information. And doing so helps avoid cache incoherency issues especially with clustered ganesha over CephFS. Fixes: https://tracker.ceph.com/issues/23393 Signed-off-by: Ramana Raja <rraja@redhat.com>	2018-04-11 13:56:40 +02:00
Sébastien Han	82ccbdafbc	ceph-defaults: bring backward compatibility for old syntax If people keep on using the mon_cap, osd_cap etc the playbook will translate this old syntax on the flight. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-11 12:18:34 +02:00
Sébastien Han	9657e4d6fa	ceph_key: use ceph_key in the playbook Replaced all the occurence of raw command using the 'command' module with the ceph_key module instead. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-11 12:18:34 +02:00
Guillaume Abrioux	66c4118dcd	defaults: fix backward compatibility backward compatibility with `ceph_mon_docker_interface` and `ceph_mon_docker_subnet` was not working since there wasn't lookup on `monitor_interface` and `public_network` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-10 00:19:11 +02:00
Ken Dreyer	3752cc6f38	common: upgrade/install ceph-test RPM first Prior to this change, if a user had ceph-test-12.2.1 installed, and upgraded to ceph v12.2.3 or newer, the RPM upgrade process would fail. The problem is that the ceph-test RPM did not depend on an exact version of ceph-common until v12.2.3. In Ceph v12.2.3, ceph-{osdomap,kvstore,monstore}-tool binaries moved from ceph-test into ceph-base. When ceph-test is not yet up-to-date, Yum encounters package conflicts between the older ceph-test and newer ceph-base. When all users have upgraded beyond Ceph < 12.2.3, this is no longer relevant.	2018-04-09 18:09:52 +02:00
Sébastien Han	bb60f2fea4	ceph-defaults: fix ceoh_uid for container image tag latest According to our recent change, we now use "CentOS" as a latest container image. We need to reflect this on the ceph_uid. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-09 13:54:55 +02:00
Zack Cerza	0123d790cd	Use the CentOS repo for Red Hat dev packages No use even trying to use something that doesn't exist. Signed-off-by: Zack Cerza <zack@redhat.com>	2018-04-09 10:05:57 +02:00
Attila Fazekas	ecd3563c21	Deploying without managed monitors failed Tripleo deployment failed when the monitors not manged by tripleo itself with: FAILED! => {"msg": "list object has no element 0"} The failing play item was introduced by `f46217b69a` . fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1552327 Signed-off-by: Attila Fazekas <afazekas@redhat.com>	2018-04-04 18:16:46 +02:00
Guillaume Abrioux	dcf6a246a4	defaults: remove `run_once: true` when creating fetch_directory because of `serial: 1`, it can be an issue when the playbook is being run on client nodes. Since the refact of `ceph-client` we skip the role `ceph-defaults` on every node except the first client node, it means that the task is not going to be played because of `run_once: true`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-04 10:51:17 +02:00
Guillaume Abrioux	18c0c7a508	config: use fact `ceph_uid` Use fact `ceph_uid` in the task which ensures `/etc/ceph` exists in containerized deployments. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-04 10:51:17 +02:00
Guillaume Abrioux	9c979c6390	clients: refact `ceph-clients` role This commit refacts this role so we don't have to pull container image on client nodes just to create pools and keys. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1550977 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-04 10:51:17 +02:00
Guillaume Abrioux	cefd471967	client: remove legacy code This seems to be a leftover. This commit removes an unnecessary 'set linux permissions' on `/var/lib/ceph` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-04 10:51:17 +02:00
Guillaume Abrioux	cf27c5e941	move selinux check to `ceph-defaults` This check is alone in `ceph-docker-common` since a previous code refactor. Moving this check in `ceph-defaults` allows us to run `ceph-clients` without having to run `ceph-docker-common` even in non-containerized deployment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-04 10:51:17 +02:00
Sébastien Han	f3caee8460	ceph-iscsi: fix certificates generation and distribution Prior to this patch, the certificates where being generated on a single node only (because of the run_once: true). Thus certificates were not distributed on all the gateway nodes. This would require a second ansible run to work. This patches fix the creation and keys's distribution on all the nodes. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1540845 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-04 09:27:39 +02:00
Randy J. Martinez	ca572a11f1	ceph-mds: delete duplicate tasks which cause multimds container deployments to fail. This update will resolve error['cephfs' is undefined.] in multimds container deployments. See: roles/ceph-mon/tasks/create_mds_filesystems.yml. The same last two tasks are present there, and actully need to happen in that role since "{{ cephfs }}" gets defined in roles/ceph-mon/defaults/main.yml, and not roles/ceph-mds/defaults/main.yml. Signed-off-by: Randy J. Martinez <ramartin@redhat.com>	2018-03-29 09:32:40 +02:00
Alfredo Deza	3fcf966803	ceph-osd note that some scenarios use ceph-disk vs. ceph-volume Signed-off-by: Alfredo Deza <adeza@redhat.com>	2018-03-29 09:11:33 +02:00
John Fulton	e6e6bd078a	Refer to expected-num-ojects as expected_num_objects, not size Follow up patch to PR 2432 [1] which replaces "size" (sorry if the original bug used that term, which can be confusing) with expected_num_objects as is used in the Ceph documentation [2]. [1] https://github.com/ceph/ceph-ansible/pull/2432/files [2] http://docs.ceph.com/docs/jewel/rados/operations/pools	2018-03-26 15:41:51 +02:00
Ning Yao	691ddf5349	cleanup osd.conf.j2 in ceph-osd osd crush location is set by ceph_crush in the library, osd.conf.j2 is not used any more. Signed-off-by: Ning Yao <yaoning@unitedstack.com>	2018-03-26 15:57:37 +08:00
Patrick Donnelly	7f91547304	setup cephx keys when not nfs_obj_gw Copy the admin key when configured nfs_file_gw (but not nfs_obj_gw). Also, copy/setup RGW related directories only when configured as nfs_obj_gw. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2018-03-22 14:01:08 +01:00
Andrew Schoen	6cffbd5409	ceph-defaults: set is_atomic variable This variable is needed for containerized clusters and is required for the ceph-docker-common role. Typically the is_atomic variable is set in site-docker.yml.sample though so if ceph-docker-common is used outside of that playbook it needs set in another way. Moving the creation of the variable inside this role means playbooks don't need to worry about setting it. fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1558252 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-03-21 19:16:11 +01:00
Andy McCrae	388562a4af	Simplify ceph.conf generation Since the approach to creating a ceph.conf file has changed, and now no-longer relies on assembling config file fragments in /etc/ceph/ceph.d we can avoid the conf_overrides rendering on the local host and skip out the tasks related to that, instead using just the config_template task to configure the file directly.	2018-03-15 15:47:41 +01:00
Sébastien Han	e3275c1ca1	osd: add fs.aio-max-nr tuning The number of osds per nodes is limited by aio-max-nr, default is low, so we need to increase it. Full story: http://lists.ceph.com/pipermail/ceph-users-ceph.com/2017-August/020408.html Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1553407 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-15 14:06:26 +01:00
Sébastien Han	f432819c1e	osd: apply systcl right away Without sysctl_set: yes the sysctm tuning will only get applied on the systctl.conf but not on the fly. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-15 14:06:26 +01:00
Sébastien Han	0f8a4251ba	move system tuning to osd role The changes from these tasks only apply to osd nodes so there is no reason to have them in ceph-common. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-15 14:06:26 +01:00
Sébastien Han	f119b25bbe	client: implement proper pools creation Just like we did for the monitor and openstack_config we now have the ability to precisely create pools. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	e302c1baae	mon: add support for erasure code pool You can now specify type: erasure and erasure_profile to use when declaring the pool dictionnary. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	277d885bc9	mon: add support for pgp, pool type and rule name When creating pools, it's crucial to expose all the options available as part of the pool creation command. As explained in: http://docs.ceph.com/docs/jewel/rados/operations/pools/ Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	26bc00fb74	mon: fail if pool creation fails There is no reason to continue the deployment if these tasks fail. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1546185 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	0011edd2bc	mon: add support for expected-num-objects This commit adds the support for expected-num-objects when creating a pool. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1541520 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	18402b636f	defaults: add useful info if daemon are not restarted properly If OSDs don't restart normally we now also dump info of the crush map, crush rules, crush tree and pools. If the monitors don't restart normally we also print the socket status by calling mon_status and quorum_status. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
jtudelag	691f7c5146	Adds handy ceph aliases whe containerized installations. Same approach as openshift-ansible etcdctl: * https://github.com/openshift/openshift-ansible/blob/release-3.7/roles/etcd/tasks/auxiliary/drop_etcdctl.yml * https://github.com/openshift/openshift-ansible/blob/release-3.7/roles/etcd/etcdctl.sh	2018-03-08 13:56:39 +01:00
Guillaume Abrioux	9181c94adf	client: fix pgs num for client pool creation The `pools` dict defined in `roles/ceph-client/defaults/main.yml` shouldn't have `{{ ceph_conf_overrides.global.osd_pool_default_pg_num }}` as default value for `pgs` keys. For instance, if you want some pools to be created but without explicitely specifying the pgs for these pools (it means you want to use the `osd_pool_default_pg_num`), you will be obliged to define `{{ ceph_conf_overrides.global.osd_pool_default_pg_num }}` anyway while you wanted to use the current default value already defined in the cluster which is retrieved early in the playbook and stored in the `{{ osd_pool_default_pg_num }}` fact. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-03-07 11:18:04 +01:00
Sébastien Han	96c049be5b	common: run updatedb task on debian systems only The command doesn't exist on Red Hat systems so it's better to skip it instead of ignoring the error. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	a52ed43093	mon: fix osd_pool_default_crush_rule persistence and effectiveness Running the last portion (insert new default and add new default crush tasks) of crush_rules.yml only on the last monitor is wrong since ceph CLI calls usually end up on the master having the quorum, which is by default the one with the lower IP. So if we run the command and end up on another mon the creation will happen on the default crush rule because the particular mon hasn't been updated. To fix this we remove the \|last on the include and use run_once: true on certain tasks, then we let the final two tasks run on all the monitors. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	47cef7a41d	mon: fix set crush default rule On releases after jewel the option 'osd_pool_default_crush_replicated_ruleset' does not exist anymore, it's called osd_pool_default_crush_rule. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	3261ab23b8	osd: remove old crush_location implementation This was causing a lot of pain with the handlers. Also the implementation was not ideal since we were assembling files. Everything can now be done with the ceph_crush module so let's remove that. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	73c4846744	mon: use ceph_crush module in the playbook Instead of creating the CRUSH hierarchy with Ansible tasks using the command module we now rely on the ceph_crush module. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Greg Charot	78c1f1938f	mons: Current crush_rule playbook does not work if there is no default rule defined (default: true). One could want to add new crush rules while keeping his current default rule. Fixed it so that it works with all rules defined as "default: false". If multiple rules are defined as default (should not be) then the last rule listed in "crush_rules" is taken as default.	2018-03-06 15:24:31 +00:00
Greg Charot	77f9c1df10	no reason the ceph-ansible ansible default provided crush_rule_hdd rule should be set as rack root + default ruleset	2018-03-06 15:24:31 +00:00
Greg Charot	50afc3fbf3	We don't want to automatically move the rbd pool to the new default crush rule. This operation shall be performed by the cluster operator.	2018-03-06 15:24:31 +00:00
Andy McCrae	04ca685ba7	Remove vars that are no longer used As part of `fcba2c801a` these vars were removed and no longer do anything: radosgw_dns_name radosgw_resolve_cname This patch removes them from the group_vars files and defaults/main.yml	2018-03-06 09:16:25 +01:00
jtudelag	c3267b77b7	Makes use of docker_exec_cmd in ceph-mon role. Keeps consistency inside the role and among roles. Makes the code more readable.	2018-03-05 12:48:35 +00:00
Sébastien Han	cb0f598965	common: run updatedb task on debian systems only The command doesn't exist on Red Hat systems so it's better to skip it instead of ignoring the error. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-02 20:59:10 +00:00
Sébastien Han	7f19df8196	rgw: add cluster name option to the handler If the cluster name is different than 'ceph', the command will fail so we need to pass the cluster name. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-02 20:59:10 +00:00
Sébastien Han	9c85280602	rgw: ability to copy ceph admin key on containerized If we now set copy_admin_key while running a containerized scenario, the ceph admin key will be copied on the node. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-02 20:59:10 +00:00
Sébastien Han	67f46d8ec3	rgw: run the handler on a mon host In case the admin wasn't copied over to the node this command would fail. So it's safer to run it from a monitor directly. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-02 20:59:10 +00:00
Guillaume Abrioux	6d35bc9bde	client: use `ceph_uid` fact to set uid/gid on admin key That task is failing on containerized deployment because `ceph:ceph` doesn't exist. The idea here is to use the `{{ ceph_uid }}` to set the ownerships for the admin keyring when containerized_deployment. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1540578 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-02-26 15:52:05 +01:00
Grant Slater	1e1b26ca4d	mds: fix ansible_service_mgr typo This commit fixes a typo introduced by `4671b9e74e`	2018-02-26 13:05:14 +01:00
Andy McCrae	c33dae7509	Revert "[TEST] Test setting up correct systemd file for nfs-ganesha" The nfs-ganesha package has been fixed as part of this commit: `963b6681df` Once the package is rebuilt this should be good to merge. This reverts commit `e88af3c4cb`.	2018-02-26 10:23:42 +01:00
Giulio Fidente	a83e1aeea3	Make rule_name optional when defining items in openstack_pools Previously it was necessary to provide a value (eventually an empty string) for the "rule_name" key for each item in openstack_pools. This change makes that optional and defaults to empty string when not given.	2018-02-23 15:11:53 +01:00
Sébastien Han	165d9dec10	remove kernel.pid_max This is now managed by Ceph packages. See: https://github.com/ceph/ceph/pull/18544/files http://tracker.ceph.com/issues/21929 Closes: https://github.com/ceph/ceph-ansible/issues/2410 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-23 13:57:57 +01:00
Andy McCrae	2779d2a850	Adjust /etc/updatedb.conf to not parse /var/lib/ceph Using updatedb -e doesnt make a permanent change, but will updatedb without the passed path. To make this change more permanent we should update the /etc/updatedb.conf file to include /var/lib/ceph.	2018-02-20 11:32:56 +01:00
Andy McCrae	e88af3c4cb	[TEST] Test setting up correct systemd file for nfs-ganesha Don't merge this. Test to see if we copy over the nfs-ganesha-lock.service.debian8 file properly, whether the Xenial CI job will work. The upstream download.ceph.com nfs-ganesha package should be fixed for xenial (which is in progress).	2018-02-20 10:49:37 +01:00
Paul Bourke	463b5c6b22	Remove redundant task to check if atomic This fact is already set in site-docker.yml so there's no need to check it again in ceph-docker-common Signed-off-by: Paul Bourke <paul.bourke@oracle.com>	2018-02-19 10:10:46 +01:00
Andy McCrae	59a4335a56	Restart services if handler called This patch fixes an issue where if hosts have different service lists, it will prevent restarting changes on services that run later on. For example, hostA in the mons and rgws group would initiate a config change and restart of services on all mons and rgws hosts, even though a separate hostB (which is only in the rgws group) has not had its configuration changed yet. Additionally, when the second host has its coniguration changed as part of the ceph-rgw role, it will not initiate a restart since its inventory name != the first hosts. To fix this we should run the restart once (using run_once: True) as long as the host has called the handler. This will ensure that even if only 1 host has called the handler it will initiate a restart on all hosts that have called the handler. Additionally, we add a var that is set when the handler runs, this will ensure that only hosts that have called the handler get restarted. Includes minor fix to remove unrequired "inventory_hostname in play_hosts" when: clause. This is no longer required since the handlers were changed. The host calling the handler will be in play_hosts already.	2018-02-16 10:40:20 +01:00
Sébastien Han	c816a9282c	container: osd remove run_once When used along with delegate, run_once does not belong well. Thus, using \| last always brings the desired result. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-14 02:01:29 +01:00
Sébastien Han	d47d02a5eb	docker-common: fix container restart on new image We now look for any excisting containers, if any we compare their running image with the latest pulled container image. For OSDs, we iterate over the list of running OSDs, this handles the case where the first OSD of the list has been updated (runs the new image) and not the others. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1526513 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-14 02:01:29 +01:00
Sébastien Han	ebc195487c	default: remove duplicate code This is already defined in ceph-defaults. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-14 02:01:29 +01:00
Caleb Boylan	0be60456ce	osd: Add support for multipath disks Multipath disks have partitions with a different format than what ceph-ansible currently supports, this update makes ceph-ansible aware of that format so multipath disks can be used as OSDs Signed-off-by: Caleb Boylan <caleb.boylan@ormuco.com>	2018-02-09 18:06:25 +01:00
Andy McCrae	b4dbc862d6	Set application for OpenStack pools Since Luminous we need to set the application tag for each pool, otherwise a CEPH_WARNING is generated when the pools are in use. We should assign the OpenStack pools to their default which would be "rbd". When updating to Luminous this would happen automatically to the vms, images, backups and volumes pools, but for new deploys this is not the case.	2018-02-09 17:15:55 +01:00
Sébastien Han	22f843e3d4	default: define 'osd_scenario' variable osd_scenario does not exist in the ceph-default role so if we try to play ceph-default on an OSD node, the playbook will fail with undefined variable. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-08 17:42:12 +01:00
Guillaume Abrioux	e537779bb3	osd: fix osd restart when dmcrypt This commit fixes a bug that occurs especially for dmcrypt scenarios. There is an issue where the 'disk_list' container can't reach the ceph cluster because it's not launched with `--net=host`. If this container can't reach the cluster, it will hang on this step (when trying to retrieve the dm-crypt key) : ``` +common_functions.sh:448: open_encrypted_part(): ceph --cluster abc12 --name \ client.osd-lockbox.9138767f-7445-49e0-baad-35e19adca8bb --keyring \ /var/lib/ceph/osd-lockbox/9138767f-7445-49e0-baad-35e19adca8bb/keyring \ config-key get dm-crypt/osd/9138767f-7445-49e0-baad-35e19adca8bb/luks +common_functions.sh:452: open_encrypted_part(): base64 -d +common_functions.sh:452: open_encrypted_part(): cryptsetup --key-file \ -luksOpen /dev/sdb1 9138767f-7445-49e0-baad-35e19adca8bb ``` It means the `ceph-run-osd.sh` script won't be able to start the `osd_disk_activate` process in ceph-container because he won't have filled the `$DOCKER_ENV` environment variable properly. Adding `--net=host` to the 'disk_list' container fixes this issue. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1543284 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-02-08 15:45:13 +01:00
Giulio Fidente	bdcc52b96d	Check for docker sockets named after both _hostname or _fqdn While hostname -f will always return an hostname including its domain part and -s without the domain part, the behavior when no arguments are given can include or not include the domain part depending on how the system is configured; the socket name might not match the instance name then.	2018-02-06 14:16:54 +01:00
Greg Charot	a6d1922a2e	mon: Fixed crush_rule_config for containerised deployment. Was called too early, container was not yet started so the commands failed. Moved the section after include docker/main.yml Signed-off-by: Greg Charot <gcharot@redhat.com>	2018-02-06 05:12:59 +01:00
Guillaume Abrioux	dd0c98c5a2	common: do not use `shell` module when it is not needed There is no need here to use `shell` instead of `command` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-31 10:45:34 +01:00
Guillaume Abrioux	deaf273b25	syntax: change local_action syntax Use a nicer syntax for `local_action` tasks. We used to have oneliner like this: ``` local_action: wait_for port=22 host={{ hostvars[inventory_hostname]['ansible_default_ipv4']['address'] }} state=started delay=10 timeout=500 }} ``` The usual syntax: ``` local_action: module: wait_for port: 22 host: "{{ hostvars[inventory_hostname]['ansible_default_ipv4']['address'] }}" state: started delay: 10 timeout: 500 ``` is nicer and kind of way to keep consistency regarding the whole playbook. This also fix a potential issue about missing quotation : ``` Traceback (most recent call last): File "/tmp/ansible_wQtWsi/ansible_module_command.py", line 213, in <module> main() File "/tmp/ansible_wQtWsi/ansible_module_command.py", line 185, in main rc, out, err = module.run_command(args, executable=executable, use_unsafe_shell=shell, encoding=None, data=stdin) File "/tmp/ansible_wQtWsi/ansible_modlib.zip/ansible/module_utils/basic.py", line 2710, in run_command File "/usr/lib64/python2.7/shlex.py", line 279, in split return list(lex) File "/usr/lib64/python2.7/shlex.py", line 269, in next token = self.get_token() File "/usr/lib64/python2.7/shlex.py", line 96, in get_token raw = self.read_token() File "/usr/lib64/python2.7/shlex.py", line 172, in read_token raise ValueError, "No closing quotation" ValueError: No closing quotation ``` writing `local_action: shell echo {{ fsid }} \| tee {{ fetch_directory }}/ceph_cluster_uuid.conf` can cause trouble because it's complaining with missing quotes, this fix solves this issue. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1510555 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-31 10:45:34 +01:00
Sébastien Han	6f9dd26caa	config: remove any spaces in public_network or cluster_network With two public networks configured - we found that with "NETWORK_ADDR_1, NETWORK_ADDR_2" install process consistently became broken, trying to find docker registry on second network, and not finding mon container. but without spaces "NETWORK_ADDR_1,NETWORK_ADDR_2" install succeeds so, containerized install is more peculiar with formatting of this line Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1534003 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-30 17:47:15 +01:00
Sébastien Han	5132cc3de4	Do not search osd ids if ceph-volume Description of problem: The 'get osd id' task goes through all the 10 times (and its respective timeouts) to make sure that the number of OSDs in the osd directory match the number of devices. This happens always, regardless if the setup and deployment is correct. Version-Release number of selected component (if applicable): Surely the latest. But any ceph-ansible version that contains ceph-volume support is affected. How reproducible: 100% Steps to Reproduce: 1. Use ceph-volume (LVM) to deploy OSDs 2. Avoid using anything in the 'devices' section 3. Deploy the cluster Actual results: TASK [ceph-osd : get osd id _uses_shell=True, _raw_params=ls /var/lib/ceph/osd/ \| sed 's/.-//'] ********************************************************************************************************************************************* task path: /Users/alfredo/python/upstream/ceph/src/ceph-volume/ceph_volume/tests/functional/lvm/.tox/xenial-filestore-dmcrypt/tmp/ceph-ansible/roles/ceph-osd/tasks/start_osds.yml:6 FAILED - RETRYING: get osd id (10 retries left). FAILED - RETRYING: get osd id (9 retries left). FAILED - RETRYING: get osd id (8 retries left). FAILED - RETRYING: get osd id (7 retries left). FAILED - RETRYING: get osd id (6 retries left). FAILED - RETRYING: get osd id (5 retries left). FAILED - RETRYING: get osd id (4 retries left). FAILED - RETRYING: get osd id (3 retries left). FAILED - RETRYING: get osd id (2 retries left). FAILED - RETRYING: get osd id (1 retries left). ok: [osd0] => { "attempts": 10, "changed": false, "cmd": "ls /var/lib/ceph/osd/ \| sed 's/.*-//'", "delta": "0:00:00.002717", "end": "2018-01-21 18:10:31.237933", "failed": true, "failed_when_result": false, "rc": 0, "start": "2018-01-21 18:10:31.235216" } STDOUT: 0 1 2 Expected results: There aren't any (or just a few) timeouts while the OSDs are found Additional info: This is happening because the check is mapping the number of "devices" defined for ceph-disk (in this case it would be 0) to match the number of OSDs found. Basically this line: until: osd_id.stdout_lines\|length == devices\|unique\|length Means in this 2 OSD case it is trying to ensure the following incorrect condition: until: 2 == 0 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1537103	2018-01-30 14:44:38 +01:00
Andy McCrae	481173f203	Add default for radosgw_keystone_ssl This should default to False. The default for Keystone is not to use PKI keys, additionally, anybody using this setting had to have been manually setting it before. Fixes: #2111	2018-01-30 11:30:23 +01:00
Guillaume Abrioux	f1232b33fd	Revert "monitor_interface: document need to use monitor_address when using IPv6" This reverts commit `10b91661ce`. This reverts also the same comment added in `1359869497`	2018-01-29 14:43:24 +01:00
Eduard Egorov	93e9f3723b	config: add host-specific ceph_conf_overrides evaluation and generation. This allows us to use host-specific variables in ceph_conf_overrides variable. For example, this fixes usage of such variables (e.g. 'nss db path' having {{ ansible_hostname }} inside) in ceph_conf_overrides for rados gateway configuration (see profiles/rgw-keystone-v3) - issue #2157. Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2018-01-26 10:15:03 +01:00
Guillaume Abrioux	ec16cbdb1a	defaults: avoid getting stuck (ceph --connect-timeout) Sometime the playbook gets stuck because even with `--connect-timeout=` option, the connexion to the existing ceph cluster never timeout. As a workaround, using `timeout` command provided by coreutils will actually timeout if we can't connect to the cluster. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1537003 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-25 10:15:59 +01:00
Andrew Schoen	79473badfe	ceph-osd: adds dmcrypt to the lvm scenario Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-01-24 14:10:08 +01:00
Guillaume Abrioux	9306a1789c	osds: change default value for `dedicated_devices` This is to keep backward compatibility with stable-2.2 and satisfy the check "verify dedicated devices have been provided" in `check_mandatory_vars.yml`. This check is looking for `dedicated_devices` so we need to default it's value to `raw_journal_devices` when `raw_multi_journal` is set to `True`. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1536098 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-22 18:02:51 +01:00
Sébastien Han	f88795e843	rgw: disable legacy unit Some systems that were deployed with old tools can leave units named "ceph-radosgw@radosgw.gateway.service". As a consequence, they will prevent the new unit to start. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1509584 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-18 14:12:18 +01:00
Andrew Schoen	fb4a6dc9a4	docs for the crush_device_class option of lvm_volumes Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-01-17 13:49:29 +01:00
Andrew Schoen	6cbb56a3b6	ceph-osd: adds the crush_device_class param to the lvm scenario Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-01-17 13:49:29 +01:00
Eduard Egorov	7d7080df6c	crush: create rack type buckets and build crush tree according to {{ osd_crush_location }}. Currently, we can define crush location for each host but only crush roots and crush rules are created. This commit automates other routines for a complete solution: 1) Creates rack type crush buckets defined in {{ ceph_crush_rack }} of each osd host. If it's not defined by user then a rack named 'default_rack_{{ ceph_crush_root }}' would be added and used in next steps. 2) Move rack type crush buckets defined in {{ ceph_crush_rack }} into crush roots defined in {{ ceph_crush_root }} of each osd host. 3) Move hosts defined in {{ ceph_crush_rack }} into crush roots defined in {{ ceph_crush_root }} of each osd host. Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2018-01-11 17:42:18 +01:00
Sébastien Han	6db4aea453	osd: skip devices marked as '/dev/dead' On a non-collocated scenario, if a drive is faulty we can't really remove it from the list of 'devices' without messing up or having to re-arrange the order of the 'dedicated_devices'. We want to keep this device list ordered. This will prevent the activation failing on a device that we know is failing but we can't remove it yet to not mess up the dedicated_devices mapping with devices. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-11 17:34:32 +01:00
Guillaume Abrioux	70401f955b	container: trigger handlers on systemd file change When a systemd unit file is changed we should trigger handlers to restart the services. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 16:46:42 +01:00
Guillaume Abrioux	b29a42cba6	handlers: avoid duplicate handler Having handlers in both ceph-defaults and ceph-docker-common roles can make the playbook restarting two times services. Handlers can be triggered first time because of a change in ceph.conf and a second time because a new image has been pulled. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 16:46:42 +01:00
Sébastien Han	8a19a83354	container: restart container when there is a new image This wasn't any good choice to implement this. We had several options and none of them were ideal since handlers can not be triggered cross-roles. We could have achieved that by doing: * option 1 was to add a dependancy in the meta of the ceph-docker-common role. We had that long ago and we decided to stop so everything is managed via site.yml * option 2 was to import files from another role. This is messy and we don't that anywhere in the current code base. We will continue to do so. There is option 3 where we pull the image from the ceph-config role. This is not suitable as well since the docker command won't be available unless you run Atomic distro. This would also mean that you're trying to pull twice. First time in ceph-config, second time in ceph-docker-common The only option I came up with was to duplicate a bit of the ceph-config handlers code. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1526513 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-10 16:46:42 +01:00
Guillaume Abrioux	900f447c82	containers: fix bug when looking for existing cluster When containerized deployment, `docker_exec_cmd` is not set before the task which try to retrieve the current fsid is played, it means it considers there is no existing fsid and try to generate a new one. Typical error: ``` ok: [mon0 -> mon0] => { "changed": false, "cmd": [ "ceph", "--connect-timeout", "3", "--cluster", "test", "fsid" ], "delta": "0:00:00.179909", "end": "2018-01-09 10:36:58.759846", "failed": false, "failed_when_result": false, "rc": 1, "start": "2018-01-09 10:36:58.579937" } STDERR: Error initializing cluster client: Error('error calling conf_read_file: errno EINVAL',) ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 16:23:18 +01:00
Sébastien Han	c2e04623a5	container: change the way we force no logs inside the container Previously we were using ceph_conf_overrides however this doesn't play nice for softwares like TripleO that uses ceph_conf_overrides inside its own code. For now, and since this is the only occurence of this, we can ensure no logs through the ceph conf template. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1532619 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-10 16:21:47 +01:00
Guillaume Abrioux	acfbebe67e	defaults: rename check_socket files for containers When containerized deployment, we are not looking for a socket but for a running container. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 15:44:47 +01:00
Sébastien Han	f0787e64da	mon: use crush rules for non-container too There is no reasons why we can't use crush rules when deploying containers. So moving the inlcude in the main.yml so it can be called. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-10 15:21:36 +01:00
Sébastien Han	97f520bc74	containers: bump memory limit A default value of 4GB for MDS is more appropriate and 3GB for OSD also. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1531607 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-09 11:26:50 +01:00
Sébastien Han	0b55abe3d0	mon: always run ceph-create-keys ceph-create-keys is idempotent so it's not an issue to run it each time we play ansible. This also fix issues where the 'creates' arg skips the task and no keys get generated on newer version, e.g during an upgrade. Closes: https://github.com/ceph/ceph-ansible/issues/2228 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-21 13:50:01 +01:00
Sébastien Han	ad54e19262	rgw: disable legacy rgw service unit When upgrading from OSP11 to OSP12 container, ceph-ansible attempts to disable the RGW service provided by the overcloud image. The task attempts to stop/disable ceph-rgw@{{ ansible-hostname }} and ceph-radosgw@{{ ansible-hostname }}.service. The actual service name is ceph-radosgw@radosgw.$name Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1525209 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-21 13:48:42 +01:00
Guillaume Abrioux	895949d6c4	osd: fix check gpt the gpt label creation doesn't work even with parted module. This commit fixes the gpt label creation by using parted command instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-20 17:42:45 +01:00
Sébastien Han	bbc79765f3	osd: best effort if no device is found during activation We have a scenario when we switch from non-container to containers. This means we don't know anything about the ceph partitions associated to an OSD. Normally in a containerized context we have files containing the preparation sequence. From these files we can get the capabilities of each OSD. As a last resort we use a ceph-disk call inside a dummy bash container to discover the ceph journal on the current osd. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1525612 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-19 14:40:48 +01:00
Sébastien Han	dfbef8361d	nfs: fix package install for debian/suss systems This resolves the following error: E: There were unauthenticated packages and -y was used without --allow-unauthenticated Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-19 13:30:49 +01:00
Christian Berendt	50a848dc40	Rename fact docker_version to ceph_docker_version The name docker_version is very generic and is also used by other roles. As a result, there may be name conflicts. To avoid this a ceph_ prefix should be used for this fact. Since it is an internal fact renaming is not a problem.	2017-12-15 20:12:21 +01:00
Markos Chandras	162b7d2b23	roles: ceph-mgr: Install the ceph-mgr package on SUSE The ceph-mgr package name is identical to RedHat so add the SUSE family to the existing task.	2017-12-15 09:22:14 +01:00
Guillaume Abrioux	a24fd1cfd9	client: don't make `osd_pool_default_pg_num` mandatory making `osd_pool_default_pg_num` mandatory is a bit agressive and is unrelated when you just want to create users keyrings. Closes: #2241 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-14 17:22:07 +01:00
Guillaume Abrioux	ab1dd3027a	client: don't try to generate keys the entrypoint to generate users keyring is `ceph-authtool`, therefore, it can expand the `$(ceph-authtool --gen-print-key)` inside the container. Users must generate a keyring themselves. This commit also adds a check to ensure keyring are properly filled when `user_config: true`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-14 17:22:07 +01:00
Guillaume Abrioux	26afe46e13	docker: add missing condition for selinux tasks on `client` and `mds` roles, it tries to set selinux even on non rhel based distributions.` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-14 17:00:14 +01:00
Sébastien Han	7eaf444328	default: look for the right return code on socket stat in-use As reported in https://github.com/ceph/ceph-ansible/issues/2254, the check with fuser is not ideal. If fuser is not available the return code is 127. Here we want to make sure that we looking for the correct return code, so 1. Closes: https://github.com/ceph/ceph-ansible/issues/2254 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-14 16:59:14 +01:00
John Fulton	8cba44262c	Add flags for OSD 'docker run --cpuset-{cpus,mems}' Add the variables ceph_osd_docker_cpuset_cpus and ceph_osd_docker_cpuset_mems, so that a user may specify the CPUs and memory nodes of NUMA systems on which OSD containers are run. Provides a example in osds.yaml.sample to guide user based on sample `lscpu` output since cpuset-mems refers to the memory by NUMA node only while cpuset-cpus can refer to individual vCPUs within a NUMA node.	2017-12-14 16:39:35 +01:00
Eduard Egorov	a8a2c13f6a	firewall: add mds, nfs, restapi and iscsi ports, remove 'configure_firewall' variable used for conditional execution. Include the task only on rpm-based systems. Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2017-12-12 23:44:55 +01:00
Eduard Egorov	6a5e0da30d	firewall: configure firewalld if it's already installed on the host (#2192 ). Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2017-12-12 23:44:55 +01:00
Major Hayden	5676fa23b1	Convert interface names to underscores for facts If a deployer uses an interface name with a dash/hyphen in it, such as 'br-storage' for the monitor_interface group_var, the ceph.conf.j2 template fails to find the right facts. It looks for 'ansible_br-storage' but only 'ansible_br_storage' exists. This patch converts the interface name to underscores when the template does the fact lookup.	2017-12-12 09:03:40 +01:00
Konstantin Shalygin	d7dadc3e7b	ceph-osd: respect nvme partitions when device is a disk.	2017-12-12 09:03:18 +01:00
Guillaume Abrioux	6a9b5c9632	defaults: fix CI issue with ceph_uid fact The CI complains because of `ceph_uid` fact which doesn't exist since the docker image tag used in the CI doesn't match with this condition. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-12 09:02:37 +01:00
Andrew Schoen	788c3f351a	ceph-osd: adds osd_objectstore to the name when using the ceph_volume module This allows for easier debugging if verbosity is not set high enough. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-12-11 09:58:06 -06:00
Andrew Schoen	5e3d8dbf63	ceph-osd: use the cluster param with the ceph_volume module Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-12-11 09:58:06 -06:00
Andrew Schoen	423166f671	ceph-osd: use the new ceph_volume module for the lvm scenario Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-12-11 09:58:06 -06:00
Sébastien Han	0ea1811f6f	Merge pull request #2226 from andymcc/gpt_mklabel Skip mklabel gpt if already gpt	2017-12-11 03:12:46 -06:00
Andy McCrae	4f1e854c79	Use parted module instead of command	2017-12-11 17:33:40 +10:00
John Fulton	ffae294288	Set tighter permissions on keyrings when containerized During a containerized deployment, set the permissions of ceph.client.admin.keyring and other keyrings to chmod 600 and chown it to ceph.	2017-12-06 19:22:28 -05:00
Guillaume Abrioux	b449b16edd	Merge pull request #2215 from squidboylan/support_loopback_devices Add support for using loopback devices as OSDs	2017-11-28 14:04:47 +01:00
Sébastien Han	f94b9040eb	Merge pull request #2214 from ceph/bz-1510555 handlers: restart daemons only if docker is running	2017-11-28 12:22:50 +01:00
Sébastien Han	ef581f807d	Merge pull request #2202 from ceph/remove_leftover osd: remove leftover and fix a typo	2017-11-28 12:21:13 +01:00
wintamute	ebe0e60235	Openstack: replaced hardcoded pool names with variables for openstack (nova) user (cherry picked from commit 2bf48f1)	2017-11-28 09:06:51 +01:00
Caleb Boylan	8f02bb007f	Add support for using loopback devices as OSDs This is particularly useful in CI environments where you dont have the option of adding extra devices or volumes to the host. It is also a simple change to support loopback devices	2017-11-27 16:02:36 -08:00
Guillaume Abrioux	b26a840002	handlers: restart daemons only if docker is running In case where docker CLI is available but docker is not running, we don't want to trigger the restart of the daemons. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510555 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-27 14:59:30 +01:00
Sébastien Han	d9cfe5f6df	Merge pull request #2177 from jprovaznik/rados Allow to use rados for ganesha exports	2017-11-23 10:36:58 +01:00
Sébastien Han	bb7b29a9fc	common: install ceph-common on all the machines Since some daemons now install their own packages the task checking the ceph version fails on Debian systems. So the 'ceph-common' package must be installed on all the machines. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-22 17:11:50 +01:00
Jan Provaznik	2435c48cd5	Allow to use rados for ganesha exports	2017-11-21 15:21:32 +01:00
Guillaume Abrioux	1cba626484	osd: remove leftover and fix a typo This task was originally needed to fix a docker installation issue (see: #1030). This has been fixed, therefore it can be removed. Fixes: #2199 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-21 11:11:34 +01:00
Guillaume Abrioux	efe06be10f	osd: ensure a gpt label is set on device ceph-disk prepare will fail on jewel if a GPT label is not present on device. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-17 17:32:23 +01:00
Guillaume Abrioux	3c6f2854fe	Merge pull request #2189 from fultonj/empty-acl Make openstack_keys param support no acls list	2017-11-16 19:39:01 +01:00
John Fulton	d73f751b63	Make openstack_keys param support no acls list A recent change [1] required that the openstack_keys param always containe an acls list. However, it's possible it might not contain that list. Thus, this param sets a default for that list to be empty if it is not in the structure as defined by the user. [1] `d65cbaa539`	2017-11-16 11:29:59 -05:00
Sébastien Han	f31d8557dd	Merge pull request #2182 from ceph/fix_reboot_rbd rbd: enable ceph-rbd-mirror.target on releases prior to luminous	2017-11-16 16:55:39 +01:00
Sébastien Han	932345ab2a	osd: remove leftover from osd partition We used to support osds that are a partition. This is long gone so removing this task. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-16 14:58:40 +01:00
Sébastien Han	b1c1322357	osd: remove failed_when on activation There is no need to continue if the activation fails. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-16 14:57:49 +01:00
Sébastien Han	80d3a242d0	osd: fix bad activation for dmcrypt We were activating dmcrypt devices with the wrong command. Basically the first task execute the wrong activate command. The task fails but continues because of the 'failed_when: false'. Then the right activation sequence is being done by the next task. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-16 14:55:08 +01:00
Sébastien Han	cc264d6ba6	Merge pull request #2151 from hwoarang/add-opensuse Add openSUSE Leap 42.3 support	2017-11-16 14:35:28 +01:00
Sébastien Han	a98f14784a	Merge pull request #2172 from ceph/lvm-raw-device lvm: add support for --data to be a raw device or partition	2017-11-16 14:14:23 +01:00
Guillaume Abrioux	ccad0ebf26	rbd: enable ceph-rbd-mirror.target for releases <= luminous when `ceph-rbd-mirror.target` is not enabled, the service won't start after a reboot because there is a dependency between these two units. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-16 14:12:59 +01:00
Yixing Yan	097249371f	fix: remove the duplicated code	2017-11-16 16:45:03 +08:00
Andrew Schoen	3c604f1115	lvm: support --data as a raw device or partition in ceph-volume Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-11-15 09:36:17 -06:00
Andrew Schoen	04f02910a9	lvm: ensure the data_vg exists before using it Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-11-15 09:36:17 -06:00
John Fulton	d65cbaa539	Set permissions and ACLs of OpenStack keys on all ceph-mons If ceph-ansible deploys a Ceph cluster with "openstack_config: true" and sets the openstack_keys map to have certain ACLs or permissions, the requested ACLs or permissions are only set on one of the monitor nodes [2] when they should be set on all of them. This patch solves [3] the above issue by having the chmod and setfacl tasks iterate the list of mon nodes (including the mon node that the task was delegated to) to apply the chmod of setfacl to the keys in openstack_keys. [1] ``` openstack_keys: - { name: client.openstack, key: "$(ceph-authtool --gen-print-key)", mon_cap: "allow r", osd_cap: "allow class-read object_prefix rbd_children, allow rwx pool=images, allow rwx pool=vms, allow rwx pool=volumes, allow rwx pool=backups", mode: "0600", acls: ["u:nova:r--", "u:cinder:r--", "u:glance:r--", "u:gnocchi:r--"] } ``` [2] ``` $ ansible mons -m shell -b -a "ls -l /etc/ceph/ceph.client.openstack.keyring ; getfacl /etc/ceph/ceph.client.openstack.keyring" 192.168.1.26 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 3 20:30 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names 192.168.1.29 \| SUCCESS \| rc=0 >> -rw-r--r--. 1 root root 253 Nov 3 20:30 /etc/ceph/ceph.client.openstack.keyring user::rw- group::r-- other::r--getfacl: Removing leading '/' from absolute path names 192.168.1.23 \| SUCCESS \| rc=0 >> -rw-r--r--. 1 root root 253 Nov 3 20:30 /etc/ceph/ceph.client.openstack.keyring user::rw- group::r-- other::r--getfacl: Removing leading '/' from absolute path names $ ``` [3] ``` (undercloud) [stack@hci-director ceph-ansible]$ ansible mons -m shell -b -a "ls -l /etc/ceph/ceph.client.openstack.keyring ; getfacl /etc/ceph/ceph.client.openstack.keyring" 192.168.1.25 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names 192.168.1.29 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names 192.168.1.27 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names (undercloud) [stack@hci-director ceph-ansible]$ ```	2017-11-15 10:09:24 -05:00
Guillaume Abrioux	aa0b1ed118	tests: remove OSD_FORCE_ZAP variable from tests according to ceph/ceph-container#840, this variable is no longer needed. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-14 17:55:01 +01:00
Markos Chandras	f8e3d4bb76	ceph-docker-common: Add support for openSUSE Leap distributions Add support for the openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	8c321b8416	ceph-nfs: Add support for openSUSE Leap distributions Add support for the openSUSE distributions. The required packages are available either in the distribution repositories or in the OBS one. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	173959cfc7	ceph-rgw: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	a868c52f3f	ceph-restapi: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	ddb468bfb3	ceph-rbd-mirror: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	fb46950373	ceph-osd: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	34a40adcf7	ceph-mon: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	f944ee3980	ceph-mgr: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	8135638c58	ceph-mds: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	c6103a0f13	ceph-fetch-keys: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	3e4a7c8b61	ceph-config: Add support for the openSUSE Leap distributions Add support for the openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	211b0c33a0	ceph-client: Add support for the openSUSE Leap distributions Add support for the openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	e06c108442	ceph-agent: Add support for the openSUSE Leap distributions Add support for the openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	dd6ee72547	ceph-common: Don't check for ceph_stable_release for distro packages When we consume the distribution packages, we don't have the choise on which version to install, so we shouldn't require that variable to be set. Distributions normally provide only one version of Ceph in the official repositories so we get whatever they provide. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	849786967a	ceph-common: Add initial support for openSUSE Leap distributions openSUSE Leap 42.3 provides support for Ceph Luminous in both the distribution package and the latest available version in the OBS repository so add these as the only available installation methods for openSUSE. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:22 +00:00
Guillaume Abrioux	44df3f9102	defaults: fix rgw restart script in handlers Like `80d32dec`, the path to the fact is not correct. In any case, we will retrieve the IP address in hostvars, the variable is the way we get the interface name according where it has been set (eg.: inventory host file vs. group_vars/) Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510906 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-13 16:30:03 +01:00
Guillaume Abrioux	0369bd59e2	Merge pull request #2146 from mslovy/wip-fix-crush-location osd: fix crush location for non-containerized deployment	2017-11-13 12:23:44 +01:00
Sébastien Han	7b0743be52	Merge pull request #2144 from ceph/quick_fix_lvm osd: skip some set_fact when osd_scenario=lvm	2017-11-13 21:50:37 +11:00
Sébastien Han	17d1ff61d5	Merge pull request #2141 from Arano-kai/run_restart_scripts_in_noexec_tmp FIX: run restart scripts in `noexec` /tmp	2017-11-13 21:37:35 +11:00
Guillaume Abrioux	c06faf2deb	Merge pull request #2154 from ceph/fix_auto_discover osd: avoid using non desired loop device in autodiscovery	2017-11-10 01:19:20 +01:00
Guillaume Abrioux	a695b2c08f	Merge pull request #2153 from ceph/fix_disk_list_test osd: always run disk_list test	2017-11-09 23:50:32 +01:00
Guillaume Abrioux	591d77220e	osd: always run disk_list test there is no need to have a condition on this task, this test should be always run since the result will be interpreted later. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-09 11:51:16 +01:00
Guillaume Abrioux	43975a7332	osd: avoid using non desired loop device in autodiscovery This will prevent ceph-ansible from using a loop device while it shouldn't in auto_discovery mode. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-09 10:26:24 +01:00
Guillaume Abrioux	80d32decd3	config: fix config generation The path to the fact is not correct. In any case, we will retrieve the IP address in hostvars, the variable is the way we get the interface name according where it has been set (eg.: inventory host file vs. group_vars/) Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510906 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-09 08:50:57 +01:00
Guillaume Abrioux	d5dfc63c89	osd: fix automatic prepare when auto_discover Use `devices` variable instead of `ansible_devices`, otherwise it means we are not using the devices which have been 'auto discovered' Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-08 10:20:44 +01:00
yaoning	d82a09dddd	fix crush location for non-containerized deployment crush location only set for containerized deployment Signed-off-by: yaoning <yaoning@unitedstack.com>	2017-11-08 12:05:10 +11:00
Sébastien Han	0930f14915	osd: do not use dm when osd_auto_discovery The current code will also return lvm devices such as /dev/dm-2, this kind of device type is not supported by ceph-disk at the moment. Now we just ignore them. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-08 11:33:10 +11:00
Guillaume Abrioux	238754a844	osd: skip some set_fact when osd_scenario=lvm these tasks are not needed when using `osd_scenario: lvm` Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1509230 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-07 15:30:08 +01:00
Guillaume Abrioux	39b584e540	osd: fix a typo in roles/ceph-osd/defaults/main.yml Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-07 10:06:16 +01:00
Arano-kai	5cde3175ae	FIX: run restart scripts in `noexec` /tmp - One can not run scripts directly in place, that mounted with `noexec` option. But one can run scripts as arguments for `bash/sh`. Signed-off-by: Arano-kai <captcha.is.evil@gmail.com>	2017-11-06 16:02:47 +02:00
Sébastien Han	d4ed9a2064	osd: enhance backward compatibility During the initial implementation of this 'old' thing we were falling into this issue without noticing https://github.com/moby/moby/issues/30341 and where blindly using --rm, now this is fixed the prepare container disappears and thus activation fail. I'm fixing this for old jewel images. Also this fixes the machine reboot case where the docker logs are purgend. In the old scenario, we now store the log locally in the same directory as the ceph-osd-run.sh script. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-03 11:15:23 +01:00
Sébastien Han	ab7eb79212	config: fix monitor_interface when not passed in the inventory file Setting monitor_interface in group_vars/all.yml makes the hostvars[host]['monitor_interface'] non-existing. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1507922 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-03 09:25:02 +01:00
Jan Provaznik	589cd27ce4	Include ganesha dbus config file This file was (accidentally) not included in a previous commit `87b1da09e7`.	2017-10-31 08:30:12 +01:00
Sébastien Han	faccd0acf0	Merge pull request #2100 from ceph/lvm-bluestore ceph-volume lvm bluestore support	2017-10-27 17:36:16 +02:00
Alfredo Deza	517a2b3feb	ceph-osd skip lvm creation if they are already in use Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-27 11:33:54 -04:00
Sébastien Han	6ea92756c0	Merge pull request #2117 from ceph/rm-dup default: remove dup variable	2017-10-27 13:49:52 +02:00
Sébastien Han	d2575c7f5e	default: remove dup variable ceph_repository_type was declared multiple times. This commit fixes this. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-27 11:46:15 +02:00
Sébastien Han	d6a0d2f9be	Merge pull request #2071 from jtaleric/master Docker image pull retry	2017-10-27 09:49:03 +02:00
Sébastien Han	5a10b048b0	Merge pull request #2105 from major/really-fix-always-run Really fix always run	2017-10-27 09:33:47 +02:00
John Fulton	ae156e9f34	Make acls and mode parameters of opentack_keys optional Only chmod or setfacl the requested keyring(s) in the opentack_keys data structure when the mode or acls keys of that data structure exist. User may specify four permission combinations for the keyring file(s): 1. only set ACL, 2. only set mode, 3. set neither mode nor ACL, 4. set mode and then ACL. Fixes: #2092	2017-10-26 12:45:17 +00:00
Joe Talerico	ab58764288	Docker image pull retry This change sets a default timeout of 300s for the image pull. If the image pull times out (300s), we will retry 3 times by default. fixes 1954	2017-10-25 13:37:10 -04:00
Sébastien Han	5f9e50dabe	Merge pull request #2103 from andymcc/tcmalloc_settings Option to set TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES	2017-10-25 17:36:04 +02:00
Sébastien Han	613b6a30f1	Merge pull request #2104 from ceph/rgw-section rgw/nfs: fix section duplication	2017-10-25 17:35:01 +02:00
Sébastien Han	07e2a783f8	Merge pull request #2084 from ceph/backward-osd-2.4 osd: bring backward compatibility with old Jewel images	2017-10-25 17:33:49 +02:00
Major Hayden	f73232caa4	Use check_mode instead of always_run This patch changes the `always_run: yes` task option to `check_mode: no` to avoid Ansible warnings.	2017-10-25 09:53:34 -05:00
Major Hayden	c2b5118c1b	Revert "Avoid deprecated always_run" This reverts commit `620fb37dd4`.	2017-10-25 09:48:09 -05:00
Sébastien Han	8670b45ef2	rgw/nfs: fix section duplication Once and for all, hopefully... Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-25 15:45:37 +02:00
Andy McCrae	7f6c39102d	Option to set TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES Use "ceph_tcmalloc_max_total_thread_cache" to set the TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES value inside /etc/default/ceph for Debian installs, or /etc/sysconfig/ceph for Red Hat/CentOS installs. By default this is set to 0, so the default package value will be used, if specified this value will be changed to match the variable, and ceph osd services will be restarted.	2017-10-25 14:38:36 +01:00
Alfredo Deza	d3b427e169	ceph-osd lvm scnearios are no longer limited to filestore Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 08:23:45 -04:00
Alfredo Deza	df05e63c10	ceph-osd use --cluster in ceph-volume calls Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 08:23:45 -04:00
Alfredo Deza	628d98a92c	ceph-osd add the CEPH_VOLUME_DEBUG env var to all ceph-volume commands Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 06:50:22 -04:00
Alfredo Deza	b89309e2a3	ceph-osd update the examples in defaults for lvm bluestore Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 06:46:39 -04:00
Alfredo Deza	bbc3672253	ceph-osd: lvm support for bluestore Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 06:46:39 -04:00
Guillaume Abrioux	f21859656b	Merge pull request #2102 from yanyixing/fix_miss_word add the miss word	2017-10-25 10:49:38 +02:00
Yixing Yan	b6296c13ac	update sample file	2017-10-25 16:39:08 +08:00
Sébastien Han	049729b8d3	Merge pull request #2097 from fultonj/issue/2095 Require osd_scenario parameter to be provided in containerized deploy	2017-10-24 13:59:51 +02:00
Sébastien Han	751da93b08	Merge pull request #2096 from andymcc/regex_defaults Add regexp check for setting CLUSTER_NAME	2017-10-23 17:24:44 +02:00
John Fulton	7a7ddab6c2	Require osd_scenario parameter to be provided in containerized deploy Fixes: #2095	2017-10-23 15:16:03 +00:00
Andy McCrae	9ebef8ba3c	Add regexp check for setting CLUSTER_NAME Minor fix to ensure that existing CLUSTER_NAME is changed, and avoid duplicates.	2017-10-23 14:42:07 +01:00
Andy McCrae	05a1f965c8	Typo fix for radosgw@ systemd file systemd script for radosgw is radosgw@ not rgw@, the directory needs to match the path.	2017-10-23 14:07:23 +01:00
Jan Provaznik	291e6b604d	ceph-nfs - add bind address variable	2017-10-23 09:34:51 +02:00
Sébastien Han	968ef04324	osd: bring backward compatibility with old Jewel images There was a huge resync from luminous to jewel in ceph-docker: https://github.com/ceph/ceph-docker/pull/797 This change brought a new handy function to discover partitions tight to an OSD. This function doesn't exist in the old image so the ceph-osd-run.sh script breaks when trying to deploy Jewel OSD with that old Jewel image version. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 16:26:41 +02:00
Sébastien Han	54de2efc5d	Merge pull request #2082 from ceph/restapi-cephconf common: move restapi template to config	2017-10-20 14:07:48 +02:00
Sébastien Han	4413511b66	all: backward compatibility between stable-2.2 and 3.0 stable-3.0 brought numerous changes in ceph-ansible variables, this PR aims to maintain backward compatibility for someone running stable-2.2 upgrading to stable-3.0 but keeps its groups_vars untouched. We will then determine the right options to make sure the upgrade works but we are expecting that new variables should be used. We will drop this in a near future, maybe 3.1 or 3.2. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 11:54:10 +02:00
Sébastien Han	fccb9472cd	mgr: force module addition Some module require --force to be enabled. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 11:54:10 +02:00
Sébastien Han	ba5c6e66f0	common: move restapi template to config Closes: github.com/ceph/ceph-ansible/issues/1981 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 11:14:13 +02:00
Guillaume Abrioux	5b1087f1e5	mgr: play 'enable modules' sequence only on luminous This feature isn't available before luminous, therefore, we need to play them only on luminous and after otherwise the playbook will fail. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit 3f3d4b9c727d06154c422d445fc2a245aceaed89)	2017-10-19 20:54:23 +02:00
Sébastien Han	c527515502	Merge pull request #2000 from ceph/merge-osd-scenarios [skip ci] ci: new osd scenarios	2017-10-19 09:18:02 +02:00
Guillaume Abrioux	ff228e2d88	mgr: fix broken task on jewel `3a58757` introduced an issue for Jewel deployments, since this role is skipped, `enabled_ceph_mgr_modules.stdout` doesn't exist, therefore, it ends up with an attribute error. Uses `.get()` to retrieve `stdout` with a default value so it won't fail if this attribute doesn't exist (jewel). Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-18 14:11:46 +02:00
Sébastien Han	1579f1c5b1	Merge pull request #2073 from ceph/fix_rbd_handler [skip ci] rbd: fix restart script for jewel	2017-10-18 11:12:05 +02:00
Guillaume Abrioux	c2850b11be	rbd: fix restart script for jewel In Jewel, we don't use bootstrap-rbd keyring for rbd-mirror nodes, it results with a socket path/name different according to which ceph release you are deploying. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-18 11:10:49 +02:00
Sébastien Han	2936d152c9	Merge pull request #2053 from Fbrachere/mgr-modules Add ability to enable ceph mgr modules.	2017-10-18 10:27:31 +02:00
Sébastien Han	a53aa9e8b4	ci: new osd scenarios This commit add new osd scenarios, it aims to simplify the CI setup and brings a better coverage on the OSD scenarios. We decided to differentiate between filestore and bluestore, thinking ahead when filestore won't be supported anymore. So we now have two classes of tests: * Filestore * Bluestore In each of those classes we have container and non-container. Then for each we test the following: * collocated * collocated dmcrypt * non-collocated * non-collocated dmcrypt * auto discovery collocated * auto discovery collocated dmcrypt This gives us a nice coverage and also reduces the footprint on the CI. We are now up to 4 scenarios, each containing 6 OSD VMs. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-18 09:26:06 +02:00
Sébastien Han	90b75185d5	defaults: fix handlers for collocation When doing collocation the condition "inventory_hostname in play_hosts" is breaking the restart workflow. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-17 19:23:16 +02:00
Guillaume Abrioux	2aa53fb0f5	Merge pull request #2055 from ceph/update-mirror-nfs upgrade: support for rbd mirror and nfs	2017-10-17 14:51:39 +02:00
Christian Berendt	4c380c9ef8	Cleanup readme files in roles directories The contents of the README files are no longer up to date. Documentation for all roles is located below the docs directory.	2017-10-17 11:22:06 +02:00
Sébastien Han	d920d4839d	upgrade: support for rbd mirror and nfs - Add upgrade support for rbd mirror and nfs daemons. - Only works with systemd (remove sysvinit and upstart occurence) - A bit of cleanup Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-17 10:54:47 +02:00
Christian Berendt	cf901f0171	In docker start scripts replace \u00a0 with \u0020 This will solve the following issue when starting docker containers on ubuntu: invalid argument "1\u00a0" for --cpus=1 : failed to parse 1 as a rational number Closes-bug: #2056	2017-10-16 15:16:48 +02:00
Fabien Brachere	3a587575d7	Add ability to enable ceph mgr modules.	2017-10-16 15:04:23 +02:00
Guillaume Abrioux	7ee9aa94b5	Merge pull request #1963 from ceph/pull-in-para site-docker.yml try to fetch images in //	2017-10-13 19:35:11 +02:00
Sébastien Han	71d819620c	mds: fix fs pool creation 1. add the variables to docker_collocation 2. trigger the check when a MDS is part of the inventory file, not when we run on an MDS... Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-13 16:03:04 +02:00
Sébastien Han	b34a04ea41	site-docker.yml try to fetch images in // The container deployment is serialized, adding this task as a best effort. If docker is already present we pull the image otherwise we wait for the role to play. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-13 11:24:40 +02:00
Guillaume Abrioux	7d4b3f9989	Merge pull request #2047 from ceph/enable_ceph-rbd-mirror.target rbd-mirror: enable ceph-rbd-mirror.target	2017-10-13 10:34:10 +02:00
Sébastien Han	f7832e5eb9	Merge pull request #2031 from major/simplify-ntp Simplify NTP checks/install	2017-10-13 09:16:20 +02:00
Guillaume Abrioux	59ca1065e9	rbd-mirror: enable ceph-rbd-mirror.target on jewel `ceph-rbd-mirror.target` isn't enabled, therefore, if the node is rebooted, the service doesn't get started. from ceph-rbd-mirror unit file: ``` [Install] WantedBy=ceph-rbd-mirror.target ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-13 08:27:43 +02:00
Sébastien Han	b685aceede	Merge pull request #2044 from major/avoid-jinja-in-when Remove jinja2 delimiters from `when` keys	2017-10-12 22:23:06 +02:00
Major Hayden	a1c76e834c	Simplify NTP checks/install This patch simplifies the checks and installation tasks for NTP. Debian and Red Hat had a check for NTP's presence but would then install NTP right afterwards anyways. In addition, there were tasks for atomic that weren't used anywhere else in the role. This patch also uses a dynamic include to reduce delays from skipped tasks.	2017-10-12 12:31:07 -05:00
Sébastien Han	9c3d749f7c	Merge pull request #2038 from major/fix-cmd-warning Suppress yum/dnf/rpm command warnings	2017-10-12 18:46:52 +02:00
Major Hayden	c01851325e	Remove jinja2 delimiters from `when` keys This patch changes the `when:` keys so that they have no jinja2 delimiters. This avoids Ansible warnings which could turn into errors in a future Ansible release.	2017-10-12 11:27:42 -05:00
Guillaume Abrioux	17623a2157	Merge pull request #2036 from ceph/cephfs-pool mds: precisely define cephfs pool	2017-10-12 17:47:10 +02:00
Sébastien Han	b49f9bda21	mds: precisely define cephfs pool We now have a variable called ceph_pools that is mandatory when deploying a MDS. It's a dictionnary that contains a pool name and a PG count. PG count is mandatory and must be set, the playbook will fail otherwise. Closes: https://github.com/ceph/ceph-ansible/issues/2017 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-12 15:56:04 +02:00
Major Hayden	33b200d43a	Suppress yum/dnf/rpm command warnings Ansible throws warnings when using yum/dnf/rpm with the command module: [WARNING]: Consider using yum module rather than running yum This patch adds the `warn: no` argument to suppress the warnings in the Ansible output.	2017-10-12 08:38:05 -05:00
Major Hayden	620fb37dd4	Avoid deprecated always_run The `always_run` key is deprecated and being removed in Ansible 2.4. Using it causes a warning to be displayed: [DEPRECATION WARNING]: always_run is deprecated. This patch changes all instances of `always_run` to use the `always` tag, which causes the task to run each time the playbook runs.	2017-10-12 08:29:44 -05:00
Sébastien Han	739a41ae91	Merge pull request #2030 from major/ceph-common-pass-pkgs-as-list Pass list of packages instead of with_items	2017-10-12 09:15:58 +02:00
Major Hayden	9d62630303	Pass list of packages instead of with_items Modern versions of Ansible can handle a list of packages passed directly to the package modules. This patch optimizes the package install process by passing the list of packages directly to the module.	2017-10-11 12:18:15 -05:00
Sébastien Han	aa70b07ae2	config: proper render ceph.conf when doing collocation Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-11 18:29:34 +02:00
Sébastien Han	f50b170a49	Merge pull request #2022 from ceph/fix-purge-iscis [skip ci] purge-iscsi: fix group name	2017-10-11 14:21:19 +02:00
Sébastien Han	d0a9e57bfc	osd: rollback bindmount of /run/udev This is causing unknown issues when trying to start a dmcrypt container. Basically the container is stuck at mount opening the LUKS device. This is still unknown why this is causing trouble but we need to move forward. Also, this doesn't seem to help in any ways to fix the race condition we've seen. Here is the log for dmcrypt: cryptsetup 1.7.4 processing "cryptsetup --debug --verbose --key-file key luksClose fbf8887d-8694-46ca-b9ff-be79a668e2a9" Running command close. Locking memory. Installing SIGINT/SIGTERM handler. Unblocking interruption on signal. Allocating crypt device context by device fbf8887d-8694-46ca-b9ff-be79a668e2a9. Initialising device-mapper backend library. dm version [ opencount flush ] [16384] (1) dm versions [ opencount flush ] [16384] (1) Detected dm-crypt version 1.14.1, dm-ioctl version 4.35.0. Device-mapper backend running with UDEV support enabled. dm status fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush ] [16384] (1) Releasing device-mapper backend. Trying to open and read device /dev/sdc1 with direct-io. Allocating crypt device /dev/sdc1 context. Trying to open and read device /dev/sdc1 with direct-io. Initialising device-mapper backend library. dm table fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush securedata ] [16384] (1) Trying to open and read device /dev/sdc1 with direct-io. Crypto backend (gcrypt 1.5.3) initialized in cryptsetup library version 1.7.4. Detected kernel Linux 3.10.0-693.el7.x86_64 x86_64. Reading LUKS header of size 1024 from device /dev/sdc1 Key length 32, device size 1943016847 sectors, header size 2050 sectors. Deactivating volume fbf8887d-8694-46ca-b9ff-be79a668e2a9. dm status fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush ] [16384] (1) Udev cookie 0xd4d14e4 (semid 32769) created Udev cookie 0xd4d14e4 (semid 32769) incremented to 1 Udev cookie 0xd4d14e4 (semid 32769) incremented to 2 Udev cookie 0xd4d14e4 (semid 32769) assigned to REMOVE task(2) with flags (0x0) dm remove fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush retryremove ] [16384] (1) fbf8887d-8694-46ca-b9ff-be79a668e2a9: Stacking NODE_DEL [verify_udev] Udev cookie 0xd4d14e4 (semid 32769) decremented to 1 Udev cookie 0xd4d14e4 (semid 32769) waiting for zero Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-11 13:21:37 +02:00
Major Hayden	10e1d464e5	Remove duplicate 'package' key This patch fixes a typo where "package:" was used twice in the same task.	2017-10-10 15:39:20 -05:00
Sébastien Han	f6d1be269f	Merge pull request #2015 from ceph/fix_nfs-ganesha-repos nfs: move repository configuration in ceph-nfs role	2017-10-10 17:15:33 +02:00
Guillaume Abrioux	5dc9c640e8	nfs: add missing condition for debian_rhcs in addition to `c4dcdaa20` this commit adds the missing condition on install tasks for debian_rhcs deployment. Without them, these tasks are played on any kind of deployment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-10 16:27:00 +02:00
Jan Provaznik	87b1da09e7	Ceph-nfs dynamic exports fixes * DBus on host should include ganesha service file * to allow ganesha container to respond on DBus it needs to run in --privileged mode (ganesha folks contacted to look at this) * ceph_nfs_include_exports_dir variable replaced with more general ceph_nfs_dynamic_exports	2017-10-10 13:59:01 +02:00
Guillaume Abrioux	fbd1a57b11	iscsi-gw: move repository configuration to ceph-iscsi-gw This is something that has nothing to do in `ceph-common`, this is too specific to `ceph-iscsi-gw` role. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-10 11:36:03 +02:00
Guillaume Abrioux	c4dcdaa201	nfs: move repository configuration in ceph-nfs role This is something that has nothing to do in `ceph-common`, this is too specific to `ceph-nfs` role. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-10 11:35:58 +02:00
Guillaume Abrioux	9e8204d9e8	nfs: move packages installation to own role Make role `ceph-nfs` handling itself the installation of nfs packages. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-09 19:10:15 +02:00
Guillaume Abrioux	3c64abe07d	mds: move installation packages in role itself Make role `ceph-mds` handling itself the installation of `ceph-mds` package. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-09 17:25:46 +02:00
Sébastien Han	4032f102fe	iscsi: move package install to ceph-iscsi-role Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-09 17:25:46 +02:00
Guillaume Abrioux	1581a1c078	mgr: move installation packages in role itself Make role `ceph-mgr` handling itself the installation of `ceph-mgr` package because it's complicated to manage it regarding we are going to install `jewel vs. luminous` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-09 17:25:45 +02:00
Sébastien Han	bf99751ce1	osd: bindmount /run/udev Ensures that "udevadm" is able to check the status of udev's event queue. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-09 17:25:45 +02:00
Sébastien Han	1bd891232c	config: do not duplicate sections when doing collocation Prior to this commit, when collocating a RGW and NFS on the same box the ceph.conf layout was the following: [client.rgw.rgw0] host = mds0 host = rgw0 rgw frontends = civetweb port=192.168.15.50:8080 num_threads=100[client.rgw.mds0] rgw frontends = civetweb port=192.168.15.70:8080 num_threads=100 rgw frontends = civetweb port=192.168.15.50:8080 num_threads=100 keyring = /var/lib/ceph/radosgw/test-rgw.mds0/keyring keyring = /var/lib/ceph/radosgw/test-rgw.rgw0/keyring rgw data = /var/lib/ceph/radosgw/test-rgw.rgw0 log file = /var/log/ceph/test-rgw-mds0.log log file = /var/log/ceph/test-rgw-rgw0.log [mds.mds0] host = mds0 [global] rgw override bucket index max shards = 16 fsid = 70e1d368-57b3-4978-b746-cbffce6e56b5 rgw bucket default quota max objects = 1638400 osd_pool_default_size = 1 public network = 192.168.15.0/24 mon host = 192.168.15.10,192.168.15.11,192.168.15.12 osd_pool_default_pg_num = 8 cluster network = 192.168.16.0/24 [mds.rgw0] host = rgw0 [client.rgw.mds0] host = mds0 rgw data = /var/lib/ceph/radosgw/test-rgw.mds0 keyring = /var/lib/ceph/radosgw/test-rgw.mds0/keyring rgw frontends = civetweb port=192.168.15.70:8080 num_threads=100 log file = /var/log/ceph/test-rgw-mds0.log Basically appending all the sections. This commits solves that. Now the sections appear like this: -bash-4.2# cat /etc/ceph/test.conf [client.rgw.rgw0] log file = /var/log/ceph/test-rgw-rgw0.log host = rgw0 keyring = /var/lib/ceph/radosgw/test-rgw.rgw0/keyring rgw frontends = civetweb port=192.168.15.50:8080 num_threads=100 [client.rgw.mds0] log file = /var/log/ceph/test-rgw-mds0.log host = mds0 keyring = /var/lib/ceph/radosgw/test-rgw.mds0/keyring rgw frontends = civetweb port=192.168.15.70:8080 num_threads=100 [global] cluster network = 192.168.16.0/24 mon host = 192.168.15.10,192.168.15.11,192.168.15.12 osd_pool_default_size = 1 public network = 192.168.15.0/24 rgw bucket default quota max objects = 1638400 osd_pool_default_pg_num = 8 rgw override bucket index max shards = 16 fsid = 77a21980-3033-4174-9264-1abc7185bcb3 [mds.rgw0] host = rgw0 [mds.mds0] host = mds0 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-09 17:25:44 +02:00
Sébastien Han	7054abef99	Merge pull request #2009 from ceph/fix-clean-pg [skip ci] handler: do not test if pgs_num = 0	2017-10-07 03:39:26 +02:00
Sébastien Han	9f1bd3d6dd	handler: add serial restart back We now restart daemons on each machine in a serialized fashion. Closes: https://github.com/ceph/ceph-ansible/issues/1989 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-07 03:39:10 +02:00
Sébastien Han	a4dcef73d4	common: fix debian rhcs installation Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-07 03:39:09 +02:00
Sébastien Han	c693e95cbf	purge-docker: rework device detection we don't need "devices" and other device variable anymore, the playbook detects that for us. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-07 03:39:04 +02:00
Sébastien Han	ac29e8f977	Merge pull request #1983 from jprovaznik/suffix Allow to override systemd service instance id	2017-10-06 22:40:57 +02:00
Sébastien Han	5d39f378da	Merge pull request #1984 from jprovaznik/exportdir Include exports dir in ceph-nfs config file	2017-10-06 22:38:13 +02:00
Ali Maredia	28862a99d9	nfs: missing conditional for setting rgw key permissions Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-10-06 15:21:35 -04:00
Sébastien Han	11f51df1fc	Merge pull request #2005 from ceph/wip-nfs-export-id nfs: config var changes	2017-10-06 17:05:21 +02:00
Sébastien Han	779f642fa8	use get to check stdout_lines During the initial play, the docker command doesn't not exist and then there is no stdout_lines to the command. So get allows us to fix this by declaring an array if the command fails. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-06 16:57:46 +02:00
Sébastien Han	d5ae0a3340	handler: do not test if pgs_num = 0 We don't need to wait if they are no PGS. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-06 16:57:46 +02:00
Guillaume Abrioux	6b027557e6	osd: fix `set_fact build dedicated_devices` Use an intermediate variable to build the final `dedicated_devices` list to avoid duplicate entry in that array. (We need a 1:1 relation between `dedicated_devices` and `devices` since we are using a `with_together` later. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-06 15:00:32 +02:00
Guillaume Abrioux	d363b0f741	rbd: fix bug when trying to fetch key With jewel, `bootstrap_rbd_keyring` is not set because of this condition: ``` when: - ceph_release_num.{{ ceph_release }} >= ceph_release_num.luminous ``` Therefore, the task `try to fetch ceph config and keys` will fail. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-06 11:34:29 +02:00
Jan Provaznik	3c16af5ef2	Allow to override systemd service instance id It's useful to have constant service instance id when ceph-nfs is managed by pacemaker.	2017-10-06 08:20:37 +02:00
Ali Maredia	0c09cd3e2e	nfs: config var changes - remove unused ganesha config vars, - set different default Export_ids for each FSAL Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-10-05 16:51:23 -04:00
Sébastien Han	1121a840ef	Merge pull request #2003 from ceph/debian-iso [skip ci] common: iso install on Debian is supported by rhcs	2017-10-05 18:57:47 +02:00
Sébastien Han	feaf5ff9c6	common: iso install on Debian is supported by rhcs Also adds support for RCSH installation on Debian. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 18:57:09 +02:00
Sébastien Han	425ecb3c7d	common: fix ga verison for debian rhcs Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 18:45:30 +02:00
Sébastien Han	639389b9cd	Merge pull request #1985 from ceph/debian-rhcs [skip ci] common: fix rhcs installation on debian	2017-10-05 18:42:46 +02:00
Sébastien Han	0d833657c1	Merge pull request #2001 from ceph/iscsi iscsi: fix wrong group name for iscsi	2017-10-05 18:29:06 +02:00
Sébastien Han	29888649e5	osd: do not do unique on dedicated_devices This is needed later, if we do unique, only the first OSD will get a journal. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 18:20:18 +02:00
Sébastien Han	9193e88878	common: fix rhcs installation on debian * Change version from 2 to 3. * use ceph_rhcs_cdn_debian_repo_version to use other repositories along * with ceph_rhcs_cdn_debian_repo Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 17:42:21 +02:00
Sébastien Han	b6b24a5ca9	iscsi: fix wrong group name for iscsi Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1498490 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 17:25:32 +02:00
Sébastien Han	9304bb6c74	Merge pull request #1997 from rrmichel/osd_fragment Fixing path to osd_fragment.yml	2017-10-05 15:58:49 +02:00
Sébastien Han	164c77acd1	Merge pull request #1995 from ceph/remove-rbd-check jewel: remove rbd check	2017-10-05 15:31:48 +02:00
Guillaume Abrioux	8fb68297a2	common: remove unusuable conditions `ceph_release` isn't available at this step of the playbook because it is set later based on the installed binaries. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1486062 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-05 14:59:37 +02:00
Sébastien Han	c803dedec8	Merge pull request #1993 from jprovaznik/log Fix bind mount for /var/lib/nfs/ganesha directory	2017-10-05 14:43:26 +02:00
Michel Rode	b462b68e65	Fixing path to osd_fragment.yml	2017-10-05 14:42:10 +02:00
Jan Provaznik	b8916ecbc1	Include exports dir in ceph-nfs config file Exports dir is used when dynamic exports creation is enabled.	2017-10-05 14:37:15 +02:00
Sébastien Han	b545080d71	Merge pull request #1988 from ceph/fix_keyrings docker: fix keyrings copied on all nodes	2017-10-05 14:30:09 +02:00
Sébastien Han	bbf6bebe32	jewel: remove rbd check The value of doing this is fairly low compare to the added value. So we remove these tasks, if rbd pool on Jewel doesn't have the right PG value you can always increase it. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 14:21:37 +02:00
Jan Provaznik	62ea6f6e7f	Fix bind mount for /var/lib/nfs/ganesha directory	2017-10-05 13:44:43 +02:00
Jan Provaznik	43e57abfd8	Evaluate cephfs pool variables Otherwise pools with names 'cephfs_data' and 'cephfs_metadata' are created.	2017-10-05 10:00:20 +02:00
Guillaume Abrioux	70e2787fe2	docker: fix keyrings copied on all nodes All keyring are getting copied to all nodes. This commit fixes a leftover from a previous code refactor. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1498583 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-05 09:23:22 +02:00
Guillaume Abrioux	8fac8f54a6	iscsi-gw: Create a rbd pool if it doesn't exist iscsi-gw needs a 'rbd' pool to configure iscsi target. Note: I could have used the facts already set in `ceph-mon` but I voluntarily didn't do it to not create a dependancy between these two roles. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-04 15:40:10 +02:00
Guillaume Abrioux	2c4258a0fd	Refact code for set_osd_pool_default_* This commit refacts the code regarding all `set_osd_pool_default_*` related tasks by avoiding usage of useless `set_fact` to determine whether a key is present in `ceph_conf_overrides`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-04 15:40:10 +02:00
Al Lau	6aca67bc9c	Only perform actions on the rbd pool after it has been created The rbd pool is the default pool that gets created during ceph cluster initializaiton. If we act on the rbd related operations too early, the rbd pool does not exist yet. Move the call to perform rbd operations to a later stage after other pools have been created. The rbd_pool.yml playbook has all the operations related to the rbd pool. Replace the always_run (deprecated) directive with check_mode. Most of the ceph related tasks only need to run once. The run_once directive executes the task on the first host. The ceph sub-command to delete a pool is delete (not rm). The changes submitted here were tested with this ceph version. ceph version 0.94.9-9.el7cp (b83334e01379f267fb2f9ce729d74a0a8fa1e92c) This upload includes these changes: - Use the fail module (instead of assert). - From luminous release, the rbd pool is no longer created by default. Delete the code to create the rbd pool for luminous release - Conform the .yml files to use the suggested syntax. The commands are executed on the mcp nodes and I think shell ansible module is the right one to use. The command module is used to execute commands on remote nodes. I can make the change to use command module if that is prefrerred.	2017-10-04 15:40:10 +02:00
Sébastien Han	cac7d034bf	defaults: fix check socket non-container handler Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-04 15:33:52 +02:00
Sébastien Han	c751c2dc6b	nfs: add run once to user creation The create user call is idempotent but it's also blocking for some reasons. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-04 15:01:13 +02:00
Guillaume Abrioux	784cc73da0	set docker_exec_cmd fact early in each role This is to ensure `docker_exec_cmd` fact is set with the correct value in case of daemons collocation Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-04 11:31:09 +02:00
Sébastien Han	5968cf09b1	ci: add collocation scenario Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-04 11:19:12 +02:00
Sébastien Han	f37e014a65	Merge pull request #1974 from ceph/mgr-upgrade-luminous upgrade: a support for mgrs	2017-10-03 19:57:31 +02:00
Sébastien Han	0ce76113bf	Merge pull request #1956 from ceph/osd-container-id Osd container	2017-10-03 18:52:24 +02:00
Sébastien Han	99466e79a1	upgrade: a support for mgrs Also we now play ceph-config to have everything being generated for new daemons bootstrap during upgrade. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1497959 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-03 16:57:31 +02:00
Sébastien Han	27808a64a4	iscsi: fix when condition generate_crt\|bool\|default(false) won't apply the default value, this generate_crt\|default(false)\|bool will Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-03 16:48:17 +02:00
Sébastien Han	3bd341f6c0	osd: container use id instead of dev name Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1494127 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-03 14:44:00 +02:00
Sébastien Han	ba42894516	osd: do not copy admin key on collocated scenario ceph-disk used to have a bug requiring the admin key to store the encrypted key in the mon kv store. This was reported in: http://tracker.ceph.com/issues/17849 Fixed and backported here: https://github.com/ceph/ceph/pull/11996 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-03 14:44:00 +02:00
Guillaume Abrioux	081f226106	defaults: change running order in main.yml The task which sets `ceph_current_fsid` in `facts.yml` in case of containerized deployment, will definitely fail because `docker_exec_cmd` is not set yet. This commits simply makes `facts.yml` played after `check_socket.yml` so `docker_exec_cmd` is set properly. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-02 18:42:43 +02:00
Sébastien Han	30ce781c79	Merge pull request #1968 from ceph/bz-1488999 refact MDS role	2017-10-02 14:42:08 +02:00
Guillaume Abrioux	62770cd7de	refact MDS role This commits refacts the role ceph-mds The goal here is to create cephfs in `ceph-mon` for both containerized and non-containerized cases so we don't need the admin keyring on mds nodes anymore. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1488999 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-02 09:12:31 +02:00
Sébastien Han	46a01df434	osd: add cluster name support I forgot to add cluster name support so some partition were never mounted correctly. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 20:30:54 +02:00
Sébastien Han	0da6d8e356	Merge pull request #1967 from ceph/use_systemd_module Use systemd module instead of service.	2017-09-29 16:35:10 +02:00
Guillaume Abrioux	466f6f35b7	Use systemd module instead of service. Using systemd module allows us to do in one task what we did in three tasks: - enable unit file, - issue a `daemon-reload`, - start the service Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-29 14:54:00 +02:00
Sébastien Han	e121bc58e9	defaults: add missing handlers for rbd mirorr and mgr Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 02:38:24 +02:00
Sébastien Han	048b55be4a	defaults: only run socket checks on their specific roles Running the socket check on all the hosts will override the default value of docker_exec_cmd, leaving it with the last value (currently rbd-mirror), as a result the subsequent docker_exec_cmd usage for the :x Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 02:38:24 +02:00
Sébastien Han	341c9e077b	nfs: fix container setup and re-arrange files Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 02:38:24 +02:00
Sébastien Han	fc29ccd0ad	rbd-mirror: force sercice enable ceph-rbd-mirror.target There is a bug in the rbd mirror unit file, the upstream fix is here: https://github.com/ceph/ceph/pull/17969. This should be reverted once the patch is merged and backport is done. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 02:38:24 +02:00
Sébastien Han	deb5d3ba1f	Merge pull request #1962 from ceph/fix_mgr_sestatus [skip ci] mgr: add condition to run selinux tasks only on rhel os family	2017-09-29 02:37:03 +02:00
Guillaume Abrioux	913ad53709	docker: add condition to run selinux tasks only on rhel os family This fixes the error : ``` The conditional check 'sestatus.stdout != 'Disabled'' failed. ``` that occurs when running on non rhel based system since the `sestatus` fact is registered only on rhel based distribution. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-29 02:35:07 +02:00
Sébastien Han	77fc8ba87f	Merge pull request #1931 from ceph/re-enable-iscsi iscsi: re-enable the scenario	2017-09-28 19:44:52 +02:00
Sébastien Han	67c78da056	iscsi: re-enable the scenario CentOS 7.4 vagrant box is now available so re-enabling this scenario. For more info: https://seven.centos.org/2017/09/updated-centos-vagrant-images-available-v1708-01/ Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-28 18:46:28 +02:00
Sébastien Han	0010979412	Merge pull request #1641 from fullerdj/wip-djf-key-timeout mon/ceph_keys: Add timeout flag to ceph-create-keys	2017-09-28 09:40:50 +02:00
Guillaume Abrioux	d20dc54202	docker-common: fix wrong syntax there is no need to backslash the quotes here. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-28 00:30:08 +02:00
Douglas Fuller	9bcbf748a3	mon/ceph_keys: Add timeout flag to ceph-create-keys Specify the timeout flag to ceph-create-keys, which causes it to time out if a monitor quorum isn't achieved. This overrides the default timeout of 10 minutes, causing ceph-ansible to fail faster in the event of cluster network issues. Signed-off-by: Douglas Fuller <dfuller@redhat.com>	2017-09-27 18:05:59 -04:00
Zack Cerza	70b321f34c	ceph-common: Fix logic for ceph_repository_type It's failing if a valid choice is specified. Signed-off-by: Zack Cerza <zack@redhat.com>	2017-09-25 15:28:27 -06:00
Sébastien Han	e4ac736071	Merge pull request #1943 from ceph/mgr-site handler: enhance socket detection	2017-09-25 18:43:32 +02:00
Sébastien Han	4266bb5d3f	Merge pull request #1933 from ceph/osd-container-reboot [skip ci] osd: fix container reboot	2017-09-25 18:36:25 +02:00
Sébastien Han	8b6456dc8a	handler: enhance socket detection We have seen issues with leftover socker. So now, if a socket is found we also check if it's accessed by a process. If so, we can run the handler, if not we remove it and continue the playbook. Signed-off-by: Sébastien Han <seb@redhat.com> Co-Authored-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-25 13:44:51 +02:00
Sébastien Han	45797ab968	osd: fix container reboot It's sad but we can not rely on the prepare container anymore since the log are flushed after reboot. So inpecting the container does not return anything. Now, instead we use a ephemeral container to look up for the journal/block.db/block.wal (depending if filestore or bluestore) and build the activate command accordingly. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-25 13:34:47 +02:00
Guillaume Abrioux	be757122f1	config: fix path to set `interface` in ceph.conf need to use `hostvars[host]['XXX']` to retrieve the monitor interface and/or radosgw interface. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1493920 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-23 14:18:28 +02:00
Sébastien Han	f3851df0c7	Merge pull request #1940 from ceph/rgw-interface config: fix rgw interface when using different interfaces	2017-09-22 18:52:51 +02:00
Sébastien Han	8f71c08e7b	handler: display ceph status properly Fix bash error, doing ceph "$CEPH_CLI" -s gives us ceph '--name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/test.keyring --cluster test' -s which results in a wrongly formatted command. Removing the double quotes expands the array properly. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 17:45:35 +02:00
Sébastien Han	2e0c2928e9	nfs: fix docker_exec_cmd_nfs default value the default is not an array, default is empty. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 16:22:36 +02:00
Sébastien Han	4a55085914	config: fix rgw interface when using different interfaces Conf file generation failing on rgw nodes when nodes have different interface names. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1493552 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 15:41:11 +02:00
Sébastien Han	64824baa83	nfs: fix undefined variable This is what happens when you don't run all the jobs from the CI... Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 15:37:43 +02:00
Sébastien Han	839bc11df0	Merge pull request #1923 from ceph/nfs-container [skip ci] tests: add nfs container test	2017-09-22 12:22:10 +02:00
Sébastien Han	aa5c36f19c	nfs: several fixes - move the file fetch/push to the existing task - rename the include - generate the ganesha template from ansible - re-arrange role structure - re-use tasks for non-container and container - configure keys for non-container and container - fix rgw container key collection; Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 00:37:32 +02:00
Guillaume Abrioux	599429dd31	common: fix debian install in addition to #1926 this commit fixes the error when trying to include `install_debian_rhcs_packages.yml` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1493231 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 13:26:29 +02:00
Guillaume Abrioux	b8c3fa9727	nfs: change ownership on /var/log/ganesha to fix selinux capability issue that prevent nfs to start. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Guillaume Abrioux	1886a69b8b	docker-common: refact `stat_ceph_files.yml` there is no need to build the `ceph_config_keys` fact in several steps for rbd-mirror keyring. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Guillaume Abrioux	62cd0bae54	rbd: fix missing keyring on nodes the rbd key was not pushed on rbd nodes because its keyring path was not added in `ceph_config_keys`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Guillaume Abrioux	295c1b0610	docker-common: fix ceph_health check `docker ps` will always return `0` (see: https://github.com/docker/cli/issues/538). Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Guillaume Abrioux	6c9f3a08a7	rgw: refact start_docker_rgw.yml remove usage of `shell` module in favor of `systemd` module. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Guillaume Abrioux	90c4066ce5	mgr: add missing admin key for mgr container Followup on #1761. Add missing admin key for mgr node in containerized deployment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Sébastien Han	adf5017924	config: remove max open file This is only used by the old sysvinit scripts Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-20 23:06:01 +02:00
Sébastien Han	a4baed1025	config: no not generate osd section if bluestore This section is not needed when running a bluestore osd. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-20 18:14:48 +02:00
Sébastien Han	cb05172605	docker: we don't need to copy the ceph.conf on all the nodes We generate the ceph.conf on all the nodes through the ceph-docker-common so there is no need to push it to the Ansible file. Also this is breaking the ceph.conf template generation since we only generate sections based on the host the ansible task is running on. For example, what's typically happening, we bootstrap the monitor, we get a ceph.conf generated for a mon only, we go on an osd, we generate the ceph.conf with osd section (done by ceph-docker-common) but this gets overwritten by the copy_config task of the ceph-osd role. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-20 16:33:29 +02:00
Sébastien Han	7aab133617	Merge pull request #1920 from jprovaznik/ganesha Make ceph-nfs service enablement/start optional	2017-09-20 14:48:36 +02:00
Sébastien Han	a89363b0ae	Merge pull request #1926 from ceph/rhcs-debina common: fix rhcs debian install	2017-09-19 19:50:40 +02:00
Sébastien Han	75e77f5948	common: fix rhcs debian install Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1493231 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-19 19:49:44 +02:00
Ali Maredia	3ba1a68cf5	nfs: ganesha.conf template fixes - Change capitalization of config options to be in line with what config.txt in the nfs-ganesha tree says Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-09-19 12:45:24 -04:00
Sébastien Han	dd7f21bd92	common: fix rhcs installation and rgw package for nfs RHCS install wasn't working at all prior to this commit as the name of the include was pointing to a non-existing file. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1492056 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-19 12:12:22 +02:00
Sébastien Han	87e2dae9d8	Merge pull request #1919 from ceph/iscsi-check common: fix rhel check	2017-09-19 12:10:10 +02:00
Sébastien Han	ace97e8720	Merge pull request #1904 from ceph/name-include-fact name includes and set_fact for clarity	2017-09-19 12:09:25 +02:00
Jan Provaznik	8c510ab9f9	Make ceph-nfs service enablement/start optional When ceph-nfs service is managed by pacemaker, it's useful to not enable and start ceph-nfs service through systemd but let pacemaker to start the service in a next step.	2017-09-19 11:59:54 +02:00
Sébastien Han	dbe64f66f7	common: fix rhel check Looks like Ansible is now using "RedHat" instead of "Red Hat Enterprise Linux" Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-19 11:55:26 +02:00
Sébastien Han	773010ee49	Merge pull request #1911 from fghaas/1910 Introduce ceph_nfs_ceph_user	2017-09-19 10:03:46 +02:00
Florian Haas	ada2f147f5	Introduce ceph_nfs_ceph_user In analogy to ceph_nfs_rgw_user, we should be able to define a user with which the nfs-ganesha Ceph FSAL connects to the cluster. Introduce a ceph_nfs_ceph_user variable, setting its default to "admin" (which preserves the prior behavior of always connecting as client.admin). Fixes #1910.	2017-09-19 09:07:28 +02:00
Sébastien Han	d100b4e596	name includes and set_fact for clarity When Ansible is not run with verbose options it's difficult to see which include and/or set_fact does what. So adding a name for each clarifies. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-18 23:39:46 +02:00
Sébastien Han	66d41f342d	Merge pull request #1889 from ceph/client-containers client: ability to create keys and pool with no ceph binaries	2017-09-18 17:27:32 +02:00
Sébastien Han	2749368a2d	Merge pull request #1915 from ceph/state-leftover docker-common: re-introduce state for leftover files	2017-09-18 15:46:07 +02:00
Sébastien Han	aa5d94fc87	docker-common: re-introduce state for leftover files The variable "statleftover" was removed by commit `a60c74f61e` and never added back to the new playbook, yet it is still being referenced. Adding it back Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1492224 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-18 15:01:32 +02:00
Sébastien Han	85d73e3be2	client: ability to create keys and pool with no cpeh binaries On a container env, machines don't have any ceph binaries so we need to use a container to run the commands. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-18 14:41:52 +02:00
Sébastien Han	68a1390dc9	Merge pull request #1898 from ceph/restart-mon defaults: restart docker daemon higher delay	2017-09-15 06:23:51 -06:00
Sébastien Han	ed3003cf41	defaults: restart docker daemon higher delay Use default delay since the mon (in particular) can take more time to restart. Solves error with: STDERR: Error response from daemon: No such container: ceph-mon-mon0 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-14 13:38:11 -06:00
Sébastien Han	fb02b1d9d3	mon: create the mgr key for release >= luminous This fixes RHCS builds. We know which Ceph version we are running on. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-14 11:06:44 -06:00
Sébastien Han	6f0b1fe803	rgw: remove old variables Since the only support civetweb these variables are obsolete. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-14 09:42:50 -06:00
Sébastien Han	660893e70e	osd: add meaningful message for journal_size Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-13 23:49:15 -06:00
Sébastien Han	ef8d37dd0d	Merge pull request #1800 from ceph/wip-osd-start-fix ceph-osd: Fix osd start sequence	2017-09-13 17:20:10 -06:00
Sébastien Han	2f51f0de28	Merge pull request #1880 from ceph/wip-rgw-nfs nfs: configure RGW FSAL to start up correctly	2017-09-13 14:20:14 -06:00
Sébastien Han	f67b47d056	Merge pull request #1882 from ceph/multi-journal osd: drop support for device partition	2017-09-13 11:43:48 -06:00
Sébastien Han	ac62437609	Merge pull request #1883 from ceph/quick_refact osd: refact include of `activate_osds.yml`	2017-09-12 22:11:31 -06:00
Sébastien Han	c3866fc4bd	Merge pull request #1747 from ceph/add-iscsi resync ceph-iscsi-gw with old upstream	2017-09-13 02:06:50 +02:00
Sébastien Han	aa364264cd	resync ceph-iscsi-gw with old upstream Taken from https://github.com/pcuzner/ceph-iscsi-ansible/tree/tcmu-fixes Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1454945 and https://bugzilla.redhat.com/show_bug.cgi?id=1484083 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-12 18:06:10 -06:00
Sébastien Han	2a1b8a1997	Merge pull request #1884 from ceph/mon-container-ip mon: add support for monitor_address block for containers	2017-09-13 01:46:18 +02:00
Sébastien Han	fdf924401f	osd: drop support for device partition We have been struggling with this, it's still broken and breaking other things too now. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1490283 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-12 17:42:07 -06:00
Guillaume Abrioux	49ad8528e5	osd: refact include of `activate_osds.yml` remove duplicate code. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-12 16:53:11 -06:00
Sébastien Han	02ba65dbbe	mon: add support for monitor_address block for containers Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-12 16:28:08 -06:00
Sébastien Han	6b8ed0440e	Merge pull request #1761 from ceph/split_copy_keys docker: split the task 'copy ceph configs&keys'	2017-09-13 00:21:50 +02:00
Ali Maredia	52efe92a87	nfs: configure RGW FSAL to start up correctly - Add RGW keyring to nfs node - Add RGW section to ganesha.conf - Add RGW section to ceph.conf onf nfs node Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-09-12 16:27:16 -04:00
Guillaume Abrioux	20946f7220	ceph-osd: remove deprecated comment in sample file Since #1724 has been merged, this comment is deprecated Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-12 16:48:10 +02:00
Guillaume Abrioux	0f506f4f0a	Docker: split the task 'copy ceph configs&keys' All keys are copied to all nodes. This commit split that task in each roles so keys are copied to their respective nodes. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1488999 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-11 21:14:13 +02:00
Sébastien Han	2ea7f287fa	docker: simplify variable declaration Less configuration for the user, the container inherit from the global variables. No more container specific variables. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-09 01:22:06 +02:00
Sébastien Han	4767eaaab3	Merge pull request #1878 from ceph/add-rbd-mirror Add rbd mirror	2017-09-09 01:21:12 +02:00
Sébastien Han	7054615551	ci: deploy rbd mirror Deploy rbd mirorr in cluster scenario Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-09 01:17:10 +02:00
Sébastien Han	477f86e305	switch to container: fix ceph nfs The service is nfs-ganesha where ceph-nfs@{{ ansible_hostname }} will be the name of the container. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-08 22:43:50 +02:00
Sébastien Han	d46d453b83	Merge pull request #1780 from ceph/wip-rgw-nfs Wip RGW NFS	2017-09-08 19:26:02 +02:00
Guillaume Abrioux	b59e9cc732	Merge pull request #1871 from ceph/handler-collocate defaults: do not restart unconfigured (yet) daemons	2017-09-08 18:15:02 +02:00
Sébastien Han	a05c58ba37	Merge pull request #1874 from ceph/rbd-mirror-mem ceph-rbd-mirror; docker fix typo	2017-09-08 17:50:55 +02:00
Sébastien Han	7a93d88025	ceph-rbd-mirror; docker fix typo Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-08 17:47:48 +02:00
Ali Maredia	f8171e8b4a	nfs: rename host to have ceph- prefix Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-09-08 11:38:05 -04:00
Ali Maredia	f3e2235b3a	nfs-ganesha: add config overrides section Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-09-08 11:37:58 -04:00
Sébastien Han	d53f55e807	Merge pull request #1870 from Logan2211/omit-default-release Omit the apt default_release if it is not needed	2017-09-08 16:55:03 +02:00
Guillaume Abrioux	44fd928e23	mds: rename mds_socket fact Rename this fact to keep consistency with handlers in `ceph-defaults`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-08 15:57:58 +02:00
Ali Maredia	55724c6e93	nfs-ganesha: add dev, stable, and rhcs nfs-ganesha's for ceph-nfs role Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-09-08 09:13:20 -04:00
Sébastien Han	12f6e53090	defaults: do not restart unconfigured (yet) daemons In a collocated scenario, where you might put a rgw, a mds and a mon on the same node you don't want the handler blindly restart all the daemons on the node. Indeed some of them might not be configured yet. Implementing a more precise socket detection, for each daemon type. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1488813 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-08 12:02:37 +02:00
Logan V	d8cb62c981	Omit the apt default_release if it is not needed The apt module will fail to downgrade packages properly when defualt release is unnecessarily defined. Closes #1869	2017-09-07 11:50:57 -05:00
Sébastien Han	3753e6cfa7	ceph-osd: fix autodetection activation Prior to this patch this activation sequence for autodetection was always skipped because we were asking to activate on device without partitions, which doesn't make sense. We also fix the way we lookup for a device, since the data partition is always numbered 1, we take the min element of the dict. Closes: https://github.com/ceph/ceph-ansible/issues/1782 Signed-off-by: Sébastien Han <seb@redhat.com> Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-07 17:47:37 +02:00
Sébastien Han	27b3f9a7d4	Merge pull request #1850 from fultonj/issue/1848 Add option to create client keyring file but not import it	2017-09-07 13:51:11 +02:00
Sébastien Han	cf88c136f5	Merge pull request #1859 from ceph/container-limit container: introduce resource limitation for containers	2017-09-07 12:51:34 +02:00
Sébastien Han	d2032c92af	Merge pull request #1862 from ceph/fail-ansible fail if ansible version < 2.3	2017-09-07 08:44:01 +02:00
Sébastien Han	fc3300ea4f	fail if ansible version < 2.3 We only test and support 2.3.x at the moment. Closes: https://github.com/ceph/ceph-ansible/issues/1858 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-07 07:53:17 +02:00
John Fulton	a57f61efd9	Add option to create client keyring file but not import it Add new boolean parameter for client config create_key_file_only with a default of false. When create_key_file_only is true, the client tasks to connect to the external ceph cluster to verify the key `ceph auth import` the key are skipped. Fixes: #1848	2017-09-06 13:56:06 +00:00
Sébastien Han	2fa151b9e8	container: introduce resource limitation for containers This can be controlled via 2 options: * ceph_$DAEMON_docker_memory_limit * ceph_$DAEMON_docker_cpu_limit All daemons default to 1GB for memory and 1 CPU by default. Recommendations from: https://access.redhat.com/documentation/en-us/red_hat_ceph_storage/2/html/red_hat_ceph_storage_hardware_guide/minimum_recommendations Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-06 14:52:21 +02:00
Sébastien Han	b7db600caa	switch-from-non-containerized-to-containerized: mask unit files We must mask the image so we are sure that even if the system reboots then the OSDs won't start. Also remove Ceph udev rules if found on the system prior to deploy containers. If we don't do this we are exposed to conflicts between udev rules and sytemd unit files. Also add the CI will now test the migration from a non-containerized cluster to a containerized cluster. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-05 15:20:31 +02:00
Sébastien Han	b6c1a0c68f	Merge pull request #1853 from ceph/fix-prepare ceph-osd: do not re-prepare if already prepared	2017-09-05 13:59:40 +02:00
Sébastien Han	5ed1a91aeb	Merge pull request #1819 from ceph/no-container-log ceph-docker-common: do not log inside the container	2017-09-05 11:47:11 +02:00
Sébastien Han	1dd976d28e	ceph-osd: do not re-prepare if alreadyy prepared I forgot to re-add the partition check while refactoring the osd Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-05 09:51:57 +02:00
Sébastien Han	23a0c26c4f	client: do not copy admin key by default Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-02 00:54:17 +02:00
Sébastien Han	58f664fd17	ceph-rgw: fix systemd unit layout Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-01 19:02:48 +02:00
Sébastien Han	967e875fd0	Merge pull request #1827 from andymcc/rgw_systemd_fix Fix RGW systemd directory	2017-09-01 18:12:23 +02:00
Alfredo Deza	98d107cebb	common do not filter by distro when dev is set for figuring out ceph_release Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-08-31 16:18:08 -04:00
Sébastien Han	673938ec96	Merge pull request #1839 from ceph/colonwq-update-docker-rgw-exec Update ceph_rgw_docker_extra_env to add bind ip	2017-08-31 19:47:16 +02:00
Sébastien Han	ea9b6395cb	Merge pull request #1838 from ceph/rgw-units Rgw units	2017-08-31 19:38:23 +02:00
Andrew Schoen	29df79e54e	Merge pull request #1841 from ceph/lvm-partitions lvm-osds: test with a partition and an lv as journals	2017-08-31 12:09:19 -05:00
Sébastien Han	3dd47a45cb	ceph-defaults: fix handlers for mds and rgw The way we handle the restart for both mds and rgw is not ideal, it will try to restart the daemon on the host that don't run the daemon, resulting in a service file being created (see bug description). Now we restart each daemon precisely and in a serialized fashion. Note: the current implementation does NOT support multiple mds or rgw on the same node. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1469781 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-31 19:02:21 +02:00
Sébastien Han	7ee1f88ee5	ceph-common: remove useless changed task There is no need to show a "changed" at the end of the play for a "command" module task. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-31 18:27:43 +02:00
Keith Schincke	eaccc12797	Update ceph_rgw_docker_extra_env to add bind ip This patch adds passing the RGW_CIVETWEB_IP to the docker container. This IP defaults to the value of radosgw_civetweb_bind_ip. radosgw_civetweb_bind_ip default to ipv4.default Without this value, the RGW containter will bind to 0.0.0.0	2017-08-31 15:50:34 +02:00
Sébastien Han	e581539e20	ceph-rgw: do not run a privileged rgw container There is no need for a privileged rgw container Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-31 15:50:15 +02:00
Sébastien Han	7ccd10a15e	rgw: cleanup old code and remove systemd condition Remove the old check prior systemd. We only support systemd so there is no need to run a condition on systemd. The playbook will fail if systemd is not present. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-31 08:29:30 +02:00
Andrew Schoen	fcba9d17f0	ceph-osd: add support for --journal vg/lv for lvm osds This also updates the tests Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-30 15:55:16 -05:00
Alfredo Deza	da90edce3e	common dev repos should not need to specify a 'release' Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-08-30 13:37:24 -04:00
Alfredo Deza	6565c38238	common: ceph_repository should not be rhcs or dev Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-08-30 13:33:04 -04:00
Alfredo Deza	8fd2bf7e2c	common: use the value of ceph_repository in the error message Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-08-30 13:33:04 -04:00
Sébastien Han	13aac5027a	Merge pull request #1741 from ceph/refactor-installation common: refactor installation method	2017-08-30 17:42:29 +02:00
Sébastien Han	b05271f464	Merge pull request #1724 from ceph/container-multi-journal osd: allow multi dedicated journals for containers	2017-08-30 17:41:42 +02:00
Sébastien Han	a60c74f61e	ceph-docker-common: re-organize stat ceph file Use a single file to run the checks instead of duplicating code. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-30 14:44:34 +02:00
Sébastien Han	e0a264c7e9	osd: allow multi dedicated journals for containers Fix: https://bugzilla.redhat.com/show_bug.cgi?id=1475820 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-30 12:34:06 +02:00
Sébastien Han	ae2fd45994	common: refactor installation method The installation process is now described as follow: * you still have to choose a 'ceph_origin' installation method. The origin can be a 'repository' (add a new repository), distro (it will use the packages provided by the native repo source of your distribution), local (only available on redhat system, it installs locally built packages). This option is not well tested, so use it carefully * if ceph_origin == 'repository' you will have to decide what kind of repository you want to enable: - community: corresponds to the stable upstream/community version - enterprise: corresponds to the stable enterprise/downstream version (basically you are a red hat customer) - dev: it will install ceph from packages built out of the github development branches Signed-off-by: Sébastien Han <seb@redhat.com> Co-Authored-by: Guillaume Abrioux <gabrioux@redhat.com> Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-30 10:52:01 +02:00
Andy McCrae	a9d91c3d69	Fix RGW systemd directory The ceph RGW systemd services are actually named "ceph-radosgw" and not "ceph-rgw", this patch fixes that for the systemd overrides file.	2017-08-29 17:24:52 +01:00
Sébastien Han	5743916092	common: add mimic release facts Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-29 17:21:37 +02:00
Sébastien Han	fa9f2313d5	Merge pull request #1822 from ceph/rhcs-container-release ceph-docker-common: detect ceph version	2017-08-29 12:16:20 +02:00
Sébastien Han	d0515cb704	Merge pull request #1825 from ceph/fix-item ceph-docker-common: fix empty array	2017-08-29 12:15:46 +02:00
Sébastien Han	b3e5206289	Merge pull request #1814 from ceph/handler-defaults handler: default to empty array if task skipped	2017-08-29 11:09:35 +02:00
Sébastien Han	cfddd2903c	ceph-docker-common: fix empty array The list can not be evaluated properly if it containers '[]', which is the case when using the filter "default([])". To fix this, we have to properly merge the lists. This is fixing the issue: "list object has no element 1" Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-29 10:25:46 +02:00
Sébastien Han	764e697186	ceph-docker-common: detect ceph version By detecting the ceph version running in the container we can easily apply conditions like: ceph_release_num.{{ ceph_release }} >= ceph_release_num.luminous We do that already, in ceph-docker-common/tasks/fetch_configs.yml. This fixes the error: TASK [ceph-docker-common : register rbd bootstrap key] ****************************************************** fatal: [magna005]: FAILED! => {"failed": true, "msg": "The conditional check 'ceph_release_num.{{ ceph_release }} >= ceph_release_num.luminous' failed. The error was: error while evaluating conditional (ceph_release_num.{{ ceph_release }} >= ceph_release_num.luminous): 'dict object' has no attribute 'dummy'\n\nThe error appears to have been in '/home/ubuntu/ceph-ansible/roles/ceph-docker-common/tasks/fetch_configs.yml': line 2, column 3, but may\nbe elsewhere in the file depending on the exact syntax problem.\n\nThe offending line appears to be:\n\n---\n- name: register rbd bootstrap key\n ^ here\n"} Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1486062 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-28 23:28:47 +02:00
Sébastien Han	aa69c2c007	ceph-docker-common: do not log inside the container Logging inside the container is not useful since it writes to the overlayfs partition, resulting in potential performance degradation on the container. If you need to check the logs, just look at journald. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-28 12:04:49 +02:00
Sébastien Han	29753da05c	handler: default to empty array if task skipped with_items is evaluated before the when condition so if the task that registers the 'results' is skipped the task will fail with: {"failed": true, "msg": "'dict object' has no attribute 'results'"} Defaulting to an empty array fixes the issue. Reverts: `abdd66619e` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1482061 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-25 18:39:00 +02:00
Sébastien Han	972eb45d31	ceph-docker-common: apply 0600 to key permissions Keys should only be readable and writable by their respective owners and that's all. Closes: https://github.com/ceph/ceph-ansible/issues/1760 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-25 18:14:28 +02:00
Boris Ranto	5f1b8fcd75	ceph-osd: Fix osd start sequence The script can fail to get the osd id because the osds are activated by udev and it can take a while for them to activate. This commit fixes that by trying to get all the osds per node in a loop. This commit also makes the osd services enabled so that they are available after reboot. Signed-off-by: Boris Ranto <branto@redhat.com>	2017-08-25 13:40:04 +02:00
Sébastien Han	1f4082f200	update meta for ansible galaxy Closes: https://github.com/ceph/ceph-ansible/issues/1637 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-25 00:05:44 +02:00
Sébastien Han	aee8267be4	Merge pull request #1808 from ceph/role-path ceph-mon: detect ANSIBLE_ROLES_PATH if present	2017-08-24 23:49:41 +02:00
Andrew Schoen	910bb036c6	ceph-config: when using local_action set become: false There should be no need to use sudo when writing or using these files. It creates an issue when the user running ansible-playbook does not have sudo privs. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-24 10:07:03 -05:00
Sébastien Han	76ac9b077b	ceph-mon: detect ANSIBLE_ROLES_PATH if present Some deployments can't copy infrastructure playbooks outside of the infrastructure-playbooks directory. Thus they use ANSIBLE_ROLES_PATH to overcome this. However some roles have 'playbook_dir' hardcoded, which results in wrong path since the execution comes from infrastructure-playbooks. Basically the role triggered by a playbook from infrastructure-playbooks believes that the roles are in infrastructure-playbooks/roles. This commit fixes that. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-24 16:19:39 +02:00
Andrew Schoen	d0a3034857	ceph-config: write ceph_conf_overrides_temp to fetch_directory because /tmp is not always writable, but we can assume that the fetch_directory will be Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-24 11:33:03 +02:00
Sébastien Han	80dc5eead7	ceph-config: add missing meta and files for the galaxy Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-24 11:33:03 +02:00
Guillaume Abrioux	539197a2fc	Introduce new role ceph-config. This will give us more flexibility and the possibility to deploy a client node for an external ceph-cluster. related BZ: https://bugzilla.redhat.com/show_bug.cgi?id=1469426 Fixes: #1670 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-24 11:33:03 +02:00
Sébastien Han	6d894e556c	ceph-mon: remove hardcoded ipv4 in containers Before this commit we were forcing ipv4 which might not be available. Now setting ip_version to ipv4 or ipv6 will give you the right support. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1484189 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-24 11:33:02 +02:00
Andrew Schoen	758c31b1cd	ceph-osd: ceph-volume requires --data to be in vg/lv format Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-23 13:43:31 -05:00
Alfredo Deza	e651469a2a	Merge pull request #1797 from ceph/purge-lvm adds purge support for the lvm_osds osd scenario	2017-08-23 14:28:29 -04:00
Sébastien Han	f2499ff5ac	Merge pull request #1788 from ceph/improve-switch switch-from-non-containerized-to-containerized: simplify	2017-08-23 19:47:26 +02:00
Sébastien Han	4f0ecb7f30	switch-from-non-containerized-to-containerized: simplify This commit eases the use of the infrastructure-playbooks/switch-from-non-containerized-to-containerized-ceph-daemons.yml playbook. We basically run it with a couple of pre-tasks and then we let the playbook run the docker roles. It obviously expect to have proper variables configured in order to work. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-23 18:39:45 +02:00
Andrew Schoen	594d5e017a	ceph-osd: restructure lvm_volumes variable for more flexiblity The lvm_volumes variable is now a list of dictionaries that represent each OSD you'd like to deploy using ceph-volume. Each dictionary must have the following keys: data, journal and data_vg. Each dictionary also can optionaly provide a journal_vg key. The 'data' key represents the lv name used for the OSD and the 'data_vg' key is the vg name that the given lv resides on. The 'journal' key is either an lv, device or partition. The 'journal_vg' key is optional and must be the vg name for the journal lv if given. This key is mainly used for purging of the journal lv if purge-cluster.yml is run. For example: lvm_volumes: - data: data_lv1 journal: journal_lv1 data_vg: vg1 journal_vg: vg2 - data: data_lv2 journal: /dev/sdc data_vg: vg1 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-23 10:14:14 -05:00
Sébastien Han	d9b3d4a981	Merge pull request #1731 from SirishaGuduru/rgw-civetwebIP-conf Common: changed civetweb line in rgw section(conf)	2017-08-23 15:33:08 +02:00
Sébastien Han	e0c43ccc53	Merge pull request #1784 from ceph/fix-restart-osd-container ceph-defaults: fix handler for osd container	2017-08-23 12:40:01 +02:00
SirishaGuduru	1359869497	Common: changed civetweb line in rgw section(conf) Resolves issue: Multiple RGW Ceph.conf Issue #1258 In multi-RGW setup, in ceph.conf the RGW sections contain identical bind IP in civetweb line. So this modification fixes that issue and puts the right IP for each RGW. Signed-off-by: SirishaGuduru SGuduru@walmartlabs.com Modified ceph-defaults and ran generate_group_vars_sample.sh group_vars/osds.yml.sample and group_vars/rhcs.yml.sample are not part of the changes. But they got modified when generate_group_vars_sample.sh is ran to generate group_vars/ all.yml.sample. Uncommented added variables in ceph-defaults Updated tests by adding value for radosgw_interface Added radosgw_interface to centos cluster tests Modified ceph-rgw role,rebased and ran generate_group_vars_sample.sh In ceph-rgw role removed check_mandatory_vars.yml. Rebased on master. Ran generate_group_vars_sample.sh and then the below files got modified.	2017-08-23 15:03:37 +05:30
Jason Dillaman	b70d54ac80	rbd-mirror should use per-host user id keyring The rbd-mirror daemon will be HA under luminous and new daemon health features require a way to uniquely identify rbd-mirror instances. Signed-off-by: Jason Dillaman <dillaman@redhat.com>	2017-08-22 18:55:29 -04:00
Jason Dillaman	70c2b934ca	distribute rbd bootstrap key if available Signed-off-by: Jason Dillaman <dillaman@redhat.com>	2017-08-22 18:55:29 -04:00
Sébastien Han	07821d9bb1	Merge pull request #1786 from ceph/re-arrange-skipped mon, osd: fix skipped condition	2017-08-22 19:44:48 +02:00

... 7 8 9 10 11 ...

2137 Commits (33c09af2507101289e8524c91f2b48c5adbdf904)