ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Markos Chandras	126e2e3f92	roles: ceph-defaults: Check if 'rgw' attribute exists for rgw_hostname If there are no services on the cluster, then the 'rgw' could be missing and the task is failing with the following problem: msg": "The task includes an option with an undefined variable. The error was: 'dict object' has no attribute 'rgw' We fix this by checking the existence of the 'rgw' attribute. If it's missing, we skip the task since the role already contains code to set a good default rgw_hostname. Signed-off-by: Markos Chandras <mchandras@suse.de>	2018-08-20 11:37:45 +02:00
Markos Chandras	37e50114de	roles: ceph-defaults: Delegate cluster information task to monitor node Since commit `f422efb1d6` ("config: ensure rgw section has the correct name") we observe the following failures in new Ceph deployment with OpenStack-Ansible fatal: [aio1_ceph-rgw_container-fc588f0a]: FAILED! => {"changed": false, "cmd": "ceph --cluster ceph -s -f json", "msg": "[Errno 2] No such file or directory" This is because the task executes 'ceph' but at this point no package installation has happened. Packages are normally installed in the 'ceph-common' role which runs after the 'ceph-defaults' one. Since we are looking to obtain cluster information, the task should be delegated to a monitor node similar to other tasks in that role Signed-off-by: Markos Chandras <mchandras@suse.de>	2018-08-20 11:37:45 +02:00
Dardo D Kleiner	f6519e4003	mgr: improve/fix disabled modules check Follow up on `36942af698` "disabled_modules" is always a list, it's the items in the list that can be dicts in mimic. Many ways to fix this, here's one. Signed-off-by: Dardo D Kleiner <dardokleiner@gmail.com>	2018-08-20 11:23:58 +02:00
Sébastien Han	3149b2564f	Revert "osd: generate device list for osd_auto_discovery on rolling_update" This reverts commit `e84f11e99e`. This commit was giving a new failure later during the rolling_update process. Basically, this was modifying the list of devices and started impacting the ceph-osd itself. The modification to accomodate the osd_auto_discovery parameter should happen outside of the ceph-osd. Also we are trying to not play ceph-osd role during the rolling_update process so we can speed up the upgrade. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-16 11:13:12 +02:00
Markos Chandras	7172737f13	roles: ceph-defaults: Set ceph_uid on SUSE distributions The ceph_uid is also '167' on SUSE systems so extend the existing task. Signed-off-by: Markos Chandras <mchandras@suse.de>	2018-08-13 19:02:57 +00:00
Guillaume Abrioux	36942af698	mgr: backward compatibility for module management Follow up on `3abc253fec` The structure had even changed within `luminous` release. It was first: ``` { "enabled_modules": [ "balancer", "dashboard", "restful", "status" ], "disabled_modules": [ "influx", "localpool", "prometheus", "selftest", "zabbix" ] } ``` Then it changed for: ``` { "enabled_modules": [ "status" ], "disabled_modules": [ "balancer", "dashboard", "influx", "localpool", "prometheus", "restful", "selftest", "zabbix" ] } ``` and finally: ``` { "enabled_modules": [ "status" ], "disabled_modules": [ { "name": "balancer", "can_run": true, "error_string": "" }, { "name": "dashboard", "can_run": true, "error_string": "" } ] } ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-08-13 13:25:06 +00:00
Guillaume Abrioux	8b5e3cd999	validate: fail if fqdn deployment attempted fqdn configuration possibility caused a lot of trouble, it's adding a lot of complexity because of multiple cases and the relation between ceph-ansible and ceph-container. Moreover, there is no benefit for such a feature. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1613155 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-08-13 10:04:24 +02:00
Guillaume Abrioux	f422efb1d6	config: ensure rgw section has the correct name the ceph.conf.j2 always assumes the hostname used to register the radosgw in the servicemap is equivalent to `{{ ansible_hostname }}` which returns the shortname form. We need to detect which form of the hostname was used in case of already deployed cluster and update the ceph.conf accordingly. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1580408 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-08-13 10:04:24 +02:00
Guillaume Abrioux	db29b5b84d	config: clean template, remove useless conditions there is no need to have all these conditions. for instance, assuming `mds_group_name` is set to 'mdss': - `if groups[mds_group_name] is defined` checks if `'mdss'` is present in `{{ groups }}` - `if {{ mds_group_name }} in group_names` checks if the current node is part the group `'mdss'` - `if inventory_hostname in groups.get(mds_group_name, [])` checks if the current node is part of the group 'mdss' The third condition is enough to cover the need of ensuring we are running on a mds node. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-08-13 10:04:24 +02:00
Sébastien Han	4c9e24a90f	mon: fix calamari initialisation If calamari is already installed and ceph has been upgraded to a higher version the initialisation will fail later. So if we detect the calamari-server is too old compare to ceph_rhcs_version we try to update it. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1601755 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-10 14:14:23 +02:00
Andrew Schoen	6423ab4ad3	lvm: fix condition when selecting which scenario to run devices and lvm_volumes will always be defined, so we need to instead check it's length before deciding to run the scenario. This fixes the failure here: https://2.jenkins.ceph.com/job/ceph-ansible-prs-luminous-bluestore_lvm_osds/86/consoleFull#1667273050b5dd38fa-a56e-4233-a5ca-584604e56e3a Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-08-10 11:46:12 +02:00
Sébastien Han	e84f11e99e	osd: generate device list for osd_auto_discovery on rolling_update rolling_update relies on the list of devices when performing the restart of the OSDs. The task that is builind the devices list out of the ansible_devices dict only runs when there are no partitions on the drives. However during an upgrade the OSD are already configured, they have been prepared and have partitions so this task won't run and thus the devices list will be empty, skipping the restart during rolling_update. We now run the same task under different requirements when rolling_update is true and build a list when: * osd_auto_discovery is true * rolling_update is true * ansible_devices exists * no dm/lv are part of the discovery * the device is not removable * the device has more than 1 sector Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1613626 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-10 09:19:40 +02:00
Andrew Schoen	3592c68cca	ceph-osd: adds crush_device_class config option This is used with the lvm osd scenario. When using devices you need the option to set the crush device class for all of the OSDs that are created from those devices. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-08-09 09:41:58 -04:00
Andrew Schoen	6d431ec22d	ceph-volume: implement the 'lvm batch' subcommand This adds the action 'batch' to the ceph-volume module so that we can run the new 'ceph-volume lvm batch' subcommand. A functional test is also included. If devices is defind and osd_scenario is lvm then the 'ceph-volume lvm batch' command will be used to create the OSDs. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-08-09 09:41:58 -04:00
Sébastien Han	4d64dd4686	rgw: ability to use ceph-ansible vars into containers Since the container now simply reads the ceph.conf, we remove all the unnecessary options. Also this PR is the foundation to support multiple backend, such as the new 'beast' from Ceph Mimic. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1582411 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-09 14:13:17 +02:00
Sébastien Han	3bce117de2	rgw: remove unused file copy_configs.yml was not including and is a leftover so let's remove it. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-09 14:13:17 +02:00
Sébastien Han	5a89479abe	rgw: remove useless condition The include does not need a condition on containerized_deployment since we are already in an include than has the same condition. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-08-09 14:13:17 +02:00
Graeme Gillies	a46025820d	Allow mgr bootstrap keyring to be defined In environments where we wish to have manual/greater control over how the bootstrap keyrings are used, we need to able to externally define what the mgr keyring secret will be and have ceph-ansible use it, instead of it being autogenerated Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1610213 Signed-off-by: Graeme Gillies <ggillies@akamai.com>	2018-08-08 19:09:01 +00:00
Artur Fijalkowski	52d9d406b1	Fix in regular expression matching OSD ID on non-contenerized deployment. restart_osd_daemon.sh is used to discover and restart all OSDs on a host. To do it the scripts loops the list of ceph-osd@ services in the system. This commit fixes bug in the regular expression responsile for extraction of OSDs - prior version uses `[0-9]{1,2}` expression which is ignoring all OSDS which numbers are greater than 99 (thus longer than 2 digits). Fix removed upper limit of digits in the number. This problem existed in two places in the script. Closes: #2964 Signed-off-by: Artur Fijalkowski <artur.fijalkowski@ing.com>	2018-08-06 15:53:49 +00:00
Guillaume Abrioux	1164cdc002	iscsigw: install ceph-iscsi-cli package Install ceph-iscsi-cli in order to provide the `gwcli` command tool. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1602785 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-08-06 14:11:52 +02:00
Guillaume Abrioux	0a6ff6bbf8	defaults: backward compatibility with fqdn deployments This commit ensures we are backward compatible with fqdn deployments. Since ceph-container enforces deployment to be done with shortname, we must keep backward compatibility with clusters already deployed with fqdn configuration Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-08-06 10:14:58 +00:00
Sébastien Han	ea9e60d48d	config: enforce socket name This was introduced by `59ee2e8d3b` and made our socket checks impossible to run. The PID could be found, but the cctid cannot. This happens during upgrade to mimic and on cluster running on mimic. So let's force the admin socket the way it was so we can properly check for existing instances also the line $cluster-$name.$pid.$cctid.asok is only needed when running multiple instances of the same daemon, thing ceph-ansible cannot do at the time of writing Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1610220 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-31 10:58:04 +02:00
Mike Christie	6f72f96dad	igw: do not fail purge on rbd removal errors Instead of failing the entire purge operation when the rbd command fails just log an error. This will allow the higher level target and config cleanup to complete, and the user only has to manually delete the rbd images. Signed-off-by: Mike Christie <mchristi@redhat.com>	2018-07-31 10:08:26 +02:00
Mike Christie	d572a9a602	igw: fix image removal during purge We were not passing in the ceph conf info into the rbd image removal command, so if the clustername was not the default igw purge would fail due to the rbd rm command failing. This just fixes the bug by passing in the ceph conf info which has the clustername to use. This fixes Red Hat bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1601949 Signed-off-by: Mike Christie <mchristi@redhat.com>	2018-07-31 10:08:26 +02:00
Sébastien Han	2ca8c51906	osd: do not remove expose_partition container The container runs with --rm which means it will be deleted by Docker when exiting. Also 'docker rm -f' is not idempotent and returns 1 if the container does not exist. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1609007 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-30 10:38:15 +02:00
Guillaume Abrioux	1ecbbbdcfa	rbd-mirror: bring back compatibility with jewel deployment rbd-mirror can't start when deploying jewel because it needs admin keyring. Getting back this task brings backward compatibility for jewel deployment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-26 18:47:10 +00:00
Guillaume Abrioux	053709da97	ceph-osds: backward compatibility with jewel for osp pools creation If we want to be backward compatible with release prior to luminous, we have to set the rule name accordingly to default values used in jewel. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-26 18:47:10 +00:00
Guillaume Abrioux	2597a557c5	client: fix an incorrect title in a task This task would be run on both containerized and non containerized deployment. Let's have a proper title to avoid confusion. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-26 15:57:41 +02:00
Sébastien Han	e2ea5bac51	rgw: add more config option for civetweb frontend In containerized deployments we now inherite from the radosgw_civetweb_options options when bootstrapping the container. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1582411 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-25 13:19:14 +00:00
Giulio Fidente	e85e5ea781	Run creation of empty rados index object to first monitor When distributing ceph-nfs role, creation of rados index object fails as it assumes availability of client.admin locally. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1607970 Signed-off-by: Giulio Fidente <gfidente@redhat.com>	2018-07-25 11:40:11 +02:00
Sébastien Han	235d1b3f55	validate: add checks for interfaces Check if the interface provided: * exists in the gathered facts (thus on the system) * is active * has an IP address (depending on ip_version ) Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1600227 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-24 17:59:30 +02:00
Guillaume Abrioux	af82e7523d	tests: test master against ansible 2.6 Ansible 2.4 is currently end-of-life. Ansible 2.5 will go end-of-life after Ansible 2.7 is released. Fixes: #2901 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-23 11:59:15 +00:00
Sébastien Han	7fc13bc9d5	validate: only run osd test on osd node Do not run device validation on every hosts, only on OSD nodes. Signed-off-by: Sébastien Han <seb@redhat.com> Co-authored-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-19 12:46:18 +00:00
Sébastien Han	cf01e596b6	valide: improve device check We know make sure that: * devices are actually block special files * length of dedicated_device is identical to devices Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-18 14:26:22 +00:00
Guillaume Abrioux	1a626d3c61	nfs: change default stable branch for nfs-ganesha repo Since `V2.6-stable` is available and has packages for `mimic`, let's update this default value accordingly so nfs nodes can be deployed with mimic. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-13 08:20:27 +00:00
Sébastien Han	e61ca882a1	validate: force ansible version We currently only support Ansible 2.4.X so let's fail if the version is different. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-13 07:52:56 +00:00
Guillaume Abrioux	5ef5fcd0b6	client: do not rely on copy_admin_key to import keys Relying on `copy_admin_key` to import created keys on client nodes makes us obliged to copy admin key on those nodes which is not something we might want. We should use the fact `condition_copy_admin_key` which will be set to `True` when the delegated node is a mon which means we can import keys without taking care of admin keyring. Fixes: #2867 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-13 06:52:00 +00:00
Guillaume Abrioux	ce5ac930c5	mgr: fix condition to add modules to ceph-mgr Follow up on #2784 We must check in the generated fact `_disabled_ceph_mgr_modules` to enable disabled mgr module. Otherwise, this task will be skipped because it's not comparing the right list. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1600155 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-12 21:04:01 +00:00
Guillaume Abrioux	9f54b3b4a7	mon: ensure socker is purged when mon is stopped On containerized deployment, if a mon is stopped, the socket is not purged and can cause failure when a cluster is redeployed after the purge playbook has been run. Typical error: ``` fatal: [osd0]: FAILED! => {} MSG: 'dict object' has no attribute 'osd_pool_default_pg_num' ``` the fact is not set because of this previous failure earlier: ``` ok: [mon0] => { "changed": false, "cmd": "docker exec ceph-mon-mon0 ceph --cluster test daemon mon.mon0 config get osd_pool_default_pg_num", "delta": "0:00:00.217382", "end": "2018-07-09 22:25:53.155969", "failed_when_result": false, "rc": 22, "start": "2018-07-09 22:25:52.938587" } STDERR: admin_socket: exception getting command descriptions: [Errno 111] Connection refused MSG: non-zero return code ``` This failure happens when the ceph-mon service is stopped, indeed, since the socket isn't purged, it's a leftover which is confusing the process. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-10 20:08:07 +00:00
Guillaume Abrioux	d0746e0858	common: switch from docker module to docker_container As of ansible 2.4, `docker` module has been removed (was deprecated since ansible 2.1). We must switch to `docker_container` instead. See: https://docs.ansible.com/ansible/latest/modules/docker_module.html#docker-module Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-10 20:08:07 +00:00
Shilpa Jagannath	07852ed039	Remove zone from zonegroup and update period before deleting the zone to avoid inconsistent period information across other zones. When you delete a zone without removing from zonegroup, the period update would fail since that command needs to load the zone and zonegroup to be able to update the master. Period update would fail with an error like this: radosgw-admin period update --commit -1 Cannot find zone id= (name=), switching to local zonegroup configuration -1 Cannot find zone id= (name=) Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>	2018-07-09 12:27:24 +00:00
Sébastien Han	b9f7df7ba2	common: remove hdparm As of Kraken, the journal code does not use the hdparm command anymore so we can remove it from our package dependency list. Fixes: https://github.com/ceph/ceph-ansible/issues/1402 Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit f6910efa24389c264062963b2054c7cd29ffebb3)	2018-07-07 08:53:47 +00:00
Sébastien Han	713b9fcf9b	ceph-config: do not log cluster log on container The container image recently merged both cluster and mon log into a single stream. Following this, we now see this warning coming from the container image: 2018-06-19 13:44:01.542990 7ff75b024700 1 mon.vm02@1(peon).log v57928205 unable to write to '/var/log/ceph/ceph.log' for channel 'cluster': (2) No such file or directory So we now tell the mon to not log cluster log on the filesystem. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1591771 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-05 15:11:45 +00:00
Sébastien Han	fcf11ecc35	ceph-common: fix rhcs condition We forgot to add mgr_group_name when checking for the mon repo, thus the conditional on the next task was failing. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1598185 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-04 17:17:21 +02:00
Guillaume Abrioux	3abc253fec	mgr: fix enabling of mgr module on mimic The data structure has slightly changed on mimic. Prior to mimic, it used to be: ``` { "enabled_modules": [ "status" ], "disabled_modules": [ "balancer", "dashboard", "influx", "localpool", "prometheus", "restful", "selftest", "zabbix" ] } ``` From mimic it looks like this: ``` { "enabled_modules": [ "status" ], "disabled_modules": [ { "name": "balancer", "can_run": true, "error_string": "" }, { "name": "dashboard", "can_run": true, "error_string": "" } ] } ``` This means we can't simply check if `item` is in `item in _ceph_mgr_modules.disabled_modules` the idea here is to use filter `map(attribute='name')` to build a list when deploying mimic. Fixes: #2766 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-07-03 21:19:16 +00:00
Sébastien Han	63658c05c7	ceph-client: do not kill the dummy container The container runs for 300 sec, then dies and removes itself thanks to the '--rm' option, so there is no point of removing it. Also this is causing failure under some circonstances. Closing: https://bugzilla.redhat.com/show_bug.cgi?id=1568157 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-03 16:09:52 +00:00
Sébastien Han	a629408967	ceph-mds: enable application pool We now enable the application type 'cephfs' for each cephfs pools we create. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1590275 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-02 10:28:34 +00:00
Sébastien Han	103c279c21	ceph-defaults: add default application to pool We now add a default 'rbd' application type to each pool we create. This will remove the warning: " application not enabled on N pool(s) " Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1590275 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-07-02 10:28:34 +00:00
Vasu Kulkarni	1d454b611f	Enable monitor repo for mgr nodes and Tools repo for iscsi/nfs/clients Signed-off-by: Vasu Kulkarni <vasu@redhat.com>	2018-06-29 18:09:26 +00:00
Sébastien Han	abdb53e16a	ceph-osd: trigger osd container restart on script change The script ceph-osd-run.sh holds the config options to start the container, if one of these options are modified we must restart the container. This was not the case before becauase the 'notify' flag wasn't present. Closing: https://bugzilla.redhat.com/show_bug.cgi?id=1596061 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-06-28 17:54:13 +02:00
Sébastien Han	f623997271	systemd: remove changed_when: false When using a module there is no need to apply this Ansible option. The module will handle the idempotency on its own. So the module decides wether or not the task has changed during the execution. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-06-28 17:54:13 +02:00
George Shuklin	653b483fc3	Add ceph_keyring_permissions variable to control permissions for keyring files in /etc/ceph. Default value is the same as it was (0600), but this variable allows user to override it (f.e. set it to 0640). Signed-off-by: George Shuklin <george.shuklin@gmail.com>	2018-06-28 15:48:39 +00:00
Ha Phan	a7b7735b6f	ceph-mon: Generate initial keyring Minor fix so that initial keyring can be generated using python3. Signed-off-by: Ha Phan <thanhha.work@gmail.com>	2018-06-28 10:39:56 +02:00
Ha Phan	b7b8aba47b	Generate a copy of ceph.conf locally Refers to #2697 This change creates a copy of `ceph.conf` in ansible server. Signed-off-by: Ha Phan <thanhha.work@gmail.com>	2018-06-28 07:39:30 +00:00
Andy McCrae	a4a3d9a01b	Fix package state for upgrades on SuSE/RHEL During `226f80c22b` only Debian package installs had the correct state set to ensure packages were upgraded when the "upgrade_ceph_packages" var was set to true. Signed-off-by: Andy McCrae <andy.mccrae@gmail.com>	2018-06-27 18:55:22 +00:00
Sébastien Han	322e2de7d2	mon: honour mon_docker_net_host option --net=host was hardcoded in the startup line so even though mon_docker_net_host was set to False the net option would always be activated. mon_docker_net_host is set to True by default so this commit does not change the behaviour. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-06-27 13:44:41 +00:00
Michel Rode	7774935707	Added 'squash' as a parameter to nfs-ganesha. Set the default to 'root_squash' - which is the default of nfs-ganesha. Signed-off-by: Michel Rode <rmichel@devnu11.net>	2018-06-25 09:13:17 +02:00
Christian Zunker	48394597c9	reset failed count of ceph-mgr Depending on your setup, ceph-mgr might get restarted multiple times. When this is done to fast, systemd will prevent further restarts because of configured limits in the ceph-mgr systemd unit file. Resetting the failure count will prevent this problem. The reset is done before the restart so in case of a real problem during the restart it still fails. Fixes: #2768 Signed-off-by: Christian Zunker <christian.zunker@codecentric.cloud>	2018-06-20 13:59:16 +02:00
Sébastien Han	bea4027f0c	common: start firewalld if configure_firewall Currently we expect that if configure_firewall is set to True to have firewalld enabled and running. Let's enforce that. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1589146 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-06-18 04:02:50 -04:00
Sébastien Han	a9ed3579ae	mon/osd: bump container memory limit As discussed with the cores, the current limits are too low and should be bumped to higher value. So now by default monitors get 3GB and OSDs get 5GB. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1591876 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-06-17 11:20:27 -04:00
Guillaume Abrioux	51cf3b7fa0	client: try to kill dummy container only on first client node The 'dummy' container is created only on first client node, it means we must seek to destroy this container only on this node, otherwise this can cause failure like following : ``` fatal: [192.168.24.8]: FAILED! => {"changed": false, "cmd": ["docker", "rm", "-f", "ceph-create-keys"], "delta": "0:00:00.023692", "end": "2018-06-12 20:56:07.261278", "msg": "non-zero return code", "rc": 1, "start": "2018-06-12 20:56:07.237586", "stderr": "Error response from daemon: No such container: ceph-create-keys", "stderr_lines": ["Error response from daemon: No such container: ceph-create-keys"], "stdout": "", "stdout_lines": []} ``` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1590746 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-06-13 16:10:46 +02:00
Patrick Donnelly	9ce81ae845	ceph-mds: do not enable multimds on jewel Multiple active MDS became stable in Luminous. Introduced-by: `c8573fe0d7` Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2018-06-12 10:47:34 +02:00
Sébastien Han	2e8412734a	common: ability to enable/disable fw configuration Prior to this patch if you were running on a Red Hat system, ceph-ansible would try to configure firewalld for you without the operators's consent. Now you can enable or disable the fw configuration by setting configure_firewall to either true or false. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1589146 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-06-11 21:51:59 +02:00
Konstantin Shalygin	3a07568496	ceph-osd: set 'openstack_keys_tmp' only when 'openstack_config' is defined. If 'openstack_config' is false this task shouldn't be executed. Signed-off-by: Konstantin Shalygin <k0ste@k0ste.ru>	2018-06-11 13:03:55 +02:00
Vishal Kanaujia	1a610df02b	Fix to run secure cluster only once in a run The current secure cluster play runs with all the monitors. The rerun of this task is unnecessary and can be skipped. Fixes: #2737 Signed-off-by: Vishal Kanaujia <vishal.kanaujia@flipkart.com>	2018-06-11 08:37:29 +02:00
Guillaume Abrioux	090ecff94e	client: keyrings aren't created when single client node combining `run_once: true` with `inventory_hostname == groups.get(client_group_name) \| first` might cause bug when the only node being run is not the first in the group. In a deployment with a single client node it might cause issue because sometimes keyring won't be created since the task could be definitively skipped. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1588093 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-06-08 15:05:47 +02:00
Sébastien Han	20c8065e48	ceph-iscsi: rename group iscsi_gws Let's try to avoid using dashes as testinfra needs to be able to read the groups. Typically, with iscsi-gws we can't add a marker for these iscsi nodes, using an underscore fixes the issue. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-06-08 10:21:54 +02:00
Sébastien Han	91bf53ee93	ceph-iscsi: support for containerize deployment We now have the ability to deploy a containerized version of ceph-iscsi. The result is similar to the non-containerized version, you simply have 3 containers running for the following services: * rbd-target-api * rbd-target-gw * tcmu-runner Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1508144 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-06-08 10:21:54 +02:00
Guillaume Abrioux	8a653cacd5	client: add a default value for keyring file Potential error if someone doesnt pass the mode in `keys` dict for client nodes: ``` fatal: [client2]: FAILED! => {} MSG: The task includes an option with an undefined variable. The error was: 'dict object' has no attribute 'mode' The error appears to have been in '/home/guits/ceph-ansible/roles/ceph-client/tasks/create_users_keys.yml': line 117, column 3, but may be elsewhere in the file depending on the exact syntax problem. The offending line appears to be: - name: get client cephx keys ^ here exception type: <class 'ansible.errors.AnsibleUndefinedVariable'> exception: 'dict object' has no attribute 'mode' ``` adding a default value will avoid the deployment failing for this. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-06-07 17:26:35 +02:00
Guillaume Abrioux	5eacc8f8d8	tests: add a dummy value for 'dev' release Functional tests are broken when testing against 'dev' release (ceph). Adding a dummy value here will make it possible to run ceph-ansible CI against dev ceph release. Typical error: ``` > if request.node.get_marker("from_luminous") and ceph_release_num[ceph_stable_release] < ceph_release_num['luminous']: E KeyError: 'dev' ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit fd1487d93f21b609a637053f5b33cd2a4e408d00)	2018-06-07 13:59:17 +02:00
Andrew Schoen	24ef47b0e5	ceph-common: move firewall checks after package installation We need to do this because on dev or rhcs installs ceph_stable_release is not mandatory and the firewall check tasks have a task that is conditional based off the installed version of ceph. If we perform those checks after package install then they will not fail on dev or rhcs installs. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-06-07 13:59:17 +02:00
Guillaume Abrioux	7b156deb67	client: use dummy created container when there is no mon in inventory the `docker_exec_cmd` fact set in client role when there is no monitor in inventory is wrong, `ceph-client-{{ hostname }}` is never created so it will fail anyway. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-06-07 16:16:38 +08:00
Guillaume Abrioux	433ecc7cbc	osd: copy openstack keys over to all mon When configuring openstack, the created keyrings aren't copied over to all monitors nodes. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1588093 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-06-07 13:58:57 +08:00
Patrick Donnelly	91f9da530f	change max_mds default to 1 Otherwise, with the removal of mds_allow_multimds, the default of 3 will be set on every new FS. Introduced by: `c8573fe0d7` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1583020 Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2018-06-06 12:16:42 +08:00
Vishal Kanaujia	2cdb0d1812	Syntax error fix in rgw multisite role This checkin fixes a syntax error in RGW multisite role under when clause. Fixes: #2704 Signed-off-by: Vishal Kanaujia <vishal.kanaujia@flipkart.com>	2018-06-05 16:01:07 +05:30
Guillaume Abrioux	2cf06b515f	rgw: refact rgw pools creation Refact of `8704144e31` There is no need to have duplicated tasks for this. The rgw pools creation should be delegated on a monitor node se we don't have to care if the admin keyring is present on rgw node. By the way, only one task is needed to create the pools, we just need to use the `docker_exec_cmd` fact already defined in `ceph-defaults` to achieve it. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1550281 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-06-05 15:00:20 +08:00
Ha Phan	1f3c9ce4f3	Use python instead of python2 The initial keyring is generated from ansible server locally and the snippet works well for both v2 and v3 of python. I don't see any reason why we should explicitly invoke`python2` instead of just `python`. In some setups, `python2` is not symlinked to `python`; while `python` and `python3` refer to v2 and v3 respectively. Signed-off-by: Ha Phan <thanhha.work@gmail.com>	2018-06-04 14:24:10 +02:00
Sébastien Han	db50aec13d	ceph-common: add firewall rules for ceph-mgr Prior to this commit the firewall tasks were not opening the ceph-mgr ports. This would lead to unclean configuration since the ceph-mgr daemons can not connect to the OSDs. Thi commit opens the right ports on the ceph-mgr nodes to talk with the OSDs. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1526400 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-06-04 12:11:41 +02:00
jtudelag	600e1e2c26	rgws: renames create_pools variable with rgw_create_pools. Renamed to be consistent with the role (rgw) and have a meaningful name. Signed-off-by: Jorge Tudela <jtudelag@redhat.com>	2018-06-04 06:23:42 +02:00
jtudelag	8704144e31	Adds RGWs pool creation to containerized installation. ceph command has to be executed from one of the monitor containers if not admin copy present in RGWs. Task has to be delegated then. Adds test to check proper RGW pool creation for Docker container scenarios. Signed-off-by: Jorge Tudela <jtudelag@redhat.com>	2018-06-04 06:23:42 +02:00
Guillaume Abrioux	aae37b44f5	mons: move set_fact of openstack_keys in ceph-osd Since the openstack_config.yml has been moved to `ceph-osd` we must move this `set_fact` in ceph-osd otherwise the tasks in `openstack_config.yml` using `openstack_keys` will actually use the defaults value from `ceph-defaults`. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1585139 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-06-01 17:12:01 +02:00
Andrew Schoen	c2423e2c48	ceph-defaults: add the nautilus 14.x entry to ceph_release_num The first 14.x tag has been cut so this needs to be added so that version detection will still work on the master branch of ceph. Fixes: https://github.com/ceph/ceph-ansible/issues/2671 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-06-01 16:51:23 +02:00
Guillaume Abrioux	9d5265fe11	osds: wait for osds to be up before creating pools This is a follow up on #2628. Even with the openstack pools creation moved later in the playbook, there is still an issue because OSDs are not all UP when trying to create pools. Adding a task which checks for all OSDs to be UP with a `retries/until` condition should definitively fix this issue. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1578086 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-06-01 15:46:52 +02:00
Guillaume Abrioux	c68126d6fd	mdss: do not make pg_num a mandatory params When playing ceph-mds role, mon nodes have set a fact with the default pg num for osd pools, we can simply default to this value for cephfs pools (`cephfs_pools` variable). At the moment the variable definition for `cephfs_pools` looks like: ``` cephfs_pools: - { name: "{{ cephfs_data }}", pgs: "" } - { name: "{{ cephfs_metadata }}", pgs: "" } ``` and we have a task in `ceph-validate` to ensure `pgs` has been set to a valid value. We could simply avoid this check by setting the default value of `pgs` to `hostvars[groups[mon_group_name][0]]['osd_pool_default_pg_num']` and let to users the possibility to override this value. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1581164 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-30 16:20:34 +02:00
Guillaume Abrioux	34e646e767	osds: do not set docker_exec_cmd fact in `ceph-osd` there is no need to set `docker_exec_cmd` since the only place where this fact is used is in `openstack_config.yml` which delegate all docker command to a monitor node. It means we need the `docker_exec_cmd` fact that has been set referring to `ceph-mon-*` containers, this fact is already set earlier in `ceph-defaults`. By the way, when collocating an OSD with a MON it fails because the container `ceph-osd-{{ ansible_hostname }}` doesn't exist. Removing this task will allow to collocate an OSD with a MON. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1584179 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-30 16:17:29 +02:00
Guillaume Abrioux	608ea947a9	mds: move mds fs pools creation When collocating mds on monitor node, the cephpfs will fail because `docker_exec_cmd` is reset to `ceph-mds-monXX` which is incorrect because we need to delegate the task on `ceph-mon-monXX`. In addition, it wouldn't have worked since `ceph-mds-monXX` container isn't started yet. Moving the task earlier in the `ceph-mds` role will fix this issue. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1578086 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-25 11:16:56 +02:00
Sébastien Han	1c084efb3c	rgw: container add option to configure multi-site zone You can now use RGW_ZONE and RGW_ZONEGROUP on each rgw host from your inventory and assign them a value. Once the rgw container starts it'll pick the info and add itself to the right zone. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1551637 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-05-24 11:32:05 -07:00
Guillaume Abrioux	3a0e168a76	mdss: move cephfs pools creation in ceph-mds When deploying a large number of OSD nodes it can be an issue because the protection check [1] won't pass since it tries to create pools before all OSDs are active. The idea here is to move cephfs pools creation in `ceph-mds` role. [1] `e59258943b/src/mon/OSDMonitor.cc (L5673)` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1578086 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-24 09:39:38 -07:00
Guillaume Abrioux	564a662baf	osds: move openstack pools creation in ceph-osd When deploying a large number of OSD nodes it can be an issue because the protection check [1] won't pass since it tries to create pools before all OSDs are active. The idea here is to move openstack pools creation at the end of `ceph-osd` role. [1] `e59258943b/src/mon/OSDMonitor.cc (L5673)` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1578086 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-24 09:39:38 -07:00
Luigi Toscano	43e96c1f98	ceph-radosgw: disable NSS PKI db when SSL is disabled The NSS PKI database is needed only if radosgw_keystone_ssl is explicitly set to true, otherwise the SSL integration is not enabled. It is worth noting that the PKI support was removed from Keystone starting from the Ocata release, so some code paths should be changed anyway. Also, remove radosgw_keystone, which is not useful anymore. This variable was used until `fcba2c801a`. Now profiles drives the setting of rgw keystone *. Signed-off-by: Luigi Toscano <ltoscano@redhat.com>	2018-05-23 23:24:09 -07:00
Vishal Kanaujia	ef5f52b1f3	Skip GPT header creation for lvm osd scenario The LVM lvcreate fails if the disk already has a GPT header. We create GPT header regardless of OSD scenario. The fix is to skip header creation for lvm scenario. fixes: https://github.com/ceph/ceph-ansible/issues/2592 Signed-off-by: Vishal Kanaujia <vishal.kanaujia@flipkart.com>	2018-05-23 11:44:09 -07:00
Subhachandra Chandra	c7e269fcf5	Fix restarting OSDs twice during a rolling update. During a rolling update, OSDs are restarted twice currently. Once, by the handler in roles/ceph-defaults/handlers/main.yml and a second time by tasks in the rolling_update playbook. This change turns off restarts by the handler. Further, the restart initiated by the rolling_update playbook is more efficient as it restarts all the OSDs on a host as one operation and waits for them to rejoin the cluster. The restart task in the handler restarts one OSD at a time and waits for it to join the cluster.	2018-05-22 19:23:07 +02:00
Andrew Schoen	a9ad8eb5f3	ceph-validate: do not check ceph version on dev or rhcs installs A dev or rhcs install does not require ceph_stable_release to be set and instead generates that by looking at the installed ceph-version. However, at this point in the playbook ceph may not have been installed yet and ceph-common has not be run. Fixes: https://github.com/ceph/ceph-ansible/issues/2618 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-21 23:11:04 +02:00
Andrew Schoen	e7d02a50d8	ceph-validate: move system checks from ceph-common to ceph-validate Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	645f61c351	ceph-defaults: remove backwards compat for containerized_deployment The validation module does not get config options with the template syntax rendered, so we're gonna remove that and just default it to False. The backwards compat was schedule to be removed in 3.1 anyway. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	d30a99c350	validate: add support for containerized_deployment Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	f84c2ba27b	ceph-defaults: fix failing tasks when osd_scenario was not set correctly When devices is not defined because you want to use the 'lvm' osd_scenario but you've made a mistake selecting that scenario these tasks should not fail. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	1f15a81c48	ceph-defaults: move cephfs vars from the ceph-mon role We're doing this so we can validate this in the ceph-validate role Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	ffe05872ac	validate: only validate cephfs_pools on mon nodes Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	48c2a4fda8	validate: check rados config options Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	377fe81c10	validate: make sure ceph_stable_release is set to the correct value Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	ba7f09c0a7	ceph-validate: move var checks from ceph-common into this role Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	32bac6b491	ceph-validate: move var checks from ceph-osd into this role Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	29a9dffc83	ceph-validate: move ceph-mon config checks into this role Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Andrew Schoen	d87a32347f	adds a new ceph-validate role This will be used to validate config given to ceph-ansible. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-05-18 17:58:24 +02:00
Sébastien Han	2f43e9dab5	defaults: restart_osd_daemon unit spaces Extra space in systemctl list-units can cause restart_osd_daemon.sh to fail It looks like if you have more services enabled in the node space between "loaded" and "active" get more space as compared to one space given in command the command[1]. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1573317 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-05-18 17:53:47 +02:00
Michael Vollman	ed050bf3f6	Do nothing when mgr module is in good state Check whether a mgr module is supposed to be disabled before disabling it and whether it is already enabled before enabling it. Signed-off-by: Michael Vollman <michael.b.vollman@gmail.com>	2018-05-18 15:21:45 +02:00
Andy McCrae	f45662e270	Fix template reference for ganesha.conf We can simply reference the template name since it exists within the role that we are calling. We don't need to check the ANSIBLE_ROLE_PATH or playbooks directory for the file.	2018-05-17 15:23:52 +02:00
Andy McCrae	226f80c22b	Install packages as a list To make the package installation more efficient we should install packages as a list rather than as individual tasks or using a "with_items" loop. The package managers can handle a list passed to them to install in one go. We can use a specified list and substitute any packages that are not to be installed with the ceph-common package, which is installed on every package install, then apply the unique filter to the package install list.	2018-05-16 09:59:00 +02:00
Guillaume Abrioux	f749830897	mon: refactor of mgr key fetching There is no need to stat for created mgr keyrings since they are created anyway when deploying a ceph cluster > jewel. In case of a jewel deployment we won't enter that block. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-16 09:44:58 +02:00
Guillaume Abrioux	926be51b44	mgr: delete copy_configs.yml (containerized) This file is a leftover from PR ceph/ceph-ansible#2516 It is not used anymore so it can be removed. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-15 19:30:34 +02:00
Sébastien Han	52fc8a0385	rolling_update: move mgr key creation Until all the mons haven't been updated to Luminous, there is no way to create a key. So we should do the key creation in the mon role only if we are not part of an update. If we are then the key creation is done after the mons upgrade to Luminous. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1574995 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-05-15 09:01:42 +02:00
Sébastien Han	e810fb217f	Revert "mon: fix mgr keyring creation when upgrading from jewel" This reverts commit `259fae931d`.	2018-05-15 09:01:42 +02:00
Guillaume Abrioux	a145caf947	iscsi-gw: fix issue when trying to mask target trying to mask target when `/etc/systemd/system/target.service` doesn't exist seems to be a bug. There is no need to mask a unit file which doesn't exist. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-14 21:42:22 +02:00
Sébastien Han	8c7c11b774	iscsi: add python-rtslib repository Signed-off-by: Sébastien Han <seb@redhat.com>	2018-05-14 21:42:22 +02:00
Andy McCrae	08a2b58d39	Allow os_tuning_params to overwrite fs.aio-max-nr The order of fs.aio-max-nr (which is hard-coded to 1048576) means that if you set fs.aio-max-nr in os_tuning_params it will effectively be ignored for bluestore scenarios. To resolve this we should move the setting of fs.aio-max-nr above the setting of os_tuning_params, in this way the operator can define the value of fs.aio-max-nr to be something other than 1048576 if they want to. Additionally, we can make the sysctl settings happen in 1 task rather than multiple.	2018-05-11 10:49:37 +01:00
Guillaume Abrioux	f60b049ae5	client: remove default value for pg_num in pools creation trying to set the default value for pg_num to `hostvars[groups[mon_group_name][0]]['osd_pool_default_pg_num'])` will break in case of external client nodes deployment. the `pg_num` attribute should be mandatory and be tested in future `ceph-validate` role. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-10 11:51:02 -07:00
Gregory Meno	26f6a65042	adds missing state needed to upgrade nfs-ganesha in tasks for os_family Red Hat we were missing this fixes: bz1575859 Signed-off-by: Gregory Meno <gmeno@redhat.com>	2018-05-09 19:58:04 +00:00
Guillaume Abrioux	259fae931d	mon: fix mgr keyring creation when upgrading from jewel On containerized deployment, when upgrading from jewel to luminous, mgr keyring creation fails because the command to create mgr keyring is executed on a container that is still running jewel since the container is restarted later to run the new image, therefore, it fails with bad entity error. To get around this situation, we can delegate the command to create these keyrings on the first monitor when we are running the playbook on the last monitor. That way we ensure we will issue the command on a container that has been well restarted with the new image. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1574995 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-09 10:29:48 -07:00
Guillaume Abrioux	7b387b506a	osd: clean legacy syntax in ceph-osd-run.sh.j2 Quick clean on a legacy syntax due to `e0a264c7e` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-09 07:29:33 +02:00
Simone Caronni	b12bf62c36	Make sure the restart_mds_daemon script is created with the correct MDS name	2018-05-08 20:53:15 +02:00
Sébastien Han	07ca91b5cb	common: enable Tools repo for rhcs clients Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1574458 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-05-08 16:12:30 +02:00
Andy McCrae	e99351b95b	Fix install of nfs-ganesha-ceph for Debian/SuSE The Debian and SuSE installs for nfs-ganesha on the non-rhcs repository requires you to allow_unauthenticated for Debian, and disable_gpg_check for SuSE. The nfs-ganesha-rgw package already does this, but the nfs-ganesha-ceph package will fail to install because of this same issue. This PR moves the installations to happen when the appropriate flags are set to True (nfs_obj_gw & nfs_file_gw), but does it per distro (one for SuSE and one for Debian) so that the appropriate flag can be passed to ignore the GPG check.	2018-05-04 15:13:59 +02:00
Ramana Raja	31762dede3	ceph-nfs: disable attribute caching When 'ceph_nfs_disable_caching' is set to True, disable attribute caching done by Ganesha for all Ganesha exports. Signed-off-by: Ramana Raja <rraja@redhat.com>	2018-05-04 09:47:54 +02:00
Sébastien Han	4a186237e6	common: copy iso files if rolling_update If we are in a middle of an update we want to get the new package version being installed so the task that copies the repo files should not be skipped. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1572032 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-05-03 17:18:55 +02:00
Andy McCrae	d142be0422	Move apt cache update to individual task per role The apt-cache update can fail due to transient issues related to the action being a network operation. To reduce the impact of these transient failures this patch adds a retry to the update_cache task. However, the apt_repository tasks which would perform an apt_update won't retry the apt_update on a failure in the same way, as such this PR moves the apt_update into an individual task, once per role. Finally, the apt_repository tasks no longer have a changed_when: false, and the apt_cache update is only performed once per role, if the repositories change. Otherwise the cache is updated on the "apt" install tasks if the cache_timeout has been reached.	2018-05-03 14:02:15 +02:00
Guillaume Abrioux	6fe8df627b	client: fix pool creation the value in `docker_exec_client_cmd` doesn't allow to check for existing pools because it's set with a wrong value for the entrypoint that is going to be used. It means the check were going to fail anyway even if pools actually exist. Using jinja syntax to set `docker_exec_cmd` allows to handle the case where you don't have monitors in your inventory. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-05-03 08:22:40 +02:00
Sébastien Han	43e23ffe4d	mon: change application pool support If openstack_pools contains an application key it will be used to apply this application pool type to a pool. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1562220 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-30 09:42:58 +02:00
Guillaume Abrioux	75ed437d4e	check if pools already exist before creating them Add a task to check if pools already exist before we create them. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-30 08:15:18 +02:00
Guillaume Abrioux	a68091c923	tests: update the type for the rule used in pools As of ceph 12.2.5 the type of the parameter `type` is not a name anymore but an id, therefore an `int` is expected otherwise it will fail with the following error Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-30 08:15:18 +02:00
Sébastien Han	12eebc31fb	mon/client: honor key mode when copying it to other nodes The last mon creates the keys with a particular mode, while copying them to the other mons (first and second) we must re-use the mode that was set. The same applies for the client node, the slurp preserves the initial 'item' so we can get the mode for the copy. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-23 18:34:58 +02:00
Sébastien Han	74494253fa	mon: remove redundant copy task We had twice the same task, also one was overriding the mode. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-23 18:34:58 +02:00
Sébastien Han	85732d11b9	mon/client: remove acl code Applying ACL on the keyrings is not used anymore so let's remove this code. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-23 18:34:58 +02:00
Sébastien Han	cfe8e51d99	mon/client: apply mode from ceph_key Do not use a dedicated task for this but use the ceph_key module capability to set file mode. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-23 18:34:58 +02:00
Di Xu	113eb25424	add AArch64 to supported architecture works on AArch64 platform	2018-04-23 10:23:21 +02:00
Sébastien Han	949507d304	mon: remove mgr key from ceph_config_keys This key is created after the last mon is up so there is no need to try to push it from the first mon. The initia mon container is not creating the mgr key, ansible does. So this key will never exist. The key will go into the fetch dir once the last mon is up, then when the ceph-mgr plays it will try to get it from the fetch directory. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-23 10:17:24 +02:00
Sébastien Han	35c1eb7183	mon: remove mon map from ceph_config_keys During the initial bootstrap of the first mon, the monmap file is destroyed so it's not available and ansible will never find it. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-23 10:17:24 +02:00
Sébastien Han	65ba85aff6	Expose /var/run/ceph Useful for softwares that do data collection/monitoring like collectd. They can connect to the socket and then retrieve information. Even though the sockets are exposed now, I'm keeping the docker exec to check the socket, this will allow newer version of ceph-ansible to work with older versions. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1563280 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-20 15:48:32 +02:00
Sébastien Han	bf1e70e8cf	default: extent ceph_uid and gid We now have the ability to detect the uid/gid of the ceph user depending on the distribution we are running on and so we are doing non-container deployements. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-20 15:48:32 +02:00
Sébastien Han	f3656ad167	move create ceph initial directories to default This is needed for both non-container and container deployments. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-20 15:48:32 +02:00
Sébastien Han	641f141c0f	selinux: remove chcon calls We know bindmount with the :z option at the end of the -v command so this will basically run the exact same command as we used to run. So to speak: chcon -Rt svirt_sandbox_file_t /var/lib/ceph Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-19 14:59:37 +02:00
Sébastien Han	90e47c5fb0	client: add a --rm option to run the container This fixes the case where the playbook died and never removed the container. So now, once the container exits it will remove itself from the container list. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1568157 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-19 14:59:37 +02:00
Sébastien Han	6c742376fd	client: import the key in ceph is copy_admin_key is true If the user has set copy_admin_key to true we assume he/she wants to import the key in Ceph and not only create the key on the filesystem. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-18 17:46:54 +02:00
Sébastien Han	424815501a	client: add quotes to the dict values ceph-authtool does not support raw arguements so we have to quote caps declaration like this allow 'bla bla' instead of allow bla bla Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1568157 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-18 17:46:54 +02:00
Sébastien Han	d2a2793cb0	refactor the way we copy keys This commit does a couple of things: * use a common.yml file that contains things that can be played on both container and non-container * refactor the ability to copy the admin key to the nodes Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-18 16:46:33 +02:00
Randy J. Martinez	127a643fd0	ceph-defaults: fix ceph_uid fact on container deployments Red Hat is now using tags[3,latest] for image rhceph/rhceph-3-rhel7. Because of this, the ceph_uid conditional passes for Debian when 'ceph_docker_image_tag: latest' on RH deployments. I've added an additional task to check for rhceph image specifically, and also updated the RH family task for ceph/daemon [centos\|fedora]tags. Signed-off-by: Randy J. Martinez <ramartin@redhat.com>	2018-04-17 16:54:51 +02:00
Sébastien Han	a98885a71e	rhcs: re-add apt-pining When installing rhcs on Debian systems the red hat repos must have the highest priority so we avoid packages conflicts and install the rhcs version. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1565850 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-17 16:07:06 +02:00
Guillaume Abrioux	899b0eb451	defaults: check only 1 time if there is a running cluster There is no need to check for a running cluster n*nodes time in `ceph-defaults` so let's add a `run_once: true` to save some resources and time. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-16 11:23:00 +02:00
Sébastien Han	5bbbce527e	osd: do not do anything if the dev has a partition Regardless if the partition is 'ceph' or something else, we don't want to be as strick as checking for a particular partition. If the drive has a partition, we just don't do anything. This solves the case where the server reboots, disks get a different /dev/sda (node) allocation. In this case, prior to restarting the server /dev/sda was an OSD, but now it's /dev/sdb and the other way around. In such scenario, we will try to prepare the OSD and create a new partition, so let's not mess around with devices that have partitions. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1498303 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-13 19:11:15 +02:00
Sébastien Han	37117071eb	common: add tools repo for iscsi gw To install iscsi gw packages we need to enable the tools repo. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1547849 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-12 13:38:34 +02:00
Douglas Fuller	c8573fe0d7	Remove deprecated allow_multimds allow_multimds will be officially deprecated in Mimic, specify it only for all versions of Ceph where it was declared stable. Going forward, specify only max_mds. Signed-off-by: Douglas Fuller <dfuller@redhat.com>	2018-04-12 10:29:17 +02:00
vasishta p shastry	020e66c1b4	Fixed a typo (extra space)	2018-04-11 14:21:15 +02:00
vasishta p shastry	e1a1f81b6f	osd: to support copy_admin_key	2018-04-11 14:21:15 +02:00
vasishta p shastry	db3a5ce6d9	mds: to support copy_admin_keyring	2018-04-11 14:21:15 +02:00
vasishta p shastry	6b59416f75	nfs: to support copy_admin_key - containerized	2018-04-11 14:21:15 +02:00
Ali Maredia	01c58695fc	nfs: ensure nfs-server server is stopped NFS-ganesha cannot start is the nfs-server service is running. This commit stops nfs-server in case it is running on a (debian, redhat, suse) node before the nfs-ganesha service starts up fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1508506 Signed-off-by: Ali Maredia <amaredia@redhat.com>	2018-04-11 14:00:48 +02:00
Ramana Raja	4a430ae29a	ceph-nfs: allow disabling ganesha caching Add a variable, ceph_nfs_disable_caching, that if set to true disables ganesha's directory and attribute caching as much as possible. Also, disable caching done by ganesha, when 'nfs_file_gw' variable is true, i.e., when Ganesha is used as CephFS's gateway. This is the recommended Ganesha setting as libcephfs already caches information. And doing so helps avoid cache incoherency issues especially with clustered ganesha over CephFS. Fixes: https://tracker.ceph.com/issues/23393 Signed-off-by: Ramana Raja <rraja@redhat.com>	2018-04-11 13:56:40 +02:00
Sébastien Han	82ccbdafbc	ceph-defaults: bring backward compatibility for old syntax If people keep on using the mon_cap, osd_cap etc the playbook will translate this old syntax on the flight. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-11 12:18:34 +02:00
Sébastien Han	9657e4d6fa	ceph_key: use ceph_key in the playbook Replaced all the occurence of raw command using the 'command' module with the ceph_key module instead. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-11 12:18:34 +02:00
Guillaume Abrioux	66c4118dcd	defaults: fix backward compatibility backward compatibility with `ceph_mon_docker_interface` and `ceph_mon_docker_subnet` was not working since there wasn't lookup on `monitor_interface` and `public_network` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-10 00:19:11 +02:00
Ken Dreyer	3752cc6f38	common: upgrade/install ceph-test RPM first Prior to this change, if a user had ceph-test-12.2.1 installed, and upgraded to ceph v12.2.3 or newer, the RPM upgrade process would fail. The problem is that the ceph-test RPM did not depend on an exact version of ceph-common until v12.2.3. In Ceph v12.2.3, ceph-{osdomap,kvstore,monstore}-tool binaries moved from ceph-test into ceph-base. When ceph-test is not yet up-to-date, Yum encounters package conflicts between the older ceph-test and newer ceph-base. When all users have upgraded beyond Ceph < 12.2.3, this is no longer relevant.	2018-04-09 18:09:52 +02:00
Sébastien Han	bb60f2fea4	ceph-defaults: fix ceoh_uid for container image tag latest According to our recent change, we now use "CentOS" as a latest container image. We need to reflect this on the ceph_uid. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-09 13:54:55 +02:00
Zack Cerza	0123d790cd	Use the CentOS repo for Red Hat dev packages No use even trying to use something that doesn't exist. Signed-off-by: Zack Cerza <zack@redhat.com>	2018-04-09 10:05:57 +02:00
Attila Fazekas	ecd3563c21	Deploying without managed monitors failed Tripleo deployment failed when the monitors not manged by tripleo itself with: FAILED! => {"msg": "list object has no element 0"} The failing play item was introduced by `f46217b69a` . fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1552327 Signed-off-by: Attila Fazekas <afazekas@redhat.com>	2018-04-04 18:16:46 +02:00
Guillaume Abrioux	dcf6a246a4	defaults: remove `run_once: true` when creating fetch_directory because of `serial: 1`, it can be an issue when the playbook is being run on client nodes. Since the refact of `ceph-client` we skip the role `ceph-defaults` on every node except the first client node, it means that the task is not going to be played because of `run_once: true`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-04 10:51:17 +02:00
Guillaume Abrioux	18c0c7a508	config: use fact `ceph_uid` Use fact `ceph_uid` in the task which ensures `/etc/ceph` exists in containerized deployments. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-04 10:51:17 +02:00
Guillaume Abrioux	9c979c6390	clients: refact `ceph-clients` role This commit refacts this role so we don't have to pull container image on client nodes just to create pools and keys. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1550977 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-04 10:51:17 +02:00
Guillaume Abrioux	cefd471967	client: remove legacy code This seems to be a leftover. This commit removes an unnecessary 'set linux permissions' on `/var/lib/ceph` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-04 10:51:17 +02:00
Guillaume Abrioux	cf27c5e941	move selinux check to `ceph-defaults` This check is alone in `ceph-docker-common` since a previous code refactor. Moving this check in `ceph-defaults` allows us to run `ceph-clients` without having to run `ceph-docker-common` even in non-containerized deployment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-04-04 10:51:17 +02:00
Sébastien Han	f3caee8460	ceph-iscsi: fix certificates generation and distribution Prior to this patch, the certificates where being generated on a single node only (because of the run_once: true). Thus certificates were not distributed on all the gateway nodes. This would require a second ansible run to work. This patches fix the creation and keys's distribution on all the nodes. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1540845 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-04 09:27:39 +02:00
Randy J. Martinez	ca572a11f1	ceph-mds: delete duplicate tasks which cause multimds container deployments to fail. This update will resolve error['cephfs' is undefined.] in multimds container deployments. See: roles/ceph-mon/tasks/create_mds_filesystems.yml. The same last two tasks are present there, and actully need to happen in that role since "{{ cephfs }}" gets defined in roles/ceph-mon/defaults/main.yml, and not roles/ceph-mds/defaults/main.yml. Signed-off-by: Randy J. Martinez <ramartin@redhat.com>	2018-03-29 09:32:40 +02:00
Alfredo Deza	3fcf966803	ceph-osd note that some scenarios use ceph-disk vs. ceph-volume Signed-off-by: Alfredo Deza <adeza@redhat.com>	2018-03-29 09:11:33 +02:00
John Fulton	e6e6bd078a	Refer to expected-num-ojects as expected_num_objects, not size Follow up patch to PR 2432 [1] which replaces "size" (sorry if the original bug used that term, which can be confusing) with expected_num_objects as is used in the Ceph documentation [2]. [1] https://github.com/ceph/ceph-ansible/pull/2432/files [2] http://docs.ceph.com/docs/jewel/rados/operations/pools	2018-03-26 15:41:51 +02:00
Ning Yao	691ddf5349	cleanup osd.conf.j2 in ceph-osd osd crush location is set by ceph_crush in the library, osd.conf.j2 is not used any more. Signed-off-by: Ning Yao <yaoning@unitedstack.com>	2018-03-26 15:57:37 +08:00
Patrick Donnelly	7f91547304	setup cephx keys when not nfs_obj_gw Copy the admin key when configured nfs_file_gw (but not nfs_obj_gw). Also, copy/setup RGW related directories only when configured as nfs_obj_gw. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2018-03-22 14:01:08 +01:00
Andrew Schoen	6cffbd5409	ceph-defaults: set is_atomic variable This variable is needed for containerized clusters and is required for the ceph-docker-common role. Typically the is_atomic variable is set in site-docker.yml.sample though so if ceph-docker-common is used outside of that playbook it needs set in another way. Moving the creation of the variable inside this role means playbooks don't need to worry about setting it. fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1558252 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-03-21 19:16:11 +01:00
Andy McCrae	388562a4af	Simplify ceph.conf generation Since the approach to creating a ceph.conf file has changed, and now no-longer relies on assembling config file fragments in /etc/ceph/ceph.d we can avoid the conf_overrides rendering on the local host and skip out the tasks related to that, instead using just the config_template task to configure the file directly.	2018-03-15 15:47:41 +01:00
Sébastien Han	e3275c1ca1	osd: add fs.aio-max-nr tuning The number of osds per nodes is limited by aio-max-nr, default is low, so we need to increase it. Full story: http://lists.ceph.com/pipermail/ceph-users-ceph.com/2017-August/020408.html Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1553407 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-15 14:06:26 +01:00
Sébastien Han	f432819c1e	osd: apply systcl right away Without sysctl_set: yes the sysctm tuning will only get applied on the systctl.conf but not on the fly. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-15 14:06:26 +01:00
Sébastien Han	0f8a4251ba	move system tuning to osd role The changes from these tasks only apply to osd nodes so there is no reason to have them in ceph-common. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-15 14:06:26 +01:00
Sébastien Han	f119b25bbe	client: implement proper pools creation Just like we did for the monitor and openstack_config we now have the ability to precisely create pools. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	e302c1baae	mon: add support for erasure code pool You can now specify type: erasure and erasure_profile to use when declaring the pool dictionnary. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	277d885bc9	mon: add support for pgp, pool type and rule name When creating pools, it's crucial to expose all the options available as part of the pool creation command. As explained in: http://docs.ceph.com/docs/jewel/rados/operations/pools/ Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	26bc00fb74	mon: fail if pool creation fails There is no reason to continue the deployment if these tasks fail. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1546185 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	0011edd2bc	mon: add support for expected-num-objects This commit adds the support for expected-num-objects when creating a pool. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1541520 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	18402b636f	defaults: add useful info if daemon are not restarted properly If OSDs don't restart normally we now also dump info of the crush map, crush rules, crush tree and pools. If the monitors don't restart normally we also print the socket status by calling mon_status and quorum_status. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
jtudelag	691f7c5146	Adds handy ceph aliases whe containerized installations. Same approach as openshift-ansible etcdctl: * https://github.com/openshift/openshift-ansible/blob/release-3.7/roles/etcd/tasks/auxiliary/drop_etcdctl.yml * https://github.com/openshift/openshift-ansible/blob/release-3.7/roles/etcd/etcdctl.sh	2018-03-08 13:56:39 +01:00
Guillaume Abrioux	9181c94adf	client: fix pgs num for client pool creation The `pools` dict defined in `roles/ceph-client/defaults/main.yml` shouldn't have `{{ ceph_conf_overrides.global.osd_pool_default_pg_num }}` as default value for `pgs` keys. For instance, if you want some pools to be created but without explicitely specifying the pgs for these pools (it means you want to use the `osd_pool_default_pg_num`), you will be obliged to define `{{ ceph_conf_overrides.global.osd_pool_default_pg_num }}` anyway while you wanted to use the current default value already defined in the cluster which is retrieved early in the playbook and stored in the `{{ osd_pool_default_pg_num }}` fact. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-03-07 11:18:04 +01:00
Sébastien Han	96c049be5b	common: run updatedb task on debian systems only The command doesn't exist on Red Hat systems so it's better to skip it instead of ignoring the error. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	a52ed43093	mon: fix osd_pool_default_crush_rule persistence and effectiveness Running the last portion (insert new default and add new default crush tasks) of crush_rules.yml only on the last monitor is wrong since ceph CLI calls usually end up on the master having the quorum, which is by default the one with the lower IP. So if we run the command and end up on another mon the creation will happen on the default crush rule because the particular mon hasn't been updated. To fix this we remove the \|last on the include and use run_once: true on certain tasks, then we let the final two tasks run on all the monitors. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	47cef7a41d	mon: fix set crush default rule On releases after jewel the option 'osd_pool_default_crush_replicated_ruleset' does not exist anymore, it's called osd_pool_default_crush_rule. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	3261ab23b8	osd: remove old crush_location implementation This was causing a lot of pain with the handlers. Also the implementation was not ideal since we were assembling files. Everything can now be done with the ceph_crush module so let's remove that. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	73c4846744	mon: use ceph_crush module in the playbook Instead of creating the CRUSH hierarchy with Ansible tasks using the command module we now rely on the ceph_crush module. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Greg Charot	78c1f1938f	mons: Current crush_rule playbook does not work if there is no default rule defined (default: true). One could want to add new crush rules while keeping his current default rule. Fixed it so that it works with all rules defined as "default: false". If multiple rules are defined as default (should not be) then the last rule listed in "crush_rules" is taken as default.	2018-03-06 15:24:31 +00:00
Greg Charot	77f9c1df10	no reason the ceph-ansible ansible default provided crush_rule_hdd rule should be set as rack root + default ruleset	2018-03-06 15:24:31 +00:00
Greg Charot	50afc3fbf3	We don't want to automatically move the rbd pool to the new default crush rule. This operation shall be performed by the cluster operator.	2018-03-06 15:24:31 +00:00
Andy McCrae	04ca685ba7	Remove vars that are no longer used As part of `fcba2c801a` these vars were removed and no longer do anything: radosgw_dns_name radosgw_resolve_cname This patch removes them from the group_vars files and defaults/main.yml	2018-03-06 09:16:25 +01:00
jtudelag	c3267b77b7	Makes use of docker_exec_cmd in ceph-mon role. Keeps consistency inside the role and among roles. Makes the code more readable.	2018-03-05 12:48:35 +00:00
Sébastien Han	cb0f598965	common: run updatedb task on debian systems only The command doesn't exist on Red Hat systems so it's better to skip it instead of ignoring the error. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-02 20:59:10 +00:00
Sébastien Han	7f19df8196	rgw: add cluster name option to the handler If the cluster name is different than 'ceph', the command will fail so we need to pass the cluster name. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-02 20:59:10 +00:00
Sébastien Han	9c85280602	rgw: ability to copy ceph admin key on containerized If we now set copy_admin_key while running a containerized scenario, the ceph admin key will be copied on the node. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-02 20:59:10 +00:00
Sébastien Han	67f46d8ec3	rgw: run the handler on a mon host In case the admin wasn't copied over to the node this command would fail. So it's safer to run it from a monitor directly. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-02 20:59:10 +00:00
Guillaume Abrioux	6d35bc9bde	client: use `ceph_uid` fact to set uid/gid on admin key That task is failing on containerized deployment because `ceph:ceph` doesn't exist. The idea here is to use the `{{ ceph_uid }}` to set the ownerships for the admin keyring when containerized_deployment. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1540578 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-02-26 15:52:05 +01:00
Grant Slater	1e1b26ca4d	mds: fix ansible_service_mgr typo This commit fixes a typo introduced by `4671b9e74e`	2018-02-26 13:05:14 +01:00
Andy McCrae	c33dae7509	Revert "[TEST] Test setting up correct systemd file for nfs-ganesha" The nfs-ganesha package has been fixed as part of this commit: `963b6681df` Once the package is rebuilt this should be good to merge. This reverts commit `e88af3c4cb`.	2018-02-26 10:23:42 +01:00
Giulio Fidente	a83e1aeea3	Make rule_name optional when defining items in openstack_pools Previously it was necessary to provide a value (eventually an empty string) for the "rule_name" key for each item in openstack_pools. This change makes that optional and defaults to empty string when not given.	2018-02-23 15:11:53 +01:00
Sébastien Han	165d9dec10	remove kernel.pid_max This is now managed by Ceph packages. See: https://github.com/ceph/ceph/pull/18544/files http://tracker.ceph.com/issues/21929 Closes: https://github.com/ceph/ceph-ansible/issues/2410 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-23 13:57:57 +01:00
Andy McCrae	2779d2a850	Adjust /etc/updatedb.conf to not parse /var/lib/ceph Using updatedb -e doesnt make a permanent change, but will updatedb without the passed path. To make this change more permanent we should update the /etc/updatedb.conf file to include /var/lib/ceph.	2018-02-20 11:32:56 +01:00
Andy McCrae	e88af3c4cb	[TEST] Test setting up correct systemd file for nfs-ganesha Don't merge this. Test to see if we copy over the nfs-ganesha-lock.service.debian8 file properly, whether the Xenial CI job will work. The upstream download.ceph.com nfs-ganesha package should be fixed for xenial (which is in progress).	2018-02-20 10:49:37 +01:00
Paul Bourke	463b5c6b22	Remove redundant task to check if atomic This fact is already set in site-docker.yml so there's no need to check it again in ceph-docker-common Signed-off-by: Paul Bourke <paul.bourke@oracle.com>	2018-02-19 10:10:46 +01:00
Andy McCrae	59a4335a56	Restart services if handler called This patch fixes an issue where if hosts have different service lists, it will prevent restarting changes on services that run later on. For example, hostA in the mons and rgws group would initiate a config change and restart of services on all mons and rgws hosts, even though a separate hostB (which is only in the rgws group) has not had its configuration changed yet. Additionally, when the second host has its coniguration changed as part of the ceph-rgw role, it will not initiate a restart since its inventory name != the first hosts. To fix this we should run the restart once (using run_once: True) as long as the host has called the handler. This will ensure that even if only 1 host has called the handler it will initiate a restart on all hosts that have called the handler. Additionally, we add a var that is set when the handler runs, this will ensure that only hosts that have called the handler get restarted. Includes minor fix to remove unrequired "inventory_hostname in play_hosts" when: clause. This is no longer required since the handlers were changed. The host calling the handler will be in play_hosts already.	2018-02-16 10:40:20 +01:00
Sébastien Han	c816a9282c	container: osd remove run_once When used along with delegate, run_once does not belong well. Thus, using \| last always brings the desired result. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-14 02:01:29 +01:00
Sébastien Han	d47d02a5eb	docker-common: fix container restart on new image We now look for any excisting containers, if any we compare their running image with the latest pulled container image. For OSDs, we iterate over the list of running OSDs, this handles the case where the first OSD of the list has been updated (runs the new image) and not the others. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1526513 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-14 02:01:29 +01:00
Sébastien Han	ebc195487c	default: remove duplicate code This is already defined in ceph-defaults. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-14 02:01:29 +01:00
Caleb Boylan	0be60456ce	osd: Add support for multipath disks Multipath disks have partitions with a different format than what ceph-ansible currently supports, this update makes ceph-ansible aware of that format so multipath disks can be used as OSDs Signed-off-by: Caleb Boylan <caleb.boylan@ormuco.com>	2018-02-09 18:06:25 +01:00
Andy McCrae	b4dbc862d6	Set application for OpenStack pools Since Luminous we need to set the application tag for each pool, otherwise a CEPH_WARNING is generated when the pools are in use. We should assign the OpenStack pools to their default which would be "rbd". When updating to Luminous this would happen automatically to the vms, images, backups and volumes pools, but for new deploys this is not the case.	2018-02-09 17:15:55 +01:00
Sébastien Han	22f843e3d4	default: define 'osd_scenario' variable osd_scenario does not exist in the ceph-default role so if we try to play ceph-default on an OSD node, the playbook will fail with undefined variable. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-02-08 17:42:12 +01:00
Guillaume Abrioux	e537779bb3	osd: fix osd restart when dmcrypt This commit fixes a bug that occurs especially for dmcrypt scenarios. There is an issue where the 'disk_list' container can't reach the ceph cluster because it's not launched with `--net=host`. If this container can't reach the cluster, it will hang on this step (when trying to retrieve the dm-crypt key) : ``` +common_functions.sh:448: open_encrypted_part(): ceph --cluster abc12 --name \ client.osd-lockbox.9138767f-7445-49e0-baad-35e19adca8bb --keyring \ /var/lib/ceph/osd-lockbox/9138767f-7445-49e0-baad-35e19adca8bb/keyring \ config-key get dm-crypt/osd/9138767f-7445-49e0-baad-35e19adca8bb/luks +common_functions.sh:452: open_encrypted_part(): base64 -d +common_functions.sh:452: open_encrypted_part(): cryptsetup --key-file \ -luksOpen /dev/sdb1 9138767f-7445-49e0-baad-35e19adca8bb ``` It means the `ceph-run-osd.sh` script won't be able to start the `osd_disk_activate` process in ceph-container because he won't have filled the `$DOCKER_ENV` environment variable properly. Adding `--net=host` to the 'disk_list' container fixes this issue. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1543284 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-02-08 15:45:13 +01:00
Giulio Fidente	bdcc52b96d	Check for docker sockets named after both _hostname or _fqdn While hostname -f will always return an hostname including its domain part and -s without the domain part, the behavior when no arguments are given can include or not include the domain part depending on how the system is configured; the socket name might not match the instance name then.	2018-02-06 14:16:54 +01:00
Greg Charot	a6d1922a2e	mon: Fixed crush_rule_config for containerised deployment. Was called too early, container was not yet started so the commands failed. Moved the section after include docker/main.yml Signed-off-by: Greg Charot <gcharot@redhat.com>	2018-02-06 05:12:59 +01:00
Guillaume Abrioux	dd0c98c5a2	common: do not use `shell` module when it is not needed There is no need here to use `shell` instead of `command` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-31 10:45:34 +01:00
Guillaume Abrioux	deaf273b25	syntax: change local_action syntax Use a nicer syntax for `local_action` tasks. We used to have oneliner like this: ``` local_action: wait_for port=22 host={{ hostvars[inventory_hostname]['ansible_default_ipv4']['address'] }} state=started delay=10 timeout=500 }} ``` The usual syntax: ``` local_action: module: wait_for port: 22 host: "{{ hostvars[inventory_hostname]['ansible_default_ipv4']['address'] }}" state: started delay: 10 timeout: 500 ``` is nicer and kind of way to keep consistency regarding the whole playbook. This also fix a potential issue about missing quotation : ``` Traceback (most recent call last): File "/tmp/ansible_wQtWsi/ansible_module_command.py", line 213, in <module> main() File "/tmp/ansible_wQtWsi/ansible_module_command.py", line 185, in main rc, out, err = module.run_command(args, executable=executable, use_unsafe_shell=shell, encoding=None, data=stdin) File "/tmp/ansible_wQtWsi/ansible_modlib.zip/ansible/module_utils/basic.py", line 2710, in run_command File "/usr/lib64/python2.7/shlex.py", line 279, in split return list(lex) File "/usr/lib64/python2.7/shlex.py", line 269, in next token = self.get_token() File "/usr/lib64/python2.7/shlex.py", line 96, in get_token raw = self.read_token() File "/usr/lib64/python2.7/shlex.py", line 172, in read_token raise ValueError, "No closing quotation" ValueError: No closing quotation ``` writing `local_action: shell echo {{ fsid }} \| tee {{ fetch_directory }}/ceph_cluster_uuid.conf` can cause trouble because it's complaining with missing quotes, this fix solves this issue. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1510555 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-31 10:45:34 +01:00
Sébastien Han	6f9dd26caa	config: remove any spaces in public_network or cluster_network With two public networks configured - we found that with "NETWORK_ADDR_1, NETWORK_ADDR_2" install process consistently became broken, trying to find docker registry on second network, and not finding mon container. but without spaces "NETWORK_ADDR_1,NETWORK_ADDR_2" install succeeds so, containerized install is more peculiar with formatting of this line Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1534003 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-30 17:47:15 +01:00
Sébastien Han	5132cc3de4	Do not search osd ids if ceph-volume Description of problem: The 'get osd id' task goes through all the 10 times (and its respective timeouts) to make sure that the number of OSDs in the osd directory match the number of devices. This happens always, regardless if the setup and deployment is correct. Version-Release number of selected component (if applicable): Surely the latest. But any ceph-ansible version that contains ceph-volume support is affected. How reproducible: 100% Steps to Reproduce: 1. Use ceph-volume (LVM) to deploy OSDs 2. Avoid using anything in the 'devices' section 3. Deploy the cluster Actual results: TASK [ceph-osd : get osd id _uses_shell=True, _raw_params=ls /var/lib/ceph/osd/ \| sed 's/.-//'] ********************************************************************************************************************************************* task path: /Users/alfredo/python/upstream/ceph/src/ceph-volume/ceph_volume/tests/functional/lvm/.tox/xenial-filestore-dmcrypt/tmp/ceph-ansible/roles/ceph-osd/tasks/start_osds.yml:6 FAILED - RETRYING: get osd id (10 retries left). FAILED - RETRYING: get osd id (9 retries left). FAILED - RETRYING: get osd id (8 retries left). FAILED - RETRYING: get osd id (7 retries left). FAILED - RETRYING: get osd id (6 retries left). FAILED - RETRYING: get osd id (5 retries left). FAILED - RETRYING: get osd id (4 retries left). FAILED - RETRYING: get osd id (3 retries left). FAILED - RETRYING: get osd id (2 retries left). FAILED - RETRYING: get osd id (1 retries left). ok: [osd0] => { "attempts": 10, "changed": false, "cmd": "ls /var/lib/ceph/osd/ \| sed 's/.*-//'", "delta": "0:00:00.002717", "end": "2018-01-21 18:10:31.237933", "failed": true, "failed_when_result": false, "rc": 0, "start": "2018-01-21 18:10:31.235216" } STDOUT: 0 1 2 Expected results: There aren't any (or just a few) timeouts while the OSDs are found Additional info: This is happening because the check is mapping the number of "devices" defined for ceph-disk (in this case it would be 0) to match the number of OSDs found. Basically this line: until: osd_id.stdout_lines\|length == devices\|unique\|length Means in this 2 OSD case it is trying to ensure the following incorrect condition: until: 2 == 0 Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1537103	2018-01-30 14:44:38 +01:00
Andy McCrae	481173f203	Add default for radosgw_keystone_ssl This should default to False. The default for Keystone is not to use PKI keys, additionally, anybody using this setting had to have been manually setting it before. Fixes: #2111	2018-01-30 11:30:23 +01:00
Guillaume Abrioux	f1232b33fd	Revert "monitor_interface: document need to use monitor_address when using IPv6" This reverts commit `10b91661ce`. This reverts also the same comment added in `1359869497`	2018-01-29 14:43:24 +01:00
Eduard Egorov	93e9f3723b	config: add host-specific ceph_conf_overrides evaluation and generation. This allows us to use host-specific variables in ceph_conf_overrides variable. For example, this fixes usage of such variables (e.g. 'nss db path' having {{ ansible_hostname }} inside) in ceph_conf_overrides for rados gateway configuration (see profiles/rgw-keystone-v3) - issue #2157. Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2018-01-26 10:15:03 +01:00
Guillaume Abrioux	ec16cbdb1a	defaults: avoid getting stuck (ceph --connect-timeout) Sometime the playbook gets stuck because even with `--connect-timeout=` option, the connexion to the existing ceph cluster never timeout. As a workaround, using `timeout` command provided by coreutils will actually timeout if we can't connect to the cluster. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1537003 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-25 10:15:59 +01:00
Andrew Schoen	79473badfe	ceph-osd: adds dmcrypt to the lvm scenario Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-01-24 14:10:08 +01:00
Guillaume Abrioux	9306a1789c	osds: change default value for `dedicated_devices` This is to keep backward compatibility with stable-2.2 and satisfy the check "verify dedicated devices have been provided" in `check_mandatory_vars.yml`. This check is looking for `dedicated_devices` so we need to default it's value to `raw_journal_devices` when `raw_multi_journal` is set to `True`. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1536098 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-22 18:02:51 +01:00
Sébastien Han	f88795e843	rgw: disable legacy unit Some systems that were deployed with old tools can leave units named "ceph-radosgw@radosgw.gateway.service". As a consequence, they will prevent the new unit to start. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1509584 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-18 14:12:18 +01:00
Andrew Schoen	fb4a6dc9a4	docs for the crush_device_class option of lvm_volumes Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-01-17 13:49:29 +01:00
Andrew Schoen	6cbb56a3b6	ceph-osd: adds the crush_device_class param to the lvm scenario Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-01-17 13:49:29 +01:00
Eduard Egorov	7d7080df6c	crush: create rack type buckets and build crush tree according to {{ osd_crush_location }}. Currently, we can define crush location for each host but only crush roots and crush rules are created. This commit automates other routines for a complete solution: 1) Creates rack type crush buckets defined in {{ ceph_crush_rack }} of each osd host. If it's not defined by user then a rack named 'default_rack_{{ ceph_crush_root }}' would be added and used in next steps. 2) Move rack type crush buckets defined in {{ ceph_crush_rack }} into crush roots defined in {{ ceph_crush_root }} of each osd host. 3) Move hosts defined in {{ ceph_crush_rack }} into crush roots defined in {{ ceph_crush_root }} of each osd host. Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2018-01-11 17:42:18 +01:00
Sébastien Han	6db4aea453	osd: skip devices marked as '/dev/dead' On a non-collocated scenario, if a drive is faulty we can't really remove it from the list of 'devices' without messing up or having to re-arrange the order of the 'dedicated_devices'. We want to keep this device list ordered. This will prevent the activation failing on a device that we know is failing but we can't remove it yet to not mess up the dedicated_devices mapping with devices. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-11 17:34:32 +01:00
Guillaume Abrioux	70401f955b	container: trigger handlers on systemd file change When a systemd unit file is changed we should trigger handlers to restart the services. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 16:46:42 +01:00
Guillaume Abrioux	b29a42cba6	handlers: avoid duplicate handler Having handlers in both ceph-defaults and ceph-docker-common roles can make the playbook restarting two times services. Handlers can be triggered first time because of a change in ceph.conf and a second time because a new image has been pulled. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 16:46:42 +01:00
Sébastien Han	8a19a83354	container: restart container when there is a new image This wasn't any good choice to implement this. We had several options and none of them were ideal since handlers can not be triggered cross-roles. We could have achieved that by doing: * option 1 was to add a dependancy in the meta of the ceph-docker-common role. We had that long ago and we decided to stop so everything is managed via site.yml * option 2 was to import files from another role. This is messy and we don't that anywhere in the current code base. We will continue to do so. There is option 3 where we pull the image from the ceph-config role. This is not suitable as well since the docker command won't be available unless you run Atomic distro. This would also mean that you're trying to pull twice. First time in ceph-config, second time in ceph-docker-common The only option I came up with was to duplicate a bit of the ceph-config handlers code. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1526513 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-10 16:46:42 +01:00
Guillaume Abrioux	900f447c82	containers: fix bug when looking for existing cluster When containerized deployment, `docker_exec_cmd` is not set before the task which try to retrieve the current fsid is played, it means it considers there is no existing fsid and try to generate a new one. Typical error: ``` ok: [mon0 -> mon0] => { "changed": false, "cmd": [ "ceph", "--connect-timeout", "3", "--cluster", "test", "fsid" ], "delta": "0:00:00.179909", "end": "2018-01-09 10:36:58.759846", "failed": false, "failed_when_result": false, "rc": 1, "start": "2018-01-09 10:36:58.579937" } STDERR: Error initializing cluster client: Error('error calling conf_read_file: errno EINVAL',) ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 16:23:18 +01:00
Sébastien Han	c2e04623a5	container: change the way we force no logs inside the container Previously we were using ceph_conf_overrides however this doesn't play nice for softwares like TripleO that uses ceph_conf_overrides inside its own code. For now, and since this is the only occurence of this, we can ensure no logs through the ceph conf template. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1532619 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-10 16:21:47 +01:00
Guillaume Abrioux	acfbebe67e	defaults: rename check_socket files for containers When containerized deployment, we are not looking for a socket but for a running container. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 15:44:47 +01:00
Sébastien Han	f0787e64da	mon: use crush rules for non-container too There is no reasons why we can't use crush rules when deploying containers. So moving the inlcude in the main.yml so it can be called. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-10 15:21:36 +01:00
Sébastien Han	97f520bc74	containers: bump memory limit A default value of 4GB for MDS is more appropriate and 3GB for OSD also. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1531607 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-09 11:26:50 +01:00
Sébastien Han	0b55abe3d0	mon: always run ceph-create-keys ceph-create-keys is idempotent so it's not an issue to run it each time we play ansible. This also fix issues where the 'creates' arg skips the task and no keys get generated on newer version, e.g during an upgrade. Closes: https://github.com/ceph/ceph-ansible/issues/2228 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-21 13:50:01 +01:00
Sébastien Han	ad54e19262	rgw: disable legacy rgw service unit When upgrading from OSP11 to OSP12 container, ceph-ansible attempts to disable the RGW service provided by the overcloud image. The task attempts to stop/disable ceph-rgw@{{ ansible-hostname }} and ceph-radosgw@{{ ansible-hostname }}.service. The actual service name is ceph-radosgw@radosgw.$name Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1525209 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-21 13:48:42 +01:00
Guillaume Abrioux	895949d6c4	osd: fix check gpt the gpt label creation doesn't work even with parted module. This commit fixes the gpt label creation by using parted command instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-20 17:42:45 +01:00
Sébastien Han	bbc79765f3	osd: best effort if no device is found during activation We have a scenario when we switch from non-container to containers. This means we don't know anything about the ceph partitions associated to an OSD. Normally in a containerized context we have files containing the preparation sequence. From these files we can get the capabilities of each OSD. As a last resort we use a ceph-disk call inside a dummy bash container to discover the ceph journal on the current osd. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1525612 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-19 14:40:48 +01:00
Sébastien Han	dfbef8361d	nfs: fix package install for debian/suss systems This resolves the following error: E: There were unauthenticated packages and -y was used without --allow-unauthenticated Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-19 13:30:49 +01:00
Christian Berendt	50a848dc40	Rename fact docker_version to ceph_docker_version The name docker_version is very generic and is also used by other roles. As a result, there may be name conflicts. To avoid this a ceph_ prefix should be used for this fact. Since it is an internal fact renaming is not a problem.	2017-12-15 20:12:21 +01:00
Markos Chandras	162b7d2b23	roles: ceph-mgr: Install the ceph-mgr package on SUSE The ceph-mgr package name is identical to RedHat so add the SUSE family to the existing task.	2017-12-15 09:22:14 +01:00
Guillaume Abrioux	a24fd1cfd9	client: don't make `osd_pool_default_pg_num` mandatory making `osd_pool_default_pg_num` mandatory is a bit agressive and is unrelated when you just want to create users keyrings. Closes: #2241 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-14 17:22:07 +01:00
Guillaume Abrioux	ab1dd3027a	client: don't try to generate keys the entrypoint to generate users keyring is `ceph-authtool`, therefore, it can expand the `$(ceph-authtool --gen-print-key)` inside the container. Users must generate a keyring themselves. This commit also adds a check to ensure keyring are properly filled when `user_config: true`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-14 17:22:07 +01:00
Guillaume Abrioux	26afe46e13	docker: add missing condition for selinux tasks on `client` and `mds` roles, it tries to set selinux even on non rhel based distributions.` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-14 17:00:14 +01:00
Sébastien Han	7eaf444328	default: look for the right return code on socket stat in-use As reported in https://github.com/ceph/ceph-ansible/issues/2254, the check with fuser is not ideal. If fuser is not available the return code is 127. Here we want to make sure that we looking for the correct return code, so 1. Closes: https://github.com/ceph/ceph-ansible/issues/2254 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-14 16:59:14 +01:00
John Fulton	8cba44262c	Add flags for OSD 'docker run --cpuset-{cpus,mems}' Add the variables ceph_osd_docker_cpuset_cpus and ceph_osd_docker_cpuset_mems, so that a user may specify the CPUs and memory nodes of NUMA systems on which OSD containers are run. Provides a example in osds.yaml.sample to guide user based on sample `lscpu` output since cpuset-mems refers to the memory by NUMA node only while cpuset-cpus can refer to individual vCPUs within a NUMA node.	2017-12-14 16:39:35 +01:00
Eduard Egorov	a8a2c13f6a	firewall: add mds, nfs, restapi and iscsi ports, remove 'configure_firewall' variable used for conditional execution. Include the task only on rpm-based systems. Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2017-12-12 23:44:55 +01:00
Eduard Egorov	6a5e0da30d	firewall: configure firewalld if it's already installed on the host (#2192 ). Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2017-12-12 23:44:55 +01:00
Major Hayden	5676fa23b1	Convert interface names to underscores for facts If a deployer uses an interface name with a dash/hyphen in it, such as 'br-storage' for the monitor_interface group_var, the ceph.conf.j2 template fails to find the right facts. It looks for 'ansible_br-storage' but only 'ansible_br_storage' exists. This patch converts the interface name to underscores when the template does the fact lookup.	2017-12-12 09:03:40 +01:00
Konstantin Shalygin	d7dadc3e7b	ceph-osd: respect nvme partitions when device is a disk.	2017-12-12 09:03:18 +01:00
Guillaume Abrioux	6a9b5c9632	defaults: fix CI issue with ceph_uid fact The CI complains because of `ceph_uid` fact which doesn't exist since the docker image tag used in the CI doesn't match with this condition. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-12-12 09:02:37 +01:00
Andrew Schoen	788c3f351a	ceph-osd: adds osd_objectstore to the name when using the ceph_volume module This allows for easier debugging if verbosity is not set high enough. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-12-11 09:58:06 -06:00
Andrew Schoen	5e3d8dbf63	ceph-osd: use the cluster param with the ceph_volume module Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-12-11 09:58:06 -06:00
Andrew Schoen	423166f671	ceph-osd: use the new ceph_volume module for the lvm scenario Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-12-11 09:58:06 -06:00
Sébastien Han	0ea1811f6f	Merge pull request #2226 from andymcc/gpt_mklabel Skip mklabel gpt if already gpt	2017-12-11 03:12:46 -06:00
Andy McCrae	4f1e854c79	Use parted module instead of command	2017-12-11 17:33:40 +10:00
John Fulton	ffae294288	Set tighter permissions on keyrings when containerized During a containerized deployment, set the permissions of ceph.client.admin.keyring and other keyrings to chmod 600 and chown it to ceph.	2017-12-06 19:22:28 -05:00
Guillaume Abrioux	b449b16edd	Merge pull request #2215 from squidboylan/support_loopback_devices Add support for using loopback devices as OSDs	2017-11-28 14:04:47 +01:00
Sébastien Han	f94b9040eb	Merge pull request #2214 from ceph/bz-1510555 handlers: restart daemons only if docker is running	2017-11-28 12:22:50 +01:00
Sébastien Han	ef581f807d	Merge pull request #2202 from ceph/remove_leftover osd: remove leftover and fix a typo	2017-11-28 12:21:13 +01:00
wintamute	ebe0e60235	Openstack: replaced hardcoded pool names with variables for openstack (nova) user (cherry picked from commit 2bf48f1)	2017-11-28 09:06:51 +01:00
Caleb Boylan	8f02bb007f	Add support for using loopback devices as OSDs This is particularly useful in CI environments where you dont have the option of adding extra devices or volumes to the host. It is also a simple change to support loopback devices	2017-11-27 16:02:36 -08:00
Guillaume Abrioux	b26a840002	handlers: restart daemons only if docker is running In case where docker CLI is available but docker is not running, we don't want to trigger the restart of the daemons. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510555 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-27 14:59:30 +01:00
Sébastien Han	d9cfe5f6df	Merge pull request #2177 from jprovaznik/rados Allow to use rados for ganesha exports	2017-11-23 10:36:58 +01:00
Sébastien Han	bb7b29a9fc	common: install ceph-common on all the machines Since some daemons now install their own packages the task checking the ceph version fails on Debian systems. So the 'ceph-common' package must be installed on all the machines. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-22 17:11:50 +01:00
Jan Provaznik	2435c48cd5	Allow to use rados for ganesha exports	2017-11-21 15:21:32 +01:00
Guillaume Abrioux	1cba626484	osd: remove leftover and fix a typo This task was originally needed to fix a docker installation issue (see: #1030). This has been fixed, therefore it can be removed. Fixes: #2199 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-21 11:11:34 +01:00
Guillaume Abrioux	efe06be10f	osd: ensure a gpt label is set on device ceph-disk prepare will fail on jewel if a GPT label is not present on device. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-17 17:32:23 +01:00
Guillaume Abrioux	3c6f2854fe	Merge pull request #2189 from fultonj/empty-acl Make openstack_keys param support no acls list	2017-11-16 19:39:01 +01:00
John Fulton	d73f751b63	Make openstack_keys param support no acls list A recent change [1] required that the openstack_keys param always containe an acls list. However, it's possible it might not contain that list. Thus, this param sets a default for that list to be empty if it is not in the structure as defined by the user. [1] `d65cbaa539`	2017-11-16 11:29:59 -05:00
Sébastien Han	f31d8557dd	Merge pull request #2182 from ceph/fix_reboot_rbd rbd: enable ceph-rbd-mirror.target on releases prior to luminous	2017-11-16 16:55:39 +01:00
Sébastien Han	932345ab2a	osd: remove leftover from osd partition We used to support osds that are a partition. This is long gone so removing this task. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-16 14:58:40 +01:00
Sébastien Han	b1c1322357	osd: remove failed_when on activation There is no need to continue if the activation fails. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-16 14:57:49 +01:00
Sébastien Han	80d3a242d0	osd: fix bad activation for dmcrypt We were activating dmcrypt devices with the wrong command. Basically the first task execute the wrong activate command. The task fails but continues because of the 'failed_when: false'. Then the right activation sequence is being done by the next task. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-16 14:55:08 +01:00
Sébastien Han	cc264d6ba6	Merge pull request #2151 from hwoarang/add-opensuse Add openSUSE Leap 42.3 support	2017-11-16 14:35:28 +01:00
Sébastien Han	a98f14784a	Merge pull request #2172 from ceph/lvm-raw-device lvm: add support for --data to be a raw device or partition	2017-11-16 14:14:23 +01:00
Guillaume Abrioux	ccad0ebf26	rbd: enable ceph-rbd-mirror.target for releases <= luminous when `ceph-rbd-mirror.target` is not enabled, the service won't start after a reboot because there is a dependency between these two units. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-16 14:12:59 +01:00
Yixing Yan	097249371f	fix: remove the duplicated code	2017-11-16 16:45:03 +08:00
Andrew Schoen	3c604f1115	lvm: support --data as a raw device or partition in ceph-volume Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-11-15 09:36:17 -06:00
Andrew Schoen	04f02910a9	lvm: ensure the data_vg exists before using it Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-11-15 09:36:17 -06:00
John Fulton	d65cbaa539	Set permissions and ACLs of OpenStack keys on all ceph-mons If ceph-ansible deploys a Ceph cluster with "openstack_config: true" and sets the openstack_keys map to have certain ACLs or permissions, the requested ACLs or permissions are only set on one of the monitor nodes [2] when they should be set on all of them. This patch solves [3] the above issue by having the chmod and setfacl tasks iterate the list of mon nodes (including the mon node that the task was delegated to) to apply the chmod of setfacl to the keys in openstack_keys. [1] ``` openstack_keys: - { name: client.openstack, key: "$(ceph-authtool --gen-print-key)", mon_cap: "allow r", osd_cap: "allow class-read object_prefix rbd_children, allow rwx pool=images, allow rwx pool=vms, allow rwx pool=volumes, allow rwx pool=backups", mode: "0600", acls: ["u:nova:r--", "u:cinder:r--", "u:glance:r--", "u:gnocchi:r--"] } ``` [2] ``` $ ansible mons -m shell -b -a "ls -l /etc/ceph/ceph.client.openstack.keyring ; getfacl /etc/ceph/ceph.client.openstack.keyring" 192.168.1.26 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 3 20:30 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names 192.168.1.29 \| SUCCESS \| rc=0 >> -rw-r--r--. 1 root root 253 Nov 3 20:30 /etc/ceph/ceph.client.openstack.keyring user::rw- group::r-- other::r--getfacl: Removing leading '/' from absolute path names 192.168.1.23 \| SUCCESS \| rc=0 >> -rw-r--r--. 1 root root 253 Nov 3 20:30 /etc/ceph/ceph.client.openstack.keyring user::rw- group::r-- other::r--getfacl: Removing leading '/' from absolute path names $ ``` [3] ``` (undercloud) [stack@hci-director ceph-ansible]$ ansible mons -m shell -b -a "ls -l /etc/ceph/ceph.client.openstack.keyring ; getfacl /etc/ceph/ceph.client.openstack.keyring" 192.168.1.25 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names 192.168.1.29 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names 192.168.1.27 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names (undercloud) [stack@hci-director ceph-ansible]$ ```	2017-11-15 10:09:24 -05:00
Guillaume Abrioux	aa0b1ed118	tests: remove OSD_FORCE_ZAP variable from tests according to ceph/ceph-container#840, this variable is no longer needed. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-14 17:55:01 +01:00
Markos Chandras	f8e3d4bb76	ceph-docker-common: Add support for openSUSE Leap distributions Add support for the openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	8c321b8416	ceph-nfs: Add support for openSUSE Leap distributions Add support for the openSUSE distributions. The required packages are available either in the distribution repositories or in the OBS one. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	173959cfc7	ceph-rgw: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	a868c52f3f	ceph-restapi: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	ddb468bfb3	ceph-rbd-mirror: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	fb46950373	ceph-osd: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	34a40adcf7	ceph-mon: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	f944ee3980	ceph-mgr: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	8135638c58	ceph-mds: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	c6103a0f13	ceph-fetch-keys: Add support for openSUSE Leap distributions Add support for openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	3e4a7c8b61	ceph-config: Add support for the openSUSE Leap distributions Add support for the openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	211b0c33a0	ceph-client: Add support for the openSUSE Leap distributions Add support for the openSUSE Leap distributions Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	e06c108442	ceph-agent: Add support for the openSUSE Leap distributions Add support for the openSUSE Leap distributions. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	dd6ee72547	ceph-common: Don't check for ceph_stable_release for distro packages When we consume the distribution packages, we don't have the choise on which version to install, so we shouldn't require that variable to be set. Distributions normally provide only one version of Ceph in the official repositories so we get whatever they provide. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:23 +00:00
Markos Chandras	849786967a	ceph-common: Add initial support for openSUSE Leap distributions openSUSE Leap 42.3 provides support for Ceph Luminous in both the distribution package and the latest available version in the OBS repository so add these as the only available installation methods for openSUSE. Signed-off-by: Markos Chandras <mchandras@suse.de>	2017-11-14 10:51:22 +00:00
Guillaume Abrioux	44df3f9102	defaults: fix rgw restart script in handlers Like `80d32dec`, the path to the fact is not correct. In any case, we will retrieve the IP address in hostvars, the variable is the way we get the interface name according where it has been set (eg.: inventory host file vs. group_vars/) Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510906 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-13 16:30:03 +01:00
Guillaume Abrioux	0369bd59e2	Merge pull request #2146 from mslovy/wip-fix-crush-location osd: fix crush location for non-containerized deployment	2017-11-13 12:23:44 +01:00
Sébastien Han	7b0743be52	Merge pull request #2144 from ceph/quick_fix_lvm osd: skip some set_fact when osd_scenario=lvm	2017-11-13 21:50:37 +11:00
Sébastien Han	17d1ff61d5	Merge pull request #2141 from Arano-kai/run_restart_scripts_in_noexec_tmp FIX: run restart scripts in `noexec` /tmp	2017-11-13 21:37:35 +11:00
Guillaume Abrioux	c06faf2deb	Merge pull request #2154 from ceph/fix_auto_discover osd: avoid using non desired loop device in autodiscovery	2017-11-10 01:19:20 +01:00
Guillaume Abrioux	a695b2c08f	Merge pull request #2153 from ceph/fix_disk_list_test osd: always run disk_list test	2017-11-09 23:50:32 +01:00
Guillaume Abrioux	591d77220e	osd: always run disk_list test there is no need to have a condition on this task, this test should be always run since the result will be interpreted later. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-09 11:51:16 +01:00
Guillaume Abrioux	43975a7332	osd: avoid using non desired loop device in autodiscovery This will prevent ceph-ansible from using a loop device while it shouldn't in auto_discovery mode. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-09 10:26:24 +01:00
Guillaume Abrioux	80d32decd3	config: fix config generation The path to the fact is not correct. In any case, we will retrieve the IP address in hostvars, the variable is the way we get the interface name according where it has been set (eg.: inventory host file vs. group_vars/) Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510906 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-09 08:50:57 +01:00
Guillaume Abrioux	d5dfc63c89	osd: fix automatic prepare when auto_discover Use `devices` variable instead of `ansible_devices`, otherwise it means we are not using the devices which have been 'auto discovered' Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-08 10:20:44 +01:00
yaoning	d82a09dddd	fix crush location for non-containerized deployment crush location only set for containerized deployment Signed-off-by: yaoning <yaoning@unitedstack.com>	2017-11-08 12:05:10 +11:00
Sébastien Han	0930f14915	osd: do not use dm when osd_auto_discovery The current code will also return lvm devices such as /dev/dm-2, this kind of device type is not supported by ceph-disk at the moment. Now we just ignore them. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-08 11:33:10 +11:00
Guillaume Abrioux	238754a844	osd: skip some set_fact when osd_scenario=lvm these tasks are not needed when using `osd_scenario: lvm` Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1509230 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-07 15:30:08 +01:00
Guillaume Abrioux	39b584e540	osd: fix a typo in roles/ceph-osd/defaults/main.yml Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-11-07 10:06:16 +01:00
Arano-kai	5cde3175ae	FIX: run restart scripts in `noexec` /tmp - One can not run scripts directly in place, that mounted with `noexec` option. But one can run scripts as arguments for `bash/sh`. Signed-off-by: Arano-kai <captcha.is.evil@gmail.com>	2017-11-06 16:02:47 +02:00
Sébastien Han	d4ed9a2064	osd: enhance backward compatibility During the initial implementation of this 'old' thing we were falling into this issue without noticing https://github.com/moby/moby/issues/30341 and where blindly using --rm, now this is fixed the prepare container disappears and thus activation fail. I'm fixing this for old jewel images. Also this fixes the machine reboot case where the docker logs are purgend. In the old scenario, we now store the log locally in the same directory as the ceph-osd-run.sh script. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-03 11:15:23 +01:00
Sébastien Han	ab7eb79212	config: fix monitor_interface when not passed in the inventory file Setting monitor_interface in group_vars/all.yml makes the hostvars[host]['monitor_interface'] non-existing. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1507922 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-11-03 09:25:02 +01:00
Jan Provaznik	589cd27ce4	Include ganesha dbus config file This file was (accidentally) not included in a previous commit `87b1da09e7`.	2017-10-31 08:30:12 +01:00
Sébastien Han	faccd0acf0	Merge pull request #2100 from ceph/lvm-bluestore ceph-volume lvm bluestore support	2017-10-27 17:36:16 +02:00
Alfredo Deza	517a2b3feb	ceph-osd skip lvm creation if they are already in use Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-27 11:33:54 -04:00
Sébastien Han	6ea92756c0	Merge pull request #2117 from ceph/rm-dup default: remove dup variable	2017-10-27 13:49:52 +02:00
Sébastien Han	d2575c7f5e	default: remove dup variable ceph_repository_type was declared multiple times. This commit fixes this. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-27 11:46:15 +02:00
Sébastien Han	d6a0d2f9be	Merge pull request #2071 from jtaleric/master Docker image pull retry	2017-10-27 09:49:03 +02:00
Sébastien Han	5a10b048b0	Merge pull request #2105 from major/really-fix-always-run Really fix always run	2017-10-27 09:33:47 +02:00
John Fulton	ae156e9f34	Make acls and mode parameters of opentack_keys optional Only chmod or setfacl the requested keyring(s) in the opentack_keys data structure when the mode or acls keys of that data structure exist. User may specify four permission combinations for the keyring file(s): 1. only set ACL, 2. only set mode, 3. set neither mode nor ACL, 4. set mode and then ACL. Fixes: #2092	2017-10-26 12:45:17 +00:00
Joe Talerico	ab58764288	Docker image pull retry This change sets a default timeout of 300s for the image pull. If the image pull times out (300s), we will retry 3 times by default. fixes 1954	2017-10-25 13:37:10 -04:00
Sébastien Han	5f9e50dabe	Merge pull request #2103 from andymcc/tcmalloc_settings Option to set TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES	2017-10-25 17:36:04 +02:00
Sébastien Han	613b6a30f1	Merge pull request #2104 from ceph/rgw-section rgw/nfs: fix section duplication	2017-10-25 17:35:01 +02:00
Sébastien Han	07e2a783f8	Merge pull request #2084 from ceph/backward-osd-2.4 osd: bring backward compatibility with old Jewel images	2017-10-25 17:33:49 +02:00
Major Hayden	f73232caa4	Use check_mode instead of always_run This patch changes the `always_run: yes` task option to `check_mode: no` to avoid Ansible warnings.	2017-10-25 09:53:34 -05:00
Major Hayden	c2b5118c1b	Revert "Avoid deprecated always_run" This reverts commit `620fb37dd4`.	2017-10-25 09:48:09 -05:00
Sébastien Han	8670b45ef2	rgw/nfs: fix section duplication Once and for all, hopefully... Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-25 15:45:37 +02:00
Andy McCrae	7f6c39102d	Option to set TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES Use "ceph_tcmalloc_max_total_thread_cache" to set the TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES value inside /etc/default/ceph for Debian installs, or /etc/sysconfig/ceph for Red Hat/CentOS installs. By default this is set to 0, so the default package value will be used, if specified this value will be changed to match the variable, and ceph osd services will be restarted.	2017-10-25 14:38:36 +01:00
Alfredo Deza	d3b427e169	ceph-osd lvm scnearios are no longer limited to filestore Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 08:23:45 -04:00
Alfredo Deza	df05e63c10	ceph-osd use --cluster in ceph-volume calls Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 08:23:45 -04:00
Alfredo Deza	628d98a92c	ceph-osd add the CEPH_VOLUME_DEBUG env var to all ceph-volume commands Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 06:50:22 -04:00
Alfredo Deza	b89309e2a3	ceph-osd update the examples in defaults for lvm bluestore Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 06:46:39 -04:00
Alfredo Deza	bbc3672253	ceph-osd: lvm support for bluestore Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-10-25 06:46:39 -04:00
Guillaume Abrioux	f21859656b	Merge pull request #2102 from yanyixing/fix_miss_word add the miss word	2017-10-25 10:49:38 +02:00
Yixing Yan	b6296c13ac	update sample file	2017-10-25 16:39:08 +08:00
Sébastien Han	049729b8d3	Merge pull request #2097 from fultonj/issue/2095 Require osd_scenario parameter to be provided in containerized deploy	2017-10-24 13:59:51 +02:00
Sébastien Han	751da93b08	Merge pull request #2096 from andymcc/regex_defaults Add regexp check for setting CLUSTER_NAME	2017-10-23 17:24:44 +02:00
John Fulton	7a7ddab6c2	Require osd_scenario parameter to be provided in containerized deploy Fixes: #2095	2017-10-23 15:16:03 +00:00
Andy McCrae	9ebef8ba3c	Add regexp check for setting CLUSTER_NAME Minor fix to ensure that existing CLUSTER_NAME is changed, and avoid duplicates.	2017-10-23 14:42:07 +01:00
Andy McCrae	05a1f965c8	Typo fix for radosgw@ systemd file systemd script for radosgw is radosgw@ not rgw@, the directory needs to match the path.	2017-10-23 14:07:23 +01:00
Jan Provaznik	291e6b604d	ceph-nfs - add bind address variable	2017-10-23 09:34:51 +02:00
Sébastien Han	968ef04324	osd: bring backward compatibility with old Jewel images There was a huge resync from luminous to jewel in ceph-docker: https://github.com/ceph/ceph-docker/pull/797 This change brought a new handy function to discover partitions tight to an OSD. This function doesn't exist in the old image so the ceph-osd-run.sh script breaks when trying to deploy Jewel OSD with that old Jewel image version. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 16:26:41 +02:00
Sébastien Han	54de2efc5d	Merge pull request #2082 from ceph/restapi-cephconf common: move restapi template to config	2017-10-20 14:07:48 +02:00
Sébastien Han	4413511b66	all: backward compatibility between stable-2.2 and 3.0 stable-3.0 brought numerous changes in ceph-ansible variables, this PR aims to maintain backward compatibility for someone running stable-2.2 upgrading to stable-3.0 but keeps its groups_vars untouched. We will then determine the right options to make sure the upgrade works but we are expecting that new variables should be used. We will drop this in a near future, maybe 3.1 or 3.2. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 11:54:10 +02:00
Sébastien Han	fccb9472cd	mgr: force module addition Some module require --force to be enabled. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 11:54:10 +02:00
Sébastien Han	ba5c6e66f0	common: move restapi template to config Closes: github.com/ceph/ceph-ansible/issues/1981 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 11:14:13 +02:00
Guillaume Abrioux	5b1087f1e5	mgr: play 'enable modules' sequence only on luminous This feature isn't available before luminous, therefore, we need to play them only on luminous and after otherwise the playbook will fail. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit 3f3d4b9c727d06154c422d445fc2a245aceaed89)	2017-10-19 20:54:23 +02:00
Sébastien Han	c527515502	Merge pull request #2000 from ceph/merge-osd-scenarios [skip ci] ci: new osd scenarios	2017-10-19 09:18:02 +02:00
Guillaume Abrioux	ff228e2d88	mgr: fix broken task on jewel `3a58757` introduced an issue for Jewel deployments, since this role is skipped, `enabled_ceph_mgr_modules.stdout` doesn't exist, therefore, it ends up with an attribute error. Uses `.get()` to retrieve `stdout` with a default value so it won't fail if this attribute doesn't exist (jewel). Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-18 14:11:46 +02:00
Sébastien Han	1579f1c5b1	Merge pull request #2073 from ceph/fix_rbd_handler [skip ci] rbd: fix restart script for jewel	2017-10-18 11:12:05 +02:00
Guillaume Abrioux	c2850b11be	rbd: fix restart script for jewel In Jewel, we don't use bootstrap-rbd keyring for rbd-mirror nodes, it results with a socket path/name different according to which ceph release you are deploying. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-18 11:10:49 +02:00
Sébastien Han	2936d152c9	Merge pull request #2053 from Fbrachere/mgr-modules Add ability to enable ceph mgr modules.	2017-10-18 10:27:31 +02:00
Sébastien Han	a53aa9e8b4	ci: new osd scenarios This commit add new osd scenarios, it aims to simplify the CI setup and brings a better coverage on the OSD scenarios. We decided to differentiate between filestore and bluestore, thinking ahead when filestore won't be supported anymore. So we now have two classes of tests: * Filestore * Bluestore In each of those classes we have container and non-container. Then for each we test the following: * collocated * collocated dmcrypt * non-collocated * non-collocated dmcrypt * auto discovery collocated * auto discovery collocated dmcrypt This gives us a nice coverage and also reduces the footprint on the CI. We are now up to 4 scenarios, each containing 6 OSD VMs. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-18 09:26:06 +02:00
Sébastien Han	90b75185d5	defaults: fix handlers for collocation When doing collocation the condition "inventory_hostname in play_hosts" is breaking the restart workflow. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-17 19:23:16 +02:00
Guillaume Abrioux	2aa53fb0f5	Merge pull request #2055 from ceph/update-mirror-nfs upgrade: support for rbd mirror and nfs	2017-10-17 14:51:39 +02:00
Christian Berendt	4c380c9ef8	Cleanup readme files in roles directories The contents of the README files are no longer up to date. Documentation for all roles is located below the docs directory.	2017-10-17 11:22:06 +02:00
Sébastien Han	d920d4839d	upgrade: support for rbd mirror and nfs - Add upgrade support for rbd mirror and nfs daemons. - Only works with systemd (remove sysvinit and upstart occurence) - A bit of cleanup Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-17 10:54:47 +02:00
Christian Berendt	cf901f0171	In docker start scripts replace \u00a0 with \u0020 This will solve the following issue when starting docker containers on ubuntu: invalid argument "1\u00a0" for --cpus=1 : failed to parse 1 as a rational number Closes-bug: #2056	2017-10-16 15:16:48 +02:00
Fabien Brachere	3a587575d7	Add ability to enable ceph mgr modules.	2017-10-16 15:04:23 +02:00
Guillaume Abrioux	7ee9aa94b5	Merge pull request #1963 from ceph/pull-in-para site-docker.yml try to fetch images in //	2017-10-13 19:35:11 +02:00
Sébastien Han	71d819620c	mds: fix fs pool creation 1. add the variables to docker_collocation 2. trigger the check when a MDS is part of the inventory file, not when we run on an MDS... Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-13 16:03:04 +02:00
Sébastien Han	b34a04ea41	site-docker.yml try to fetch images in // The container deployment is serialized, adding this task as a best effort. If docker is already present we pull the image otherwise we wait for the role to play. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-13 11:24:40 +02:00
Guillaume Abrioux	7d4b3f9989	Merge pull request #2047 from ceph/enable_ceph-rbd-mirror.target rbd-mirror: enable ceph-rbd-mirror.target	2017-10-13 10:34:10 +02:00
Sébastien Han	f7832e5eb9	Merge pull request #2031 from major/simplify-ntp Simplify NTP checks/install	2017-10-13 09:16:20 +02:00
Guillaume Abrioux	59ca1065e9	rbd-mirror: enable ceph-rbd-mirror.target on jewel `ceph-rbd-mirror.target` isn't enabled, therefore, if the node is rebooted, the service doesn't get started. from ceph-rbd-mirror unit file: ``` [Install] WantedBy=ceph-rbd-mirror.target ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-13 08:27:43 +02:00
Sébastien Han	b685aceede	Merge pull request #2044 from major/avoid-jinja-in-when Remove jinja2 delimiters from `when` keys	2017-10-12 22:23:06 +02:00
Major Hayden	a1c76e834c	Simplify NTP checks/install This patch simplifies the checks and installation tasks for NTP. Debian and Red Hat had a check for NTP's presence but would then install NTP right afterwards anyways. In addition, there were tasks for atomic that weren't used anywhere else in the role. This patch also uses a dynamic include to reduce delays from skipped tasks.	2017-10-12 12:31:07 -05:00
Sébastien Han	9c3d749f7c	Merge pull request #2038 from major/fix-cmd-warning Suppress yum/dnf/rpm command warnings	2017-10-12 18:46:52 +02:00
Major Hayden	c01851325e	Remove jinja2 delimiters from `when` keys This patch changes the `when:` keys so that they have no jinja2 delimiters. This avoids Ansible warnings which could turn into errors in a future Ansible release.	2017-10-12 11:27:42 -05:00
Guillaume Abrioux	17623a2157	Merge pull request #2036 from ceph/cephfs-pool mds: precisely define cephfs pool	2017-10-12 17:47:10 +02:00
Sébastien Han	b49f9bda21	mds: precisely define cephfs pool We now have a variable called ceph_pools that is mandatory when deploying a MDS. It's a dictionnary that contains a pool name and a PG count. PG count is mandatory and must be set, the playbook will fail otherwise. Closes: https://github.com/ceph/ceph-ansible/issues/2017 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-12 15:56:04 +02:00
Major Hayden	33b200d43a	Suppress yum/dnf/rpm command warnings Ansible throws warnings when using yum/dnf/rpm with the command module: [WARNING]: Consider using yum module rather than running yum This patch adds the `warn: no` argument to suppress the warnings in the Ansible output.	2017-10-12 08:38:05 -05:00
Major Hayden	620fb37dd4	Avoid deprecated always_run The `always_run` key is deprecated and being removed in Ansible 2.4. Using it causes a warning to be displayed: [DEPRECATION WARNING]: always_run is deprecated. This patch changes all instances of `always_run` to use the `always` tag, which causes the task to run each time the playbook runs.	2017-10-12 08:29:44 -05:00
Sébastien Han	739a41ae91	Merge pull request #2030 from major/ceph-common-pass-pkgs-as-list Pass list of packages instead of with_items	2017-10-12 09:15:58 +02:00
Major Hayden	9d62630303	Pass list of packages instead of with_items Modern versions of Ansible can handle a list of packages passed directly to the package modules. This patch optimizes the package install process by passing the list of packages directly to the module.	2017-10-11 12:18:15 -05:00
Sébastien Han	aa70b07ae2	config: proper render ceph.conf when doing collocation Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-11 18:29:34 +02:00
Sébastien Han	f50b170a49	Merge pull request #2022 from ceph/fix-purge-iscis [skip ci] purge-iscsi: fix group name	2017-10-11 14:21:19 +02:00
Sébastien Han	d0a9e57bfc	osd: rollback bindmount of /run/udev This is causing unknown issues when trying to start a dmcrypt container. Basically the container is stuck at mount opening the LUKS device. This is still unknown why this is causing trouble but we need to move forward. Also, this doesn't seem to help in any ways to fix the race condition we've seen. Here is the log for dmcrypt: cryptsetup 1.7.4 processing "cryptsetup --debug --verbose --key-file key luksClose fbf8887d-8694-46ca-b9ff-be79a668e2a9" Running command close. Locking memory. Installing SIGINT/SIGTERM handler. Unblocking interruption on signal. Allocating crypt device context by device fbf8887d-8694-46ca-b9ff-be79a668e2a9. Initialising device-mapper backend library. dm version [ opencount flush ] [16384] (1) dm versions [ opencount flush ] [16384] (1) Detected dm-crypt version 1.14.1, dm-ioctl version 4.35.0. Device-mapper backend running with UDEV support enabled. dm status fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush ] [16384] (1) Releasing device-mapper backend. Trying to open and read device /dev/sdc1 with direct-io. Allocating crypt device /dev/sdc1 context. Trying to open and read device /dev/sdc1 with direct-io. Initialising device-mapper backend library. dm table fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush securedata ] [16384] (1) Trying to open and read device /dev/sdc1 with direct-io. Crypto backend (gcrypt 1.5.3) initialized in cryptsetup library version 1.7.4. Detected kernel Linux 3.10.0-693.el7.x86_64 x86_64. Reading LUKS header of size 1024 from device /dev/sdc1 Key length 32, device size 1943016847 sectors, header size 2050 sectors. Deactivating volume fbf8887d-8694-46ca-b9ff-be79a668e2a9. dm status fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush ] [16384] (1) Udev cookie 0xd4d14e4 (semid 32769) created Udev cookie 0xd4d14e4 (semid 32769) incremented to 1 Udev cookie 0xd4d14e4 (semid 32769) incremented to 2 Udev cookie 0xd4d14e4 (semid 32769) assigned to REMOVE task(2) with flags (0x0) dm remove fbf8887d-8694-46ca-b9ff-be79a668e2a9 [ opencount flush retryremove ] [16384] (1) fbf8887d-8694-46ca-b9ff-be79a668e2a9: Stacking NODE_DEL [verify_udev] Udev cookie 0xd4d14e4 (semid 32769) decremented to 1 Udev cookie 0xd4d14e4 (semid 32769) waiting for zero Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-11 13:21:37 +02:00
Major Hayden	10e1d464e5	Remove duplicate 'package' key This patch fixes a typo where "package:" was used twice in the same task.	2017-10-10 15:39:20 -05:00
Sébastien Han	f6d1be269f	Merge pull request #2015 from ceph/fix_nfs-ganesha-repos nfs: move repository configuration in ceph-nfs role	2017-10-10 17:15:33 +02:00
Guillaume Abrioux	5dc9c640e8	nfs: add missing condition for debian_rhcs in addition to `c4dcdaa20` this commit adds the missing condition on install tasks for debian_rhcs deployment. Without them, these tasks are played on any kind of deployment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-10 16:27:00 +02:00
Jan Provaznik	87b1da09e7	Ceph-nfs dynamic exports fixes * DBus on host should include ganesha service file * to allow ganesha container to respond on DBus it needs to run in --privileged mode (ganesha folks contacted to look at this) * ceph_nfs_include_exports_dir variable replaced with more general ceph_nfs_dynamic_exports	2017-10-10 13:59:01 +02:00
Guillaume Abrioux	fbd1a57b11	iscsi-gw: move repository configuration to ceph-iscsi-gw This is something that has nothing to do in `ceph-common`, this is too specific to `ceph-iscsi-gw` role. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-10 11:36:03 +02:00
Guillaume Abrioux	c4dcdaa201	nfs: move repository configuration in ceph-nfs role This is something that has nothing to do in `ceph-common`, this is too specific to `ceph-nfs` role. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-10 11:35:58 +02:00
Guillaume Abrioux	9e8204d9e8	nfs: move packages installation to own role Make role `ceph-nfs` handling itself the installation of nfs packages. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-09 19:10:15 +02:00
Guillaume Abrioux	3c64abe07d	mds: move installation packages in role itself Make role `ceph-mds` handling itself the installation of `ceph-mds` package. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-09 17:25:46 +02:00
Sébastien Han	4032f102fe	iscsi: move package install to ceph-iscsi-role Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-09 17:25:46 +02:00
Guillaume Abrioux	1581a1c078	mgr: move installation packages in role itself Make role `ceph-mgr` handling itself the installation of `ceph-mgr` package because it's complicated to manage it regarding we are going to install `jewel vs. luminous` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-09 17:25:45 +02:00
Sébastien Han	bf99751ce1	osd: bindmount /run/udev Ensures that "udevadm" is able to check the status of udev's event queue. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-09 17:25:45 +02:00
Sébastien Han	1bd891232c	config: do not duplicate sections when doing collocation Prior to this commit, when collocating a RGW and NFS on the same box the ceph.conf layout was the following: [client.rgw.rgw0] host = mds0 host = rgw0 rgw frontends = civetweb port=192.168.15.50:8080 num_threads=100[client.rgw.mds0] rgw frontends = civetweb port=192.168.15.70:8080 num_threads=100 rgw frontends = civetweb port=192.168.15.50:8080 num_threads=100 keyring = /var/lib/ceph/radosgw/test-rgw.mds0/keyring keyring = /var/lib/ceph/radosgw/test-rgw.rgw0/keyring rgw data = /var/lib/ceph/radosgw/test-rgw.rgw0 log file = /var/log/ceph/test-rgw-mds0.log log file = /var/log/ceph/test-rgw-rgw0.log [mds.mds0] host = mds0 [global] rgw override bucket index max shards = 16 fsid = 70e1d368-57b3-4978-b746-cbffce6e56b5 rgw bucket default quota max objects = 1638400 osd_pool_default_size = 1 public network = 192.168.15.0/24 mon host = 192.168.15.10,192.168.15.11,192.168.15.12 osd_pool_default_pg_num = 8 cluster network = 192.168.16.0/24 [mds.rgw0] host = rgw0 [client.rgw.mds0] host = mds0 rgw data = /var/lib/ceph/radosgw/test-rgw.mds0 keyring = /var/lib/ceph/radosgw/test-rgw.mds0/keyring rgw frontends = civetweb port=192.168.15.70:8080 num_threads=100 log file = /var/log/ceph/test-rgw-mds0.log Basically appending all the sections. This commits solves that. Now the sections appear like this: -bash-4.2# cat /etc/ceph/test.conf [client.rgw.rgw0] log file = /var/log/ceph/test-rgw-rgw0.log host = rgw0 keyring = /var/lib/ceph/radosgw/test-rgw.rgw0/keyring rgw frontends = civetweb port=192.168.15.50:8080 num_threads=100 [client.rgw.mds0] log file = /var/log/ceph/test-rgw-mds0.log host = mds0 keyring = /var/lib/ceph/radosgw/test-rgw.mds0/keyring rgw frontends = civetweb port=192.168.15.70:8080 num_threads=100 [global] cluster network = 192.168.16.0/24 mon host = 192.168.15.10,192.168.15.11,192.168.15.12 osd_pool_default_size = 1 public network = 192.168.15.0/24 rgw bucket default quota max objects = 1638400 osd_pool_default_pg_num = 8 rgw override bucket index max shards = 16 fsid = 77a21980-3033-4174-9264-1abc7185bcb3 [mds.rgw0] host = rgw0 [mds.mds0] host = mds0 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-09 17:25:44 +02:00
Sébastien Han	7054abef99	Merge pull request #2009 from ceph/fix-clean-pg [skip ci] handler: do not test if pgs_num = 0	2017-10-07 03:39:26 +02:00
Sébastien Han	9f1bd3d6dd	handler: add serial restart back We now restart daemons on each machine in a serialized fashion. Closes: https://github.com/ceph/ceph-ansible/issues/1989 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-07 03:39:10 +02:00
Sébastien Han	a4dcef73d4	common: fix debian rhcs installation Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-07 03:39:09 +02:00
Sébastien Han	c693e95cbf	purge-docker: rework device detection we don't need "devices" and other device variable anymore, the playbook detects that for us. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-07 03:39:04 +02:00
Sébastien Han	ac29e8f977	Merge pull request #1983 from jprovaznik/suffix Allow to override systemd service instance id	2017-10-06 22:40:57 +02:00
Sébastien Han	5d39f378da	Merge pull request #1984 from jprovaznik/exportdir Include exports dir in ceph-nfs config file	2017-10-06 22:38:13 +02:00
Ali Maredia	28862a99d9	nfs: missing conditional for setting rgw key permissions Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-10-06 15:21:35 -04:00
Sébastien Han	11f51df1fc	Merge pull request #2005 from ceph/wip-nfs-export-id nfs: config var changes	2017-10-06 17:05:21 +02:00
Sébastien Han	779f642fa8	use get to check stdout_lines During the initial play, the docker command doesn't not exist and then there is no stdout_lines to the command. So get allows us to fix this by declaring an array if the command fails. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-06 16:57:46 +02:00
Sébastien Han	d5ae0a3340	handler: do not test if pgs_num = 0 We don't need to wait if they are no PGS. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-06 16:57:46 +02:00
Guillaume Abrioux	6b027557e6	osd: fix `set_fact build dedicated_devices` Use an intermediate variable to build the final `dedicated_devices` list to avoid duplicate entry in that array. (We need a 1:1 relation between `dedicated_devices` and `devices` since we are using a `with_together` later. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-06 15:00:32 +02:00
Guillaume Abrioux	d363b0f741	rbd: fix bug when trying to fetch key With jewel, `bootstrap_rbd_keyring` is not set because of this condition: ``` when: - ceph_release_num.{{ ceph_release }} >= ceph_release_num.luminous ``` Therefore, the task `try to fetch ceph config and keys` will fail. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-06 11:34:29 +02:00
Jan Provaznik	3c16af5ef2	Allow to override systemd service instance id It's useful to have constant service instance id when ceph-nfs is managed by pacemaker.	2017-10-06 08:20:37 +02:00
Ali Maredia	0c09cd3e2e	nfs: config var changes - remove unused ganesha config vars, - set different default Export_ids for each FSAL Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-10-05 16:51:23 -04:00
Sébastien Han	1121a840ef	Merge pull request #2003 from ceph/debian-iso [skip ci] common: iso install on Debian is supported by rhcs	2017-10-05 18:57:47 +02:00
Sébastien Han	feaf5ff9c6	common: iso install on Debian is supported by rhcs Also adds support for RCSH installation on Debian. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 18:57:09 +02:00
Sébastien Han	425ecb3c7d	common: fix ga verison for debian rhcs Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 18:45:30 +02:00
Sébastien Han	639389b9cd	Merge pull request #1985 from ceph/debian-rhcs [skip ci] common: fix rhcs installation on debian	2017-10-05 18:42:46 +02:00
Sébastien Han	0d833657c1	Merge pull request #2001 from ceph/iscsi iscsi: fix wrong group name for iscsi	2017-10-05 18:29:06 +02:00
Sébastien Han	29888649e5	osd: do not do unique on dedicated_devices This is needed later, if we do unique, only the first OSD will get a journal. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 18:20:18 +02:00
Sébastien Han	9193e88878	common: fix rhcs installation on debian * Change version from 2 to 3. * use ceph_rhcs_cdn_debian_repo_version to use other repositories along * with ceph_rhcs_cdn_debian_repo Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 17:42:21 +02:00
Sébastien Han	b6b24a5ca9	iscsi: fix wrong group name for iscsi Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1498490 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 17:25:32 +02:00
Sébastien Han	9304bb6c74	Merge pull request #1997 from rrmichel/osd_fragment Fixing path to osd_fragment.yml	2017-10-05 15:58:49 +02:00
Sébastien Han	164c77acd1	Merge pull request #1995 from ceph/remove-rbd-check jewel: remove rbd check	2017-10-05 15:31:48 +02:00
Guillaume Abrioux	8fb68297a2	common: remove unusuable conditions `ceph_release` isn't available at this step of the playbook because it is set later based on the installed binaries. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1486062 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-05 14:59:37 +02:00
Sébastien Han	c803dedec8	Merge pull request #1993 from jprovaznik/log Fix bind mount for /var/lib/nfs/ganesha directory	2017-10-05 14:43:26 +02:00
Michel Rode	b462b68e65	Fixing path to osd_fragment.yml	2017-10-05 14:42:10 +02:00
Jan Provaznik	b8916ecbc1	Include exports dir in ceph-nfs config file Exports dir is used when dynamic exports creation is enabled.	2017-10-05 14:37:15 +02:00
Sébastien Han	b545080d71	Merge pull request #1988 from ceph/fix_keyrings docker: fix keyrings copied on all nodes	2017-10-05 14:30:09 +02:00
Sébastien Han	bbf6bebe32	jewel: remove rbd check The value of doing this is fairly low compare to the added value. So we remove these tasks, if rbd pool on Jewel doesn't have the right PG value you can always increase it. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 14:21:37 +02:00
Jan Provaznik	62ea6f6e7f	Fix bind mount for /var/lib/nfs/ganesha directory	2017-10-05 13:44:43 +02:00
Jan Provaznik	43e57abfd8	Evaluate cephfs pool variables Otherwise pools with names 'cephfs_data' and 'cephfs_metadata' are created.	2017-10-05 10:00:20 +02:00
Guillaume Abrioux	70e2787fe2	docker: fix keyrings copied on all nodes All keyring are getting copied to all nodes. This commit fixes a leftover from a previous code refactor. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1498583 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-05 09:23:22 +02:00
Guillaume Abrioux	8fac8f54a6	iscsi-gw: Create a rbd pool if it doesn't exist iscsi-gw needs a 'rbd' pool to configure iscsi target. Note: I could have used the facts already set in `ceph-mon` but I voluntarily didn't do it to not create a dependancy between these two roles. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-04 15:40:10 +02:00
Guillaume Abrioux	2c4258a0fd	Refact code for set_osd_pool_default_* This commit refacts the code regarding all `set_osd_pool_default_*` related tasks by avoiding usage of useless `set_fact` to determine whether a key is present in `ceph_conf_overrides`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-04 15:40:10 +02:00
Al Lau	6aca67bc9c	Only perform actions on the rbd pool after it has been created The rbd pool is the default pool that gets created during ceph cluster initializaiton. If we act on the rbd related operations too early, the rbd pool does not exist yet. Move the call to perform rbd operations to a later stage after other pools have been created. The rbd_pool.yml playbook has all the operations related to the rbd pool. Replace the always_run (deprecated) directive with check_mode. Most of the ceph related tasks only need to run once. The run_once directive executes the task on the first host. The ceph sub-command to delete a pool is delete (not rm). The changes submitted here were tested with this ceph version. ceph version 0.94.9-9.el7cp (b83334e01379f267fb2f9ce729d74a0a8fa1e92c) This upload includes these changes: - Use the fail module (instead of assert). - From luminous release, the rbd pool is no longer created by default. Delete the code to create the rbd pool for luminous release - Conform the .yml files to use the suggested syntax. The commands are executed on the mcp nodes and I think shell ansible module is the right one to use. The command module is used to execute commands on remote nodes. I can make the change to use command module if that is prefrerred.	2017-10-04 15:40:10 +02:00
Sébastien Han	cac7d034bf	defaults: fix check socket non-container handler Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-04 15:33:52 +02:00
Sébastien Han	c751c2dc6b	nfs: add run once to user creation The create user call is idempotent but it's also blocking for some reasons. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-04 15:01:13 +02:00
Guillaume Abrioux	784cc73da0	set docker_exec_cmd fact early in each role This is to ensure `docker_exec_cmd` fact is set with the correct value in case of daemons collocation Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-04 11:31:09 +02:00
Sébastien Han	5968cf09b1	ci: add collocation scenario Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-04 11:19:12 +02:00
Sébastien Han	f37e014a65	Merge pull request #1974 from ceph/mgr-upgrade-luminous upgrade: a support for mgrs	2017-10-03 19:57:31 +02:00
Sébastien Han	0ce76113bf	Merge pull request #1956 from ceph/osd-container-id Osd container	2017-10-03 18:52:24 +02:00
Sébastien Han	99466e79a1	upgrade: a support for mgrs Also we now play ceph-config to have everything being generated for new daemons bootstrap during upgrade. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1497959 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-03 16:57:31 +02:00
Sébastien Han	27808a64a4	iscsi: fix when condition generate_crt\|bool\|default(false) won't apply the default value, this generate_crt\|default(false)\|bool will Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-03 16:48:17 +02:00
Sébastien Han	3bd341f6c0	osd: container use id instead of dev name Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1494127 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-03 14:44:00 +02:00
Sébastien Han	ba42894516	osd: do not copy admin key on collocated scenario ceph-disk used to have a bug requiring the admin key to store the encrypted key in the mon kv store. This was reported in: http://tracker.ceph.com/issues/17849 Fixed and backported here: https://github.com/ceph/ceph/pull/11996 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-03 14:44:00 +02:00
Guillaume Abrioux	081f226106	defaults: change running order in main.yml The task which sets `ceph_current_fsid` in `facts.yml` in case of containerized deployment, will definitely fail because `docker_exec_cmd` is not set yet. This commits simply makes `facts.yml` played after `check_socket.yml` so `docker_exec_cmd` is set properly. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-02 18:42:43 +02:00
Sébastien Han	30ce781c79	Merge pull request #1968 from ceph/bz-1488999 refact MDS role	2017-10-02 14:42:08 +02:00
Guillaume Abrioux	62770cd7de	refact MDS role This commits refacts the role ceph-mds The goal here is to create cephfs in `ceph-mon` for both containerized and non-containerized cases so we don't need the admin keyring on mds nodes anymore. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1488999 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-02 09:12:31 +02:00
Sébastien Han	46a01df434	osd: add cluster name support I forgot to add cluster name support so some partition were never mounted correctly. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 20:30:54 +02:00
Sébastien Han	0da6d8e356	Merge pull request #1967 from ceph/use_systemd_module Use systemd module instead of service.	2017-09-29 16:35:10 +02:00
Guillaume Abrioux	466f6f35b7	Use systemd module instead of service. Using systemd module allows us to do in one task what we did in three tasks: - enable unit file, - issue a `daemon-reload`, - start the service Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-29 14:54:00 +02:00
Sébastien Han	e121bc58e9	defaults: add missing handlers for rbd mirorr and mgr Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 02:38:24 +02:00
Sébastien Han	048b55be4a	defaults: only run socket checks on their specific roles Running the socket check on all the hosts will override the default value of docker_exec_cmd, leaving it with the last value (currently rbd-mirror), as a result the subsequent docker_exec_cmd usage for the :x Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 02:38:24 +02:00
Sébastien Han	341c9e077b	nfs: fix container setup and re-arrange files Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 02:38:24 +02:00
Sébastien Han	fc29ccd0ad	rbd-mirror: force sercice enable ceph-rbd-mirror.target There is a bug in the rbd mirror unit file, the upstream fix is here: https://github.com/ceph/ceph/pull/17969. This should be reverted once the patch is merged and backport is done. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-29 02:38:24 +02:00
Sébastien Han	deb5d3ba1f	Merge pull request #1962 from ceph/fix_mgr_sestatus [skip ci] mgr: add condition to run selinux tasks only on rhel os family	2017-09-29 02:37:03 +02:00
Guillaume Abrioux	913ad53709	docker: add condition to run selinux tasks only on rhel os family This fixes the error : ``` The conditional check 'sestatus.stdout != 'Disabled'' failed. ``` that occurs when running on non rhel based system since the `sestatus` fact is registered only on rhel based distribution. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-29 02:35:07 +02:00
Sébastien Han	77fc8ba87f	Merge pull request #1931 from ceph/re-enable-iscsi iscsi: re-enable the scenario	2017-09-28 19:44:52 +02:00
Sébastien Han	67c78da056	iscsi: re-enable the scenario CentOS 7.4 vagrant box is now available so re-enabling this scenario. For more info: https://seven.centos.org/2017/09/updated-centos-vagrant-images-available-v1708-01/ Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-28 18:46:28 +02:00
Sébastien Han	0010979412	Merge pull request #1641 from fullerdj/wip-djf-key-timeout mon/ceph_keys: Add timeout flag to ceph-create-keys	2017-09-28 09:40:50 +02:00
Guillaume Abrioux	d20dc54202	docker-common: fix wrong syntax there is no need to backslash the quotes here. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-28 00:30:08 +02:00
Douglas Fuller	9bcbf748a3	mon/ceph_keys: Add timeout flag to ceph-create-keys Specify the timeout flag to ceph-create-keys, which causes it to time out if a monitor quorum isn't achieved. This overrides the default timeout of 10 minutes, causing ceph-ansible to fail faster in the event of cluster network issues. Signed-off-by: Douglas Fuller <dfuller@redhat.com>	2017-09-27 18:05:59 -04:00
Zack Cerza	70b321f34c	ceph-common: Fix logic for ceph_repository_type It's failing if a valid choice is specified. Signed-off-by: Zack Cerza <zack@redhat.com>	2017-09-25 15:28:27 -06:00
Sébastien Han	e4ac736071	Merge pull request #1943 from ceph/mgr-site handler: enhance socket detection	2017-09-25 18:43:32 +02:00
Sébastien Han	4266bb5d3f	Merge pull request #1933 from ceph/osd-container-reboot [skip ci] osd: fix container reboot	2017-09-25 18:36:25 +02:00
Sébastien Han	8b6456dc8a	handler: enhance socket detection We have seen issues with leftover socker. So now, if a socket is found we also check if it's accessed by a process. If so, we can run the handler, if not we remove it and continue the playbook. Signed-off-by: Sébastien Han <seb@redhat.com> Co-Authored-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-25 13:44:51 +02:00
Sébastien Han	45797ab968	osd: fix container reboot It's sad but we can not rely on the prepare container anymore since the log are flushed after reboot. So inpecting the container does not return anything. Now, instead we use a ephemeral container to look up for the journal/block.db/block.wal (depending if filestore or bluestore) and build the activate command accordingly. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-25 13:34:47 +02:00
Guillaume Abrioux	be757122f1	config: fix path to set `interface` in ceph.conf need to use `hostvars[host]['XXX']` to retrieve the monitor interface and/or radosgw interface. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1493920 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-23 14:18:28 +02:00
Sébastien Han	f3851df0c7	Merge pull request #1940 from ceph/rgw-interface config: fix rgw interface when using different interfaces	2017-09-22 18:52:51 +02:00
Sébastien Han	8f71c08e7b	handler: display ceph status properly Fix bash error, doing ceph "$CEPH_CLI" -s gives us ceph '--name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/test.keyring --cluster test' -s which results in a wrongly formatted command. Removing the double quotes expands the array properly. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 17:45:35 +02:00
Sébastien Han	2e0c2928e9	nfs: fix docker_exec_cmd_nfs default value the default is not an array, default is empty. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 16:22:36 +02:00
Sébastien Han	4a55085914	config: fix rgw interface when using different interfaces Conf file generation failing on rgw nodes when nodes have different interface names. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1493552 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 15:41:11 +02:00
Sébastien Han	64824baa83	nfs: fix undefined variable This is what happens when you don't run all the jobs from the CI... Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 15:37:43 +02:00
Sébastien Han	839bc11df0	Merge pull request #1923 from ceph/nfs-container [skip ci] tests: add nfs container test	2017-09-22 12:22:10 +02:00
Sébastien Han	aa5c36f19c	nfs: several fixes - move the file fetch/push to the existing task - rename the include - generate the ganesha template from ansible - re-arrange role structure - re-use tasks for non-container and container - configure keys for non-container and container - fix rgw container key collection; Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-22 00:37:32 +02:00
Guillaume Abrioux	599429dd31	common: fix debian install in addition to #1926 this commit fixes the error when trying to include `install_debian_rhcs_packages.yml` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1493231 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 13:26:29 +02:00
Guillaume Abrioux	b8c3fa9727	nfs: change ownership on /var/log/ganesha to fix selinux capability issue that prevent nfs to start. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Guillaume Abrioux	1886a69b8b	docker-common: refact `stat_ceph_files.yml` there is no need to build the `ceph_config_keys` fact in several steps for rbd-mirror keyring. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Guillaume Abrioux	62cd0bae54	rbd: fix missing keyring on nodes the rbd key was not pushed on rbd nodes because its keyring path was not added in `ceph_config_keys`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Guillaume Abrioux	295c1b0610	docker-common: fix ceph_health check `docker ps` will always return `0` (see: https://github.com/docker/cli/issues/538). Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Guillaume Abrioux	6c9f3a08a7	rgw: refact start_docker_rgw.yml remove usage of `shell` module in favor of `systemd` module. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Guillaume Abrioux	90c4066ce5	mgr: add missing admin key for mgr container Followup on #1761. Add missing admin key for mgr node in containerized deployment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00
Sébastien Han	adf5017924	config: remove max open file This is only used by the old sysvinit scripts Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-20 23:06:01 +02:00
Sébastien Han	a4baed1025	config: no not generate osd section if bluestore This section is not needed when running a bluestore osd. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-20 18:14:48 +02:00
Sébastien Han	cb05172605	docker: we don't need to copy the ceph.conf on all the nodes We generate the ceph.conf on all the nodes through the ceph-docker-common so there is no need to push it to the Ansible file. Also this is breaking the ceph.conf template generation since we only generate sections based on the host the ansible task is running on. For example, what's typically happening, we bootstrap the monitor, we get a ceph.conf generated for a mon only, we go on an osd, we generate the ceph.conf with osd section (done by ceph-docker-common) but this gets overwritten by the copy_config task of the ceph-osd role. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-20 16:33:29 +02:00
Sébastien Han	7aab133617	Merge pull request #1920 from jprovaznik/ganesha Make ceph-nfs service enablement/start optional	2017-09-20 14:48:36 +02:00
Sébastien Han	a89363b0ae	Merge pull request #1926 from ceph/rhcs-debina common: fix rhcs debian install	2017-09-19 19:50:40 +02:00
Sébastien Han	75e77f5948	common: fix rhcs debian install Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1493231 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-19 19:49:44 +02:00
Ali Maredia	3ba1a68cf5	nfs: ganesha.conf template fixes - Change capitalization of config options to be in line with what config.txt in the nfs-ganesha tree says Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-09-19 12:45:24 -04:00
Sébastien Han	dd7f21bd92	common: fix rhcs installation and rgw package for nfs RHCS install wasn't working at all prior to this commit as the name of the include was pointing to a non-existing file. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1492056 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-19 12:12:22 +02:00
Sébastien Han	87e2dae9d8	Merge pull request #1919 from ceph/iscsi-check common: fix rhel check	2017-09-19 12:10:10 +02:00
Sébastien Han	ace97e8720	Merge pull request #1904 from ceph/name-include-fact name includes and set_fact for clarity	2017-09-19 12:09:25 +02:00
Jan Provaznik	8c510ab9f9	Make ceph-nfs service enablement/start optional When ceph-nfs service is managed by pacemaker, it's useful to not enable and start ceph-nfs service through systemd but let pacemaker to start the service in a next step.	2017-09-19 11:59:54 +02:00
Sébastien Han	dbe64f66f7	common: fix rhel check Looks like Ansible is now using "RedHat" instead of "Red Hat Enterprise Linux" Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-19 11:55:26 +02:00
Sébastien Han	773010ee49	Merge pull request #1911 from fghaas/1910 Introduce ceph_nfs_ceph_user	2017-09-19 10:03:46 +02:00
Florian Haas	ada2f147f5	Introduce ceph_nfs_ceph_user In analogy to ceph_nfs_rgw_user, we should be able to define a user with which the nfs-ganesha Ceph FSAL connects to the cluster. Introduce a ceph_nfs_ceph_user variable, setting its default to "admin" (which preserves the prior behavior of always connecting as client.admin). Fixes #1910.	2017-09-19 09:07:28 +02:00
Sébastien Han	d100b4e596	name includes and set_fact for clarity When Ansible is not run with verbose options it's difficult to see which include and/or set_fact does what. So adding a name for each clarifies. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-18 23:39:46 +02:00
Sébastien Han	66d41f342d	Merge pull request #1889 from ceph/client-containers client: ability to create keys and pool with no ceph binaries	2017-09-18 17:27:32 +02:00
Sébastien Han	2749368a2d	Merge pull request #1915 from ceph/state-leftover docker-common: re-introduce state for leftover files	2017-09-18 15:46:07 +02:00
Sébastien Han	aa5d94fc87	docker-common: re-introduce state for leftover files The variable "statleftover" was removed by commit `a60c74f61e` and never added back to the new playbook, yet it is still being referenced. Adding it back Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1492224 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-18 15:01:32 +02:00
Sébastien Han	85d73e3be2	client: ability to create keys and pool with no cpeh binaries On a container env, machines don't have any ceph binaries so we need to use a container to run the commands. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-18 14:41:52 +02:00
Sébastien Han	68a1390dc9	Merge pull request #1898 from ceph/restart-mon defaults: restart docker daemon higher delay	2017-09-15 06:23:51 -06:00
Sébastien Han	ed3003cf41	defaults: restart docker daemon higher delay Use default delay since the mon (in particular) can take more time to restart. Solves error with: STDERR: Error response from daemon: No such container: ceph-mon-mon0 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-14 13:38:11 -06:00
Sébastien Han	fb02b1d9d3	mon: create the mgr key for release >= luminous This fixes RHCS builds. We know which Ceph version we are running on. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-14 11:06:44 -06:00
Sébastien Han	6f0b1fe803	rgw: remove old variables Since the only support civetweb these variables are obsolete. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-14 09:42:50 -06:00
Sébastien Han	660893e70e	osd: add meaningful message for journal_size Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-13 23:49:15 -06:00
Sébastien Han	ef8d37dd0d	Merge pull request #1800 from ceph/wip-osd-start-fix ceph-osd: Fix osd start sequence	2017-09-13 17:20:10 -06:00
Sébastien Han	2f51f0de28	Merge pull request #1880 from ceph/wip-rgw-nfs nfs: configure RGW FSAL to start up correctly	2017-09-13 14:20:14 -06:00
Sébastien Han	f67b47d056	Merge pull request #1882 from ceph/multi-journal osd: drop support for device partition	2017-09-13 11:43:48 -06:00
Sébastien Han	ac62437609	Merge pull request #1883 from ceph/quick_refact osd: refact include of `activate_osds.yml`	2017-09-12 22:11:31 -06:00
Sébastien Han	c3866fc4bd	Merge pull request #1747 from ceph/add-iscsi resync ceph-iscsi-gw with old upstream	2017-09-13 02:06:50 +02:00
Sébastien Han	aa364264cd	resync ceph-iscsi-gw with old upstream Taken from https://github.com/pcuzner/ceph-iscsi-ansible/tree/tcmu-fixes Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1454945 and https://bugzilla.redhat.com/show_bug.cgi?id=1484083 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-12 18:06:10 -06:00
Sébastien Han	2a1b8a1997	Merge pull request #1884 from ceph/mon-container-ip mon: add support for monitor_address block for containers	2017-09-13 01:46:18 +02:00
Sébastien Han	fdf924401f	osd: drop support for device partition We have been struggling with this, it's still broken and breaking other things too now. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1490283 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-12 17:42:07 -06:00
Guillaume Abrioux	49ad8528e5	osd: refact include of `activate_osds.yml` remove duplicate code. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-12 16:53:11 -06:00
Sébastien Han	02ba65dbbe	mon: add support for monitor_address block for containers Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-12 16:28:08 -06:00
Sébastien Han	6b8ed0440e	Merge pull request #1761 from ceph/split_copy_keys docker: split the task 'copy ceph configs&keys'	2017-09-13 00:21:50 +02:00
Ali Maredia	52efe92a87	nfs: configure RGW FSAL to start up correctly - Add RGW keyring to nfs node - Add RGW section to ganesha.conf - Add RGW section to ceph.conf onf nfs node Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-09-12 16:27:16 -04:00
Guillaume Abrioux	20946f7220	ceph-osd: remove deprecated comment in sample file Since #1724 has been merged, this comment is deprecated Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-12 16:48:10 +02:00
Guillaume Abrioux	0f506f4f0a	Docker: split the task 'copy ceph configs&keys' All keys are copied to all nodes. This commit split that task in each roles so keys are copied to their respective nodes. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1488999 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-11 21:14:13 +02:00
Sébastien Han	2ea7f287fa	docker: simplify variable declaration Less configuration for the user, the container inherit from the global variables. No more container specific variables. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-09 01:22:06 +02:00
Sébastien Han	4767eaaab3	Merge pull request #1878 from ceph/add-rbd-mirror Add rbd mirror	2017-09-09 01:21:12 +02:00
Sébastien Han	7054615551	ci: deploy rbd mirror Deploy rbd mirorr in cluster scenario Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-09 01:17:10 +02:00
Sébastien Han	477f86e305	switch to container: fix ceph nfs The service is nfs-ganesha where ceph-nfs@{{ ansible_hostname }} will be the name of the container. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-08 22:43:50 +02:00
Sébastien Han	d46d453b83	Merge pull request #1780 from ceph/wip-rgw-nfs Wip RGW NFS	2017-09-08 19:26:02 +02:00
Guillaume Abrioux	b59e9cc732	Merge pull request #1871 from ceph/handler-collocate defaults: do not restart unconfigured (yet) daemons	2017-09-08 18:15:02 +02:00
Sébastien Han	a05c58ba37	Merge pull request #1874 from ceph/rbd-mirror-mem ceph-rbd-mirror; docker fix typo	2017-09-08 17:50:55 +02:00
Sébastien Han	7a93d88025	ceph-rbd-mirror; docker fix typo Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-08 17:47:48 +02:00
Ali Maredia	f8171e8b4a	nfs: rename host to have ceph- prefix Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-09-08 11:38:05 -04:00
Ali Maredia	f3e2235b3a	nfs-ganesha: add config overrides section Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-09-08 11:37:58 -04:00
Sébastien Han	d53f55e807	Merge pull request #1870 from Logan2211/omit-default-release Omit the apt default_release if it is not needed	2017-09-08 16:55:03 +02:00
Guillaume Abrioux	44fd928e23	mds: rename mds_socket fact Rename this fact to keep consistency with handlers in `ceph-defaults`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-08 15:57:58 +02:00
Ali Maredia	55724c6e93	nfs-ganesha: add dev, stable, and rhcs nfs-ganesha's for ceph-nfs role Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-09-08 09:13:20 -04:00
Sébastien Han	12f6e53090	defaults: do not restart unconfigured (yet) daemons In a collocated scenario, where you might put a rgw, a mds and a mon on the same node you don't want the handler blindly restart all the daemons on the node. Indeed some of them might not be configured yet. Implementing a more precise socket detection, for each daemon type. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1488813 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-08 12:02:37 +02:00
Logan V	d8cb62c981	Omit the apt default_release if it is not needed The apt module will fail to downgrade packages properly when defualt release is unnecessarily defined. Closes #1869	2017-09-07 11:50:57 -05:00
Sébastien Han	3753e6cfa7	ceph-osd: fix autodetection activation Prior to this patch this activation sequence for autodetection was always skipped because we were asking to activate on device without partitions, which doesn't make sense. We also fix the way we lookup for a device, since the data partition is always numbered 1, we take the min element of the dict. Closes: https://github.com/ceph/ceph-ansible/issues/1782 Signed-off-by: Sébastien Han <seb@redhat.com> Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-07 17:47:37 +02:00
Sébastien Han	27b3f9a7d4	Merge pull request #1850 from fultonj/issue/1848 Add option to create client keyring file but not import it	2017-09-07 13:51:11 +02:00
Sébastien Han	cf88c136f5	Merge pull request #1859 from ceph/container-limit container: introduce resource limitation for containers	2017-09-07 12:51:34 +02:00
Sébastien Han	d2032c92af	Merge pull request #1862 from ceph/fail-ansible fail if ansible version < 2.3	2017-09-07 08:44:01 +02:00
Sébastien Han	fc3300ea4f	fail if ansible version < 2.3 We only test and support 2.3.x at the moment. Closes: https://github.com/ceph/ceph-ansible/issues/1858 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-07 07:53:17 +02:00
John Fulton	a57f61efd9	Add option to create client keyring file but not import it Add new boolean parameter for client config create_key_file_only with a default of false. When create_key_file_only is true, the client tasks to connect to the external ceph cluster to verify the key `ceph auth import` the key are skipped. Fixes: #1848	2017-09-06 13:56:06 +00:00
Sébastien Han	2fa151b9e8	container: introduce resource limitation for containers This can be controlled via 2 options: * ceph_$DAEMON_docker_memory_limit * ceph_$DAEMON_docker_cpu_limit All daemons default to 1GB for memory and 1 CPU by default. Recommendations from: https://access.redhat.com/documentation/en-us/red_hat_ceph_storage/2/html/red_hat_ceph_storage_hardware_guide/minimum_recommendations Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-06 14:52:21 +02:00
Sébastien Han	b7db600caa	switch-from-non-containerized-to-containerized: mask unit files We must mask the image so we are sure that even if the system reboots then the OSDs won't start. Also remove Ceph udev rules if found on the system prior to deploy containers. If we don't do this we are exposed to conflicts between udev rules and sytemd unit files. Also add the CI will now test the migration from a non-containerized cluster to a containerized cluster. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-05 15:20:31 +02:00
Sébastien Han	b6c1a0c68f	Merge pull request #1853 from ceph/fix-prepare ceph-osd: do not re-prepare if already prepared	2017-09-05 13:59:40 +02:00
Sébastien Han	5ed1a91aeb	Merge pull request #1819 from ceph/no-container-log ceph-docker-common: do not log inside the container	2017-09-05 11:47:11 +02:00
Sébastien Han	1dd976d28e	ceph-osd: do not re-prepare if alreadyy prepared I forgot to re-add the partition check while refactoring the osd Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-05 09:51:57 +02:00
Sébastien Han	23a0c26c4f	client: do not copy admin key by default Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-02 00:54:17 +02:00
Sébastien Han	58f664fd17	ceph-rgw: fix systemd unit layout Signed-off-by: Sébastien Han <seb@redhat.com>	2017-09-01 19:02:48 +02:00
Sébastien Han	967e875fd0	Merge pull request #1827 from andymcc/rgw_systemd_fix Fix RGW systemd directory	2017-09-01 18:12:23 +02:00
Alfredo Deza	98d107cebb	common do not filter by distro when dev is set for figuring out ceph_release Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-08-31 16:18:08 -04:00
Sébastien Han	673938ec96	Merge pull request #1839 from ceph/colonwq-update-docker-rgw-exec Update ceph_rgw_docker_extra_env to add bind ip	2017-08-31 19:47:16 +02:00
Sébastien Han	ea9b6395cb	Merge pull request #1838 from ceph/rgw-units Rgw units	2017-08-31 19:38:23 +02:00
Andrew Schoen	29df79e54e	Merge pull request #1841 from ceph/lvm-partitions lvm-osds: test with a partition and an lv as journals	2017-08-31 12:09:19 -05:00
Sébastien Han	3dd47a45cb	ceph-defaults: fix handlers for mds and rgw The way we handle the restart for both mds and rgw is not ideal, it will try to restart the daemon on the host that don't run the daemon, resulting in a service file being created (see bug description). Now we restart each daemon precisely and in a serialized fashion. Note: the current implementation does NOT support multiple mds or rgw on the same node. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1469781 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-31 19:02:21 +02:00
Sébastien Han	7ee1f88ee5	ceph-common: remove useless changed task There is no need to show a "changed" at the end of the play for a "command" module task. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-31 18:27:43 +02:00
Keith Schincke	eaccc12797	Update ceph_rgw_docker_extra_env to add bind ip This patch adds passing the RGW_CIVETWEB_IP to the docker container. This IP defaults to the value of radosgw_civetweb_bind_ip. radosgw_civetweb_bind_ip default to ipv4.default Without this value, the RGW containter will bind to 0.0.0.0	2017-08-31 15:50:34 +02:00
Sébastien Han	e581539e20	ceph-rgw: do not run a privileged rgw container There is no need for a privileged rgw container Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-31 15:50:15 +02:00
Sébastien Han	7ccd10a15e	rgw: cleanup old code and remove systemd condition Remove the old check prior systemd. We only support systemd so there is no need to run a condition on systemd. The playbook will fail if systemd is not present. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-31 08:29:30 +02:00
Andrew Schoen	fcba9d17f0	ceph-osd: add support for --journal vg/lv for lvm osds This also updates the tests Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-30 15:55:16 -05:00
Alfredo Deza	da90edce3e	common dev repos should not need to specify a 'release' Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-08-30 13:37:24 -04:00
Alfredo Deza	6565c38238	common: ceph_repository should not be rhcs or dev Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-08-30 13:33:04 -04:00
Alfredo Deza	8fd2bf7e2c	common: use the value of ceph_repository in the error message Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-08-30 13:33:04 -04:00
Sébastien Han	13aac5027a	Merge pull request #1741 from ceph/refactor-installation common: refactor installation method	2017-08-30 17:42:29 +02:00
Sébastien Han	b05271f464	Merge pull request #1724 from ceph/container-multi-journal osd: allow multi dedicated journals for containers	2017-08-30 17:41:42 +02:00
Sébastien Han	a60c74f61e	ceph-docker-common: re-organize stat ceph file Use a single file to run the checks instead of duplicating code. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-30 14:44:34 +02:00
Sébastien Han	e0a264c7e9	osd: allow multi dedicated journals for containers Fix: https://bugzilla.redhat.com/show_bug.cgi?id=1475820 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-30 12:34:06 +02:00
Sébastien Han	ae2fd45994	common: refactor installation method The installation process is now described as follow: * you still have to choose a 'ceph_origin' installation method. The origin can be a 'repository' (add a new repository), distro (it will use the packages provided by the native repo source of your distribution), local (only available on redhat system, it installs locally built packages). This option is not well tested, so use it carefully * if ceph_origin == 'repository' you will have to decide what kind of repository you want to enable: - community: corresponds to the stable upstream/community version - enterprise: corresponds to the stable enterprise/downstream version (basically you are a red hat customer) - dev: it will install ceph from packages built out of the github development branches Signed-off-by: Sébastien Han <seb@redhat.com> Co-Authored-by: Guillaume Abrioux <gabrioux@redhat.com> Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-30 10:52:01 +02:00
Andy McCrae	a9d91c3d69	Fix RGW systemd directory The ceph RGW systemd services are actually named "ceph-radosgw" and not "ceph-rgw", this patch fixes that for the systemd overrides file.	2017-08-29 17:24:52 +01:00
Sébastien Han	5743916092	common: add mimic release facts Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-29 17:21:37 +02:00
Sébastien Han	fa9f2313d5	Merge pull request #1822 from ceph/rhcs-container-release ceph-docker-common: detect ceph version	2017-08-29 12:16:20 +02:00
Sébastien Han	d0515cb704	Merge pull request #1825 from ceph/fix-item ceph-docker-common: fix empty array	2017-08-29 12:15:46 +02:00
Sébastien Han	b3e5206289	Merge pull request #1814 from ceph/handler-defaults handler: default to empty array if task skipped	2017-08-29 11:09:35 +02:00
Sébastien Han	cfddd2903c	ceph-docker-common: fix empty array The list can not be evaluated properly if it containers '[]', which is the case when using the filter "default([])". To fix this, we have to properly merge the lists. This is fixing the issue: "list object has no element 1" Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-29 10:25:46 +02:00
Sébastien Han	764e697186	ceph-docker-common: detect ceph version By detecting the ceph version running in the container we can easily apply conditions like: ceph_release_num.{{ ceph_release }} >= ceph_release_num.luminous We do that already, in ceph-docker-common/tasks/fetch_configs.yml. This fixes the error: TASK [ceph-docker-common : register rbd bootstrap key] ****************************************************** fatal: [magna005]: FAILED! => {"failed": true, "msg": "The conditional check 'ceph_release_num.{{ ceph_release }} >= ceph_release_num.luminous' failed. The error was: error while evaluating conditional (ceph_release_num.{{ ceph_release }} >= ceph_release_num.luminous): 'dict object' has no attribute 'dummy'\n\nThe error appears to have been in '/home/ubuntu/ceph-ansible/roles/ceph-docker-common/tasks/fetch_configs.yml': line 2, column 3, but may\nbe elsewhere in the file depending on the exact syntax problem.\n\nThe offending line appears to be:\n\n---\n- name: register rbd bootstrap key\n ^ here\n"} Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1486062 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-28 23:28:47 +02:00
Sébastien Han	aa69c2c007	ceph-docker-common: do not log inside the container Logging inside the container is not useful since it writes to the overlayfs partition, resulting in potential performance degradation on the container. If you need to check the logs, just look at journald. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-28 12:04:49 +02:00
Sébastien Han	29753da05c	handler: default to empty array if task skipped with_items is evaluated before the when condition so if the task that registers the 'results' is skipped the task will fail with: {"failed": true, "msg": "'dict object' has no attribute 'results'"} Defaulting to an empty array fixes the issue. Reverts: `abdd66619e` Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1482061 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-25 18:39:00 +02:00
Sébastien Han	972eb45d31	ceph-docker-common: apply 0600 to key permissions Keys should only be readable and writable by their respective owners and that's all. Closes: https://github.com/ceph/ceph-ansible/issues/1760 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-25 18:14:28 +02:00
Boris Ranto	5f1b8fcd75	ceph-osd: Fix osd start sequence The script can fail to get the osd id because the osds are activated by udev and it can take a while for them to activate. This commit fixes that by trying to get all the osds per node in a loop. This commit also makes the osd services enabled so that they are available after reboot. Signed-off-by: Boris Ranto <branto@redhat.com>	2017-08-25 13:40:04 +02:00
Sébastien Han	1f4082f200	update meta for ansible galaxy Closes: https://github.com/ceph/ceph-ansible/issues/1637 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-25 00:05:44 +02:00
Sébastien Han	aee8267be4	Merge pull request #1808 from ceph/role-path ceph-mon: detect ANSIBLE_ROLES_PATH if present	2017-08-24 23:49:41 +02:00
Andrew Schoen	910bb036c6	ceph-config: when using local_action set become: false There should be no need to use sudo when writing or using these files. It creates an issue when the user running ansible-playbook does not have sudo privs. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-24 10:07:03 -05:00
Sébastien Han	76ac9b077b	ceph-mon: detect ANSIBLE_ROLES_PATH if present Some deployments can't copy infrastructure playbooks outside of the infrastructure-playbooks directory. Thus they use ANSIBLE_ROLES_PATH to overcome this. However some roles have 'playbook_dir' hardcoded, which results in wrong path since the execution comes from infrastructure-playbooks. Basically the role triggered by a playbook from infrastructure-playbooks believes that the roles are in infrastructure-playbooks/roles. This commit fixes that. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-24 16:19:39 +02:00
Andrew Schoen	d0a3034857	ceph-config: write ceph_conf_overrides_temp to fetch_directory because /tmp is not always writable, but we can assume that the fetch_directory will be Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-24 11:33:03 +02:00
Sébastien Han	80dc5eead7	ceph-config: add missing meta and files for the galaxy Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-24 11:33:03 +02:00
Guillaume Abrioux	539197a2fc	Introduce new role ceph-config. This will give us more flexibility and the possibility to deploy a client node for an external ceph-cluster. related BZ: https://bugzilla.redhat.com/show_bug.cgi?id=1469426 Fixes: #1670 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-24 11:33:03 +02:00
Sébastien Han	6d894e556c	ceph-mon: remove hardcoded ipv4 in containers Before this commit we were forcing ipv4 which might not be available. Now setting ip_version to ipv4 or ipv6 will give you the right support. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1484189 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-24 11:33:02 +02:00
Andrew Schoen	758c31b1cd	ceph-osd: ceph-volume requires --data to be in vg/lv format Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-23 13:43:31 -05:00
Alfredo Deza	e651469a2a	Merge pull request #1797 from ceph/purge-lvm adds purge support for the lvm_osds osd scenario	2017-08-23 14:28:29 -04:00
Sébastien Han	f2499ff5ac	Merge pull request #1788 from ceph/improve-switch switch-from-non-containerized-to-containerized: simplify	2017-08-23 19:47:26 +02:00
Sébastien Han	4f0ecb7f30	switch-from-non-containerized-to-containerized: simplify This commit eases the use of the infrastructure-playbooks/switch-from-non-containerized-to-containerized-ceph-daemons.yml playbook. We basically run it with a couple of pre-tasks and then we let the playbook run the docker roles. It obviously expect to have proper variables configured in order to work. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-23 18:39:45 +02:00
Andrew Schoen	594d5e017a	ceph-osd: restructure lvm_volumes variable for more flexiblity The lvm_volumes variable is now a list of dictionaries that represent each OSD you'd like to deploy using ceph-volume. Each dictionary must have the following keys: data, journal and data_vg. Each dictionary also can optionaly provide a journal_vg key. The 'data' key represents the lv name used for the OSD and the 'data_vg' key is the vg name that the given lv resides on. The 'journal' key is either an lv, device or partition. The 'journal_vg' key is optional and must be the vg name for the journal lv if given. This key is mainly used for purging of the journal lv if purge-cluster.yml is run. For example: lvm_volumes: - data: data_lv1 journal: journal_lv1 data_vg: vg1 journal_vg: vg2 - data: data_lv2 journal: /dev/sdc data_vg: vg1 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-23 10:14:14 -05:00
Sébastien Han	d9b3d4a981	Merge pull request #1731 from SirishaGuduru/rgw-civetwebIP-conf Common: changed civetweb line in rgw section(conf)	2017-08-23 15:33:08 +02:00
Sébastien Han	e0c43ccc53	Merge pull request #1784 from ceph/fix-restart-osd-container ceph-defaults: fix handler for osd container	2017-08-23 12:40:01 +02:00
SirishaGuduru	1359869497	Common: changed civetweb line in rgw section(conf) Resolves issue: Multiple RGW Ceph.conf Issue #1258 In multi-RGW setup, in ceph.conf the RGW sections contain identical bind IP in civetweb line. So this modification fixes that issue and puts the right IP for each RGW. Signed-off-by: SirishaGuduru SGuduru@walmartlabs.com Modified ceph-defaults and ran generate_group_vars_sample.sh group_vars/osds.yml.sample and group_vars/rhcs.yml.sample are not part of the changes. But they got modified when generate_group_vars_sample.sh is ran to generate group_vars/ all.yml.sample. Uncommented added variables in ceph-defaults Updated tests by adding value for radosgw_interface Added radosgw_interface to centos cluster tests Modified ceph-rgw role,rebased and ran generate_group_vars_sample.sh In ceph-rgw role removed check_mandatory_vars.yml. Rebased on master. Ran generate_group_vars_sample.sh and then the below files got modified.	2017-08-23 15:03:37 +05:30
Jason Dillaman	b70d54ac80	rbd-mirror should use per-host user id keyring The rbd-mirror daemon will be HA under luminous and new daemon health features require a way to uniquely identify rbd-mirror instances. Signed-off-by: Jason Dillaman <dillaman@redhat.com>	2017-08-22 18:55:29 -04:00
Jason Dillaman	70c2b934ca	distribute rbd bootstrap key if available Signed-off-by: Jason Dillaman <dillaman@redhat.com>	2017-08-22 18:55:29 -04:00
Sébastien Han	07821d9bb1	Merge pull request #1786 from ceph/re-arrange-skipped mon, osd: fix skipped condition	2017-08-22 19:44:48 +02:00
Sébastien Han	a359fc35b4	mon, osd: fix skipped condition To be properly evaluated the "skipped" conditions must always have the first place on the list of condition, otherwise the other conditions are evaluated before and make the task fail. Closes: https://github.com/ceph/ceph-ansible/issues/1733 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-22 18:34:51 +02:00
Yoann Moulin	a7e4562297	fix radosgw-admin call with another cluster name than "ceph"	2017-08-22 16:57:12 +02:00
Sébastien Han	38d575ce55	Merge pull request #1654 from andymcc/master Allow ceph-mon systemd overrides to be specified	2017-08-22 15:32:39 +02:00
Sébastien Han	abdd66619e	ceph-defaults: fix handler for osd container Problem: task "check for a ceph socket in containerized deployment" will be skipped if we are not an OSD. with_items are still evaluated before when conditions so if the task was skipped the dict will be empty and then fail. Adding a "not skipped" condition skips the execution of the task. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1482061 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-22 11:56:05 +02:00
Sébastien Han	19ae8b42e6	resync group_vars files Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-22 11:31:49 +02:00
Sébastien Han	b8af5fc5a0	Merge pull request #1695 from fullerdj/wip-djf-docker-multimds ceph-mds: Enable multimds under docker	2017-08-21 10:23:21 +02:00
Guillaume Abrioux	e0e9bb33b1	common: install ceph-common on all nodes This commits force ceph-common to be installed early in deployment on nodes. For instance, ceph-rbdmirror doesn't have the CLI installed while it is needed for some tasks which uses it to set some facts. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-17 14:44:38 +02:00
Andy McCrae	4671b9e74e	Allow ceph service systemd overrides to be specified ceph services can fail to start under certain circumstances (for example, when running in a container) because the default systemd service configuration causes namespace issues. To work around this we can override the system service settings by placing an overrides file in the ceph-<service>@.service.d directory. This can be generic so as to allow any potential changes required to the ceph-<service> service files. The overrides file is only setup when the "ceph_<service>_systemd_overrides" config_template override variable is specified. The available service systemd override files are as follows: ceph_mds_systemd_overrides ceph_mgr_systemd_overrides ceph_mon_systemd_overrides ceph_osd_systemd_overrides ceph_rbd_mirror_systemd_overrides ceph_rgw_systemd_overrides	2017-08-16 17:57:06 +01:00
John Fulton	c04559181e	Set the permissions mode on all of the OpenStack keys The original fix to issue #1755 only set the permissions on the monitors to which the key was copied, but not the original monitor where the key was created. Thus, we use a separate task to set the permission of the key.	2017-08-10 13:50:08 -04:00
John Fulton	7d42941090	Allow user to specify the mode of the openstack keys The openstack_keys structure now supports a key called mode whose value is a string that one could pass to chmod to set the mode of the key file. The ansible file module applies the mode to all openstack keys with this property. Fixes: #1755	2017-08-10 15:34:39 +00:00
Andrew Schoen	30f9b0e075	ceph-common: render ceph_conf_overrides into fetch_directory Writing into /tmp is not always allowed, but we can assume the fetch_directory is writable. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-04 12:19:12 -05:00
Douglas Fuller	aab3318802	ceph-mds: Enable multimds under docker This is under the MDS role instead of the mon role because that role does not create the filesystem under docker. Signed-off-by: Douglas Fuller <dfuller@redhat.com>	2017-08-04 10:46:11 -04:00
Andrew Schoen	be78bc1a90	ceph-defaults: fix containerized osd restarts This needs to check `containerized_deployment` because socket_osd_container is undefined otherwise. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-04 06:38:38 -05:00
Andrew Schoen	d1c7ec81c1	ceph-common: move release-rhs.yml after ceph_version is set These tasks needs to be run after we set ceph_version or they fail because it's undefined. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-04 06:38:37 -05:00
Andrew Schoen	1d5f876729	ceph-osd: devices is not required when osd_scenario == lvm Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-04 06:38:37 -05:00
Andrew Schoen	e597628be9	lvm: update scenario for new osd_scenario variable Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-04 06:38:36 -05:00
Andrew Schoen	3b5a06bb3c	lvm-osds: reorder mandatory vars checks Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-04 06:13:10 -05:00
Andrew Schoen	96c92a154e	lvm-osds: check for osd_objectstore == 'filestore' ceph-volume currently only has support for filestore, not bluestore Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-04 06:13:10 -05:00
Andrew Schoen	61d63f8468	lvm-osds: make task name and files consistent Removes capitilization and newlines to keep these files consistent in style with the existing tasks. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-04 06:13:10 -05:00
Andrew Schoen	63b7e3d36c	lvm_osds: ensure osd daemons are started Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-04 06:13:09 -05:00
Andrew Schoen	b93794bed4	adds a new 'lvm_osds' osd scenario This scenario will create OSDs using ceph-volume and is only available in ceph releases greater than Luminous. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-08-04 06:13:09 -05:00
Sébastien Han	e6a5c3b815	Merge pull request #1734 from ceph/debian-repo common: automate setting up online repositories for ceph deployments …	2017-08-03 22:45:31 +02:00
Sébastien Han	7559a2deff	common: automate setting up online repositories for ceph deployments on debian nodes This commits automates the process of setting up online repositories for Red Hat Ceph Storage on Debian nodes. The manual steps are currently described here: https://access.redhat.com/documentation/en-us/red_hat_ceph_storage/2/html/installation_guide_for_ubuntu/prerequisites#online_repositories If you are an RHCS customer and run a Debian based system you can now access package through the Red Hat CDN. For this set: ceph_rhcs and ceph_rhcs_cdn_install to true. Then set your customer credentials in ceph_rhcs_cdn_debian_repo. Replace customername:customerpasswd with your details. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1434175 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-03 17:15:07 +02:00
Andrew Schoen	d2f4d3666f	Merge pull request #1725 from ceph/simplify-osd-scenario osd: simply osd scenario declaration	2017-08-03 09:31:57 -05:00
Sébastien Han	af1e4d16d2	common: override and autodetect ceph_stable_release For ceph_dev and rhcs installation we need to detect the release since we do not declare it explicitly. Keeping the default ceph_stable_release could lead to several things going wrong and some have already been reported. Fixes: https://github.com/ceph/ceph-ansible/issues/1712 and https://bugzilla.redhat.com/show_bug.cgi?id=1476210 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-03 14:13:31 +02:00
Sébastien Han	30991b1c0a	osd: simplify scenarios There is only two main scenarios now: * collocated: everything remains on the same device: - data, db, wal for bluestore - data and journal for filestore * non-collocated: dedicated device for some of the component Signed-off-by: Sébastien Han <seb@redhat.com>	2017-08-03 10:20:39 +02:00
Guillaume Abrioux	608bad901d	docker-common: Fix bug when updating config in containerized deployment, if you try to update your `ceph.conf` file it won't be actually updated on your nodes because it is overwritten by the copy of the file which is present in your fetch directory. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-02 17:12:20 +02:00
Guillaume Abrioux	7a333d05ce	Add handlers for containerized deployment Until now, there is no handlers for containerized deployments. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-02 17:12:20 +02:00
Guillaume Abrioux	fc6b6e9859	Move basics facts to `ceph-defaults` Move `fsid`,`monitor_name`,`docker_exec_cmd` and `ceph_release` set_fact to `ceph-defaults` role. It will allow to reuse these facts without having to play `ceph-common` or `ceph-docker-common`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-02 17:12:20 +02:00
Guillaume Abrioux	5adbf0fdaa	Move role dependencies in site.yml/site-docker.yml This will give us more flexibility and avoid a lot of useless when skipping all tasks from a non-desired role. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-02 17:12:14 +02:00
Guillaume Abrioux	7322526838	Add new role `ceph-defaults` Add a new role `ceph-defaults`. This role aims to handle all defaults vars for `ceph-docker-common` and `ceph-common` and set basic facts (eg. `fsid`) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-02 14:46:57 +02:00
Guillaume Abrioux	1d003aa887	merge docker-common and common defaults vars Merge `ceph-docker-common` and `ceph-common` defaults vars in `ceph-defaults` role. Remove redundant variables declaration in `ceph-mon` and `ceph-osd` roles. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-08-02 14:46:51 +02:00
Sébastien Han	511750f31e	Merge pull request #1018 from ceph/profile-examples profiles: introducing cluster profiles	2017-07-28 15:38:06 +02:00
Sébastien Han	050807471a	Merge pull request #1723 from ceph/mount-skip osd: fail check mount partition if not skipped	2017-07-28 14:39:33 +02:00
Sébastien Han	078e5f8e64	Merge pull request #1713 from ceph/ceph-stable common: make ceph_stable_release mandatory	2017-07-27 17:23:20 +02:00
Sébastien Han	26f4752bc4	common: only add a daemon section if we run on the host We don't want to have heterogeous ceph.conf anymore and believe that we should have the right section for the running daemon. If we don't do this and use profiles, e.g: rgw, we will get a new rgw section on some of the nodes. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-27 16:34:17 +02:00
Sébastien Han	af431a0802	Merge pull request #1596 from czunker/restart_with_crush_location Restart OSDs during initial setup when crush location is used	2017-07-27 14:22:45 +02:00
Sébastien Han	63cbcc8260	osd: fail check mount partition if not skipped We forgot to handle the case where "check if any of the raw partitions are mounted" task gets skipped. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-27 11:39:19 +02:00
Sébastien Han	da4cce2c74	common: make ceph_stable_release mandatory It is mandatory now to set the Ceph version you want to install, e.g: ceph_stable_release: luminous To find the release names, you can look at the release not doc: http://docs.ceph.com/docs/master/release-notes/ Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-27 09:49:21 +02:00
Sébastien Han	8ac7d2e4c9	osd: do not enable osd@id unit file ceph-disk is responsable for enabling the unit file if needed. Actually since https://github.com/ceph/ceph/pull/12241 it seems that it's not even needed. On an event of a restart, udev rules will be trigger and they will ceph-disk activate the device too so the 'enabled' is not needed. Closes: https://github.com/ceph/ceph-ansible/issues/1142 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-26 17:17:57 +02:00
Christian Zunker	381600a092	Restart OSDs during initial setup when crush location is used OSDs get started by ceph-disk before the ceph.conf file is written with a crush location. That results in a crush map without configured crush location. To prevent this, we have to restart the OSDs during the initial setup after the crush location was added to the ceph.conf file.	2017-07-25 12:06:12 +02:00
Guillaume Abrioux	cfd7ae87e4	Merge pull request #1707 from ceph/admin-ker-perms mon: add mgr cap to admin key	2017-07-24 17:06:59 +02:00
Sébastien Han	2566db3e7c	mon: add mgr cap to admin key Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-24 16:00:06 +02:00
Sébastien Han	afa29889c1	ceph-docker-common: update etcd kv port New registered port is 2379 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-24 15:40:44 +02:00
Sébastien Han	33c1f0cb03	osd: refactor osd scenarios We have multiple issues with ceph-disk's cli with bluestore and Ceph releases. This is mainly due to cli changes with Luminous. Luminous introduced a --bluestore and --filestore options which respectively does not exist on releases older than Luminous. The default store being bluestore on Luminous, simply checking for the store is not enough so we have to build a specific command line for ceph-disk depending on the Ceph version we are running and the desired osd_store. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-24 13:48:08 +02:00
John Fulton	73633f05a0	Allow user to define ACLs for OpenStack keys The keys and openstack_keys structure now supports an optional key called acls whose value is a list of strings one could pass to setfacl. The ansible ACL module applies the ACLs to all openstack keys with this property. Fixes: #1688	2017-07-20 13:20:16 +00:00
Guillaume Abrioux	07fb79ce13	Mon: Copy openstack keyring files on all mons Copies all created openstack keys on all mons. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-19 16:28:18 +00:00
Guillaume Abrioux	233696d1b1	Common: Add profiles Remove `rgw enable static website` and `rgw enable usage log` from ceph.conf and make it usable with ceph_config_overrides as profiles. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-19 11:50:03 +02:00
Guillaume Abrioux	894df4b8c8	Common: Avoid duplicate section in ceph.conf See: - https://github.com/ceph/ceph-ansible/pull/1018#issuecomment-253491094 - https://github.com/ceph/ceph-ansible/pull/1018#issuecomment-260276588 The workaround here is rendering `ceph_conf_overrides` before passing it to `config_template` to be sure we won't have a section added twice in ceph.conf Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-19 11:50:03 +02:00
Sébastien Han	fcba2c801a	profiles: introducing cluster profiles This commit introduces a new directory called "profiles" which contains some set of variables for a particular use case. These profiles provide guidance for certain scenarios such as: * configuring rgw with keystone v3 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-19 11:50:03 +02:00
Alfredo Deza	92fc34eb9f	Merge pull request #1693 from batrick/debian-obsolete-key remove obsolete debian devel repo key	2017-07-18 15:08:33 -04:00
Patrick Donnelly	8800a23c38	remove obsolete debian devel repo key According to Alfredo, this was used for gitbuilders. Right now shaman/chacra dev repos are unsigned. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-07-18 08:47:47 -07:00
Guillaume Abrioux	151d59db4e	Mon: Create rbd pool on luminous There is no more an rbd pool that comes with luminous. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-18 01:01:53 +02:00
Guillaume Abrioux	14d2d08340	Docker-common: Make `fsid` available for all roles. Move condition at task level and not at include level to make `fsid` variable available for all roles. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-17 21:11:01 +02:00
Guillaume Abrioux	d11975800c	Common: change path for fetch_directory Some tasks fetch file to `{{ fetch_directory }}/docker_mon_files` and then try to copy from `{{ fetch_directory }}/{{ fsid }}`. That causes the playbook to fail. Fixes: #1683 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-17 21:11:01 +02:00
Guillaume Abrioux	d0311c6aa3	Client: keep consistency between `openstack_key` and `keys` To keep consistency between `{{ openstack_keys }}` and `{{ keys }}` respectively in `ceph-mon` and `ceph-client` roles. This commit also add the possibility to set mds caps. Fixes: #1680 Co-Authored-by: John Fulton <johfulto@redhat.com> Co-Authored-by: Giulio Fidente <gfidente@redhat.com> Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-17 21:11:01 +02:00
yanyx	7e56b5c531	ceph-osd: when ceph relase >= luminous add --filestore config	2017-07-14 09:53:59 +08:00
Andrew Schoen	4e87c1f0f5	ceph-common: make sure monitor_interface or monitor_address is defined Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-07-12 16:34:41 +02:00
Andrew Schoen	a9a3d24a3d	ceph-common: fixes bug related to monitor_interface set in inventory The ceph.conf template needs to look for the value of monitor_interface in hostvars[host] because there might be different values set per host. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-07-12 16:34:41 +02:00
Guillaume Abrioux	30a0fa31e3	Docker: Fix bug "waiting for /dev/XXX to show up" Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-12 15:02:39 +02:00
Guillaume Abrioux	0a38bfaadc	Osd: Fix bug 'uniq' command not found Due to a breaking space introduced by `d2320e412e` the command here is broken. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-12 15:02:39 +02:00
Guillaume Abrioux	94c3756167	Tests: Add bluestore scenarios Since we started testing against Luminous, we need to add more scenarios testing. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-12 15:02:32 +02:00
Guillaume Abrioux	a517ab5583	Osd: Force filestore and bluestore usage In Luminous, ceph-disk defaults to bluestore so all our scenarios are using bluestore, we need to force testing both. Signed-off-by: Sébastien Han <seb@redhat.com> Co-Authored-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-12 11:30:30 +02:00
Sébastien Han	f678b11d3d	Merge pull request #1663 from ceph/add_docker_custom_admin_secret Doc: containerized deploy with custom admin secret	2017-07-07 16:46:03 +02:00
Guillaume Abrioux	da755cb317	Doc: containerized deploy with custom admin secret In addition to ceph/ceph-docker@69d9aa6, this explains how to deploy a containerized cluster with a custom admin secret. Basically, just need to pass the `admin_secret` defined in your `group_vars/all.yml` to the `ceph_mon_docker_extra_env` variable. Eg: `ceph_mon_docker_extra_env: -e CLUSTER={{ cluster }} -e FSID={{ fsid }} -e MON_NAME={{ monitor_name }} -e ADMIN_SECRET={{ admin_secret }}` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-07 16:29:56 +02:00
Sébastien Han	0f1f4388da	Merge pull request #1644 from fullerdj/wip-djf-check-devices osd: validate devices variable input	2017-07-07 15:48:39 +02:00
Sébastien Han	5245473b3c	Merge pull request #1662 from fullerdj/wip-djf-install-rsync common: ensure rsync is installed for local install	2017-07-07 15:46:22 +02:00
Douglas Fuller	e5d06a449f	osd: validate devices variable input Fail with a sane message if the devices or raw_journal_devices variables are strings instead of lists during manual device assignment. Signed-off-by: Douglas Fuller <dfuller@redhat.com>	2017-07-07 13:37:29 +00:00
Douglas Fuller	79ba50cd9f	common: ensure rsync is installed for local install rsync is required by the ansible synchronize package. Ensure it is installed when local installation is selected. Signed-off-by: Douglas Fuller <dfuller@redhat.com>	2017-07-06 17:29:43 +00:00
Sébastien Han	035846217e	Merge pull request #1627 from ceph/ceph-osd-prepare-script osd: docker, refactor ceph-osd-run.sh.j2	2017-07-06 16:08:59 +02:00
Sébastien Han	d2320e412e	osd: docker, refactor ceph-osd-run.sh.j2 Easier to read and enhance. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-06 15:49:14 +02:00
Guillaume Abrioux	7e1df484db	Mon: Allow to deploy with custom admin secret Add a new parameter `admin_secret` that allow to deploy a ceph cluster with a custom admin secret. Fix: #1630 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-05 14:57:49 +02:00
Sébastien Han	e7ae65b6f9	Merge pull request #1632 from ceph/bluestore-redundant-options osd: remove redundant options to enable bluestore	2017-07-04 19:33:54 +02:00
Sébastien Han	7d657ac643	osd: ability to set db and wal to bluestore This commits refactors how we deploy bluestore. We have existing scenarios that we don't want to change too much. This commits eases the user experience by now changing the way you use scenarios. Bluestore is just a different interface to store objects but the scenarios more or less remain the same. If you set osd_objectstore == 'bluestore' along with journal_collocation: true, you will get an OSD running bluestore with DB and WAL partitions on the same device. If you set osd_objectstore == 'bluestore' along with raw_multi_journal: true, you will get an OSD running bluestore with a dedicated drive for the rocksdb DB, then the remaining drives (used with 'devices') will have WAL and DATA collocated. If you set osd_objectstore == 'bluestore' along with raw_multi_journal: true and declare bluestore_wal_devices you will get an OSD running bluestore with a dedicated drive for rocksdb db, a dedicated drive partition for rocksdb WAL and a dedicated drive for DATA. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-04 19:07:16 +02:00
Sébastien Han	adf752f241	Merge pull request #1597 from czunker/openstack_pools_with_rule Create OpenStack pools with crush rule	2017-07-04 18:34:00 +02:00
Sébastien Han	fc0e54c59e	osd: remove redundant options to enable bluestore There is no need for 2 variables to enable bluestore, prior to this patch one had to do the following to activate bluestore: osd_objectstore: bluestore bluestore: true Now you just need to set `osd_objectstore: bluestore`. Fixes: https://github.com/ceph/ceph-ansible/issues/1475 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-07-04 18:22:03 +02:00
Guillaume Abrioux	896d62d78b	Refact: remove ceph_mon_docker_interface variable remove `ceph_mon_docker_interface` and use `monitor_interface` instead for both containerized and non-containerized deployment. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-04 18:08:59 +02:00
Guillaume Abrioux	f16037fac5	Common: monitor_address_block is always defined monitor_address_block is always defined. We need to check for true or false instead. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-04 17:50:30 +02:00
Guillaume Abrioux	a951b7f957	Docker-common: Add missing variables declaration Some variables are missing from ceph-docker-common role since the include of check_mandatory_vars.yml has been re-added in the ceph-mon role. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-04 17:50:23 +02:00
Sébastien Han	1c2d3a0b79	Merge pull request #1594 from ceph/ipv6_support Common: Add ipv6 support	2017-07-04 15:08:47 +02:00
Guillaume Abrioux	0ab9be9a7a	Mon: Readd the include of check_mandatory_vars.yml The check regarding the networking scenario configuration has been moved from ceph-common to ceph-mon in `1de8176` but the include was not re-added in `189f4fe` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-04 10:57:27 +02:00
Guillaume Abrioux	675087d01c	Doc: Add a note to clarify how to setup network Explain how to setup networking in ceph-ansible by adding a note in README.md. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-04 10:57:27 +02:00
Guillaume Abrioux	88df105d0b	Common: Add ipv6 support `e8187f6` does not fix the ipv6 as expected since `ansible_default_*` are filled with the IP address carried by the network interface used by the default gateway route. By the way, it assumes that the MON_IP address will be this IP address which is not always the case. We need to keep using the previous fact but add some intelligence in the template to determine how to retrieve the ipv4\|ipv6 address since the path to the fact in `hostvars` is not the same according to ipv4 vs ipv6 case. Fix: 1569 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-07-04 10:57:26 +02:00
Christian Zunker	0050f8e6ef	Create OpenStack pools with crush rule Add an extra variable to the openstack pools, which creates them with defined rules. This will allow to place different pools on e.g. different type of disks. This commit will also set a new default rule when defined and move the rbd pool to the new rule.	2017-07-03 15:30:38 +02:00
Sébastien Han	78d95aeb63	Merge pull request #1636 from fghaas/fix-mon-directory-perms Ensure that ceph-mon applies mon directory ownership recursively	2017-07-03 11:19:53 +02:00
Florian Haas	18e6c35dee	Ensure that ceph-mon applies mon directory ownership recursively Fixes #1635. Signed-off-by: Florian Haas <florian@hastexo.com>	2017-06-30 10:18:51 +02:00
Douglas Fuller	6915dfcf81	ansible: fail if user selects OSD auto detection and raw devices are mounted Signed-off-by: Douglas Fuller <dfuller@redhat.com>	2017-06-29 17:02:17 +00:00
Sébastien Han	d6ba1d1d4e	mon: fix openstack key creation Somehow the shell module will return an error if the command line is not next to it. Plus fixed the import with the right path. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-06-27 11:47:02 +02:00
Guillaume Abrioux	3dfeffab43	Fix followup on refact code (1469) In addition to `7bb04a5`, these lines are no longer needed and can even cause playbook failures. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-06-26 15:53:41 +02:00
Sébastien Han	0c58257004	Merge pull request #1621 from ceph/openstack-key mon: rework openstack keys creation	2017-06-26 14:39:05 +02:00
Sébastien Han	010897d1a5	Merge pull request #1624 from ceph/chose-ceph-config-location Choose ceph config location	2017-06-26 14:38:54 +02:00
Sébastien Han	670d7a2495	mon: rework openstack keys creation We now allow a user to pass a key secret. Fixes: https://github.com/ceph/ceph-ansible/issues/1617 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-06-26 13:39:22 +02:00
Sébastien Han	0c21fb3f66	docker: ability to change ceph config directory Signed-off-by: Sébastien Han <seb@redhat.com>	2017-06-26 13:21:56 +02:00
Sébastien Han	7bb04a5970	docker: refactor followup Followup on https://github.com/ceph/ceph-ansible/pull/1469 where we merged most of the container code from roles/ceph-/task/docker/.yml into roles/ceph-docker-common/tasks/ It seems that we forgot to remove the original files. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-06-26 13:21:36 +02:00
Guillaume Abrioux	73141118d0	Make the new check PGs working with /bin/sh The new test in the checks PGs are no longer working on distributions where /bin/sh isn't linked to /bin/bash. Fix: #1619 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-06-22 17:59:38 +02:00
John Fulton	9619ef33d0	Add OpenStack metrics pool OpenStack's Gnocchi service expects to have a pool called "metrics". This change addess "metrics" to the list of `openstack_pools` and creates a corresponding key. It is only run if the user sets `openstack_config: false`.	2017-06-19 14:25:59 -04:00
Sébastien Han	62192df644	Merge pull request #1595 from czunker/restart_all_osds Restart all OSDs and do not stop after the first one.	2017-06-16 11:15:47 +02:00
Christian Zunker	bf8347e149	Restart all OSDs and do not stop after the first one. The current handler only restarts one OSD on each OSD server. After the first one the handler stops, not matter what results the checks had. Co-Authored-By: Gaudenz Steinlin (@gaudenz)	2017-06-14 09:38:07 +00:00
Peter Jenkins	804b0eef24	Bluestore: Omit "osd mkfs type" etc from ceph.conf Remove "osd mkfs type" and the other pre-Bluestore parameters from the generated ceph.conf so that disk activation on OSDs will work. The current default xfs config results in a failed deployment and incorrect partition metadata.	2017-06-14 10:44:13 +03:00
Sébastien Han	497924795d	ceph-mon: fix get rbd size hanging For newly created cluster the command: ceph --cluster {{ cluster }} osd pool get rbd size does not respond properly. We only want to check if the rbd pool exists, so we know use an ls \| grep approach. Closes: https://github.com/ceph/ceph-ansible/issues/1547 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-06-12 14:39:39 +02:00
Guillaume Abrioux	304de4833f	Common: Rewrite check_pgs Rewrite the check_pgs by using json parsing instead of complex regexp to parse the `ceph -s` output. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-06-12 10:59:16 +02:00
Guillaume Abrioux	a09ce92d51	Common: Add a default for ceph_docker_on_openstack Add a default value for `ceph_docker_on_openstack` to avoid a conditional check error for the task `pause after docker install before starting` in `roles/ceph-docker-common/tasks/pre_requisites/prerequisites.yml` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-06-06 16:49:04 +02:00
Andrew Schoen	e8187f6a0f	ceph-mon: fix support for ipv6 on containerized mons The fact ['ansible_$interface']['ipv4'] is a dictionary where ['ansible_$interface']['ipv6'] is a list. If we use ansible_default_ipv6\|ipv4 is is always a dictionary which allows us to get the ipv6 and ipv4 address without adding more complexity to the template. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-06-05 10:51:47 -05:00
Sébastien Han	fdc7866072	Merge pull request #1469 from ceph/refact_code Docker: Refact code	2017-06-02 12:40:25 +02:00
Sébastien Han	bd4a7dd6c8	Merge pull request #1580 from ceph/fix_check_pgs Common: Improve check pgs	2017-06-02 12:11:05 +02:00
Guillaume Abrioux	0542a95b68	Common: Improve check pgs For some reason we changed the check of pgs but it appears it could be dangerous because the current check might satisfied as long as 1 PG is active+clean. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-06-01 20:12:36 +02:00
Guillaume Abrioux	0a2048a577	Docker: Remove duplicate var passed to docker-run since `-e CEPH_DAEMON=OSD_CEPH_DISK_ACTIVATE` is already hardcoded in `eph-osd-run.sh.j2` there is no need to add `-e CEPH_DAEMON=OSD_CEPH_DISK_ACTIVATE` as a default value in defaults vars. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-06-01 14:31:17 +02:00
Guillaume Abrioux	ddfe019342	Refact code `ceph-docker-common`: At the moment there is a lot of duplicated tasks in each `./roles/ceph-<role>/tasks/docker/main.yml` that could be refactored in `./roles/ceph-docker-common/tasks/main.yml`. `_containerized_deployment` variables: All `_containerized_deployment` have been refactored to a single variable `containerized_deployment` duplicate `cephx` variables in `group_vars/* have been removed. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-05-24 15:55:41 +02:00
Guillaume Abrioux	f0adecf482	Clean osds.yml.sample Remove duplicate lines in osds.yml default vars file. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-05-24 15:55:41 +02:00
Andrew Schoen	2326c5ac63	Merge pull request #1557 from ceph/install-condition common: fix installation condition	2017-05-24 06:39:36 -05:00
Sébastien Han	468dc06bcd	common: remove useless check We only check for everything expect 'distro' because that is a valid way of deploying RHCS, with preprepared repos present on the nodes. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-05-24 11:52:22 +02:00
Austin Workman	22033bd1bf	Fixing partition detection regex for FusionIO devices.	2017-05-23 14:39:39 -05:00
Sébastien Han	f7e9585a2c	common: fix installation condition Problem: we could end up in situation where we would install a package on a machine that does not have the right repo enabled. Because the condition was set to OR we weren't pinning a particular host but just a condition. Let's say someone sets 'ceph_origin == "distro"', this would try to install OSD packages on Monitors. Solution: use a AND condition to first pin to the group_name (which identifies a set of hosts) AND then after this one of the installation condition. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1453119 Co-Authored-By: https://github.com/zhsj Signed-off-by: Sébastien Han <seb@redhat.com>	2017-05-23 11:50:58 +02:00
Sébastien Han	8ad503b248	common: explicitly set rhel os version support Clarify in the error message that only RHEL version >= 7.3 are supported. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1452431 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-05-19 10:38:20 +02:00
Sébastien Han	6bdadc4363	Revert "docker: Retry OSD disk prepare to workaround race condition"	2017-05-18 16:03:16 +02:00
Sébastien Han	c7aae7f965	mon: do not hardcode ipv4 Problem: fail to deploy a containerized Ceph cluster with ipv6 Solution: do not hardcode ipv4 when bootstrapping the container. Now use ip_version: ipv6 to get a containerized cluster deployed with ipv6. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1451786 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-05-18 11:56:55 +02:00
Andrew Schoen	58618aa778	Merge pull request #1531 from ceph/wip-1495 docker: Retry OSD disk prepare to workaround race condition	2017-05-17 09:36:07 -05:00
Sébastien Han	1da3a3106f	Merge pull request #1513 from flokli/monitor_interface monitor_interface: document need to use monitor_address when using IPv6	2017-05-17 15:12:38 +02:00
Sébastien Han	4555f3c04a	Merge pull request #1527 from ceph/piwi3910-master added support for Sandisk FusionIO devices	2017-05-17 15:11:32 +02:00
Sébastien Han	cf25ebb17c	Merge pull request #1500 from yanyixing/master ignore error when key is not exist	2017-05-17 15:04:20 +02:00
Guillaume Abrioux	1e7010de7f	Docker: rm container before retry of ceph osd prepare In addition to `196fa7e` this commit check if a container has been already launched and delete it before retrying the ceph osd prepare process. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-05-17 10:10:49 +02:00
Pascal Watteel	e4ef8bb87f	added support for Sandisk FusionIO devices Signed-off-by: Pascal Watteel <pascal.watteel@emc.com>	2017-05-16 12:00:21 +02:00
Sébastien Han	213d8acedf	Merge pull request #1518 from ceph/pgs-error-message ceph-common: improve error message on restart osd daemon script	2017-05-15 13:58:35 +02:00
Alfredo Deza	b04d18988e	ceph-common: improve error message on restart osd daemon script Signed-off-by: Alfredo Deza <adeza@redhat.com>	2017-05-12 11:37:38 -04:00
Sébastien Han	f1651cea9b	mon: do not create mgr key on jewel The CI on Docker is reporting the following error: STDERR: Error EINVAL: bad entity name This is due to the fact that this auth entity name does not exist on Jewel so we should not create that key when running Jewel containers. Fixes: https://github.com/ceph/ceph-ansible/issues/1514 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-05-12 17:22:24 +02:00
David Galloway	196fa7ef39	docker: Retry osd disk prep to workaround race condition Fixes: https://github.com/ceph/ceph-ansible/issues/1495 Signed-off-by: David Galloway <dgallowa@redhat.com>	2017-05-11 16:19:11 -04:00
Florian Klink	10b91661ce	monitor_interface: document need to use monitor_address when using IPv6 Already documented in the Red Hat Ceph Storage 2 Installation Guide for Red Hat Enterprise Linux, but not here Signed-off-by: Florian Klink <flokli@flokli.de>	2017-05-11 11:17:10 +02:00
Ali Maredia	2aeb3a4957	rgw: move default bucket quota conf vars to global "rgw override bucket index max shards" and "rgw bucket default quota max objects" were in the client section of the ceph.conf and not being applied, this commit moves them to global Resolves: bz#1391500 Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-05-10 13:06:48 -04:00
Sébastien Han	31267ab48a	Merge pull request #1460 from albertomurillo/clearlinux Add clearlinux to supported platforms	2017-05-09 18:15:55 -04:00
Andrew Schoen	b38b69b603	ceph-osd: fix typo in containerized OSD systemd unit Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-05-08 15:30:45 -05:00
yanyx	027f9aaf09	ignore error when key is not exist	2017-05-05 16:44:58 +08:00
Guillaume Abrioux	48bd807508	Common: Do not install ceph-mgr packages on jewel ceph-mgr tasks has to be skipped on jewel. Fix: #1494 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-05-04 18:38:59 +02:00
Andrew Schoen	471cdb0c39	ceph-common: add luminous to ceph_release_num Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-05-03 09:11:52 -05:00
Andrew Schoen	598fe0cada	ceph-common: remove debian_ceph_packages We shouldn't need this anymore as the upgrade bug that debian_ceph_packages was used to workaround should have been fixed as of jewel. See https://github.com/ceph/ceph-ansible/issues/1481 for more detailed information. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-04-28 10:51:51 -05:00
Guillaume Abrioux	b61deacfc0	Common: Fix condition on osd restart handler Fix the condition on the osd restart handler. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-04-27 16:24:21 +02:00
Ali Maredia	5192e3cd6d	rgw: set tuning parameters Change civetweb_num_thread default to 100 Add capability to override number of pgs for rgw pools. Add ceph.conf vars to enable default bucket object quota at users choosing into the ceph.conf.j2 template Resolves: rhbz#1437173 Resolves: rhbz#1391500 Signed-off-by: Ali Maredia <amaredia@redhat.com>	2017-04-25 16:01:03 -04:00
Andrew Schoen	ae351efeca	Merge pull request #1466 from ceph/upgrade-calamari mon: upgrade calamari when running rolling_update playbook	2017-04-24 07:34:41 -07:00
Alberto Murillo	5218df5ef3	Add clearlinux to supported platforms Signed-off-by: Alberto Murillo Silva <alberto.murillo.silva@intel.com>	2017-04-24 09:34:23 -05:00
Andrew Schoen	b28424334a	Merge pull request #1455 from ceph/restart_daemons Common: Restore check_socket	2017-04-24 06:54:07 -07:00
Guillaume Abrioux	800b439667	Common: Restore check_socket Restore the check_socket that was removed by `5bec62b`. This commit also improves the logging in `restart_*_daemon.sh` scripts Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-04-24 12:31:49 +02:00
Sébastien Han	84d96be197	mon: upgrade calamari when running rolling_update playbook Prior to this change, ansible was only checking for the existence of the package, now if upgrade_ceph_packages is true this means we are performing an upgrade. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1442016 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-04-24 11:05:13 +02:00
Sébastien Han	58e7d39bcc	Merge pull request #1461 from ceph/wip-remove-osd-directory-scenario remove osd directory scenario	2017-04-24 10:54:54 +02:00
Gregory Meno	eb0c83db5f	remove osd directory scenario Proof-of-concept clusters or actual production clusters will never want to use this. We also do not test it anywhere for this same reason. Signed-off-by: Gregory Meno <gmeno@redhat.com>	2017-04-21 15:50:32 -07:00
John Spray	3a0d03656f	mon: update mgr key capabilities This is to allow ceph-mgr daemons to remote control osd and mds daemons with MCommand messages. Fixes: http://tracker.ceph.com/issues/19713 Signed-off-by: John Spray <john.spray@redhat.com>	2017-04-21 12:15:25 +01:00
John McEleney	f1388dc2c2	Apparmor on Ubuntu Xenial will not permit containers to mount devices, even with CAP SYS_ADMIN.	2017-04-19 19:22:02 +01:00
Andrew Schoen	be3e0d772c	ceph-common: install ceph-mgr for rhcs on debian Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-04-17 08:40:24 -05:00
Andrew Schoen	30098f4e34	ceph-common: install ceph-mgr for rhcs on rhel Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-04-17 08:37:34 -05:00
Sébastien Han	dfd8f4d96e	test: add mgr section to the host inventory file Without this, we don't test the mgr role so we need to add it. Co-Authored-by: Guillaume Abrioux <gabrioux@redhat.com> Signed-off-by: Sébastien Han <seb@redhat.com>	2017-04-15 00:16:10 +02:00
Sébastien Han	804aa529bb	Merge pull request #1442 from ceph/fix-hung-command ceph-mon: change command to see if rbd exists	2017-04-13 15:21:26 +02:00
Andrew Schoen	ebed16e9a5	Merge pull request #1439 from ceph/rhcs-tests tests: fix ceph_rhcs setup	2017-04-13 07:23:07 -05:00
Andrew Schoen	9ce0c2808e	ceph-mon: change command to see if rbd exists The previous command was hanging, see this issue: https://github.com/ceph/ceph-ansible/issues/1440 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-04-12 15:07:10 -05:00
Andrew Schoen	37d38b122b	ceph-common: is ceph_rchs is True do not include install_on_redhat.yml That task includes logic for upstream installs that we do not want to run when deploying RHCS. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-04-12 07:55:03 -05:00
Matthew Vernon	e3a6429e73	Only include cluster.conf and osd.conf when building ceph.conf This is the same fix as `bc846b7da6` applied to the other part of the code-base that builds ceph.conf (I'd missed that `349b9ab3e7` had duplicated this code). Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk>	2017-04-11 14:33:21 +01:00
Matthew Vernon	bc846b7da6	Only assemble {{ cluster }}.conf and osd.conf Ansible's assemble module by default will put all files in the src directory together into dest. We only want to put {{ cluster }}.conf and osd.conf together, not anything that might have found its way into /etc/ceph/ceph.d (e.g. files left by the sysadmin taking backups before an ansible run). So specify a regexp that matches only those two files. Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk>	2017-04-11 13:27:19 +01:00
Sébastien Han	05331a2634	mon: fix with_items being evaluated before when Ansible evaluates the 'with_items' before the 'when' so if the inventory does not have the group declared it'll fail. To fix this, we set an empty array to make the with_items happy and then evaluate with the 'when'. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-04-11 12:10:55 +02:00
Sébastien Han	186a392656	Merge pull request #1425 from ceph/bump-kraken common: bump ceph version to kraken	2017-04-10 19:03:39 +02:00
Sébastien Han	e48c31c671	common: bump ceph version to kraken Kraken has been out for a couple of weeks now and the CI can test both Kraken and Jewel. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-04-10 18:05:19 +02:00
Sébastien Han	d113cf854c	Merge pull request #1377 from ceph/ceph-mgr mgr: add new role for ceph-mgr	2017-04-10 17:50:24 +02:00
Sébastien Han	12b2aa2e55	docker: fix monitors name Prior to this change we were deploying a monitor using tis fqdn name but we were checking its state and performing actions on it using its shortname. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-04-10 15:13:12 +02:00
Sébastien Han	2aa5286544	mgr: add new role for ceph-mgr The Ceph Manager daemon (ceph-mgr) runs alongside monitor daemons, to provide additional monitoring and interfaces to external monitoring and management systems. Only works as of the Kraken release. Co-Authored-By: Guillaume Abrioux <gabrioux@redhat.com> Signed-off-by: Sébastien Han <seb@redhat.com>	2017-04-10 15:13:09 +02:00
Konstantin Shalygin	334d4cb885	ceph-common: disable_transparent_hugepage now persist	2017-04-08 13:06:18 +07:00
Sébastien Han	e2c8f1a404	Merge pull request #1412 from guits/fix_handlers Common: Fix handlers that are not properly triggered.	2017-04-07 02:36:27 +02:00
WingkaiHo	6ad8288c91	enable ceph-mon.target service after monitor start ceph-create-keys unit file was removed here: * `8bcb4646b6` * `dc5fe8d415` As a consequence the systemctl preset command now fails to run since the unit does not exist anymore. Due to the redirection in /dev/null we don't know what's happening. Ultimately the mon unit doesn't get enabled and the mon service won't start after reboot. Removing the old/non-existent unit makes the command succeed now. ceph fix: https://github.com/ceph/ceph/pull/14226 Signed-off-by: WingkaiHo <sanguosfiang@163.com> Co-Authored-By: Sébastien Han <seb@redhat.com>	2017-04-06 16:58:32 +02:00
Guillaume Abrioux	5bec62ba7c	Common: Fix handlers that are not properly triggered. Until now, only the first task were executed. The idea here is to use `listen` statement to be able to notify multiple handler and regroup all of them in `./handlers/main.yml` as notifying an included handler task is not possible. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-04-06 16:19:58 +02:00
Daniel Horak	ce06dc1460	osd autodiscovery mode: fix holders detection Small fix for (probably copy&paste) issue from `42ffe6301`. Signed-off-by: Daniel Horak <dahorak@redhat.com>	2017-04-06 09:11:32 +02:00
Sébastien Han	42ffe63017	osd: autodiscovery mode, use holders to detect device As reported in https://github.com/ceph/ceph-ansible/issues/1403 when devices are held by lvm and `osd_auto_discovery` is set to true, it's not enough to check for a partition count = 0 since Ansible does not report. This patch also looks for 'holders' which in a case of lvm corresponds to the name of the pv. Now we also look for holders = 0. Fixes: #1403 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-04-04 10:37:14 +02:00
Sébastien Han	c37aaa41f4	playbook: homogenize the way list osd ids Problem: too many different commands to do the same thing. The 'cut' command on infrastructure-playbooks/purge-cluster.yml was also wrong. This sed command from osixia in ceph-docker https://github.com/ceph/ceph-docker/pull/580/ addresses all the scenarios. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-03-30 11:51:38 +02:00
Guillaume Abrioux	4e62627202	Common: Do not install ntp when ntp_service_enabled is false ntp is still installed even if ntp_service_enabled is set to false. That could be a problem if the time synchronization is managed by something else than ceph-ansible or if you want to use different NTP implementation as suggested in #1354. Fixes: #1354 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Signed-off-by: Guits <gabrioux@redhat.com>	2017-03-28 19:51:34 +02:00
Sébastien Han	6a9a32de82	Merge pull request #1391 from k0ste/newfeature Resolve issues when groups names not in default value.	2017-03-27 18:48:33 +02:00
Sébastien Han	6e3c0971a9	Merge pull request #1396 from guits/fix_1374 BugFix: Fix #1374	2017-03-27 18:13:51 +02:00
Konstantin Shalygin	1662976fc0	Resolve issues when groups names not in default value.	2017-03-27 21:44:30 +07:00
Guillaume Abrioux	a99e04a9b7	BugFix: Fix #1374 If a group of hosts is empty, (for instance 'mdss', in case of a deployment without any mds node), the playbook will fails when trying to restart service with `"'dict object' has no attribute u'XXX'"` error. The idea here is to force the `with_items` statements in all included handler tasks to get at least an empty array. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-03-27 16:24:07 +02:00
Concubidated	0454d540b0	ceph-common: update sysctl file location systctl tuning should be in the sysctl.d directory. This creates a seperation from what values were set specific to ceph, and what values were set by the operator. Signed-off-by: Tyler Brekke <tbrekke@redhat.com>	2017-03-24 12:59:34 -07:00
Guillaume Abrioux	589d6812ca	ceph-docker: fix bootstrap directories permissions Make bootstrap directories permissions work for both RedHat and Debian os families. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> Fix: #1338	2017-03-22 11:18:56 +01:00
Sébastien Han	350d2f88c7	Merge pull request #1381 from joke-lee/master the $? of pgrep ceph-rest-api command return is 1 when ceph-rest-api is already run	2017-03-21 09:53:01 +01:00
Sébastien Han	833d16c068	Merge pull request #1379 from D3N14L/fix_1299 Use ansible uri module instead of shell module with curl	2017-03-21 09:43:10 +01:00
Sébastien Han	839e0fdf4d	Merge pull request #1376 from czunker/fix_osd_crush_location Fix osd_crush_location to prevent systemd error message	2017-03-21 09:41:10 +01:00
Sébastien Han	28042eb032	Merge pull request #1365 from czunker/fix_osd_fragments Make ceph-common aware of osd config fragments	2017-03-21 09:40:25 +01:00
yuliyang	aa90fe6417	the $? of pgrep ceph-rest-api command return is 1 when ceph-rest-api already run. use ps instead	2017-03-18 21:41:53 +08:00
Christian Zunker	09646041ee	Fix osd_crush_location to prevent systemd error message With ' in osd_crush_location, systemd will show this error: ceph-osd-prestart.sh[2931]: Invalid command: invalid chars ' in 'root= Signed-off-by: Christian Zunker <christian.zunker@codecentric.de>	2017-03-17 07:26:40 +01:00
Daniel Marks	2ed94d914c	Use ansible uri module instead of shell module with curl This fixes issue #1299. According to @ktdreyer s comment in the ticket, he fixed the web server config so also older (non-SNI) python clients can use the uri module here.	2017-03-16 23:16:30 +01:00
Andrew Schoen	ec5ce81884	ceph-mon: always call ceph-create-keys After the jewel release the mon startup does not generate keys, but it's still harmless to call ceph-create-keys with jewel because this task has a 'creates' argument that will cause it not to run if the keys already exist. Removing this when condition also allows the downstream CI tests to install kraken or luminous without resetting ceph_stable_release, which does not pertain to rhcs. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-03-16 16:31:25 -05:00
Sébastien Han	777913e9da	docker: change centos extra repo var name This is not only for monitors, but also mds, rgw and rbd mirror so making the var name more generic: ceph_docker_enable_centos_extra_repo Signed-off-by: Sébastien Han <seb@redhat.com>	2017-03-16 15:16:34 +01:00
Sébastien Han	0609786ce6	Merge pull request #1331 from guits/enh_1321 ceph-mon: create openstack pools and keys even for containerized deployments	2017-03-16 13:28:45 +01:00
Sébastien Han	8b463d0bb8	Merge pull request #1373 from ceph/docker-mon-wait mon: increase wait for monitor socket timeout	2017-03-16 12:03:15 +01:00
Sébastien Han	60a56b5d4a	mon: increase wait for monitor socket timeout Sometimes the socket appears during the 5th attempt and sometimes not so increasing the timeout a little bit. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-03-16 11:16:49 +01:00
Guillaume Abrioux	ea68fbaaae	ceph-mon: Create openstack pools and keys even for containerized deployments Add the possibility to create openstack pools and keys even for containerized deployments Fix: #1321 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-03-16 10:53:53 +01:00
Sébastien Han	8c6079e23e	rbd-mirror: fix spelling It is RBD and NOT RDB. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-03-16 10:18:31 +01:00
Christian Zunker	349b9ab3e7	Make ceph-common aware off osd config fragments This removes the implicit order requirement when using OSD fragments. When you use OSD fragments and ceph-osd role is not the last one, the fragments get removed from ceph.conf by ceph-common. It is not nice to have this code at two locations, but this is necessary to prevent problems, when ceph-osd is the last role as ceph-common gets executed before ceph-osd. This could be prevented when ceph-common would be explicitly called at the end of the playbook. Signed-off-by: Christian Zunker <christian.zunker@codecentric.de>	2017-03-16 09:50:08 +01:00
Sébastien Han	38ab6de602	Merge pull request #1336 from WingkaiHo/master Load a variable file for devices partition	2017-03-15 11:55:26 +01:00
Sébastien Han	8567609b7e	kv scenario: add KV_PORT option This option was missing for rrgw, mds, rbd mirror and nfs making these daemon impossible to run on a kv deployment with containers. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-03-14 22:46:09 +01:00
Sébastien Han	8320c14191	Merge pull request #1317 from ibotty/harmonize-docker-names harmonize docker names	2017-03-14 18:20:20 +01:00
Sébastien Han	2fcffafb93	Merge pull request #1358 from ceph/nfs-ganesha-fsal-deb ceph-common: install nfs-ganesha FSALs on Debian	2017-03-14 16:03:14 +01:00
Ken Dreyer	bf57efaf3c	ceph-common: install nfs-ganesha FSALs on Debian Prior to this change, ceph-ansible would install the main NFS Ganesha server daemon on Ubuntu, but it would skip the Ceph FSALs. Running "apt-get install nfs-ganesha" will only install the main NFS Ganesha server. It does not pull in the RGW FSAL (/usr/lib/x86_64-linux-gnu/ganesha/libfsalrgw.so) Running "apt-get install nfs-ganesha-fsal" will install the RGW FSAL as well as the main NFS Ganesha server package. Signed-off-by: Ken Dreyer <kdreyer@redhat.com>	2017-03-14 08:37:45 -06:00
Boris Ranto	3c6a7a60eb	calamari: Add debugging option This patch introduces calamari_debug option which will turn on debugging for calamari before initializing and running it. Signed-off-by: Boris Ranto <branto@redhat.com>	2017-03-14 10:53:05 +01:00
Sébastien Han	c9e333f750	Merge pull request #1347 from ceph/no-vfs-cache-pressure avoid setting vfs_cache_pressure	2017-03-08 18:02:06 +01:00
Sébastien Han	b55a0702c2	Merge pull request #1335 from batrick/mon-fs-cmd-1mon send fs commands to one mon	2017-03-07 16:45:16 +01:00
Sébastien Han	703e82f277	Merge pull request #1339 from ceph/rgw-remove-sudo rgw: remove sudoers file	2017-03-07 16:44:28 +01:00
Ken Dreyer	a77cd4be5e	avoid setting vfs_cache_pressure From Josh Durgin, "I'd recommend not setting vfs_cache_pressure in ceph-ansible. The syncfs issue is still there, and has caused real problems in the past, whereas there hasn't been good data showing lower vfs_cache_pressure is very helpful - the only cases I'm aware of have shown it makes little difference to performance." https://bugzilla.redhat.com/show_bug.cgi?id=1395451	2017-03-03 11:20:05 -07:00
Ken Dreyer	f9a317ef30	check_system: improve RHCS support message and version comparison "red hat storage" -> "red hat ceph storage" "RHEL 7.1" -> "RHEL 7" and make the version number check verify the very latest version https://bugzilla.redhat.com/show_bug.cgi?id=1354059	2017-03-03 09:57:16 -07:00
Guillaume Abrioux	66b59ea9c6	docker: Fix #1303 Install package from official repos rather than pip when using RHEL. This commit fix https://bugzilla.redhat.com/show_bug.cgi?id=1420855 Also this commit Refact all `roles/ceph-*/tasks/docker/pre_requisite.yml` to avoid a lot of duplicated code. Fix: #1303 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-03-03 10:49:13 +01:00
Sébastien Han	88c6296fa2	rgw: remove sudoers file This was needed for Hammer and older version, not needed anymore since we have a 'ceph' user to run ceph processes. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-03-02 14:59:21 +01:00
WingKai Ho	029eb2a6d3	Update ceph_keys.yml	2017-03-02 14:09:08 +08:00
WingKai Ho	62892fbdf6	Update ceph_keys.yml	2017-03-02 14:07:52 +08:00
Patrick Donnelly	56d8367339	send fs commands to one mon Add-on to #1329. Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>	2017-03-01 11:05:56 -05:00
WingKai Ho	997fe5b198	Update ceph_keys.yml	2017-03-01 17:39:53 +08:00
Guillaume Abrioux	e00b745a6c	ceph-mon: Check if ceph filesystem exists Check if ceph filesystem already exists before creating it. If the ceph filesystem doesn't exist, execute the task only on one node. Fix: #1314 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-02-24 09:11:52 +01:00
Tobias Florek	931027e6f7	harmonize docker names Created containers now are named more or less in the form of <ansible role>-<ansible_hostname>	2017-02-23 09:15:05 +01:00
Sébastien Han	458a9ad5c3	mon: docker, ability to enable centos extra repo Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-21 15:56:09 -05:00
Sébastien Han	72b17d2480	docker: osd, clarify variable usage for scenarii Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-21 15:56:09 -05:00
Sébastien Han	b91d227b99	docker: make ceph docker osd script path Since distro will not allow /usr/share to be writable (e.g: atomic) so we let the operator decide where to put that script. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-21 15:56:09 -05:00
Sébastien Han	73cf0378c2	docker: osd, do not use priviledged container anymore Oh yeah! This patch adds more fine grained control on how we run the activation osd container. We now use --device to give a read, write and mknodaccess to a specific device to be consumed by Ceph. We also use SYS_ADMIN cap to allow mount operations, ceph-disk needs to temporary mount the osd data directory during the activation sequence. This patch also enables the support of dedicated journal devices when deploying ceph-docker with ceph-ansible. Depends on https://github.com/ceph/ceph-docker/pull/478 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-21 15:54:36 -05:00
Sébastien Han	dd548c6034	docker: osd, do not skip on failure If the systemd unit file can not be generated we should fail, same for systemd enable and reload. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-21 15:54:36 -05:00
Andrew Schoen	6cf842eb39	ceph-common: remove infernalis comment on radosgw_civetweb_port As of Infernalis, the Ceph daemons run as an unprivileged "ceph" UID, and this is by design. Commit `f19b765` altered the default civetweb port from 80 to 8080 with a comment in the commit log about "until this gets solved" Remove the comment about permissions on Infernalis, because this is always going to be the case on the Ceph versions we support, and it is just confusing. If users want to expose civetweb to s3 clients using privileged TCP ports, they can redirect traffic with iptables, or use a reverse proxy application like HAproxy. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-21 12:35:00 -06:00
Andrew Schoen	1579642e3f	ceph-common: do not get current fsid when performing a rolling_update This avoids a situation where during a rolling_update we try to talk to a mon to get the fsid and if that mon is down the playbook hangs indefinitely. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-21 12:31:26 -06:00
Andrew Schoen	920bd9cf2d	ceph-common: use yum_repository when adding the ceph_stable repo This gives us more flexibility than installing the ceph-release package as we can easily use different mirrors. Also, I noticed an issue when upgrading from jewel -> kraken as the ceph-release package for those releases both have the same version number and yum doesn't know to update anything. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-21 12:31:26 -06:00
Sébastien Han	effefe91d5	common: add doc for rgw on ipv6 See: https://bugzilla.redhat.com/show_bug.cgi?id=1424799 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-21 12:00:37 -05:00
WingKai Ho	421d1a2853	Update ceph_keys.yml jewel version need to build the {{ cluster }}.client.admin.keyring exists key	2017-02-21 09:49:52 +08:00
Sébastien Han	7c79e09424	common: fix "disable transparent hugepage" To configure kernel the task is using "command" module which is not respect operator ">". So this task just print to "stdout": "never > /sys/kernel/mm/transparent_hugepage/enabled" fix: #1319 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-20 17:07:53 -05:00
Shengjing Zhu	32923fd217	fix grep match pattern for osd ids Some playbooks use [0-9]*, others use \d+$ The latter is more correct since cluster name may contain numbers. Signed-off-by: Shengjing Zhu <zsj950618@gmail.com>	2017-02-20 16:35:56 +08:00
Sébastien Han	cc769464d0	docker: homogenise systemd file location So unit files were stored in /var/lib/ceph some where in /etc/systemd/system. Now they are all under /etc/systemd/system. closes: #1296 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-16 18:31:32 +01:00
Sébastien Han	978093d843	Merge pull request #1304 from guits/fix_1300 Skip facts_mon_fsid.yml if cephx is disabled	2017-02-11 13:14:07 +01:00
Andrew Schoen	c5a5658e09	Merge pull request #1301 from guits/fix_1294 Check if ceph_conf_overrides.global is defined before calling it	2017-02-10 10:20:14 -06:00
Andrew Schoen	94ac749918	Merge pull request #1302 from ceph/docker-pull docker: use a better method to pull images	2017-02-10 10:18:37 -06:00
Guillaume Abrioux	11bae8b252	Skip facts_mon_fsid.yml if cephx is disabled If cephx is disabled it is not necessary to include `facts_mon_fsid.yml` in `roles/ceph-common/tasks/facts.yml`. Fix: #1300 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-02-10 17:04:32 +01:00
Guillaume Abrioux	e63631a4ab	bugfix: Add missing conditions for kraken release `76ddcbc` introduced an issue by removing some needed conditions on tasks that need to be run only on release >= kraken.	2017-02-10 15:14:54 +01:00
Guillaume Abrioux	4295d427b9	Check if ceph_conf_overrides.global is defined before calling it Expand the fix in #1291 to all the playbook in order to get a full coverage. Fix: #1294 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-02-09 20:28:58 +01:00
Sébastien Han	c2f1dca823	docker: use a better method to pull images We changed the way we declare image. Prior to this patch we must have a "user/image:tag" format, which is incompatible with non docker-hub registry where you usually don't have a "user". On the docker hub a "user" is also identified as a namespace, so for Ceph the user was "ceph". Variables have been simplified with only: * ceph_docker_image * ceph_docker_image_tag 1. For docker hub images: ceph_docker_name: "ceph/daemon" will give you the 'daemon' image of the 'ceph' user. 2. For non docker hub images: ceph_docker_name: "daemon" will simply give you the "daemon" image. Infrastructure playbooks have been modified as well. The file group_vars/all.docker.yml.sample has been removed as well. It is hard to maintain since we have to generate it manually. If you want to configure specific variables for a specific daemon simply edit group_vars/$DAEMON.yml Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1420207 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-09 17:57:18 +01:00
Guits	df162a61b1	ceph-mon: Fix bug #1242 We shouldn't test directly the value of `ceph_conf_overrides.global.osd_pool_default_pg_num` because this can cause the playbook to fail if the key `global` is not present in `ceph_conf_overrides`. Therefore we have to use the facts that have been defined earlier. Fix: #1242 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-02-08 10:46:46 +01:00
Shengjing Zhu	7e10b0325b	add ceph_mirror variable Closes: #1282 Signed-off-by: Shengjing Zhu <zsj950618@gmail.com>	2017-02-08 13:13:31 +08:00
Andrew Schoen	9580fd974a	ceph-common: set ms bind ipv6 = true in ceph.conf when using ipv6 This fixes an issue with parsing the ceph.conf file when ip_version is set to ipv6. See: https://bugzilla.redhat.com/show_bug.cgi?id=1419814 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-07 12:34:29 -06:00
Sébastien Han	55abf69481	Merge pull request #1267 from ceph/container-systemd Container systemd	2017-02-03 14:02:53 +01:00
Sébastien Han	2ed93875a9	docker: fix monitor addition on kraken Probably not the best fix but useful for testing. We will remove this once we find a better workaround. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-03 11:35:54 +01:00
Sébastien Han	064c57babb	Merge pull request #1264 from ceph/clarify-osd-logs osd: clarify osd scenario prepare sequence	2017-02-02 14:03:12 +01:00
Sébastien Han	c4331d87b3	common: ability to disable handler osd health check Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-02 11:22:55 +01:00
Sébastien Han	6d5afdfb99	docker: fix restapi key creation with cluster name Add support for key creation when the cluster name is different than 'ceph'. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-02 09:32:12 +01:00
Sébastien Han	cb499a689b	docker: fix mkdir path for all the distros On ubntu systems mkdir is in /bin where on atomic it is /usr/bin/. We use the shell built-in function "command" to find its right location. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-01 17:19:07 +01:00
Sébastien Han	f446b1c82f	docker: fail if systemd is not present Since we treat containers as services using systemd, systemd is an absolute requirement. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-01 17:18:17 +01:00
Sébastien Han	40709c8336	docker: use systemd to manage container Since we now only support systemd has an init system we can finally treat containers as processes using systemd and this for all the distros. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-01 17:12:46 +01:00
Andrew Schoen	873c044c53	ceph-common: remove absolute path to handler include files This breaks if the roles don't live in the same place as the playbook. See: https://github.com/ceph/ceph-ansible/issues/1265 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-01 09:22:26 -06:00
Sébastien Han	5578b9bc7b	osd: clarify osd scenario prepare sequence we now use the name of the scenario in the prepare task. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-01 13:59:35 +01:00
Sébastien Han	40a2df5bbf	common: serialise host restart This commits allows us to restart Ceph daemon machine by machine instead of restarting all the daemons in a single shot. Rework the structure of the handler for clarity as well. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-31 17:39:51 +01:00
Guillaume Abrioux	76ddcbc271	Remove support of releases prior to Jewel. According to #1216, we need to simply the code by removing the support of anything before Jewel. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-01-31 11:00:54 +01:00
Sébastien Han	f888cc4e06	Merge pull request #1254 from ceph/osd-dir-exist Osd dir exist and purge fixes	2017-01-31 08:46:22 +01:00
Andrew Schoen	7dc9c4b5b6	ceph-common: update apt cache in it's own task Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-30 17:15:08 -06:00
Sébastien Han	837ca9eaff	Merge pull request #1257 from clwells/rgw-restart-handler Use ansible_hostname instead of ansible_host in handler	2017-01-30 14:38:35 +01:00
Sébastien Han	1149825f8f	common: create ceph initial directories Some users purge their environments and leave it in a non-optimal state. e.g: packages are still installed but /etc/ceph and /var/lib/ceph don't exist anymore. This will result in multiple failures across the play, sometimes hard to detect. Populating these directories "just in case" should help us solving these problems. Closes: #1253 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-30 14:31:56 +01:00
Sébastien Han	6f53774ee9	osd: make sure osd directory exists Sometimes users for testing, tend to delete the whole /var/lib/ceph and then run ansible again, OSD will never come up if we do not create their directory. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-30 14:31:56 +01:00
Chris Wells	84c322550a	Use hostname instead of host (host can be an IP) and hostname matches the default nomenclature in ceph.conf	2017-01-28 13:09:03 -05:00
Chris Wells	8b2dace29b	Using ini_file with ansible_hostname to ensure each INI block gets the rgw_zone setting in a multi-RGW setup. Also, ansible_hostname better matches what ceph-common does for the actual hostname (ansible_host != ansible_hostname under all conditions).	2017-01-28 12:30:27 -05:00
Sébastien Han	e35070f6ce	mon: make sure osd_pool_default_size is honoured This patch makes sure we set the proper pool size on the rbd pool. Usually during bootstrap the rbd pool size is not honoured so we need to add this workaround. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-25 22:53:17 +01:00
Andrew Schoen	03cb803bd1	Adds ip_version configuration option This allows the user to set ip_version to either ipv4 or ipv6. This resolves a bug where monitor_address is set to an ipv6 address, but the template fails to render because it's hardcoded to look for an 'ipv4' key in the ansible facts. See: https://bugzilla.redhat.com/show_bug.cgi?id=1416010 Signed-off-by: Andrew Schoen <aschoen@redhat.com> Resolves: bz#1416010	2017-01-24 09:48:55 -06:00
tynorth-cisco	6400989cf2	add unique filter to openstack pool names could have scenario where different openstack components would use the same pool, but the logic would create the same pool more than once add unique filter to account for this	2017-01-23 11:58:28 -08:00
Andrew Schoen	c4161df38b	Merge pull request #1146 from ceph/client-pool mon: pool creation and pgs	2017-01-23 10:17:21 -06:00
Logan V	cd50cd8f18	RGW: Add resolve cname configuration parameter Adds a switch to enable bucket resolution using DNS CNAME provided by the client's HTTP headers.	2017-01-19 11:15:18 -06:00
Logan V	4941de838c	RGW: Add static website hosting configuration	2017-01-19 11:15:18 -06:00
Logan V	6f3a98919c	RGW: Allow configurable rgw frontends setting Allow for more operator flexibility in the `rgw frontends` setting while maintaining backwards compatibility with the old vars. This allows an operator to, for example, use the civetweb settings for implementing SSL ports. For available civetweb configuration parameters, see: https://github.com/civetweb/civetweb/blob/master/docs/UserManual.md	2017-01-19 11:15:18 -06:00
Logan V	8edc2663d9	RGW: Add support for usage log configuration Adds support for configuring the RadosGW usage log described at http://docs.ceph.com/docs/jewel/man/8/radosgw/#usage-logging.	2017-01-19 11:15:18 -06:00
Andrew Schoen	e2a18af5b3	Merge pull request #1227 from Logan2211/resync-group-vars-sample Remove libcephfs1 from group_vars sample	2017-01-19 09:57:54 -06:00
Sébastien Han	90648e7518	mon: fix mds pool creation It is not enough to check for the mds to exists, it actually always does because we declare the variable. So we need to make sure that there is a mds host. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-19 14:49:09 +01:00
Sébastien Han	ddac3a1fb5	mon: pool creation and pgs Since we introduced config_overrides we removed a lot of options from the default template. In some cases, like mds pool, openstack pools etc we need to know the amount of PGs required. The idea here is to skip the task if ceph_conf_overrides.global.osd_pool_default_pg_num is not define in your `group_vars/all.yml`. Closes: #1145 Signed-off-by: Sébastien Han <seb@redhat.com> Co-Authored-By: Guillaume Abrioux <gabrioux@redhat.com>	2017-01-19 11:30:39 +01:00
Andrew Schoen	2e8cf582f2	ceph-docker-common: adds a README file Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-18 11:06:25 -06:00
Andrew Schoen	88a2ddb240	ceph-docker-common: include a meta/main.yml so we can use galaxy This role needs a meta/main.yml before we can upload it to ansible galaxy. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-18 10:46:48 -06:00
Sébastien Han	c6728ca21d	Merge pull request #1229 from ceph/docker-fixes adds a ceph-docker-common role and enables custom registry usage	2017-01-18 14:54:12 +01:00
Musee Ullah	7636d09f92	mon: run ceph-create-keys after startup for kraken release	2017-01-17 22:51:39 +09:00
Andrew Schoen	0c55a35963	ceph-osd: use ceph_docker_registry when preparing OSDs Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-16 11:39:13 -06:00
Andrew Schoen	655b8449ae	use ceph_docker_registry when starting containers Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-16 11:17:41 -06:00
Andrew Schoen	3713824b79	ceph-docker-common: add symlink to ceph.ceph-docker-common This allows for the role to be used with ansible-galaxy and to fix the include in all the meta/main.yml files in the roles. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-16 10:42:42 -06:00
Andrew Schoen	9449dbf083	use ceph_docker_registry in all the roles instead of docker.io This allows for ceph-ansible to use other docker registries. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-16 10:42:42 -06:00
Andrew Schoen	25277587fa	ceph-common: include ceph_docker_registry when fetching the image Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-16 09:51:20 -06:00
Andrew Schoen	c07b7ddbaa	use ceph-docker-common in roles that support docker deployments Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-16 09:30:53 -06:00
Andrew Schoen	f770780dda	ceph-docker-common: a new role to share things common to docker We can use this to share common variables and tasks needed for every containerized deployment. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-16 09:28:40 -06:00
Logan V	c78a3707f1	Remove libcephfs1 from group_vars sample The libcephfs1 package was removed from ceph-common in `cb1c06901e`, however it was not synced to group_vars/all.yml.sample using the `generate_group_vars_sample.sh` script. This fixes up the comment formatting in the ceph-common defaults and brings the group_vars sample back into sync.	2017-01-16 09:15:57 -06:00
Daniel Marks	fefaa8ed13	Set empty list as default for osd_directories As described in issue #1224 leaving this variable undefined may cause a problem during execution of the ceph-osd role.	2017-01-13 15:27:16 +01:00
Sébastien Han	6ebea0cf42	Merge pull request #1208 from clwells/multisite-defined-endpoint [rtr] Granular Multi-site PULL Variables	2017-01-10 16:12:04 -05:00
Chris Wells	d609dca7a6	Updated pull[port\|proto] to pull_[port\|proto] and regenerated base group vars	2017-01-07 09:33:24 -05:00
Ken Dreyer	63e5b5c406	ceph-common: always include release.yml Prior to this change, a playbook run with '--tags' or '--skip-tags' would fail, because the ceph-common role would not include the release.yml task, and this file defines critical things like ceph_release. Thanks Andrew Schoen <aschoen@redhat.com> for help with the fix.	2017-01-05 14:40:39 -07:00
Sébastien Han	cb1c06901e	Merge pull request #1171 from cbodley/wip-libcephfs2 bump package version to libcephfs2	2017-01-03 10:48:56 +01:00
Sébastien Han	676d8812f9	Merge pull request #1212 from ceph/cluster-name-info common: document "cluster" variable usage	2017-01-03 10:18:41 +01:00
Sébastien Han	f1308b5f71	Merge pull request #1213 from ceph/cephx-initial-mon mon: only put mon initial key in mon kv when cephx is enabled	2017-01-02 13:43:22 +01:00
Sébastien Han	468afa2d5b	mon: only put mon initial key in mon kv when cephx is enabled Task put initial mon keyring in mon kv store from ceph-mon/tasks/ceph_keys.yml is failing when cephx is disabled. The root cause is that variable monitor_keyring is not populated by any task from deploy_monitors.yml. Fixes: #1211 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-02 11:16:03 +01:00
Sébastien Han	b76d0aceca	common: document "cluster" variable usage Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1408848 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-02 09:20:43 +01:00
Shengjing Zhu	93a8b69a57	fix check nmapexist on debian command is a shell-buitin, so `command -v nmap` must use shell module Signed-off-by: Shengjing Zhu <zsj950618@gmail.com>	2016-12-28 16:21:23 +08:00
Chris Wells	5f37ae9d1f	Allowing granular control of the pull host/port/protocol for installs where SSL/443 is used and direct civetweb access isn't necessarily available.	2016-12-25 21:54:13 -05:00
Sébastien Han	d44927de03	common: rename rh_storage to rhcs to match product name Signed-off-by: Sébastien Han <seb@redhat.com>	2016-12-20 13:49:11 +01:00
Sébastien Han	775d61ed09	common: enable tool repo for mds install of rhcs Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1405985 Signed-off-by: Sébastien Han <seb@redhat.com>	2016-12-20 13:42:24 +01:00
tynorth-cisco	704b49ba33	use monitor name, not host ansible host in ceph.conf in ceph-common there is a fact check for monitor name that will set it to ansible_host or ansible_fqdn	2016-12-19 12:33:38 -08:00
Sébastien Han	fd4fb8d5b2	Merge pull request #1199 from Logan2211/fix-mon-fsid-block Workaround for broken 'block' syntax	2016-12-19 11:30:58 +01:00
Andrew Schoen	99d66e09d9	Merge pull request #1153 from ceph/cluster-name-test test: add cluster name support test scenario	2016-12-16 13:10:52 -06:00
Sébastien Han	2d8ac4a586	docker: only use systemd to manage containers Prior to this patch we had several ways to runs containers, we could use ansible's docker module on some distro and on containers distros we were using systemd. We strongly believe threating containers as services with systemd is the right approach so this patch generalizes to all the distros. These days most of the distros are running systemd so it's fair assumption. Signed-off-by: Sébastien Han <seb@redhat.com>	2016-12-16 19:37:05 +01:00
Logan V	ffc89ee95f	Workaround for broken 'block' syntax The block syntax in ansible is broken on 2.1 so we work around the issue introduced in `471be5e` by using a task file include.	2016-12-16 09:24:48 -06:00
Sébastien Han	ce7431a227	docker: add support for cluster name We need to honour the cluster name that was chosen by ceph-ansible and pass it to ceph-docker. Signed-off-by: Sébastien Han <seb@redhat.com>	2016-12-16 14:31:21 +01:00
Sébastien Han	dcd94f6c52	Merge pull request #1179 from Logan2211/rgw-keystone-v3 RGW/Keystone integration improvements	2016-12-16 13:52:52 +01:00
Sébastien Han	608b008a95	mon: add the initial mon keyring to the mon store Once we have our first monitor up and running we need to add it to the monitor store as a safety measure. Just in case the local file gets deleted and you need to add a new monitor. Now you can retrieve this key like this: ceph config-key get initial_mon_keyring > initial_mon_keyring.txt Signed-off-by: Sébastien Han <seb@redhat.com>	2016-12-16 11:54:02 +01:00
Sébastien Han	6c71cf5b10	common: do not become root on local task There is no need to become root on local_action. This will event trigger an error on some systems as it will try to run a sudo command. If the current user does not have passwordless sudo, Ansible will fail. Anyway using the current user is perfectly fine and no elevation privilege is needed. Signed-off-by: Sébastien Han <seb@redhat.com>	2016-12-16 11:42:07 +01:00
Logan V	cccdb2ab13	Add support for Keystone v3 API The Keystone v2 APIs are deprecated and scheduled to be removed in Q release of Openstack. This adds support for configuring RGW to use the current Keystone v3 API.	2016-12-15 17:17:41 -06:00
Logan V	aa0bfaa89a	Add a switch to disable nss PKI database initialization The PKI keys are used to decrypt the Keystone revocation list when PKI tokens are used. When UUID or Fernet token providers are used in Keystone, PKI certs may not exist, so we now accommodate this scenario by allowing the operator to disable the PKI tasks.	2016-12-15 17:17:41 -06:00
Logan V	12f8b5c38e	Add support for Keystone user authentication with RGW Jewel added support for user/pass authentication with Keystone, allowing deployers to disable Keystone admin token as required for production deployments. This implements configuration for the new RGW Keystone user/pass authentication feature added in Jewel. See docs here: http://docs.ceph.com/docs/master/radosgw/keystone/	2016-12-15 17:17:37 -06:00
Sébastien Han	faabfdcefe	Merge pull request #1178 from zhsj/dev-partition Add prepare osd with partition devices in raw_multi_journal	2016-12-15 22:50:23 +01:00
Sébastien Han	fbeacb325c	Merge pull request #1192 from davidebelloni/master Adding Proxy support	2016-12-15 17:38:38 +01:00
Davide Belloni	87373e75d9	Adding Proxy support	2016-12-15 15:28:59 +01:00

... 16 17 18 19 20 ...

2736 Commits (94c37b9de89ffd93449e77f7a90ad50b700fd0db)