ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Tobias Florek	931027e6f7	harmonize docker names Created containers now are named more or less in the form of <ansible role>-<ansible_hostname>	2017-02-23 09:15:05 +01:00
Sébastien Han	3b633d5ddc	purge-docker: re-implement zap devices We now run the container and waits until it dies. Prior to this we were stopping it before completion so not all the devices where zapped. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-21 15:56:09 -05:00
Sébastien Han	a002508a91	purge-docker: also purge journal devices Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-21 15:54:36 -05:00
Andrew Schoen	5622c94e8b	rolling-update: do not use upstart to stop mons when using systemd Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-21 12:31:26 -06:00
Shengjing Zhu	32923fd217	fix grep match pattern for osd ids Some playbooks use [0-9]*, others use \d+$ The latter is more correct since cluster name may contain numbers. Signed-off-by: Shengjing Zhu <zsj950618@gmail.com>	2017-02-20 16:35:56 +08:00
Andrew Schoen	22f52a9dc6	purge-cluster: also purge dmcrypt dedicated journals See: https://bugzilla.redhat.com/show_bug.cgi?id=1414647 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-15 10:27:17 -06:00
Andrew Schoen	3964929a56	rgw-standalone: also fetch keys from mons This is to allow for ceph-installer usage of this playbook and to ensure that you have the correct keys locally when bootstrapping. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-14 16:12:59 -06:00
Andrew Schoen	c5f561a4e9	purge-cluster: remove calamari-server package See: https://bugzilla.redhat.com/show_bug.cgi?id=1422134 Signed-off-by: Andrew Schoen <aschoen@redhat.com> Resolves rhbz#1422134	2017-02-14 09:24:02 -06:00
Sébastien Han	c2f1dca823	docker: use a better method to pull images We changed the way we declare image. Prior to this patch we must have a "user/image:tag" format, which is incompatible with non docker-hub registry where you usually don't have a "user". On the docker hub a "user" is also identified as a namespace, so for Ceph the user was "ceph". Variables have been simplified with only: * ceph_docker_image * ceph_docker_image_tag 1. For docker hub images: ceph_docker_name: "ceph/daemon" will give you the 'daemon' image of the 'ceph' user. 2. For non docker hub images: ceph_docker_name: "daemon" will simply give you the "daemon" image. Infrastructure playbooks have been modified as well. The file group_vars/all.docker.yml.sample has been removed as well. It is hard to maintain since we have to generate it manually. If you want to configure specific variables for a specific daemon simply edit group_vars/$DAEMON.yml Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1420207 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-09 17:57:18 +01:00
Andrew Schoen	5ddfc4f85c	Merge pull request #1284 from ceph/BZ-1418980 purge-cluster: do not use ceph-detect-init	2017-02-08 08:46:03 -06:00
Andrew Schoen	4ff5908758	Merge pull request #1289 from ceph/fix-1286 rolling-update: detect init system properly	2017-02-08 06:31:30 -06:00
Andrew Schoen	865b4500dc	purge-cluster: set a default value for fetch_directory if not defined Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-08 06:25:43 -06:00
Andrew Schoen	adf6aee643	purge-cluster: remove all include tasks Including variables from role defaults or files in a group_vars directory relative to the playbook is a bad practice. We don't want to do this because including these defaults at the task level overrides values that would be set in a group_vars directory relative to the inventory file, which is the correct usage if you wish to override those default values. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-08 06:25:43 -06:00
Andrew Schoen	0476b24af1	purge-cluster: do not use ceph-detect-init We can not always ensure that ceph-detect-init will be present on the system. See: https://bugzilla.redhat.com/show_bug.cgi?id=1418980 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-02-08 06:24:44 -06:00
Sébastien Han	8f94bfb498	rolling-update: detect init system properly Simply use the ansible_service_mgr fact. Closes: #1286 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-08 08:52:05 +01:00
Sébastien Han	c34d0a9d28	purge-docker: force image deletion even if non-runnin containers are using this image as a reference. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-07 22:14:21 +01:00
Sébastien Han	72cd9199ac	purge: ability to purge client role Signed-off-by: Sébastien Han <seb@redhat.com>	2017-02-07 22:14:18 +01:00
Guillaume Abrioux	76ddcbc271	Remove support of releases prior to Jewel. According to #1216, we need to simply the code by removing the support of anything before Jewel. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-01-31 11:00:54 +01:00
Sébastien Han	d5dd658cfa	purge: do not stop ceph.target on each daemon Doing this cause some all the daemons to go down at the same time. In a scenario where we colocate a monitor and an osd, this osds will take some time to go down which will make the 'umount' task fail. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-30 14:31:56 +01:00
Sébastien Han	cb57a359ba	purge: do not fail on purge ceph files On systems running docker there is an issue with lxfs that results in the find command returning 1 but actually did the job. e.g: on a system with docker runnning find /var will give us the following error: find: '/var/lib/lxcfs/cgroup/devices/lxc/x1/system.slice/systemd-update-utmp.service/devices.deny': Permission denied find: '/var/lib/lxcfs/cgroup/devices/lxc/x1/system.slice/dev-random.mount/devices.allow': Permission denied ... ... However ceph files got deleted so we ignore the error. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-30 14:31:56 +01:00
Sébastien Han	e371bd591c	purge: fix ubuntu purge when not using systemd We now rely on the cli tool ceph-detect-init which will tell us the init system in used on the distribution. We do this instead of the previous lookup for systemd unit files to call the right task depending on the init system. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-30 14:31:56 +01:00
Sébastien Han	0e2e270ab2	purge: allow purge to run multiple times with_items is evaluated before the when so in a second run where the variable is empty if will fail with "'dict object' has no attribute 'stdout_lines'". To fix this we had a default array so with_items does not fail and the task is skipped with the when. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-30 14:31:56 +01:00
Sébastien Han	0d2e580768	Merge pull request #1250 from ceph/new-tests CI testing updates	2017-01-27 14:30:45 +01:00
Andrew Schoen	d3cb8dba4e	purge-cluster: fix failure when raw_multi_journal is not defined Because the purge-cluster.yml playbook does not have access to the roles default vars then we can be sure that raw_multi_journal is defined. For example, if this was purging a dmcrypt journal then raw_multi_journal might not be defined at all in group_vars/all.yml or group_vars/osds.yml. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-27 05:23:17 -06:00
Ivan Font	0298354137	Update to use consistent docker extra env vars This playbook was still referencing the old version of the ceph__docker_extra_env but only for Ceph MONs and Ceph NFS. This playbook was not kept up-to-date when updating the ceph__docker_extra_env variables to add the '-e' option to docker. That's because the addition of '-e' breaks this playbook as it requires a comma separated list of variables for the 'env:' docker module parameter. Therefore this change just makes the playbook consistently broken by referencing the same variable throughout.	2017-01-26 15:57:34 -08:00
Andrew Schoen	b2a6f095f1	purge-cluster: fix syntax when deleting dmcrypt devices Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-26 11:28:30 -06:00
Sébastien Han	73ca1a7a00	purge: remove dm-crypt devices When running encrypted OSDs, an encrypted device mapper is used (because created by the crypsetup tool). So before attempting to remove all the partitions on a device we must delete all the encrypted device mappers, then we can delete all the partitions. Signed-off-by: Sébastien Han <seb@redhat.com> Please enter the commit message for your changes. Lines starting	2017-01-25 22:32:46 +01:00
Sébastien Han	adeb3decf3	purge: remove zap_block_devs variable The name of this variable was a bit confusing since its activation will zap all the block devices no matter which osd scenario we are using. Removing this variable and applying a condition on the OSD scenario is now feasible and easier since we import group_vars variable files for OSDs. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-18 10:55:01 +01:00
Sébastien Han	b7fcbe5ca2	purge: cosmetic cleanup Just applying our writing syntax convention in the playbook. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-01-18 10:53:21 +01:00
Andrew Schoen	dd8389cdf7	purge-cluster: do not include ceph-osd and ceph-common defaults for osds When purging OSDs we do not need to include these defaults as nothing in the following tasks uses them. Also, it has the side effect of overwriting any variables defined in group_vars files that are relative to the inventory you are using with the default values. That behavior was causing the CI tests to fail. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-10 16:57:58 -06:00
Andrew Schoen	321cea8ba9	purge-cluster: get journal partitions after zapping osd disks In my testing zapping the osd disks deleted the journal partitions, making the 'zap ceph journal partitions' task fail because the partitions it found previously do not exist anymore. This moves the task that finds the journal partitions after 'zap osd disks' to catch any partitions ceph-disk might have missed. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-03 15:57:17 -06:00
Andrew Schoen	c9e5914377	purge-cluster: use ignore_errors: true when including group_vars files Using failed_when will still throw an exception and stop the playbook if the file you're trying to include doesn't exist. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2017-01-03 15:57:17 -06:00
Sébastien Han	cb1c06901e	Merge pull request #1171 from cbodley/wip-libcephfs2 bump package version to libcephfs2	2017-01-03 10:48:56 +01:00
Shengjing Zhu	2dc2e1d48c	infrastructure playbook: add make osd partition Signed-off-by: Shengjing Zhu <zsj950618@gmail.com>	2016-12-15 22:03:38 +08:00
Casey Bodley	acaf01ac17	purge-cluster: add new version of libcephfs2 the libcephfs version was bumped to 2, so we need to check for that as well when we're removing all ceph packages Signed-off-by: Casey Bodley <cbodley@redhat.com>	2016-12-09 16:54:06 -05:00
Sébastien Han	9dac195200	take-over: use more precise ceph.conf detection Prior to this patch we were just looking for any *.conf file which sometimes could results in multiple matches. The new command looks for a .conf file that must contain [global] and 'fsid' patterns. This will definitely get us the ceph.conf file. We can not directly use ceph.conf because of a different cluster name. Signed-off-by: Sébastien Han <seb@redhat.com>	2016-12-06 16:02:48 +01:00
Sébastien Han	4444d7d78e	git: update gitignore * ignore yml files in general * refactor based on commit f8e043b6ea5ac4e886532d4f2f675c507b44b955 that changed directory layouts Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit ec5c6f5da566611c4e0b88f925cbd26dc90368d6)	2016-12-06 10:18:19 +01:00
chenyanshan	7eab2529ed	this patch fix the regex pattern in infrastructure-playbooks/shrink-osd.yml when the osd's pid num is bigger than 9999 Signed-off-by: chenyanshan <yanshanchen@139.com>	2016-12-05 13:40:38 +08:00
Guillaume Abrioux	b2b7222b3a	[shrink-mon]: force playbook to fail if there is only one mon The playbook will fail if only 1 mon is in the cluster and advise to use the `purge-cluster` playbook instead. Fix #1083	2016-11-25 11:20:11 +01:00
Guillaume Abrioux	a680707f6f	All `include_vars` need to have `.yml`, `.yaml` or `*.json` extension. As introduced in the following PR: - https://github.com/ansible/ansible/pull/17207 we need to refactor our code.	2016-11-24 14:03:49 +01:00
Sébastien Han	829e2b6598	Merge pull request #1077 from font/rolling_update Support containerized rolling update	2016-11-22 16:56:46 +01:00
Sébastien Han	38e846e542	rolling_update: clarify "serial" usage Prior to this commit the serial variable was poorly documented. Now we are making clear that this value should be left untouched as the rolling update mechanism should happen serially. Solves: bz-1396742 Signed-off-by: Sébastien Han <seb@redhat.com>	2016-11-21 14:42:46 +01:00
Ken Dreyer	adfdf6871e	remove apache support for RGW libfcgi is dead upstream (http://tracker.ceph.com/issues/16784) The RGW developers intend to remove libfcgi support entirely before the Luminous release. Since libfcgi gets little-to-no developer attention or testing, remove it entirely from ceph-ansible.	2016-11-18 13:13:12 -07:00
Ivan Font	255e816e28	Rolling update changes for containerized deployments Separate out systemd restart tasks for containerized and non-containerized deployments Signed-off-by: Ivan Font <ifont@redhat.com>	2016-11-17 11:25:25 -08:00
Ivan Font	e72f08080d	Warn user when upgrading cluster with only one mon Signed-off-by: Ivan Font <ifont@redhat.com>	2016-11-17 11:25:25 -08:00
Ivan Font	3ff17f1c8f	Support containerized rolling update - Update rolling update playbook to support containerized deployments for mons, osds, mdss, and rgws - Skip checking if existing cluster is running when performing a rolling update - Fixed bug where we were failing to start the mds container because it was missing the admin keyring. The admin keyring was missing because it was not being pushed from the mon host to the ansible host due to the keyring not being available before running the copy_configs.yml task include file. Now we forcefully wait for the admin keyring to be generated before continuing with the copy_configs.yml task include file - Skip pre_requisite.yml when running on atomic host. This technically no longer requires specifying to skip tasks containing the with_pkg tag - Add missing variables to all.docker.sample - Misc. cleanup Signed-off-by: Ivan Font <ifont@redhat.com>	2016-11-17 11:25:25 -08:00
Alfredo Deza	60ce2311b8	rolling_update: bump retries for osd_check/retries to 20 minutes Signed-off-by: Alfredo Deza <adeza@redhat.com> Resolves: rhbz#1395073	2016-11-17 10:43:58 -05:00
Sébastien Han	81a72cb85d	Merge pull request #1068 from ceph/v2.2 moving to ansible v2.2 compatibility	2016-11-16 16:33:40 +01:00
Andrew Schoen	5f44b118b8	rolling update: stop RGWs before upgrade and start afterwards Signed-off-by: Andrew Schoen <aschoen@redhat.com> Resolves: rhbz#1394929	2016-11-14 14:47:12 -06:00
Andrew Schoen	ded9d9dfd3	rolling update: stop MDSs before upgrading and start afterwards Signed-off-by: Andrew Schoen <aschoen@redhat.com> Resolves: rhbz#1394929	2016-11-14 14:47:12 -06:00
Andrew Schoen	5429c5f8c5	rolling update: stop MONs before upgrading and start afterwards Signed-off-by: Andrew Schoen <aschoen@redhat.com> Resolves: rhbz#1394929	2016-11-14 14:47:12 -06:00
Andrew Schoen	66f09bdac4	rolling update: stop OSDs before upgrading This avoids a bug where OSDs are sometimes restarted twice on upgrades which leaves the OSD process running but not marked up. See: https://bugzilla.redhat.com/show_bug.cgi?id=1394928 https://bugzilla.redhat.com/show_bug.cgi?id=1391675 https://bugzilla.redhat.com/show_bug.cgi?id=1394929 Signed-off-by: Andrew Schoen <aschoen@redhat.com> Resolves: rhbz#1394929	2016-11-14 14:46:58 -06:00
Sébastien Han	991341f525	rolling_update: add variable to upgrade ceph My stupid self removed this crucial variable here: `217ce3ca` thinking it was another hard coded variable import where this is actually the trigger for the upgrade. Closes: #1071 Signed-off-by: Sébastien Han <seb@redhat.com>	2016-11-04 17:31:02 +01:00
Sébastien Han	a2fcd222d2	moving to ansible v2.2 compatibility Signed-off-by: Sébastien Han <seb@redhat.com> Co-Authored-By: Julien Francoz julien@francoz.net	2016-11-04 10:09:38 +01:00
Andrew Schoen	8262ce5e40	rolling update: fix restarts of radosgw Signed-off-by: Andrew Schoen <aschoen@redhat.com> Resolves: rhbz#1391675 Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2016-11-03 14:36:42 -05:00
Eduard Egorov	ab5c9f2a67	Adjust 'devices' list check for being not defined in purge-cluster playbook (see PR #1024 ) Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2016-11-03 06:36:42 +00:00
Leseb	899c8b309f	Merge pull request #1024 from eduardegorov/egorove_make_devices_optional Make {{ devices }} list optional	2016-11-02 15:12:02 +01:00
Eduard Egorov	e5473ee565	Fix typos Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2016-11-01 12:29:21 +00:00
Eduard Egorov	3652bb708b	Fix rbd-mirrors group name Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2016-11-01 12:21:47 +00:00
Eduard Egorov	645b5efebf	Fix hard-coded host group names in include tasks for group variables' file paths. Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2016-11-01 12:21:40 +00:00
Eduard Egorov	f33c1cd2d2	Make {{ devices }} list optional: define it as empty list by default, remove unneccessary 'default([])' checks Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2016-11-01 09:57:25 +00:00
Andrew Schoen	0897c965ff	rolling_update: define mon_group_name when upgrading the mons see: https://bugzilla.redhat.com/show_bug.cgi?id=1389456 Signed-off-by: Andrew Schoen <aschoen@redhat.com> Resolves: rhbz#1389456	2016-10-27 14:17:56 -05:00
Sébastien Han	b0989c700f	rolling_update: fix wrong indent Fixing: https://bugzilla.redhat.com/show_bug.cgi?id=1388295 Also add some notes in the README on how to run infrastructure playbooks. Signed-off-by: Sébastien Han <seb@redhat.com>	2016-10-26 12:51:08 -05:00
Ivan Font	534b188396	Update for infrastructure-playbooks execution - Updates to allow running infrastructure-playbooks both from within its directory or root directory of ceph-ansible. Signed-off-by: Ivan Font <ifont@redhat.com>	2016-10-26 09:43:37 -07:00
Andrew Schoen	bebf412c92	infrastructure-playbooks: fix syntax errors in all playbooks Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2016-10-25 16:56:58 -05:00
Sébastien Han	f49bf2832c	rolling_update: improve variables import we now have pointer to default role so we don't miss any of the variables defined. Signed-off-by: Sébastien Han <seb@redhat.com>	2016-10-06 14:08:04 +02:00
Ivan Font	5a5e185e11	Reworked purge cluster playbook - Separated out one large playbook into multiple playbooks to run host-type by host-type i.e. mdss, rgws, rbdmirrors, nfss, osds, mons. - Combined common tasks into one shared task for all hosts where applicable - Fixed various bugs Signed-off-by: Ivan Font <ivan.font@redhat.com>	2016-10-05 21:32:38 -07:00
Leseb	598d78cef3	Merge pull request #961 from ceph/fix-purge purge: only purge ceph partitions	2016-10-04 18:03:21 +02:00
Sébastien Han	e81ec9c138	purge: only purge ceph partitions Prior to this change we were purging all the partitions on the device when using the raw_journal_devices scenario. This was breaking deployments where other partitions are used for other purposes (ie: OS system). Signed-off-by: Sébastien Han <seb@redhat.com>	2016-10-04 17:58:53 +02:00
Leseb	4bf7e8355a	Merge pull request #953 from jsaintrocc/hammerfix Fixes for Hammer install and added numerical release checks	2016-10-04 11:34:26 +02:00
Sébastien Han	ac2cb9ac2c	upgrade: add custom timeout options This commit introduces the ability to configure delays and retries for cluster health checks, for both monitors and OSDs. Signed-off-by: Sébastien Han <seb@redhat.com>	2016-10-03 11:27:02 +02:00
James Saint-Rossy	982c44d41c	Rebased with upstream master	2016-09-25 23:22:16 -04:00
Sébastien Han	b8158a6554	ability to switch from bare metal to containerized daemons Signed-off-by: Sébastien Han <seb@redhat.com>	2016-09-21 18:07:50 +02:00
Leseb	517196ed66	Merge pull request #977 from ceph/switch-bare-metal-to-container ability to switch from bare metal to containerized daemons	2016-09-21 15:04:06 +02:00
Sébastien Han	5bfa1b0d24	ability to switch from bare metal to containerized daemons Signed-off-by: Sébastien Han <seb@redhat.com>	2016-09-21 14:46:57 +02:00
Sébastien Han	21356c653f	rolling updates: remove mon compact command Users have reported this task to hang. Since this command is not required to perform the upgrade, we remove it. Signed-off-by: Sébastien Han <seb@redhat.com>	2016-09-13 10:09:07 +02:00
James Saint-Rossy	666637f715	Replaced is_before is_after is_ booleans with numerical version dictionary	2016-09-09 17:34:26 -04:00
Rachana Patel	ad5805f03e	rolling_update.yml will not work if cluster name is not 'ceph'. Adding --cluster will solve this problem Fixes issue #969 Signed-off-by: Rachana Patel <rachana83.patel@gmail.com>	2016-09-07 15:38:58 -04:00
James Saint-Rossy	f52be23770	Prevent local_action from requiring root	2016-09-02 19:31:59 -04:00
Ivan Font	05c5d1ea91	Update relative path to include vars Signed-off-by: Ivan Font <ivan.font@redhat.com>	2016-08-24 00:27:54 -07:00
Ivan Font	7c9cb0993e	Include group_vars files in purge cluster playbook - Add all relevant group_vars files in containerized purge cluster playbook and ignore errors if file may not exist. - Also fixing indentation issues. Signed-off-by: Ivan Font <ivan.font@redhat.com>	2016-08-19 09:11:56 -07:00
Ivan Font	c1905bfa23	Update for containerized purge cluster playbook - Added support for purging containerized rbd-mirror node Signed-off-by: Ivan Font <ivan.font@redhat.com>	2016-08-19 09:11:56 -07:00
James Saint-Rossy	449d456086	Rebased and moved multisite/rgw playbooks to infrastructure-playbooks	2016-08-17 13:28:01 -04:00
Sébastien Han	fde819d1a8	create a directory for infrastructure playbooks Since we have a couple of infrastructure related playbooks (additionnally to the roles we are using to deploy Ceph), it makes sense to have them located in a separate directory. Signed-off-by: Sébastien Han <seb@redhat.com>	2016-08-17 11:53:34 +02:00

1 2 3

134 Commits (e8187f6a0fba6b158dfec37a198a64f712ee95c6)