ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Mike Christie	db576f6f0e	igw: fix firewall iscsi_group_name check The firewall setup for igw is not getting setup because iscsi_group_name does not it exist. It should be iscsi_gw_group_name. Signed-off-by: Mike Christie <mchristi@redhat.com> (cherry picked from commit `a4ff52842c`)	2018-11-12 10:46:41 +00:00
Mike Christie	c843ea1d92	igw: Fix default api port The default igw api port is 5000 in the manual setup docs and ceph-iscsi-config package so this syncs up ansible. Signed-off-by: Mike Christie <mchristi@redhat.com> (cherry picked from commit `a10853c5f8`)	2018-11-12 10:46:41 +00:00
VasishtaShastry	f17140c03d	ceph-validate : Added functions to accept true and flase ceph-validate used to throw error for setting flags as 'true' or 'false' for True and False Now user can set the flags 'dmcrypt' and 'osd_auto_discovery' as 'true' or 'false' Will fix - Bug 1638325 Signed-off-by: VasishtaShastry <vipin.indiasmg@gmail.com> (cherry picked from commit `098f42f233`)	2018-11-09 16:47:57 +00:00
Rishabh Dave	a74f4204cd	remove configuration files for ceph packages on ubuntu clusters For apt-get, purge command needs to be used, instead of remove command, to remove related configuration files. Otherwise, packages might be shown as installed while running dpkg command even after removing them. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1640061 Signed-off-by: Rishabh Dave <ridave@redhat.com> (cherry picked from commit `640cad3fd8`)	2018-11-09 16:50:25 +01:00
Mike Christie	77de54025b	igw: stop tcmu-runner on iscsi purge When the iscsi purge playbook is run we stop the gw and api daemons but not tcmu-runner which I forgot on the previous PR. Fixes Red Hat BZ: https://bugzilla.redhat.com/show_bug.cgi?id=1621255 Signed-off-by: Mike Christie <mchristi@redhat.com> (cherry picked from commit `b523a44a1a`)	2018-11-09 16:50:04 +01:00
Guillaume Abrioux	93cdbddd78	tests: test ooo_collocation agasint v3.0.3 ceph-container image Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com> (cherry picked from commit `811f043947`)	2018-11-09 16:48:35 +01:00
Sébastien Han	12ce311da5	rbd-mirror: enable ceph-rbd-mirror.target Without this the daemon will never start after reboot. Signed-off-by: Sébastien Han <seb@redhat.com> (cherry picked from commit `b7a791e902`)	2018-11-09 16:48:35 +01:00
Andrew Schoen	ee883aa9f2	validate: do not validate ceph_repository if deploying containers Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1630975 Signed-off-by: Andrew Schoen <aschoen@redhat.com> (cherry picked from commit `9cd8ecf0cc`)	2018-11-09 15:14:40 +00:00
Guillaume Abrioux	d5409109fb	rgw: move multisite default variables in ceph-defaults Move all rgw multisite variables in ceph-defaults so ceph-validate can go through them. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-30 17:41:35 +01:00
Guillaume Abrioux	f52344300a	tests: add more memory for rgw_multsite scenarios Adding more memory to VMs for rgw_multisite scenarios could avoid this error I have recently hit in the CI: (It is worth it to set 1024Mb since there is only 2 nodes in those scenarios.) ``` fatal: [osd0]: FAILED! => { "changed": false, "cmd": [ "docker", "run", "--rm", "--entrypoint", "/usr/bin/ceph", "docker.io/ceph/daemon:latest-luminous", "--version" ], "delta": "0:00:04.799084", "end": "2018-10-29 17:10:39.136602", "rc": 1, "start": "2018-10-29 17:10:34.337518" } STDERR: Traceback (most recent call last): File "/usr/bin/ceph", line 125, in <module> import rados ImportError: libceph-common.so.0: cannot map zero-fill pages: Cannot allocate memory ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-30 14:00:28 +01:00
Guillaume Abrioux	547e90f281	rgw: move multisite related tasks after docker/main.yml We must play this task after the container has started otherwise rgw_multisite tasks will fail. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-30 14:00:28 +01:00
Guillaume Abrioux	710e11668d	rgw: add rgw_multisite for containerized deployments run commands on containers when containerized deployments. (At the moment, all commands are run on the host only) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-30 14:00:28 +01:00
Guillaume Abrioux	37970a5b3c	tests: add rgw_multisite functional test Add a playbook that will upload a file on the master then try to get info from the secondary node, this way we can check if the replication is ok. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-30 14:00:28 +01:00
Guillaume Abrioux	4d464c1003	rgw: add testing scenario for rgw multisite This will setup 2 cluster with rgw multisite enabled. First cluster will act as the 'master', the 2nd will be the secondary one. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-30 14:00:28 +01:00
Guillaume Abrioux	fe88c89c9c	validate: remove check on rgw_multisite_endpoint_addr definition since `rgw_multisite_endpoint_addr` has a default value to `{{ ansible_fqdn }}`, it shouldn't be mandatory to set this variable. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-30 14:00:28 +01:00
Ali Maredia	59e6d04f9b	rgw: add ceph-validate tasks for multisite, other fixes - updated README-MULTISITE - re-added destroy.yml - added tasks in ceph-validate to make sure the rgw multisite vars are set Signed-off-by: Ali Maredia <amaredia@redhat.com>	2018-10-30 14:00:28 +01:00
Guillaume Abrioux	77d5d128c3	rgw: add a dedicated variable for multisite endpoint We should give users the possibility to set the IP they want as multisite endpoint, setting the default value to `{{ ansible_fqdn }}` to not force them to set this variable. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-30 14:00:28 +01:00
Ali Maredia	474f151450	rgw: update rgw multisite tasks - remove destroy tasks - cleanup conditionals and syntax - remove unnecessary realm pulls - enable multisite to be tested in automated testing infra - add multisite related vars to main.yml and group_vars - update README-MULTISITE - ensure all `radosgw-admin` commands are being run on a mon Signed-off-by: Ali Maredia <amaredia@redhat.com>	2018-10-30 14:00:28 +01:00
Sébastien Han	9e87a5ae5e	travis: add ansible-galaxy integration This instructs Travis to notify Galaxy when a build completes. Since 3.0 the ansible-galaxy has the ability to build and push roles from repos with multiple roles. Closes: https://github.com/ceph/ceph-ansible/issues/3165 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-30 13:45:30 +01:00
Sébastien Han	49d4b65751	gitignore: add mergify and travis as exceptions Git must notice changes from .travis.yml and .mergify.yml Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-30 13:45:30 +01:00
Sébastien Han	b8a203bacf	contrib: rm script push-roles-to-ansible-galaxy.sh The script is not used anymore and soon Travis CI will do this job of pushing the role into the galaxy. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-30 13:45:30 +01:00
Sébastien Han	0e659caf77	cleanup repos's root Remove old files and move scripts to the contrib directory. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-30 10:37:48 +00:00
Maciej Naruszewicz	252d0f9cf2	ceph-volume: fix TypeError exception when setting osds-per-device > 1 osds-per-device needs to be passed to run_command as a string. Otherwise, expandvars method will try to iterate over an integer. Signed-off-by: Maciej Naruszewicz <maciej.naruszewicz@intel.com>	2018-10-29 21:56:37 +01:00
Sébastien Han	22aed97266	testinfra: change test osds for containers We do not use @<device> anymore so we don't need to perform the readlink check anymore. Also we are making an exception for ooo which is still using ceph-disk. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-29 18:31:17 +01:00
Sébastien Han	1df0a7acce	ceph_volume: add container support for batch https://tracker.ceph.com/issues/36363 has been resolved and the patch has been backported to luminous and mimic so let's enable the container support. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1541415 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-29 18:31:17 +01:00
Sébastien Han	1cdec4069a	test_osd: dynamically get the osd container Do not enforce the container name since this will fail when we have multiple VMs running OSDs. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-29 15:33:12 +01:00
Sébastien Han	876f6ced74	test: convert all the tests to use lvm ceph-disk is now deprecated in ceph-ansible so let's convert all the ci tests to use lvm instead of ceph-disk. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-29 15:33:12 +01:00
Sébastien Han	89e76e5baf	tox: change container image to use master We have a latest-master image which contains builds from upstream ceph so let's use it to verify build. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-29 15:33:12 +01:00
Sébastien Han	2fd7da12bb	test: remove ceph-disk CI tests Since we are removing the ceph-disk test from the ci in master then there is no need to have the functionnal tests in master anymore. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-29 15:33:12 +01:00
Guillaume Abrioux	748342f5b6	roles: fix _docker_memory_limit default value append 'm' suffix to specify the unit size used in all `_docker_memory_limit`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-29 14:59:09 +01:00
Neha Ojha	b7e4d4eb84	roles: do not limit docker_memory_limit for various daemons Since we do not have enough data to put valid upper bounds for the memory usage of these daemons, do not put artificial limits by default. This will help us avoid failures like OOM kills due to low default values. Whenever required, these limits can be manually enforced by the user. More details in https://bugzilla.redhat.com/show_bug.cgi?id=1638148 Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1638148 Signed-off-by: Neha Ojha <nojha@redhat.com>	2018-10-29 14:59:09 +01:00
Sébastien Han	c5e4e62ab5	Merge branch 'jcsp-wip-rm-calamari'	2018-10-29 14:53:47 +01:00
Sébastien Han	0e63f0f3c9	Merge branch 'master' into wip-rm-calamari	2018-10-29 14:50:37 +01:00
Ali Maredia	219fa8f919	infrastructure playbooks: ensure nvme_device is defined in lv-create.yml Signed-off-by: Ali Maredia <amaredia@redhat.com>	2018-10-29 08:41:42 +00:00
Sébastien Han	5ab90b358c	nfs: do not create the nfs user if already present Check if the user exists and skip its creation if true. Closes: https://github.com/ceph/ceph-ansible/issues/3254 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-26 16:24:38 +00:00
Jairo Llopis	fc20973c2b	Fix problem with ceph_key in python3 Pretty basic problem of iteritems removal. Signed-off-by: Jairo Llopis <yajo.sk8@gmail.com>	2018-10-26 15:29:37 +02:00
Sébastien Han	91385e4ff6	ceph_volume: better error handling When loading the json, if invalid, we should fail with a meaningful error. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-26 11:19:24 +02:00
Sébastien Han	c58100002b	ceph_volume: expose ceph-volume logs on the host This will tremendously help debugging failures while performing any ceph-volume command in containers. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-26 11:19:24 +02:00
Guillaume Abrioux	cd3d6409fe	resync group_vars/*.sample files `ee2d52d33d` missed this sync between ceph-defaults/defaults/main.yml and group_vars/all.yml.sampl Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-26 08:44:08 +00:00
Guillaume Abrioux	a0cceb3e44	tox: fix a typo the line setting `ANSIBLE_CONFIG` obviously contains a typo introduced by `1e283bf69b` `ANSIBLE_CONFIG` has to point to a path only (path to an ansible.cfg) Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-25 14:40:58 +00:00
Mike Christie	0904860032	igw: stop daemons on purge all calls When purging the entire igw config (lio and rbd) stop disable the api and gw daemons. Fixes Red Hat BZ https://bugzilla.redhat.com/show_bug.cgi?id=1621255 Signed-off-by: Mike Christie <mchristi@redhat.com>	2018-10-25 12:59:18 +02:00
Rishabh Dave	ff4dc83b87	ceph-validate: avoid "list index out of range" error Be sure that error.path has more than one members before using them. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2018-10-24 18:21:48 +00:00
Guillaume Abrioux	4d698ce831	ceph-infra: reload firewall after rules are added we ensure that firewalld is installed and running before adding any rule. This has no sense anymore not to reload firewalld once the rule are added. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-23 09:53:09 +00:00
Rishabh Dave	ee2d52d33d	allow custom pool size Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1596339 Signed-off-by: Rishabh Dave <ridave@redhat.com>	2018-10-22 16:00:21 +02:00
Guillaume Abrioux	c47aa2e83b	tests: remove unnecessary variables definition since we set `configure_firewall: true` in `ceph-defaults/defaults/main.yml` there is no need to explicitly set it in `centos7_cluster` and `docker_cluster` testing scenarios. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-19 15:12:45 +02:00
Guillaume Abrioux	48cfc60722	defaults: set default `configure_firewall` to `True` Let's configure firewalld by default. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1526400 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-19 15:12:45 +02:00
Sébastien Han	44d0da0dd4	rolling_update: fix upgrade when using fqdn CLusters that were deployed using 'mon_use_fqdn' have a different unit name, so during the upgrade this must be used otherwise the upgrade will fail, looking for a unit that does not exist. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1597516 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-10-19 13:06:56 +00:00
Andrew Schoen	a439eb574d	validate: check the version of python-notario If the version of python-notario is < 0.0.13 an error message is given like "TypeError: validate() got an unexpected keyword argument 'defined_keys'", which is not helpful in figuring out you've got an incorrect version of python-notario. This check will avoid that situation by telling the user that they need to upgrade python-notario before they hit that error. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2018-10-19 09:18:39 +00:00
Guillaume Abrioux	8fa437b7bd	iscsi: fix networking issue on containerized env The iscsi-gw containers can't reach monitors without `--net=host` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-19 00:12:43 +00:00
Guillaume Abrioux	1f9090884e	Revert "tests: test `test_all_docker_osds_are_up_and_in()` from mon nodes" This approach doesn't work with all scenarios because it's comparing a local OSD number expected to a global OSD number found in the whole cluster. This reverts commit `b8ad35ceb9`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-10-19 00:12:43 +00:00

1 2 3 4 5 ...

4056 Commits (db576f6f0edaecb9ceef4139a269ea72c6149e2f) All Branches Search

4056 Commits (db576f6f0edaecb9ceef4139a269ea72c6149e2f)

All Branches