ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Boris Ranto	8f77caa932	dashboard: Add and copy alerting rules This commit adds a list of alerting rules for ceph-dashboard from the old cephmetrics project. It also installs the configuration file so that the rules get recognized by the prometheus server. Signed-off-by: Boris Ranto <branto@redhat.com>	2019-05-16 16:39:13 +02:00
Zack Cerza	9b4339a2ba	purge-docker-cluster.yml: Default lvm_volumes We were failing when that variable is unset; purge-cluster.yml contains this workaround. Signed-off-by: Zack Cerza <zack@redhat.com>	2019-05-16 16:39:13 +02:00
Boris Ranto	2f141a6e80	Merge cephmetrics/dashboard-ansible repo This commit will merge dashboard-ansible installation scripts with ceph-ansible. This includes several new roles to setup ceph-dashboard and the underlying technologies like prometheus and grafana server. Signed-off-by: Boris Ranto & Zack Cerza <team-gmeno@redhat.com> Co-authored-by: Zack Cerza <zcerza@redhat.com> Co-authored-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-05-16 16:39:13 +02:00
wumingqiao	5320aa11c4	shrink_osd: mark all osd(s) out in one command Signed-off-by: wumingqiao <wumingqiao@beyondcent.com>	2019-05-15 16:04:28 +02:00
Guillaume Abrioux	2798774e96	tests: fix a typo in dev_setup.yml `c907ec41ae` introduced a typo. This commit fixes it. ``` [WARNING]: While constructing a mapping from /home/guits/ceph-ansible/tests/functional/dev_setup.yml, line 21, column 9, found a duplicate dict key (replace). ``` Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-05-15 11:33:26 +02:00
Dimitri Savineau	168d7cd016	purge-docker-cluster: remove docker data We never clean the content of /var/lib/docker so we can still have some data present in this directory after run the purge playbook. Pip isn't used anymore. Also update the docker package name (especially the python binding one). Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-05-14 10:55:43 +02:00
Dimitri Savineau	d2ad191eca	container-common: allow podman for other distros Currently podman installation is very tied to RHEL 8 even if we're able to install it on Debian/Ubuntu distribution. This patch changes the way we are starting or not the (fat) container daemon. Before the condition was based on the distribution release and now on the container_service_name variable. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-05-13 16:24:00 +02:00
Bruceforce	c3b0ee30a1	ceph-nfs: fixed with_items If we do this in one line we get the error described in #3968 fixes #3968 Signed-off-by: Bruceforce <markus.greis@gmx.de>	2019-05-13 16:23:43 +02:00
Dimitri Savineau	ea1f8f551c	gather-ceph-logs: fix logs list generation The shell module doesn't have a stdout_lines attributes. Instead of using the shell module, we can use the find modules. Also adding `become: false` to the local tmp directory creation otherwise we won't have enough right to fetch the files into this directory. Resolves: #3966 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-05-13 16:23:05 +02:00
Bruceforce	29f2c953b4	ceph-nfs: fixed condition for "stable repos specific tasks" The old condition would resolve to "when": "nfs_ganesha_stable - ceph_repository == 'community'" now it is "when": [ "nfs_ganesha_stable", "ceph_repository == 'community'" ] Please backport to stable-4.0 Signed-off-by: Bruceforce <markus.greis@gmx.de>	2019-05-13 09:53:54 +02:00
Dimitri Savineau	ba49225eab	Update RHCS version with Nautilus RHCS 4 will be based on Nautilus and only usable on RHEL 8. Updated the default ceph_rhcs_version to 4 and update the rhcs repositories to rhcs 4 with RHEL 8. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-05-13 09:53:18 +02:00
Kevin Coakley	381c58ca3e	Set the rgw_create_pools pools application to rgw Set the application to rgw for pools created from rgw_create_pools. On Ceph Nautilus the heath is set to HEALTH_WARN with the message "application not enabled on X pool(s)" if an application isn't specified for a pool. Signed-off-by: Kevin Coakley <kcoakley@sdsc.edu>	2019-05-13 09:48:25 +02:00
Mike Christie	d7ef12910e	igw: Fix rolling update service ordering We must stop tcmu-runner after the other rbd-target-* services because they may need to interact with tcmu-runner during shutdown. There is also a bug in some kernels where IO can get stuck in the kernel and by stopping rbd-target-* first we can make sure all IO is flushed. Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=1659611 Signed-off-by: Mike Christie <mchristi@redhat.com>	2019-05-10 09:40:52 +02:00
Rishabh Dave	121b5e4184	ceph-rbd-mirror: refactor tasks/main.yml Use blocks for similar tasks in main.yml. And move when keywords before block keywords. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-05-10 09:21:54 +02:00
Rishabh Dave	1a4dccdbb9	ceph-mds: group similar tasks in create_mds_filesystem.yml Group similar tasks together using block keyword. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-05-10 09:21:05 +02:00
Dimitri Savineau	52b9f3fb28	tox: Refact lvm_osds scenario The current lvm_osds only tests filestore on one OSD node. We also have bs_lvm_osds to test bluestore and encryption. Let's use only one scenario to test filestore/bluestore and with or without dmcrypt on four OSD nodes. Also use validate_dmcrypt_bool_value instead of types.boolean on dmcrypt validation via notario. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-05-09 09:38:20 +02:00
Guillaume Abrioux	936c6fca78	facts: fix external cluster bug running an external ceph cluster deployment with (obviously) no monitors defined in inventory breaks with an undefined error because `_monitor_addresses` never get defined. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1707460 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-05-07 17:53:53 +02:00
Rishabh Dave	56bfec7c58	ceph-mgr: create keys for MGRs Add code in ceph-mgr for creating a keyring for manager in so that managers can be deployed on a separate node too. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-05-07 14:13:06 +02:00
Rishabh Dave	d2cfd8b780	allow adding a manager to a deployed cluster Add a playbook that deploys manager on a new node and adds that node to the already deployed Ceph cluster. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1677431 Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-05-07 14:13:06 +02:00
Rishabh Dave	6e8fb2b3ea	remove infrastructure-playbooks/rgw-standalone.yml We don't need infrastructure-playbooks/rgw-standalone.yml since site.yml.sample and site-cotainer.yml.sample can add a new RGW node to an already deployed Ceph cluster. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-05-07 13:05:17 +02:00
Rishabh Dave	89748d579a	don't access other node's docker_exec_cmd variable Except for some corner case, it's not correct to access some other node's copy of variable docker_exec_cmd. Therefore replace "hostvars[groups[mon_group_name][0]]['docker_exec_cmd']" by "docker_exec_cmd". Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-05-07 12:37:48 +02:00
Rishabh Dave	f201222447	allow adding a RGW to already deployed cluster Add a tox scenario that adds a new RGW node as a part of already deployed Ceph cluster and deploys RGW there. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1677431 Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-05-07 12:36:16 +02:00
letterwuyu	d57f6fcdc6	Fix comment content Signed-off-by: lishuhao letterwuyu@gmail.com	2019-05-07 10:54:44 +02:00
Gaudenz Steinlin	3c8987c7a5	Fix check mode support Adds "check_mode: no" to commands which register cluster state in a variable and don't modify anything. These commands have to run in order to support running the playbook in check mode. Signed-off-by: Gaudenz Steinlin <gaudenz.steinlin@cloudscale.ch>	2019-05-07 09:49:20 +02:00
Rishabh Dave	221b2b4988	allow adding a RBD mirror to already deployed cluster Add a tox scenario that adds a new RBD mirror node as a part of already deployed Ceph cluster and deploys RBD mirror there. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1677431 Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-05-07 09:45:20 +02:00
Dimitri Savineau	ae266c6f2b	ansible: remove private and static attribute This will be removed in ansible 2.8 and breaks the playbook execution with this release. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-05-02 14:25:17 -04:00
Dimitri Savineau	1999cf3d19	ceph-mds: Increase cpu limit to 4 In containerized deployment the default mds cpu quota is too low for production environment. This is causing performance degradation compared to bare-metal. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1695850 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-24 20:33:02 +02:00
Dimitri Savineau	c17106874c	ceph-osd: Increase cpu limit to 4 In containerized deployment the default osd cpu quota is too low for production environment using NVMe devices. This is causing performance degradation compared to bare-metal. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1695880 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-24 17:59:42 +02:00
Jugwan Eom	6c4f48812a	validate: check custom repository config options This adds missing configuration options when the 'custom' repository is used. Signed-off-by: Jugwan Eom <zugwan@gmail.com>	2019-04-24 11:34:12 +02:00
Dimitri Savineau	4ae5ce399b	ceph-iscsi: start tcmu-runner for non-container Only rbd-target-api and rbd-target-gw were started/enabled for non containerized deployment. The issue doesn't happen with containerized setup. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-24 10:03:25 +02:00
Dimitri Savineau	564ec9c992	tests: group and parametrize tests Instead of creating a dedicated test and using the same testinfra module we can group them into a single test to avoid multiple ansible connections and testinfra module execution. This patch also adds parametrize pytest decorator when possible. Finally fixing some flake minor issue. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-24 10:03:25 +02:00
Dimitri Savineau	8ab6a3391f	tox: Remove update scenario reference update scenario is now handled by tox-update.ini file so we shoudn't have update reference in tox.ini file. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-24 10:02:45 +02:00
Dimitri Savineau	1eeddc394d	Update group_vars according to defaults `b2f2426` didn't use the generate_group_vars_sample.sh script so we currently have a difference between the content in group_vars and the ceph-defaults/defaults directories. Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-24 09:48:25 +02:00
Dimitri Savineau	f1048627ea	rolling_update: restart all ceph-iscsi services Currently only rbd-target-gw service is restarted during an update. We also need to restart tcmu-runner and rbd-target-api services during the ceph iscsi upgrade. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1659611 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-24 07:47:23 +00:00
Guillaume Abrioux	d6e28ffd27	validate: fix a typo `5aa2779461` introduced a typo. This commit fixes it. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-23 10:18:05 -04:00
Rishabh Dave	739a662c80	improve coding style Keywords requiring only one item shouldn't express it by creating a list with single item. Signed-off-by: Rishabh Dave <ridave@redhat.com>	2019-04-23 15:37:07 +02:00
Guillaume Abrioux	2326180bf9	validate: fix notario error Typical error: ``` AttributeError: 'Invalid' object has no attribute 'message' ``` As of python 2.6, `BaseException.message` has been deprecated. When using python3, it fails because it has been removed. Let's use `str(error)` instead so we don't hit this error when using python3. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-23 09:36:19 -04:00
Radu Toader	b2f242660e	Allow CephFS pool to be created with specific rule_name, erasure_profile just like rbd pools Signed-off-by: Radu Toader <radu.m.toader@gmail.com>	2019-04-20 02:26:05 +00:00
Dimitri Savineau	8105a1cefb	ceph-container-common: modify requirement flow Until now it was not possible to install a specific container package because it was somehow hardcoded. This patch allows to override the container package name (docker.io vs docker-ce) and refacts the package installation. This could be achieve via the container_package_name variable. Instead of using one task per distribution we can set the package and service name in vars. This allows to have a unified package task. Also refactorize the debian_prerequisites tasks because the content was outdated. https://docs.docker.com/install/linux/docker-ce/debian/ https://docs.docker.com/install/linux/docker-ce/ubuntu/ Resolves: #3609 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-18 16:18:01 +02:00
Florian Haas	37962fec92	doc: update index.rst with current information for stable-4.0 With the stable-4.0 branch nearing release, update docs/source/index.rst with current information about which Ceph releases are supported, and which Ansible versions are required, for each branch. Signed-off-by: Florian Haas <florian@citynetwork.eu>	2019-04-18 16:16:46 +02:00
Guillaume Abrioux	58f3851573	mds: remove legacy task this task has nothing to do in stable-4.0 and after. Let's remove it since stable-4.0 and after aren't intended to deploy luminous. Closes: #3873 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-18 15:55:45 +02:00
Kyle Bader	0bee90b201	rgw: add cpuset support 1/ The OSD already supports cpuset to be used for containerized deployments through the use of the ceph_osd_docker_cpuset_cpus variable. This adds similar support to the RGW service for containerized deployments by setting a new variable named ceph_rgw_docker_cpuset_cpus. Like the OSD, there are times where using distinct cores has advantages over using the CFS in kernel scheduler. ceph_rgw_docker_cpuset_cpus accepts a comma delimited set of CPU ids 2/ Add support for specifying --cpuset-mem variable to restrict the cgroup's memory allocations to a particular numa node, which should typically correspond with the cpu ids of that numa node that were provided with --cpuset-cpus. To ensure the correct cpu ids are used one can run `numactl --hardware` to list the nodes and which cpu ids correspond to each. Signed-off-by: Kyle Bader <kbader@redhat.com> Co-authored-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-18 15:55:19 +02:00
Dimitri Savineau	86315272c7	ceph-mgr: Add extra module packages Since Nautilus there's mgr extra modules not present in ceph-mgr package but in dedicated packages. Resolves: #3860 Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>	2019-04-18 15:31:22 +02:00
Guillaume Abrioux	7eb42c9e8e	update: ensure tasks are executed on an upgraded mon These tasks must be run from a monitor which is upgraded otherwise it might fail. See: https://tracker.ceph.com/issues/39355 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-18 11:16:11 +02:00
Guillaume Abrioux	ed84325b1d	update: ensure ceph command returns 0 these commands could return something else than 0. Let's ensure all retries have been done before actually failing. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-18 11:16:11 +02:00
Guillaume Abrioux	543d1e2e41	update: set osd flags before upgrading any mon Typical error: ``` failed: [mon0 -> mon2] (item=noout) => changed=true cmd: - ceph - --cluster - ceph - osd - set - noout delta: '0:00:00.293756' end: '2019-04-17 06:31:57.552386' item: noout msg: non-zero return code rc: 1 start: '2019-04-17 06:31:57.258630' stderr: \|- Traceback (most recent call last): File "/bin/ceph", line 1222, in <module> retval = main() File "/bin/ceph", line 1146, in main sigdict = parse_json_funcsigs(outbuf.decode('utf-8'), 'cli') File "/usr/lib/python2.7/site-packages/ceph_argparse.py", line 788, in parse_json_funcsigs cmd['sig'] = parse_funcsig(cmd['sig']) File "/usr/lib/python2.7/site-packages/ceph_argparse.py", line 728, in parse_funcsig raise JsonFormat(s) ceph_argparse.JsonFormat: unknown type CephBool stderr_lines: - 'Traceback (most recent call last):' - ' File "/bin/ceph", line 1222, in <module>' - ' retval = main()' - ' File "/bin/ceph", line 1146, in main' - ' sigdict = parse_json_funcsigs(outbuf.decode(''utf-8''), ''cli'')' - ' File "/usr/lib/python2.7/site-packages/ceph_argparse.py", line 788, in parse_json_funcsigs' - ' cmd[''sig''] = parse_funcsig(cmd[''sig''])' - ' File "/usr/lib/python2.7/site-packages/ceph_argparse.py", line 728, in parse_funcsig' - ' raise JsonFormat(s)' - 'ceph_argparse.JsonFormat: unknown type CephBool' stdout: '' stdout_lines: <omitted> ``` Having mixed versions of monitors seems to cause this error. Moving these tasks before any monitor gets upgraded seems to be enough to get around this issue. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-18 11:16:11 +02:00
Guillaume Abrioux	a4bc7bda51	update: refact msgr2 migration this commit refact the msgr2 protocol introduction. If it's a fresh install, let's go with v2 only. If we upgrade to nautilus, we should go with v2+v1 syntax to ensure nothing breaks. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2019-04-18 11:16:11 +02:00
Andrew Schoen	e2529dcd7f	rolling_update: ceph commands should use --cluster Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2019-04-18 10:55:11 +02:00
Andrew Schoen	67453853ff	rolling_update: set num_osds to the number of running osds We do this so that the ceph-config role can most accurately report the number of osds for the generation of the ceph.conf file. We don't want to use ceph-volume to determine the number of osds because in an upgrade to nautilus ceph-volume won't be able to accurately count osds created by ceph-disk. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2019-04-18 10:55:11 +02:00
Andrew Schoen	5e3dfe5021	ceph-osd: do not run lvm batch tasks during update When performing a rolling update do not try to create any new osds with `ceph-volume lvm batch`. This is troublesome because when upgrading to nautilus the devices list might contain devices that are currently being used by ceph-disk and have GPT headers on them, which will cause ceph-volume to fail when trying to use such a device. Any devices originally created by ceph-disk will need to be removed from the devices list before any new osds can be created. Signed-off-by: Andrew Schoen <aschoen@redhat.com>	2019-04-18 10:55:11 +02:00

... 12 13 14 15 16 ...

5238 Commits (44e1ebaaff61b5317bae0a3483f5d485c12bd18b) All Branches Search

5238 Commits (44e1ebaaff61b5317bae0a3483f5d485c12bd18b)

All Branches