ceph-ansible

Commit Graph

Author	SHA1	Message	Date
Douglas Fuller	c8573fe0d7	Remove deprecated allow_multimds allow_multimds will be officially deprecated in Mimic, specify it only for all versions of Ceph where it was declared stable. Going forward, specify only max_mds. Signed-off-by: Douglas Fuller <dfuller@redhat.com>	2018-04-12 10:29:17 +02:00
Sébastien Han	82ccbdafbc	ceph-defaults: bring backward compatibility for old syntax If people keep on using the mon_cap, osd_cap etc the playbook will translate this old syntax on the flight. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-11 12:18:34 +02:00
Sébastien Han	9657e4d6fa	ceph_key: use ceph_key in the playbook Replaced all the occurence of raw command using the 'command' module with the ceph_key module instead. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-04-11 12:18:34 +02:00
John Fulton	e6e6bd078a	Refer to expected-num-ojects as expected_num_objects, not size Follow up patch to PR 2432 [1] which replaces "size" (sorry if the original bug used that term, which can be confusing) with expected_num_objects as is used in the Ceph documentation [2]. [1] https://github.com/ceph/ceph-ansible/pull/2432/files [2] http://docs.ceph.com/docs/jewel/rados/operations/pools	2018-03-26 15:41:51 +02:00
Sébastien Han	e302c1baae	mon: add support for erasure code pool You can now specify type: erasure and erasure_profile to use when declaring the pool dictionnary. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	277d885bc9	mon: add support for pgp, pool type and rule name When creating pools, it's crucial to expose all the options available as part of the pool creation command. As explained in: http://docs.ceph.com/docs/jewel/rados/operations/pools/ Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	26bc00fb74	mon: fail if pool creation fails There is no reason to continue the deployment if these tasks fail. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1546185 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
Sébastien Han	0011edd2bc	mon: add support for expected-num-objects This commit adds the support for expected-num-objects when creating a pool. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1541520 Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-14 14:22:00 +01:00
jtudelag	691f7c5146	Adds handy ceph aliases whe containerized installations. Same approach as openshift-ansible etcdctl: * https://github.com/openshift/openshift-ansible/blob/release-3.7/roles/etcd/tasks/auxiliary/drop_etcdctl.yml * https://github.com/openshift/openshift-ansible/blob/release-3.7/roles/etcd/etcdctl.sh	2018-03-08 13:56:39 +01:00
Sébastien Han	a52ed43093	mon: fix osd_pool_default_crush_rule persistence and effectiveness Running the last portion (insert new default and add new default crush tasks) of crush_rules.yml only on the last monitor is wrong since ceph CLI calls usually end up on the master having the quorum, which is by default the one with the lower IP. So if we run the command and end up on another mon the creation will happen on the default crush rule because the particular mon hasn't been updated. To fix this we remove the \|last on the include and use run_once: true on certain tasks, then we let the final two tasks run on all the monitors. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	47cef7a41d	mon: fix set crush default rule On releases after jewel the option 'osd_pool_default_crush_replicated_ruleset' does not exist anymore, it's called osd_pool_default_crush_rule. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Sébastien Han	73c4846744	mon: use ceph_crush module in the playbook Instead of creating the CRUSH hierarchy with Ansible tasks using the command module we now rely on the ceph_crush module. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-03-06 15:24:31 +00:00
Greg Charot	78c1f1938f	mons: Current crush_rule playbook does not work if there is no default rule defined (default: true). One could want to add new crush rules while keeping his current default rule. Fixed it so that it works with all rules defined as "default: false". If multiple rules are defined as default (should not be) then the last rule listed in "crush_rules" is taken as default.	2018-03-06 15:24:31 +00:00
Greg Charot	50afc3fbf3	We don't want to automatically move the rbd pool to the new default crush rule. This operation shall be performed by the cluster operator.	2018-03-06 15:24:31 +00:00
jtudelag	c3267b77b7	Makes use of docker_exec_cmd in ceph-mon role. Keeps consistency inside the role and among roles. Makes the code more readable.	2018-03-05 12:48:35 +00:00
Giulio Fidente	a83e1aeea3	Make rule_name optional when defining items in openstack_pools Previously it was necessary to provide a value (eventually an empty string) for the "rule_name" key for each item in openstack_pools. This change makes that optional and defaults to empty string when not given.	2018-02-23 15:11:53 +01:00
Andy McCrae	b4dbc862d6	Set application for OpenStack pools Since Luminous we need to set the application tag for each pool, otherwise a CEPH_WARNING is generated when the pools are in use. We should assign the OpenStack pools to their default which would be "rbd". When updating to Luminous this would happen automatically to the vms, images, backups and volumes pools, but for new deploys this is not the case.	2018-02-09 17:15:55 +01:00
Giulio Fidente	bdcc52b96d	Check for docker sockets named after both _hostname or _fqdn While hostname -f will always return an hostname including its domain part and -s without the domain part, the behavior when no arguments are given can include or not include the domain part depending on how the system is configured; the socket name might not match the instance name then.	2018-02-06 14:16:54 +01:00
Greg Charot	a6d1922a2e	mon: Fixed crush_rule_config for containerised deployment. Was called too early, container was not yet started so the commands failed. Moved the section after include docker/main.yml Signed-off-by: Greg Charot <gcharot@redhat.com>	2018-02-06 05:12:59 +01:00
Guillaume Abrioux	deaf273b25	syntax: change local_action syntax Use a nicer syntax for `local_action` tasks. We used to have oneliner like this: ``` local_action: wait_for port=22 host={{ hostvars[inventory_hostname]['ansible_default_ipv4']['address'] }} state=started delay=10 timeout=500 }} ``` The usual syntax: ``` local_action: module: wait_for port: 22 host: "{{ hostvars[inventory_hostname]['ansible_default_ipv4']['address'] }}" state: started delay: 10 timeout: 500 ``` is nicer and kind of way to keep consistency regarding the whole playbook. This also fix a potential issue about missing quotation : ``` Traceback (most recent call last): File "/tmp/ansible_wQtWsi/ansible_module_command.py", line 213, in <module> main() File "/tmp/ansible_wQtWsi/ansible_module_command.py", line 185, in main rc, out, err = module.run_command(args, executable=executable, use_unsafe_shell=shell, encoding=None, data=stdin) File "/tmp/ansible_wQtWsi/ansible_modlib.zip/ansible/module_utils/basic.py", line 2710, in run_command File "/usr/lib64/python2.7/shlex.py", line 279, in split return list(lex) File "/usr/lib64/python2.7/shlex.py", line 269, in next token = self.get_token() File "/usr/lib64/python2.7/shlex.py", line 96, in get_token raw = self.read_token() File "/usr/lib64/python2.7/shlex.py", line 172, in read_token raise ValueError, "No closing quotation" ValueError: No closing quotation ``` writing `local_action: shell echo {{ fsid }} \| tee {{ fetch_directory }}/ceph_cluster_uuid.conf` can cause trouble because it's complaining with missing quotes, this fix solves this issue. Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1510555 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-31 10:45:34 +01:00
Eduard Egorov	7d7080df6c	crush: create rack type buckets and build crush tree according to {{ osd_crush_location }}. Currently, we can define crush location for each host but only crush roots and crush rules are created. This commit automates other routines for a complete solution: 1) Creates rack type crush buckets defined in {{ ceph_crush_rack }} of each osd host. If it's not defined by user then a rack named 'default_rack_{{ ceph_crush_root }}' would be added and used in next steps. 2) Move rack type crush buckets defined in {{ ceph_crush_rack }} into crush roots defined in {{ ceph_crush_root }} of each osd host. 3) Move hosts defined in {{ ceph_crush_rack }} into crush roots defined in {{ ceph_crush_root }} of each osd host. Signed-off-by: Eduard Egorov <eduard.egorov@icl-services.com>	2018-01-11 17:42:18 +01:00
Guillaume Abrioux	70401f955b	container: trigger handlers on systemd file change When a systemd unit file is changed we should trigger handlers to restart the services. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2018-01-10 16:46:42 +01:00
Sébastien Han	f0787e64da	mon: use crush rules for non-container too There is no reasons why we can't use crush rules when deploying containers. So moving the inlcude in the main.yml so it can be called. Signed-off-by: Sébastien Han <seb@redhat.com>	2018-01-10 15:21:36 +01:00
Sébastien Han	0b55abe3d0	mon: always run ceph-create-keys ceph-create-keys is idempotent so it's not an issue to run it each time we play ansible. This also fix issues where the 'creates' arg skips the task and no keys get generated on newer version, e.g during an upgrade. Closes: https://github.com/ceph/ceph-ansible/issues/2228 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-12-21 13:50:01 +01:00
John Fulton	ffae294288	Set tighter permissions on keyrings when containerized During a containerized deployment, set the permissions of ceph.client.admin.keyring and other keyrings to chmod 600 and chown it to ceph.	2017-12-06 19:22:28 -05:00
John Fulton	d73f751b63	Make openstack_keys param support no acls list A recent change [1] required that the openstack_keys param always containe an acls list. However, it's possible it might not contain that list. Thus, this param sets a default for that list to be empty if it is not in the structure as defined by the user. [1] `d65cbaa539`	2017-11-16 11:29:59 -05:00
John Fulton	d65cbaa539	Set permissions and ACLs of OpenStack keys on all ceph-mons If ceph-ansible deploys a Ceph cluster with "openstack_config: true" and sets the openstack_keys map to have certain ACLs or permissions, the requested ACLs or permissions are only set on one of the monitor nodes [2] when they should be set on all of them. This patch solves [3] the above issue by having the chmod and setfacl tasks iterate the list of mon nodes (including the mon node that the task was delegated to) to apply the chmod of setfacl to the keys in openstack_keys. [1] ``` openstack_keys: - { name: client.openstack, key: "$(ceph-authtool --gen-print-key)", mon_cap: "allow r", osd_cap: "allow class-read object_prefix rbd_children, allow rwx pool=images, allow rwx pool=vms, allow rwx pool=volumes, allow rwx pool=backups", mode: "0600", acls: ["u:nova:r--", "u:cinder:r--", "u:glance:r--", "u:gnocchi:r--"] } ``` [2] ``` $ ansible mons -m shell -b -a "ls -l /etc/ceph/ceph.client.openstack.keyring ; getfacl /etc/ceph/ceph.client.openstack.keyring" 192.168.1.26 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 3 20:30 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names 192.168.1.29 \| SUCCESS \| rc=0 >> -rw-r--r--. 1 root root 253 Nov 3 20:30 /etc/ceph/ceph.client.openstack.keyring user::rw- group::r-- other::r--getfacl: Removing leading '/' from absolute path names 192.168.1.23 \| SUCCESS \| rc=0 >> -rw-r--r--. 1 root root 253 Nov 3 20:30 /etc/ceph/ceph.client.openstack.keyring user::rw- group::r-- other::r--getfacl: Removing leading '/' from absolute path names $ ``` [3] ``` (undercloud) [stack@hci-director ceph-ansible]$ ansible mons -m shell -b -a "ls -l /etc/ceph/ceph.client.openstack.keyring ; getfacl /etc/ceph/ceph.client.openstack.keyring" 192.168.1.25 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names 192.168.1.29 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names 192.168.1.27 \| SUCCESS \| rc=0 >> -rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring user::rw- user:glance:r-- user:nova:r-- user:cinder:r-- user:gnocchi:r-- group::--- mask::r-- other::---getfacl: Removing leading '/' from absolute path names (undercloud) [stack@hci-director ceph-ansible]$ ```	2017-11-15 10:09:24 -05:00
Sébastien Han	5a10b048b0	Merge pull request #2105 from major/really-fix-always-run Really fix always run	2017-10-27 09:33:47 +02:00
John Fulton	ae156e9f34	Make acls and mode parameters of opentack_keys optional Only chmod or setfacl the requested keyring(s) in the opentack_keys data structure when the mode or acls keys of that data structure exist. User may specify four permission combinations for the keyring file(s): 1. only set ACL, 2. only set mode, 3. set neither mode nor ACL, 4. set mode and then ACL. Fixes: #2092	2017-10-26 12:45:17 +00:00
Major Hayden	f73232caa4	Use check_mode instead of always_run This patch changes the `always_run: yes` task option to `check_mode: no` to avoid Ansible warnings.	2017-10-25 09:53:34 -05:00
Major Hayden	c2b5118c1b	Revert "Avoid deprecated always_run" This reverts commit `620fb37dd4`.	2017-10-25 09:48:09 -05:00
Sébastien Han	4413511b66	all: backward compatibility between stable-2.2 and 3.0 stable-3.0 brought numerous changes in ceph-ansible variables, this PR aims to maintain backward compatibility for someone running stable-2.2 upgrading to stable-3.0 but keeps its groups_vars untouched. We will then determine the right options to make sure the upgrade works but we are expecting that new variables should be used. We will drop this in a near future, maybe 3.1 or 3.2. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-20 11:54:10 +02:00
Sébastien Han	71d819620c	mds: fix fs pool creation 1. add the variables to docker_collocation 2. trigger the check when a MDS is part of the inventory file, not when we run on an MDS... Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-13 16:03:04 +02:00
Major Hayden	c01851325e	Remove jinja2 delimiters from `when` keys This patch changes the `when:` keys so that they have no jinja2 delimiters. This avoids Ansible warnings which could turn into errors in a future Ansible release.	2017-10-12 11:27:42 -05:00
Guillaume Abrioux	17623a2157	Merge pull request #2036 from ceph/cephfs-pool mds: precisely define cephfs pool	2017-10-12 17:47:10 +02:00
Sébastien Han	b49f9bda21	mds: precisely define cephfs pool We now have a variable called ceph_pools that is mandatory when deploying a MDS. It's a dictionnary that contains a pool name and a PG count. PG count is mandatory and must be set, the playbook will fail otherwise. Closes: https://github.com/ceph/ceph-ansible/issues/2017 Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-12 15:56:04 +02:00
Major Hayden	620fb37dd4	Avoid deprecated always_run The `always_run` key is deprecated and being removed in Ansible 2.4. Using it causes a warning to be displayed: [DEPRECATION WARNING]: always_run is deprecated. This patch changes all instances of `always_run` to use the `always` tag, which causes the task to run each time the playbook runs.	2017-10-12 08:29:44 -05:00
Sébastien Han	164c77acd1	Merge pull request #1995 from ceph/remove-rbd-check jewel: remove rbd check	2017-10-05 15:31:48 +02:00
Sébastien Han	b545080d71	Merge pull request #1988 from ceph/fix_keyrings docker: fix keyrings copied on all nodes	2017-10-05 14:30:09 +02:00
Sébastien Han	bbf6bebe32	jewel: remove rbd check The value of doing this is fairly low compare to the added value. So we remove these tasks, if rbd pool on Jewel doesn't have the right PG value you can always increase it. Signed-off-by: Sébastien Han <seb@redhat.com>	2017-10-05 14:21:37 +02:00
Jan Provaznik	43e57abfd8	Evaluate cephfs pool variables Otherwise pools with names 'cephfs_data' and 'cephfs_metadata' are created.	2017-10-05 10:00:20 +02:00
Guillaume Abrioux	70e2787fe2	docker: fix keyrings copied on all nodes All keyring are getting copied to all nodes. This commit fixes a leftover from a previous code refactor. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1498583 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-05 09:23:22 +02:00
Guillaume Abrioux	2c4258a0fd	Refact code for set_osd_pool_default_* This commit refacts the code regarding all `set_osd_pool_default_*` related tasks by avoiding usage of useless `set_fact` to determine whether a key is present in `ceph_conf_overrides`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-04 15:40:10 +02:00
Al Lau	6aca67bc9c	Only perform actions on the rbd pool after it has been created The rbd pool is the default pool that gets created during ceph cluster initializaiton. If we act on the rbd related operations too early, the rbd pool does not exist yet. Move the call to perform rbd operations to a later stage after other pools have been created. The rbd_pool.yml playbook has all the operations related to the rbd pool. Replace the always_run (deprecated) directive with check_mode. Most of the ceph related tasks only need to run once. The run_once directive executes the task on the first host. The ceph sub-command to delete a pool is delete (not rm). The changes submitted here were tested with this ceph version. ceph version 0.94.9-9.el7cp (b83334e01379f267fb2f9ce729d74a0a8fa1e92c) This upload includes these changes: - Use the fail module (instead of assert). - From luminous release, the rbd pool is no longer created by default. Delete the code to create the rbd pool for luminous release - Conform the .yml files to use the suggested syntax. The commands are executed on the mcp nodes and I think shell ansible module is the right one to use. The command module is used to execute commands on remote nodes. I can make the change to use command module if that is prefrerred.	2017-10-04 15:40:10 +02:00
Guillaume Abrioux	784cc73da0	set docker_exec_cmd fact early in each role This is to ensure `docker_exec_cmd` fact is set with the correct value in case of daemons collocation Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-04 11:31:09 +02:00
Guillaume Abrioux	62770cd7de	refact MDS role This commits refacts the role ceph-mds The goal here is to create cephfs in `ceph-mon` for both containerized and non-containerized cases so we don't need the admin keyring on mds nodes anymore. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1488999 Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-10-02 09:12:31 +02:00
Guillaume Abrioux	466f6f35b7	Use systemd module instead of service. Using systemd module allows us to do in one task what we did in three tasks: - enable unit file, - issue a `daemon-reload`, - start the service Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-29 14:54:00 +02:00
Guillaume Abrioux	913ad53709	docker: add condition to run selinux tasks only on rhel os family This fixes the error : ``` The conditional check 'sestatus.stdout != 'Disabled'' failed. ``` that occurs when running on non rhel based system since the `sestatus` fact is registered only on rhel based distribution. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-29 02:35:07 +02:00
Douglas Fuller	9bcbf748a3	mon/ceph_keys: Add timeout flag to ceph-create-keys Specify the timeout flag to ceph-create-keys, which causes it to time out if a monitor quorum isn't achieved. This overrides the default timeout of 10 minutes, causing ceph-ansible to fail faster in the event of cluster network issues. Signed-off-by: Douglas Fuller <dfuller@redhat.com>	2017-09-27 18:05:59 -04:00
Guillaume Abrioux	62cd0bae54	rbd: fix missing keyring on nodes the rbd key was not pushed on rbd nodes because its keyring path was not added in `ceph_config_keys`. Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>	2017-09-21 09:56:37 +02:00

1 2 3 4 5 ...

315 Commits (a98885a71ec63ff129d7001301a0323bfaadad8a)