Commit Graph

59 Commits

Author SHA1 Message Date
Josh Durgin
bccdef6650 tasks/ceph: ignore EEXIST for the archive data dir creation
With multiple clusters this will be called multiple times.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 14:55:27 -07:00
Josh Durgin
4c2e7309db tasks/ceph: pull each mon dir only once
No need to pull all mon dirs for the host for each mon on the host.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 14:55:26 -07:00
Josh Durgin
3203b76792 tasks/ceph: only run ceph_log and valgrind_post once
These setup and parse logs on all hosts, so they should be run only
for the first cluster setup. This cluster will be torn down last, so
the cleanup happens after all clusters are shutdown as well.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 14:55:26 -07:00
Josh Durgin
96e7724e23 tasks/ceph: make scrubbing cluster-aware
Simplify implementation by using manager and teuthology.misc methods
instead of reinventing them here.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 14:55:26 -07:00
Josh Durgin
3948f108a8 tasks/ceph: make restart subtask cluster-aware
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:59 -07:00
Josh Durgin
9ad65769c7 tasks/ceph: make wait_for_mon_quorum cluster-aware
Accept a 'daemons' list like other ceph subtasks, so it can get an
optional 'cluster' setting too.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:59 -07:00
Josh Durgin
bb76deaf57 tasks/ceph: make wait_for_osds_up cluster-aware
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:59 -07:00
Josh Durgin
ef619062be tasks/ceph: update ctx.manager usage to ctx.managers
Not sure this function is ever used (no users in ceph-qa-suite yamls
or tasks.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:59 -07:00
Josh Durgin
524e6d7a5e tasks/ceph_manager: add cluster param to write_conf()
Only used by cephfs right now, so don't bother changing callers.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:59 -07:00
Josh Durgin
141c73d399 tasks/ceph_manager: parameterize CephManager with cluster
Add --cluster arguments, pass cluster to get_daemon() and
iter_daemons_of_role, replace 'ceph' with cluster in paths, and use
ctx.ceph[cluster] instead of ctx.ceph.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:58 -07:00
Josh Durgin
0acbafe371 tasks/ceph: store cluster config in a per-cluster dict
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:58 -07:00
Josh Durgin
25ff14af74 tasks/ceph: create a CephManager per cluster
Thrashings tasks will be updated to use ctx.managers indexed by
cluster later.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:58 -07:00
Josh Durgin
d135470305 tasks/ceph: make healthy() cluster-aware
ceph.healthy may be used as a standalone task, so it may not always
have the cluster name in its configuration.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:58 -07:00
Josh Durgin
b0dd04736e tasks/ceph: make cephfs_setup() cluster-aware
Note that cephfs tests using the Filesystem abstractions will need to
be converted to understand multiple clusters later. This just updates
the ceph task portion.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:58 -07:00
Josh Durgin
4409710102 tasks/ceph: make crush_setup() cluster-aware
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:58 -07:00
Josh Durgin
e19e9e2fa3 tasks/ceph: adapt to cluster-aware daemon.resolve_role_list
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:57 -07:00
Josh Durgin
26b8a1f3ac tasks/ceph: make run_daemon() cluster-aware
Pass --cluster where appropriate and include the full role in file
names.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:57 -07:00
Josh Durgin
a54ff597fa tasks/ceph: convert cluster creation to work with multiple clusters
Add a cluster option to the ceph task, and pass that through to
cluster(). Make sure monitors and clients don't collide by adding
their cluster to paths they use.

This assumes there is one ceph task per cluster, and osds from
multiple clusters do not share hosts (or else the block device
assignment won't work).

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2016-05-09 11:51:57 -07:00
Sage Weil
52b13e82a6 tasks/ceph: allow set allow_multiple to fail
This will fail on upgrade tests.

Signed-off-by: Sage Weil <sage@redhat.com>
2016-04-16 09:10:51 -04:00
Greg Farnum
5e7e017d7d cephfs: update tests to enable multimds when needed
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2016-04-06 16:32:44 -07:00
Dan Mick
228f71e176 tasks/ceph.py: Remove *.pid at end of run
http://tracker.ceph.com/issues/15162
Fixes: #15162
Signed-off-by: Dan Mick <dan.mick@redhat.com>
2016-03-16 15:50:30 -07:00
John Spray
f05d977628 tasks/ceph: fix up whitespace
...because otherwise it lights up like a christmas
tree in pycharm.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-11-09 13:20:49 +00:00
Loic Dachary
8f9de175e0 ceph: log which ceph.conf file is written
Signed-off-by: Loic Dachary <ldachary@redhat.com>
2015-10-21 15:49:48 +02:00
John Spray
eab70197a6 tasks/ceph: wait for MDS to be active when creating a cluster
This is the correct implementation of 685d76a77c,
merged while broken in ff1655cb57 and
reverted in 4cccde634f.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-09-21 16:05:51 -07:00
Greg Farnum
4cccde634f Revert "Merge pull request #567 from ceph/ceph_fuse-timeout"
This reverts commit ff1655cb57, reversing
changes made to 2b25080d4f.

Since we haven't actually started the MDS daemons yet, this code is broken.

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-09-17 07:29:28 -07:00
Greg Farnum
685d76a77c ceph: wait for CephFS to be healthy before proceeding
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-09-16 13:53:36 -07:00
Zack Cerza
e9847570de Merge pull request #561 from ceph/wip-sudo
sudo ceph
2015-09-11 10:20:50 -06:00
Sage Weil
dad981d339 tasks: sudo ceph for cli
/var/run/ceph is 770.  This is mainly necessary for any
interaction with the daemon sockets, but it is what users do
and it may avoid log noise.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-11 12:15:01 -04:00
Sage Weil
9b3f36f91f ceph: add option to expect valgind errors and fail if there are none
See #10328
Signed-off-by: Sage Weil <sage@redhat.com>
2015-09-11 11:53:37 -04:00
John Spray
9f530092e2 tasks: fix syntax error in ceph.py
From e195f9fa.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-08-06 10:25:03 +01:00
Zack Cerza
e195f9fa1c Set the SELinux context of the logrotate config
Signed-off-by: Zack Cerza <zack@redhat.com>
2015-08-05 10:08:35 -06:00
Greg Farnum
451c5ca79d ceph: fix up log rotation stopper
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-06-12 11:19:24 -07:00
Greg Farnum
6573e92fb3 ceph: update log rotation for review comments
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-06-09 16:16:50 -07:00
Greg Farnum
96f3eb9dbe ceph: support arbitrarily-named daemons in logrotate
And make it more configurable in terms of sizes.

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-06-08 17:36:49 -07:00
Greg Farnum
5935f86e49 ceph: enable mds log rotation
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2015-06-08 17:36:49 -07:00
Jason Dillaman
8392d7f213 tasks: add support for running fsx under valgrind
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2015-04-28 09:44:05 -04:00
John Spray
ea7c39222a tasks/ceph: refactor legacy FS configuration check
Move up into Filesystem so that this can be used from
the ceph_deploy task as well.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-04-19 17:46:18 -07:00
Andrew Schoen
8cb28ddb8e Revert "ceph: be less weird about passing -f to mkfs" 2015-04-02 15:08:13 -05:00
Sage Weil
182cb63034 ceph: fix mkfs -f bug
Pass -f by default to btrfs instead of first trying without and *then*
trying with.

Among other things, this avoids a confusing failure where we try mkfs.ext4
device (no -f), fail for some reason, and then try again with -f and get
a usage error (-f does not mean force for mke2fs).

Signed-off-by: Sage Weil <sage@redhat.com>
2015-03-31 07:56:53 -07:00
Sage Weil
1922c61bbf ceph: ugh fix syntax
Signed-off-by: Sage Weil <sage@redhat.com>
2015-02-25 11:37:44 -08:00
Sage Weil
18307be0ca ceph: fix ps axuf lsof line
Signed-off-by: Sage Weil <sage@redhat.com>
2015-02-25 11:05:13 -08:00
Sage Weil
a68281e147 ceph: ps axf too before lsof
Specifically, I want to know *who* is running the ceph-osd that is
holding the files open.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-02-24 06:59:16 -08:00
Sage Weil
077e917af1 ceph: lsof if umount fails
Signed-off-by: Sage Weil <sage@redhat.com>
2015-02-23 13:52:48 -08:00
Sage Weil
ca09683f5f ceph: archive logs at very end
This ensures that we still gather the logs even if the other nested tasks
throw an exception in the finally block.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-02-23 13:52:48 -08:00
Sage Weil
4c22dac311 Merge pull request #302 from ceph/wip-crush-tunables
ceph: set crush tunables
2015-01-27 11:26:10 -08:00
Sage Weil
9e9d7c91eb ceph: set crush tunables
Signed-off-by: Sage Weil <sage@redhat.com>
2015-01-26 13:42:56 -08:00
Sage Weil
cb74060cd4 ceph: no need to stop() before restart()
restart() will stop if the daemon is running.  This will get rid of the
spurious error

2015-01-23 15:19:36,828.828 ERROR:tasks.ceph.osd.0:tried to stop a non-running daemon

when the daemon isn't already running.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-01-23 15:21:31 -08:00
Sage Weil
83dd3d58a1 ceph: add wait_for_failure command
Expect and then ignore a daemon failure.

Signed-off-by: Sage Weil <sage@redhat.com>
2015-01-23 15:20:50 -08:00
John Spray
997c0c3d62 tasks/cephfs: remove config from Filesystem cons
This was only used in get_first_mon, which doesn't actually
need the parameter itself.  Makes it easier to casually
use Filesystem from any place with a ctx to hand.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:09:53 +00:00
John Spray
213641f315 tasks/ceph: update mds auth caps to 'allow *'
Since the new 'tell' for the MDS was introduced,
caps have to have the '*' to permit running remote
administrative commands.

Signed-off-by: John Spray <john.spray@redhat.com>
2015-01-09 10:07:34 +00:00