When keep_running is true, do not shutdown the cluster, leave it as it
is for other workunits or tasks to use. This effectively allows the
ceph-deploy task to be used as a helper to deploy clusters.
The call to build_ceph_cluster is simplified by giving it the whole
configuration dictionary instead of re-building one with selected arguments.
Signed-off-by: Loic Dachary <loic@dachary.org>
When ceph-deploy fails, run ceph report to get more information about
the state of the cluster at the time of the failure.
Signed-off-by: Loic Dachary <loic@dachary.org>
In our RHCS 1.3 ceph-deploy docs, we tell users to run "ceph-deploy
install --cli" on their calamari admin node, but our smoke test wasn't
actually doing this.
See https://bugzilla.redhat.com/1252929 , "[Ubuntu 1.3.0] - ceph-deploy
install --no-adjust-repos --cli `hostname` is failing with a Traceback
error"
In RHCS 1.2 we don't have a /mnt/MON directory. The intention of
35c6363a1e was to handle this condition,
but in 1.2, the non-zero return code makes Teuthology fail the whole
test.
We don't want *Teuthology* itself to act on the return code here; we
simply want to know what it was and structure the rest of the test
accordingly.
lttng is not yet part of any private repo; since 1.3.0/CentOS is not
a supported product, just grab it from EPEL for this test.
Signed-off-by: Dan Mick <dan.mick@redhat.com>
Otherwise, ceph-deploy will install an apt source that points to
ceph.com, which will override the local ISO repos.
No --mon/--osd yet until 12147 is fixed
Signed-off-by: Dan Mick <dan.mick@redhat.com>
also waits to remove it from dead_osds. this fixes an issue where
do_sighup tries to send a signal to an osd that has not been revived
yet.
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
This will call Thrasher.do_sighup which picks a random osd and sends a
signal.SIGHUP to it, delaying for the value of sighup_delay between each
time it picks a new osd to signal.
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
This method runs in a separate greenlet than do_thrash and will pick a
random live osd to send a signal.SIGHUP to. There is a config option,
sighup_delay, which controls how long to delay between sending the
signals.
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
ceph-deploy enables EPEL on Centos; this caused problems for
Calamari. Undo it after ceph-deploy has installed Ceph.
Signed-off-by: Dan Mick <dan.mick@redhat.com>
1) add a wait time before the mount attempt to let the cluster get set up.
By default this should be skipped, but for VMs and known-slow systems we
can give them 60 seconds.
2) Make the timeout configurable, with a 30-second default, but override it
for VM tests.
http://tracker.ceph.com/issues/12320Fixes: #12320
Signed-off-by: Loic Dachary <loic@dachary.org>
Also, clean up the command handling; failure will throw an exception,
so no need to accumulate the errors in an int
Fixes: #12228
Signed-off-by: Dan Mick <dan.mick@redhat.com>
When client capabilities get released, MDS may update corresponding
inodes' client writable range and mark those inodes dirty. The auto
repair test expects MDS to trim inodes from its cache, but MDS can't
trim dirty inodes. So we should flush journal after umount.
Fixes: #12172
Signed-off-by: Yan, Zheng <zyan@redhat.com>
This is for verifying the new layout-writing behaviour. While
we're at it, test that the pre-existing backtrace behaviours
are really happening (updating old_pools)
Signed-off-by: John Spray <john.spray@redhat.com>