Sage Weil
f84458bd93
ceph_manager: increase osd revival timeout from 75 -> 150 seconds
...
Saw this fail on a firefly run just bc of valgrind slowness. Make the
timeout higher!
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-30 21:34:45 -07:00
Sage Weil
7c3fc404d6
valgrind: ignore ec plugin factory leaks
...
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-30 18:14:08 -07:00
John Spray
9e2c82824b
Merge pull request #300 from ceph/wip-mpi-vers
...
task/mpi: Explicit check for version mismatch
2014-07-30 23:37:39 +01:00
Zack Cerza
094cd5b3ba
Merge pull request #302 from ceph/wip-8850
...
tell us if you are really not running any more
2014-07-30 14:31:27 -06:00
Alfredo Deza
4686115c48
add another call to use "ps aux"
...
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
2014-07-30 16:28:02 -04:00
Alfredo Deza
ec342d8146
tell us if you are really not running any more
...
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
2014-07-30 16:17:01 -04:00
Sage Weil
1c7c2eb4b9
nuke: allow nuking vpm hosts
...
Skip the console check, though.
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-30 11:12:09 -07:00
Sage Weil
ec2f94901f
kcephfs: thrash mds too
...
And shorten names.
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-30 06:55:43 -07:00
John Spray
491817961a
task/mpi: Explicit check for version mismatch
...
Instead of proceeding to have mpiexec fail or hang,
do an up-front check that the MPI version is the same
on all of the nodes in the test.
Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-30 14:29:11 +01:00
Sage Weil
79357060f1
Merge pull request #53 from ceph/wip-leaks
...
use notcmalloc to fix leak checking
2014-07-29 15:27:54 -07:00
Warren Usui
6d01f11a02
Merge pull request #67 from ceph/wip_fix_emperor-x
...
Fixed -x in the suite, this fixed http://tracker.ceph.com/issues/8862
2014-07-29 14:35:53 -07:00
Yuri Weinstein
5d2ce7fd87
Fixed -x in the suite, this fixed http://tracker.ceph.com/issues/8862
...
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
2014-07-29 14:14:32 -07:00
Zack Cerza
b8687a54aa
Fix package_version_for_hash() call
...
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-29 14:43:41 -06:00
John Spray
b354631e6e
Merge pull request #62 from ceph/wip-standbyreplay
...
suites/fs: add a standbyreplay configuration
2014-07-29 18:14:02 +01:00
John Spray
0147c74671
task/mds_journal_migration: fix cleanup
...
Previously this was remounting in the background
which meant that if a workunit was executed immediately
afterwards it could put files in the mount dir before
the FS was mounted, causing the mount to fail.
Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-29 13:11:12 +01:00
John Spray
431ae2f583
suites/fs: add a standbyreplay configuration
...
A place for any tests which should be run with
standbyreplay MDSs in play. Initially just
the mds_journal_migration task, to cover
the #8811 case.
Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-29 11:34:50 +01:00
Yuri Weinstein
3f3da880d4
Update README
2014-07-28 16:28:14 -07:00
Yuri Weinstein
f2b742268a
Update README
2014-07-28 16:24:57 -07:00
Josh Durgin
7dcb8bd6b2
dumpling-firefly-x: document structure of suite
...
Hopefully this makes it easier to digest and maintain
than reading yaml fragments.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Signed-off-by: Yuri Weintstein <yuri.weinstein@inktank.com>
2014-07-28 16:14:36 -07:00
John Spray
cc7c9b2e8b
task/workunit: add sudo to dir delete
...
This failure to delete used to be hidden in
a catch-all exception handler that was
removed in 396dc073
.
This was causing failures in kcephfs suite
like:
Command failed on plana96 with status 1: 'rm -rf -- /home/ubuntu/cephtest/mnt.0/client.0'
Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-27 12:04:23 +01:00
Sage Weil
c88e1cb1d6
rados/singleton/all/rest-api: add mds.a
...
The test expects an MDS.
Signed-off-by: Sage Weil <sage@inktank.com>
2014-07-26 09:05:25 -07:00
tamil
05fb534206
added a new test for rgw_readwrite with front end as civetweb instead of apache
...
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2014-07-25 21:04:12 -07:00
tamil
ac8ab1e6fd
added a new test for rgw_readwrite with front end as civetweb instead of apache
2014-07-25 20:43:39 -07:00
Josh Durgin
ef6f0de7c7
Merge pull request #66 from ceph/wip_tune_upgrade-firefly
...
Changed upgrade order and re-enabled rados/tests.sh
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2014-07-25 16:05:53 -07:00
Yuri Weinstein
531f89caf1
Changed upgrade order and re-enabled rados/tests.sh
...
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
2014-07-25 15:58:45 -07:00
Zack Cerza
2999a4d492
Merge pull request #296 from ceph/wip-standby-mds
...
Wip standby mds
2014-07-25 16:44:34 -06:00
Sage Weil
883f2a76f4
task/osd_recovery: use rbd instead of metadata pool
...
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-25 11:39:42 -07:00
Sage Weil
4e56521ebe
task/osd_backfill: use 'rbd' instead of 'data' pool
...
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-25 11:33:25 -07:00
John Spray
2ed1131fc9
task/workunit: general tidy
...
* PEP8-ize whitespace
* specialize a catch-all except:
* don't alias 'misc' to 'teuthology'
Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-25 13:33:31 +01:00
John Spray
396dc073cb
task/workunit: clean up dir deletion
...
This was always trying to remove the mountpoint,
and then swallowing the exception and printing
it to the log.
Instead:
* Ensure it's only trying to delete mnt if it created it
* Don't swallow those exceptions: if deleting the dirs
fails then something has gone wrong and it should bubble
up.
Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-25 13:33:31 +01:00
John Spray
a96a7b027c
task/cephfs: generalise Filesystem for multi-MDS
...
This enables tasks like mds_journal_migration to be
run in an environment with standby-replay MDSs present.
Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-25 13:33:31 +01:00
Sage Weil
175057f3bc
rados/verify: increase api tests timeout
...
This has been timing out for weeks to months (hard to say since before that
we were consistently hitting an OSD crash on shutdown). The runs I've looked
at are making progress and seem fine ... just slow because of valgrind and
because of thrashing.
Backport: firefly
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-24 18:57:22 -07:00
Sage Weil
279f39d4c1
use notcmalloc builder for valgrind leak checking
...
This reverts 453d72b294
. It turns out that
tcmalloc defeats the leak checking in valgrind.
Signed-off-by: Sage Weil <sage@inktank.com>
2014-07-24 11:57:06 -07:00
Zack Cerza
c2fc561427
Remove oddly-placed, broken import
...
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-24 10:03:38 -06:00
Zack Cerza
479a1fe676
Rename CephState to DaemonGroup and move to orchestra
...
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-23 19:59:16 -06:00
Zack Cerza
7d0adcee2d
Move DaemonState to teuthology.orchestra
...
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-23 19:59:16 -06:00
Sage Weil
16ad68d0fd
clusters/fixed-3-cephfs.yaml: put client.0 on separate node, remove client.1
...
Only one job needs 2 clients; put them on separate hosts. They should not
be colocated with the OSDs!
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-23 18:46:30 -07:00
Sage Weil
a546e20761
rados/objectstore: move objectstore related tests from singleton-nomsgr
...
Just so they're in a nice convenient bunch.
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-23 11:04:02 -07:00
tamil
367d4da083
added debug messages for radosbench.yaml
...
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2014-07-22 16:15:19 -07:00
Zack Cerza
efce34186c
Fail sooner if an invalid config_yaml is passed
...
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-22 15:47:53 -06:00
Yuri Weinstein
456aff87fc
Removed rados/test.sh for now
...
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
2014-07-22 14:05:38 -07:00
Zack Cerza
0e2149c86e
Remove kcon_most
...
It was unused.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-22 13:57:40 -06:00
Zack Cerza
d6e5895fb2
Merge pull request #293 from ceph/wip-ship-utilities
...
move ship_utilities to install task (from ceph test)
2014-07-22 13:49:57 -06:00
Sage Weil
828bb2a6ea
ceph_deploy: use new ship_utilities location
...
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-22 10:55:10 -07:00
John Spray
344d5974fd
Remove mds role from rados & rgw tests
...
Follow up to a24efaef
, remove the mds from
the (mainly singleton) test configs that
explicitly list it.
Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-22 16:44:02 +01:00
Yuri Weinstein
fb6ab3bcf4
Modified print task
...
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
2014-07-21 21:35:53 -07:00
Josh Durgin
5f452829a3
Merge pull request #64 from ceph/wip_dumpling-firefly-x_upgrade_suite
...
Wip dumpling firefly x upgrade suite
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2014-07-21 16:22:19 -07:00
Josh Durgin
13824408f0
upgrade/dumpling-firefly-x: remove unnecessary files
...
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2014-07-21 16:18:58 -07:00
Josh Durgin
e962fa7552
upgrade/dumpling-firefly-x: upgrade clients to firefly
...
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2014-07-21 16:14:20 -07:00
Sage Weil
c6ee02d0e1
upgrade/dumpling-firefly-x: use correct tests against final workload
...
No reason to run old tests against new code here.
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-21 14:31:51 -07:00