Commit Graph

3557 Commits

Author SHA1 Message Date
Sage Weil
f84458bd93 ceph_manager: increase osd revival timeout from 75 -> 150 seconds
Saw this fail on a firefly run just bc of valgrind slowness.  Make the
timeout higher!

Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-30 21:34:45 -07:00
Sage Weil
7c3fc404d6 valgrind: ignore ec plugin factory leaks
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-30 18:14:08 -07:00
John Spray
9e2c82824b Merge pull request #300 from ceph/wip-mpi-vers
task/mpi: Explicit check for version mismatch
2014-07-30 23:37:39 +01:00
Zack Cerza
094cd5b3ba Merge pull request #302 from ceph/wip-8850
tell us if you are really not running any more
2014-07-30 14:31:27 -06:00
Alfredo Deza
4686115c48 add another call to use "ps aux"
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
2014-07-30 16:28:02 -04:00
Alfredo Deza
ec342d8146 tell us if you are really not running any more
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
2014-07-30 16:17:01 -04:00
Sage Weil
1c7c2eb4b9 nuke: allow nuking vpm hosts
Skip the console check, though.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-30 11:12:09 -07:00
Sage Weil
ec2f94901f kcephfs: thrash mds too
And shorten names.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-30 06:55:43 -07:00
John Spray
491817961a task/mpi: Explicit check for version mismatch
Instead of proceeding to have mpiexec fail or hang,
do an up-front check that the MPI version is the same
on all of the nodes in the test.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-30 14:29:11 +01:00
Sage Weil
79357060f1 Merge pull request #53 from ceph/wip-leaks
use notcmalloc to fix leak checking
2014-07-29 15:27:54 -07:00
Warren Usui
6d01f11a02 Merge pull request #67 from ceph/wip_fix_emperor-x
Fixed -x in the suite, this fixed http://tracker.ceph.com/issues/8862
2014-07-29 14:35:53 -07:00
Yuri Weinstein
5d2ce7fd87 Fixed -x in the suite, this fixed http://tracker.ceph.com/issues/8862
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
2014-07-29 14:14:32 -07:00
Zack Cerza
b8687a54aa Fix package_version_for_hash() call
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-29 14:43:41 -06:00
John Spray
b354631e6e Merge pull request #62 from ceph/wip-standbyreplay
suites/fs: add a standbyreplay configuration
2014-07-29 18:14:02 +01:00
John Spray
0147c74671 task/mds_journal_migration: fix cleanup
Previously this was remounting in the background
which meant that if a workunit was executed immediately
afterwards it could put files in the mount dir before
the FS was mounted, causing the mount to fail.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-29 13:11:12 +01:00
John Spray
431ae2f583 suites/fs: add a standbyreplay configuration
A place for any tests which should be run with
standbyreplay MDSs in play.  Initially just
the mds_journal_migration task, to cover
the #8811 case.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-29 11:34:50 +01:00
Yuri Weinstein
3f3da880d4 Update README 2014-07-28 16:28:14 -07:00
Yuri Weinstein
f2b742268a Update README 2014-07-28 16:24:57 -07:00
Josh Durgin
7dcb8bd6b2 dumpling-firefly-x: document structure of suite
Hopefully this makes it easier to digest and maintain
than reading yaml fragments.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Signed-off-by: Yuri Weintstein <yuri.weinstein@inktank.com>
2014-07-28 16:14:36 -07:00
John Spray
cc7c9b2e8b task/workunit: add sudo to dir delete
This failure to delete used to be hidden in
a catch-all exception handler that was
removed in 396dc073.

This was causing failures in kcephfs suite
like:
Command failed on plana96 with status 1: 'rm -rf -- /home/ubuntu/cephtest/mnt.0/client.0'

Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-27 12:04:23 +01:00
Sage Weil
c88e1cb1d6 rados/singleton/all/rest-api: add mds.a
The test expects an MDS.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-07-26 09:05:25 -07:00
tamil
05fb534206 added a new test for rgw_readwrite with front end as civetweb instead of apache
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2014-07-25 21:04:12 -07:00
tamil
ac8ab1e6fd added a new test for rgw_readwrite with front end as civetweb instead of apache 2014-07-25 20:43:39 -07:00
Josh Durgin
ef6f0de7c7 Merge pull request #66 from ceph/wip_tune_upgrade-firefly
Changed upgrade order and re-enabled rados/tests.sh

Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2014-07-25 16:05:53 -07:00
Yuri Weinstein
531f89caf1 Changed upgrade order and re-enabled rados/tests.sh
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
2014-07-25 15:58:45 -07:00
Zack Cerza
2999a4d492 Merge pull request #296 from ceph/wip-standby-mds
Wip standby mds
2014-07-25 16:44:34 -06:00
Sage Weil
883f2a76f4 task/osd_recovery: use rbd instead of metadata pool
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-25 11:39:42 -07:00
Sage Weil
4e56521ebe task/osd_backfill: use 'rbd' instead of 'data' pool
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-25 11:33:25 -07:00
John Spray
2ed1131fc9 task/workunit: general tidy
* PEP8-ize whitespace
* specialize a catch-all except:
* don't alias 'misc' to 'teuthology'

Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-25 13:33:31 +01:00
John Spray
396dc073cb task/workunit: clean up dir deletion
This was always trying to remove the mountpoint,
and then swallowing the exception and printing
it to the log.

Instead:
 * Ensure it's only trying to delete mnt if it created it
 * Don't swallow those exceptions: if deleting the dirs
   fails then something has gone wrong and it should bubble
   up.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-25 13:33:31 +01:00
John Spray
a96a7b027c task/cephfs: generalise Filesystem for multi-MDS
This enables tasks like mds_journal_migration to be
run in an environment with standby-replay MDSs present.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-25 13:33:31 +01:00
Sage Weil
175057f3bc rados/verify: increase api tests timeout
This has been timing out for weeks to months (hard to say since before that
we were consistently hitting an OSD crash on shutdown).  The runs I've looked
at are making progress and seem fine ... just slow because of valgrind and
because of thrashing.

Backport: firefly
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-24 18:57:22 -07:00
Sage Weil
279f39d4c1 use notcmalloc builder for valgrind leak checking
This reverts 453d72b294.  It turns out that
tcmalloc defeats the leak checking in valgrind.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-07-24 11:57:06 -07:00
Zack Cerza
c2fc561427 Remove oddly-placed, broken import
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-24 10:03:38 -06:00
Zack Cerza
479a1fe676 Rename CephState to DaemonGroup and move to orchestra
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-23 19:59:16 -06:00
Zack Cerza
7d0adcee2d Move DaemonState to teuthology.orchestra
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-23 19:59:16 -06:00
Sage Weil
16ad68d0fd clusters/fixed-3-cephfs.yaml: put client.0 on separate node, remove client.1
Only one job needs 2 clients; put them on separate hosts.  They should not
be colocated with the OSDs!

Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-23 18:46:30 -07:00
Sage Weil
a546e20761 rados/objectstore: move objectstore related tests from singleton-nomsgr
Just so they're in a nice convenient bunch.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-23 11:04:02 -07:00
tamil
367d4da083 added debug messages for radosbench.yaml
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2014-07-22 16:15:19 -07:00
Zack Cerza
efce34186c Fail sooner if an invalid config_yaml is passed
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-22 15:47:53 -06:00
Yuri Weinstein
456aff87fc Removed rados/test.sh for now
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
2014-07-22 14:05:38 -07:00
Zack Cerza
0e2149c86e Remove kcon_most
It was unused.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
2014-07-22 13:57:40 -06:00
Zack Cerza
d6e5895fb2 Merge pull request #293 from ceph/wip-ship-utilities
move ship_utilities to install task (from ceph test)
2014-07-22 13:49:57 -06:00
Sage Weil
828bb2a6ea ceph_deploy: use new ship_utilities location
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-22 10:55:10 -07:00
John Spray
344d5974fd Remove mds role from rados & rgw tests
Follow up to a24efaef, remove the mds from
the (mainly singleton) test configs that
explicitly list it.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-22 16:44:02 +01:00
Yuri Weinstein
fb6ab3bcf4 Modified print task
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
2014-07-21 21:35:53 -07:00
Josh Durgin
5f452829a3 Merge pull request #64 from ceph/wip_dumpling-firefly-x_upgrade_suite
Wip dumpling firefly x upgrade suite

Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2014-07-21 16:22:19 -07:00
Josh Durgin
13824408f0 upgrade/dumpling-firefly-x: remove unnecessary files
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2014-07-21 16:18:58 -07:00
Josh Durgin
e962fa7552 upgrade/dumpling-firefly-x: upgrade clients to firefly
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2014-07-21 16:14:20 -07:00
Sage Weil
c6ee02d0e1 upgrade/dumpling-firefly-x: use correct tests against final workload
No reason to run old tests against new code here.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-21 14:31:51 -07:00