Commit Graph

1253 Commits

Author SHA1 Message Date
Sam Lang
c0b03a1d2b fs: Add fuse_use_invalidate_cb option
Enables the fuse_use_invalidate_cb for the
ceph fuse client in the nightlies.  Once these
tests pass reliably, we can flip the boolean
in the config to true.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-01 17:57:09 -06:00
Joe Buck
b33714a5cf test: add hadoop-internal test
Add a test for our branch of the Hadoop code.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by:  Sam Lang <sam.lang@inktank.com>
2013-02-26 21:01:18 -08:00
Joe Buck
f985106d6b test: update java tests
Add additional packages to be installed
for the java tests. This is required
  due to recent changes to teuthology.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-02-26 21:00:00 -08:00
Josh Durgin
0b968f51e8 Add rbd locking/fencing test
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-02-25 18:27:30 -08:00
Sage Weil
8ce9490ed3 rgw: no lockdep on radosgw
This currently crashes because of linking problems, see #4253.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-25 17:32:23 -08:00
Sage Weil
9d92ad6ead rados: crank up mon debugging for the mon thrashing tests
At least for now!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-25 09:06:53 -08:00
Sage Weil
8dcbf8b685 specify deterministic admin socket path for admin_socket.py tests
Otherwise we don't know the path because the default now includes $pid.
2013-02-24 07:51:11 -08:00
Alex Elder
05a877987e rbd_xfstests: re-enable test 049
Test 049 is now passing, so add it back to the set that runs
nightly.  It only takes 10-15 seconds to run, so it shouldn't
adversely affect the balance of tests run across the three clients.

This resolves:
    http://tracker.ceph.com/issues/4244

Signed-off-by: Alex Elder <elder@inktank.com>
2013-02-22 15:41:30 -06:00
Sage Weil
e747abef4e rbd/librbd: fix the cache mode facet
This appears to have been broken, which means we weren't actually testing
any of the non-default cache modes (writethrough, none).  Whoops!  We were
getting jobs like

task:
- install: null
- ceph:
    conf:
      client:
        rbd cache: false
tasks:
- chef: null
- clock: null
- install: null
- ceph: null
- rbd_fsx:
    clients:
    - client.0
    ops: 5000

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-20 21:17:39 -08:00
Sage Weil
acab068269 add explicit install task before ceph task 2013-02-19 21:02:26 -08:00
Sage Weil
81704665f4 rados: fix ceph_test_filejournal test
Assume it is in the $PATH.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-19 11:26:02 -08:00
Sage Weil
c68f687c80 rbd: use 2 node cluster, mostly.
Not for the collection that includes KVM tests.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:37:29 -08:00
Sage Weil
e250815ace rgw: use 2 node cluster
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:30:28 -08:00
Sage Weil
f05f90b527 rados: use 2 instead of 3 machines
Put the client on the same node as some of the osds.  This shouldn't cause
any problems.  It will make timing a bit different, but if anything that
is probably a good thing.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:24:18 -08:00
Sage Weil
61605f8178 rados: don't pass ceph.conf path to workloadgen
The workunit tasks sets CEPH_CONF as needed, which the binaries already
understand.
2013-02-18 09:31:19 -08:00
Sage Weil
6fa3749302 remove rados-multifs link 2013-02-17 22:40:23 -08:00
Sage Weil
0c8d78236a rados: remove empty multifs collection 2013-02-17 17:13:50 -08:00
tamil
d4d11a505c adding task for rbd cli tests
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-02-11 17:43:09 -08:00
Sage Weil
d5c771a3f3 rados/monthrash: run some mon tests 2013-02-11 07:13:53 -08:00
Sage Weil
8d5ca94331 rados: separate out mon thrasher from thrasher collection
Run a different set of tests.
2013-02-11 07:13:41 -08:00
tamil
bd4f1d5cf7 adding task for iogen
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-02-01 15:39:34 -08:00
Sam Lang
8e566f6f8a marginal/osd_powercycle: OSD powercycle thrashing
Tasks to run while thrashing osds using ipmi to powercycle.
This currently runs in the marginal suite only.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-31 08:55:32 -06:00
Sage Weil
0c87249171 rbd: add rbd_cli_misc with map-snapshot-io.sh 2013-01-30 01:08:23 -08:00
Samuel Just
d50086020f btrfs.yaml: increase osd op thread timeout
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-28 19:01:27 -08:00
Josh Durgin
81ed1bc76c rados: add pool_ops workunit to cephtool test 2013-01-28 16:58:54 -08:00
Samuel Just
2f192eaff6 TestRados expects rollback, not snap_rollback
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-23 19:42:45 -08:00
Sage Weil
b7cb1b113f rados/thrash: 3 monitors, so that we can thrash them 2013-01-21 15:09:00 -08:00
Sam Lang
ee4a9f25ec marginal/mds_thrasher: Add tests for mds thrasher
Adds a basic set of roles for testing the mds thrasher
with 1 active and 1 standby, and a few basic tests that
stress mds operations.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-18 18:55:22 -06:00
Joao Eduardo Luis
f41b5421f8 add mon_thrash task to kernel and rados thrashers collections
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-17 21:00:21 -08:00
Sam Lang
626f6104fd Add a test for the truncate/osd-commit-reply race
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-17 20:57:26 -08:00
Sage Weil
cc7bf1bd18 rados: add osd reply delay injection 2013-01-17 20:54:37 -08:00
Sage Weil
0946a78c01 fix mon clock queue test syntax 2013-01-16 15:40:58 -08:00
Josh Durgin
3c67ee3667 rbd: add test for formatted output from rbd cli 2013-01-16 13:57:10 -08:00
Sage Weil
123a2dc486 rados: adjust socket injection rate down
See #3795.
2013-01-14 16:25:17 -08:00
Joao Eduardo Luis
305cb54a17 suites: rados: multimon: add mon clock skews task yaml files
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-10 18:03:03 -08:00
Joe Buck
2fa5d23bb2 test: Hadoop cluster and task config.
Add a 3-node cluster specification and a
task for running wordcount with Hadoop on Ceph.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-01-10 16:58:07 -08:00
Sam Lang
c8f3fd6e25 marginal: Remove broken symlinks
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-10 10:54:23 -06:00
Sam Lang
573f5315ed marginal/multiclient: Matching tests for kclient
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 13:57:21 -06:00
Sam Lang
14385a6631 marginal/multiclient: Add three client cluster
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 13:54:11 -06:00
Sam Lang
a4df523878 marginal/multiclient: Adding ior test to marginal
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 13:51:11 -06:00
Sam Lang
1e03fe18ef marginal/multiclient: Add a test for fsx-mpi
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 12:36:03 -06:00
Sam Lang
c07a4cb671 marginal/multiclient: New task to run mdtest
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-01-08 12:23:10 -06:00
Josh Durgin
d6496abfbe remove rbd_header_race test
This no longer works since export does not do a watch, and the race is
being closed a different way not detectable by this test.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-01-04 10:18:48 -08:00
Sage Weil
1a878611ac regression: include nfs suite 2013-01-04 09:54:27 -08:00
David Zafman
aaa03bbcd2 qa: Add knfsd reexport suite
Feature http://tracker.newdream.net/issues/3621

Signed-off-by: David Zafman <david.zafman@inktank.com>
2013-01-03 12:57:20 -08:00
Alex Elder
eee795c0ae rbd_xfstests.yaml: drop test 186
Stop running test 186.  It keeps failing in nightly runs, unable 
to unmount the scratch file system during setup.  As such, the
problem may be in an earlier test.

See http://tracker.newdream.net/issues/3685

Signed-off-by: Alex Elder <elder@inktank.com>
2013-01-03 11:14:13 -06:00
Sage Weil
3686371ed3 rados: add test_filejournal
This writes to /tmp by default; should be ok plana, since it's / and not
tmpfs.
2013-01-02 11:49:38 -08:00
Sage Weil
7d70dd119b Revert "kernel: move fsync test to marginal suite until it works"
This reverts commit acb91f7d0d.

Now fixed in master branch, commit b4d3bd06d4
2012-12-31 10:17:09 -08:00
Sage Weil
acb91f7d0d kernel: move fsync test to marginal suite until it works 2012-12-28 08:09:15 -08:00
Sage Weil
02e4eeff3f kernel: move fsx to marginal suite until it passese 2012-12-28 08:08:42 -08:00
Sage Weil
80bcaa2905 rados: add filestore_idempotent test with journal aio = true 2012-12-27 10:53:54 -08:00
Yehuda Sadeh
677a7a5a57 rgw: add swift tasks
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2012-12-21 14:00:45 -08:00
Sage Weil
0953ce5315 rados: add cephtool test 2012-12-20 10:58:56 -08:00
Greg Farnum
c9b8151072 add an fsync-tester workunit to the fuse and kclient suites
Signed-off-by: Greg Farnum <greg@inktank.com>
2012-12-14 15:28:11 -08:00
Greg Farnum
673b682055 put fsx back in the kernel suite. Looks like this was lost accidentally?
Signed-off-by: Greg Farnum <greg@inktank.com>
2012-12-14 15:27:20 -08:00
Josh Durgin
f7b26958e5 Merge branch 'next' 2012-12-10 12:06:38 -08:00
Josh Durgin
1bdd5c3bc9 Fix qemu options for xfstests
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2012-12-10 12:06:32 -08:00
Josh Durgin
4acc0789c9 Merge branch 'next' 2012-12-05 16:08:51 -08:00
Josh Durgin
50af473485 Add xfstests on rbd inside qemu
We may want to prune these later, since they take a long time.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2012-12-05 15:58:44 -08:00
tamil
8e78cd8232 dropping xfs test 183 for now as it causes nightly failure
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2012-12-04 16:26:34 -08:00
tamil
cea082c690 dropping test 183 as it is causing nightly failures
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2012-12-04 16:20:50 -08:00
Sage Weil
307d291aa0 kernel: add kclient ffsb back in 2012-12-04 16:08:30 -08:00
Sage Weil
9aacaf776e add mon-thrasher singleton 2012-11-29 01:15:44 +00:00
Sage Weil
71361ac67d kernel: add rbd map/unmap test 2012-11-28 15:25:32 -08:00
Sage Weil
e1a9a44027 fix objectcacher stress test 2012-11-27 17:03:12 -08:00
Josh Durgin
a4c41e20f0 Add objectcacher stress testing
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2012-11-26 11:08:44 -08:00
Sage Weil
9f7ab6e811 kernel: remove serial rbd_xfstests job 2012-11-25 14:43:21 -08:00
Sage Weil
7ff17ff03d xfstests: remove 179 and 182 from parallel test
2012-11-25T11:34:51.177 INFO:teuthology.task.rbd.client.2.out:179        [failed, exit status 1] - output mismatch (see 179.out.bad)
2012-11-25T11:34:51.181 INFO:teuthology.task.rbd.client.2.out:--- 179.out       2012-11-25 10:44:54.273586464 -0800
2012-11-25T11:34:51.181 INFO:teuthology.task.rbd.client.2.out:+++ 179.out.bad   2012-11-25 11:34:41.360151613 -0800
2012-11-25T11:34:51.181 INFO:teuthology.task.rbd.client.2.out:@@ -1 +1,5 @@
2012-11-25T11:34:51.181 INFO:teuthology.task.rbd.client.2.out: QA output created by 179
2012-11-25T11:34:51.182 INFO:teuthology.task.rbd.client.2.out:+umount: /tmp/cephtest/scratch_mnt.WNYZwsbUfT: device is busy.
2012-11-25T11:34:51.182 INFO:teuthology.task.rbd.client.2.out:+        (In some cases useful info about processes that use
2012-11-25T11:34:51.182 INFO:teuthology.task.rbd.client.2.out:+         the device is found by lsof(8) or fuser(1))
2012-11-25T11:34:51.182 INFO:teuthology.task.rbd.client.2.out:+failed to unmount /dev/rbd2
2012-11-25T11:34:52.587 INFO:teuthology.task.rbd.client.2.out:182        [failed, exit status 1] - output mismatch (see 182.out.bad)
2012-11-25T11:34:52.591 INFO:teuthology.task.rbd.client.2.out:--- 182.out       2012-11-25 10:44:54.293587298 -0800
2012-11-25T11:34:52.591 INFO:teuthology.task.rbd.client.2.out:+++ 182.out.bad   2012-11-25 11:34:42.770180618 -0800
2012-11-25T11:34:52.592 INFO:teuthology.task.rbd.client.2.out:@@ -1 +1,5 @@
2012-11-25T11:34:52.592 INFO:teuthology.task.rbd.client.2.out: QA output created by 182
2012-11-25T11:34:52.592 INFO:teuthology.task.rbd.client.2.out:+umount: /tmp/cephtest/scratch_mnt.WNYZwsbUfT: device is busy.
2012-11-25T11:34:52.592 INFO:teuthology.task.rbd.client.2.out:+        (In some cases useful info about processes that use
2012-11-25T11:34:52.592 INFO:teuthology.task.rbd.client.2.out:+         the device is found by lsof(8) or fuser(1))
2012-11-25T11:34:52.592 INFO:teuthology.task.rbd.client.2.out:+failed to unmount /dev/rbd2
2012-11-25 14:42:56 -08:00
Sage Weil
ffd19b6448 run xfstests on 3 parallel clients 2012-11-21 07:44:14 -08:00
Alex Elder
8bb3a15089 rbd_xfstests: fix the test list, drop 181
The main purpose of this was to eliminate test 181 from the list.
But I also found the test list that was there was broken (it
that too...
included a spurious 'TESTS="${TESTS}' so I fixed that too.

Signed-off-by: Alex Elder <elder@inktank.com>
2012-11-20 15:48:57 -06:00
Sage Weil
837ab3c7f8 xfstests: disable 174
2012-11-18T11:11:53.032 INFO:teuthology.orchestra.run.out:174    [failed, exit status 1] - output mismatch (see 174.out.bad)
2012-11-18T11:11:53.035 INFO:teuthology.orchestra.run.out:--- 174.out   2012-11-18 09:40:51.869769691 -0800
2012-11-18T11:11:53.035 INFO:teuthology.orchestra.run.out:+++ 174.out.bad       2012-11-18 11:11:51.017195883 -0800
2012-11-18T11:11:53.036 INFO:teuthology.orchestra.run.out:@@ -1,11 +1,5 @@
2012-11-18T11:11:53.036 INFO:teuthology.orchestra.run.out: QA output created by 174
2012-11-18T11:11:53.036 INFO:teuthology.orchestra.run.out:-# testing 8 32 65 3 1 1 0 fail ....
2012-11-18T11:11:53.036 INFO:teuthology.orchestra.run.out:-# streaming
2012-11-18T11:11:53.036 INFO:teuthology.orchestra.run.out:-# sync AGs...
2012-11-18T11:11:53.036 INFO:teuthology.orchestra.run.out:-# checking stream AGs...
2012-11-18T11:11:53.036 INFO:teuthology.orchestra.run.out:-+ expected failure, matching AGs
2012-11-18T11:11:53.036 INFO:teuthology.orchestra.run.out:-# testing 8 32 65 3 1 0 1 fail ....
2012-11-18T11:11:53.036 INFO:teuthology.orchestra.run.out:-# streaming
2012-11-18T11:11:53.036 INFO:teuthology.orchestra.run.out:-# sync AGs...
2012-11-18T11:11:53.037 INFO:teuthology.orchestra.run.out:-# checking stream AGs...
2012-11-18T11:11:53.037 INFO:teuthology.orchestra.run.out:-+ expected failure, matching AGs
2012-11-18T11:11:53.037 INFO:teuthology.orchestra.run.out:+umount: /tmp/cephtest/scratch_mnt.0ZO5vZAvBE: device is busy.
2012-11-18T11:11:53.037 INFO:teuthology.orchestra.run.out:+        (In some cases useful info about processes that use
2012-11-18T11:11:53.037 INFO:teuthology.orchestra.run.out:+         the device is found by lsof(8) or fuser(1))
2012-11-18T11:11:53.037 INFO:teuthology.orchestra.run.out:+failed to unmount /dev/rbd2
2012-11-18 14:35:01 -08:00
Sage Weil
841a2895a2 xfstests: explicitly enumerate test list 2012-11-18 14:34:33 -08:00
Sage Weil
43b57dec98 check ceph-mon for leaks 2012-11-15 22:04:20 -08:00
Samuel Just
2770ef7272 peer.yaml: assumes osd_pool_default_min_size is 1
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2012-11-13 12:29:24 -08:00
Joe Buck
bb2924d431 libcephfs_java.yaml: Adding the libcephfs-java test to the suite.
Signed-off-by: Joe Buck <jbbuck@gmail.com>
2012-11-12 10:07:02 -08:00
Sage Weil
aabca849ac fs/verify: check for ceph-fuse leaks 2012-11-11 07:13:55 -08:00
Josh Durgin
fa1e434812 Add fsstress with btrfs and ext4 on krbd
xfstests already does xfs on krbd
2012-10-24 16:38:32 -07:00
Sage Weil
ea2182b7d8 fix kernel symlink for fs 2012-10-15 11:58:22 -07:00
Sage Weil
2e1296785f regression: update for new kernel collections 2012-10-10 09:46:32 -07:00
Sage Weil
fe313a35ed kernel: add msgr failure facet to rbd collection 2012-10-09 14:22:26 -07:00
Sage Weil
bc6244af6d kernel: break basic collection into rbd, kclient 2012-10-09 14:21:38 -07:00
Yehuda Sadeh
6a0979dc42 radsogw-admin: bump up logging
debug ms = 1
debug objclass = 20

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2012-10-05 16:49:27 -07:00
Sage Weil
f868718eb2 rbd: reduce fsx runtimes
These go way slower now that they are doing the cloning stuff.
2012-10-02 10:05:53 -07:00
Josh Durgin
38f7f3dd02 rbd: add test for reading an up to date header
This is racy, but in all my test runs it has produced the
problem successfully.
2012-09-27 13:55:50 -07:00
Sage Weil
6a2be38885 smoke: add cls unit tests in validator 2012-09-21 08:38:05 -07:00
Sage Weil
8eaaddcfea rados: run class unit tests through validators 2012-09-21 08:36:58 -07:00
Sage Weil
31a2a83cb4 move rgw tasks to separate suite 2012-09-21 08:36:08 -07:00
Sage Weil
675e9f319b rados: test all rados classes 2012-09-20 15:40:50 -07:00
Sage Weil
cbfb0566ba test_cls_rbd has moved 2012-09-20 15:40:12 -07:00
tamil
b6f5d12ffb changed the debug value for mds from 10 to 20
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2012-09-10 15:45:30 -07:00
Sage Weil
e386fb9b57 rbd: add msgr failure injection 2012-08-24 14:49:05 -07:00
Sage Weil
d8bc55eea4 avoid doing filestore idempotency tester 2x w/ and w/o msgr failures 2012-08-24 14:48:52 -07:00
Sage Weil
92d3404f84 rados: add msgr failure injection 2012-08-21 18:04:38 -07:00
Sage Weil
14ce35a197 move kclient + blogbench to marginal
Periodically fails #1945
2012-08-21 13:01:28 -07:00
Sage Weil
035f49a076 include mds debugging on ffsb
hopefully we can track down #1947
2012-08-21 10:57:32 -07:00
Sage Weil
aa4ea5fa4f marginal: remove verify collection (unused) 2012-08-20 13:54:36 -07:00
Sage Weil
c2ff66ebe5 crank up pjd debugging 2012-07-31 20:37:00 -07:00
Sage Weil
cd0b5278e5 separate regression suite into topical categories rados, rbd, fs 2012-07-31 09:55:45 -07:00
Sage Weil
85187edd5e add osd-recovery-incomplete 2012-07-28 10:54:52 -07:00
Sage Weil
0b8b58fffc fix adminsocket test 2012-07-27 13:42:51 -07:00
Sage Weil
61ff894cd1 admin-socket: test generic admin socket commands 2012-07-24 21:36:38 -07:00
Sage Weil
deb7a54a27 marginal kclient+ffsb: enable mds logging to catch badess
See #1947
2012-07-22 20:50:12 -07:00
Sage Weil
32353f1388 move misc, blogbench back into active kernel suite
these were removed from regression ages ago, and only recently put back in
marginal.  they seem fine.
2012-07-22 20:47:39 -07:00
Sage Weil
000fec3b37 move all kernel tests to kernel suite; symlink collections from regression
Make regression a union of other topical suites.
2012-07-22 20:47:32 -07:00
Sage Weil
17e4f751e6 this fails reliably 2012-07-21 20:59:04 -07:00
Sage Weil
ec4ce8bf3e regression: do some tests on ext4 2012-07-20 17:36:43 -07:00
Sage Weil
da770146d8 move cfuse+dbench back to regression for verify, too 2012-07-20 13:14:28 -07:00
Sage Weil
5c5ca4b80e move cfuse + dbench from marginal to regression
Fixed #1737, yay!
2012-07-17 20:05:30 -07:00
Sage Weil
811665a8fc move cfuse + ffsb from marginal to regression
This has had no failures.
2012-07-16 10:35:25 -07:00
Sage Weil
88e2ad3568 move cfuse + fsx back into regression suite
No failures in marginal.  The objectcacher fixes that came out of the
rbd_fsx stuff probably fixed the original problem?
2012-07-16 09:41:35 -07:00
Sage Weil
f6a16f7424 fix wrongly marked down whitelist
This used to have '...or wrong addr' but it doesn't any more.
2012-07-12 16:05:12 -07:00
Josh Durgin
986c0ef9ae rbd: test with layering enabled
RBD_FEATURES=0 hits a bug that's fixed in wip-rbd-parent.

Once that's merged, we can add RBD_FEATURES=0 tests back in.
2012-07-11 10:59:08 -07:00
Sage Weil
fe6ed3e763 ffsb is marginal, remove from smoke suite 2012-07-11 08:27:42 -07:00
Sage Weil
19742ec21c Revert "smoke: add msgr failures"
This reverts commit 9278e231e64f49c3205c2ded8b1f2d3b27265eac.
2012-07-10 20:26:25 -07:00
Sage Weil
657c9db2c5 move cfuse fsx into marginal suite
This should probably pass, given the testing that ObjectCacher gets these
days with librbd_fsx.
2012-07-10 19:58:28 -07:00
Sage Weil
c3e1ab4967 remove suites/stress/basic 2012-07-10 19:58:26 -07:00
Sage Weil
392a6596aa move some old flaky tasks into marginal suite
These were pulled out of regression a while ago.  Put them into the
marginal suite where they will be regularly run and we can evaluate the
severity of the problems they cause.
2012-07-10 19:58:23 -07:00
Sage Weil
98a21cc8f0 move qemu_iozone test to marginal suite 2012-07-06 17:04:44 -07:00
Samuel Just
ed3bd211fe increase thrashosds timeout 2012-07-06 10:02:29 -07:00
Sage Weil
12a1f62364 move other ffsb workloads to marginal suite 2012-07-04 12:47:00 -07:00
Sage Weil
fb9d39d54c move locktest to marginal suite
This fails 1 in 10 times or something like that.
2012-07-03 17:39:59 -07:00
Sage Weil
9278e231e6 smoke: add msgr failures 2012-07-02 14:08:24 -07:00
Sage Weil
b9414b6cf7 fewer hosts for mon tests 2012-07-02 12:26:10 -07:00
Sage Weil
96ccb0605d add rbd_xfstests to kernel suite 2012-07-01 14:27:38 -07:00
Josh Durgin
3321700a9e qemu_iozone: use a larger image
The default is not large enough.
2012-06-29 11:02:38 -07:00
Sage Weil
74b1468fe6 kernel suite 2012-06-29 09:12:51 -07:00
Sage Weil
1db84ddd33 include ceph task in librbd collection 2012-06-25 21:21:33 -07:00
Sage Weil
aa89e6ab32 move kclient_workunit_suites_ffsb to marginal suite
until #1947 is fixed
2012-06-25 15:30:27 -07:00
Josh Durgin
94a6ab8ff3 Add some tests inside qemu for the librbd suite 2012-06-21 18:18:08 -07:00
Josh Durgin
a92306a41a Move librbd tests to rbd suite
This lets us generate jobs with different caching settings instead of
hardcoding them.
2012-06-21 18:16:32 -07:00
Sage Weil
845e6c282f move cfuse + dbench task that triggers #1737 to marginal suite 2012-06-20 11:23:20 -07:00
Sage Weil
a4589c6ab6 don't dup ceph task for new fsx jobs 2012-06-17 08:58:59 -07:00
Josh Durgin
0c40b24c15 Run fsx on rbd with thrashing 2012-06-15 11:59:43 -07:00
Josh Durgin
50e01c18c9 Increase number of ops done by fsx against rbd.
Especially in the no-cache case, this should detect more races. The
fiemap problem is detectable on plana after ~5000 fsx ops.
2012-06-15 11:55:35 -07:00
Sage Weil
9aeac5decd add radosgw-admin test to regression suite
We wrote this test ages ago, but forgot to add it!  Fixed up a few things
that have changed since then.
2012-06-14 14:06:34 -07:00
Josh Durgin
5012b73abb Add test for cls_rbd 2012-06-10 22:37:12 -07:00
Josh Durgin
68f14b400a Test old and new rbd formats 2012-06-10 21:45:59 -07:00
Josh Durgin
04ef5dcc12 Update for new workunit task syntax 2012-06-10 21:26:50 -07:00
Sage Weil
8c08482cc3 regression: fix new rados, rbd test yamls
Don't start cluster twice!
2012-06-08 14:35:56 -07:00
Sage Weil
6df344c7ec run rados, rbd api tests under thrashing 2012-06-08 11:55:30 -07:00
Sage Weil
95ecf40e44 add rados_stress_watch to regression 2012-05-31 16:44:30 -07:00
Sage Weil
43ac8e2c8c rbd_fsx in write-through mode 2012-05-08 16:07:10 -07:00
Sage Weil
c5429bf936 use fewer nodes for the simple singleton tasks 2012-04-30 20:11:44 -07:00
Sage Weil
ff0fe37294 add rbd_fsx_[no]cache jobs to regression suite 2012-04-19 13:33:32 -07:00
Sage Weil
7ae1aefab7 gather logs for cfuse dbench workload, hopefully catch #1737 2012-04-18 15:19:49 -07:00
Sage Weil
6bede298ef dump_stuck: whitelist 'wrongly marked me down'
The test marks the osds down.. they may generate this error if they get
that faster than they get the signal via the daemon-wrapper.
2012-04-15 20:39:56 -07:00
Sage Weil
4498825a48 add rbd_xfstests to regression suite 2012-04-13 22:27:24 -07:00
Sage Weil
55535d04bb move tasks:cfuse_workunit_suites_dbench.yaml to stress pending #1737 fix 2012-04-12 22:56:09 -07:00
Sage Weil
ef17c8c9eb add smoke suite
This could probably be collapsed into a bunch of singleton tasks to make
it simpler to track how many actual jobs result, but it was simpler to
make it a subset of regression.  And probably that'll be easier to maintain
moving forward.

Tried to avoid any jobs that took more than 10 minutes (tho there are a few
in here).  Kept both valgrind and lockdep jobs, and dropped many of those
from the basic collection (esp api tests).

We'll see how long this takes on plana and adjust up/down from there,
depending on how long we want to wait for it.
2012-03-24 21:47:15 -07:00
Sage Weil
24910c3b3b add osd-recovery test 2012-03-24 16:07:47 -07:00
Sage Weil
6bf9c957c9 renamed backfill -> osd_backfill 2012-03-24 16:07:38 -07:00
Sage Weil
01924a22d4 disable rbd thrash workload, #2174 2012-03-16 13:28:44 -07:00
Sage Weil
b4572351a9 Revert "disable rbd thrash workload, #2174"
This reverts commit 1bec416c7c7ff8a6462d94baaba8e7da73e88ab4.

Fixed with #2174
2012-03-15 10:32:39 -07:00
Sage Weil
1bec416c7c disable rbd thrash workload, #2174 2012-03-14 15:51:51 -07:00
Sage Weil
b90354dbab thrash: put client on separate machine from osds
This allows us to run kenrel clients (kclient, rbd) against the thrashing
cluster.
2012-03-13 10:49:33 -07:00
Sage Weil
096427d589 remove dup ceph tasks from new thrash workloads 2012-03-12 15:22:17 -07:00
Sage Weil
51d817fe57 Revert "disable s3tests on valgrind/lockdep until #2103 is fixed"
This reverts commit 9f757ca9511374f6565d74263e242c74e39f8a3f.
2012-03-11 21:32:45 -07:00
Sage Weil
af445189c2 add rbd, kclient workloads to regression thrash collection
This will get us some kernel osd_client osd restart coverage.
2012-03-11 21:28:45 -07:00
Sage Weil
71e6e62ebb fix typo, ceph-fyuse -> ceph-fuse 2012-03-11 13:03:41 -07:00
Sage Weil
b84897e56f use dbench workunit, not the autotest one
The autotest one uses an old tarball that doesn't build.  Workunit assumes
the dbench package is installed.
2012-03-10 20:01:57 -08:00
Sage Weil
9f757ca951 disable s3tests on valgrind/lockdep until #2103 is fixed 2012-03-01 22:04:19 -08:00
Josh Durgin
b2bbede826 dump-stuck: set pg stuck threshold to match test 2012-02-29 15:45:25 -08:00
Sage Weil
722af1a4dd no peer as part of lost_unfound 2012-02-27 14:52:35 -08:00
Sage Weil
9afafdf164 move peer to separate test for now 2012-02-26 17:09:41 -08:00
Sage Weil
6295578f16 lost_unfound: do peer after, until wait_for_clean propagates last_epoch_started
The peer task does wait_for_clean, and then lost_unfound immediately marks
something down.  But the PGs become clean before the replica last_epoch_started
is moved forward in time, which means they block waiting for the now down
OSD.  Needlessly.

Until we fix this, just do the peer test after.
2012-02-25 21:35:31 -08:00
Sage Weil
d944e7ee1e fix lockdep.yaml conf syntax 2012-02-24 21:39:55 -08:00
Sage Weil
5d5a022cc1 run radosgw through valgrind for s3tests 2012-02-24 15:20:00 -08:00
Sage Weil
c9c1a4ab42 do peer test along with lost_unfound 2012-02-24 15:04:27 -08:00
Sage Weil
fc531a9177 rename valgrind -> verify, add in runs under lockdep 2012-02-24 12:49:33 -08:00
Josh Durgin
52a52cf4f2 Add test for 'ceph pg dump_stuck' 2012-02-21 16:21:05 -08:00
Sage Weil
df5f5738e6 add valgrind collection to regression suite
Run a smaller set of tests with valgrind on the mon, osd, and mds.

Valgrind is currently ignoring leaks, but this will pick up use-after-free
and similar badness.
2012-02-21 10:02:44 -08:00
Sage Weil
84bd876cd4 cfuse -> ceph-fuse 2012-02-20 12:49:35 -08:00
Sage Weil
b5668cf64b thrashing: whitelist 'objects unfound and apparently lost' message
This can happen when we mark OSDs down... if the objects are found when
the osds come back up then we're fine.  if not, it won't go clean, and the
test will fail for that reason.
2012-02-18 13:56:47 -08:00
Sage Weil
bc0e4068be add regression/multifs collection; run rgw tests under both xfs and btrfs 2012-02-14 21:49:26 -08:00
Sage Weil
3fbb5714cc rename fs files 2012-02-14 08:58:30 -08:00
Sage Weil
10a94d2b40 regression/thrash on xfs and btrfs both 2012-02-13 16:45:04 -08:00
Sage Weil
04f3e44514 btrfs: 1 -> fs: btrfs 2012-02-13 15:29:52 -08:00
Sage Weil
4fad1317da add snap thrashing covering a small number of objects
The snaps-many-objects has a relatively low density of ops-per-object. This
hammers on a small number of them and does a better job of validating the
correctness wrt snaps.
2012-02-11 13:40:44 -08:00
Sage Weil
e841f9c713 move snap thrashing back into regression suite 2012-02-11 13:39:46 -08:00
Sage Weil
6028b36376 move kclient_workunit_suites_blogbench.yaml to stress suite
This is consistently failing due to an mds/kclient interaction.
2012-02-10 16:40:03 -08:00
Sage Weil
0b68dbca9a add backfill test 2012-01-31 16:37:57 -08:00
Sage Weil
9da01185a4 make 6-osd-2-machine simpler... single monitor 2012-01-28 21:11:32 -08:00
Josh Durgin
06c8fdc9e6 regression: add admin socket test for objecter requests. 2012-01-27 18:08:45 -08:00
Sage Weil
fe2834f603 remove snap thrashing from regression suite for time being 2012-01-25 14:04:04 -08:00
Samuel Just
5bb9a9d615 Add small cluster thrashing tasks
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2012-01-17 15:08:02 -08:00
Sage Weil
b2c07d8a60 add simple thrash workload to regression suite 2012-01-16 15:09:29 -08:00
Sage Weil
8e126db1da mon.0 -> mon.a 2012-01-16 11:08:34 -08:00
Sage Weil
43da161d2b mds.0 -> mds.a 2012-01-16 11:08:19 -08:00
Yehuda Sadeh
2317b9ae91 add rgw readwrite and roundtrip tasks 2012-01-10 15:30:53 -08:00
Sage Weil
fbf791213b do not put monitors on the same nodes as clients
Otherwise, for kernel clients (rbd or kclient), ceph-mon can cause a deadlock when it calls sync(2).
2012-01-07 10:16:39 -08:00
Sage Weil
cfeaef451b move multimon failure thrashing tests into regression
We need to test these nightly.
2012-01-06 15:08:01 -08:00
Josh Durgin
71d5bcbb5a Adjust rados model workloads for new config format 2012-01-03 13:55:36 -08:00
Sage Weil
4da96ff35b rados load-gen workunits 2011-12-13 08:28:33 -08:00
Samuel Just
6b8588b781 Use btrfs for regression tests
Some of the tests (particularly the s3 tests) use very long filenames
which trigger bugs related to ext4 xattr handling.

Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-12-08 15:35:16 -08:00
Tommi Virtanen
dbd7a3b4dc Rename "testrados" task to not begin with "test".
See commit e80c32c442 in
teuthology.git
2011-12-05 10:08:54 -08:00
Josh Durgin
2918b5014c Move kclient multiple_rsync workunit to stress collection.
Bug #1760 keeps being triggered by this.
2011-11-29 16:20:55 -08:00
Sage Weil
24ee09b05c Revert "more logs (yuck) for #1682"
This reverts commit ea00114f08.
2011-11-21 21:58:13 -08:00
Sage Weil
ea00114f08 more logs (yuck) for #1682 2011-11-20 15:24:17 -08:00
Sage Weil
dcab329b5f fix conf thinko
'int' object has no attribute 'iteritems'
2011-11-19 19:28:26 -08:00
Sage Weil
a08e7f126a regression/basic/tasks/kclient_workunit_misc: turn on mds log
Hopefully will catch #1682
2011-11-19 13:56:17 -08:00
Sage Weil
13c98df920 regression/basic/tasks/cfuse_dbench: turn up client debugging
Hopefully we'll hit #1737...
2011-11-19 13:45:28 -08:00
Josh Durgin
7c8a7a89e0 Move multimds tests to a new suite, 'experimental'.
This suite is for testing features that aren't expected to be stable yet.
2011-11-18 10:21:38 -08:00
Josh Durgin
94100ad072 Move collections into separate suites
For now, there are just two suites:
  * regression - tests that should always pass
  * stress     - tests that have problems for one reason or another
2011-11-17 17:57:57 -08:00