Commit Graph

226 Commits

Author SHA1 Message Date
Sage Weil
79357060f1 Merge pull request #53 from ceph/wip-leaks
use notcmalloc to fix leak checking
2014-07-29 15:27:54 -07:00
Sage Weil
c88e1cb1d6 rados/singleton/all/rest-api: add mds.a
The test expects an MDS.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-07-26 09:05:25 -07:00
Sage Weil
175057f3bc rados/verify: increase api tests timeout
This has been timing out for weeks to months (hard to say since before that
we were consistently hitting an OSD crash on shutdown).  The runs I've looked
at are making progress and seem fine ... just slow because of valgrind and
because of thrashing.

Backport: firefly
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-24 18:57:22 -07:00
Sage Weil
279f39d4c1 use notcmalloc builder for valgrind leak checking
This reverts 453d72b294.  It turns out that
tcmalloc defeats the leak checking in valgrind.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-07-24 11:57:06 -07:00
Sage Weil
a546e20761 rados/objectstore: move objectstore related tests from singleton-nomsgr
Just so they're in a nice convenient bunch.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-23 11:04:02 -07:00
tamil
367d4da083 added debug messages for radosbench.yaml
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2014-07-22 16:15:19 -07:00
John Spray
344d5974fd Remove mds role from rados & rgw tests
Follow up to a24efaef, remove the mds from
the (mainly singleton) test configs that
explicitly list it.

Signed-off-by: John Spray <john.spray@redhat.com>
2014-07-22 16:44:02 +01:00
Sage Weil
2b4f740200 rados/singleton-nomsgr/all/alloc-hint: fix config syntax
Put in [osd] section.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-21 08:36:20 -07:00
Sage Weil
02a6c917ec rados/singleton-nomsgr/all/alloc-hint: enable xfs extsize for test
This is off by default; enable it for this test.

Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-18 13:18:53 -07:00
Sage Weil
0ee2d54533 Merge pull request #46 from ceph/wip-max-backfills
rados/thrash: set osd_max_backfill = 1 sometimes
2014-07-14 14:43:29 -07:00
Sage Weil
3150bba803 rados: test ceph-post-file
Signed-off-by: Sage Weil <sage@redhat.com>
2014-07-11 15:35:56 -07:00
Sage Weil
5e880228fd rados/thrash: set osd_max_backfill = 1 sometimes
Hopefully this will help catch leaks in the recovery reservations.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-07-01 16:09:53 -07:00
Sage Weil
f8282df84d rados/singleton/all/thrash_cache_writeback_forward_none: thrash cache pool drain
Enable/disable cache pool against a running workload.

Backport: firefly
Signed-off-by: Sage Weil <sage@inktank.com>
2014-06-30 13:34:21 -07:00
Samuel Just
77670aeadf thrashers/mapgap.yaml: ignore osd_map_cache_size warning
Signed-off-by: Samuel Just <sam.just@inktank.com>
2014-06-24 10:19:48 -07:00
Samuel Just
09d02abe45 Revert "rados: Fix cache_flush, cache_try_flush, cache_evict and add hit_set_list"
cache_evict is kinda broken at the moment due to snapdir nonsense.  I'm reverting this
test for now until it's fixed so it doesn't obscure the test results.

Bug: 8629
This reverts commit 3dae48c7cd.
2014-06-19 14:25:30 -07:00
David Zafman
3dae48c7cd rados: Fix cache_flush, cache_try_flush, cache_evict and add hit_set_list
Signed-off-by: David Zafman <david.zafman@inktank.com>
2014-06-05 18:24:59 -07:00
Sage Weil
c427af0701 osd_heartbeat_grace from 20 -> 40 when running valgrind
This will mask occasional 'wrongly marked me down' errors when valgrind
is running (it makes things generally slow).

Signed-off-by: Sage Weil <sage@inktank.com>
2014-05-28 10:11:05 -07:00
Sage Weil
a4f00abaa6 rados/singleton-nomsgr/all/objectstore: skip ceph cluster setup
Not necessary; we just need it installed.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-05-20 22:09:48 -07:00
Sage Weil
5be30f0d9b rados/singleton-nomsgr/all/objectstore: clean up
ceph_test_objectstore scribbles in $CWD.  Keep it inside $TESTDIR and
clean up when we are done.

Fixes: #8395
Signed-off-by: Sage Weil <sage@inktank.com>
2014-05-20 22:09:48 -07:00
David Zafman
99e67abc94 rados/thrash: Add pool_snaps variants
Fixes: #8284

Signed-off-by: David Zafman <david.zafman@inktank.com>
2014-05-05 11:18:14 -07:00
Samuel Just
d2d7b941dd cache-snaps.yaml: set target_max_objects to test snap flush/evict
Signed-off-by: Samuel Just <sam.just@inktank.com>
2014-05-02 17:04:09 -07:00
David Zafman
18334ea3d3 rados/thrash: Fix workload of cache-agent-big
Create a log of objects and operate on some of them
(Initial object creation counts against total operations specified)
Add limit to 20 minutes

Fixes: 8263

Signed-off-by: David Zafman <david.zafman@inktank.com>
2014-05-01 14:39:18 -07:00
David Zafman
b080355902 Fix syntax of erasure coded pool creation
Signed-off-by: David Zafman <david.zafman@inktank.com>
2014-05-01 10:52:57 -07:00
Samuel Just
3a2c8886c3 rados: add ec and rep lost_unfound_delete tests
Signed-off-by: Samuel Just <sam.just@inktank.com>
2014-04-22 17:42:17 -07:00
Sage Weil
387110b119 rados/singleton/all/cephtool: whitelist scrub vs split vs agent issue
Signed-off-by: Sage Weil <sage@inktank.com>
2014-04-22 12:44:48 -07:00
Sage Weil
c623b3dd97 rados/thrash: whitelist 'must scrub before tier agent can activate'
Signed-off-by: Sage Weil <sage@inktank.com>
2014-04-18 06:59:27 -07:00
Josh Durgin
7afc277736 rados: include objectstore tests
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2014-04-17 18:58:33 -07:00
Sage Weil
b97c3803c5 rados/thrash: rejection backfill reservations sometimes while thrashing 2014-04-03 18:11:01 -07:00
Sage Weil
946a9685f1 rados/singleton-nomsgr: add multi-backfill reservation rejection test
This triggers #7922.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-04-03 18:08:19 -07:00
Sage Weil
cb699f6a1a rados/monthrash: shorten mon pings so that freezing triggers reconnects
This exercises (tho it doesn't necessarily verify) the mon ping reconnect
code when the thrasher freezes the mons.

See #7888.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-28 16:17:00 -07:00
Samuel Just
68343ee193 rados/thrash/workloads: make cache-agent-big use an ec base pool
Signed-off-by: Samuel Just <sam.just@inktank.com>
2014-03-27 11:51:44 -07:00
Sage Weil
a42e197f5d rados/monthrash: add pool create/delete tests
This is sufficient to trigger #7736 when combined with mon thrashing and
mon msg delay injection.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-03-17 16:16:50 -07:00
Ilya Dryomov
0815e4fa70 rados/singleton-nomsgr: add allocation hint test
Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
2014-03-07 18:32:12 +02:00
Samuel Just
c6a22b33e3 rados/thrash/workloads: enable copy_from for the ec workloads
Signed-off-by: Samuel Just <sam.just@inktank.com>
2014-02-23 14:56:10 -08:00
Samuel Just
b9e1341f10 rados/thrash: add ec-radosbench workload
Signed-off-by: Samuel Just <sam.just@inktank.com>
2014-02-18 13:49:12 -08:00
Samuel Just
386650d983 rados/thrash: add ec workloads
Adds ec workloads for snaps, small objects, read-write.

copy-from is not yet enabled.

Signed-off-by: Samuel Just <sam.just@inktank.com>
2014-02-18 13:45:39 -08:00
Sage Weil
5f8eedf01d rados/thrash: test rados against cache pool with agents
Run rados model against a pool with target_max_objects set so that the
tiering agent is working against us.

Signed-off-by: Sage Weil <sage@inktank.com>
2014-02-17 16:41:25 -08:00
Sage Weil
453d72b294 don't use notcmalloc flavor for valgrind
We now have the suppressions to avoid this.  Yay!

Signed-off-by: Sage Weil <sage@inktank.com>
2014-02-11 09:44:17 -08:00
Sage Weil
47dcdb324e rados/thrash: add cache + snaps workload
Signed-off-by: Sage Weil <sage@inktank.com>
2014-01-23 07:08:10 -08:00
Josh Durgin
04fe727130 rados: add rados tool test
For now this just tests listomapvals, but it may be expanded in the
future.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-12-31 11:44:46 -08:00
Sage Weil
a47d16ec8c rados/trash: add cache workload
Signed-off-by: Sage Weil <sage@inktank.com>
2013-12-22 09:46:03 -08:00
Sage Weil
018164ad04 rados/thrash: add small objects ceph_test_rados workload
With small objects we can generate many more ops more quickly and stress
the cluster a bit differently.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-12-19 21:47:19 -08:00
David Zafman
20b7998222 Increase timeout by 50% in thrashers/mapgap.yaml
Seen in /a/teuthology-2013-12-17_23:00:03-rados-next-distro-basic-plana/7188

Signed-off-by: David Zafman <david.zafman@inktank.com>
2013-12-18 16:59:05 -08:00
tamil
7b392a815a white listed "wrongly marked me down" to avoid false alarm in the nightlies
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-12-02 12:52:08 -08:00
Sage Weil
39c4674b27 test copy_from when running ceph_test_rados
Skip most of the upgrade suites, except the test that ends up on next.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-10-15 12:20:39 -07:00
Sage Weil
3615c149c2 make xfs.yaml a symlink 2013-10-02 13:30:13 -07:00
Sage Weil
886c4fdd45 make ext4.yaml a symlink 2013-10-02 13:30:13 -07:00
Sage Weil
92cc980be4 make btrfs.yaml a symlink 2013-10-02 13:30:13 -07:00
Sage Weil
30c56cc1cb mark all existing collections with %
This is ignored by old teuthology, but used by newer code to tell us when
to generate a product of all directory items.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-08-28 11:24:29 -07:00
Sage Weil
f2a688dcc2 rados/verify: crank up debugging on system tests
Looking for #5925

Signed-off-by: Sage Weil <sage@inktank.com>
2013-08-27 07:33:50 -07:00
Sage Weil
794d22443c rados/singleton: add 'wrong cluster addr' whitelist
Missed this the first time around.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-08-20 13:09:53 -07:00
Sage Weil
a8965a6a5d rados/monthash: fix 5925 yaml
Signed-off-by: Sage Weil <sage@inktank.com>
2013-08-14 21:37:16 -07:00
David Zafman
0e9a937936 monthrash workloads: Add task to reproduce 5925 with extra logging
Signed-off-by: David Zafman <david.zafman@inktank.com>
2013-08-13 17:15:16 -07:00
Sage Weil
2b6ac0faac rados/.../cephtool: let us run test_daemon.sh too
This needs the mon to be mon.a; rename it.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-30 23:10:41 -07:00
Sage Weil
4c15d73f19 rados: add rest-api test
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-26 18:23:25 -07:00
Sage Weil
5849853c5f rados/thrash: add in ext4 2013-07-26 16:27:11 -07:00
Samuel Just
99ae947109 rados: add thrasher which causes map gaps
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-07-26 10:48:55 -07:00
Sage Weil
a0edf7aaa5 rados: trim more aggressively 2013-07-23 10:46:32 -07:00
Sage Weil
f528108d14 rados: keep fewer osdmaps around
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-23 09:24:35 -07:00
David Zafman
748d0c80b1 Add more stressful thrashing by doing more pg splitting
Signed-off-by: David Zafman <david.zafman@inktank.com>
2013-07-22 10:20:25 -07:00
Sage Weil
9b3f59dfef rados: add in the mon/caps.sh tests 2013-07-22 09:29:43 -07:00
Sage Weil
e8c58d3ee9 rados: use 'ms inject intenral delays' during thrashing
This tries to help hit unusual races in the messenger.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-16 21:03:17 -07:00
Sage Weil
0a13124c72 rados/mon_thrash: inject random delays in mon messages
More useful than osd delays for this collection
2013-07-08 15:01:50 -07:00
Sage Weil
94ae48cf2c rados/monthrash: reduce matrix size a bit
we don't need so many osd workloads
2013-07-08 15:01:49 -07:00
Sage Weil
dbf12a31aa rados/monthrash: refactor a bit
Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 15:01:49 -07:00
Sage Weil
5295406dbb rados: expand mon thrashing tests
- test sync (full and recent)
- test multiple mons down
- test other cluster sizes (3 and 9)

Signed-off-by: Sage Weil <sage@inktank.com>
2013-07-08 15:01:49 -07:00
Sage Weil
e7fa5fcab7 rgw: move radosgw-admin test into rgw suite 2013-07-01 14:08:12 -07:00
Sage Weil
d5acff2888 rados: dump-stick: remove configs from yaml; let task set them
The mon osd report timeout was wrong; see #5440.
2013-06-25 12:48:32 -07:00
Sage Weil
8e4c292ecd rados: whitelist 'had wrong client addr' for cephtool test
Recently added stuff to cephtool/test.sh workunit that triggers this.

fd769c0f21
2013-06-21 11:50:48 -07:00
Sage Weil
35f0478e4a use correct branch for admin_socket tests 2013-06-19 10:37:43 -07:00
Sage Weil
d64f264c89 rados: don't use ceph-fuse for workunits
They can run on the local mount/dir.
2013-06-17 16:41:30 -07:00
Sage Weil
ae04f3c06f rados: whitelist 'wrongly marked me down' for cephtool tests
These test 'ceph osd down ...'; whitelist the resulting log message for
these tests.
2013-06-16 20:12:26 -07:00
Sage Weil
38f82f61c1 rados: no cephtool test on ceph-fuse 2013-06-16 10:03:08 -07:00
Sage Weil
73413642d7 rados: do not run cli tests on top of ceph-fuse
Some of the commands we test break teh mds, which break the mount, which
stall teh test.

Fixes: #5370
2013-06-16 10:00:22 -07:00
Sage Weil
1bf7a2ca2d rados: thrash under valgrind
This will get us better coverage in the peering and recovery code.
2013-06-14 15:17:55 -07:00
Sage Weil
8badb90494 move powercycle tests into a separate suite
We may not want to run these nightly, or on the same hardware, and we want
to separate out core rados problems from powercycle-specific issues (like
hw failure and ipmi suckage).
2013-06-12 21:03:16 -07:00
Sage Weil
03b12b8a37 rados: add btrfs to osd powercycle tests
So we can get some testing coverage on btrfs, and identify whether the
xfs corruption issues are related to the fs or ceph.
2013-06-07 08:54:29 -07:00
Sage Weil
ff8f9d3068 rados: only 1 client for objectcacher stress test
Otherwise these may step on each other.
2013-06-04 17:22:39 -07:00
Sage Weil
ed8e3c348f rados: increase recovery timeout when powercycling
Machine reboots take a while, and i've observed recovery making progress
but not finishing in time.
2013-06-01 13:33:08 -07:00
Sage Weil
8cec56dd79 valgrind: select notcmalloc ceph install flavor 2013-05-27 14:06:35 -07:00
Sage Weil
6205c3da81 rados/osd-powrcycle: turn up mds logging
To catch #4832

Signed-off-by: Sage Weil <sage@inktank.com>
2013-05-10 18:03:56 -07:00
Samuel Just
c838e1f449 Revert "Revert "Specify xfs for osd powercycle testing""
Pushed a fix to teuthology, should work now.

This reverts commit 853e8fdb731e863e4703d86d8852ecdb4160275f.
2013-05-01 13:52:04 -07:00
Samuel Just
853e8fdb73 Revert "Specify xfs for osd powercycle testing"
This is currently broken.

This reverts commit 79abc44205.
2013-04-30 18:38:21 -07:00
Samuel Just
79abc44205 Specify xfs for osd powercycle testing
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-04-30 14:32:38 -07:00
Samuel Just
526863ee25 remove ext4 from rados thrashing for now
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-04-23 12:50:17 -07:00
Sage Weil
bbcba29271 set 'filestore flush min = 0' for all ffsb jobs
Until we fix #4579
2013-04-22 20:59:25 -07:00
Sage Weil
21d6af2365 rados: whitelist quota warning in cluster log for pool quota test
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-03 14:04:54 -07:00
Sage Weil
8b49eb19ff rados: test mon config keys
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-01 15:56:14 -07:00
Sage Weil
b4a276e6bc rados: test pool quotas when testing api
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-29 18:04:40 -07:00
caleb miles
c170022b19 radosgw-admin: Adjust garbage collection settings.
Signed-off-by: caleb miles <caleb.miles@inktank.com>
2013-03-19 14:37:15 -04:00
Sage Weil
d02967922c move osd powercycling tests to rados suite
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-15 16:00:02 -07:00
Samuel Just
2230500eda rados/thrashers: enable split testing
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-03-15 12:23:37 -07:00
caleb miles
cbfeac9910 radosgw-admin: Adjust garbage collection settings.
Signed-off-by: caleb miles <caleb.miles@inktank.com>
2013-03-08 22:07:25 -05:00
Sage Weil
9d92ad6ead rados: crank up mon debugging for the mon thrashing tests
At least for now!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-25 09:06:53 -08:00
Sage Weil
8dcbf8b685 specify deterministic admin socket path for admin_socket.py tests
Otherwise we don't know the path because the default now includes $pid.
2013-02-24 07:51:11 -08:00
Sage Weil
acab068269 add explicit install task before ceph task 2013-02-19 21:02:26 -08:00
Sage Weil
81704665f4 rados: fix ceph_test_filejournal test
Assume it is in the $PATH.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-19 11:26:02 -08:00
Sage Weil
f05f90b527 rados: use 2 instead of 3 machines
Put the client on the same node as some of the osds.  This shouldn't cause
any problems.  It will make timing a bit different, but if anything that
is probably a good thing.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:24:18 -08:00
Sage Weil
61605f8178 rados: don't pass ceph.conf path to workloadgen
The workunit tasks sets CEPH_CONF as needed, which the binaries already
understand.
2013-02-18 09:31:19 -08:00
Sage Weil
0c8d78236a rados: remove empty multifs collection 2013-02-17 17:13:50 -08:00
Sage Weil
d5c771a3f3 rados/monthrash: run some mon tests 2013-02-11 07:13:53 -08:00
Sage Weil
8d5ca94331 rados: separate out mon thrasher from thrasher collection
Run a different set of tests.
2013-02-11 07:13:41 -08:00
Samuel Just
d50086020f btrfs.yaml: increase osd op thread timeout
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-28 19:01:27 -08:00
Josh Durgin
81ed1bc76c rados: add pool_ops workunit to cephtool test 2013-01-28 16:58:54 -08:00
Samuel Just
2f192eaff6 TestRados expects rollback, not snap_rollback
Signed-off-by: Samuel Just <sam.just@inktank.com>
2013-01-23 19:42:45 -08:00
Sage Weil
b7cb1b113f rados/thrash: 3 monitors, so that we can thrash them 2013-01-21 15:09:00 -08:00
Joao Eduardo Luis
f41b5421f8 add mon_thrash task to kernel and rados thrashers collections
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-17 21:00:21 -08:00
Sage Weil
cc7bf1bd18 rados: add osd reply delay injection 2013-01-17 20:54:37 -08:00
Sage Weil
0946a78c01 fix mon clock queue test syntax 2013-01-16 15:40:58 -08:00
Sage Weil
123a2dc486 rados: adjust socket injection rate down
See #3795.
2013-01-14 16:25:17 -08:00
Joao Eduardo Luis
305cb54a17 suites: rados: multimon: add mon clock skews task yaml files
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
2013-01-10 18:03:03 -08:00
Sage Weil
3686371ed3 rados: add test_filejournal
This writes to /tmp by default; should be ok plana, since it's / and not
tmpfs.
2013-01-02 11:49:38 -08:00
Sage Weil
80bcaa2905 rados: add filestore_idempotent test with journal aio = true 2012-12-27 10:53:54 -08:00
Sage Weil
0953ce5315 rados: add cephtool test 2012-12-20 10:58:56 -08:00
Sage Weil
9aacaf776e add mon-thrasher singleton 2012-11-29 01:15:44 +00:00
Sage Weil
e1a9a44027 fix objectcacher stress test 2012-11-27 17:03:12 -08:00
Josh Durgin
a4c41e20f0 Add objectcacher stress testing
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2012-11-26 11:08:44 -08:00
Sage Weil
43b57dec98 check ceph-mon for leaks 2012-11-15 22:04:20 -08:00
Samuel Just
2770ef7272 peer.yaml: assumes osd_pool_default_min_size is 1
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2012-11-13 12:29:24 -08:00
Yehuda Sadeh
6a0979dc42 radsogw-admin: bump up logging
debug ms = 1
debug objclass = 20

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2012-10-05 16:49:27 -07:00
Sage Weil
8eaaddcfea rados: run class unit tests through validators 2012-09-21 08:36:58 -07:00
Sage Weil
31a2a83cb4 move rgw tasks to separate suite 2012-09-21 08:36:08 -07:00
Sage Weil
675e9f319b rados: test all rados classes 2012-09-20 15:40:50 -07:00
Sage Weil
d8bc55eea4 avoid doing filestore idempotency tester 2x w/ and w/o msgr failures 2012-08-24 14:48:52 -07:00
Sage Weil
92d3404f84 rados: add msgr failure injection 2012-08-21 18:04:38 -07:00
Sage Weil
cd0b5278e5 separate regression suite into topical categories rados, rbd, fs 2012-07-31 09:55:45 -07:00