Mark Nelson
31762c0003
lock: Improved logging when there aren't enough nodes available to lock-many.
2012-03-07 12:55:54 -08:00
Mark Nelson
05a07dda7d
lock: Added a --locked flag to teuthology-lock.
...
Can be used to restrict searches based on lock status, e.g.
'teuthology-lock --list -a --locked false --status up' shows available nodes.
2012-03-07 12:55:33 -08:00
Sage Weil
2a18c3e1d0
nuke: unmount osd data directories
...
This helps us avoid reboot to clean up osd data directories that are left
mounted.
2012-03-06 09:34:38 -08:00
Josh Durgin
1493674735
Use non-zero exit status if any tests failed
...
Fixes : #1989
2012-03-05 13:34:33 -08:00
Sage Weil
dc1abab211
github.com/NewDreamNetwork -> github.com/ceph
2012-03-02 10:55:56 -08:00
Josh Durgin
a80246c17f
dump_stuck: note required ceph configuration
2012-02-29 15:47:17 -08:00
Josh Durgin
85cc96c11a
dump_stuck: verify that 'ceph health' mentions the right number of inactive/unclean/stale pgs
2012-02-28 13:55:46 -08:00
Sage Weil
999e21928c
peer: ignore +scrubbing portion of pg state
...
It can cause the mon state and osd states to not match.
2012-02-28 09:50:29 -08:00
Sage Weil
84cd4ed6c3
peer: wait for peering to complete, or block
...
We need to wait for peering to either complete, or block because it is
waiting for another PG. _Then_ look at all the PG states and compare the
mon values with what we get from qeurying the OSDs directly.
2012-02-25 21:05:00 -08:00
Josh Durgin
b8739585a0
peer: remove unused variable
2012-02-24 15:01:34 -08:00
Josh Durgin
62bda12711
misc: always return a usable result from get_valgrind_args
2012-02-24 14:56:43 -08:00
Josh Durgin
e4801819f2
rgw: simplify valgrind args
2012-02-24 14:56:42 -08:00
Sage Weil
edbb41e1f8
add peer task
...
Force a pg to get stuck in 'down' state, verify we can query the peering
state, then start the OSD so it can recover.
2012-02-24 15:05:17 -08:00
Sage Weil
7ac04a422a
lost_unfound: list missing/unfound for each pg and verify the unfound counts
...
This also tests the pg list_missing functionality.
2012-02-24 12:42:39 -08:00
Sage Weil
c43e87d118
ceph_manager: list_pg_missing
...
List missing objects for the given pgid.
2012-02-24 12:42:39 -08:00
Josh Durgin
c93a08eda0
Whitespace and unnecessary formatting fixes
2012-02-24 12:05:35 -08:00
Josh Durgin
3bfb8d696e
ceph, ceph-fuse: simplify valgrind argument additions
2012-02-24 12:05:35 -08:00
Sage Weil
9ec047226f
refactor all valgrind users to use a get_valgrind_args() helper
...
This avoids much annoying, duplicated code.
2012-02-24 12:05:35 -08:00
Sage Weil
90fdc84086
ceph: always create valgrind logs dir
...
Other tasks use it too. It's more annoying to conditionally create it.
2012-02-24 12:05:35 -08:00
Sage Weil
7af6e46c94
ceph: always try to process valgrind logs
...
Check for errors in valgrind logs even if there is no valgrind option
the ceph task config stanza. Other tasks can run via valgrind (ceph-fuse,
rgw). If the logs aren't there, this is harmless.
2012-02-24 12:05:35 -08:00
Sage Weil
e2ea73d1a5
rgw: add valgrind support
...
tasks:
- ceph:
- rgw:
client.a:
valgrind: [--tool=memcheck]
2012-02-24 12:05:35 -08:00
Sage Weil
7bf64b73ee
rgw: accept dict
...
e.g.,
tasks:
...
- rgw:
client.0:
client.1:
2012-02-24 12:05:35 -08:00
Sage Weil
d40a9b275f
lost_unfound: new mark_unfound_lost syntax
2012-02-23 20:09:09 -08:00
Josh Durgin
81a46c462a
dump_stuck: flush stats before waiting for recovery/clean
2012-02-23 17:07:26 -08:00
Josh Durgin
995dc1f751
Add a task for testing stuck pg visibility.
2012-02-21 15:12:48 -08:00
Josh Durgin
2a1c74c5f5
Move duration calculation to an internal task
...
This excludes all generic start up costs, like waiting for locks,
rebooting into a new kernel, etc.
2012-02-21 15:12:26 -08:00
Josh Durgin
eb434a507a
Add necessary imports for s3 tasks, and keep them alphabetical.
2012-02-21 15:04:00 -08:00
Yehuda Sadeh
11073e505f
s3roundtrip, s3readwrite: access key uses url safe chars
...
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>
2012-02-21 12:23:38 -08:00
Yehuda Sadeh
6e1b3a5644
rgw: access key uses url safe chars
...
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>
2012-02-21 12:12:03 -08:00
Sage Weil
c5688e6570
ceph: valgrind trumps coverage when picking a flavor
...
valgrind will crash if we don't use notcmalloc; coverage will silently
fail to collect coverage info.
2012-02-20 15:17:52 -08:00
Sage Weil
5216d3c7a9
ceph.conf: no lockdep by default
2012-02-20 14:54:10 -08:00
Sage Weil
5f9445c88b
suite.results: include test duration in output
2012-02-20 13:38:06 -08:00
Sage Weil
71d0d97a97
cfuse -> ceph-fuse
2012-02-20 07:12:53 -08:00
Sage Weil
7ff9f044e7
ceph: allow valgrind per-type (not just per-name)
2012-02-20 07:04:45 -08:00
Sage Weil
eb93fa744d
lost_unfound: mark osds in when we revive them
...
so that we test what we meant to. It also lets us actually go clean at the
very end.
2012-02-19 19:40:45 -08:00
Sage Weil
45b6189b7d
ceph_manager: ignore stale states when counting
...
also remove assumptions about ordering of states
2012-02-18 14:44:53 -08:00
Sage Weil
196d4a1f16
wait_till_clean -> wait_for_clean and wait_for_recovery
...
Clean now also means the correct number of replicas, whereas recovered
means we have done all the work we can do given the replicas/osds we have.
For example, degraded and clean are now mutually exclusive.
Also move away from 'till'.
2012-02-17 21:53:25 -08:00
Sage Weil
ad9d7fb6e1
backfill: wait for clean before writing+blackholing
...
If we have straggler pgs and blackhole osd.1, we can deadlock because we
need info from that osd to repeer and continue. Make sure we're clean, and
then start the write + blackhole + kill test.
2012-02-14 15:24:11 -08:00
Sage Weil
50cc60f02d
nuke: nuke testrados too
...
Slightly fewer nuke -r's
2012-02-14 15:23:19 -08:00
Sage Weil
6f3abc6ced
ceph_manager: mark in a bit more often than out
...
Otherwise we can get into cases where many/most nodes are out, and things
don't work as well. e.g., crush may start to fail.
2012-02-13 15:28:24 -08:00
Sage Weil
af4ce44233
ceph: use any fs, not just btrfs, on scratch devices
...
The
btrfs: true
syntax is replaced with
fs: btrfs
or ext4, xfs.
2012-02-13 15:28:24 -08:00
Sage Weil
975d73a2bb
nuke: nuke testrados and rados processes, too
...
So that -r is needed slightly less often.
2012-02-13 15:28:24 -08:00
Sage Weil
46b612efa4
misc: make get_scratch_devices look for (almost) any disk that's not mounted
2012-02-13 15:28:24 -08:00
Sage Weil
2adad559bd
hammer.sh: assume path is set
2012-02-11 14:19:49 -08:00
Josh Durgin
0cd16cf03d
ceph: always add logger for daemons
...
The extra log function added redundant info and didn't allow different
levels.
2012-02-02 09:36:04 -08:00
Josh Durgin
7af7c66bd0
ceph: rename type parameter to type_
...
type is a built-in and shouldn't be aliased.
2012-02-02 09:35:58 -08:00
Josh Durgin
7146db9215
ceph: use the correct comparison operator
...
is compares identity (i.e. address in cpython), not value.
2012-02-02 09:27:04 -08:00
Josh Durgin
e7672b6433
ceph: sync before unmounting btrfs devices
...
There may still be writes in flight, since the osds may not have
shutdown cleanly. This should prevent EBUSY when unmounting.
Fixes : #1997
2012-02-02 09:26:45 -08:00
Josh Durgin
1364b8826f
ceph: delay raising exceptions until all daemons are stopped
...
If a daemon crashes, the exception is raised when we stop it. This
caused some daemons to continue running during cleanup, since the rest
of the daemons of the same type would not be shut down. Also log each
daemon that crashed, for easier debugging.
Fixes : #1744
2012-02-02 09:26:25 -08:00
Sage Weil
0236dc0f5e
add backfill task
...
This does a basic test of backfill functionality, including a divergent
log on a backfill target (#1983 ).
2012-01-31 16:25:53 -08:00