Commit Graph

720 Commits

Author SHA1 Message Date
Mark Nelson
31762c0003 lock: Improved logging when there aren't enough nodes available to lock-many. 2012-03-07 12:55:54 -08:00
Mark Nelson
05a07dda7d lock: Added a --locked flag to teuthology-lock.
Can be used to restrict searches based on lock status, e.g.
'teuthology-lock --list -a --locked false --status up' shows available nodes.
2012-03-07 12:55:33 -08:00
Sage Weil
2a18c3e1d0 nuke: unmount osd data directories
This helps us avoid reboot to clean up osd data directories that are left
mounted.
2012-03-06 09:34:38 -08:00
Josh Durgin
1493674735 Use non-zero exit status if any tests failed
Fixes: #1989
2012-03-05 13:34:33 -08:00
Sage Weil
dc1abab211 github.com/NewDreamNetwork -> github.com/ceph 2012-03-02 10:55:56 -08:00
Josh Durgin
a80246c17f dump_stuck: note required ceph configuration 2012-02-29 15:47:17 -08:00
Josh Durgin
85cc96c11a dump_stuck: verify that 'ceph health' mentions the right number of inactive/unclean/stale pgs 2012-02-28 13:55:46 -08:00
Sage Weil
999e21928c peer: ignore +scrubbing portion of pg state
It can cause the mon state and osd states to not match.
2012-02-28 09:50:29 -08:00
Sage Weil
84cd4ed6c3 peer: wait for peering to complete, or block
We need to wait for peering to either complete, or block because it is
waiting for another PG.  _Then_ look at all the PG states and compare the
mon values with what we get from qeurying the OSDs directly.
2012-02-25 21:05:00 -08:00
Josh Durgin
b8739585a0 peer: remove unused variable 2012-02-24 15:01:34 -08:00
Josh Durgin
62bda12711 misc: always return a usable result from get_valgrind_args 2012-02-24 14:56:43 -08:00
Josh Durgin
e4801819f2 rgw: simplify valgrind args 2012-02-24 14:56:42 -08:00
Sage Weil
edbb41e1f8 add peer task
Force a pg to get stuck in 'down' state, verify we can query the peering
state, then start the OSD so it can recover.
2012-02-24 15:05:17 -08:00
Sage Weil
7ac04a422a lost_unfound: list missing/unfound for each pg and verify the unfound counts
This also tests the pg list_missing functionality.
2012-02-24 12:42:39 -08:00
Sage Weil
c43e87d118 ceph_manager: list_pg_missing
List missing objects for the given pgid.
2012-02-24 12:42:39 -08:00
Josh Durgin
c93a08eda0 Whitespace and unnecessary formatting fixes 2012-02-24 12:05:35 -08:00
Josh Durgin
3bfb8d696e ceph, ceph-fuse: simplify valgrind argument additions 2012-02-24 12:05:35 -08:00
Sage Weil
9ec047226f refactor all valgrind users to use a get_valgrind_args() helper
This avoids much annoying, duplicated code.
2012-02-24 12:05:35 -08:00
Sage Weil
90fdc84086 ceph: always create valgrind logs dir
Other tasks use it too.  It's more annoying to conditionally create it.
2012-02-24 12:05:35 -08:00
Sage Weil
7af6e46c94 ceph: always try to process valgrind logs
Check for errors in valgrind logs even if there is no valgrind option
the ceph task config stanza.  Other tasks can run via valgrind (ceph-fuse,
rgw).  If the logs aren't there, this is harmless.
2012-02-24 12:05:35 -08:00
Sage Weil
e2ea73d1a5 rgw: add valgrind support
tasks:
- ceph:
- rgw:
   client.a:
     valgrind: [--tool=memcheck]
2012-02-24 12:05:35 -08:00
Sage Weil
7bf64b73ee rgw: accept dict
e.g.,

tasks:
...
- rgw:
    client.0:
    client.1:
2012-02-24 12:05:35 -08:00
Sage Weil
d40a9b275f lost_unfound: new mark_unfound_lost syntax 2012-02-23 20:09:09 -08:00
Josh Durgin
81a46c462a dump_stuck: flush stats before waiting for recovery/clean 2012-02-23 17:07:26 -08:00
Josh Durgin
995dc1f751 Add a task for testing stuck pg visibility. 2012-02-21 15:12:48 -08:00
Josh Durgin
2a1c74c5f5 Move duration calculation to an internal task
This excludes all generic start up costs, like waiting for locks,
rebooting into a new kernel, etc.
2012-02-21 15:12:26 -08:00
Josh Durgin
eb434a507a Add necessary imports for s3 tasks, and keep them alphabetical. 2012-02-21 15:04:00 -08:00
Yehuda Sadeh
11073e505f s3roundtrip, s3readwrite: access key uses url safe chars
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>
2012-02-21 12:23:38 -08:00
Yehuda Sadeh
6e1b3a5644 rgw: access key uses url safe chars
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>
2012-02-21 12:12:03 -08:00
Sage Weil
c5688e6570 ceph: valgrind trumps coverage when picking a flavor
valgrind will crash if we don't use notcmalloc; coverage will silently
fail to collect coverage info.
2012-02-20 15:17:52 -08:00
Sage Weil
5216d3c7a9 ceph.conf: no lockdep by default 2012-02-20 14:54:10 -08:00
Sage Weil
5f9445c88b suite.results: include test duration in output 2012-02-20 13:38:06 -08:00
Sage Weil
71d0d97a97 cfuse -> ceph-fuse 2012-02-20 07:12:53 -08:00
Sage Weil
7ff9f044e7 ceph: allow valgrind per-type (not just per-name) 2012-02-20 07:04:45 -08:00
Sage Weil
eb93fa744d lost_unfound: mark osds in when we revive them
so that we test what we meant to.  It also lets us actually go clean at the
very end.
2012-02-19 19:40:45 -08:00
Sage Weil
45b6189b7d ceph_manager: ignore stale states when counting
also remove assumptions about ordering of states
2012-02-18 14:44:53 -08:00
Sage Weil
196d4a1f16 wait_till_clean -> wait_for_clean and wait_for_recovery
Clean now also means the correct number of replicas, whereas recovered
means we have done all the work we can do given the replicas/osds we have.
For example, degraded and clean are now mutually exclusive.

Also move away from 'till'.
2012-02-17 21:53:25 -08:00
Sage Weil
ad9d7fb6e1 backfill: wait for clean before writing+blackholing
If we have straggler pgs and blackhole osd.1, we can deadlock because we
need info from that osd to repeer and continue.  Make sure we're clean, and
then start the write + blackhole + kill test.
2012-02-14 15:24:11 -08:00
Sage Weil
50cc60f02d nuke: nuke testrados too
Slightly fewer nuke -r's
2012-02-14 15:23:19 -08:00
Sage Weil
6f3abc6ced ceph_manager: mark in a bit more often than out
Otherwise we can get into cases where many/most nodes are out, and things
don't work as well.  e.g., crush may start to fail.
2012-02-13 15:28:24 -08:00
Sage Weil
af4ce44233 ceph: use any fs, not just btrfs, on scratch devices
The

  btrfs: true

syntax is replaced with

  fs: btrfs

or ext4, xfs.
2012-02-13 15:28:24 -08:00
Sage Weil
975d73a2bb nuke: nuke testrados and rados processes, too
So that -r is needed slightly less often.
2012-02-13 15:28:24 -08:00
Sage Weil
46b612efa4 misc: make get_scratch_devices look for (almost) any disk that's not mounted 2012-02-13 15:28:24 -08:00
Sage Weil
2adad559bd hammer.sh: assume path is set 2012-02-11 14:19:49 -08:00
Josh Durgin
0cd16cf03d ceph: always add logger for daemons
The extra log function added redundant info and didn't allow different
levels.
2012-02-02 09:36:04 -08:00
Josh Durgin
7af7c66bd0 ceph: rename type parameter to type_
type is a built-in and shouldn't be aliased.
2012-02-02 09:35:58 -08:00
Josh Durgin
7146db9215 ceph: use the correct comparison operator
is compares identity (i.e. address in cpython), not value.
2012-02-02 09:27:04 -08:00
Josh Durgin
e7672b6433 ceph: sync before unmounting btrfs devices
There may still be writes in flight, since the osds may not have
shutdown cleanly. This should prevent EBUSY when unmounting.

Fixes: #1997
2012-02-02 09:26:45 -08:00
Josh Durgin
1364b8826f ceph: delay raising exceptions until all daemons are stopped
If a daemon crashes, the exception is raised when we stop it. This
caused some daemons to continue running during cleanup, since the rest
of the daemons of the same type would not be shut down. Also log each
daemon that crashed, for easier debugging.

Fixes: #1744
2012-02-02 09:26:25 -08:00
Sage Weil
0236dc0f5e add backfill task
This does a basic test of backfill functionality, including a divergent
log on a backfill target (#1983).
2012-01-31 16:25:53 -08:00