Commit Graph

509 Commits

Author SHA1 Message Date
Josh Durgin
c56ab97442 reconnect: ignore SSHExceptions before the timeout expires
Fixes: #1587
2011-10-06 17:18:35 -07:00
Samuel Just
4722d468c6 task/watch_notify_stress: watch_notify_stress now thrashes clients
This should exercise the watch notify timeout code.

Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-06 14:34:44 -07:00
Sage Weil
4e61e4835e rgw: keep radosgw in foreground
It defaults to a daemon now.
2011-10-06 12:50:12 -07:00
Josh Durgin
107db6a913 Retry listing machines if the lock server goes down. 2011-10-04 17:21:00 -07:00
Sage Weil
39a1e76065 rgw: use normal logging mechanism
Keep capturing stdout/err, even though it should end up empty.

Signed-off-by: Sage Weil <sage@newdream.net>
2011-10-04 16:09:51 -07:00
Josh Durgin
7b7ff6e8ce teuthology-worker: clean up last_in_suite jobs
There's no reason not to delete them once they start.
2011-10-04 12:32:58 -07:00
Josh Durgin
3d3ba1ebb1 daemon-helper: detect the signal actually sent
I thought I fixed this when I implemented coverage collection, but I
guess it got lost in a rebase or something.
2011-10-04 12:17:19 -07:00
Josh Durgin
d305d61b86 ceph_manager: remove unused raw_pg_status method 2011-10-03 17:49:53 -07:00
Josh Durgin
8e031730c1 ceph_manager: run ceph -s as a normal program
This allows failures from it to be detected better.
2011-10-03 17:49:13 -07:00
Josh Durgin
bad609e63e teuthology-results: include passed tests in email 2011-10-03 17:11:53 -07:00
Josh Durgin
8bcd2a74ca teuthology-results: include reasons for failure in email 2011-10-03 17:08:29 -07:00
Josh Durgin
030161ed8d teuthology-ls: show reasons for failures with -v 2011-10-03 17:07:41 -07:00
Josh Durgin
1cad309d65 Add failure_reason to summary for the first failure detected.
For now, this is the exception raised during a task, the error found
in the central log, or coredumps found. More specific errors
(i.e. s3-tests had 3 failures) can be added later as exceptions raised
by tasks.
2011-10-03 17:07:41 -07:00
Josh Durgin
817b950494 radosbench: get coverage and cores 2011-10-03 17:07:41 -07:00
Samuel Just
fe1a271d69 watch_notify_stress.py: add ceph flags option
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-03 14:26:08 -07:00
Samuel Just
28d60172f6 ceph.py: add btrfs option
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-03 14:26:04 -07:00
Josh Durgin
ae196028ad nuke: keep up with renaming cfuse -> ceph-fuse 2011-10-03 09:56:42 -07:00
Sage Weil
2b601a32d0 radosgw-admin: test additional keys, log list/show/rm
Signed-off-by: Sage Weil <sage@newdream.net>
2011-10-03 09:45:11 -07:00
Sage Weil
b93a00771f tasks/radosgw-admin: test radosgw-admin tool
Not yet complete...
2011-10-03 09:45:11 -07:00
Sage Weil
afc2dc085a nuke: killall apache2 and radosgw too 2011-10-03 09:44:58 -07:00
Greg Farnum
9b44469e5e s3-tests: use radosgw-admin instead of radosgw_admin
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-30 09:26:42 -07:00
Josh Durgin
52427589a6 ceph_manager: parse osd numbers with dots
This is necessary since wip-dot-names was merged.
2011-09-29 09:09:31 -07:00
Sage Weil
a92fef77dc rename c* -> ceph-*
Leave cfuse task name unchanged for now...
2011-09-23 08:57:18 -07:00
Josh Durgin
c7ff5fc38b queue: results_timeout needs to be converted to a string 2011-09-22 18:23:42 -07:00
Samuel Just
ef56a72b73 task/watch_notify_stress.py: add simple watch_notify stress test
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-09-22 13:25:21 -07:00
Josh Durgin
c3c262656d schedule: put results timeout in the job
The default was always being used instead.
2011-09-21 11:05:33 -07:00
Greg Farnum
e4dfe3d4bd lockfile: increase interval to prevent incorrect locking orders
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-20 10:04:01 -07:00
Greg Farnum
5ff88d1902 lockfile: don't fail cleanup if no lock procs exist
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-20 10:03:33 -07:00
Tommi Virtanen
0d5dbfa27e workunit: Fetch source from github.
Needed an elaborate dance because Github won't let us download
an archive of a subdirectory.

Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
2011-09-16 11:32:15 -07:00
Tommi Virtanen
5583fac383 s3tests: Clone repository from github.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
2011-09-16 11:09:45 -07:00
Tommi Virtanen
4d92c355ec coverage: Fetch source from github.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
2011-09-16 11:08:38 -07:00
Samuel Just
4a0f8fee54 ceph.py: remove unused variables mds_daemons and mon_daemons
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-09-15 17:26:38 -07:00
Samuel Just
a3c886af19 ceph.py/cephmanager.py: add ctx.daemons for restarting daemons
ctx.daemons will now be an instance of CephState.

ctx.daemons.get_daemon(role, id).stop() to stop daemon, retart() to
restart the daemon, etc.

Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-09-15 17:08:34 -07:00
Samuel Just
85cb29d345 testsnaps: LD_PRELOAD needed for librados
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-09-14 16:28:06 -07:00
Tommi Virtanen
a2372fce12 Move orchestra to teuthology.orchestra so there's just one top-level package. 2011-09-13 14:53:02 -07:00
Tommi Virtanen
bf46fd5394 Merge orchestra into teuthology.
There are too many things called Orchestra out there,
including Ubuntu's new multi-machine service orchestration
framework. The code might still be beneficial outside of
teuthology, but it can be spun off at that time.

Conflicts:
	bootstrap
	requirements.txt
	setup.py
2011-09-13 14:10:12 -07:00
Tommi Virtanen
cc72fe6cf3 Callers of task s3tests.create_users don't need to provide dummy "fixtures" dict. 2011-09-09 13:22:03 -07:00
Josh Durgin
1970bad9d9 thrashosds: fix timeout when no options are specified 2011-09-09 10:31:08 -07:00
Josh Durgin
8dd52f9941 thrashosds: fail if cluster doesn't finally become clean in 5 minutes 2011-09-08 18:09:11 -07:00
Josh Durgin
fc1b14ddcc thrasher: get coverage and cores from calling ceph commands 2011-09-08 14:09:13 -07:00
Josh Durgin
b72c5a8363 thrashosds: wait for every pg to go active and clean before exiting 2011-09-08 14:07:23 -07:00
Josh Durgin
08747c5bfb thrasher: clean up a bit 2011-09-08 12:58:59 -07:00
Josh Durgin
091b0ae3de autotest: allow tests to be run on all clients 2011-09-07 17:50:12 -07:00
Josh Durgin
e45109b645 rbd: allow specifying all clients 2011-09-07 16:54:24 -07:00
Greg Farnum
655e4a4cfe locktest: don't fail cleanup if the dir doesn't exist
We're doing this the cheapest way possible: make the dir!

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-06 12:39:21 -07:00
Sage Weil
d4a876f3e3 teuthology: do a deep merge of input yaml fragments
Concatenate lists, and recursively combine dicts.

If you specify inputs like

 foo:
 - a
 - b

and

 foo:
 - c

you should get

 foo:
 - a
 - b
 - c

Dicts should also be merged (last one wins), and the merging is deep. E.g.

 foo:
   a:
     b:
       c: 1

and

 foo:
   a:
     b:
       c: 2

is

 foo:
   a:
     b:
       c: 2

Fixes: #1497
2011-09-03 15:07:21 -07:00
Josh Durgin
8f2ef43b9e lock: default to only listing machines you have locked
--all removes this restriction
2011-09-02 19:12:16 -07:00
Josh Durgin
5c99f9f264 rgw: run as an external fastcgi server to match dho 2011-09-02 17:58:19 -07:00
Sage Weil
e66dffc3d3 don't eat exceptions for breakfast
fixes 0c2bee1514
2011-09-02 11:07:10 -07:00
Greg Farnum
7c4a5ac83b locktest: make it actually run the executable test
This was missing an argument (the file to run on!) and apparently
that didn't cause the command to output a failure return code.

Additionally, the ceph wrappers were blocking a crash and falsely
reporting success back to teuthology. (Yikes!)

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-01 14:47:48 -07:00