Josh Durgin
409c57170d
coverage: don't remove ceph tarball
...
We want to keep it for examining core files, and we're already
fetching it here, once per suite run.
2011-10-20 16:28:32 -07:00
Sage Weil
4ec37b2391
add lost_unfound task
...
Also some misc useful bits to ceph_manager.
2011-10-17 15:32:22 -07:00
Josh Durgin
bcded7f163
ceph: add whitelist for cluster log errors
...
Some messages are expected when thrashing osds or creating unfound
objects.
Fixes : #1622
2011-10-17 14:42:08 -07:00
Josh Durgin
fba220ecaa
nuke: reset syslog configuration after rebooting
...
Previously we removed a file and rebooted without syncing, so the file
was never deleted.
2011-10-17 10:40:19 -07:00
Yehuda Sadeh
493596a7fd
radosgw-admin: test swift keys creation/removal
2011-10-12 15:37:33 -07:00
Josh Durgin
321381d75f
teuthology-worker: remove --keep-locked-on-error
2011-10-07 14:51:46 -07:00
Josh Durgin
3d3eb0efea
Remove --keep-locked-on-error, and behave as if it were specified
...
This will help prevent machines with cephtest dirs still present from
being used. It's easy to unlock machines - the targets yaml fragment
is output during a run.
2011-10-07 14:49:53 -07:00
Josh Durgin
c56ab97442
reconnect: ignore SSHExceptions before the timeout expires
...
Fixes : #1587
2011-10-06 17:18:35 -07:00
Samuel Just
4722d468c6
task/watch_notify_stress: watch_notify_stress now thrashes clients
...
This should exercise the watch notify timeout code.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-06 14:34:44 -07:00
Sage Weil
4e61e4835e
rgw: keep radosgw in foreground
...
It defaults to a daemon now.
2011-10-06 12:50:12 -07:00
Josh Durgin
107db6a913
Retry listing machines if the lock server goes down.
2011-10-04 17:21:00 -07:00
Sage Weil
39a1e76065
rgw: use normal logging mechanism
...
Keep capturing stdout/err, even though it should end up empty.
Signed-off-by: Sage Weil <sage@newdream.net>
2011-10-04 16:09:51 -07:00
Josh Durgin
7b7ff6e8ce
teuthology-worker: clean up last_in_suite jobs
...
There's no reason not to delete them once they start.
2011-10-04 12:32:58 -07:00
Josh Durgin
3d3ba1ebb1
daemon-helper: detect the signal actually sent
...
I thought I fixed this when I implemented coverage collection, but I
guess it got lost in a rebase or something.
2011-10-04 12:17:19 -07:00
Josh Durgin
d305d61b86
ceph_manager: remove unused raw_pg_status method
2011-10-03 17:49:53 -07:00
Josh Durgin
8e031730c1
ceph_manager: run ceph -s as a normal program
...
This allows failures from it to be detected better.
2011-10-03 17:49:13 -07:00
Josh Durgin
bad609e63e
teuthology-results: include passed tests in email
2011-10-03 17:11:53 -07:00
Josh Durgin
8bcd2a74ca
teuthology-results: include reasons for failure in email
2011-10-03 17:08:29 -07:00
Josh Durgin
030161ed8d
teuthology-ls: show reasons for failures with -v
2011-10-03 17:07:41 -07:00
Josh Durgin
1cad309d65
Add failure_reason to summary for the first failure detected.
...
For now, this is the exception raised during a task, the error found
in the central log, or coredumps found. More specific errors
(i.e. s3-tests had 3 failures) can be added later as exceptions raised
by tasks.
2011-10-03 17:07:41 -07:00
Josh Durgin
817b950494
radosbench: get coverage and cores
2011-10-03 17:07:41 -07:00
Samuel Just
fe1a271d69
watch_notify_stress.py: add ceph flags option
...
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-03 14:26:08 -07:00
Samuel Just
28d60172f6
ceph.py: add btrfs option
...
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-03 14:26:04 -07:00
Josh Durgin
ae196028ad
nuke: keep up with renaming cfuse -> ceph-fuse
2011-10-03 09:56:42 -07:00
Sage Weil
2b601a32d0
radosgw-admin: test additional keys, log list/show/rm
...
Signed-off-by: Sage Weil <sage@newdream.net>
2011-10-03 09:45:11 -07:00
Sage Weil
b93a00771f
tasks/radosgw-admin: test radosgw-admin tool
...
Not yet complete...
2011-10-03 09:45:11 -07:00
Sage Weil
afc2dc085a
nuke: killall apache2 and radosgw too
2011-10-03 09:44:58 -07:00
Greg Farnum
9b44469e5e
s3-tests: use radosgw-admin instead of radosgw_admin
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-30 09:26:42 -07:00
Josh Durgin
52427589a6
ceph_manager: parse osd numbers with dots
...
This is necessary since wip-dot-names was merged.
2011-09-29 09:09:31 -07:00
Sage Weil
a92fef77dc
rename c* -> ceph-*
...
Leave cfuse task name unchanged for now...
2011-09-23 08:57:18 -07:00
Josh Durgin
c7ff5fc38b
queue: results_timeout needs to be converted to a string
2011-09-22 18:23:42 -07:00
Samuel Just
ef56a72b73
task/watch_notify_stress.py: add simple watch_notify stress test
...
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-09-22 13:25:21 -07:00
Josh Durgin
c3c262656d
schedule: put results timeout in the job
...
The default was always being used instead.
2011-09-21 11:05:33 -07:00
Greg Farnum
e4dfe3d4bd
lockfile: increase interval to prevent incorrect locking orders
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-20 10:04:01 -07:00
Greg Farnum
5ff88d1902
lockfile: don't fail cleanup if no lock procs exist
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-20 10:03:33 -07:00
Tommi Virtanen
0d5dbfa27e
workunit: Fetch source from github.
...
Needed an elaborate dance because Github won't let us download
an archive of a subdirectory.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
2011-09-16 11:32:15 -07:00
Tommi Virtanen
5583fac383
s3tests: Clone repository from github.
...
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
2011-09-16 11:09:45 -07:00
Tommi Virtanen
4d92c355ec
coverage: Fetch source from github.
...
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
2011-09-16 11:08:38 -07:00
Samuel Just
4a0f8fee54
ceph.py: remove unused variables mds_daemons and mon_daemons
...
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-09-15 17:26:38 -07:00
Samuel Just
a3c886af19
ceph.py/cephmanager.py: add ctx.daemons for restarting daemons
...
ctx.daemons will now be an instance of CephState.
ctx.daemons.get_daemon(role, id).stop() to stop daemon, retart() to
restart the daemon, etc.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-09-15 17:08:34 -07:00
Samuel Just
85cb29d345
testsnaps: LD_PRELOAD needed for librados
...
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-09-14 16:28:06 -07:00
Tommi Virtanen
a2372fce12
Move orchestra to teuthology.orchestra so there's just one top-level package.
2011-09-13 14:53:02 -07:00
Tommi Virtanen
bf46fd5394
Merge orchestra into teuthology.
...
There are too many things called Orchestra out there,
including Ubuntu's new multi-machine service orchestration
framework. The code might still be beneficial outside of
teuthology, but it can be spun off at that time.
Conflicts:
bootstrap
requirements.txt
setup.py
2011-09-13 14:10:12 -07:00
Tommi Virtanen
cc72fe6cf3
Callers of task s3tests.create_users don't need to provide dummy "fixtures" dict.
2011-09-09 13:22:03 -07:00
Josh Durgin
1970bad9d9
thrashosds: fix timeout when no options are specified
2011-09-09 10:31:08 -07:00
Josh Durgin
8dd52f9941
thrashosds: fail if cluster doesn't finally become clean in 5 minutes
2011-09-08 18:09:11 -07:00
Josh Durgin
fc1b14ddcc
thrasher: get coverage and cores from calling ceph commands
2011-09-08 14:09:13 -07:00
Josh Durgin
b72c5a8363
thrashosds: wait for every pg to go active and clean before exiting
2011-09-08 14:07:23 -07:00
Josh Durgin
08747c5bfb
thrasher: clean up a bit
2011-09-08 12:58:59 -07:00
Josh Durgin
091b0ae3de
autotest: allow tests to be run on all clients
2011-09-07 17:50:12 -07:00