Commit Graph

377 Commits

Author SHA1 Message Date
Josh Durgin
6e3e0d7cdc connection: allow the caller to specify whether keep-alive should be used 2011-11-03 13:07:21 -07:00
Josh Durgin
b1a0c1adea locker: fix race in locking
The isolation level is lower than I thought. This made it possible for
two clients to think they both locked the same machines, since the
update would still be modifying each row to change the locked_since
time.
2011-11-03 11:29:18 -07:00
Samuel Just
a2f406ef49 testrados: set CEPH_CLIENT_ID without a ;
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-11-02 11:33:37 -07:00
Samuel Just
810cae1a1d testrados: specify CEPH_CONF directly
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-31 14:54:24 -07:00
Yehuda Sadeh
10c3508741 rgw: add user suspend/enable test 2011-10-27 12:11:28 -07:00
Yehuda Sadeh
86aa940ffb rgw: log-to-stderr is now a binary flag 2011-10-27 11:32:12 -07:00
Samuel Just
8d0a7c5977 testrados: rename testsnaps to testrados and make snap testing optional
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-24 14:25:22 -07:00
Josh Durgin
a1249d07ca workunit: set PYTHONPATH so we can test python bindings 2011-10-24 13:52:58 -07:00
Sage Weil
61cbb3218e ceph.conf: python parser doens't like ; comments 2011-10-23 10:30:27 -07:00
Sage Weil
3ed065625b ceph.conf: more frequent osd scrubbing; remove old cruft 2011-10-22 22:16:39 -07:00
Sage Weil
b8beff3dd5 ceph_manager: count active+clean+<somjething else> as active+clean
In my case, one pg was active+clean+scrubbing.

Signed-off-by: Sage Weil <sage@newdream.net>
2011-10-21 10:54:05 -07:00
Josh Durgin
409c57170d coverage: don't remove ceph tarball
We want to keep it for examining core files, and we're already
fetching it here, once per suite run.
2011-10-20 16:28:32 -07:00
Sage Weil
4ec37b2391 add lost_unfound task
Also some misc useful bits to ceph_manager.
2011-10-17 15:32:22 -07:00
Josh Durgin
bcded7f163 ceph: add whitelist for cluster log errors
Some messages are expected when thrashing osds or creating unfound
objects.

Fixes: #1622
2011-10-17 14:42:08 -07:00
Josh Durgin
fba220ecaa nuke: reset syslog configuration after rebooting
Previously we removed a file and rebooted without syncing, so the file
was never deleted.
2011-10-17 10:40:19 -07:00
Yehuda Sadeh
493596a7fd radosgw-admin: test swift keys creation/removal 2011-10-12 15:37:33 -07:00
Josh Durgin
321381d75f teuthology-worker: remove --keep-locked-on-error 2011-10-07 14:51:46 -07:00
Josh Durgin
3d3eb0efea Remove --keep-locked-on-error, and behave as if it were specified
This will help prevent machines with cephtest dirs still present from
being used. It's easy to unlock machines - the targets yaml fragment
is output during a run.
2011-10-07 14:49:53 -07:00
Josh Durgin
c56ab97442 reconnect: ignore SSHExceptions before the timeout expires
Fixes: #1587
2011-10-06 17:18:35 -07:00
Samuel Just
4722d468c6 task/watch_notify_stress: watch_notify_stress now thrashes clients
This should exercise the watch notify timeout code.

Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-06 14:34:44 -07:00
Sage Weil
4e61e4835e rgw: keep radosgw in foreground
It defaults to a daemon now.
2011-10-06 12:50:12 -07:00
Josh Durgin
107db6a913 Retry listing machines if the lock server goes down. 2011-10-04 17:21:00 -07:00
Sage Weil
39a1e76065 rgw: use normal logging mechanism
Keep capturing stdout/err, even though it should end up empty.

Signed-off-by: Sage Weil <sage@newdream.net>
2011-10-04 16:09:51 -07:00
Josh Durgin
7b7ff6e8ce teuthology-worker: clean up last_in_suite jobs
There's no reason not to delete them once they start.
2011-10-04 12:32:58 -07:00
Josh Durgin
3d3ba1ebb1 daemon-helper: detect the signal actually sent
I thought I fixed this when I implemented coverage collection, but I
guess it got lost in a rebase or something.
2011-10-04 12:17:19 -07:00
Josh Durgin
d305d61b86 ceph_manager: remove unused raw_pg_status method 2011-10-03 17:49:53 -07:00
Josh Durgin
8e031730c1 ceph_manager: run ceph -s as a normal program
This allows failures from it to be detected better.
2011-10-03 17:49:13 -07:00
Josh Durgin
bad609e63e teuthology-results: include passed tests in email 2011-10-03 17:11:53 -07:00
Josh Durgin
8bcd2a74ca teuthology-results: include reasons for failure in email 2011-10-03 17:08:29 -07:00
Josh Durgin
030161ed8d teuthology-ls: show reasons for failures with -v 2011-10-03 17:07:41 -07:00
Josh Durgin
1cad309d65 Add failure_reason to summary for the first failure detected.
For now, this is the exception raised during a task, the error found
in the central log, or coredumps found. More specific errors
(i.e. s3-tests had 3 failures) can be added later as exceptions raised
by tasks.
2011-10-03 17:07:41 -07:00
Josh Durgin
817b950494 radosbench: get coverage and cores 2011-10-03 17:07:41 -07:00
Samuel Just
fe1a271d69 watch_notify_stress.py: add ceph flags option
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-03 14:26:08 -07:00
Samuel Just
28d60172f6 ceph.py: add btrfs option
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-10-03 14:26:04 -07:00
Josh Durgin
ae196028ad nuke: keep up with renaming cfuse -> ceph-fuse 2011-10-03 09:56:42 -07:00
Sage Weil
2b601a32d0 radosgw-admin: test additional keys, log list/show/rm
Signed-off-by: Sage Weil <sage@newdream.net>
2011-10-03 09:45:11 -07:00
Sage Weil
b93a00771f tasks/radosgw-admin: test radosgw-admin tool
Not yet complete...
2011-10-03 09:45:11 -07:00
Sage Weil
afc2dc085a nuke: killall apache2 and radosgw too 2011-10-03 09:44:58 -07:00
Greg Farnum
9b44469e5e s3-tests: use radosgw-admin instead of radosgw_admin
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-30 09:26:42 -07:00
Josh Durgin
52427589a6 ceph_manager: parse osd numbers with dots
This is necessary since wip-dot-names was merged.
2011-09-29 09:09:31 -07:00
Sage Weil
a92fef77dc rename c* -> ceph-*
Leave cfuse task name unchanged for now...
2011-09-23 08:57:18 -07:00
Josh Durgin
c7ff5fc38b queue: results_timeout needs to be converted to a string 2011-09-22 18:23:42 -07:00
Samuel Just
ef56a72b73 task/watch_notify_stress.py: add simple watch_notify stress test
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-09-22 13:25:21 -07:00
Josh Durgin
c3c262656d schedule: put results timeout in the job
The default was always being used instead.
2011-09-21 11:05:33 -07:00
Greg Farnum
e4dfe3d4bd lockfile: increase interval to prevent incorrect locking orders
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-20 10:04:01 -07:00
Greg Farnum
5ff88d1902 lockfile: don't fail cleanup if no lock procs exist
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-09-20 10:03:33 -07:00
Tommi Virtanen
0d5dbfa27e workunit: Fetch source from github.
Needed an elaborate dance because Github won't let us download
an archive of a subdirectory.

Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
2011-09-16 11:32:15 -07:00
Tommi Virtanen
5583fac383 s3tests: Clone repository from github.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
2011-09-16 11:09:45 -07:00
Tommi Virtanen
4d92c355ec coverage: Fetch source from github.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
2011-09-16 11:08:38 -07:00
Samuel Just
4a0f8fee54 ceph.py: remove unused variables mds_daemons and mon_daemons
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
2011-09-15 17:26:38 -07:00