Commit Graph

501 Commits

Author SHA1 Message Date
Josh Durgin
3d69965c42 workunits: remove unused variable 2011-08-31 16:49:05 -07:00
Josh Durgin
697f3419d0 nuke: add option to reboot all nodes 2011-08-31 14:37:16 -07:00
Josh Durgin
ec768ba3ca Fix pyflakes warnings. 2011-08-31 14:36:01 -07:00
Josh Durgin
5dd50b3990 coverage: remove debugging 2011-08-30 17:21:36 -07:00
Josh Durgin
5b42b08527 workunit: save coverage and coredumps
Anything that runs a ceph utility should be using these commands.
2011-08-30 17:13:44 -07:00
Greg Farnum
6d91915217 workunits: rework a little bit to allow "all" clients in a run
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-30 15:49:10 -07:00
Sage Weil
ec97dd8203 cfuse: support running through valgrind
Also switch up the config code so we can take per-client options.
2011-08-30 13:34:52 -07:00
Greg Farnum
0c2bee1514 valgrind: don't run valgrind_post if there's no valgrind
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-29 16:47:22 -07:00
Greg Farnum
3a3c859f5b valgrind: scan logs for bad results
It's not sophisticated but it will warn you about a node
if at least one node has issues.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-29 14:03:02 -07:00
Greg Farnum
50a648bdfc valgrind: use xml output for tools that support it
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-29 14:03:02 -07:00
Josh Durgin
7be9eaa030 suite: add option to send an email if the entire suite passed 2011-08-29 12:42:45 -07:00
Josh Durgin
4f4227a44d Generate coverage at the end of a suite run,
and optionally email failures and ongoing jobs.
2011-08-29 10:23:12 -07:00
Josh Durgin
2b66938232 queue: delete every job when it finishes, so only running jobs are buried 2011-08-29 10:23:12 -07:00
Josh Durgin
b582764b62 Add teuthology-coverage for analyzing test coverage for a suite run. 2011-08-29 10:23:12 -07:00
Josh Durgin
0545e2fa68 Add scripts to analyze coverage for a single teuthology run. 2011-08-29 10:23:12 -07:00
Greg Farnum
fb33ef3c69 thrasher: improve documentation a little
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-25 15:27:30 -07:00
Greg Farnum
83e263425a thrasher: add option to mark OSDs down instead of out.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-25 15:19:30 -07:00
Greg Farnum
0f9b74e28c thrasher: allow a config to set values
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-25 15:18:42 -07:00
Greg Farnum
5d5de0e70c thrasher: remove redundant wait_till_clean()
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-25 14:38:34 -07:00
Greg Farnum
1130e5fe37 coverage: create dir conditionally
We don't need to create the dir if we aren't using coverage.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-24 16:48:14 -07:00
Greg Farnum
0840d05a8f lockfile: add a lockfile task
This allows pretty highly configurable testing of
fcntl locking via a teuthology task.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-24 16:39:23 -07:00
Sage Weil
c2c5ac0c54 lock: --list-targets: list locks and dump result in targets: yaml format. 2011-08-24 10:03:43 -07:00
Sage Weil
42318c57cb check ceph cluster log for badness (ERR, WRN, SEC) 2011-08-23 21:00:26 -07:00
Sage Weil
21d04419b8 ceph: copy cluster log file to archive/ceph.log 2011-08-22 22:04:57 -07:00
Sage Weil
e79dda9a9d workunits: set CEPH_CONF environment
This allows any ceph util we run (including the rados-api tests) find
the config and keyrings they need.
2011-08-21 17:26:15 -07:00
Sage Weil
aa575c1318 rbd: make default image 10G instead of 1G 2011-08-21 15:14:02 -07:00
Sage Weil
cfd5456f3f suite: support a suite consisting of multiple collections
suite = many collections, and maybe some shared files
collection = a collection of facets
facet = a config fragment
2011-08-18 16:30:08 -07:00
Greg Farnum
e20bae2a7f valgrind: Document!
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:35:37 -07:00
Greg Farnum
73de620c9e Merge branch 'wip-valgrind' 2011-08-17 10:32:57 -07:00
Greg Farnum
4efc95fa57 include log in valgrind log file names
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:30:26 -07:00
Greg Farnum
d5eb2c2b77 ceph task: split up arguments a little more
This allows selective daemon kill signal changes. With valgrind
daemons we want term instead of kill, for instance.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:30:24 -07:00
Greg Farnum
5323e1796f valgrind: move valgrind logs to log dir
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:29:54 -07:00
Greg Farnum
aa74481728 ceph: split up daemon-running arguments and insert valgrind ones
This setup should let us insert other kinds of things too, if we
need them.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:35:42 -07:00
Greg Farnum
9ec19f13df ceph: Set up valgrind as a flavor, and create a dir for logging.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:32:23 -07:00
Greg Farnum
98ac89a54e ceph task: pass the full config to the daemon startup subs
So far as I can tell there is no reason to reduce them to
the coverage config, and I want the full config for my
soon-to-exist valgrind options.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:31:18 -07:00
Tommi Virtanen
747deecaf6 Add assert to catch simple typos in roles list.
Input of "roles:\n- [mds,1]" used to make teuthology crash
in a non-obviou way.
2011-08-15 09:36:06 -07:00
Greg Farnum
0139323e51 Merge branch 'wip-nuke'
Conflicts:
	teuthology/task/kernel.py
2011-08-10 16:16:25 -07:00
Greg Farnum
6938946a19 manypools: remove commented-out code
This accidentally got left in from my development.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 16:12:53 -07:00
Greg Farnum
1ea825c643 teuthology-nuke: split the big main function
It was getting a bit big, but now all the functions fit on
one screen each.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 16:06:45 -07:00
Greg Farnum
af0d7c5e44 teuthology-nuke: move it into its own file.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 15:38:57 -07:00
Greg Farnum
453a0f99d4 teuthology-nuke: identify and reboot machines with kernel mounts
This includes untested code for just force-unmounting them
when that works again, but for now it does a full reboot-and-
reconnect cycle.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:46 -07:00
Greg Farnum
9566008468 teuthology-nuke: use a more robust cfuse mount finder
This way it can remove cfuse mounts in any location on
the system.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:41 -07:00
Greg Farnum
257d63137f teuthology-nuke: split out different pieces into different loops
This will let us behave more intelligently on things like
nuking kernel mounts.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:36 -07:00
Greg Farnum
b5859f877a Move reconnect function from kernel task to misc.py
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:24 -07:00
Tommi Virtanen
7fd798a347 Configure grub to default to the right kernel, not the greatest installed one.
This is sticky; that is, even if you install other kernels (manually/via fab/etc),
grub will keep booting up the one that was last enabled via teuthology config.
Use teuthology to switch kernels and it'll just work.

If the kernel the grub default points to is removed, grub will fall back to
booting the kernel with the greatest version number.

Closes: http://tracker.newdream.net/issues/1364
2011-08-10 13:40:00 -07:00
Tommi Virtanen
39e22e4c0a Handle socket.timeout when waiting for a reconnect.
Now it gets ignored, just like the other harmless socket errors.
2011-08-10 13:22:14 -07:00
Tommi Virtanen
742109f4d9 Wait up to 300 seconds for a reboot.
At least sepia86 was reliably slower than the previous 180 second default.
2011-08-10 13:21:39 -07:00
Sage Weil
bf7b1dd4a7 ceph: fix max_mds calculation
Signed-off-by: Sage Weil <sage@newdream.net>
2011-08-10 12:47:20 -07:00
Greg Farnum
a1f3cac0b6 kernel: comment reconnect task, clean up reporting
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 09:07:48 -07:00
Greg Farnum
663bbf8b2b manypools: remove commented-out code
This accidentally got left in from my development.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-09 16:53:46 -07:00