Josh Durgin
3d69965c42
workunits: remove unused variable
2011-08-31 16:49:05 -07:00
Josh Durgin
697f3419d0
nuke: add option to reboot all nodes
2011-08-31 14:37:16 -07:00
Josh Durgin
ec768ba3ca
Fix pyflakes warnings.
2011-08-31 14:36:01 -07:00
Josh Durgin
5dd50b3990
coverage: remove debugging
2011-08-30 17:21:36 -07:00
Josh Durgin
5b42b08527
workunit: save coverage and coredumps
...
Anything that runs a ceph utility should be using these commands.
2011-08-30 17:13:44 -07:00
Greg Farnum
6d91915217
workunits: rework a little bit to allow "all" clients in a run
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-30 15:49:10 -07:00
Sage Weil
ec97dd8203
cfuse: support running through valgrind
...
Also switch up the config code so we can take per-client options.
2011-08-30 13:34:52 -07:00
Greg Farnum
0c2bee1514
valgrind: don't run valgrind_post if there's no valgrind
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-29 16:47:22 -07:00
Greg Farnum
3a3c859f5b
valgrind: scan logs for bad results
...
It's not sophisticated but it will warn you about a node
if at least one node has issues.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-29 14:03:02 -07:00
Greg Farnum
50a648bdfc
valgrind: use xml output for tools that support it
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-29 14:03:02 -07:00
Josh Durgin
7be9eaa030
suite: add option to send an email if the entire suite passed
2011-08-29 12:42:45 -07:00
Josh Durgin
4f4227a44d
Generate coverage at the end of a suite run,
...
and optionally email failures and ongoing jobs.
2011-08-29 10:23:12 -07:00
Josh Durgin
2b66938232
queue: delete every job when it finishes, so only running jobs are buried
2011-08-29 10:23:12 -07:00
Josh Durgin
b582764b62
Add teuthology-coverage for analyzing test coverage for a suite run.
2011-08-29 10:23:12 -07:00
Josh Durgin
0545e2fa68
Add scripts to analyze coverage for a single teuthology run.
2011-08-29 10:23:12 -07:00
Greg Farnum
fb33ef3c69
thrasher: improve documentation a little
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-25 15:27:30 -07:00
Greg Farnum
83e263425a
thrasher: add option to mark OSDs down instead of out.
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-25 15:19:30 -07:00
Greg Farnum
0f9b74e28c
thrasher: allow a config to set values
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-25 15:18:42 -07:00
Greg Farnum
5d5de0e70c
thrasher: remove redundant wait_till_clean()
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-25 14:38:34 -07:00
Greg Farnum
1130e5fe37
coverage: create dir conditionally
...
We don't need to create the dir if we aren't using coverage.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-24 16:48:14 -07:00
Greg Farnum
0840d05a8f
lockfile: add a lockfile task
...
This allows pretty highly configurable testing of
fcntl locking via a teuthology task.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-24 16:39:23 -07:00
Sage Weil
c2c5ac0c54
lock: --list-targets: list locks and dump result in targets: yaml format.
2011-08-24 10:03:43 -07:00
Sage Weil
42318c57cb
check ceph cluster log for badness (ERR, WRN, SEC)
2011-08-23 21:00:26 -07:00
Sage Weil
21d04419b8
ceph: copy cluster log file to archive/ceph.log
2011-08-22 22:04:57 -07:00
Sage Weil
e79dda9a9d
workunits: set CEPH_CONF environment
...
This allows any ceph util we run (including the rados-api tests) find
the config and keyrings they need.
2011-08-21 17:26:15 -07:00
Sage Weil
aa575c1318
rbd: make default image 10G instead of 1G
2011-08-21 15:14:02 -07:00
Sage Weil
cfd5456f3f
suite: support a suite consisting of multiple collections
...
suite = many collections, and maybe some shared files
collection = a collection of facets
facet = a config fragment
2011-08-18 16:30:08 -07:00
Greg Farnum
e20bae2a7f
valgrind: Document!
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:35:37 -07:00
Greg Farnum
73de620c9e
Merge branch 'wip-valgrind'
2011-08-17 10:32:57 -07:00
Greg Farnum
4efc95fa57
include log in valgrind log file names
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:30:26 -07:00
Greg Farnum
d5eb2c2b77
ceph task: split up arguments a little more
...
This allows selective daemon kill signal changes. With valgrind
daemons we want term instead of kill, for instance.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:30:24 -07:00
Greg Farnum
5323e1796f
valgrind: move valgrind logs to log dir
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:29:54 -07:00
Greg Farnum
aa74481728
ceph: split up daemon-running arguments and insert valgrind ones
...
This setup should let us insert other kinds of things too, if we
need them.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:35:42 -07:00
Greg Farnum
9ec19f13df
ceph: Set up valgrind as a flavor, and create a dir for logging.
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:32:23 -07:00
Greg Farnum
98ac89a54e
ceph task: pass the full config to the daemon startup subs
...
So far as I can tell there is no reason to reduce them to
the coverage config, and I want the full config for my
soon-to-exist valgrind options.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:31:18 -07:00
Tommi Virtanen
747deecaf6
Add assert to catch simple typos in roles list.
...
Input of "roles:\n- [mds,1]" used to make teuthology crash
in a non-obviou way.
2011-08-15 09:36:06 -07:00
Greg Farnum
0139323e51
Merge branch 'wip-nuke'
...
Conflicts:
teuthology/task/kernel.py
2011-08-10 16:16:25 -07:00
Greg Farnum
6938946a19
manypools: remove commented-out code
...
This accidentally got left in from my development.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 16:12:53 -07:00
Greg Farnum
1ea825c643
teuthology-nuke: split the big main function
...
It was getting a bit big, but now all the functions fit on
one screen each.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 16:06:45 -07:00
Greg Farnum
af0d7c5e44
teuthology-nuke: move it into its own file.
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 15:38:57 -07:00
Greg Farnum
453a0f99d4
teuthology-nuke: identify and reboot machines with kernel mounts
...
This includes untested code for just force-unmounting them
when that works again, but for now it does a full reboot-and-
reconnect cycle.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:46 -07:00
Greg Farnum
9566008468
teuthology-nuke: use a more robust cfuse mount finder
...
This way it can remove cfuse mounts in any location on
the system.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:41 -07:00
Greg Farnum
257d63137f
teuthology-nuke: split out different pieces into different loops
...
This will let us behave more intelligently on things like
nuking kernel mounts.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:36 -07:00
Greg Farnum
b5859f877a
Move reconnect function from kernel task to misc.py
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:24 -07:00
Tommi Virtanen
7fd798a347
Configure grub to default to the right kernel, not the greatest installed one.
...
This is sticky; that is, even if you install other kernels (manually/via fab/etc),
grub will keep booting up the one that was last enabled via teuthology config.
Use teuthology to switch kernels and it'll just work.
If the kernel the grub default points to is removed, grub will fall back to
booting the kernel with the greatest version number.
Closes: http://tracker.newdream.net/issues/1364
2011-08-10 13:40:00 -07:00
Tommi Virtanen
39e22e4c0a
Handle socket.timeout when waiting for a reconnect.
...
Now it gets ignored, just like the other harmless socket errors.
2011-08-10 13:22:14 -07:00
Tommi Virtanen
742109f4d9
Wait up to 300 seconds for a reboot.
...
At least sepia86 was reliably slower than the previous 180 second default.
2011-08-10 13:21:39 -07:00
Sage Weil
bf7b1dd4a7
ceph: fix max_mds calculation
...
Signed-off-by: Sage Weil <sage@newdream.net>
2011-08-10 12:47:20 -07:00
Greg Farnum
a1f3cac0b6
kernel: comment reconnect task, clean up reporting
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 09:07:48 -07:00
Greg Farnum
663bbf8b2b
manypools: remove commented-out code
...
This accidentally got left in from my development.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-09 16:53:46 -07:00