Greg Farnum
d5eb2c2b77
ceph task: split up arguments a little more
...
This allows selective daemon kill signal changes. With valgrind
daemons we want term instead of kill, for instance.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:30:24 -07:00
Greg Farnum
5323e1796f
valgrind: move valgrind logs to log dir
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:29:54 -07:00
Greg Farnum
aa74481728
ceph: split up daemon-running arguments and insert valgrind ones
...
This setup should let us insert other kinds of things too, if we
need them.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:35:42 -07:00
Greg Farnum
9ec19f13df
ceph: Set up valgrind as a flavor, and create a dir for logging.
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:32:23 -07:00
Greg Farnum
98ac89a54e
ceph task: pass the full config to the daemon startup subs
...
So far as I can tell there is no reason to reduce them to
the coverage config, and I want the full config for my
soon-to-exist valgrind options.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:31:18 -07:00
Tommi Virtanen
747deecaf6
Add assert to catch simple typos in roles list.
...
Input of "roles:\n- [mds,1]" used to make teuthology crash
in a non-obviou way.
2011-08-15 09:36:06 -07:00
Greg Farnum
0139323e51
Merge branch 'wip-nuke'
...
Conflicts:
teuthology/task/kernel.py
2011-08-10 16:16:25 -07:00
Greg Farnum
6938946a19
manypools: remove commented-out code
...
This accidentally got left in from my development.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 16:12:53 -07:00
Greg Farnum
1ea825c643
teuthology-nuke: split the big main function
...
It was getting a bit big, but now all the functions fit on
one screen each.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 16:06:45 -07:00
Greg Farnum
af0d7c5e44
teuthology-nuke: move it into its own file.
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 15:38:57 -07:00
Greg Farnum
453a0f99d4
teuthology-nuke: identify and reboot machines with kernel mounts
...
This includes untested code for just force-unmounting them
when that works again, but for now it does a full reboot-and-
reconnect cycle.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:46 -07:00
Greg Farnum
9566008468
teuthology-nuke: use a more robust cfuse mount finder
...
This way it can remove cfuse mounts in any location on
the system.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:41 -07:00
Greg Farnum
257d63137f
teuthology-nuke: split out different pieces into different loops
...
This will let us behave more intelligently on things like
nuking kernel mounts.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:36 -07:00
Greg Farnum
b5859f877a
Move reconnect function from kernel task to misc.py
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:24 -07:00
Tommi Virtanen
7fd798a347
Configure grub to default to the right kernel, not the greatest installed one.
...
This is sticky; that is, even if you install other kernels (manually/via fab/etc),
grub will keep booting up the one that was last enabled via teuthology config.
Use teuthology to switch kernels and it'll just work.
If the kernel the grub default points to is removed, grub will fall back to
booting the kernel with the greatest version number.
Closes: http://tracker.newdream.net/issues/1364
2011-08-10 13:40:00 -07:00
Sage Weil
e98b218d48
restructure in terms of collections
2011-08-10 13:36:34 -07:00
Tommi Virtanen
39e22e4c0a
Handle socket.timeout when waiting for a reconnect.
...
Now it gets ignored, just like the other harmless socket errors.
2011-08-10 13:22:14 -07:00
Tommi Virtanen
742109f4d9
Wait up to 300 seconds for a reboot.
...
At least sepia86 was reliably slower than the previous 180 second default.
2011-08-10 13:21:39 -07:00
Sage Weil
23874712a6
add workunits on top of rbd
2011-08-10 12:55:12 -07:00
Sage Weil
bf7b1dd4a7
ceph: fix max_mds calculation
...
Signed-off-by: Sage Weil <sage@newdream.net>
2011-08-10 12:47:20 -07:00
Greg Farnum
a1f3cac0b6
kernel: comment reconnect task, clean up reporting
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 09:07:48 -07:00
Greg Farnum
663bbf8b2b
manypools: remove commented-out code
...
This accidentally got left in from my development.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-09 16:53:46 -07:00
Tommi Virtanen
1ccdcb9896
Make rbd task use mnt.N not mnt.client.N as mountpoint.
...
Everything else expects this, so e.g. workunits wouldn't work with rbd.
2011-08-09 16:25:00 -07:00
Tommi Virtanen
780ebcdf1b
Make sure workunit task does not create mnt.N by itself.
...
This used to hide a bug in the rbd task, where rbd
created the mountpoint with the wrong name. The workunits
ended up running against the local filesystem.
2011-08-09 16:11:32 -07:00
Tommi Virtanen
4e2ec6fd8b
Add interactive-on-error, to pause and explore on error.
...
Closes: http://tracker.newdream.net/issues/1291
2011-08-09 15:42:17 -07:00
Stephon Striplin
eee1d9a9e4
allow s3tests.create_users defaults be overridden
2011-08-09 14:28:08 -07:00
Tommi Virtanen
4241185ad7
Add simple unit test for get_clients.
2011-08-09 13:40:56 -07:00
Sage Weil
b5ba155c17
Revert "fix get_clients"
...
This reverts commit 83b6678e79
. The bug I was
hitting was actually fxied by 06e3e69c29
.
2011-08-09 13:23:58 -07:00
Gregory Farnum
137f36d533
teuthology: add task manypools
...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-08 15:13:21 -07:00
Sage Weil
01fac3e2c6
new gitbuilder ref/branch naming
...
no origin_ prefix
2011-08-05 14:35:44 -07:00
Sage Weil
3f2ad30aca
cfuse, kclient: print remote host
2011-08-05 14:35:44 -07:00
Sage Weil
83b6678e79
fix get_clients
...
Only return the clients that are listed (not _all_ clients). There might
be a combination of cfuse and kclient (or other) clients here!
2011-08-05 14:35:44 -07:00
Sage Weil
06e3e69c29
tasks/kclient: don't clobber remote
2011-08-05 14:35:43 -07:00
Sage Weil
ef2b80910a
use coverage_dir
2011-08-05 14:35:43 -07:00
Josh Durgin
f38c3697fd
kernel: install in parallel
2011-08-05 11:17:28 -07:00
Josh Durgin
f66c010ef5
kernel: debug weird socket exceptions
2011-08-05 11:08:02 -07:00
Josh Durgin
6df0d71abf
kernel: reboot immediately after installing
...
This hides the latency of rebooting when installing on many machines.
2011-08-05 11:07:40 -07:00
Josh Durgin
3e6b17f1b8
Down machines shouldn't be considered free.
2011-08-05 10:59:16 -07:00
Josh Durgin
68e6f2b77e
Make scheduled tasks leave some machines free.
2011-08-04 18:32:57 -07:00
Josh Durgin
4e399da700
Log connections to targets
...
This way you can tell which machines have problems in case of an
error.
2011-08-04 18:25:43 -07:00
Josh Durgin
ac0a469b78
teuthology-worker: log to a file with timestamps
2011-08-03 15:28:46 -07:00
Josh Durgin
5897d7b95d
teuthology-nuke: run in parallel, and print each node being nuked
2011-08-03 14:52:55 -07:00
Josh Durgin
30a8dac323
Set success at the beginning of a run.
...
This way internal tasks like locking can tell whether the run
succeeded, and unlock nodes if it did.
2011-08-03 14:03:13 -07:00
Josh Durgin
e8676ce0eb
teuthology-nuke: reset rsyslog config
2011-08-03 11:21:32 -07:00
Josh Durgin
d079d51037
teuthology-worker: keep machines locked on error
...
This prevents a failure to clean up in one case from affecting the
rest of the tests.
2011-08-02 17:59:08 -07:00
Josh Durgin
c7b62feac8
teuthology-lock: update usage
2011-08-02 16:15:57 -07:00
Josh Durgin
cd04423541
teuthology-lock: allow list of locks to be filtered by owner and status
2011-08-02 15:56:14 -07:00
Greg Farnum
6ac6f7ab38
teuthology: convert from bzip2 to gzip.
...
gzip is much, much faster on large log files. With a 7.7GB client log, gzip
took 2:45 to compress it to 624MB. bzip2 took 34:38 to compress it to
366MB. For our purposes the space savings are not worth the time loss.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-07-29 10:35:02 -07:00
Sage Weil
277c4ff7aa
set max_mds based on non-standbys
2011-07-28 10:25:30 -07:00
Sage Weil
07745f8a51
no ++ in python
2011-07-27 11:45:20 -07:00