Commit Graph

244 Commits

Author SHA1 Message Date
Sage Weil
cfd5456f3f suite: support a suite consisting of multiple collections
suite = many collections, and maybe some shared files
collection = a collection of facets
facet = a config fragment
2011-08-18 16:30:08 -07:00
Greg Farnum
e20bae2a7f valgrind: Document!
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:35:37 -07:00
Greg Farnum
73de620c9e Merge branch 'wip-valgrind' 2011-08-17 10:32:57 -07:00
Greg Farnum
4efc95fa57 include log in valgrind log file names
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:30:26 -07:00
Greg Farnum
d5eb2c2b77 ceph task: split up arguments a little more
This allows selective daemon kill signal changes. With valgrind
daemons we want term instead of kill, for instance.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:30:24 -07:00
Greg Farnum
5323e1796f valgrind: move valgrind logs to log dir
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-17 10:29:54 -07:00
Greg Farnum
aa74481728 ceph: split up daemon-running arguments and insert valgrind ones
This setup should let us insert other kinds of things too, if we
need them.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:35:42 -07:00
Greg Farnum
9ec19f13df ceph: Set up valgrind as a flavor, and create a dir for logging.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:32:23 -07:00
Greg Farnum
98ac89a54e ceph task: pass the full config to the daemon startup subs
So far as I can tell there is no reason to reduce them to
the coverage config, and I want the full config for my
soon-to-exist valgrind options.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-15 15:31:18 -07:00
Tommi Virtanen
747deecaf6 Add assert to catch simple typos in roles list.
Input of "roles:\n- [mds,1]" used to make teuthology crash
in a non-obviou way.
2011-08-15 09:36:06 -07:00
Greg Farnum
0139323e51 Merge branch 'wip-nuke'
Conflicts:
	teuthology/task/kernel.py
2011-08-10 16:16:25 -07:00
Greg Farnum
6938946a19 manypools: remove commented-out code
This accidentally got left in from my development.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 16:12:53 -07:00
Greg Farnum
1ea825c643 teuthology-nuke: split the big main function
It was getting a bit big, but now all the functions fit on
one screen each.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 16:06:45 -07:00
Greg Farnum
af0d7c5e44 teuthology-nuke: move it into its own file.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 15:38:57 -07:00
Greg Farnum
453a0f99d4 teuthology-nuke: identify and reboot machines with kernel mounts
This includes untested code for just force-unmounting them
when that works again, but for now it does a full reboot-and-
reconnect cycle.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:46 -07:00
Greg Farnum
9566008468 teuthology-nuke: use a more robust cfuse mount finder
This way it can remove cfuse mounts in any location on
the system.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:41 -07:00
Greg Farnum
257d63137f teuthology-nuke: split out different pieces into different loops
This will let us behave more intelligently on things like
nuking kernel mounts.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:36 -07:00
Greg Farnum
b5859f877a Move reconnect function from kernel task to misc.py
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 14:37:24 -07:00
Tommi Virtanen
7fd798a347 Configure grub to default to the right kernel, not the greatest installed one.
This is sticky; that is, even if you install other kernels (manually/via fab/etc),
grub will keep booting up the one that was last enabled via teuthology config.
Use teuthology to switch kernels and it'll just work.

If the kernel the grub default points to is removed, grub will fall back to
booting the kernel with the greatest version number.

Closes: http://tracker.newdream.net/issues/1364
2011-08-10 13:40:00 -07:00
Tommi Virtanen
39e22e4c0a Handle socket.timeout when waiting for a reconnect.
Now it gets ignored, just like the other harmless socket errors.
2011-08-10 13:22:14 -07:00
Tommi Virtanen
742109f4d9 Wait up to 300 seconds for a reboot.
At least sepia86 was reliably slower than the previous 180 second default.
2011-08-10 13:21:39 -07:00
Sage Weil
bf7b1dd4a7 ceph: fix max_mds calculation
Signed-off-by: Sage Weil <sage@newdream.net>
2011-08-10 12:47:20 -07:00
Greg Farnum
a1f3cac0b6 kernel: comment reconnect task, clean up reporting
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-10 09:07:48 -07:00
Greg Farnum
663bbf8b2b manypools: remove commented-out code
This accidentally got left in from my development.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-09 16:53:46 -07:00
Tommi Virtanen
1ccdcb9896 Make rbd task use mnt.N not mnt.client.N as mountpoint.
Everything else expects this, so e.g. workunits wouldn't work with rbd.
2011-08-09 16:25:00 -07:00
Tommi Virtanen
780ebcdf1b Make sure workunit task does not create mnt.N by itself.
This used to hide a bug in the rbd task, where rbd
created the mountpoint with the wrong name. The workunits
ended up running against the local filesystem.
2011-08-09 16:11:32 -07:00
Tommi Virtanen
4e2ec6fd8b Add interactive-on-error, to pause and explore on error.
Closes: http://tracker.newdream.net/issues/1291
2011-08-09 15:42:17 -07:00
Stephon Striplin
eee1d9a9e4 allow s3tests.create_users defaults be overridden 2011-08-09 14:28:08 -07:00
Tommi Virtanen
4241185ad7 Add simple unit test for get_clients. 2011-08-09 13:40:56 -07:00
Sage Weil
b5ba155c17 Revert "fix get_clients"
This reverts commit 83b6678e79904793bf31e82bbecad7bf16c1b2b5.  The bug I was
hitting was actually fxied by 06e3e69c293b20c0ce5df526fa923a979c1d8cfc.
2011-08-09 13:23:58 -07:00
Gregory Farnum
137f36d533 teuthology: add task manypools
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-08-08 15:13:21 -07:00
Sage Weil
01fac3e2c6 new gitbuilder ref/branch naming
no origin_ prefix
2011-08-05 14:35:44 -07:00
Sage Weil
3f2ad30aca cfuse, kclient: print remote host 2011-08-05 14:35:44 -07:00
Sage Weil
83b6678e79 fix get_clients
Only return the clients that are listed (not _all_ clients).  There might
be a combination of cfuse and kclient (or other) clients here!
2011-08-05 14:35:44 -07:00
Sage Weil
06e3e69c29 tasks/kclient: don't clobber remote 2011-08-05 14:35:43 -07:00
Sage Weil
ef2b80910a use coverage_dir 2011-08-05 14:35:43 -07:00
Josh Durgin
f38c3697fd kernel: install in parallel 2011-08-05 11:17:28 -07:00
Josh Durgin
f66c010ef5 kernel: debug weird socket exceptions 2011-08-05 11:08:02 -07:00
Josh Durgin
6df0d71abf kernel: reboot immediately after installing
This hides the latency of rebooting when installing on many machines.
2011-08-05 11:07:40 -07:00
Josh Durgin
3e6b17f1b8 Down machines shouldn't be considered free. 2011-08-05 10:59:16 -07:00
Josh Durgin
68e6f2b77e Make scheduled tasks leave some machines free. 2011-08-04 18:32:57 -07:00
Josh Durgin
4e399da700 Log connections to targets
This way you can tell which machines have problems in case of an
error.
2011-08-04 18:25:43 -07:00
Josh Durgin
ac0a469b78 teuthology-worker: log to a file with timestamps 2011-08-03 15:28:46 -07:00
Josh Durgin
5897d7b95d teuthology-nuke: run in parallel, and print each node being nuked 2011-08-03 14:52:55 -07:00
Josh Durgin
30a8dac323 Set success at the beginning of a run.
This way internal tasks like locking can tell whether the run
succeeded, and unlock nodes if it did.
2011-08-03 14:03:13 -07:00
Josh Durgin
e8676ce0eb teuthology-nuke: reset rsyslog config 2011-08-03 11:21:32 -07:00
Josh Durgin
d079d51037 teuthology-worker: keep machines locked on error
This prevents a failure to clean up in one case from affecting the
rest of the tests.
2011-08-02 17:59:08 -07:00
Josh Durgin
c7b62feac8 teuthology-lock: update usage 2011-08-02 16:15:57 -07:00
Josh Durgin
cd04423541 teuthology-lock: allow list of locks to be filtered by owner and status 2011-08-02 15:56:14 -07:00
Greg Farnum
6ac6f7ab38 teuthology: convert from bzip2 to gzip.
gzip is much, much faster on large log files. With a 7.7GB client log, gzip
took 2:45 to compress it to 624MB. bzip2 took 34:38 to compress it to
366MB. For our purposes the space savings are not worth the time loss.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
2011-07-29 10:35:02 -07:00