Commit Graph

614 Commits

Author SHA1 Message Date
Sam Lang
5ce4d70e4b fix error on teardown failing to unmount /mnt 2012-09-18 15:56:08 -07:00
tamil
78b7b02c07 imported subprocess module in nuke script
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2012-09-14 15:04:40 -07:00
Josh Durgin
d27806a293 nuke: add missing import 2012-09-13 14:31:46 -07:00
Josh Durgin
c8c7014fc0 rbd: fix typo and cast to int before comparing format 2012-09-13 14:29:43 -07:00
Josh Durgin
055bf73d50 rbd: only specify --format if not using the default
This lets older versions that do not support --format still work with
format 1 images.
2012-09-12 11:31:28 -07:00
Tommi Virtanen
79607eed3c Don't lose tracebacks of exceptions raised in a greenlet.
Exception objects don't contain the traceback of where they were
raised from (to avoid cyclic data structures wrecking gc and causing
mem leaks), so the singular "raise obj" form creates a new traceback
from the current execution location, thus losing the original location
of the error.

Gevent explicitly wants to throw away the traceback, to release any
objects the greenlet may still be referring to, closing files,
releasing locks etc. In this case, we think it's safe, so stash the
exception info away in a holder object, and resurrect it on the other
side of the results queue.

http://stackoverflow.com/questions/9268916/how-to-capture-a-traceback-in-gevent

This can be reproduced easily with

	from teuthology.parallel import parallel
	def f():
	    raise RuntimeError("bork")
	with parallel() as p:
	    p.spawn(f)

and looking at the resulting traceback with and without this change.
2012-09-11 11:25:21 -07:00
Alex Elder
f64cedf4db rbd: allow xfstests task to specify rbd image formats
This adds the ability to specify the rbd image format to use for the
scratch and test devices for the rbd.xfstests task.

Signed-off-by: Alex Elder <elder@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
2012-09-10 19:38:21 -05:00
Alex Elder
73a29cdf91 rbd: allow image format to be specified
This adds the ability to specify an rbd image format (either 1 or 2)
for an rbd image.

Signed-off-by: Alex Elder <elder@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
2012-09-10 19:37:25 -05:00
tamil
39efbbcc2d Suppress valgrind error "Invalid write 8"
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2012-09-10 15:02:47 -07:00
Yehuda Sadeh
d6c2ded087 radosgw-admin: update task for new usage reporting
Usage reporting output has been modified, also use the new
--categories input param.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
2012-09-10 11:24:25 -07:00
Sage Weil
3473c2ed1d s3tests: run against arbitrary branch/sha1 of s3-tests.git 2012-09-10 11:08:57 -07:00
Sage Weil
db8037d998 debian ntp servers 2012-09-09 14:23:12 -07:00
Mike Ryan
f8e1f5c222 task: die on ceph error or coredump
This task allows ceph to signal to teuth that it should die immediately
by touching a file under /tmp/cephtest

Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
2012-09-04 09:52:38 -07:00
Sage Weil
dc1c247abc disable lockdep recursive warnings until #3040 is fixed 2012-08-24 19:23:34 -07:00
Sage Weil
b6b302890f internal: fix escaping of \b in syslog grep 2012-08-23 11:00:39 -07:00
Sage Weil
82cefa2477 suppress this valgrind error
<error>
  <unique>0x4</unique>
  <tid>1</tid>
  <kind>InvalidWrite</kind>
  <what>Invalid write of size 8</what>
  <stack>
    <frame>
      <ip>0x400A299</ip>
      <obj>/lib/x86_64-linux-gnu/ld-2.15.so</obj>
      <fn>do_lookup_x</fn>
      <dir>/build/buildd/eglibc-2.15/elf</dir>
      <file>dl-lookup.c</file>
      <line>250</line>
    </frame>
    <frame>
      <ip>0x403122F</ip>
    </frame>
    <frame>
      <ip>0x400A522</ip>
      <obj>/lib/x86_64-linux-gnu/ld-2.15.so</obj>
      <fn>_dl_lookup_symbol_x</fn>
      <dir>/build/buildd/eglibc-2.15/elf</dir>
      <file>dl-lookup.c</file>
      <line>739</line>
    </frame>
  </stack>
  <auxwhat>Address 0x7feffeec8 is on thread 1's stack</auxwhat>
</error>

pops up recently
2012-08-23 11:00:39 -07:00
Sage Weil
b800496bb4 ceph: fix cpu_profile default 2012-08-19 20:16:43 -07:00
Sage Weil
7d50411ca9 rbd.xfstests: default to 1gb (not 250mb) image 2012-08-18 20:10:54 -07:00
Mike Ryan
5b7ec43e0e task: run osd/mds/mon with Google CPU profiler via cpu_profile option
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
2012-08-17 13:47:13 -07:00
Mike Ryan
7f6591b556 ceph: support tmpfs_journal option to put journal on tmpfs
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
2012-08-16 15:50:10 -07:00
Sage Weil
6dbbcf03fc queue: fix logging of child return code 2012-08-14 15:08:21 -07:00
Tommi Virtanen
99ac6b0b3e Disable asynchronous DNS lookups.
Especially on older hosts, we keep triggering errors::

  ServerNotFoundError: Unable to find the server at
  teuthology.front.sepia.ceph.com: [Errno 3] name does not exist

That comes from libevent's evdns via gevent.dns and httplib2. The rate
of these errors is low enough that they seem to be perhaps timeouts,
or more arbitrary. Busy looping on DNS resolution calls has never
triggered them, so far.

With ``monkey.patch_all(dns=False)``, the teuthology process will
block as a whole whenever doing DNS resolution. This will hopefully be
rare enough that it won't matter.

The only real "fix" seems to be upgrading libraries and hoping for the
best; this commit can be reverted after that is done.
2012-08-13 16:18:33 -07:00
Tommi Virtanen
273a43eda8 Flush data to temp file before reading it in another process. 2012-08-09 09:42:35 -07:00
Tommi Virtanen
8aaf21d537 Oops tempfile now gives us file objects not fds. 2012-08-09 09:42:13 -07:00
Tommi Virtanen
99e99758e5 In teuthology-worker, shuffle the child stdout/stderr into our log.
Otherwise, child can suffer a failure that does not get logged by
it's own exception handling machinery, and we have no idea why.
2012-08-08 14:48:21 -07:00
Tommi Virtanen
05007f7e0f Minimize scope of try-except.
os.write and list.append won't raise CalledProcessError, and now
we don't need to try to contain them for temp file clean up reasons.
2012-08-08 14:45:49 -07:00
Tommi Virtanen
4b9e17626d Use tempfile.NamedTemporaryFile instead of mkstemp.
Simpler code, no manual cleanup needed. We see a littering of
zero-length temp files from teuthology-worker, and this seems
like a likely source.
2012-08-08 14:44:47 -07:00
Mike Ryan
3b85b2311b task: verify scrub detects files whose contents changed
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
2012-08-02 11:14:51 -07:00
Mike Ryan
8665bdc164 task: scrub OSDs periodically
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
2012-08-02 11:14:51 -07:00
Sage Weil
e4e239e268 kernel: push a local .deb instead of using gitbuilder
This lets you specify a path to an existing kernel deb
to be pushed and installed on the remote node.

Limitations:
 - We don't build the deb for you.  figuring out what
   filename 'make deb-pkg' is building is annoying.
 - We need to be able to figure out the sha1 from the provided
   path.  It shouldn't be a problem, given the way make deb-pkg
   names the debs.
2012-07-29 12:40:13 -07:00
Sage Weil
1c93d5ab4d syslog check: fix false-positive BUG matches in random strings 2012-07-29 12:15:51 -07:00
Sage Weil
a0847694a5 osd_recovery: also test unfound discovery
This tests for bug #2866.
2012-07-28 10:53:09 -07:00
Sage Weil
8dd09cb21d osd_recovery: test incomplete pg recovery
4-osd test to reproduce #2860 and confirm the fix.
2012-07-28 10:23:18 -07:00
Sage Weil
a9f2bf622f ceph_manager: wait_for_active 2012-07-28 10:23:18 -07:00
Sage Weil
731d520900 ceph_manager: count 'incomplete' as 'down' 2012-07-28 10:23:18 -07:00
tamil
0d6ce42405 Fixed the code to pass 'yes' during mkfs
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2012-07-26 13:48:11 -07:00
tamil
2b75ddef63 Added '-y' option for mkfs.ext4
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2012-07-25 16:38:25 -07:00
Sage Weil
9bc86171ac admin_socket: make test optional
If it's not there, we just verify the output is valid json.
2012-07-24 15:26:06 -07:00
Sage Weil
f70b825042 ceph: fix mkfs/mount option defaults
Later code expects a list, not None.
2012-07-21 20:18:24 -07:00
Samuel Just
e1c98e7d19 tasks: add multibench task for testing pool creation
Also adds support for specifying a pool for radosbench
to create and then cleanup instead of "data".

Signed-off-by: Samuel Just <sam.just@inktank.com>
2012-07-19 15:51:55 -07:00
Sage Weil
c49daeca2f clock: print skew with ntp servers to log to help debug time issues 2012-07-18 13:44:59 -07:00
Sage Weil
55847fc298 nuke: log what pid we are killing when we kill it 2012-07-18 11:04:30 -07:00
Sage Weil
12dc0ad101 ceph: archive mon data to a .tgz
Saves bandwidth, time, and space.
2012-07-17 10:00:59 -07:00
Sage Weil
ff0f4742fe set machine description to ctx.archive when auto-locking machines for a run 2012-07-16 10:53:25 -07:00
Sage Weil
042edcbe1e schedule/suite: schedule job, suite N times 2012-07-14 13:51:51 -07:00
Sage Weil
f37214e114 kernel: fix kernel installation when kdb: is specified
This normalize check would only trigger if a *single* key was specified.
Change it so that it triggers as long as all keys are in the list of valid
keys. This lets us specify both kdb: true and a sha1/branch/tag.

Phew!
2012-07-13 13:57:22 -07:00
Sage Weil
b0985e446b ceph: add default btrfs mkfs options 2012-07-13 11:30:21 -07:00
Sage Weil
014fb97f5f ceph: cleanup/simplify mount/mkfs options 2012-07-13 11:30:07 -07:00
Sage Weil
d0c9e49bb8 workunit: allow overrides
Pull top-level overrides into our config.  This lets you do:

overrides:
  workunit:
    branch: foo
tasks:
...
- workunit:
  clients:
    all:
    - foo
...
2012-07-13 11:13:41 -07:00
Sage Weil
a50bf67223 workunit: allow branch/sha1/tag to be specified
Pull the workunit(s) from the branch/tag/sha1 specified in the config.
2012-07-13 11:13:41 -07:00