Commit Graph

1087 Commits

Author SHA1 Message Date
Sam Lang
fe513353c6 Merge pull request #11 from ceph/wip-4717
misc: Use tempfile.mkstemp() instead of tempnam
2013-04-15 11:44:59 -07:00
Sam Lang
3b0d91533e misc: Use tempfile.mkstemp() instead of tempnam
tempnam() is considered an unsafe security risk because the filename
generated is easy to guess and can be symlinked in advance.  Use
mkstemp() instead.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
Reviewed-by: Joe Buck <jbbuck@gmail.com>
2013-04-15 13:43:20 -05:00
Dan Mick
f69ddafde7 Revert "Revert "Install.py: Prevent prompts from breaking apt""
This reverts commit 67a616a979.

Sigh.  As it turns out, /etc/default/grub being hacked also
causes the same problem.  I think there's a way to fix that cleanly
as well, but until then, replacing the "accept installed version"
hack here so jobs can run.
2013-04-15 11:24:31 -07:00
Sam Lang
50aaece463 misc: Use pythonic 'is not None' for jobid case
The conditional 'if global_jobid:' evaluates to true
in some cases even when global_jobid is None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-13 10:12:45 -05:00
Sam Lang
715245e100 misc: Fix name parsing
Use last two digits of year.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-12 19:04:19 -05:00
Dan Mick
67a616a979 Revert "Install.py: Prevent prompts from breaking apt"
This reverts commit 5995ae7e78.

With the changes to ceph-qa-chef and the teuthology kernel task,
we're no longer touching packaged file /etc/grub.d/10_linux, which
was the reason for this apt forcing.  Remove so that we find other
package problems that might be masked by this; we can always
put it back if there are such problems until we can fix those as well.

Signed-off-by: Dan Mick <dan.mick@inktank.com>
(cherry picked from commit c2b0828b19)
2013-04-12 15:49:24 -07:00
Dan Mick
52cdaae683 kernel.py: put submenu name in 01_ceph_kernel if necessary
We had been writing 01_ceph_kernel with the kernel title, and
relying on the fact that grub.cfg would never have submenus in it
(implemented by a hack to /etc/grub.d/10_linux which neutered its
submenu creation).  However, that hack was modifying a package file,
and got in the way of later apt commands.  Rather than doing it
that way, this divines the title of the submenu and sets the
default variable to "submenu>kernel", which works to select the
desired kernel.

It depends on there being only one level of submenu, and on the
format of the menuentry and submenu commands, dictated by grub2.
None of this is likely to work at all outside Ubuntu.

Fixes: #4496
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
(cherry picked from commit 52aec32a7d)
2013-04-12 15:49:15 -07:00
Samuel Just
2c7b1f3d62 peer.py: we can't assume pg query state will match mon pg state
The pg state could easily have changed in the mean time,
for example, from recovery_wait to recovering.

Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-12 15:02:16 -07:00
Sam Lang
35e6db72a1 Merge pull request #10 from ceph/wip-fix-importcycle
lock: Fix import cycle breakage

Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-04-12 13:06:55 -07:00
Dan Mick
c2b0828b19 Revert "Install.py: Prevent prompts from breaking apt"
This reverts commit 5995ae7e78.

With the changes to ceph-qa-chef and the teuthology kernel task,
we're no longer touching packaged file /etc/grub.d/10_linux, which
was the reason for this apt forcing.  Remove so that we find other
package problems that might be masked by this; we can always
put it back if there are such problems until we can fix those as well.

Signed-off-by: Dan Mick <dan.mick@inktank.com>
2013-04-12 10:57:41 -07:00
Sam Lang
a6b84a5613 lock: Fix import cycle breakage
fa2049f caused an import cycle between lock.py and misc.py.  Move the
needed functions from lock.py to lockstatus.py so that we can avoid the
import cycle.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-12 12:57:33 -05:00
wusui
a246f1e8ad Merge pull request #9 from ceph/wip-short-testdir
misc: Use job id and make short path for testdir

Reviewed-by: Warren Usui <warren.usui@inktank.com>
2013-04-12 10:22:15 -07:00
Dan Mick
52aec32a7d kernel.py: put submenu name in 01_ceph_kernel if necessary
We had been writing 01_ceph_kernel with the kernel title, and
relying on the fact that grub.cfg would never have submenus in it
(implemented by a hack to /etc/grub.d/10_linux which neutered its
submenu creation).  However, that hack was modifying a package file,
and got in the way of later apt commands.  Rather than doing it
that way, this divines the title of the submenu and sets the
default variable to "submenu>kernel", which works to select the
desired kernel.

It depends on there being only one level of submenu, and on the
format of the menuentry and submenu commands, dictated by grub2.
None of this is likely to work at all outside Ubuntu.

Fixes: #4496
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
2013-04-11 13:41:45 -07:00
Sam Lang
fa2049ffbb misc: Use job id and make short path for testdir
Nightlies run on teuthology currently use a testdir of
/home/ubuntu/cephtest, but this causes stale job errors occasionally
from the previous tests not getting properly cleaned up, which prevents
the nightlies from running successfully.

The misc.py get_testdir() function can specify a testdir that is
specific to the job, but previously the path was too long and would
cause separate job failures.

This patch does two things to resolve that.  First, it uses the job id
from the teuthology run if one exists.  This should be a relatively
short number that will identify the job run effectively.  Second,
if the job id isn't available, it creates a shortened form of the
job's name, for example the job name:

teuthology-2013-04-09_23:51:49-rgw-next-testing-basic

becomes:

te1304092351rntb

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-04-11 13:53:38 -05:00
Sandon Van Ness
e2938f4026 Fix for kdb: doesn't work on mira nodes
This is a fix for issue #4677 which was caused by kdb output being
hard-coded to ttyS1 which is fine for all our hardware except mira
machines. This change just checks to see if mira is in the host's
name and uses ttyS2 instead (simple fix).
2013-04-09 13:12:58 -07:00
Sandon Van Ness
9c9baef680 Fix: kdb: doesn't work on mira nodes
Change kernel.py to use ttyS2 for kdb output instead of ttyS1 when
the node is a mira machine. This is a fix for issue #4677
2013-04-09 13:09:39 -07:00
Joe Buck
826ad0d816 teuthology: fix for ssh-keys-task
Resolves an issue where we
were not properly escaping the generated
public key when doing matches against it.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewd-by: Sam Lang <sam.lang@inktank.com>
2013-04-08 11:54:55 -07:00
caleb miles
fa70eb8f67 radosgw-admin: Test bucket list for bucket starting with underscore.
Signed-off-by: caleb miles <caleb.miles@inktank.com>
2013-04-05 10:18:49 -07:00
Sandon Van Ness
41028847f8 Install.py: Prevent prompts from breaking apt
Change apt commands to prevent prompts from coming up (forcing
non-interactive mode) so things like grub or other stuff doesn't
break teuthology runs.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-04-04 19:40:21 -07:00
Sandon Van Ness
5995ae7e78 Install.py: Prevent prompts from breaking apt
Change apt commands to prevent prompts from coming up (forcing
non-interactive mode) so things like grub or other stuff doesn't
break teuthology runs.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-04-04 19:15:14 -07:00
Joe Buck
4824dafb3c teuthology: extend Hadoop task to support branches
Modify the Hadoop task to support branches
being specified for both the Apache and Inktank
Hadoop branches.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewd-by: Sam Lang <sam.lang@inktank.com>
2013-04-04 15:57:31 -07:00
Joe Buck
4258807bfc teuthology: remove previous test ssh keys
Updated the ssh-keys task to cleanup
any left-over keys from previous tasks
(indicated by the user being 'ssh-keys-user').
Also, some of the functions in the ssh_keys task seem
like they could be useful in general.
This patch refactors them into misc.py.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewd-by: Sam Lang <sam.lang@inktank.com>
2013-04-04 15:55:54 -07:00
Sage Weil
939223aa4f workunit: sudo rm -rf ...
Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-03 22:01:01 -07:00
Warren Usui
ba536a8353 Merge branch 'wip-teuthologyvminstall-wusui' 2013-04-03 12:34:01 -07:00
Warren Usui
8129bffb17 Implement full reinstallation of a VM system.
Downburst create is used to reinstall a VM when it is locked.
Downburst destroy is used to remove a VM when it is unlocked.
Host keys are regenerated on each vm instantiation, so the keys
need to be checked prior to use.
If needed, qa-ceph-chef is run on newly installed systems to insure that
they are fully functional.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-04-03 12:29:47 -07:00
Sage Weil
cf4bf09b2c ceph.conf: lower mon disk avail warning threshold
Only wanr when we hit 90% instead of default 70%

Signed-off-by: Sage Weil <sage@inktank.com>
2013-04-03 10:03:09 -07:00
caleb miles
982e1ed18f radosgw-admin: Add test of duplicate user email specification.
Signed-off-by: caleb miles <caleb.miles@inktank.com>
2013-04-03 08:12:56 -07:00
caleb miles
fb34c38332 radosgw-admin: Test subuser mask durability when creating new key.
Signed-off-by: caleb miles <caleb.miles@inktank.com>
2013-04-03 06:04:01 -07:00
caleb miles
7b3973fff2 radosgw-admin: cluster info -> zone info
Signed-off-by caleb.miles <caleb.miles@inktank.com>
2013-04-01 20:46:30 -07:00
Samuel Just
d81babffe5 repair_test: add test for repairing read errs and truncations
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
2013-04-01 16:38:33 -07:00
Josh Durgin
2a1cdda90d locker: try to make up for apache timeouts
If the lock request succeeds in updating the db, but the client gets a
timeout from apache, they can now try again and get back the machines
they just locked.

Only automatic runs have a description set when locking several
machines, so this does not affect users of teuthology-lock
--lock-many, where no description can be set in the same request.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-03-29 16:34:15 -07:00
Sage Weil
aeb1bbe414 do not archive on pass if 'archive-on-error: True'
Optional flag makes us suck down the archive (mostly, the logs, which
might be huge for some debugging tests) unless the test has failed.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-29 14:27:20 -07:00
Sage Weil
a40b850eb3 locker: log desc too
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-29 14:27:13 -07:00
Sage Weil
9f46f47b6b run: clean up machine_type thing
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-29 12:19:05 -07:00
Sage Weil
e8afa454d8 ceph_manager: retry set_pool_property on EAGAIN
Retry indefinitely, for now.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-28 15:25:10 -07:00
Sage Weil
b815268b58 run: machine-type: foo, not machine_type: foo
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-28 15:25:10 -07:00
Sam Lang
6fd7ebd44d task/mds_thrash: Log mds dump after long delay
In cases where the mds thrasher continuously loops
waiting for an mds to be removed from the map, or
for a new mds to become active, we want to start logging
the mds state for debugging.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-27 08:48:45 -05:00
Sage Weil
bc54a8bfaa locker: make desc optional
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-26 13:27:53 -07:00
Sage Weil
07e324ae65 ceph.conf: osd debug op order = true
Debug the osd op ordering by default.  Most of the runs have a small number
of clients, which makes the STL maps cheap.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-26 12:21:13 -07:00
Sage Weil
8eabe1bb4a locker/api: fix DELETE
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-26 11:40:13 -07:00
Sage Weil
0b72c8dbd2 lock: pass desc to lock operation; leave on unlock
Pass the desc to the lock operation.

The unlock operation now clears desc for us; no need to do it outselves.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-25 16:47:02 -07:00
Sage Weil
56820b3bad locker: set desc on lock
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-25 16:47:02 -07:00
Sage Weil
6208af93ba locker: clear desc on unlock
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-25 16:47:02 -07:00
Samuel Just
c50b143e92 thrashosds: add test_backfill_full
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
2013-03-25 15:39:12 -07:00
Samuel Just
97a5c05141 thrashosds.py: fix line length
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
2013-03-25 15:39:11 -07:00
Josh Durgin
e594fcb30f locker: log updates
Note whenever locks are acquired/released, or a machine's description is updated.
Under apache, these will go to error.log.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-03-25 15:42:08 -07:00
Sage Weil
5d3fed47f7 ceph-deploy: purge /var/lib/ceph data on finish
The install task does this now that the package doesn't; we
need to too.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-24 15:12:59 -07:00
Sage Weil
4e68c2033c verify /var/lib/ceph not present on start
Verify there is no /var/lib/ceph, just like we do with the cephtest
directory.  We will need to change this (or make it optional) when we
allow runs against an existing cluster, but then a whole bunch of other
things will need to change then as well.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-23 20:58:46 -07:00
Sage Weil
466fede77b install: need sudo when purging /var/lib/ceph
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-23 20:53:51 -07:00
Sage Weil
4a6e3b97e3 install, nuke: explicitly purge /var/lib/ceph
The packages won't do this anymore.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-22 15:22:38 -07:00
Joe Buck
2ed9f648a8 teuthology: cleanup client dirs for workunit task
This patch corrects an issue where a workunit task is
not cleaning up generated directories
if the 'all' key is used to specify clients.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-03-22 13:02:32 -07:00
Sage Weil
4d28a65798 stop ignoring osd leaks
Note that the mds is the only one left that we are ignoring.
2013-03-21 20:40:48 -07:00
tamil
8f27fa6194 moving client.keyring creation out of ceph task
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-21 16:14:54 -07:00
Dan Mick
e7cd598e60 lock: make do_summary() respect --machine-type
Signed-off-by: Dan Mick <dan.mick@inktank.com>
2013-03-20 18:31:11 -07:00
tamil
8ff32eb630 stop and restart daemons as restart only starts.
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-20 17:40:46 -07:00
Sam Lang
fabf36d2b6 task/ceph: Revert extra check for running status
Don't use exit status info to track daemon state.  We need to find
a better way to do this for the restart task.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-19 15:46:34 -05:00
Sage Weil
66e27fa589 suite: shorten subject a bit
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-19 12:12:56 -07:00
David Zafman
4e1737b914 Merge branch 'wip-4197' 2013-03-19 11:30:41 -07:00
David Zafman
4d7e1e9188 osd: data loss: low space handling
Automated test cases for feature #4197

Signed-off-by: David Zafman <david.zafman@inktank.com>
Reported-by: Sam Just <sam.just@inktank.com>
2013-03-19 11:27:00 -07:00
Warren Usui
511f04f2a9 Fixed so that installation works on a brand new CentOS system.
Do yum install rather than yum reinstall for CentOS.
When exiting CentOS, yum erase the ceph-release rpm.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-03-19 10:03:33 -07:00
Sam Lang
1b0369bd95 task/restart: Handle error from script correctly
The exitstatus on the process is a gevent.AsyncResult
(not an int).  Use the try/except pattern for handling
errors instead.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-19 08:10:30 -05:00
tamil
23557907d2 s/dist-upgrade/upgrade
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-18 16:29:18 -07:00
Warren Usui
c55f12887b Fixed ceph-fuse mount point cleanup bug
Tested for the existence of /sys/fs/fuse/connections/*/abort
before clobbering it.  This problem was generated when all
the machines were virtual CentOS machines.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-03-18 13:35:58 -07:00
Sam Lang
42e98497c0 task/restart: Cleanup in finally
Need to cleanup the files created for this test from
the testdir.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-18 11:28:51 -05:00
Sam Lang
cd98efe591 task/restart: Fix check for done
The last command a restart script outputs is 'done'
indicating the script does not require being restarted
further.  Handle this case properly.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-18 11:27:11 -05:00
Sam Lang
6fe1deadbf task/restart: Restart task for testing daemon kill
The ceph daemons support being killed at a specific code point
with a config option.  In some cases, we want to test a kill point
only once for a given daemon run (such as replay that only occurs
during daemon startup).  This task allows running a script or executable
and (when the script sends a command to the task) restarting it with
a temporary config that has the appropriate kill point set.  Once
the daemon asserts and gets restarted, the original config is used.

Adds a specific restart_with_args() method to the DaemonState in the
ceph task.

Right now this task follows the workunit task closely, but uses stdout/stdin
to specify when to restart a daemon.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-18 11:16:01 -05:00
tamil
9e81ff58e5 added ceph_health check and a few log messages
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-15 15:50:52 -07:00
Warren Usui
4300f42689 Fixed 'clock:' on Centos
ntpdc commands were formerly returning -127 on CentOS

Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-03-15 10:15:43 -07:00
tamil
fa1faa1f95 added install.upgrade task
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-14 18:26:03 -07:00
tamil
aaf02ab05d added task for ceph.restart
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-14 18:18:39 -07:00
Sage Weil
85a8baae38 install: do debs for 'Debian'
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-14 16:27:01 -07:00
Sage Weil
46a78afabc install: el6 -> rpm
Slightly more general.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-14 16:26:25 -07:00
Warren Usui
0c75c6b1f7 Added el6 install functionality for CentOS systems.
install_packages, remove_packages and remove_sources are now the
installation and removal functions used by teuthology.  Debian
references have been removed outside of tasks/install.py.  CentOS
functionality parallel to Debian have been added to tasks/install.py,
and el6 references have been added to nuke.py, task/ceph-fuse.y and
task/install.py.

Some files created by CentOS are removed with rm -fr.  This should
be changed once the installation/removal rpm procedure is implemented.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-03-14 16:25:18 -07:00
Warren Usui
01a40cfbf1 Use service instead of initctl to restart rsyslog.
This change is needed to make sure teuthology works on CentOS when the
-a option is specified.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-03-13 18:37:25 -07:00
Sage Weil
6dfea4bbed Merge remote-tracking branch 'gh/wip-powercycle-fixes' 2013-03-13 18:09:06 -07:00
Sage Weil
67f0aa11fe ceph.conf: auth debug = true
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-13 13:39:54 -07:00
Sage Weil
740fb85aaa ceph-deploy: uninstall even when no archive
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-13 09:50:09 -07:00
Sam Lang
13d0d5ab96 task/ceph_manager: Only reconnect if powercycled
We don't need to reconnect for remotes
that haven't been powercycled.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-13 11:15:51 -05:00
Sam Lang
6be6f6c607 task/thrashosds: Ipmi checking/setup in thrashosds
We don't need to setup the ipmi console on runs that
don't use powercycling, so delay setup of the RemoteConsole
with ipmi to the thrashosd task and only then if the powercycle
config is set.  This avoids spurious test failures from flaky
ipmi.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-13 10:19:48 -05:00
Sam Lang
8791b3740c task/ceph_manager: Check that ipmi is enabled
If powercycling was requested for the osd thrasher
we should ensure that we are able to reach the
ipmi console.  This helps us avoid weird errors.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-12 22:21:12 -05:00
Sam Lang
5ef2a04137 task/ceph: Recreate /var/run/ceph on powercycle
If a node is powecycled during testing, we need to
recreate /var/run/ceph once the node restarts.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-12 22:21:01 -05:00
Sage Weil
fd1e083c61 lock: tolerate description of None
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-12 19:50:07 -07:00
Joe Buck
6511950847 teuthology: update hadoop task for new code layout
Update the teuthology Hadoop task to use
the new code layout.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-03-12 17:45:51 -07:00
tamil
bdd1feca0d increasing the default memory of vm to 4 MB
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-12 12:12:58 -07:00
Sage Weil
a5e1ed286e ceph-deploy: no need to call chef explicitly
The chef task does this.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-11 16:22:02 -07:00
Sage Weil
e9c26063ee ceph-deploy: use ceph.com git mirror
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-11 16:21:47 -07:00
Sage Weil
c6a427be55 ceph: sudo grep /var/log/ceph/ceph.log
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-10 17:08:55 -07:00
Sage Weil
d9a233c9bc mds_thrash: requery mds status inside the wait loop
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-09 22:17:15 -08:00
Sage Weil
f1d66d2dd5 schedule_suite.sh: check, but do not sync, clock for each run
Trust NTP to actually sync things.  But check the clock skew so we can
verify it if something seems off.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-09 21:34:24 -08:00
Sage Weil
5df43e64ab mpi: substitute $TESTDIR
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-09 21:20:43 -08:00
Sage Weil
b4d2234376 exec/pexec: substitute $TESTDIR into command
It might be better to make this unescaped, but that's trickier.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-09 18:07:06 -08:00
Sage Weil
36b877add3 Revert "ceph.conf: osd debug op order = true"
This reverts commit 4f68e3ee12.

Until we fix 4392... the kernel is sending ops out of order.
2013-03-08 09:33:51 -08:00
Samuel Just
e7c079f790 rados.py: add option to do many short runs
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-03-07 18:06:55 -08:00
tamil
09e14bf474 ceph-deploy task
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-07 17:56:29 -08:00
tamil
e0ad6faa5e added 'extras' to install extra packages excluding ceph
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-07 17:43:40 -08:00
Josh Durgin
cef6e5d5e9 lock: remove description when unlocking
This will hopefully make it easier to track down what's leaving nodes unlocked.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-03-07 09:40:29 -08:00
Sage Weil
4f68e3ee12 ceph.conf: osd debug op order = true
Debug the osd op ordering by default.  Most of the runs have a small number
of clients, which makes the STL maps cheap.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-06 21:35:41 -08:00
Joe Buck
da1e63bddd testing: fix logic in determining Hadoop master
Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-03-05 18:48:09 -08:00
Joe Buck
6b0a8d3db7 testing: specify ceph.conf path in core-site.xml
Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-03-05 18:47:55 -08:00
Sage Weil
1660d896b0 install: install libcephfs-java
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-05 15:12:02 -08:00
Sage Weil
daad53fc4f ceph-fuse: install via install.py
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-05 14:14:12 -08:00
Warren Usui
09979541ca Implement email task.
Email.py was added so that the emailto attribute could be passed,
and to prevent 'module object has no attribute: email' errors from
happening.  Run.py actual performs the email operation and calls
suite.email_results to do the actual send mail operation.  The
information passed right now is the summary and config information.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-02-27 12:28:59 -08:00
Warren Usui
c5b55f9b76 Fix pass/fail display on exit.
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-02-27 12:28:59 -08:00
Sage Weil
1e6e8aeb7f ceph-fuse: fix install_debs call with new syntax
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-27 10:40:50 -08:00
Sage Weil
c0c33834cf install: poll for packages if wait-for-package: true
If the yaml has

 wait-for-package: true

then block and poll for the packages to appear if they are not already
there.  This is only useful for new branches or explicit sha1's, obviously.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-26 21:02:14 -08:00
Sage Weil
454e161829 install: fix branch/tag/sha1 selection
We were only allowing selection of packages by branch!

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-26 21:01:11 -08:00
Sage Weil
d908c30d60 ceph_manager: use an exception type
Otherwise this obscures the error

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-26 16:47:39 -08:00
Joe Buck
e00bff2278 teuthology: remove CEPH_JAVA_PATH
Since teuthology now uses debian packages,
we do not need to set this in workunit.
The one test that uses this now tests for
it locally.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
2013-02-26 10:18:25 -08:00
Joe Buck
9af61cb617 teuthology: add an extra_packages flag to install
Some tests require additional packages
(e.g., java bindings, hadoop bindings).
Extend the install task to allow for those
packages to be specified in the yaml files.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-02-26 10:15:45 -08:00
Sage Weil
51fa5fb2c1 nuke: blow away /home/ubuntu/cephtest too
(along with /tmp/cephtest)

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-25 19:15:15 -08:00
Warren Usui
3ebabb346f Add timer.py and display summary info in run.py.
Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-02-25 19:15:15 -08:00
caleb miles
b744f42361 radosgw-admin: Bug fixes for issue 4251.
Fixes errors introduced by d44fb14762.

Signed-off-by caleb.miles <caleb.miles@inktank.com>
2013-02-25 19:14:21 -08:00
Sage Weil
24eeb0da95 rbd: drop udev
The packages have these rules now.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-25 15:12:56 -06:00
Sage Weil
d7337364a3 exec, pexec: set TESTDIR
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-25 15:12:56 -06:00
Sage Weil
31ef4a1276 ceph.conf: debug ms = 1 for monitors
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-25 15:12:56 -06:00
Sage Weil
a60e7b599a rbd: set TESTDIR when running xfstests
[elder@inktank.com: fixed typo]

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-25 15:12:41 -06:00
Sage Weil
d8021a1aa0 nuke: sudo for killall
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-22 10:51:51 -08:00
Joao Eduardo Luis
045a86680d task: ceph: create monitor data directories prior to --mkfs
The new monitor store does not create the data directory on --mkfs. We
must create it instead, much like what happens with the osds.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-02-22 01:04:06 +00:00
Joao Eduardo Luis
c1b75c6b0c task: mon_thrash: Thrash multiple monitors and 'maintain-quorum' option
We now add a new option 'thrash-many' that by being set to true will break
the default behaviour of killing only one monitor at a time.  Instead,
this option will select up to the maximum number of killable monitors to
kill in each round.

We also add a new 'maintain-quorum' option that will limit the amount of
monitors that can be killed in each thrashing round.  If set to true, this
option will limit the amount of killable monitors up to (n/2-1).  This
means that if we are running a configuration that only has up to two
configured monitors, if 'maintain-quorum' is set to true, this task won't
run as there are no killable monitors -- in such a scenario, this option
should be set to false.

Furthermore, if 'store-thrash' is set to true, then 'maintain-quorum' must
also be set to true, as we cannot let the task to thrash all the monitor
stores, or we wouldn't be able to sync from other monitors, nor can we
let quorum be dropped, or we won't be able to resync our way into quorum.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-02-22 00:44:49 +00:00
Joao Eduardo Luis
d28bb05a65 task: mon_thrash: Add 'seed' and 'store-thrash' options
This patch introduces an option to thrash a monitor store when we thrash
the monitors, as well as a 'store-thrash-probability' option (defaulting
to 50%).

We also took this opportunity to introduce a new 'seed' option, that ought
to allow a given run of this task to be reproducible.  This might come in
hand when attempting to reproduce a given behavior that would otherwise
be randomly triggered.

You should note that while the 'seed' option will indeed mimic past
behaviors, this only applies to a past behavior of this task: other tasks
are not affected by this value, nor are any workunits or even ceph daemons.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
2013-02-22 00:44:25 +00:00
Sage Weil
278be217b6 ceph.conf: log file, not log dir
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-21 16:21:33 -08:00
Sage Weil
c85ba56397 install: ignore apt-get update failures on package removal
At that point we don't really care.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-21 15:08:22 -08:00
Sage Weil
17be13b2b4 ceph: fix log, /var/run/ceph stupid
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-21 15:08:14 -08:00
Josh Durgin
a862d8bf77 Fix unused vars, unused imports, and aliasing
Found by pyflakes

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-02-21 14:47:00 -08:00
Josh Durgin
23669be9af radosgw-admin: fix errors found by pyflakes
Using the wrong variable, and true/false are capitalized in python.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-02-21 14:37:46 -08:00
Sage Weil
8b0eef29c5 ceph_manager: fix asok string formatting
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-21 12:52:30 -08:00
Sage Weil
5c0a2f40de ceph: make /var/run/ceph writeable by non-root too
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-21 12:45:26 -08:00
Sage Weil
1d62baf107 ceph: fix /var/log/ceph chown/cmod typo
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-21 12:35:55 -08:00
Sage Weil
e1573ea181 ceph.conf: use default locations for admin socket; fix client log file
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-21 11:35:36 -08:00
Sage Weil
2574d87562 make /var/log/ceph writeable by non-root; make clients log to it
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-21 11:05:30 -08:00
Sage Weil
4431e1efab install: be slightly more efficient
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-20 21:49:22 -08:00
Sage Weil
00986ce080 install: be more careful about package removal
- call apt separately for each package; it will error out annoyingly if
there is one in the list not in the APT sources.
- use dpkg with appropriate force to clean up broken half-installs.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-20 21:46:37 -08:00
Sage Weil
7d8a72a379 ceph-fuse: install -dbg package, too
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-20 21:36:47 -08:00
Sage Weil
c64cd20103 radosgw-admin: fix sleep syntax error
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-20 21:02:16 -08:00
Sage Weil
ecb563b221 install: install libcephfs1[-dbg]
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-20 16:27:52 -08:00
Sage Weil
28f11d037f ceph_manager: drop extra line
Ooops, from 0c990f3add.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-20 16:21:42 -08:00
Sage Weil
9996bdbe6e run: print pass/FAIL as final line
Makes it easy to tell at a glance if your last test passed or not.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-20 15:27:23 -08:00
Yehuda Sadeh
6c2f4a2bbe Merge remote-tracking branch 'origin/wip-3634' 2013-02-20 14:10:50 -08:00
Sage Weil
0c990f3add ceph_manager: eventually time out while waiting for admin socket
Otherwise we get stuck in a loop if an osd crashes unexpectedly, the
task never fails, and we don't collect all the evidence.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-19 22:24:27 -08:00
Sage Weil
9617dcc6b2 Merge branch 'unstable'
Conflicts:
	teuthology/task/workunit.py
2013-02-19 21:04:24 -08:00
Sage Weil
1a0e201d65 ceph: fix valgrind log check
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 16:43:04 -08:00
Sage Weil
2653b5a07b install: clean up flavor, distro, arch detection
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 15:40:43 -08:00
Joe Buck
84d7f37fdb testing: export TESTDIR in workunit task.
Some command-line tools need to reference the path
to the test directory, which is created at run-time.
We export this as TESTDIR

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by:  Sam Lang <sam.lang@inktank.com>
2013-02-18 15:22:52 -08:00
Sage Weil
3f7c9bcaa4 move the install to a separate task.
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 15:06:52 -08:00
Sage Weil
176a3408c9 testrados -> ceph_test_rados
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:20 -08:00
Sage Weil
4869b494c9 lock: allow filtering by description, description substring
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:06 -08:00
Sage Weil
2a344c1079 rgw: sudo
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:06 -08:00
Sage Weil
ed82d87566 fix a few archive/log stragglers
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:06 -08:00
Sage Weil
38b30c646f ceph: make gitbuilder host configurable
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:06 -08:00
Sage Weil
b1c9864a7a ceph: install -dbg packages, too
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:06 -08:00
Sage Weil
d8b6bf94ec ceph: create /var/run/ceph
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
969a6def3b ceph-fuse: sudo
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
4eb047b4ea sudo for admin socket commands
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
0ad55b3604 cfuse -> ceph-fuse 2013-02-18 13:39:05 -08:00
Sage Weil
88ebc27f16 ceph: store logs in normal location
We need to switch around how these are compressed and pulled, since they
aren't in the regular archive dir anymore.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
d1d36241b7 ceph: use default data, keyring locations
This required reordering the cluster setup so that we do the ceph-osd
--mkfs --mkkey prior to gathering keys and initializing the monitors.

Also, run daemons as root.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
7a61b237d6 ceph: don't uninstall librados, librbd
This forces uninstall of kvm too, which is expensive.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
bc9ecf7813 ceph: pass package version to apt-get install
This avoids problems when a different or newer version of the package is
already installed.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
c9d3dea320 avoid secretfile, except for kclient
Only mount.ceph needs the secret in a standalone file.  Remove other users,
and simplify that one.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
45ddbe636e rgw: specify keyring location
Otherwise we look at the default /var/lib/ceph/radosgw/ceph-$id/keyring.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
a54200d444 nuke: tolerate failed dpkg --configure -a/apt-get -f install
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
4dc6c8b275 install radosgw
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
333c7260bc rbd: remove merge cruft
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:05 -08:00
Sage Weil
8b971611f2 ceph: simplify apt-key management
Run apt-key as root. No need to initialize ubuntu user's gpg.  Fix
whitespace.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:04 -08:00
Sage Weil
eadefec8ff ceph: put client keyrings in /etc/ceph/ceph.$name.keyring
And make it world readable, for ubuntu's sake.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:04 -08:00
Sage Weil
149be93639 nuke: dpkg --configure -a and apt-get -f install
Installing debs means we are more likely to hit a case where we interrupt
apt/dpkg.  Try to mop up as best we can in nuke.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:04 -08:00
Sage Weil
3400ea39ba nuke: whitespace
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:04 -08:00
Sage Weil
27fec2377f ceph: simpilfy package removal
apt-get doesn't have a nice way to tell if the package is not install and
we don't need to purge it.  Well, not one I found in 5 minutes.  Just
do a big purge and assume it works, or failed because there was nothing to
be done.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:04 -08:00
Sage Weil
28116db6a0 nuke: remove librados, librbd
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:04 -08:00
Sage Weil
a529bb725f ceph: install ceph-mds, ceph-common
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:04 -08:00
Sage Weil
5235fc18a0 ceph: fix purge
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:39:03 -08:00
Sander Pool
c525e1061b Install ceph debs and use installed debs
The ceph task installs ceph using the debian
packages now, and all invocations of binaries installed
in {tmpdir}/binary/usr/local/bin/ are replace with
the use of the binaries installed in standard locations
by the debs.

Author:    Sander Pool <sander.pool@inktank.com>
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-02-18 13:39:03 -08:00
Sage Weil
d790eeb451 nuke: testrados -> ceph_test_rados
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-18 13:38:54 -08:00
Sage Weil
7a5fd05edd misc: replace : with - in testdir name
The :'s break the list in $PATH.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-17 22:13:45 -08:00
Sage Weil
9513f2f206 rbd_fsx: binary name now has ceph_ prefix
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-15 09:12:25 -08:00
Sage Weil
5d6d6884fe rados: testrados -> ceph_test_rados
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-13 14:10:33 -08:00
Sage Weil
6e3c2d93fb peer: add recovery delay to make test behave
Otherwise it was (very) racy!
2013-02-11 06:59:17 -08:00
Sandon Van Ness
a56eb88c16 Merge to include --machine-type and changes to --summary
Added the ability to support multiple types of machines with
--machine-type added to teuthology-lock when used with --lock-many
or --machine-type with teuthology --lock (automated tests). It
defaults to 'plana' and the 'vps' type is currently unused but
should be in the future.

Also updated teutholoy-lock --summary to be machine type aware
and sort things in a nice output.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-02-07 16:34:14 -08:00
Sandon Van Ness
75d86e47fd Made teuthology-lock --summary machine type aware.
Signed-off-by: Sandon Van Ness <sandon@van-ness.com>
2013-02-07 16:06:21 -08:00
Sandon Van Ness
030bc7c23d Added support for multiple types of machines.
Added the ability to support multiple types of machines with
--machine-type added to teuthology-lock when used with --lock-many
or --machine-type with teuthology --lock (automated tests). It
defaults to 'plana' and the 'vps' type is currently unused but
should be in the future.

Signed-off-by: Sandon Van Ness <sandon@van-ness.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
2013-02-07 13:26:37 -08:00
Sage Weil
ed9103aad5 rgw: parse testdir into apache.conf
Also fix up the template to use {{field}} for stuff we don't want to parse.
There is probably a better way...

Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-06 22:02:10 -08:00
Sage Weil
67bbb9c77b osd_recovery: add missing testdir arg 2013-02-06 21:44:10 -08:00
Sage Weil
561ea14c6e ceph_manager: take int or string to osd_admin_socket
This fixes a failure on dump_stuck.
2013-02-06 17:14:24 -08:00
Sage Weil
3fbb552240 radosbench: fix missing format value
tdir is substituted in at the end.  There is probably a better way to do
this.
2013-02-06 09:06:35 -08:00
Sage Weil
936f314a63 rgw: fix testdir format on f
Format the path, not filehandle
2013-02-06 09:04:37 -08:00
Josh Durgin
ed3c3615c3 nuke: don't try unmount if we're rebooting everything anyway
This can cause issues when unmount hangs. Our automatic runs reboot
everything unconditionally, so this caused a bunch of unecessary hangs
when an fs was accidentally rendered un-unmountable.
2013-02-05 23:31:39 -08:00
Josh Durgin
c6504bab9a nuke: make tmpfs check only umount tmpfs
This would catch things like /tmp/cephtest/mnt.client.0, which are
used by cfuse, rbd, and kclient.
2013-02-05 23:28:12 -08:00
Sage Weil
82273e951b rbd: fix rbd image unmount
The testdir param was missing.  Avoid this class of errors by unmounting
exactly what we mounted.
2013-02-05 23:19:23 -08:00
Sage Weil
6099045990 rbd: set env before running sudo
Signed-off-by: Sage Weil <sage@inktank.com>
2013-02-05 23:01:25 -08:00
Sam Lang
100e9056ed misc: Close connections on reboot
When nodes are rebooted, the connections remain open
even after calling reconnect and setting up new ssh
sessions to the rebooted nodes.  This causes ECONNRESET
errors to show up in the teuthology output.

Close the existing connections before trying to reconnect.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-02-05 16:20:52 -06:00
Sam Lang
da10b58d65 task/ceph_manager: Fix NoneType config issue
kill_mon is getting a config set to None, which blows
up now due to the check for powercycle.  Initialize
the config to an empty dict if we don't get anything
on init.  This is the error showing up in teuthology:

2013-02-04T15:04:16.595 ERROR:teuthology.run_tasks:Manager failed: <contextlib.GeneratorContextManager object at 0x1fcafd0>
Traceback (most recent call last):
  File "/var/lib/teuthworker/teuthology-master/teuthology/run_tasks.py", line 45, in run_tasks
    suppress = manager.__exit__(*exc_info)
  File "/usr/lib/python2.7/contextlib.py", line 24, in __exit__
    self.gen.next()
  File "/var/lib/teuthworker/teuthology-master/teuthology/task/mon_thrash.py", line 142, in task
    thrash_proc.do_join()
  File "/var/lib/teuthworker/teuthology-master/teuthology/task/mon_thrash.py", line 69, in do_join
    self.thread.get()
  File "/var/lib/teuthworker/teuthology-master/virtualenv/local/lib/python2.7/site-packages/gevent/greenlet.py", line 308, in get
    raise self._exception
AttributeError: 'NoneType' object has no attribute 'get'

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-02-05 10:38:48 -06:00
Josh Durgin
2f41f81dfa misc: don't use colon in default run name
LD_LIBRARY_PATH does not work with colons (and backslash does not escape them.)
2013-02-04 10:39:40 -08:00
Sam Lang
55c1bcf6b0 Add testdir param to get_valgrind_args() calls
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-02-03 22:09:15 -06:00
Sam Lang
a5ba4f6a94 Merge branch 'wip-misc-fixes'
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-02-03 11:38:29 -06:00
Sam Lang
887e93e7e5 nuke.py: Allow name of job/run to be specified
Nuke will cleanup the base test directory by default, but can
cleanup the test directory for a given run if specified.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-02-03 11:09:49 -06:00
Sam Lang
46d3ff94f5 run.py: Add target name to logging info
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-02-03 11:09:04 -06:00
Sage Weil
ada803db0f rbd: fix .format() call with {1} syntax
IndexError: tuple index out of range
2013-02-03 08:18:52 -08:00
Sage Weil
fe9fb49e27 ceph_manager: use get() for self.config powercycle checks
I think this is what is going on...

Traceback (most recent call last):
  File "/var/lib/teuthworker/teuthology-master/teuthology/contextutil.py", line 27, in nested
    yield vars
  File "/var/lib/teuthworker/teuthology-master/teuthology/task/ceph.py", line 1158, in task
    yield
  File "/var/lib/teuthworker/teuthology-master/teuthology/run_tasks.py", line 25, in run_tasks
    manager = _run_one_task(taskname, ctx=ctx, config=config)
  File "/var/lib/teuthworker/teuthology-master/teuthology/run_tasks.py", line 14, in _run_one_task
    return fn(**kwargs)
  File "/var/lib/teuthworker/teuthology-master/teuthology/task/dump_stuck.py", line 93, in task
    manager.kill_osd(id_)
  File "/var/lib/teuthworker/teuthology-master/teuthology/task/ceph_manager.py", line 665, in kill_osd
    if 'powercycle' in self.config and self.config['powercycle']:
TypeError: argument of type 'NoneType' is not iterable
2013-02-02 21:01:08 -08:00
Sam Lang
7280980f34 Fixup latest commits that use /tmp/cephtest.
Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-02-02 11:00:17 -06:00
Sam Lang
d9fff40f6b task/chdir-coredump: Use readlink -e
realpath isn't available everywhere, use readlink -e instead.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-02-01 16:07:29 -06:00