Commit Graph

901 Commits

Author SHA1 Message Date
Sam Lang
6fd7ebd44d task/mds_thrash: Log mds dump after long delay
In cases where the mds thrasher continuously loops
waiting for an mds to be removed from the map, or
for a new mds to become active, we want to start logging
the mds state for debugging.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-27 08:48:45 -05:00
Sage Weil
bc54a8bfaa locker: make desc optional
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-26 13:27:53 -07:00
Sage Weil
07e324ae65 ceph.conf: osd debug op order = true
Debug the osd op ordering by default.  Most of the runs have a small number
of clients, which makes the STL maps cheap.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-26 12:21:13 -07:00
Sage Weil
8eabe1bb4a locker/api: fix DELETE
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-26 11:40:13 -07:00
Sage Weil
0b72c8dbd2 lock: pass desc to lock operation; leave on unlock
Pass the desc to the lock operation.

The unlock operation now clears desc for us; no need to do it outselves.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-25 16:47:02 -07:00
Sage Weil
56820b3bad locker: set desc on lock
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-25 16:47:02 -07:00
Sage Weil
6208af93ba locker: clear desc on unlock
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-25 16:47:02 -07:00
Samuel Just
c50b143e92 thrashosds: add test_backfill_full
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
2013-03-25 15:39:12 -07:00
Samuel Just
97a5c05141 thrashosds.py: fix line length
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
2013-03-25 15:39:11 -07:00
Josh Durgin
e594fcb30f locker: log updates
Note whenever locks are acquired/released, or a machine's description is updated.
Under apache, these will go to error.log.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
2013-03-25 15:42:08 -07:00
Sage Weil
5d3fed47f7 ceph-deploy: purge /var/lib/ceph data on finish
The install task does this now that the package doesn't; we
need to too.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-24 15:12:59 -07:00
Sage Weil
4e68c2033c verify /var/lib/ceph not present on start
Verify there is no /var/lib/ceph, just like we do with the cephtest
directory.  We will need to change this (or make it optional) when we
allow runs against an existing cluster, but then a whole bunch of other
things will need to change then as well.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-23 20:58:46 -07:00
Sage Weil
466fede77b install: need sudo when purging /var/lib/ceph
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-23 20:53:51 -07:00
Sage Weil
4a6e3b97e3 install, nuke: explicitly purge /var/lib/ceph
The packages won't do this anymore.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-22 15:22:38 -07:00
Joe Buck
2ed9f648a8 teuthology: cleanup client dirs for workunit task
This patch corrects an issue where a workunit task is
not cleaning up generated directories
if the 'all' key is used to specify clients.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-03-22 13:02:32 -07:00
Sage Weil
4d28a65798 stop ignoring osd leaks
Note that the mds is the only one left that we are ignoring.
2013-03-21 20:40:48 -07:00
tamil
8f27fa6194 moving client.keyring creation out of ceph task
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-21 16:14:54 -07:00
Dan Mick
e7cd598e60 lock: make do_summary() respect --machine-type
Signed-off-by: Dan Mick <dan.mick@inktank.com>
2013-03-20 18:31:11 -07:00
tamil
8ff32eb630 stop and restart daemons as restart only starts.
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-20 17:40:46 -07:00
Sam Lang
fabf36d2b6 task/ceph: Revert extra check for running status
Don't use exit status info to track daemon state.  We need to find
a better way to do this for the restart task.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-19 15:46:34 -05:00
Sage Weil
66e27fa589 suite: shorten subject a bit
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-19 12:12:56 -07:00
David Zafman
4e1737b914 Merge branch 'wip-4197' 2013-03-19 11:30:41 -07:00
David Zafman
4d7e1e9188 osd: data loss: low space handling
Automated test cases for feature #4197

Signed-off-by: David Zafman <david.zafman@inktank.com>
Reported-by: Sam Just <sam.just@inktank.com>
2013-03-19 11:27:00 -07:00
Warren Usui
511f04f2a9 Fixed so that installation works on a brand new CentOS system.
Do yum install rather than yum reinstall for CentOS.
When exiting CentOS, yum erase the ceph-release rpm.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-03-19 10:03:33 -07:00
Sam Lang
1b0369bd95 task/restart: Handle error from script correctly
The exitstatus on the process is a gevent.AsyncResult
(not an int).  Use the try/except pattern for handling
errors instead.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-19 08:10:30 -05:00
tamil
23557907d2 s/dist-upgrade/upgrade
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-18 16:29:18 -07:00
Warren Usui
c55f12887b Fixed ceph-fuse mount point cleanup bug
Tested for the existence of /sys/fs/fuse/connections/*/abort
before clobbering it.  This problem was generated when all
the machines were virtual CentOS machines.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-03-18 13:35:58 -07:00
Sam Lang
42e98497c0 task/restart: Cleanup in finally
Need to cleanup the files created for this test from
the testdir.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-18 11:28:51 -05:00
Sam Lang
cd98efe591 task/restart: Fix check for done
The last command a restart script outputs is 'done'
indicating the script does not require being restarted
further.  Handle this case properly.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-18 11:27:11 -05:00
Sam Lang
6fe1deadbf task/restart: Restart task for testing daemon kill
The ceph daemons support being killed at a specific code point
with a config option.  In some cases, we want to test a kill point
only once for a given daemon run (such as replay that only occurs
during daemon startup).  This task allows running a script or executable
and (when the script sends a command to the task) restarting it with
a temporary config that has the appropriate kill point set.  Once
the daemon asserts and gets restarted, the original config is used.

Adds a specific restart_with_args() method to the DaemonState in the
ceph task.

Right now this task follows the workunit task closely, but uses stdout/stdin
to specify when to restart a daemon.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-18 11:16:01 -05:00
tamil
9e81ff58e5 added ceph_health check and a few log messages
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-15 15:50:52 -07:00
Warren Usui
4300f42689 Fixed 'clock:' on Centos
ntpdc commands were formerly returning -127 on CentOS

Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-03-15 10:15:43 -07:00
tamil
fa1faa1f95 added install.upgrade task
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-14 18:26:03 -07:00
tamil
aaf02ab05d added task for ceph.restart
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-14 18:18:39 -07:00
Sage Weil
85a8baae38 install: do debs for 'Debian'
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-14 16:27:01 -07:00
Sage Weil
46a78afabc install: el6 -> rpm
Slightly more general.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-14 16:26:25 -07:00
Warren Usui
0c75c6b1f7 Added el6 install functionality for CentOS systems.
install_packages, remove_packages and remove_sources are now the
installation and removal functions used by teuthology.  Debian
references have been removed outside of tasks/install.py.  CentOS
functionality parallel to Debian have been added to tasks/install.py,
and el6 references have been added to nuke.py, task/ceph-fuse.y and
task/install.py.

Some files created by CentOS are removed with rm -fr.  This should
be changed once the installation/removal rpm procedure is implemented.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
2013-03-14 16:25:18 -07:00
Warren Usui
01a40cfbf1 Use service instead of initctl to restart rsyslog.
This change is needed to make sure teuthology works on CentOS when the
-a option is specified.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
2013-03-13 18:37:25 -07:00
Sage Weil
6dfea4bbed Merge remote-tracking branch 'gh/wip-powercycle-fixes' 2013-03-13 18:09:06 -07:00
Sage Weil
67f0aa11fe ceph.conf: auth debug = true
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-13 13:39:54 -07:00
Sage Weil
740fb85aaa ceph-deploy: uninstall even when no archive
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-13 09:50:09 -07:00
Sam Lang
13d0d5ab96 task/ceph_manager: Only reconnect if powercycled
We don't need to reconnect for remotes
that haven't been powercycled.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-13 11:15:51 -05:00
Sam Lang
6be6f6c607 task/thrashosds: Ipmi checking/setup in thrashosds
We don't need to setup the ipmi console on runs that
don't use powercycling, so delay setup of the RemoteConsole
with ipmi to the thrashosd task and only then if the powercycle
config is set.  This avoids spurious test failures from flaky
ipmi.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-13 10:19:48 -05:00
Sam Lang
8791b3740c task/ceph_manager: Check that ipmi is enabled
If powercycling was requested for the osd thrasher
we should ensure that we are able to reach the
ipmi console.  This helps us avoid weird errors.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-12 22:21:12 -05:00
Sam Lang
5ef2a04137 task/ceph: Recreate /var/run/ceph on powercycle
If a node is powecycled during testing, we need to
recreate /var/run/ceph once the node restarts.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
2013-03-12 22:21:01 -05:00
Sage Weil
fd1e083c61 lock: tolerate description of None
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-12 19:50:07 -07:00
Joe Buck
6511950847 teuthology: update hadoop task for new code layout
Update the teuthology Hadoop task to use
the new code layout.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
2013-03-12 17:45:51 -07:00
tamil
bdd1feca0d increasing the default memory of vm to 4 MB
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
2013-03-12 12:12:58 -07:00
Sage Weil
a5e1ed286e ceph-deploy: no need to call chef explicitly
The chef task does this.

Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-11 16:22:02 -07:00
Sage Weil
e9c26063ee ceph-deploy: use ceph.com git mirror
Signed-off-by: Sage Weil <sage@inktank.com>
2013-03-11 16:21:47 -07:00