ceph/teuthology/task/thrashosds.py

import contextlib
import logging
import ceph_manager
from teuthology import misc as teuthology


log = logging.getLogger(__name__)

@contextlib.contextmanager
def task(ctx, config):
    """
    "Thrash" the OSDs by randomly marking them out/down (and then back
    in) until the task is ended. This loops, and every op_delay
    seconds it randomly chooses to add or remove an OSD (even odds)
    unless there are fewer than min_out OSDs out of the cluster, or
    more than min_in OSDs in the cluster.

    All commands are run on mon0 and it stops when __exit__ is called.

    The config is optional, and is a dict containing some or all of:

    min_in: (default 2) the minimum number of OSDs to keep in the
       cluster

    min_out: (default 0) the minimum number of OSDs to keep out of the
       cluster

    op_delay: (5) the length of time to sleep between changing an
       OSD's status

    clean_interval: (60) the approximate length of time to loop before
       waiting until the cluster goes clean. (In reality this is used
       to probabilistically choose when to wait, and the method used
       makes it closer to -- but not identical to -- the half-life.)

    chance_down: (0) the probability that the thrasher will mark an
       OSD down rather than marking it out. (The thrasher will not
       consider that OSD out of the cluster, since presently an OSD
       wrongly marked down will mark itself back up again.) This value
       can be either an integer (eg, 75) or a float probability (eg
       0.75).

    timeout: (360) the number of seconds to wait for the cluster
       to become clean before the task exits. If this doesn't happen,
       an exception will be raised.

    example:

    tasks:
    - ceph:
    - thrashosds:
        chance_down: 10
        op_delay: 3
        min_in: 1
        timeout: 600
    - interactive:
    """
    log.info('Beginning thrashosds...')
    first_mon = teuthology.get_first_mon(ctx, config)
    (mon,) = ctx.cluster.only(first_mon).remotes.iterkeys()
    manager = ceph_manager.CephManager(
        mon,
        logger=log.getChild('ceph_manager'),
        )
    thrash_proc = ceph_manager.Thrasher(
        manager,
        config,
        logger=log.getChild('thrasher')
        )
    try:
        yield
    finally:
        log.info('joining thrashosds')
        thrash_proc.do_join()
        manager.wait_till_clean(config.get('timeout', 360))
added thrashosds Signed-off-by: Samuel Just <samuel.just@dreamhost.com> 2011-06-13 23:36:21 +00:00			`import contextlib`
			`import logging`
			`import ceph_manager`
thrashosds: make it work when first mon isn't mon.0 2011-08-31 20:56:42 +00:00			`from teuthology import misc as teuthology`

added thrashosds Signed-off-by: Samuel Just <samuel.just@dreamhost.com> 2011-06-13 23:36:21 +00:00
			`log = logging.getLogger(__name__)`

			`@contextlib.contextmanager`
			`def task(ctx, config):`
			`"""`
thrasher: allow a config to set values Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> 2011-08-25 22:18:42 +00:00			`"Thrash" the OSDs by randomly marking them out/down (and then back`
thrashosds: no camelcaps, add some whitespace 2011-08-31 20:21:30 +00:00			`in) until the task is ended. This loops, and every op_delay`
			`seconds it randomly chooses to add or remove an OSD (even odds)`
			`unless there are fewer than min_out OSDs out of the cluster, or`
			`more than min_in OSDs in the cluster.`

thrasher: allow a config to set values Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> 2011-08-25 22:18:42 +00:00			`All commands are run on mon0 and it stops when __exit__ is called.`
thrasher: improve documentation a little Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> 2011-08-25 22:27:30 +00:00
thrasher: allow a config to set values Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> 2011-08-25 22:18:42 +00:00			`The config is optional, and is a dict containing some or all of:`
added thrashosds Signed-off-by: Samuel Just <samuel.just@dreamhost.com> 2011-06-13 23:36:21 +00:00
thrashosds: no camelcaps, add some whitespace 2011-08-31 20:21:30 +00:00			`min_in: (default 2) the minimum number of OSDs to keep in the`
			`cluster`

			`min_out: (default 0) the minimum number of OSDs to keep out of the`
			`cluster`

			`op_delay: (5) the length of time to sleep between changing an`
			`OSD's status`

			`clean_interval: (60) the approximate length of time to loop before`
			`waiting until the cluster goes clean. (In reality this is used`
			`to probabilistically choose when to wait, and the method used`
			`makes it closer to -- but not identical to -- the half-life.)`

			`chance_down: (0) the probability that the thrasher will mark an`
			`OSD down rather than marking it out. (The thrasher will not`
			`consider that OSD out of the cluster, since presently an OSD`
			`wrongly marked down will mark itself back up again.) This value`
			`can be either an integer (eg, 75) or a float probability (eg`
			`0.75).`
thrashosds: fail if cluster doesn't finally become clean in 5 minutes 2011-09-09 01:09:11 +00:00
			`timeout: (360) the number of seconds to wait for the cluster`
			`to become clean before the task exits. If this doesn't happen,`
			`an exception will be raised.`

added thrashosds Signed-off-by: Samuel Just <samuel.just@dreamhost.com> 2011-06-13 23:36:21 +00:00			`example:`

			`tasks:`
			`- ceph:`
Whitespace and style cleanup. 2011-07-12 01:00:03 +00:00			`- thrashosds:`
thrashosds: no camelcaps, add some whitespace 2011-08-31 20:21:30 +00:00			`chance_down: 10`
			`op_delay: 3`
			`min_in: 1`
thrashosds: fail if cluster doesn't finally become clean in 5 minutes 2011-09-09 01:09:11 +00:00			`timeout: 600`
added thrashosds Signed-off-by: Samuel Just <samuel.just@dreamhost.com> 2011-06-13 23:36:21 +00:00			`- interactive:`
			`"""`
			`log.info('Beginning thrashosds...')`
thrashosds: make it work when first mon isn't mon.0 2011-08-31 20:56:42 +00:00			`first_mon = teuthology.get_first_mon(ctx, config)`
			`(mon,) = ctx.cluster.only(first_mon).remotes.iterkeys()`
added thrashosds Signed-off-by: Samuel Just <samuel.just@dreamhost.com> 2011-06-13 23:36:21 +00:00			`manager = ceph_manager.CephManager(`
Whitespace and style cleanup. 2011-07-12 01:00:03 +00:00			`mon,`
			`logger=log.getChild('ceph_manager'),`
added thrashosds Signed-off-by: Samuel Just <samuel.just@dreamhost.com> 2011-06-13 23:36:21 +00:00			`)`
			`thrash_proc = ceph_manager.Thrasher(`
			`manager,`
thrasher: allow a config to set values Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> 2011-08-25 22:18:42 +00:00			`config,`
			`logger=log.getChild('thrasher')`
added thrashosds Signed-off-by: Samuel Just <samuel.just@dreamhost.com> 2011-06-13 23:36:21 +00:00			`)`
			`try:`
			`yield`
			`finally:`
			`log.info('joining thrashosds')`
			`thrash_proc.do_join()`
thrashosds: fail if cluster doesn't finally become clean in 5 minutes 2011-09-09 01:09:11 +00:00			`manager.wait_till_clean(config.get('timeout', 360))`