ceph/tasks/osd_backfill.py

"""
Osd backfill test
"""
import logging
import ceph_manager
import time
from teuthology import misc as teuthology


log = logging.getLogger(__name__)


def rados_start(ctx, remote, cmd):
    """
    Run a remote rados command (currently used to only write data)
    """
    log.info("rados %s" % ' '.join(cmd))
    testdir = teuthology.get_testdir(ctx)
    pre = [
        'adjust-ulimits',
        'ceph-coverage',
        '{tdir}/archive/coverage'.format(tdir=testdir),
        'rados',
        ];
    pre.extend(cmd)
    proc = remote.run(
        args=pre,
        wait=False,
        )
    return proc

def task(ctx, config):
    """
    Test backfill
    """
    if config is None:
        config = {}
    assert isinstance(config, dict), \
        'thrashosds task only accepts a dict for configuration'
    first_mon = teuthology.get_first_mon(ctx, config)
    (mon,) = ctx.cluster.only(first_mon).remotes.iterkeys()

    num_osds = teuthology.num_instances_of_type(ctx.cluster, 'osd')
    log.info('num_osds is %s' % num_osds)
    assert num_osds == 3

    manager = ceph_manager.CephManager(
        mon,
        ctx=ctx,
        logger=log.getChild('ceph_manager'),
        )

    while len(manager.get_osd_status()['up']) < 3:
        manager.sleep(10)
    manager.raw_cluster_cmd('tell', 'osd.0', 'flush_pg_stats')
    manager.raw_cluster_cmd('tell', 'osd.1', 'flush_pg_stats')
    manager.raw_cluster_cmd('tell', 'osd.2', 'flush_pg_stats')
    manager.wait_for_clean()

    # write some data
    p = rados_start(ctx, mon, ['-p', 'rbd', 'bench', '15', 'write', '-b', '4096',
                          '--no-cleanup'])
    err = p.wait()
    log.info('err is %d' % err)

    # mark osd.0 out to trigger a rebalance/backfill
    manager.mark_out_osd(0)

    # also mark it down to it won't be included in pg_temps
    manager.kill_osd(0)
    manager.mark_down_osd(0)

    # wait for everything to peer and be happy...
    manager.raw_cluster_cmd('tell', 'osd.1', 'flush_pg_stats')
    manager.raw_cluster_cmd('tell', 'osd.2', 'flush_pg_stats')
    manager.wait_for_recovery()

    # write some new data
    p = rados_start(ctx, mon, ['-p', 'rbd', 'bench', '30', 'write', '-b', '4096',
                          '--no-cleanup'])

    time.sleep(15)

    # blackhole + restart osd.1
    # this triggers a divergent backfill target
    manager.blackhole_kill_osd(1)
    time.sleep(2)
    manager.revive_osd(1)

    # wait for our writes to complete + succeed
    err = p.wait()
    log.info('err is %d' % err)

    # cluster must recover
    manager.raw_cluster_cmd('tell', 'osd.1', 'flush_pg_stats')
    manager.raw_cluster_cmd('tell', 'osd.2', 'flush_pg_stats')
    manager.wait_for_recovery()

    # re-add osd.0
    manager.revive_osd(0)
    manager.raw_cluster_cmd('tell', 'osd.1', 'flush_pg_stats')
    manager.raw_cluster_cmd('tell', 'osd.2', 'flush_pg_stats')
    manager.wait_for_clean()
Added docstrings, and improved some of the comments on several tasks. 2013-10-12 08:28:27 +00:00			`"""`
			`Osd backfill test`
			`"""`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`import logging`
			`import ceph_manager`
			`import time`
			`from teuthology import misc as teuthology`


			`log = logging.getLogger(__name__)`


Replace /tmp/cephtest/ with configurable path Teuthology uses /tmp/cephtest/ as the scratch test directory for a run. This patch replaces /tmp/cephtest/ everywhere with a per-run directory: {basedir}/{rundir} where {basedir} is a directory configured in .teuthology.yaml (/tmp/cephtest if not specified), and {rundir} is the name of the run, as given in --name. If no name is specified, {user}-{timestamp} is used. To get the old behavior (/tmp/cephtest), set test_path: /tmp/cephtest in .teuthology.yaml. This change was modivated by #3782, which requires a test dir that survives across reboots, but also resolves #3767. Signed-off-by: Sam Lang <sam.lang@inktank.com> Reviewed-by: Josh Durgin <josh.durgin@inktank.com> 2013-01-23 20:37:39 +00:00			`def rados_start(ctx, remote, cmd):`
Added docstrings, and improved some of the comments on several tasks. 2013-10-12 08:28:27 +00:00			`"""`
			`Run a remote rados command (currently used to only write data)`
			`"""`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`log.info("rados %s" % ' '.join(cmd))`
Replace /tmp/cephtest/ with configurable path Teuthology uses /tmp/cephtest/ as the scratch test directory for a run. This patch replaces /tmp/cephtest/ everywhere with a per-run directory: {basedir}/{rundir} where {basedir} is a directory configured in .teuthology.yaml (/tmp/cephtest if not specified), and {rundir} is the name of the run, as given in --name. If no name is specified, {user}-{timestamp} is used. To get the old behavior (/tmp/cephtest), set test_path: /tmp/cephtest in .teuthology.yaml. This change was modivated by #3782, which requires a test dir that survives across reboots, but also resolves #3767. Signed-off-by: Sam Lang <sam.lang@inktank.com> Reviewed-by: Josh Durgin <josh.durgin@inktank.com> 2013-01-23 20:37:39 +00:00			`testdir = teuthology.get_testdir(ctx)`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`pre = [`
Helper scripts live in /usr/local/bin now! 2013-09-06 20:08:01 +00:00			`'adjust-ulimits',`
Install ceph debs and use installed debs The ceph task installs ceph using the debian packages now, and all invocations of binaries installed in {tmpdir}/binary/usr/local/bin/ are replace with the use of the binaries installed in standard locations by the debs. Author: Sander Pool <sander.pool@inktank.com> Signed-off-by: Sam Lang <sam.lang@inktank.com> 2013-02-06 19:16:52 +00:00			`'ceph-coverage',`
Replace /tmp/cephtest/ with configurable path Teuthology uses /tmp/cephtest/ as the scratch test directory for a run. This patch replaces /tmp/cephtest/ everywhere with a per-run directory: {basedir}/{rundir} where {basedir} is a directory configured in .teuthology.yaml (/tmp/cephtest if not specified), and {rundir} is the name of the run, as given in --name. If no name is specified, {user}-{timestamp} is used. To get the old behavior (/tmp/cephtest), set test_path: /tmp/cephtest in .teuthology.yaml. This change was modivated by #3782, which requires a test dir that survives across reboots, but also resolves #3767. Signed-off-by: Sam Lang <sam.lang@inktank.com> Reviewed-by: Josh Durgin <josh.durgin@inktank.com> 2013-01-23 20:37:39 +00:00			`'{tdir}/archive/coverage'.format(tdir=testdir),`
Install ceph debs and use installed debs The ceph task installs ceph using the debian packages now, and all invocations of binaries installed in {tmpdir}/binary/usr/local/bin/ are replace with the use of the binaries installed in standard locations by the debs. Author: Sander Pool <sander.pool@inktank.com> Signed-off-by: Sam Lang <sam.lang@inktank.com> 2013-02-06 19:16:52 +00:00			`'rados',`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`];`
			`pre.extend(cmd)`
			`proc = remote.run(`
			`args=pre,`
			`wait=False,`
			`)`
			`return proc`

			`def task(ctx, config):`
			`"""`
			`Test backfill`
			`"""`
			`if config is None:`
			`config = {}`
			`assert isinstance(config, dict), \`
			`'thrashosds task only accepts a dict for configuration'`
			`first_mon = teuthology.get_first_mon(ctx, config)`
Revert "Lines formerly of the form '(remote,) = ctx.cluster.only(role).remotes.keys()'" This reverts commit d693b3f8950ffd1f2492a4db0f8234fee31f00f0. 2014-03-27 16:35:28 +00:00			`(mon,) = ctx.cluster.only(first_mon).remotes.iterkeys()`
Update users of the teuthology.orchestra.run APIs Signed-off-by: Zack Cerza <zack.cerza@inktank.com> 2014-05-30 19:32:38 +00:00
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`num_osds = teuthology.num_instances_of_type(ctx.cluster, 'osd')`
			`log.info('num_osds is %s' % num_osds)`
Update users of the teuthology.orchestra.run APIs Signed-off-by: Zack Cerza <zack.cerza@inktank.com> 2014-05-30 19:32:38 +00:00			`assert num_osds == 3`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00
			`manager = ceph_manager.CephManager(`
			`mon,`
			`ctx=ctx,`
			`logger=log.getChild('ceph_manager'),`
			`)`

fix misc checks that wait for N osds to be up These all cut&pasted broken code, blah! 2012-04-19 19:43:54 +00:00			`while len(manager.get_osd_status()['up']) < 3:`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`manager.sleep(10)`
			`manager.raw_cluster_cmd('tell', 'osd.0', 'flush_pg_stats')`
			`manager.raw_cluster_cmd('tell', 'osd.1', 'flush_pg_stats')`
			`manager.raw_cluster_cmd('tell', 'osd.2', 'flush_pg_stats')`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_clean()`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00
			`# write some data`
Replace /tmp/cephtest/ with configurable path Teuthology uses /tmp/cephtest/ as the scratch test directory for a run. This patch replaces /tmp/cephtest/ everywhere with a per-run directory: {basedir}/{rundir} where {basedir} is a directory configured in .teuthology.yaml (/tmp/cephtest if not specified), and {rundir} is the name of the run, as given in --name. If no name is specified, {user}-{timestamp} is used. To get the old behavior (/tmp/cephtest), set test_path: /tmp/cephtest in .teuthology.yaml. This change was modivated by #3782, which requires a test dir that survives across reboots, but also resolves #3767. Signed-off-by: Sam Lang <sam.lang@inktank.com> Reviewed-by: Josh Durgin <josh.durgin@inktank.com> 2013-01-23 20:37:39 +00:00			`p = rados_start(ctx, mon, ['-p', 'rbd', 'bench', '15', 'write', '-b', '4096',`
osd_backfill: --no-cleanup for rados bench 2013-01-29 03:53:34 +00:00			`'--no-cleanup'])`
Update users of the teuthology.orchestra.run APIs Signed-off-by: Zack Cerza <zack.cerza@inktank.com> 2014-05-30 19:32:38 +00:00			`err = p.wait()`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`log.info('err is %d' % err)`

			`# mark osd.0 out to trigger a rebalance/backfill`
			`manager.mark_out_osd(0)`

			`# also mark it down to it won't be included in pg_temps`
			`manager.kill_osd(0)`
			`manager.mark_down_osd(0)`

backfill: wait for clean before writing+blackholing If we have straggler pgs and blackhole osd.1, we can deadlock because we need info from that osd to repeer and continue. Make sure we're clean, and then start the write + blackhole + kill test. 2012-02-14 23:24:11 +00:00			`# wait for everything to peer and be happy...`
			`manager.raw_cluster_cmd('tell', 'osd.1', 'flush_pg_stats')`
			`manager.raw_cluster_cmd('tell', 'osd.2', 'flush_pg_stats')`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_recovery()`
backfill: wait for clean before writing+blackholing If we have straggler pgs and blackhole osd.1, we can deadlock because we need info from that osd to repeer and continue. Make sure we're clean, and then start the write + blackhole + kill test. 2012-02-14 23:24:11 +00:00
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`# write some new data`
task/osd_backfill: use 'rbd' instead of 'data' pool Signed-off-by: Sage Weil <sage@redhat.com> 2014-07-25 18:33:14 +00:00			`p = rados_start(ctx, mon, ['-p', 'rbd', 'bench', '30', 'write', '-b', '4096',`
osd_backfill: --no-cleanup for rados bench 2013-01-29 03:53:34 +00:00			`'--no-cleanup'])`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00
			`time.sleep(15)`

			`# blackhole + restart osd.1`
			`# this triggers a divergent backfill target`
			`manager.blackhole_kill_osd(1)`
			`time.sleep(2)`
			`manager.revive_osd(1)`

			`# wait for our writes to complete + succeed`
Update users of the teuthology.orchestra.run APIs Signed-off-by: Zack Cerza <zack.cerza@inktank.com> 2014-05-30 19:32:38 +00:00			`err = p.wait()`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`log.info('err is %d' % err)`

			`# cluster must recover`
			`manager.raw_cluster_cmd('tell', 'osd.1', 'flush_pg_stats')`
			`manager.raw_cluster_cmd('tell', 'osd.2', 'flush_pg_stats')`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_recovery()`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00
			`# re-add osd.0`
			`manager.revive_osd(0)`
			`manager.raw_cluster_cmd('tell', 'osd.1', 'flush_pg_stats')`
			`manager.raw_cluster_cmd('tell', 'osd.2', 'flush_pg_stats')`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_clean()`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00