ceph/qa/tasks/osd_backfill.py

"""
Osd backfill test
"""
import logging
import time
from tasks import ceph_manager
from teuthology import misc as teuthology


log = logging.getLogger(__name__)


def rados_start(ctx, remote, cmd):
    """
    Run a remote rados command (currently used to only write data)
    """
    log.info("rados %s" % ' '.join(cmd))
    testdir = teuthology.get_testdir(ctx)
    pre = [
        'adjust-ulimits',
        'ceph-coverage',
        '{tdir}/archive/coverage'.format(tdir=testdir),
        'rados',
        ];
    pre.extend(cmd)
    proc = remote.run(
        args=pre,
        wait=False,
        )
    return proc

def task(ctx, config):
    """
    Test backfill
    """
    if config is None:
        config = {}
    assert isinstance(config, dict), \
        'thrashosds task only accepts a dict for configuration'
    first_mon = teuthology.get_first_mon(ctx, config)
    (mon,) = ctx.cluster.only(first_mon).remotes.keys()

    num_osds = teuthology.num_instances_of_type(ctx.cluster, 'osd')
    log.info('num_osds is %s' % num_osds)
    assert num_osds == 3

    manager = ceph_manager.CephManager(
        mon,
        ctx=ctx,
        logger=log.getChild('ceph_manager'),
        )

    while len(manager.get_osd_status()['up']) < 3:
        time.sleep(10)
    manager.flush_pg_stats([0, 1, 2])
    manager.wait_for_clean()

    # write some data
    p = rados_start(ctx, mon, ['-p', 'rbd', 'bench', '15', 'write', '-b', '4096',
                          '--no-cleanup'])
    err = p.wait()
    log.info('err is %d' % err)

    # mark osd.0 out to trigger a rebalance/backfill
    manager.mark_out_osd(0)

    # also mark it down to it won't be included in pg_temps
    manager.kill_osd(0)
    manager.mark_down_osd(0)

    # wait for everything to peer and be happy...
    manager.flush_pg_stats([1, 2])
    manager.wait_for_recovery()

    # write some new data
    p = rados_start(ctx, mon, ['-p', 'rbd', 'bench', '30', 'write', '-b', '4096',
                          '--no-cleanup'])

    time.sleep(15)

    # blackhole + restart osd.1
    # this triggers a divergent backfill target
    manager.blackhole_kill_osd(1)
    time.sleep(2)
    manager.revive_osd(1)

    # wait for our writes to complete + succeed
    err = p.wait()
    log.info('err is %d' % err)

    # wait for osd.1 and osd.2 to be up
    manager.wait_till_osd_is_up(1)
    manager.wait_till_osd_is_up(2)

    # cluster must recover
    manager.flush_pg_stats([1, 2])
    manager.wait_for_recovery()

    # re-add osd.0
    manager.revive_osd(0)
    manager.flush_pg_stats([1, 2])
    manager.wait_for_clean()
Added docstrings, and improved some of the comments on several tasks. 2013-10-12 08:28:27 +00:00			`"""`
			`Osd backfill test`
			`"""`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`import logging`
			`import time`
qa: import with full path to be py3 compatible Signed-off-by: Kefu Chai <kchai@redhat.com> 2020-03-24 08:33:22 +00:00			`from tasks import ceph_manager`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`from teuthology import misc as teuthology`


			`log = logging.getLogger(__name__)`


Replace /tmp/cephtest/ with configurable path Teuthology uses /tmp/cephtest/ as the scratch test directory for a run. This patch replaces /tmp/cephtest/ everywhere with a per-run directory: {basedir}/{rundir} where {basedir} is a directory configured in .teuthology.yaml (/tmp/cephtest if not specified), and {rundir} is the name of the run, as given in --name. If no name is specified, {user}-{timestamp} is used. To get the old behavior (/tmp/cephtest), set test_path: /tmp/cephtest in .teuthology.yaml. This change was modivated by #3782, which requires a test dir that survives across reboots, but also resolves #3767. Signed-off-by: Sam Lang <sam.lang@inktank.com> Reviewed-by: Josh Durgin <josh.durgin@inktank.com> 2013-01-23 20:37:39 +00:00			`def rados_start(ctx, remote, cmd):`
Added docstrings, and improved some of the comments on several tasks. 2013-10-12 08:28:27 +00:00			`"""`
			`Run a remote rados command (currently used to only write data)`
			`"""`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`log.info("rados %s" % ' '.join(cmd))`
Replace /tmp/cephtest/ with configurable path Teuthology uses /tmp/cephtest/ as the scratch test directory for a run. This patch replaces /tmp/cephtest/ everywhere with a per-run directory: {basedir}/{rundir} where {basedir} is a directory configured in .teuthology.yaml (/tmp/cephtest if not specified), and {rundir} is the name of the run, as given in --name. If no name is specified, {user}-{timestamp} is used. To get the old behavior (/tmp/cephtest), set test_path: /tmp/cephtest in .teuthology.yaml. This change was modivated by #3782, which requires a test dir that survives across reboots, but also resolves #3767. Signed-off-by: Sam Lang <sam.lang@inktank.com> Reviewed-by: Josh Durgin <josh.durgin@inktank.com> 2013-01-23 20:37:39 +00:00			`testdir = teuthology.get_testdir(ctx)`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`pre = [`
Helper scripts live in /usr/local/bin now! 2013-09-06 20:08:01 +00:00			`'adjust-ulimits',`
Install ceph debs and use installed debs The ceph task installs ceph using the debian packages now, and all invocations of binaries installed in {tmpdir}/binary/usr/local/bin/ are replace with the use of the binaries installed in standard locations by the debs. Author: Sander Pool <sander.pool@inktank.com> Signed-off-by: Sam Lang <sam.lang@inktank.com> 2013-02-06 19:16:52 +00:00			`'ceph-coverage',`
Replace /tmp/cephtest/ with configurable path Teuthology uses /tmp/cephtest/ as the scratch test directory for a run. This patch replaces /tmp/cephtest/ everywhere with a per-run directory: {basedir}/{rundir} where {basedir} is a directory configured in .teuthology.yaml (/tmp/cephtest if not specified), and {rundir} is the name of the run, as given in --name. If no name is specified, {user}-{timestamp} is used. To get the old behavior (/tmp/cephtest), set test_path: /tmp/cephtest in .teuthology.yaml. This change was modivated by #3782, which requires a test dir that survives across reboots, but also resolves #3767. Signed-off-by: Sam Lang <sam.lang@inktank.com> Reviewed-by: Josh Durgin <josh.durgin@inktank.com> 2013-01-23 20:37:39 +00:00			`'{tdir}/archive/coverage'.format(tdir=testdir),`
Install ceph debs and use installed debs The ceph task installs ceph using the debian packages now, and all invocations of binaries installed in {tmpdir}/binary/usr/local/bin/ are replace with the use of the binaries installed in standard locations by the debs. Author: Sander Pool <sander.pool@inktank.com> Signed-off-by: Sam Lang <sam.lang@inktank.com> 2013-02-06 19:16:52 +00:00			`'rados',`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`];`
			`pre.extend(cmd)`
			`proc = remote.run(`
			`args=pre,`
			`wait=False,`
			`)`
			`return proc`

			`def task(ctx, config):`
			`"""`
			`Test backfill`
			`"""`
			`if config is None:`
			`config = {}`
			`assert isinstance(config, dict), \`
			`'thrashosds task only accepts a dict for configuration'`
			`first_mon = teuthology.get_first_mon(ctx, config)`
qa: get rid of iterkeys for py3 compatibility Fixes: https://tracker.ceph.com/issues/42287 Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com> 2019-10-11 15:57:47 +00:00			`(mon,) = ctx.cluster.only(first_mon).remotes.keys()`
Update users of the teuthology.orchestra.run APIs Signed-off-by: Zack Cerza <zack.cerza@inktank.com> 2014-05-30 19:32:38 +00:00
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`num_osds = teuthology.num_instances_of_type(ctx.cluster, 'osd')`
			`log.info('num_osds is %s' % num_osds)`
Update users of the teuthology.orchestra.run APIs Signed-off-by: Zack Cerza <zack.cerza@inktank.com> 2014-05-30 19:32:38 +00:00			`assert num_osds == 3`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00
			`manager = ceph_manager.CephManager(`
			`mon,`
			`ctx=ctx,`
			`logger=log.getChild('ceph_manager'),`
			`)`

fix misc checks that wait for N osds to be up These all cut&pasted broken code, blah! 2012-04-19 19:43:54 +00:00			`while len(manager.get_osd_status()['up']) < 3:`
tasks: fix non-existent sleep function CephManager has no sleep function. Use time.sleep() instead. Ran into this while testing a branch. Apparently it doesn't happen much since this hasn't changed in years, but the error was copied into several tasks. Signed-off-by: Josh Durgin <jdurgin@redhat.com> 2016-06-02 22:24:56 +00:00			`time.sleep(10)`
qa/tasks: use new reliable flush_pg_stats helper The helper gets a sequence number from the osd (or osds), and then polls the mon until that seq is reflected there. This is overkill in some cases, since many tests only require that the stats be reflected on the mgr (not the mon), but waiting for it to also reach the mon is sufficient! Signed-off-by: Sage Weil <sage@redhat.com> 2017-05-18 22:16:55 +00:00			`manager.flush_pg_stats([0, 1, 2])`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_clean()`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00
			`# write some data`
Replace /tmp/cephtest/ with configurable path Teuthology uses /tmp/cephtest/ as the scratch test directory for a run. This patch replaces /tmp/cephtest/ everywhere with a per-run directory: {basedir}/{rundir} where {basedir} is a directory configured in .teuthology.yaml (/tmp/cephtest if not specified), and {rundir} is the name of the run, as given in --name. If no name is specified, {user}-{timestamp} is used. To get the old behavior (/tmp/cephtest), set test_path: /tmp/cephtest in .teuthology.yaml. This change was modivated by #3782, which requires a test dir that survives across reboots, but also resolves #3767. Signed-off-by: Sam Lang <sam.lang@inktank.com> Reviewed-by: Josh Durgin <josh.durgin@inktank.com> 2013-01-23 20:37:39 +00:00			`p = rados_start(ctx, mon, ['-p', 'rbd', 'bench', '15', 'write', '-b', '4096',`
osd_backfill: --no-cleanup for rados bench 2013-01-29 03:53:34 +00:00			`'--no-cleanup'])`
Update users of the teuthology.orchestra.run APIs Signed-off-by: Zack Cerza <zack.cerza@inktank.com> 2014-05-30 19:32:38 +00:00			`err = p.wait()`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`log.info('err is %d' % err)`

			`# mark osd.0 out to trigger a rebalance/backfill`
			`manager.mark_out_osd(0)`

			`# also mark it down to it won't be included in pg_temps`
			`manager.kill_osd(0)`
			`manager.mark_down_osd(0)`

backfill: wait for clean before writing+blackholing If we have straggler pgs and blackhole osd.1, we can deadlock because we need info from that osd to repeer and continue. Make sure we're clean, and then start the write + blackhole + kill test. 2012-02-14 23:24:11 +00:00			`# wait for everything to peer and be happy...`
qa/tasks: use new reliable flush_pg_stats helper The helper gets a sequence number from the osd (or osds), and then polls the mon until that seq is reflected there. This is overkill in some cases, since many tests only require that the stats be reflected on the mgr (not the mon), but waiting for it to also reach the mon is sufficient! Signed-off-by: Sage Weil <sage@redhat.com> 2017-05-18 22:16:55 +00:00			`manager.flush_pg_stats([1, 2])`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_recovery()`
backfill: wait for clean before writing+blackholing If we have straggler pgs and blackhole osd.1, we can deadlock because we need info from that osd to repeer and continue. Make sure we're clean, and then start the write + blackhole + kill test. 2012-02-14 23:24:11 +00:00
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`# write some new data`
task/osd_backfill: use 'rbd' instead of 'data' pool Signed-off-by: Sage Weil <sage@redhat.com> 2014-07-25 18:33:14 +00:00			`p = rados_start(ctx, mon, ['-p', 'rbd', 'bench', '30', 'write', '-b', '4096',`
osd_backfill: --no-cleanup for rados bench 2013-01-29 03:53:34 +00:00			`'--no-cleanup'])`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00
			`time.sleep(15)`

			`# blackhole + restart osd.1`
			`# this triggers a divergent backfill target`
			`manager.blackhole_kill_osd(1)`
			`time.sleep(2)`
			`manager.revive_osd(1)`

			`# wait for our writes to complete + succeed`
Update users of the teuthology.orchestra.run APIs Signed-off-by: Zack Cerza <zack.cerza@inktank.com> 2014-05-30 19:32:38 +00:00			`err = p.wait()`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`log.info('err is %d' % err)`

qa/tasks/osd_backfill.py: wait for osd.[12] to start ...before sending a tell command. Otherwise osd.2 might start without 1, the io unblocks, and the tell fails because osd.1 is still down. Fixes: http://tracker.ceph.com/issues/18303 Signed-off-by: Sage Weil <sage@redhat.com> 2016-12-20 02:55:54 +00:00			`# wait for osd.1 and osd.2 to be up`
			`manager.wait_till_osd_is_up(1)`
			`manager.wait_till_osd_is_up(2)`

add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00			`# cluster must recover`
qa/tasks: use new reliable flush_pg_stats helper The helper gets a sequence number from the osd (or osds), and then polls the mon until that seq is reflected there. This is overkill in some cases, since many tests only require that the stats be reflected on the mgr (not the mon), but waiting for it to also reach the mon is sufficient! Signed-off-by: Sage Weil <sage@redhat.com> 2017-05-18 22:16:55 +00:00			`manager.flush_pg_stats([1, 2])`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_recovery()`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00
			`# re-add osd.0`
			`manager.revive_osd(0)`
qa/tasks: use new reliable flush_pg_stats helper The helper gets a sequence number from the osd (or osds), and then polls the mon until that seq is reflected there. This is overkill in some cases, since many tests only require that the stats be reflected on the mgr (not the mon), but waiting for it to also reach the mon is sufficient! Signed-off-by: Sage Weil <sage@redhat.com> 2017-05-18 22:16:55 +00:00			`manager.flush_pg_stats([1, 2])`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_clean()`
add backfill task This does a basic test of backfill functionality, including a divergent log on a backfill target (#1983). 2012-02-01 00:25:53 +00:00