ceph/qa/tasks/lost_unfound.py

"""
Lost_unfound
"""
import logging
import time
import ceph_manager
from teuthology import misc as teuthology
from teuthology.orchestra import run
from util.rados import rados

log = logging.getLogger(__name__)

def task(ctx, config):
    """
    Test handling of lost objects.

    A pretty rigid cluseter is brought up andtested by this task
    """
    POOL = 'unfound_pool'
    if config is None:
        config = {}
    assert isinstance(config, dict), \
        'lost_unfound task only accepts a dict for configuration'
    first_mon = teuthology.get_first_mon(ctx, config)
    (mon,) = ctx.cluster.only(first_mon).remotes.iterkeys()

    manager = ceph_manager.CephManager(
        mon,
        ctx=ctx,
        logger=log.getChild('ceph_manager'),
        )

    while len(manager.get_osd_status()['up']) < 3:
        time.sleep(10)

    manager.wait_for_clean()

    manager.create_pool(POOL)

    # something that is always there
    dummyfile = '/etc/fstab'

    # take an osd out until the very end
    manager.kill_osd(2)
    manager.mark_down_osd(2)
    manager.mark_out_osd(2)

    # kludge to make sure they get a map
    rados(ctx, mon, ['-p', POOL, 'put', 'dummy', dummyfile])

    manager.flush_pg_stats([0, 1])
    manager.wait_for_recovery()

    # create old objects
    for f in range(1, 10):
        rados(ctx, mon, ['-p', POOL, 'put', 'existing_%d' % f, dummyfile])
        rados(ctx, mon, ['-p', POOL, 'put', 'existed_%d' % f, dummyfile])
        rados(ctx, mon, ['-p', POOL, 'rm', 'existed_%d' % f])

    # delay recovery, and make the pg log very long (to prevent backfill)
    manager.raw_cluster_cmd(
            'tell', 'osd.1',
            'injectargs',
            '--osd-recovery-delay-start 1000 --osd-min-pg-log-entries 100000000'
            )

    manager.kill_osd(0)
    manager.mark_down_osd(0)
    
    for f in range(1, 10):
        rados(ctx, mon, ['-p', POOL, 'put', 'new_%d' % f, dummyfile])
        rados(ctx, mon, ['-p', POOL, 'put', 'existed_%d' % f, dummyfile])
        rados(ctx, mon, ['-p', POOL, 'put', 'existing_%d' % f, dummyfile])

    # bring osd.0 back up, let it peer, but don't replicate the new
    # objects...
    log.info('osd.0 command_args is %s' % 'foo')
    log.info(ctx.daemons.get_daemon('osd', 0).command_args)
    ctx.daemons.get_daemon('osd', 0).command_kwargs['args'].extend([
            '--osd-recovery-delay-start', '1000'
            ])
    manager.revive_osd(0)
    manager.mark_in_osd(0)
    manager.wait_till_osd_is_up(0)

    manager.flush_pg_stats([1, 0])
    manager.wait_till_active()

    # take out osd.1 and the only copy of those objects.
    manager.kill_osd(1)
    manager.mark_down_osd(1)
    manager.mark_out_osd(1)
    manager.raw_cluster_cmd('osd', 'lost', '1', '--yes-i-really-mean-it')

    # bring up osd.2 so that things would otherwise, in theory, recovery fully
    manager.revive_osd(2)
    manager.mark_in_osd(2)
    manager.wait_till_osd_is_up(2)

    manager.flush_pg_stats([0, 2])
    manager.wait_till_active()
    manager.flush_pg_stats([0, 2])

    # verify that there are unfound objects
    unfound = manager.get_num_unfound_objects()
    log.info("there are %d unfound objects" % unfound)
    assert unfound

    testdir = teuthology.get_testdir(ctx)
    procs = []
    if config.get('parallel_bench', True):
        procs.append(mon.run(
            args=[
                "/bin/sh", "-c",
                " ".join(['adjust-ulimits',
                          'ceph-coverage',
                          '{tdir}/archive/coverage',
                          'rados',
                          '--no-log-to-stderr',
                          '--name', 'client.admin',
                          '-b', str(4<<10),
                          '-p' , POOL,
                          '-t', '20',
                          'bench', '240', 'write',
                      ]).format(tdir=testdir),
            ],
            logger=log.getChild('radosbench.{id}'.format(id='client.admin')),
            stdin=run.PIPE,
            wait=False
        ))
    time.sleep(10)

    # mark stuff lost
    pgs = manager.get_pg_stats()
    for pg in pgs:
        if pg['stat_sum']['num_objects_unfound'] > 0:
            primary = 'osd.%d' % pg['acting'][0]

            # verify that i can list them direct from the osd
            log.info('listing missing/lost in %s state %s', pg['pgid'],
                     pg['state']);
            m = manager.list_pg_unfound(pg['pgid'])
            #log.info('%s' % m)
            assert m['num_unfound'] == pg['stat_sum']['num_objects_unfound']
            num_unfound=0
            for o in m['objects']:
                if len(o['locations']) == 0:
                    num_unfound += 1
            assert m['num_unfound'] == num_unfound

            log.info("reverting unfound in %s on %s", pg['pgid'], primary)
            manager.raw_cluster_cmd('pg', pg['pgid'],
                                    'mark_unfound_lost', 'revert')
        else:
            log.info("no unfound in %s", pg['pgid'])

    manager.raw_cluster_cmd('tell', 'osd.0', 'debug', 'kick_recovery_wq', '5')
    manager.raw_cluster_cmd('tell', 'osd.2', 'debug', 'kick_recovery_wq', '5')
    manager.flush_pg_stats([0, 2])
    manager.wait_for_recovery()

    # verify result
    for f in range(1, 10):
        err = rados(ctx, mon, ['-p', POOL, 'get', 'new_%d' % f, '-'])
        assert err
        err = rados(ctx, mon, ['-p', POOL, 'get', 'existed_%d' % f, '-'])
        assert err
        err = rados(ctx, mon, ['-p', POOL, 'get', 'existing_%d' % f, '-'])
        assert not err

    # see if osd.1 can cope
    manager.mark_in_osd(1)
    manager.revive_osd(1)
    manager.wait_till_osd_is_up(1)
    manager.wait_for_clean()
    run.wait(procs)
Added docstrings, and improved some of the comments on several tasks. 2013-10-12 08:28:27 +00:00			`"""`
			`Lost_unfound`
			`"""`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`import logging`
Fixes #11013, use time.sleep instead of manager.sleep which isn't there. Signed-off-by: Andrew Schoen <aschoen@redhat.com> 2015-03-03 22:38:10 +00:00			`import time`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`import ceph_manager`
			`from teuthology import misc as teuthology`
tasks/unfound.py: run rados bench in parallel Signed-off-by: Samuel Just <sjust@redhat.com> 2015-12-22 18:06:22 +00:00			`from teuthology.orchestra import run`
Update module references Signed-off-by: Zack Cerza <zack.cerza@inktank.com> 2014-08-07 14:24:59 +00:00			`from util.rados import rados`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00
			`log = logging.getLogger(__name__)`

			`def task(ctx, config):`
			`"""`
			`Test handling of lost objects.`
Added docstrings, and improved some of the comments on several tasks. 2013-10-12 08:28:27 +00:00
			`A pretty rigid cluseter is brought up andtested by this task`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`"""`
Tasks are failing since using "data" pool no longer part of default install Create a pool specifically for each task Fixes: 8930 Signed-off-by: David Zafman <david.zafman@inktank.com> 2014-08-01 16:36:10 +00:00			`POOL = 'unfound_pool'`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`if config is None:`
			`config = {}`
			`assert isinstance(config, dict), \`
lost_unfound: typo 2012-01-11 00:21:00 +00:00			`'lost_unfound task only accepts a dict for configuration'`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`first_mon = teuthology.get_first_mon(ctx, config)`
Revert "Lines formerly of the form '(remote,) = ctx.cluster.only(role).remotes.keys()'" This reverts commit d693b3f8950ffd1f2492a4db0f8234fee31f00f0. 2014-03-27 16:35:28 +00:00			`(mon,) = ctx.cluster.only(first_mon).remotes.iterkeys()`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00
			`manager = ceph_manager.CephManager(`
			`mon,`
			`ctx=ctx,`
			`logger=log.getChild('ceph_manager'),`
			`)`

fix misc checks that wait for N osds to be up These all cut&pasted broken code, blah! 2012-04-19 19:43:54 +00:00			`while len(manager.get_osd_status()['up']) < 3:`
Fixes #11013, use time.sleep instead of manager.sleep which isn't there. Signed-off-by: Andrew Schoen <aschoen@redhat.com> 2015-03-03 22:38:10 +00:00			`time.sleep(10)`
[ec_]lost_unfound: don't flush_pg_stats at the beginning The upgrade tests restart the daemons right before that part, and the restart marks the osds down causing the flush_pg_stats to fail. It's not necessary anymore anyway. Signed-off-by: Samuel Just <sjust@redhat.com> 2016-06-17 15:16:56 +00:00
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_clean()`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00
Tasks are failing since using "data" pool no longer part of default install Create a pool specifically for each task Fixes: 8930 Signed-off-by: David Zafman <david.zafman@inktank.com> 2014-08-01 16:36:10 +00:00			`manager.create_pool(POOL)`

add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`# something that is always there`
			`dummyfile = '/etc/fstab'`

			`# take an osd out until the very end`
			`manager.kill_osd(2)`
			`manager.mark_down_osd(2)`
			`manager.mark_out_osd(2)`

			`# kludge to make sure they get a map`
Tasks are failing since using "data" pool no longer part of default install Create a pool specifically for each task Fixes: 8930 Signed-off-by: David Zafman <david.zafman@inktank.com> 2014-08-01 16:36:10 +00:00			`rados(ctx, mon, ['-p', POOL, 'put', 'dummy', dummyfile])`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00
qa/tasks: use new reliable flush_pg_stats helper The helper gets a sequence number from the osd (or osds), and then polls the mon until that seq is reflected there. This is overkill in some cases, since many tests only require that the stats be reflected on the mgr (not the mon), but waiting for it to also reach the mon is sufficient! Signed-off-by: Sage Weil <sage@redhat.com> 2017-05-18 22:16:55 +00:00			`manager.flush_pg_stats([0, 1])`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_recovery()`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00
			`# create old objects`
			`for f in range(1, 10):`
Tasks are failing since using "data" pool no longer part of default install Create a pool specifically for each task Fixes: 8930 Signed-off-by: David Zafman <david.zafman@inktank.com> 2014-08-01 16:36:10 +00:00			`rados(ctx, mon, ['-p', POOL, 'put', 'existing_%d' % f, dummyfile])`
			`rados(ctx, mon, ['-p', POOL, 'put', 'existed_%d' % f, dummyfile])`
			`rados(ctx, mon, ['-p', POOL, 'rm', 'existed_%d' % f])`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00
lost_unfound: make test work with backfill If we backfill, we fail to peer instead of having every object show up as 'unfound'. Avoid that by preventing log trimming, so that we always do log recovery for this test. 2012-01-12 23:08:11 +00:00			`# delay recovery, and make the pg log very long (to prevent backfill)`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`manager.raw_cluster_cmd(`
			`'tell', 'osd.1',`
lost_unfound: make test work with backfill If we backfill, we fail to peer instead of having every object show up as 'unfound'. Avoid that by preventing log trimming, so that we always do log recovery for this test. 2012-01-12 23:08:11 +00:00			`'injectargs',`
			`'--osd-recovery-delay-start 1000 --osd-min-pg-log-entries 100000000'`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`)`

			`manager.kill_osd(0)`
			`manager.mark_down_osd(0)`

			`for f in range(1, 10):`
Tasks are failing since using "data" pool no longer part of default install Create a pool specifically for each task Fixes: 8930 Signed-off-by: David Zafman <david.zafman@inktank.com> 2014-08-01 16:36:10 +00:00			`rados(ctx, mon, ['-p', POOL, 'put', 'new_%d' % f, dummyfile])`
			`rados(ctx, mon, ['-p', POOL, 'put', 'existed_%d' % f, dummyfile])`
			`rados(ctx, mon, ['-p', POOL, 'put', 'existing_%d' % f, dummyfile])`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00
			`# bring osd.0 back up, let it peer, but don't replicate the new`
			`# objects...`
			`log.info('osd.0 command_args is %s' % 'foo')`
			`log.info(ctx.daemons.get_daemon('osd', 0).command_args)`
			`ctx.daemons.get_daemon('osd', 0).command_kwargs['args'].extend([`
			`'--osd-recovery-delay-start', '1000'`
			`])`
			`manager.revive_osd(0)`
lost_unfound: mark osds in when we revive them so that we test what we meant to. It also lets us actually go clean at the very end. 2012-02-20 03:40:45 +00:00			`manager.mark_in_osd(0)`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`manager.wait_till_osd_is_up(0)`

qa/tasks: use new reliable flush_pg_stats helper The helper gets a sequence number from the osd (or osds), and then polls the mon until that seq is reflected there. This is overkill in some cases, since many tests only require that the stats be reflected on the mgr (not the mon), but waiting for it to also reach the mon is sufficient! Signed-off-by: Sage Weil <sage@redhat.com> 2017-05-18 22:16:55 +00:00			`manager.flush_pg_stats([1, 0])`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`manager.wait_till_active()`

			`# take out osd.1 and the only copy of those objects.`
			`manager.kill_osd(1)`
			`manager.mark_down_osd(1)`
			`manager.mark_out_osd(1)`
			`manager.raw_cluster_cmd('osd', 'lost', '1', '--yes-i-really-mean-it')`

			`# bring up osd.2 so that things would otherwise, in theory, recovery fully`
			`manager.revive_osd(2)`
lost_unfound: mark osds in when we revive them so that we test what we meant to. It also lets us actually go clean at the very end. 2012-02-20 03:40:45 +00:00			`manager.mark_in_osd(2)`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`manager.wait_till_osd_is_up(2)`

qa/tasks: use new reliable flush_pg_stats helper The helper gets a sequence number from the osd (or osds), and then polls the mon until that seq is reflected there. This is overkill in some cases, since many tests only require that the stats be reflected on the mgr (not the mon), but waiting for it to also reach the mon is sufficient! Signed-off-by: Sage Weil <sage@redhat.com> 2017-05-18 22:16:55 +00:00			`manager.flush_pg_stats([0, 2])`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`manager.wait_till_active()`
qa/tasks: use new reliable flush_pg_stats helper The helper gets a sequence number from the osd (or osds), and then polls the mon until that seq is reflected there. This is overkill in some cases, since many tests only require that the stats be reflected on the mgr (not the mon), but waiting for it to also reach the mon is sufficient! Signed-off-by: Sage Weil <sage@redhat.com> 2017-05-18 22:16:55 +00:00			`manager.flush_pg_stats([0, 2])`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00
			`# verify that there are unfound objects`
			`unfound = manager.get_num_unfound_objects()`
			`log.info("there are %d unfound objects" % unfound)`
			`assert unfound`

tasks/unfound.py: run rados bench in parallel Signed-off-by: Samuel Just <sjust@redhat.com> 2015-12-22 18:06:22 +00:00			`testdir = teuthology.get_testdir(ctx)`
			`procs = []`
			`if config.get('parallel_bench', True):`
			`procs.append(mon.run(`
			`args=[`
			`"/bin/sh", "-c",`
			`" ".join(['adjust-ulimits',`
			`'ceph-coverage',`
			`'{tdir}/archive/coverage',`
			`'rados',`
			`'--no-log-to-stderr',`
			`'--name', 'client.admin',`
			`'-b', str(4<<10),`
			`'-p' , POOL,`
			`'-t', '20',`
			`'bench', '240', 'write',`
			`]).format(tdir=testdir),`
			`],`
			`logger=log.getChild('radosbench.{id}'.format(id='client.admin')),`
			`stdin=run.PIPE,`
			`wait=False`
			`))`
			`time.sleep(10)`

add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`# mark stuff lost`
			`pgs = manager.get_pg_stats()`
			`for pg in pgs:`
			`if pg['stat_sum']['num_objects_unfound'] > 0:`
			`primary = 'osd.%d' % pg['acting'][0]`
lost_unfound: list missing/unfound for each pg and verify the unfound counts This also tests the pg list_missing functionality. 2012-02-24 19:11:59 +00:00
			`# verify that i can list them direct from the osd`
github.com/NewDreamNetwork -> github.com/ceph 2012-03-02 18:55:19 +00:00			`log.info('listing missing/lost in %s state %s', pg['pgid'],`
			`pg['state']);`
osd/PrimaryLogPG: s/list_missing/list_unfound/ Also: - Do not print offset until specified - Count missing objects correctly (used to be primary's local missing) Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn> 2018-08-23 06:29:01 +00:00			`m = manager.list_pg_unfound(pg['pgid'])`
lost_unfound: list missing/unfound for each pg and verify the unfound counts This also tests the pg list_missing functionality. 2012-02-24 19:11:59 +00:00			`#log.info('%s' % m)`
			`assert m['num_unfound'] == pg['stat_sum']['num_objects_unfound']`
			`num_unfound=0`
			`for o in m['objects']:`
			`if len(o['locations']) == 0:`
			`num_unfound += 1`
			`assert m['num_unfound'] == num_unfound`

add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`log.info("reverting unfound in %s on %s", pg['pgid'], primary)`
lost_unfound: new mark_unfound_lost syntax 2012-02-24 04:07:24 +00:00			`manager.raw_cluster_cmd('pg', pg['pgid'],`
			`'mark_unfound_lost', 'revert')`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`else:`
			`log.info("no unfound in %s", pg['pgid'])`

			`manager.raw_cluster_cmd('tell', 'osd.0', 'debug', 'kick_recovery_wq', '5')`
			`manager.raw_cluster_cmd('tell', 'osd.2', 'debug', 'kick_recovery_wq', '5')`
qa/tasks: use new reliable flush_pg_stats helper The helper gets a sequence number from the osd (or osds), and then polls the mon until that seq is reflected there. This is overkill in some cases, since many tests only require that the stats be reflected on the mgr (not the mon), but waiting for it to also reach the mon is sufficient! Signed-off-by: Sage Weil <sage@redhat.com> 2017-05-18 22:16:55 +00:00			`manager.flush_pg_stats([0, 2])`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_recovery()`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00
			`# verify result`
			`for f in range(1, 10):`
Tasks are failing since using "data" pool no longer part of default install Create a pool specifically for each task Fixes: 8930 Signed-off-by: David Zafman <david.zafman@inktank.com> 2014-08-01 16:36:10 +00:00			`err = rados(ctx, mon, ['-p', POOL, 'get', 'new_%d' % f, '-'])`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`assert err`
Tasks are failing since using "data" pool no longer part of default install Create a pool specifically for each task Fixes: 8930 Signed-off-by: David Zafman <david.zafman@inktank.com> 2014-08-01 16:36:10 +00:00			`err = rados(ctx, mon, ['-p', POOL, 'get', 'existed_%d' % f, '-'])`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`assert err`
Tasks are failing since using "data" pool no longer part of default install Create a pool specifically for each task Fixes: 8930 Signed-off-by: David Zafman <david.zafman@inktank.com> 2014-08-01 16:36:10 +00:00			`err = rados(ctx, mon, ['-p', POOL, 'get', 'existing_%d' % f, '-'])`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`assert not err`

			`# see if osd.1 can cope`
lost_unfound: mark osds in when we revive them so that we test what we meant to. It also lets us actually go clean at the very end. 2012-02-20 03:40:45 +00:00			`manager.mark_in_osd(1)`
qa: fix the potential delay of pg state change If start osd process first and then mark it in, the pg state may remain all active+clean when doing wait_for_clean() check, which may fail the next osd_scrub_pgs() process. So faster pg state change by marking osd in first. Signed-off-by: huangjun <huangjun@xsky.com> 2017-08-25 09:07:37 +00:00			`manager.revive_osd(1)`
add lost_unfound task Also some misc useful bits to ceph_manager. 2011-10-17 22:32:22 +00:00			`manager.wait_till_osd_is_up(1)`
wait_till_clean -> wait_for_clean and wait_for_recovery Clean now also means the correct number of replicas, whereas recovered means we have done all the work we can do given the replicas/osds we have. For example, degraded and clean are now mutually exclusive. Also move away from 'till'. 2012-02-18 05:53:25 +00:00			`manager.wait_for_clean()`
tasks/unfound.py: run rados bench in parallel Signed-off-by: Samuel Just <sjust@redhat.com> 2015-12-22 18:06:22 +00:00			`run.wait(procs)`