Add a function called check_daemon_versions()
Made minor changes based on pull request comments
Changed the way versions are checked along with some other minor modifications
Made minor modification to the error message
Modifications to include the hostname in the error message
Fixes: https://tracker.ceph.com/issues/46264
Signed-off-by: Tyler <tyler_sheehan@student.uml.edu>
Remove time since health errors aren't preserved if not set every time
Rename health error to DAEMON_OLD_VERSION
Reoganize code so that we can get versions from other daemon types
Change to map of version (key), a list of daemons (value)
Get actual daemon names from get_versions() requests
Fix message output based on how many daemons and how many versions
Add a config to be able to turn off this features
Give HEALTH_ERR if more than one older version exists
Fix minor indentation problem with health detail
Signed-off-by: David Zafman <dzafman@redhat.com>
Because not all the node in teuthology config might
have any cephadm related role we should not execute
cephadm on random node.
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
root privileges need to be verified before creating the log dir
introduced by 81a7df0
Fixes: https://tracker.ceph.com/issues/48166
Signed-off-by: Michael Fritch <mfritch@suse.com>
mgr/cephadm: only deploy 3 mons by default when there are 4 hosts
Reviewed-by: Joshua Schmid <jschmid@suse.de>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
silences the warning when building with -DNDEBUG:
segment_cleaner.h:399:10: warning: unused variable ‘ret’ [-Wunused-variable]
399 | auto ret = space_tracker->allocate(
| ^~~
Signed-off-by: Kefu Chai <kchai@redhat.com>
Update the documentation reference as the content
of the HACKING.rst file has been moved to the Ceph
developer documentation.
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
in hope to address the failure of
ERROR: Cannot install -r requirements.txt (line 3) and kubernetes because these package versions have conflicting dependencies.
The conflict is caused by:
kubernetes 12.0.0 depends on urllib3>=1.24.2
requests 2.24.0 depends on urllib3!=1.25.0, !=1.25.1, <1.26 and >=1.21.1
Signed-off-by: Kefu Chai <kchai@redhat.com>
The 'op_r' will just acount CEPH_OSD_FLAG_READ flag, which will
include some other none real data read opcodes, like the CEPH_OSD_OP_STAT.
Signed-off-by: Xiubo Li <xiubli@redhat.com>
Previously, if the same image was mapped twice and then one
wanted to re-attach it could try to use a wrong device. Also
list-mapped could produce incorrect listing after re-attach.
Signed-off-by: Mykola Golub <mgolub@suse.com>
The proc fs contains both /proc/[pid] and /proc/[tid]
subdirectories. The /proc/[tid] subdirectories are not visible
when listing /proc. But when checking pid for reattached process,
get_mapped_info directly tries to open /proc/[pid]/cmdline, and
it may actually be tid belonging to some other rbd-nbd
process. Try to avoid this by checking /proc/[pid]/comm first --
for pid we expect to find "rbd-nbd" here.
Signed-off-by: Mykola Golub <mgolub@suse.com>
For `detach` failing to find the process is fatal while unmap
will still try to send disconnect to the device.
Signed-off-by: Mykola Golub <mgolub@suse.com>
Introduce a "scheduler" directory under the rados:perf tree to allow perf
suite to specify tests with the default scheduler(WPQ) and also with
the dmClock scheduler. One specification also overrides the number of
shards(1) and the number of threads per shard(16) to test with apart from
the default settings. This allows testing and performance benchmarking
with the new proposal to use one shard and multiple threads per shard with
the dmClock scheduler.
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
To preserve the old bluestore behavior it should compare the
current object with the end using bluestore keys, not oids.
Fixes: https://tracker.ceph.com/issues/48153
Signed-off-by: Mykola Golub <mgolub@suse.com>
When tests are launched with kernel client using vstart_runner.py,
config is None and, therefore, the call "config.get()" leads to a crash.
Assigning self.rbytes None is important since leaving it undefined will
to lead a crash since the code executed later assumes that self.rbytes
is defined.
Fixes: https://tracker.ceph.com/issues/48147
Signed-off-by: Rishabh Dave <ridave@redhat.com>
* refs/pull/37942/head:
ceph-volume: fix lvm help test
ceph-volume: add a unit tests to lvm batch
ceph-volume: fix lvm batch auto with full SSDs
Reviewed-by: Guillaume Abrioux <gabrioux@redhat.com>
Reviewed-by: Jan Fajerski <jfajerski@suse.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
An object may take more than one push op to recover, those later push ops also need to
set their version accordingly
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
PGRecovery::start_recovery_ops() should wait for all inflight recovery ops, whether they are
started by BackgroundRecovery or not, otherwise there may be circumstances in which BackgroundRecovery
keep recursively invoking its do_recovery when start_recovery_ops returns recovery done while there are
still missing objects.
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Add test case for permitted hours to make sure scrub doesn't start
Remove permitted hours in extended sleep test
Fixes: https://tracker.ceph.com/issues/48077
Signed-off-by: David Zafman <dzafman@redhat.com>
This commit updates the "ceph df" output
so that it is current as of October 2020.
-Add correctly formatted `ceph df` output.
-Add explanation of "DIRTY" column.
-(DATA) remains to be defined (1 instance)
-(OMAP) remains to be defined (1 instance)
-USED remains to be defined (1 instance)
-Update prompts in "Checking OSD Status"
Fixes: https://tracker.ceph.com/issues/47523
Signed-off-by: Zac Dover <zac.dover@gmail.com>