Commit Graph

88760 Commits

Author SHA1 Message Date
Nathan Cutler
e0042dd617 build/ops: unify command substitution in install-deps.sh
The $() form is preferable to `` because folks (like me) might be using
` as a keyboard shortcut to GNU Screen, causing havoc to ensue whenever
copy-pasting the ` character.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-08-01 12:36:31 +02:00
Nathan Cutler
f170775770 build/ops: streamline processing of WITH_SEASTAR env var
Quoting relevant portion of "man test":

    STRING equivalent to -n STRING

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-08-01 12:36:09 +02:00
Ricardo Dias
77cdb2dccf
Merge pull request #23224 from votdev/rest_client_timeout
mgr/dashboard: Set timeout in RestClient calls

Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
Reviewed-by: Sebastian Wagner <swagner@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2018-08-01 10:02:09 +01:00
Ricardo Dias
47a50eeba5
Merge pull request #21881 from sebastian-philipp/dashboard-pool-patch
mgr/dashboard: Add Pool update endpoint 

Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2018-08-01 10:00:10 +01:00
Kefu Chai
09121bb95f
Merge pull request #23284 from tchaikov/wip-seastar-config
crimson/common: write configs synchronously on shard.0

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2018-08-01 14:58:43 +08:00
Sage Weil
c909c0aa35 Merge PR #22825 into master
* refs/pull/22825/head:
	common: FreeBSD does not have /etc/os-release

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-07-31 20:16:46 -05:00
Sage Weil
0837c9d816 Merge PR #22998 into master
* refs/pull/22998/head:
	filestore: add pgid in filestore pg dir split log message

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-07-31 20:16:41 -05:00
Sage Weil
569d475da3 Merge PR #23134 into master
* refs/pull/23134/head:
	common: check completion condition before waiting

Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2018-07-31 20:16:35 -05:00
Sage Weil
97a697e7f7 Merge PR #23223 into master
* refs/pull/23223/head:
	osd/PG: kill dead functions and related options
	iosd/osd_type: kill unused input ec_pool for iterate_mayberw_back_to
	common: kill dead options
	osd/PG: do not initialize up/acting twice
	osd/PG: clear missing_loc properly if last location is gone

Reviewed-by: Sage Weil <sage@redhat.com>
2018-07-31 20:16:30 -05:00
Kefu Chai
1cbd929806
Merge pull request #22990 from tchaikov/wip-cmake-link-static-libstdc++
cmake: fix "WITH_STATIC_LIBSTDCXX"

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2018-08-01 09:06:37 +08:00
Sage Weil
34646c6a65 Merge PR #22692 into master
* refs/pull/22692/head:
	doc/mgr/devicehealth: document devicehealth module
	doc/rados/operations/health-checks: document DEVICE_HEALTH* messages
	mgr/devicehealth: fix style for returns
	mgr/devicehealth: use constants for health warnings
	mgr/devicehealth: deal with as many daemons as we can until limit
	mgr/devicehealth: warn if too many daemons are expected to fail soon
	mgr/devicehealth: set primary-affinity 0 for failing devices
	msg/devicehealth: fix config options
	mgr/devicehealth: only fetch osdmap once from check_health
	mgr/devicehealth: revise health messages
	mgr/devicehealth: add 'device check-health' command and run periodically
	mgr/devicehealth: fix new options
	mgr/devicehealth: add helpers to life_expectancy_response()
	mgr/devicehealth: simplify setting defaults
	common/blkdev remove debug statements

Reviewed-by: John Spray <john.spray@redhat.com>
2018-07-31 17:23:48 -05:00
Patrick Donnelly
a2089173e3
Merge PR #23157 into master
* refs/pull/23157/head:
	Provided API to change umask
2018-07-31 14:50:50 -07:00
Sage Weil
0aba0f4bcd Merge PR #23354 into master
* refs/pull/23354/head:
	src/osd/PG.cc: remove redundant call to trim_log()

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-07-31 16:24:22 -05:00
Sage Weil
f09a87f902 doc/mgr/devicehealth: document devicehealth module
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
7ab8675fdf doc/rados/operations/health-checks: document DEVICE_HEALTH* messages
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
ccdfcc7e72 mgr/devicehealth: fix style for returns
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
1f8662a708 mgr/devicehealth: use constants for health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
b23295dbb9 mgr/devicehealth: deal with as many daemons as we can until limit
Process as many OSDs as we can until we hit the min_in_ratio.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
4cda89c9e3 mgr/devicehealth: warn if too many daemons are expected to fail soon
Refuse to mark out *all* OSDs.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
1c9ce2fc56 mgr/devicehealth: set primary-affinity 0 for failing devices
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
cba41b6f7c msg/devicehealth: fix config options
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
abdee9f679 mgr/devicehealth: only fetch osdmap once from check_health
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
c688c81afd mgr/devicehealth: revise health messages
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
8deec7445f mgr/devicehealth: add 'device check-health' command and run periodically
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Sage Weil
b9d547f012 mgr/devicehealth: fix new options
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Yaarit Hatuka
e1552de24b mgr/devicehealth: add helpers to life_expectancy_response()
- if mark_out_threshold is met we write to log.warn instead of raising a
  health warning.
- check that OSD is 'in' before calling mark_out().
- raise a health warning in case OSD is marked 'out' but still has PGs
  attached to it.
- cast thresholds default values to string.
- add SCSI multipath support to health warning message.
- change health warning message.

Signed-off-by: Yaarit Hatuka <yaarithatuka@gmail.com>
2018-07-31 14:08:53 -05:00
Sage Weil
2b86590a66 mgr/devicehealth: simplify setting defaults
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 14:08:53 -05:00
Yaarit Hatuka
8e542033a1 common/blkdev remove debug statements
Signed-off-by: Yaarit Hatuka yaarithatuka@gmail.com
2018-07-31 14:08:53 -05:00
Sage Weil
34698a2c62 Merge PR #23334 into master
* refs/pull/23334/head:
	pybind/rados/rados: do not pass prval from stack

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-07-31 14:08:37 -05:00
David Zafman
9d06ab3da9
Merge pull request #23217 from dzafman/wip-25085
osd: Allow repair of an object with a bad data_digest in object_info on all replicas

Reviewed-by: Sage Weil <sage@redhat.com>
2018-07-31 15:07:22 -04:00
Neha Ojha
283b0bde4a src/osd/PG.cc: remove redundant call to trim_log()
This change is motived by the failure tracked in
https://tracker.ceph.com/issues/25198. The failure highlights a case, when a
call to trim_log() after the PG has recovered, races with the previous op,
on a replica OSD. Since the previous operation has not completed, the
last_complete value for that OSD is not valid, when we try to trim the
log. It is also worth noting that the race is due to MOSDPGTrim going through
the strict queue as a peering message vs regular ops going through the
non-strict queue.

During the investigation of this bug, we noticed that, with
https://tracker.ceph.com/issues/23979, we allow pg log trimming to
happen on the primary and replicas, whenever we cross the upper bound of
the pg log. This also ensures that pg log trimming happens while processing
any new op.

Therefore, the function trim_log(), which earlier served the purpose of
trimming logs on the primary and replicas, just before the PG went into
the Recovered state, is no more required. This acted like a last line of
defense to trim logs, when we did not need the logs any more. But, this call
seems redundant now, because, we are limiting the pg log length at all times.

Signed-off-by: Neha Ojha <nojha@redhat.com>
2018-07-31 11:43:02 -07:00
Sage Weil
8e36f18cde pybind/rados/rados: do not pass prval from stack
The prval is a pointer to an int to write the final completion code of
the rados op.  This can't be on the stack since we immediately leave the
current scope after preparing the op (looong before we do the rados op).

We keep the tuple return value to avoid breaking users of this API
(devicehealth module, gnocchi at a minimum).

Fixes: http://tracker.ceph.com/issues/25175
Signed-off-by: Sage Weil <sage@redhat.com>
2018-07-31 09:41:05 -05:00
Alfredo Deza
96e7576400
Merge pull request #23348 from ceph/wip-rm24957
ceph-volume: adds test for `ceph-volume lvm list /dev/sda`

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2018-07-31 09:56:05 -04:00
Andrew Schoen
ef10886f1e ceph-volume: adds a unit test for lvm list /dev/sda
This test is to prove that the issue from
http://tracker.ceph.com/issues/24957 was fixed
by http://tracker.ceph.com/issues/24784

When running lvm list against a raw device it should handle
gracefully the situation where there are multiple PVs with the
name of the given device.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2018-07-31 08:50:28 -05:00
Andrew Schoen
37ed1be08b ceph-volume: move pvolumes fixture into conftest.py
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
2018-07-31 08:50:27 -05:00
Kefu Chai
cec5a23f69
Merge pull request #23336 from noahdesu/vstart-dashboard-no-rbd
vstart: disable dashboard when rbd not built

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-07-31 11:05:02 +08:00
Patrick Donnelly
55e60ab17d
Merge PR #23297 into master
* refs/pull/23297/head:
	ceph_volume_client: add delay for MDSMap to be distributed

Reviewed-by: Ramana Raja <rraja@redhat.com>
2018-07-30 16:11:21 -07:00
Patrick Donnelly
957bdb4abe
Merge PR #23308 into master
* refs/pull/23308/head:
	doc: s/Ceph FS/CephFS

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2018-07-30 16:06:39 -07:00
Noah Watkins
5b9dd4c8a2 vstart: disable dashboard when rbd not built
dashboard doesn't load correctly without the rbd module, which means
vstart commands that interact with dashboard fail and vstart exits.

Signed-off-by: Noah Watkins <nwatkins@redhat.com>
2018-07-30 14:50:28 -07:00
Sage Weil
1ebafdb65f
Merge pull request #23292 from yuriw/wip-yuriw-25140-master
qa/tests: added 1st draft of mimic-x suite
2018-07-30 14:55:41 -05:00
Sage Weil
c6dd193f45
Merge pull request #23302 from yuriw/wip-yuriw-crontab-master
qa/tests: added mimic-x to the schedule
2018-07-30 14:55:27 -05:00
Alfredo Deza
81df5d18c3
Merge pull request #23321 from cernceph/dvanders_enable
ceph-volume: enable the ceph-osd during lvm activation

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2018-07-30 12:54:24 -04:00
Andrew Schoen
4a043de4b7
Merge pull request #23332 from alfredodeza/wip-rm25171
ceph-volume add a __release__ string, to help version-conditional calls

Reviewed-by: Andrew Schoen <aschoen@redhat.com>
2018-07-30 16:14:01 +00:00
Yuri Weinstein
baa4d0ea78
Merge pull request #23305 from smithfarm/wip-cleanup-upgrade
qa/upgrade: cleanup for nautilus

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-07-30 09:01:03 -07:00
Yuri Weinstein
e6f21c1aa3 qa/tests: added 1st draft of mimic-x suite
Fixes: https://tracker.ceph.com/issues/25140
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-07-30 08:41:18 -07:00
Alfredo Deza
5bd0c27f9d ceph-volume add a __release__ string, to help version-conditional calls
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2018-07-30 11:39:43 -04:00
Kefu Chai
df2196dbf8
Merge pull request #23276 from tchaikov/wip-config-diff-lock
common/config: fix the lock in ConfigProxy::diff()

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-07-30 22:24:54 +08:00
Kefu Chai
1bb7be365e
Merge pull request #23251 from neha-ojha/wip-25112
osd,mon: increase mon_max_pg_per_osd to 250

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Nathan Cutler <ncutler@suse.com>
2018-07-30 22:23:18 +08:00
Kefu Chai
b88596d93c
Merge pull request #23249 from liewegas/wip-mon-cx-nautilus
osd/OSDMap: fix CEPHX_V2 osd requirement to nautilus, not mimic

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2018-07-30 22:22:40 +08:00
Kefu Chai
53951a84b6
Merge pull request #23229 from rjfd/wip-dashboard-query-params-bug
mgr/dashboard: fix query parameters in task annotated endpoints

Reviewed-by: Tiago Melo <tmelo@suse.com>
2018-07-30 22:21:43 +08:00