Commit Graph

75177 Commits

Author SHA1 Message Date
John Spray
397d3b9350 mgr/dashboard: update for new style health checks
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
a14782e179 mon: demote cluster map prints to DEBUG level
The PaxosService subclasses should be writing out
informative log messages, and not relying on
a stream of map summary prints to communicate
changes.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
9ef298eabe mon: prettify health check log messages
Add a "Cluster is now healthy" to give clarity
after a series of "health check cleared" that
they were the last ones.

Convert certain health check messages into
well formed sentences.

Don't print severity in the log string (it's already
expressed in the severity of the log entry.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
d260368e5b mgr: fix spurious PG health messages on mgr restart
Previously, the mgr would send MMonMgrReport indicating
a very unhappy PGMap to the mon right after startup.

This is a change to hold off on sending that report until
all the OSDs have reported in, or until some time has passed.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
0c8a1d3940 mon: don't consider a starting mgr to be an error
The .available flag is there to tell MgrClients whether
to try and connect -- it isn't the right condition
for a health complaint.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:02 -04:00
John Spray
a2f02ac073 mon: pass new style health to mgr
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
f70316ecbc mon: prefix periodic health reminder with 'overall'
...so we can whitelist it.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
09957d15c7 mon/PGMap: rename a few health checks
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
8d1d6a0650 mgr/DaemonServer: debug log health checks
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
a8e07b33f7 mon/MgrStatMonitor: show health check count on receipt
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
76bdda85ed messages/MMonMgrReport: show health check count
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
78b33655ad mon/PGMap: only warn about too few pgs after >0 pools exist
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
3b01be0d4b mon/PGMap: some stuck warnings are err, some warn
inactive and stale -> error
degraded, unclean, undersized -> warning

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
63f97ddcf6 qa/suites/rados: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
e12afb714e mon/PGMap: do not warn about recovering, peering, stale
Wait for stuck before complaining.  These aren't scary in and of
themselves.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
b25ddafa16 qa/tasks/mon_clock_skew_check: vastly simplify
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
db0b2f6ede mon: shorten 'stuck' threshold from 5m -> 1m
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
bfe4c08429 osd/OSDMap: add per-osd flag OSD_FLAGS check
rename OSD_FLAGS to OSDMAP_FLAGS

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
a23b7d7ee8 osd/OSDMap: rename a few health checks
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
6a9924270b mon: move osd health checks into OSDMap method
...with one check moving into HealthMonitor where it
belongs.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
83aaa75991 qa/tasks/ceph: stop logging health on shutdown
Don't log health during actual teardown or we'll see
various scary messages unrelated to our test run.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
b9cdb9fa7b mon/MgrMonitor: do not issue MGR_DOWN on new cluster
It is normal for the initial cluster to lack a mgr.  Wait for some
grace period before complaining about a missing mgr.

Default to 30m.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
57a8747f1d mon/Monitor: periodically log new-style health warnings to log
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
69ebbebb95 mon/MDSMonitor: implement new-style cephfs health checks
Our detail elements are still strings, so we keep the bit that collapses
the metadata into a string and appends it to the string.

Each MDS-generated item becomes a detail record.

Health checks are consolidated either by MDS_ or FS_, counting the
number of mds servers or file systems affected.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
ad7a4c803f mon,mgr: pass new-style health checks from mgr's PGMap
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:01 -04:00
Sage Weil
1bbd3c48e9 mon/PGMap: implement new-style health checks
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
24a1636302 mon/OSDMonitor: implement new health checks
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
8e815abe4e mon: new health check framework
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
0b59a7f8ad mon: HealthMonitor -> OldHealthMonitor
This will go away post-luminous.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
e170405fd8 mon: remove Formatter arg to QuorumService::get_health()
This is used to dump extra weirdness to the health detail structured
output, but we are about to remove all of that in luminous.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
6068e1856d mon/Monitor: move time sync status from 'ceph health' to 'time-sync-status' command
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
349fb86ee2 mon: move health_status_t to health.h
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
32d1fd3f7e mon: move pool quota health warnings into PGMap
This puts them on the mgr in a context where they can respond to both
osdmap and pgmap updates, and removes one more dependency on pgservice
in osdmon.

We keep the pool quota FULL warning since that is a function of OSDMap
state (the pool flag... not actual usage).

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Yuri Weinstein
b932d7f883 Merge pull request #15630 from Abhishekvrshny/fix-rgw-multipart-leaks
rgw: fix leaks with incomplete multiparts

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 08:56:04 -07:00
Yuri Weinstein
a20dd80142 Merge pull request #15669 from rzarzynski/wip-rgw-fix-rgwbucket-error-handling
rgw: fix error handling in the link() method of RGWBucket.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 08:55:28 -07:00
Yuri Weinstein
ed075b05ff Merge pull request #15834 from prallabh/wip-20377
rgw: Do not fetch bucket stats by default upon bucket listing

Reviewed-by:  Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-07-12 08:52:24 -07:00
Yuri Weinstein
4d030daab7 Merge pull request #15887 from rzarzynski/wip-rgw-20394
rgw: /info claims we do support Swift's accounts ACLs.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 08:49:39 -07:00
Yuri Weinstein
dba56d62d4 Merge pull request #16154 from mikulely/obj-placement-pool
rgw: drop unused rgw_pool parameter, local variables and member variable

Reviewed-by: Jos Collin <jcollin@redhat.com>
2017-07-12 08:48:49 -07:00
Yuri Weinstein
bcd437435f Merge pull request #14961 from yehudasa/wip-19739
rgw: verify md5 in post obj

Reviewed-by: Orit Wasserman <owasserm@redhat.com>
2017-07-12 08:45:41 -07:00
Yuri Weinstein
39ad2b8ac3 Merge pull request #15281 from jimifm/wip-rgw_rados-master
rgw/rgw_rados: Remove duplicate calls in RGWRados::finalize()

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 08:44:18 -07:00
Sage Weil
808bd2ca93 Merge pull request #16287 from liewegas/wip-try-decode
mon/MgrStatMonitor: do not crash on luminous dev version upgrades

Reviewed-by: Yan, Zheng" <zyan@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
2017-07-12 09:34:20 -05:00
Casey Bodley
86d985f455 Merge pull request #16197 from shashalu/fix-RGWRadosRemoveOmapKeysCR
rgw:multisite: fix RGWRadosRemoveOmapKeysCR and change cn to intrusive_ptr

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 10:11:14 -04:00
Sage Weil
36698ad399 Merge pull request #15843 from ovh/bp-drop-ping-fields
messages/MOSDPing.h: drop unused fields

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2017-07-12 08:33:32 -05:00
Casey Bodley
850f4bb998 Merge pull request #15929 from cbodley/wip-vstart-rgw-compression
vstart: add --rgw_compression to set rgw compression plugin

Reviewed-by: Matt Benjamin <mbenjami@redhat.com>
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
2017-07-12 08:40:10 -04:00
Sage Weil
5ec27fbcbf mon/MgrStatMonitor: do not crash on luninous dev version upgrades
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 07:30:53 -04:00
Jos Collin
82e85a6d86 Merge pull request #16254 from TsaiJin/wip-rbd-nbd-version
tools/rbd_nbd: add --version show support

Reviewed-by: Jos Collin <jcollin@redhat.com>
2017-07-12 08:56:44 +00:00
lu.shasha
0faae7e890 rgw:multisite: change raw 'RGWAioCompletionNotifier*' to intrusive_ptr for rgw_cr_rados
Signed-off-by: Shasha Lu <lu.shasha@eisoo.com>
2017-07-12 13:50:02 +08:00
Nathan Cutler
7550bde5da Merge pull request #16074 from smithfarm/wip-obsolete-libcephfs1
build/ops: rpm: obsolete libcephfs1

Reviewed-by: Ken Dreyer <kdreyer@redhat.com>
Reviewed-by: Boris Ranto <branto@redhat.com>
2017-07-12 05:15:02 +02:00
Alfredo Deza
7b4553291b Merge pull request #16224 from liewegas/wip-docs-prune
doc: update intro, quick start docs

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2017-07-11 16:44:30 -04:00
Casey Bodley
dfa1389615 vstart: add --rgw_compression to set rgw compression plugin
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-07-11 15:33:56 -04:00