John Spray
397d3b9350
mgr/dashboard: update for new style health checks
...
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
a14782e179
mon: demote cluster map prints to DEBUG level
...
The PaxosService subclasses should be writing out
informative log messages, and not relying on
a stream of map summary prints to communicate
changes.
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
9ef298eabe
mon: prettify health check log messages
...
Add a "Cluster is now healthy" to give clarity
after a series of "health check cleared" that
they were the last ones.
Convert certain health check messages into
well formed sentences.
Don't print severity in the log string (it's already
expressed in the severity of the log entry.
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
d260368e5b
mgr: fix spurious PG health messages on mgr restart
...
Previously, the mgr would send MMonMgrReport indicating
a very unhappy PGMap to the mon right after startup.
This is a change to hold off on sending that report until
all the OSDs have reported in, or until some time has passed.
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
0c8a1d3940
mon: don't consider a starting mgr to be an error
...
The .available flag is there to tell MgrClients whether
to try and connect -- it isn't the right condition
for a health complaint.
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:02 -04:00
John Spray
a2f02ac073
mon: pass new style health to mgr
...
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
f70316ecbc
mon: prefix periodic health reminder with 'overall'
...
...so we can whitelist it.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
09957d15c7
mon/PGMap: rename a few health checks
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
8d1d6a0650
mgr/DaemonServer: debug log health checks
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
a8e07b33f7
mon/MgrStatMonitor: show health check count on receipt
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
76bdda85ed
messages/MMonMgrReport: show health check count
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
78b33655ad
mon/PGMap: only warn about too few pgs after >0 pools exist
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
3b01be0d4b
mon/PGMap: some stuck warnings are err, some warn
...
inactive and stale -> error
degraded, unclean, undersized -> warning
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
63f97ddcf6
qa/suites/rados: whitelist health warnings
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
e12afb714e
mon/PGMap: do not warn about recovering, peering, stale
...
Wait for stuck before complaining. These aren't scary in and of
themselves.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
b25ddafa16
qa/tasks/mon_clock_skew_check: vastly simplify
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
db0b2f6ede
mon: shorten 'stuck' threshold from 5m -> 1m
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
bfe4c08429
osd/OSDMap: add per-osd flag OSD_FLAGS check
...
rename OSD_FLAGS to OSDMAP_FLAGS
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
a23b7d7ee8
osd/OSDMap: rename a few health checks
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
6a9924270b
mon: move osd health checks into OSDMap method
...
...with one check moving into HealthMonitor where it
belongs.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
83aaa75991
qa/tasks/ceph: stop logging health on shutdown
...
Don't log health during actual teardown or we'll see
various scary messages unrelated to our test run.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
b9cdb9fa7b
mon/MgrMonitor: do not issue MGR_DOWN on new cluster
...
It is normal for the initial cluster to lack a mgr. Wait for some
grace period before complaining about a missing mgr.
Default to 30m.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
57a8747f1d
mon/Monitor: periodically log new-style health warnings to log
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
69ebbebb95
mon/MDSMonitor: implement new-style cephfs health checks
...
Our detail elements are still strings, so we keep the bit that collapses
the metadata into a string and appends it to the string.
Each MDS-generated item becomes a detail record.
Health checks are consolidated either by MDS_ or FS_, counting the
number of mds servers or file systems affected.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
ad7a4c803f
mon,mgr: pass new-style health checks from mgr's PGMap
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:01 -04:00
Sage Weil
1bbd3c48e9
mon/PGMap: implement new-style health checks
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
24a1636302
mon/OSDMonitor: implement new health checks
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
8e815abe4e
mon: new health check framework
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
0b59a7f8ad
mon: HealthMonitor -> OldHealthMonitor
...
This will go away post-luminous.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
e170405fd8
mon: remove Formatter arg to QuorumService::get_health()
...
This is used to dump extra weirdness to the health detail structured
output, but we are about to remove all of that in luminous.
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
6068e1856d
mon/Monitor: move time sync status from 'ceph health' to 'time-sync-status' command
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
349fb86ee2
mon: move health_status_t to health.h
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Sage Weil
32d1fd3f7e
mon: move pool quota health warnings into PGMap
...
This puts them on the mgr in a context where they can respond to both
osdmap and pgmap updates, and removes one more dependency on pgservice
in osdmon.
We keep the pool quota FULL warning since that is a function of OSDMap
state (the pool flag... not actual usage).
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:51:30 -04:00
Yuri Weinstein
b932d7f883
Merge pull request #15630 from Abhishekvrshny/fix-rgw-multipart-leaks
...
rgw: fix leaks with incomplete multiparts
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 08:56:04 -07:00
Yuri Weinstein
a20dd80142
Merge pull request #15669 from rzarzynski/wip-rgw-fix-rgwbucket-error-handling
...
rgw: fix error handling in the link() method of RGWBucket.
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 08:55:28 -07:00
Yuri Weinstein
ed075b05ff
Merge pull request #15834 from prallabh/wip-20377
...
rgw: Do not fetch bucket stats by default upon bucket listing
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2017-07-12 08:52:24 -07:00
Yuri Weinstein
4d030daab7
Merge pull request #15887 from rzarzynski/wip-rgw-20394
...
rgw: /info claims we do support Swift's accounts ACLs.
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 08:49:39 -07:00
Yuri Weinstein
dba56d62d4
Merge pull request #16154 from mikulely/obj-placement-pool
...
rgw: drop unused rgw_pool parameter, local variables and member variable
Reviewed-by: Jos Collin <jcollin@redhat.com>
2017-07-12 08:48:49 -07:00
Yuri Weinstein
bcd437435f
Merge pull request #14961 from yehudasa/wip-19739
...
rgw: verify md5 in post obj
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
2017-07-12 08:45:41 -07:00
Yuri Weinstein
39ad2b8ac3
Merge pull request #15281 from jimifm/wip-rgw_rados-master
...
rgw/rgw_rados: Remove duplicate calls in RGWRados::finalize()
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 08:44:18 -07:00
Sage Weil
808bd2ca93
Merge pull request #16287 from liewegas/wip-try-decode
...
mon/MgrStatMonitor: do not crash on luminous dev version upgrades
Reviewed-by: Yan, Zheng" <zyan@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
2017-07-12 09:34:20 -05:00
Casey Bodley
86d985f455
Merge pull request #16197 from shashalu/fix-RGWRadosRemoveOmapKeysCR
...
rgw:multisite: fix RGWRadosRemoveOmapKeysCR and change cn to intrusive_ptr
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 10:11:14 -04:00
Sage Weil
36698ad399
Merge pull request #15843 from ovh/bp-drop-ping-fields
...
messages/MOSDPing.h: drop unused fields
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2017-07-12 08:33:32 -05:00
Casey Bodley
850f4bb998
Merge pull request #15929 from cbodley/wip-vstart-rgw-compression
...
vstart: add --rgw_compression to set rgw compression plugin
Reviewed-by: Matt Benjamin <mbenjami@redhat.com>
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
2017-07-12 08:40:10 -04:00
Sage Weil
5ec27fbcbf
mon/MgrStatMonitor: do not crash on luninous dev version upgrades
...
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 07:30:53 -04:00
Jos Collin
82e85a6d86
Merge pull request #16254 from TsaiJin/wip-rbd-nbd-version
...
tools/rbd_nbd: add --version show support
Reviewed-by: Jos Collin <jcollin@redhat.com>
2017-07-12 08:56:44 +00:00
lu.shasha
0faae7e890
rgw:multisite: change raw 'RGWAioCompletionNotifier*' to intrusive_ptr for rgw_cr_rados
...
Signed-off-by: Shasha Lu <lu.shasha@eisoo.com>
2017-07-12 13:50:02 +08:00
Nathan Cutler
7550bde5da
Merge pull request #16074 from smithfarm/wip-obsolete-libcephfs1
...
build/ops: rpm: obsolete libcephfs1
Reviewed-by: Ken Dreyer <kdreyer@redhat.com>
Reviewed-by: Boris Ranto <branto@redhat.com>
2017-07-12 05:15:02 +02:00
Alfredo Deza
7b4553291b
Merge pull request #16224 from liewegas/wip-docs-prune
...
doc: update intro, quick start docs
Reviewed-by: Alfredo Deza <adeza@redhat.com>
2017-07-11 16:44:30 -04:00
Casey Bodley
dfa1389615
vstart: add --rgw_compression to set rgw compression plugin
...
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2017-07-11 15:33:56 -04:00