Commit Graph

75484 Commits

Author SHA1 Message Date
Kefu Chai
9cb7e07742 Merge pull request #16311 from Songweibin/wip-update-info-affi
.mailmap, .organizationmap: Update Song Weibin information and affiliation

Reviewed-by: Abhishek Lekshmanan <abhishek@suse.com>
2017-07-13 19:19:55 +08:00
Kefu Chai
c64a883013 Merge pull request #16303 from bstillwell/releases-doc-update-201707
doc/releases: Update releases from Feb 2017 to July 2017

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-07-13 19:14:30 +08:00
Nathan Cutler
0f97cd0133 doc: add v10.2.8 changelog
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-07-13 12:18:17 +02:00
Kefu Chai
4a3cdb00c6 qa/workunits/ceph-helpers: enable experimental features for osd
it matches the settings in vstart.sh, also it would be handy for those
who are still developing on btrfs, which is now marked as an experimental
features now.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-07-13 18:08:44 +08:00
Kefu Chai
a2335091d6 qa/workunits/ceph-helpers: test wait_for_health_ok differently
0 OSDs is not an error anymore in the new health checking implemented by
OSDMap::check_health(). this case was treated as an error before, see
OSDMonitor::get_health(). but an osdmap without any OSD is fine, i
think. but an osdmap with 3 OSDs, but all of them are down and out, this
is an error. and we do report this as an error. so, let's update the
test instead.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-07-13 17:49:44 +08:00
Kefu Chai
ffda75baca Merge pull request #16277 from smithfarm/wip-rn-script
tools: ceph-release-notes: handle an edge case

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-07-13 16:48:52 +08:00
Nathan Cutler
df3fe15cd2 doc: release notes: note MDS regression in 10.2.8
See the discussion in https://github.com/ceph/ceph/pull/16192

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-07-13 10:47:45 +02:00
Mykola Golub
0c169bbdbc Merge pull request #16264 from dillaman/wip-20571
rbd-mirror: ignore permission errors on rbd_mirroring object

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
2017-07-13 10:57:42 +03:00
Yan, Zheng
e4844706b0 qa/cephfs: don't use int() to convert string of float point number
Fixes: http://tracker.ceph.com/issues/20582
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2017-07-13 15:55:22 +08:00
fang.yuxiang
37369944f1 rgw: fix err when copy object in bucket with specified placement rule
Fixes: http://tracker.ceph.com/issues/20378

Signed-off-by: fang yuxiang <fang.yuxiang@eisoo.com>
2017-07-13 14:30:05 +08:00
songweibin
20bacee659 .mailmap, .organizationmap: Update Song Weibin information and affiliation
Signed-off-by: songweibin <song.weibin@zte.com.cn>
2017-07-13 11:42:17 +08:00
Matt Benjamin
d78fffa416 rgw_file: add service map registration
Add static service map registration for librgw/NFS.  In this
verision registration is unconditional (e.g., unit tests would
register) and, in addition, since there is no API change, we
don't know anything about the upper-layer client.

Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
2017-07-12 20:30:00 -04:00
Sage Weil
4fcfb8ca9b qa/suites/rados/singleton/all/reg11184: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 18:39:24 -04:00
Sage Weil
43622b17fb mon/MDSMonitor: fix segv when multiple MDSs raise same alert
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 17:44:02 -04:00
Bryan Stillwell
246a8c083d Update releases from Feb 2017 to July 2017
Pull in the latest releases from the past 5 months and fix some of the
links so they jump to the correct sections in the release notes.

Signed-off-by: Bryan Stillwell <bstillwell@godaddy.com>
2017-07-12 15:38:59 -06:00
Sage Weil
1af3f3e3b3 Merge pull request #16301 from dmick/master
mgr: increase debug level for ticks 0 -> 10
2017-07-12 14:55:56 -05:00
Dan Mick
617c96014b Mgr: increase debug level for ticks 0 -> 10
Signed-off-by: Dan Mick <dmick@redhat.com>
2017-07-12 12:51:43 -07:00
Sage Weil
b844dfb342 mon/MonCommands: mark 'pg set_*_ratio' deprecated
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 15:09:00 -04:00
Sage Weil
98a6c7afbb Merge pull request #16263 from liupan1111/wip-fix-fio
test/fio: remove experimental option for bluestore & rocksdb.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2017-07-12 13:21:17 -05:00
Sage Weil
8859627fa5 Merge pull request #15643 from liewegas/wip-health
mon: revamp health check/warning system
2017-07-12 12:10:47 -05:00
Sage Weil
5e2e7088af mon/PGMap: adjust scrub checks to avoid overflow for future stamps
Avoid an overflow (and false warning) when scrub stamps are in the future.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
7e48efb061 qa/workunits/cephtool/test.sh: adjust full tests to avoid races
OSDs may report fullness in any order.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
fd39700d55 qa/tasks/ceph: wait for osds to come up before creating pool
Avoid health warnings.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
25717f7e84 qa/tasks/ceph_test_case.py: update health check helpers
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
bf6c075b7e qa/suites/fs: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
8d711a5659 qa/suites/rgw/thrash: whitelist
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
3d268d6e83 qa/suites/rbd: whitelist health messages
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
93de19adcf qa: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
7302b49723 qa/workunits/cephtool/test.sh: adjust for new health error codes
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
Sage Weil
9f91df6bed mon/MgrMonitor: clear last_beacon after mon election
The last_beacon map is local to an election interval; if there is a new
election completed we should reset it or else we may kill an apparently
laggy mgr that hasn't been able to get a beacon processed due to the mon
quorum changing, or had its beacon processed on a different leader.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
067215152c mon: clean up osd out messages
Cleaner prose for the auto-out case, and add
a cluster log message for OSDs that go out
at the behest of the administrator.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
a89ce8e3c4 osd: don't log per-PG backfill messages at INFO level
This behaviour led to way too many messages going to
the cluster log when an OSD is marked in.  Retain
the messages at debug level.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
86365823fe mon: simplify PG health checks
Instead of a distinct health check for each possible
PG state, group the states into categories for availability,
degraded, damage, and report on that.

That way, while a PG/pool is suffering from one of those
bad PG states, health conditions don't keep toggling on and
off as we transition from one unavailable state to another
unavailable state.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
397d3b9350 mgr/dashboard: update for new style health checks
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
a14782e179 mon: demote cluster map prints to DEBUG level
The PaxosService subclasses should be writing out
informative log messages, and not relying on
a stream of map summary prints to communicate
changes.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
9ef298eabe mon: prettify health check log messages
Add a "Cluster is now healthy" to give clarity
after a series of "health check cleared" that
they were the last ones.

Convert certain health check messages into
well formed sentences.

Don't print severity in the log string (it's already
expressed in the severity of the log entry.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
d260368e5b mgr: fix spurious PG health messages on mgr restart
Previously, the mgr would send MMonMgrReport indicating
a very unhappy PGMap to the mon right after startup.

This is a change to hold off on sending that report until
all the OSDs have reported in, or until some time has passed.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:03 -04:00
John Spray
0c8a1d3940 mon: don't consider a starting mgr to be an error
The .available flag is there to tell MgrClients whether
to try and connect -- it isn't the right condition
for a health complaint.

Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:02 -04:00
John Spray
a2f02ac073 mon: pass new style health to mgr
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
f70316ecbc mon: prefix periodic health reminder with 'overall'
...so we can whitelist it.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
09957d15c7 mon/PGMap: rename a few health checks
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
8d1d6a0650 mgr/DaemonServer: debug log health checks
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
a8e07b33f7 mon/MgrStatMonitor: show health check count on receipt
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
76bdda85ed messages/MMonMgrReport: show health check count
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
78b33655ad mon/PGMap: only warn about too few pgs after >0 pools exist
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
3b01be0d4b mon/PGMap: some stuck warnings are err, some warn
inactive and stale -> error
degraded, unclean, undersized -> warning

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
63f97ddcf6 qa/suites/rados: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
e12afb714e mon/PGMap: do not warn about recovering, peering, stale
Wait for stuck before complaining.  These aren't scary in and of
themselves.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
b25ddafa16 qa/tasks/mon_clock_skew_check: vastly simplify
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00
Sage Weil
db0b2f6ede mon: shorten 'stuck' threshold from 5m -> 1m
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-12 12:52:02 -04:00