Commit Graph

1051 Commits

Author SHA1 Message Date
David Zafman
444aa9f9fe osd, mon: New pool recovery priority range -10 to 10
Use OSD_POOL_PRIORITY_MAX and OSD_POOL_PRIORITY_MIN constants
Scale legacy priorities if exceeds maximum

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-04-25 13:53:27 -07:00
Changcheng Liu
c0df98fc7e doc: fix parameter to set pg autoscale mode
osd_pool_default_pg_autoscale_mode is the right parameter to
set placement-group autoscale mode.

Signed-off-by: Changcheng Liu <changcheng.liu@intel.com>
2019-04-08 10:40:47 +08:00
Vangelis Tasoulas
24131fc59a
doc: Update documentation for the MANY_OBJECTS_PER_PG warning
The current documentation for the MANY_OBJECTS_PER_PG warning
states that The threshold can be raised to silence the health
warning by adjusting the mon_pg_warn_max_object_skew config
option on the monitors. It seems that this is not true (at least)
since the luminous times, and this option should be adjusted on
the managers.

I encountered this problem and I spend quite sometime injecting
the mon_pg_warn_max_object_skew to the monitors, added the option
ceph.conf and restarted the monitors several times but the warning
was not going away. I had to download the code to see what's
happening and I found out this:

$ git grep -A 3 mon_pg_warn_max_object_skew src/common/options.cc
src/common/options.cc:1480:    Option("mon_pg_warn_max_object_skew", Option::TYPE_FLOAT, Option::LEVEL_ADVANCED)
src/common/options.cc-1481-    .set_default(10.0)
src/common/options.cc-1482-    .set_description("max skew few average in objects per pg")
src/common/options.cc-1483-    .add_service("mgr"),

After I restarted the ceph-mgr service, the warning went away.

Signed-off-by: Vangelis Tasoulas <vangelis@tasoulas.net>
2019-04-05 19:53:35 +02:00
Vanush "Misha" Paturyan
3d935c3c53 doc/rados/configuration/mon-lookup-dns: fix typo
Signed-off-by: Vanush "Misha" Paturyan <ektich@gmail.com>
2019-04-04 12:37:54 +01:00
Sage Weil
242ef7824d doc/rados/operations: document BLUEFS_SPILLOVER
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-02 11:13:31 -05:00
Jason Dillaman
41d3fdc554
Merge pull request #27074 from LenzGr/master-documentation
doc: Updated dashboard iSCSI configuration, added labels

Reviewed-by: Ricardo Marques <rimarques@suse.com>
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2019-04-01 09:20:24 -04:00
Lenz Grimmer
71e46887d4 doc: Updated dashboard iSCSI configuration, added labels
Added note about the requirement for the latest ceph-iscsi version
3 to the dashboard documentation. Added some doc references
and replaced some URLs in the iSCSI docs with reST labels instead.

Signed-off-by: Lenz Grimmer <lgrimmer@suse.com>
2019-03-31 13:32:15 -05:00
Casey Bodley
9e949fcd5c
Merge pull request #27243 from theanalyst/doc-scheduler
config-ref: add a note on current scheduler settings.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
2019-03-28 14:23:11 -04:00
Abhishek Lekshmanan
909b8ef4bc docs: rgw: add a x-ref to rados dmclock docs
Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
2019-03-28 17:04:31 +01:00
David Zafman
769cdc8294 doc: Document new pg state and changes to auto repair behavior
Fixes: http://tracker.ceph.com/issues/38616

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-25 16:03:36 -07:00
David Zafman
9fd4b062f1 doc: Fix the pg states and auto repair config options
Fixes: http://tracker.ceph.com/issues/38896

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-22 19:58:00 -07:00
Kefu Chai
32df73f9f2
Merge pull request #26940 from xiexingguo/wip-monc-add-con
mon/MonClient: weight-based mon selection

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-03-22 15:43:04 +08:00
Danny Al-Gaaf
c9441c2916 doc: fix LRC documentation
Recovery from a failure in jerasure need only k reads and
not k+m-1.

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
2019-03-22 01:37:56 +01:00
Kefu Chai
ee5eab81e1
Merge pull request #26934 from sebastian-philipp/doc-rados-mon_command
doc/rados/api/python: Add documentation for mon_command

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-03-20 18:27:47 +08:00
xie xingguo
1ba6b267db doc/mon-lookup-dns: update "mon weight" related changes
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-03-20 12:17:00 +08:00
Sebastian Wagner
315bc1a9ee doc/rados/api/python: Add documentation for mon_command
Co-authored-by: Nathan Cutler <ncutler@suse.com>
Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
2019-03-18 11:55:56 +01:00
Sage Weil
dac96a4c0e doc/releases/nautilus: add reference to msgr2 config update section
Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-18 03:14:24 -05:00
Sage Weil
c529925e2f doc/releases/nautilus: final upgrade note updates
Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-17 05:29:27 -05:00
Kefu Chai
9f2f403553 doc/rados/operations: add clay to erasure-code-profile
so it's more visible.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2019-03-12 13:19:05 +08:00
Sage Weil
937f28e6a6 doc/releases/nautilus: add msgr2 refs
Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-11 10:31:49 -05:00
Sage Weil
60fab64f4c doc/rados/configuration/msgr2: some documentation about msgr2
This doesn't integrate very well into network-config.rst, mostly because
that document is horribly out of data and I don't know where to start.
:(

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-09 19:10:21 -06:00
Xie Xingguo
46189eaa64
Merge pull request #26705 from dzafman/wip-23999
Improve docs osd_recovery_priority, osd_recovery_op_priority and related

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-03-02 09:43:33 +08:00
David Zafman
f7bec341da common, doc: Improve docs osd_recovery_priority, osd_recovery_op_priority and related
Add option desciptions for osd_recovery_priority and osd_recovery_op_priority

Fixes: https://tracker.ceph.com/issues/23999

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-01 13:55:35 -08:00
David Zafman
b1efd43096 doc: Update some of the priority item descriptions
Signed-off-by: David Zafman <dzafman@redhat.com>
2019-02-28 12:34:53 -08:00
David Zafman
992c690772 doc: Minor fixes to monitoring-osd-pg.rst
Signed-off-by: David Zafman <dzafman@redhat.com>
2019-02-28 12:34:53 -08:00
David Zafman
dee162039c doc: Remove osd disk thread items that no longer exist
Caused by: 35a4b5072f

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-02-28 12:34:53 -08:00
Changcheng Liu
0da1f3540c doc: change ruleset to id in crush map file
ruleset is not used after merging below patch
   commit f9a095deb1
       crush: s/ruleset/id/ in decompiled output
       Moving away from the 'ruleset' terminology.

Signed-off-by: Changcheng Liu <changcheng.liu@intel.com>
2019-02-27 11:47:44 +08:00
David Zafman
ce975581a6
Merge pull request #26522 from ashishkumsingh/wip-doc-38310
doc: Fix incorrect mention of 'osd_deep_mon_scrub_interval'

Reviewed-by: David Zafman <dzafman@redhat.com>
2019-02-25 08:20:49 -08:00
Xie Xingguo
be26cb08e5
Merge pull request #26560 from xiexingguo/wip-pool-force-recovery
mgr: add per pool force-recovery/backfill commands

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: David Zafman <dzafman@redhat.com>
2019-02-23 15:30:14 +08:00
xie xingguo
fa98f09674 mgr: add per pool force-recovery/backfill commands
For those with multiple storage pools sharing the same devices,
I think it would make much more sense to offer per-pool
commands to bring pools with high priority, e.g., because they
are hosting data of more importance than others, back to normal
quickly.

Fixes: http://tracker.ceph.com/issues/38456
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-02-23 12:18:24 +08:00
Kai Wagner
1bf96668a5 doc: Remove value 'mon_osd_max_split_count' from the docs
Signed-off-by: Kai Wagner <kwagner@suse.com>
2019-02-22 11:16:34 +01:00
Jason Dillaman
3322798a36 doc/librados: explicitly mention that the C++ API is not stable
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2019-02-21 08:28:43 -05:00
Ashish Singh
7108e6a3c7 doc: Fix incorrect mention of 'osd_deep_mon_scrub_interval'
Fixed the incorrect mention of 'osd_deep_mon_scrub_interval' in health-checks.rst.
Changed it to 'osd_deep_scrub_interval'.

Fixes: https://tracker.ceph.com/issues/38310

Signed-off-by: Ashish Singh <assingh@redhat.com>
2019-02-21 12:10:41 +05:30
Alfredo Deza
7324744712
Merge pull request #26471 from tnaganawa/patch-3
ceph-deploy would not support --cluster option anymore

Reviewed-by: Alfredo Deza <adeza@redhat.com>
2019-02-20 12:23:38 -05:00
Sage Weil
618c0ea5da Merge PR #26417 into master
* refs/pull/26417/head:
	qa, doc: update ec fast read setting
	qa/suites: remove mon kv backend options

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-02-20 07:03:56 -06:00
Tatsuya Naganawa
e9dd52eb74
ceph-deploy would not support --cluster option anymore
Since ceph-deploy would not support --cluster option anymore, section in this doc could be removed
Signed-off-by: Tatsuya Naganawa <tatsuyan201101@gmail.com>
2019-02-18 02:25:05 +09:00
Drunkard Zhang
016da6d892 doc: typo fixes, s/Requered/Required/
Signed-off-by: Drunkard Zhang <gongfan193@gmail.com>
2019-02-13 23:04:40 +08:00
Josh Durgin
682def2e32 qa, doc: update ec fast read setting
31dd620883 renamed it but did not update the suites or docs

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2019-02-08 17:11:34 -05:00
Neha Ojha
4378557c04
Merge pull request #25575 from ThoreKr/doc-pool-stats-master
doc: Add mention of ceph osd pool stats

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-06 15:10:11 -08:00
James McClune
4b077dd2e5 doc: fixed typo in add-or-rm-mons.rst
Redundant use of the word "monitor"

Signed-off-by: James McClune <jmcclune@mcclunetechnologies.net>
2019-02-01 17:12:24 -05:00
David Zafman
3e6ff119e2
Merge pull request #25112 from dzafman/wip-scrub-warning
scrub warning check incorrectly uses mon scrub interval

Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-01-28 10:46:18 -08:00
Kefu Chai
fa6a69288c
Merge pull request #26099 from jdurgin/wip-remove-ms-async-affinity
common/options: remove unused ms async affinity options

Reviewed-by: Sage Weil <sage@redhat.com>
2019-01-27 21:41:10 +08:00
Ilya Dryomov
ae88c63d04 doc/rados/configuration: refresh osdmap section
"osd map cache size" and "osd map message max" were reduced in commit
855955e58e ("osd: reduce size of osdmap cache, messages").

"osd map cache bl size" and "osd map cache bl inc size" were removed
six years ago.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2019-01-24 21:16:30 +01:00
David Zafman
6a9895b97a mon: Fix scrub health warning handling and change config to a ratio
Make this mon_warn code clearer since it involves 2 values
Code used mon scrub interval instead of pg scrub interval
Rename config values to include _pg_ and ratio to make it more clear
Fix scrub warniing handling use per-pool intervals when specified

Fixes: http://tracker.ceph.com/issues/37264

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-01-23 16:49:33 -08:00
Josh Durgin
62ac23bfdc common/options: remove unused ms async affinity options
These were never implemented. They can be added back if they are
implemented and shown to help performance.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2019-01-23 11:17:41 -05:00
Sage Weil
b5e5ee6f40 Merge PR #25849 into master
* refs/pull/25849/head:
	qa/suites/rados/upgrade: one mon per node, and enable-msgr2 at end
	qa/rados/thrash-old-clients: avoid msgr2
	mon: make bootstrap rank check more robust
	mon: clean up probe debug output a bit
	msg/async: use v1 for v1 <-> [v2,v1] peers
	msg/async/AsyncMessenger: drop single-use _send_to
	mon/HealthMonitor: raise MON_MSGR2_NOT_ENABLED if mons not bound to msgr2
	doc/rados/operations/health-checks: document MON_* health warnings
	mon/MonMapMonitor: add 'mon enable-msgr2' command
	mon: respawn if rank addr changes
	mon/MonMap: calc_addr_mons() after setting rank addrvec

Reviewed-by: Ricardo Dias <rdias@suse.com>
2019-01-17 11:04:30 -06:00
Sage Weil
6ba8db68cd mon/HealthMonitor: raise MON_MSGR2_NOT_ENABLED if mons not bound to msgr2
If the ms_bind_msgr2 option is enabled, and all mons are nautilus,
raise a health alert if any mons aren't bound to msgr2 addresses.

Whitelist tests that mon_bind_addrvec=false or mon_bind_msgr2=false.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-01-15 10:42:29 -06:00
Sage Weil
57c4795c00 doc/rados/operations/health-checks: document MON_* health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2019-01-15 10:42:29 -06:00
James McClune
c44d31167a doc: fixed typo in CRUSH map docs
Fixes: https://tracker.ceph.com/issues/24924
Signed-off-by: James McClune <jmcclune@mcclunetechnologies.net>
2019-01-14 22:16:20 -05:00
Sage Weil
94620be57c Merge PR #25273 into master
* refs/pull/25273/head:
	doc/rados/operations/health-checks: Add LARGE_OMAP_OBJECTS

Reviewed-by: Sage Weil <sage@redhat.com>
2019-01-12 05:56:41 -06:00