Commit Graph

302 Commits

Author SHA1 Message Date
Neha Ojha
981babc8fa doc/rados/configuration/osd-config-ref.rst: document osd_delete_sleep
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-06-27 17:21:58 -07:00
Sage Weil
35c0d75888 osd: add hdd and ssd variants for osd_recovery_max_active
Semi-arbitrarily set the SSD max to 10 (instead of 3).  This should be
tuned based on some real data.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-06-20 16:24:51 -05:00
xie xingguo
c0f87e0f91 osd: give recovery ops initialized by client op a higher priority
to use strict priority ordering.

The new "mclock_opclass/mclock_client" queue basically prioritizes
operations based on the class they belong to. The priority property
of an operation, if lower than a specific value (64, by default),
will get ignored and hence all operations from the same class will
be treated fairly in a FIFO fashion (but still limited by the total
IOPS or bandwidth available for the corresponding class).

To reduce the impact of performance, a more general strategy would be
enforcing some limitations on the IOPS or bandwidth for the background
recovery (or backfill) operation class. However, this way we'll end up
blocking client operations too if they are currently blocked by some
degraded objects which need to be recovered first.

We hereby grant recovery operations of this kind a higher priority
to force them to use strict priority ordering, which should still
be of significance once we switch to the new "mclock_opclass/mclock_client"
queue.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-06-11 15:46:57 +08:00
Vangelis Tasoulas
24131fc59a
doc: Update documentation for the MANY_OBJECTS_PER_PG warning
The current documentation for the MANY_OBJECTS_PER_PG warning
states that The threshold can be raised to silence the health
warning by adjusting the mon_pg_warn_max_object_skew config
option on the monitors. It seems that this is not true (at least)
since the luminous times, and this option should be adjusted on
the managers.

I encountered this problem and I spend quite sometime injecting
the mon_pg_warn_max_object_skew to the monitors, added the option
ceph.conf and restarted the monitors several times but the warning
was not going away. I had to download the code to see what's
happening and I found out this:

$ git grep -A 3 mon_pg_warn_max_object_skew src/common/options.cc
src/common/options.cc:1480:    Option("mon_pg_warn_max_object_skew", Option::TYPE_FLOAT, Option::LEVEL_ADVANCED)
src/common/options.cc-1481-    .set_default(10.0)
src/common/options.cc-1482-    .set_description("max skew few average in objects per pg")
src/common/options.cc-1483-    .add_service("mgr"),

After I restarted the ceph-mgr service, the warning went away.

Signed-off-by: Vangelis Tasoulas <vangelis@tasoulas.net>
2019-04-05 19:53:35 +02:00
Vanush "Misha" Paturyan
3d935c3c53 doc/rados/configuration/mon-lookup-dns: fix typo
Signed-off-by: Vanush "Misha" Paturyan <ektich@gmail.com>
2019-04-04 12:37:54 +01:00
Jason Dillaman
41d3fdc554
Merge pull request #27074 from LenzGr/master-documentation
doc: Updated dashboard iSCSI configuration, added labels

Reviewed-by: Ricardo Marques <rimarques@suse.com>
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2019-04-01 09:20:24 -04:00
Lenz Grimmer
71e46887d4 doc: Updated dashboard iSCSI configuration, added labels
Added note about the requirement for the latest ceph-iscsi version
3 to the dashboard documentation. Added some doc references
and replaced some URLs in the iSCSI docs with reST labels instead.

Signed-off-by: Lenz Grimmer <lgrimmer@suse.com>
2019-03-31 13:32:15 -05:00
Casey Bodley
9e949fcd5c
Merge pull request #27243 from theanalyst/doc-scheduler
config-ref: add a note on current scheduler settings.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
2019-03-28 14:23:11 -04:00
Abhishek Lekshmanan
909b8ef4bc docs: rgw: add a x-ref to rados dmclock docs
Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
2019-03-28 17:04:31 +01:00
David Zafman
769cdc8294 doc: Document new pg state and changes to auto repair behavior
Fixes: http://tracker.ceph.com/issues/38616

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-25 16:03:36 -07:00
David Zafman
9fd4b062f1 doc: Fix the pg states and auto repair config options
Fixes: http://tracker.ceph.com/issues/38896

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-22 19:58:00 -07:00
Kefu Chai
32df73f9f2
Merge pull request #26940 from xiexingguo/wip-monc-add-con
mon/MonClient: weight-based mon selection

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-03-22 15:43:04 +08:00
xie xingguo
1ba6b267db doc/mon-lookup-dns: update "mon weight" related changes
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-03-20 12:17:00 +08:00
Sage Weil
dac96a4c0e doc/releases/nautilus: add reference to msgr2 config update section
Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-18 03:14:24 -05:00
Sage Weil
937f28e6a6 doc/releases/nautilus: add msgr2 refs
Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-11 10:31:49 -05:00
Sage Weil
60fab64f4c doc/rados/configuration/msgr2: some documentation about msgr2
This doesn't integrate very well into network-config.rst, mostly because
that document is horribly out of data and I don't know where to start.
:(

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-09 19:10:21 -06:00
Xie Xingguo
46189eaa64
Merge pull request #26705 from dzafman/wip-23999
Improve docs osd_recovery_priority, osd_recovery_op_priority and related

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-03-02 09:43:33 +08:00
David Zafman
f7bec341da common, doc: Improve docs osd_recovery_priority, osd_recovery_op_priority and related
Add option desciptions for osd_recovery_priority and osd_recovery_op_priority

Fixes: https://tracker.ceph.com/issues/23999

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-01 13:55:35 -08:00
David Zafman
b1efd43096 doc: Update some of the priority item descriptions
Signed-off-by: David Zafman <dzafman@redhat.com>
2019-02-28 12:34:53 -08:00
David Zafman
dee162039c doc: Remove osd disk thread items that no longer exist
Caused by: 35a4b5072f

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-02-28 12:34:53 -08:00
Kai Wagner
1bf96668a5 doc: Remove value 'mon_osd_max_split_count' from the docs
Signed-off-by: Kai Wagner <kwagner@suse.com>
2019-02-22 11:16:34 +01:00
Sage Weil
618c0ea5da Merge PR #26417 into master
* refs/pull/26417/head:
	qa, doc: update ec fast read setting
	qa/suites: remove mon kv backend options

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-02-20 07:03:56 -06:00
Drunkard Zhang
016da6d892 doc: typo fixes, s/Requered/Required/
Signed-off-by: Drunkard Zhang <gongfan193@gmail.com>
2019-02-13 23:04:40 +08:00
Josh Durgin
682def2e32 qa, doc: update ec fast read setting
31dd620883 renamed it but did not update the suites or docs

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2019-02-08 17:11:34 -05:00
Kefu Chai
fa6a69288c
Merge pull request #26099 from jdurgin/wip-remove-ms-async-affinity
common/options: remove unused ms async affinity options

Reviewed-by: Sage Weil <sage@redhat.com>
2019-01-27 21:41:10 +08:00
Ilya Dryomov
ae88c63d04 doc/rados/configuration: refresh osdmap section
"osd map cache size" and "osd map message max" were reduced in commit
855955e58e ("osd: reduce size of osdmap cache, messages").

"osd map cache bl size" and "osd map cache bl inc size" were removed
six years ago.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2019-01-24 21:16:30 +01:00
Josh Durgin
62ac23bfdc common/options: remove unused ms async affinity options
These were never implemented. They can be added back if they are
implemented and shown to help performance.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2019-01-23 11:17:41 -05:00
Neha Ojha
3659a8c5f6 doc/rados/configuration: add osd scrub {begin,end} week day
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-01-11 12:55:51 -08:00
Sage Weil
251f667ef8 Merge PR #25009 into master
* refs/pull/25009/head:
	librbd: stringify locker name with get_legacy_str()
	osdc/Objecter: fix list_watchers addr rendering to match legacy
	test/crimson: disable unittest_seastar_messenger test
	msg/msg_types: encode entity_addr_t TYPE_ANY as TYPE_LEGACY for pre-nautilus
	client: make blacklist detection handle TYPE_ANY entries
	mon/OSDMonitor: maintain compat output for 'blacklist ls'
	client: maintain compat for {inst,addr}_str in status dump
	qa/tasks/ceph_manager: compare osd flush seq #'s as ints
	qa/suites/fs: make use of simple.yaml where appropriate
	qa/msgr: move msgr factet into generic re-usable dir
	crimson: fix monmap build for seastar
	doc/start/ceph.conf: trim the sample ceph.conf file
	doc/rados/operations: only describe --public-{addr,network} method for adding mons
	PendingReleaseNotes: deprecate 'mon addr'
	doc: fix some 'mon addr' references
	doc/rados/configuration: fix some 'mon addr' references
	doc/rados/configuration/network-config-ref: revise network docs somewhat
	doc/rados/configuration/network-config-ref: remove totally obsolete section
	qa/suites/rados: replace mon_seesaw.py task with a small bash script
	qa/suites/fs/upgrade: don't bind to v2 addrs
	qa/tasks/mon_thrash: avoid 'mon addr' in mon section
	mon/MonClient: disable ms_bind_msgr2 if NAUTILUS feature not set
	osd/OSDMap: maintain compat addr fields
	msg/msg_types: add get_legacy_str()
	mds/MDSMap.h: maintain compat addr field
	mon/MgrMap: maintain compat active_addr field
	mon/MonClient: reconnect to mon if it's addrvec appears to have changed
	qa/tasks/ceph.conf.template: increase mon_mgr_mkfs_grace
	msg/async/ProtocolV2: fill in IP for all peer_addrs
	msg/async: print all addrs on debug lines
	mon/MonMap: no noname- mon name prefix when for_mkfs
	ceph-monstore-tool: print initial monmap
	msg/async/ProtocolV2: advertise ourselves as a v2 addr when using v2 protocol
	msg/async: assert existing protocol matches current protocol
	msg/async: add missing modelines
	mon/MonMap: add missing modeline
	vstart.sh: put mon addrs in mon_host, not 'mon addr'
	msg/async: better debug around conn map lookups and updates
	mon/MonClient: dump initial monmap at debug level 10
	qa/standalone/osd/osd-fast-mark-down: use v1 addr w/ simplemessenger
	qa/tasks/ceph: set initial monmap features with using addrvec addrs
	monmaptool: add --enable-all-features option
	qa/tasks/ceph: only use monmaptool --addv if addr has [,:v]
	qa/tasks/ceph_manager: make get_mon_status use mon addr
	qa/tasks/ceph: keep mon addrs in ctx namespace
	mon/OSDMonitor: log all osd addrs on boot
	msg/simple: behave when v2 and v1 addrs are present at target
	mon/MonClient: warn if global_id changes
	msg/Connection: add warning/note on get_peer_global_id
	mds/MDSDaemon: clean up handle_mds_map debug output a bit
	qa/suites/rados/upgrade: debug mds
	mds/MDSRank: improve is_stale_message to handle addrvecs
	msg/async: make loopback detect when sending to one of our many addrs
	qa/suites/rados/upgrade: no aggressive pg num changes
	mon/OSDMonitor: require nautilus mons for require_osd_release=nautilus
	mon/OSDMonitor: require mimic mons for require_osd_release=mimic
	qa/suites/rados/thrash-old-clients: use legacy addr syntax in ceph.conf
	msg/async: preserve peer features when replacing a connection
	qa/tasks/ceph.py: move methods from teuthology.git into ceph.py directly; support mon bind * options
	mon/MonMap: adjust build_initial behavior for mkfs vs probe
	mon/MonMap: improve ambiguous addr behavior
	qa/suites/rados/upgrade: spread mons a bit
	qa/rados/thrash-old-clients: keep mons on separate hosts
	qa/standalone/mon/misc.sh: tweak test to be more robust
	qa/tasks/mon_seesaw: expect v1/v2 prefix in addr
	osd/OSDMap: fix is_blacklisted() check to assume type ANY
	mon/OSDMonitor: use ANY addr type for blacklisting
	mon/msg_types: TYPE_V1ORV2 -> TYPE_ANY
	qa/workunits/cephtool: fix blacklist test
	qa/suites/upgrade: install old version with only v1 addrs
	common/options: by default, bind to both msgr v1 and v2 addresses
	vstart.sh: add --msgr1, --msgr2, --msgr21 options
	msg/async/ProtocolV2: be flexible with server identity check
	msg/msg_types: fix entity_addrvec_t::parse() with null end arg
	qa/suites/rados/basic/msgr: no msgr2 addrs in initial monmaps
	qa/tasks/ceph: add 'mon_bind_addrvec' and 'mon_bind_msgr2' options
	monmaptool: add --addv argument to pass in addrvec directly
	qa/suites/rados/basic/msgr: do not use msgr2 with simplemessenger
	qa/suites/rados/basic/msgr: async is not experimental
	messages/MOSDBoot: fix compat with pre-nautilus
	mon/MonMap: allow v1 or v2 to be explicitly specified along with part
	msg/msg_types: allow parsing of IPs without assuming v1 vs v2
	msg/msg_types: default parse to v2 addrs
	msg: standarize on v1: and v2: prefixes for *all* entity_addr_t's
	vstart.sh: use msgr2 by default
	mon/MonMap: remove get_addr() methods
	ceph-mon: adjust startup/bind/join sequence to use addrs
	mon: use MonMap::get_addrs() (instead of get_addr())
	mon/MonClient: change pending_cons to addrvec-based map
	mon/MonMap: fix set_addr() caller, kill wrapper
	mon/MonMap: remove addr-based add()
	monmaptool: fix --add to do either legacy or msgr2+legacy
	monmaptool: clean up iterator use a bit
	mon/MonMap: handle ambiguous mon addrs by trying both legacy and msgr
	mon/MonMap: take addrvec for set_initial_members
	mon/MonMap: use addrvecs for test instances
	mon: pass addrvec via MMonJoin
	mon/MonmapMonitor: fix 'mon add' to populate addrvec
	mon/MonMap: addr -> addrvec
	msg/async/ProtocolV2: only update socket_addr if we learned our addr
	osd: go active even if mon only accepted our v1 addr
	test/msgr: add test for msgr2 protocol
	msg/async/ProtocolV2: share socket_addr and all addrs during handshake
	msg/async: print socket_addr for the connection
	msg/async: msgr2 protocol placeholder
	msg/async: move ProtocolV1 class to its own source file
	msg/async: keep listen addr in ServerSocket, pass to new connections
	msg/async/AsyncMessenger: fix set_addr_unknowns

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2019-01-04 13:42:09 -06:00
Sage Weil
1675cd5096 doc/rados/configuration: fix some 'mon addr' references
Signed-off-by: Sage Weil <sage@redhat.com>
2019-01-03 11:17:31 -06:00
Sage Weil
369dadea44 doc/rados/configuration/network-config-ref: revise network docs somewhat
These still could use a larger rewrite, but this is a bit better.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-01-03 11:17:31 -06:00
Sage Weil
bef7c62fe8 doc/rados/configuration/network-config-ref: remove totally obsolete section
Signed-off-by: Sage Weil <sage@redhat.com>
2019-01-03 11:17:31 -06:00
Jos Collin
a0b3562b24
doc: Fixed double spaces
Signed-off-by: Jos Collin <jcollin@redhat.com>
2018-12-26 11:24:48 +05:30
Jos Collin
c1ddc49b2d
doc: Fixed Create a Cluster url
Fixes: http://tracker.ceph.com/issues/37764
Signed-off-by: Jos Collin <jcollin@redhat.com>
2018-12-26 11:24:11 +05:30
Mark Nelson
9e2d1a52ab doc/rados: Add bluestore memory autotuning docs.
Signed-off-by: Mark Nelson <mnelson@redhat.com>
2018-11-13 13:19:24 -06:00
Kefu Chai
0e5bdd9d56 librados: remove rados_tmap_*
we have switched from tmap to omap long ago.

but keep the server side implementation around, in case ancient
client is still using these tmap APIs.

also, tmap_update() is kept, because librbd is using it for v1 image
backward compatibility.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-11-07 14:05:25 -08:00
James McClune
dafdd0f29f
doc: updated crush map tunables link
Updated crush map tunables link in rados/configuration/mon-config-ref.rst
Fixes: https://tracker.ceph.com/issues/36339

Signed-off-by: James McClune <jmcclune@mcclunetechnologies.net>
2018-10-07 22:13:32 -04:00
Kefu Chai
78116ce147 bluestore/NVMe: use PCIe selector as the path name
as the latest SPDK deprecated and removed spdk_pci_get_device(),
we cannot use the PCI device's serial number as its identifier and the
filename of the file representing the device. in this change, we
are using the PCI device's selector instead, it is also used by SPDK as
part of the transport id.

Tested-by: Tone Zhang <tone.zhang@arm.com>
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-09-26 17:47:38 +08:00
Kefu Chai
5ee1fd2c32 doc: fix typos
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-09-21 12:41:42 +08:00
James McClune
f38426a766 doc: fixed hit set type link
Fixed reference link for hit set type value. Restructured wording in description.
Fixes: https://tracker.ceph.com/issues/34539

Signed-off-by: James McClune <jmcclune@mcclunetechnologies.net>
2018-09-09 21:41:08 -04:00
Kefu Chai
9bef451aa7
Merge pull request #23719 from bstillwell/config-doc-fixes
doc: Fixed spelling errors in configuration section

Reviewed-by: Kefu Chai <kchai@redhat.com>
2018-08-24 14:15:58 +08:00
Bryan Stillwell
2e98517d36 doc: Fixed spelling errors in configuration section
Correct a number of spelling mistakes and word omissions in the cluster
configuration section of the docs.

Signed-off-by: Bryan Stillwell <bstillwell@godaddy.com>
2018-08-23 16:17:08 -06:00
Bryan Stillwell
41ae1d2ecd doc: Spelling fixes in BlueStore config reference
Signed-off-by: Bryan Stillwell <bstillwell@godaddy.com>
2018-08-23 15:05:05 -06:00
Alfredo Deza
35a48728ae doc/rados update journal size defaults to 5GB
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2018-08-10 13:14:30 -04:00
Alfredo Deza
20e556b624 doc/bluestore detail the different sizing strategies for block.db based on cluster workload
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2018-08-06 15:57:56 -04:00
Alfredo Deza
2534df34b1 doc/bluestore expand on configuration with fast and slow devices
Signed-off-by: Alfredo Deza <adeza@redhat.com>
2018-08-06 15:57:56 -04:00
Sage Weil
6090545cc7 Merge PR #22433 into master
* refs/pull/22433/head:
	common/config: Add description to (near)full ratio settings

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-08-06 08:57:03 -05:00
Nathan Cutler
73124c8df8 doc: cleanup: prune Argonaut-specific verbiage
Also drop all release-specific upgrading instructions (they only go up to
Firefly, anyway - none of the current releases are covered).

Note that all of this verbiage I am removing here can still be accessed
on docs.ceph.com via e.g. http://docs.ceph.com/docs/firefly/

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-07-17 12:20:01 +02:00
Sage Weil
6564adc878 Merge remote-tracking branch 'private/wip-cx'
Conflicts:
	src/include/ceph_features.h
	src/msg/async/AsyncConnection.cc
	src/msg/async/AsyncConnection.h

- SERVER_NAUTLIUS feature recently added to default
- msgr2 bool
- ceph::shared_ptr -> std::shared_ptr
2018-07-09 08:07:29 -05:00
David Disseldorp
fbc3330102 doc/bluestore: fix minor typos in compression section
Signed-off-by: David Disseldorp <ddiss@suse.de>
2018-07-05 13:41:35 +02:00