Commit Graph

1187 Commits

Author SHA1 Message Date
Michael Prokop
d98b3e36a9 doc: fix typos
s/amount of times/number of times/
s/assosciated/associated/
s/availabe/available/
s/Commiting/Committing/
s/Containter/Container/
s/dependant/dependent/
s/developement/development/
s/filesytem/filesystem/
s/guarenteed/guaranteed/
s/hiearchy/hierarchy/
s/intance/instance/
s/Interger/Integer/
s/mutiple/multiple/
s/nubmer/number/
s/occured/occurred/
s/overriden/overridden/
s/reseted/reset/
s/sytem/system/
s/unkown/unknown/

Signed-off-by: Michael Prokop <mika@grml.org>
2019-09-26 09:17:07 +02:00
Sage Weil
6e46b1c0e5 osd/OSDMap: health alert for non-power-of-two pg_num
Fixes: https://tracker.ceph.com/issues/41647
Signed-off-by: Sage Weil <sage@redhat.com>
2019-09-24 09:26:33 -05:00
Sage Weil
7b988e9fe1 doc: remove all pg_num arguments to 'osd pool create'
Also, update the dicussion about pg_num and pool creation, with a reference
to the autoscaler.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-09-22 16:58:33 -05:00
Anthony D'Atri
be4582c26a Change osd op queue cut off default to high
Discussion: https://www.mail-archive.com/ceph-users@ceph.io/msg00166.html

Signed-off-by: Anthony D'Atri <anthony.datri@gmail.com>
2019-09-18 06:50:27 -07:00
Patrick Donnelly
a779766be9
Merge PR #30014 into master
* refs/pull/30014/head:
	doc: update CephFS overview in introductory page
	doc: filesystem to file system

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Sidharth Anupkrishnan <sanupkri@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
2019-09-11 09:59:51 -07:00
Danny Abukalam
6deba986b3 doc: Update ceph-deploy docs from dumpling to nautilus
Signed-off-by: Danny Abukalam <danny@softiron.co.uk>
2019-09-10 18:53:48 +01:00
Patrick Donnelly
e7a7cf429e
doc: filesystem to file system
"Filesystem" is not a word (although fairly common in use).

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-09-10 08:43:28 -07:00
Francois Deppierraz
858b88f195 doc: Fix a typo in balancer documentation
Signed-off-by: Francois Deppierraz <francois@ctrlaltdel.ch>
2019-09-06 15:15:13 +02:00
David Zafman
336b6b66ca
Merge pull request #28755 from dzafman/wip-network
feature: Health warnings on long network ping times, add "dump_osd_network" to get a report

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2019-09-05 07:54:43 -07:00
David Zafman
71015b94ab doc: Document network performance monitoring
Signed-off-by: David Zafman <dzafman@redhat.com>
2019-09-04 16:01:11 -07:00
David Zafman
5f83a6158b osd doc mon mgr: To milliseconds for config value, user input and threshold out
Signed-off-by: David Zafman <dzafman@redhat.com>
2019-09-04 17:13:32 +00:00
James McClune
8d8cbabff4 doc: updated ceph monitor config options
Executed ceph-conf --dump-all on a freshly installed v14.2.2 (nautilus)
cluster. Compared the global defaults to the keys/values specified in
mon-config-ref.rst. Checked options.cc to make sure the obsolete keys
are no longer used.

Fixes: https://tracker.ceph.com/issues/41516
Signed-off-by: James McClune <jmcclune@mcclunetechnologies.net>
2019-08-29 23:54:10 -04:00
Nathan Cutler
cde6489691
Merge pull request #29889 from smithfarm/wip-clay-typo-fwdport
doc/rados: Correcting some typos in the clay code documentation

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-08-28 09:32:15 +02:00
David Zafman
048f809626 osd mgr: Add osd_mon_heartbeat_stat_stale option to time out ping info
after 1 hour

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-08-26 15:25:34 +00:00
David Zafman
f4a0be2e87 doc: Add documentation and release notes
Signed-off-by: David Zafman <dzafman@redhat.com>
2019-08-26 15:25:34 +00:00
David Zafman
66d44e7f91 osd mon: Track heartbeat ping times and report health warning
Fixes: http://tracker.ceph.com/issues/40640

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-08-26 15:25:32 +00:00
Myna V
45a5834b35 doc/rados: Correcting some typos in the clay code documentation
Signed-off-by: Myna <mynaramana@gmail.com>

(cherry picked from commit a20ba26721)

Note: This documentation fix was merged to nautilus via
https://github.com/ceph/ceph/pull/29191 without being merged to master, first.
This commit forward-ports the fix to master.
2019-08-26 11:05:23 +02:00
James McClune
820ada1e58 doc: default values for mon_health_to_clog_* were flipped
On a freshly installed nautilus cluster (i.e. 14.2.2), the default values are:

mon_health_to_clog_interval = 3600
mon_health_to_clog_tick_interval = 60.000000

Fixes: https://tracker.ceph.com/issues/41403
Signed-off-by: James McClune <jmcclune@mcclunetechnologies.net>
2019-08-25 11:35:41 -04:00
Kefu Chai
fc55a51a87
Merge pull request #29579 from liewegas/wip-big-vs-bluestore
osd: scrub error on big objects; make bluestore refuse to start on big objects

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-08-16 20:24:43 +08:00
Sage Weil
2a1b58b5ac doc/rados/operations/monitoring: document muting health alerts
I think someday the docs for how health alerts work (here) and the
enumeration of all actual alerts should be restructured.  For now this
si the simplest placde to fit this!

Signed-off-by: Sage Weil <sage@redhat.com>t
2019-08-14 20:40:08 -05:00
Sage Weil
95b8e9fa0d doc/rados/operations/health-checks: document MON_DISK_{LOW,CRIT,BIG}
Signed-off-by: Sage Weil <sage@redhat.com>
2019-08-14 20:40:08 -05:00
Sage Weil
dd5e985614 doc/rados/operations/health-checks: document OSD_NO_DOWN_OUT_INTERVAL
Signed-off-by: Sage Weil <sage@redhat.com>
2019-08-14 20:40:08 -05:00
Sage Weil
0eba993fad doc/rados/operations/health-checks: document AUTH_BAD_CAPS
Signed-off-by: Sage Weil <sage@redhat.com>
2019-08-14 20:40:08 -05:00
Sage Weil
7e9ba0a1c1 doc/reados/operations/health-checks: document PG_SLOW_SNAP_TRIMMING
The mitigation steps are weak, but it's not clear concrete guidance to
provide.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-08-14 20:40:08 -05:00
Sage Weil
078ef210d5 doc/rados/operations/health-checks: document MGR_DOWN
Signed-off-by: Sage Weil <sage@redhat.com>
2019-08-14 20:40:08 -05:00
Sage Weil
1b6745efb4 doc/rados/operations/health-alerts: document BLUESTORE_NO_COMPRESSION
Signed-off-by: Sage Weil <sage@redhat.com>
2019-08-14 20:40:08 -05:00
David Zafman
5928fe8ca0 osd/PG: scrub error when objects are larger than osd_max_object_size
Signed-off-by: David Zafman <dzafman@redhat.com>
2019-08-14 20:25:12 -05:00
Anthony D'Atri
51fb48b0f7
doc: operations: correct 'comma-delimited'
CIDR blocks are comma-separated, not comma-delimited.

Signed-off-by: Anthony D'Atri <anthony.datri@gmail.com>
2019-08-13 12:50:39 -07:00
Josh Durgin
3f18ed55aa
Merge pull request #28227 from sseshasa/monCachePriority
mon/OSDMonitor: Use generic priority cache tuner for mon caches

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2019-08-09 14:23:39 -07:00
Sage Weil
f011c13547 Merge PR #29292 into master
* refs/pull/29292/head:
	os/bluestore: warn on no per-pool omap
	os/bluestore: fsck: warning (not error) by default on no per-pool omap
	os/bluestore: fsck: int64_t for error count
	os/bluestore: default size of 1 TB for testing
	os/bluestore: behave if we *do* set PGMETA and PERPOOL flags
	os/bluestore: do not set both PGMETA_OMAP and PERPOOL_OMAP
	os/bluestore: fsck: only generate 1 error per omap_head
	os/bluestore: make fsck repair convert to per-pool omap
	os/bluestore: teach fsck to tolerate per-pool omap
	os/bluestore: ondisk format change to 3 for per-pool omap
	mon/PGMap: add data/omap breakouts for 'df detail' view
	osd/osd_types: separate get_{user,allocated}_bytes() into data and omap variants
	mon/PGMap: fix stored_raw calculation
	mon/PGMap: add in actual omap usage into per-pool stats
	osd: report per-pool omap support via store_statfs_t
	os/bluestore: set per_pool_omap key on mkfs
	osd/osd_types: count per-pool omap capable OSDs
	os/bluestore: report omap_allocated per-pool
	os/bluestore: add pool prefix to omap keys
	kv/KeyValueDB: take key_prefix for estimate_prefix_size()
	os/bluestore: fix manual omap key manipulation to use Onode::get_omap_key()
	os/bluestore: make omap key helpers Onode methods
	os/bluestore: add Onode::get_omap_prefix() helper
	os/bluestore: change _do_omap_clear() args

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2019-08-09 10:40:45 -05:00
Sage Weil
b8501164ef os/bluestore: warn on no per-pool omap
Signed-off-by: Sage Weil <sage@redhat.com>
2019-08-09 08:21:18 -05:00
Neha Ojha
c9d2833b25
Merge pull request #29425 from aclamk/wip-bluestore-monitor-allocations
[bluestore][tools] Inspect allocations in bluestore

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-08-07 11:37:34 -07:00
Adam Kupczyk
713f9b4d09 doc/rados/operations/health-checks: document BlueStore fragmentation and BlueFS space available features
Signed-off-by: Adam Kupczyk <akupczyk@redhat.com>
2019-08-07 19:18:21 +02:00
Sridhar Seshasayee
3b96417e18 mon/OSDMonitor: Use generic priority cache tuner for mon caches
Use priority cache manager to tune inc, full and rocksdb caches.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
2019-08-06 20:22:16 +05:30
Jan Fajerski
f0746bbbec
Merge pull request #27859 from jan--f/update-bluestore-cache-doc
doc: update bluestore cache settings and clarify data fraction
2019-08-06 13:32:58 +02:00
Sage Weil
c6eafb7343 Merge PR #29337 into master
* refs/pull/29337/head:
	mon: enable telemetry module by default
	mgr/telemetry: force re-opt-in if the report contents change
	mgr/telemetry: less noise in the log
	mgr/telemetry: wake up serve on config change
	mgr/telemetry: track telemetry report revisions

Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-08-05 13:08:53 -05:00
Abhishek L
067db76a4c
Merge pull request #29364 from capri1989/doc-pg-num
doc: pg_num should always be a power of two

Reviewed-By: Jan Fajerski <jfajerski@suse.com>
Reviewed-By: Sage Weil <sage@redhat.com>
Reviewed-By: Abhishek Lekshmanan <abhishek@suse.com>
2019-08-01 14:18:53 +02:00
Sage Weil
143e1f0469 mgr/telemetry: force re-opt-in if the report contents change
Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-31 20:33:19 -05:00
Lars Marowsky-Bree
3a7e171df6 doc: Address further comments on choosing pg_num
Signed-off-by: Lars Marowsky-Bree <lmb@suse.com>
2019-07-31 15:14:04 +02:00
Lars Marowsky-Bree
bd20d692bb doc: adjust examples to use 2^n pg_num
The examples used pg_num set to 300 or 1000. This misled some users.

Signed-off-by: Lars Marowsky-Bree <lmb@suse.com>
2019-07-31 15:03:48 +02:00
Kai Wagner
3660e17e4b doc: pg_num should always be a power of two
Signed-off-by: Kai Wagner <kwagner@suse.com>
2019-07-30 08:35:16 +02:00
Jan Fajerski
9d8336a7f4 doc: update bluestore cache settings and clarify data fraction
Fixes: http://tracker.ceph.com/issues/39522

Signed-off-by: Jan Fajerski <jfajerski@suse.com>
2019-07-29 13:58:32 +02:00
Kefu Chai
3ed3de6c96 doc/rados/operations: crush_rule is a name
like
```
ceph osd pool set <pool-name> crush_rule <rule-name>
```
where `<rule-name>` is a string instead of a number.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2019-07-27 21:35:39 +08:00
Sage Weil
0c0093e450 Merge PR #29034 into master
* refs/pull/29034/head:
	doc/mgr/crash: document missing commands, options
	qa/suites/rados/singleton/all/test-crash: whitelist RECENT_CRASH
	qa/suites/rados/mgr/tasks/insights: whitelist RECENT_CRASH
	qa/tasks/mgr/test_insights: crash module now rejects bad crash reports
	mgr/telemetry: fix remote into crash do_ls()
	mgr/crash: don't make these methods static
	mgr/BaseMgrModule: handle unicode health detail strings
	mgr/crash: verify timestamp is valid
	qa/suites/mgr: whitelist RECENT_CRASH
	mgr/crash: remove unused var
	mgr/crash: remove unused import 'six'
	qa/workunits/rados/test_crash: health check
	mgr/crash: improve validation on post
	mgr/crash: automatically prune old crashes after a year
	mgr/crash: raise RECENT_CRASH warning for recent (new) crashes
	mgr/crash: add 'crash ls-new'
	mgr/crash: add option and serve infra
	mgr/crash: keep copy of crashes in memory
	mgr/pg_autoscaler: adjust style to match built-in tables
	mgr/crash: make 'crash ls' a nice table with a NEW column
	mgr/crash: nicely format 'crash info' output
	mgr/crash: add 'crash archive <id>', 'crash archive-all' commands

Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-07-24 17:20:21 -05:00
Kefu Chai
e6aee61076
Merge pull request #28772 from neha-ojha/wip-40528-2
osd: add hdd, ssd and hybrid variants for osd_snap_trim_sleep

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: David Zafman <dzafman@redhat.com>
2019-07-24 09:24:14 +08:00
Sage Weil
c885ee7f0c mgr/crash: raise RECENT_CRASH warning for recent (new) crashes
Signed-off-by: Sage Weil <sage@redhat.com>
2019-07-19 09:43:04 -05:00
Patrick Donnelly
a256c42d47
doc/cephfs: improve add/remove MDS section
Include hardware details and update language for modern tools.

Fixes: http://tracker.ceph.com/issues/39620
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-07-12 17:11:12 -07:00
Nathan Cutler
31f29d796a doc: cover more cache modes in rados/operations/cache-tiering.rst
Fixes: http://tracker.ceph.com/issues/14153
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2019-07-09 17:06:05 +02:00
Kefu Chai
00a67b07b1
Merge pull request #28753 from tchaikov/wip-doc-conf
doc/rados/configuration: update to be in sync with ConfUtils changes

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
2019-07-04 13:23:24 +08:00
Xie Xingguo
873db7c2ef
Merge pull request #28805 from Rjerk/fix-doc
doc/rados/configuration: fix typos in osd-config-ref.rst

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-07-02 11:55:55 +08:00
Lan Liu
2d71227e43 doc/rados/configuration: fix typos in osd-config-ref.rst
Signed-off-by: Lan Liu <liulan@umcloud.com>
2019-07-02 10:22:03 +08:00
Kefu Chai
eceed56b95 doc/rados/configuration: update to be in sync with ConfUtils changes
Signed-off-by: Kefu Chai <kchai@redhat.com>
2019-07-02 10:13:02 +08:00
Kefu Chai
60403c0a21
Merge pull request #28750 from neha-ojha/wip-40488
doc/rados/operations/erasure-code.rst: allow recovery below min_size

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-07-02 10:00:03 +08:00
Gregory Farnum
0d9ea078f1 doc: update erasure-code.rst with review comments.
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2019-06-28 14:52:47 -07:00
Gregory Farnum
4f246cf74d doc: update erasure code recovery description
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
2019-06-28 14:52:30 -07:00
Sage Weil
b4757f32a7 Merge PR #28491 into master
* refs/pull/28491/head:
	doc/rados/operations: add safe-to-destroy check to OSD replacement workflow

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2019-06-28 13:44:18 -05:00
Neha Ojha
733df09fe5 common/options.cc, doc: osd_snap_trim_sleep overrides other variants
A value > 0 for osd_snap_trim_sleep, will override the backend specific
variants of osd_snap_trim_sleep.

Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-06-28 09:53:01 -07:00
Neha Ojha
bad43f856d
Merge pull request #28775 from neha-ojha/wip-doc-delete-sleep
doc/rados/configuration/osd-config-ref.rst: document osd_delete_sleep

Reviewed-by: Sage Weil <sage@redhat.com>
2019-06-28 09:11:00 -07:00
Neha Ojha
981babc8fa doc/rados/configuration/osd-config-ref.rst: document osd_delete_sleep
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-06-27 17:21:58 -07:00
Neha Ojha
accf95e9de doc/rados/configuration/osd-config-ref.rst: document snap trim sleep
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-06-27 12:46:41 -07:00
Sage Weil
0a28dd0348 doc/rados/operations: add safe-to-destroy check to OSD replacement workflow
Signed-off-by: Sage Weil <sage@redhat.com>
2019-06-27 10:42:17 -05:00
Neha Ojha
bfccdecb94 doc/rados/operations/erasure-code.rst: allow recovery below min_size
https://github.com/ceph/ceph/pull/17619 allows recovery below min_size
for EC pools starting in Octopus. Also mention that this wasn't allowed
earlier.

Fixes: https://tracker.ceph.com/issues/40488
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-06-25 15:48:48 -07:00
Kefu Chai
1b0eb21220
Merge pull request #28677 from liewegas/wip-recovery-nvme
osd: add hdd and ssd variants for osd_recovery_max_active

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Mark Nelson <mnelson@redhat.com>
2019-06-23 01:56:42 +08:00
Sage Weil
35c0d75888 osd: add hdd and ssd variants for osd_recovery_max_active
Semi-arbitrarily set the SSD max to 10 (instead of 3).  This should be
tuned based on some real data.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-06-20 16:24:51 -05:00
David Zafman
fa698e18e1 mon: Improve health status for backfill_toofull and recovery_toofull
Treat backfull_toofull as a warning condition because it can resolve itself.
Includes test case for PG_BACKFILL_FULL
Includes test case for recovery_toofull / PG_RECOVERY_FULL

Fixes: https://tracker.ceph.com/issues/39555

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-06-20 02:22:01 +00:00
Kefu Chai
ce26c99bed
Merge pull request #28418 from xiexingguo/wip-kick-recovery-priority
osd: give recovery ops initialized by client op a higher priority

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2019-06-18 23:29:42 +08:00
Hannes von Haugwitz
65c6425489 doc: update mondb recovery script
- be specific about stopped OSDs
- add missing '--no-mon-config' option
- fix indent of here script delimiting identifier
- use $host variable in for loop

Signed-off-by: Hannes von Haugwitz <hannes@vonhaugwitz.com>
2019-06-12 17:16:25 +02:00
xie xingguo
c0f87e0f91 osd: give recovery ops initialized by client op a higher priority
to use strict priority ordering.

The new "mclock_opclass/mclock_client" queue basically prioritizes
operations based on the class they belong to. The priority property
of an operation, if lower than a specific value (64, by default),
will get ignored and hence all operations from the same class will
be treated fairly in a FIFO fashion (but still limited by the total
IOPS or bandwidth available for the corresponding class).

To reduce the impact of performance, a more general strategy would be
enforcing some limitations on the IOPS or bandwidth for the background
recovery (or backfill) operation class. However, this way we'll end up
blocking client operations too if they are currently blocked by some
degraded objects which need to be recovered first.

We hereby grant recovery operations of this kind a higher priority
to force them to use strict priority ordering, which should still
be of significance once we switch to the new "mclock_opclass/mclock_client"
queue.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-06-11 15:46:57 +08:00
Xie Xingguo
302d7bcdd8
Merge pull request #27735 from xiexingguo/wip-device-class-noout
osd: revamp {noup,nodown,noin,noout} related commands

Reviewed-by: Sage Weil <sage@redhat.com>
2019-06-05 14:17:06 +08:00
Jason Dillaman
d23bdb7931
Merge pull request #28296 from mcv21/doc-profile-rbd
doc: note explicitly that "profile rbd" allows blacklisting

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2019-05-31 08:29:21 -04:00
Kefu Chai
f6b022bdbe
Merge pull request #27806 from ashitakasam/add-osd-alarm
osd: Better error message when OSD count is less than osd_pool_default_size

Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-05-30 21:28:54 +08:00
xie xingguo
a3b0dc29b9 doc: refresh {noup,nodown,noin,noout} changes
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-05-30 10:52:38 +08:00
Matthew Vernon
6812582222 doc: note explicitly that "profile rbd" allows blacklisting
The Luminous release notes tell users to ensure that rbd clients have
the ability to blacklist other client users; this is provided by
"profile rbd", which this change now documents explicitly in the user
management documentation.

Signed-off-by: Matthew Vernon <mv3@sanger.ac.uk>
2019-05-29 14:36:48 +01:00
zjh
94237d3693 osd: Better error message when OSD count is less than osd_pool_default_size
Fixes: http://tracker.ceph.com/issues/38617

Signed-off-by: zjh <jhzeng93@foxmail.com>
2019-04-28 20:09:13 +08:00
David Zafman
39cc14bdc1
Merge pull request #27503 from dzafman/wip-39099
osd: Give recovery for inactive PGs a higher priority

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-04-25 15:06:56 -07:00
David Zafman
444aa9f9fe osd, mon: New pool recovery priority range -10 to 10
Use OSD_POOL_PRIORITY_MAX and OSD_POOL_PRIORITY_MIN constants
Scale legacy priorities if exceeds maximum

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-04-25 13:53:27 -07:00
Sage Weil
61d6d051de Merge PR #27472 into master
* refs/pull/27472/head:
	doc/rados/operations/devices: document device failure prediction

Reviewed-by: Rick Chen <rick.chen@prophetstor.com>
Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
2019-04-24 08:37:49 -05:00
Sage Weil
a3a4af3454 Merge PR #27656 into master
* refs/pull/27656/head:
	doc/dev/erasure-coded-pool: update
	doc/rados/operations/erasure-code*: update default ec profile references
	common/options: change default erasure-code-profile to k=2 m=2

Reviewed-by: Neha Ojha <nojha@redhat.com>
2019-04-24 08:14:55 -05:00
Sage Weil
67fadc711a doc/rados/operations/devices: document device failure prediction
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-23 07:10:53 -05:00
Sage Weil
69c7a4d24e doc/rados/operations/erasure-code*: update default ec profile references
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-22 11:20:55 -05:00
Anthony D'Atri
8c2b2fdd27
doc: operations: reweight-by-utilization typo
Add a missing backquote delimiter.

Signed-off-by: Anthony D'Atri <anthony.datri@gmail.com>
2019-04-19 15:15:12 -07:00
Anthony D'Atri
963bfab07d
doc: operations: improve reweight-by-utilization
Add the missing `max_change`, `max_osds`, and `--no-increasing` parameters to `reweight-by-utilization` and `test-reweight-by-utilization`.  Minor adjustments to wording.

Signed-off-by: Anthony D'Atri <anthony.datri@gmail.com>
2019-04-17 14:48:33 -07:00
Sage Weil
c2190c1ff8 Merge PR #27519 into master
* refs/pull/27519/head:
	doc/rados/operations/health-checks: document new bluestore warnings
	os/bluestore: alert on fm/bdev size mismatch
	os/bluestore: introduce legacy statfs alert

Reviewed-by: Sage Weil <sage@redhat.com>
2019-04-16 14:31:49 -05:00
Sage Weil
872590fe83 Merge PR #27563 into master
* refs/pull/27563/head:
	mon/OSDMonitor: respect crush node flags for can_mark_*()
	osd/OSDMap: add get_crush_node_flags(int osd)
	mon/OSDMonitor: make 'osd {add,rm}-{noin,noout,...}' support crush nodes
	osd/OSDMap: raise OSD_FLAGS health alert for crush node flags, too
	osd/OSDMap: add flags for crush nodes

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
2019-04-16 14:30:41 -05:00
Sage Weil
b29495954b doc/rados/operations/health-checks: document new bluestore warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-15 17:42:48 +03:00
Sage Weil
9b979a5c5d doc/release/nautilus: proscribe minimum hammer tunables and straw2 on upgrade
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-12 17:26:56 -05:00
Sage Weil
9aa9893b8f osd/OSDMap: raise OSD_FLAGS health alert for crush node flags, too
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-12 11:10:35 -05:00
Changcheng Liu
c0df98fc7e doc: fix parameter to set pg autoscale mode
osd_pool_default_pg_autoscale_mode is the right parameter to
set placement-group autoscale mode.

Signed-off-by: Changcheng Liu <changcheng.liu@intel.com>
2019-04-08 10:40:47 +08:00
Vangelis Tasoulas
24131fc59a
doc: Update documentation for the MANY_OBJECTS_PER_PG warning
The current documentation for the MANY_OBJECTS_PER_PG warning
states that The threshold can be raised to silence the health
warning by adjusting the mon_pg_warn_max_object_skew config
option on the monitors. It seems that this is not true (at least)
since the luminous times, and this option should be adjusted on
the managers.

I encountered this problem and I spend quite sometime injecting
the mon_pg_warn_max_object_skew to the monitors, added the option
ceph.conf and restarted the monitors several times but the warning
was not going away. I had to download the code to see what's
happening and I found out this:

$ git grep -A 3 mon_pg_warn_max_object_skew src/common/options.cc
src/common/options.cc:1480:    Option("mon_pg_warn_max_object_skew", Option::TYPE_FLOAT, Option::LEVEL_ADVANCED)
src/common/options.cc-1481-    .set_default(10.0)
src/common/options.cc-1482-    .set_description("max skew few average in objects per pg")
src/common/options.cc-1483-    .add_service("mgr"),

After I restarted the ceph-mgr service, the warning went away.

Signed-off-by: Vangelis Tasoulas <vangelis@tasoulas.net>
2019-04-05 19:53:35 +02:00
Vanush "Misha" Paturyan
3d935c3c53 doc/rados/configuration/mon-lookup-dns: fix typo
Signed-off-by: Vanush "Misha" Paturyan <ektich@gmail.com>
2019-04-04 12:37:54 +01:00
Sage Weil
242ef7824d doc/rados/operations: document BLUEFS_SPILLOVER
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-02 11:13:31 -05:00
Jason Dillaman
41d3fdc554
Merge pull request #27074 from LenzGr/master-documentation
doc: Updated dashboard iSCSI configuration, added labels

Reviewed-by: Ricardo Marques <rimarques@suse.com>
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
2019-04-01 09:20:24 -04:00
Lenz Grimmer
71e46887d4 doc: Updated dashboard iSCSI configuration, added labels
Added note about the requirement for the latest ceph-iscsi version
3 to the dashboard documentation. Added some doc references
and replaced some URLs in the iSCSI docs with reST labels instead.

Signed-off-by: Lenz Grimmer <lgrimmer@suse.com>
2019-03-31 13:32:15 -05:00
Casey Bodley
9e949fcd5c
Merge pull request #27243 from theanalyst/doc-scheduler
config-ref: add a note on current scheduler settings.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
2019-03-28 14:23:11 -04:00
Abhishek Lekshmanan
909b8ef4bc docs: rgw: add a x-ref to rados dmclock docs
Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
2019-03-28 17:04:31 +01:00
David Zafman
769cdc8294 doc: Document new pg state and changes to auto repair behavior
Fixes: http://tracker.ceph.com/issues/38616

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-25 16:03:36 -07:00
David Zafman
9fd4b062f1 doc: Fix the pg states and auto repair config options
Fixes: http://tracker.ceph.com/issues/38896

Signed-off-by: David Zafman <dzafman@redhat.com>
2019-03-22 19:58:00 -07:00
Kefu Chai
32df73f9f2
Merge pull request #26940 from xiexingguo/wip-monc-add-con
mon/MonClient: weight-based mon selection

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-03-22 15:43:04 +08:00
Danny Al-Gaaf
c9441c2916 doc: fix LRC documentation
Recovery from a failure in jerasure need only k reads and
not k+m-1.

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
2019-03-22 01:37:56 +01:00
Kefu Chai
ee5eab81e1
Merge pull request #26934 from sebastian-philipp/doc-rados-mon_command
doc/rados/api/python: Add documentation for mon_command

Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-03-20 18:27:47 +08:00