Commit Graph

1097 Commits

Author SHA1 Message Date
Sebastian Wagner
cee38745ea
Merge pull request #26404 from sebastian-philipp/orchestrator-error-handling
mgr/orchestrator: Add error handling to interface

Reviewed-by: Jeff Layton <jlayton@redhat.com>
Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
Reviewed-by: Tim Serong <tserong@suse.com>
2019-03-21 12:30:13 +01:00
Sage Weil
4f41800386 ms/async/ProtocolV2: add ms_die_on_bug and assert rxbuf/txbuf don't get big
Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-15 06:42:29 -05:00
Sage Weil
96b837830c Merge PR #26920 into master
* refs/pull/26920/head:
	qa/tasks/mgr/test_module_selftest: fix localized value test
	mgr/BaseMgrStandbyModule: parse prefix properly

Reviewed-by: Volker Theile <vtheile@suse.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2019-03-13 08:16:20 -05:00
Sage Weil
ebdd003bf4 qa/tasks/mgr/test_module_selftest: fix localized value test
When mgr/selftest/testkey = foo and mgr/selftest/x/testkey is not set,
then get_localized() should return foo.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-13 07:11:47 -05:00
Patrick Donnelly
4f3df2cc82
Merge PR #26893 into master
* refs/pull/26893/head:
	qa: unmount clients prior to marking fs down

Reviewed-by: Zheng Yan <zyan@redhat.com>
2019-03-12 10:47:53 -07:00
Volker Theile
bc9643657a mgr: Fix broken get_localized_module_option function
Fixes: https://tracker.ceph.com/issues/38560

Signed-off-by: Volker Theile <vtheile@suse.com>
2019-03-11 17:25:18 +01:00
Patrick Donnelly
e7e4eea3a6
Merge PR #26818 into master
* refs/pull/26818/head:
	qa/cephfs: relax min_caps_per_client check

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2019-03-11 09:21:43 -07:00
Patrick Donnelly
9aaf6118a4
qa: unmount clients prior to marking fs down
Evicted RHEL7.5 clients may hang.

Fixes: http://tracker.ceph.com/issues/38677
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-03-11 09:15:37 -07:00
Yan, Zheng
8e81bd74c5 qa/cephfs: relax min_caps_per_client check
new kernel client proactively release caps. caps count can go below
mds_min_caps_per_client

Fixes: http://tracker.ceph.com/issues/38270
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
2019-03-07 21:32:20 +08:00
Sebastian Wagner
61d53ed70e mgr/orchestrator: Add error handling to interface
Also:

* Small test_orchestrator refactorization
* Improved Docstring in MgrModule.remote
* Added `raise_if_exception` that raises Exceptions
* Added `OrchestratorError` and `OrchestratorValidationError`
* `_orchestrator_wait` no longer raises anything
* `volumes` model also calls `raise_if_exception`

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
2019-03-07 14:13:44 +01:00
Sebastian Wagner
7ba6bece41
Merge pull request #26633 from jtlayton/wip-nfs-scale
mgr/orchestrator: Allow the orchestrator to scale the NFS server count

Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>
2019-03-06 19:08:48 +01:00
Jeff Layton
a256735d4c mgr/orchestrator: allow scaling the NFS server count up and down
Add a new 'ceph orchestrator nfs update' command that will take the
NFS clustername and a new count as arguments. That will get translated
to a StatelessServiceSpec and passed to update_stateless_service.

Also, add the necessary stubs to the test_orchestrator and the CLI
QA test.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
2019-03-06 07:15:14 -05:00
Patrick Donnelly
06e7b12d24
Merge PR #26699 into master
* refs/pull/26699/head:
	qa: unmount clients before deleting fs

Reviewed-by: Zheng Yan <zyan@redhat.com>
2019-03-01 06:46:46 -08:00
Patrick Donnelly
1f5892f820
Merge PR #26638 into master
* refs/pull/26638/head:
	doc: update documentation for standby-replay
	qa: update discontinous map test to use mds freezing
	mon: add freeze MDS command
	qa: update testing for standby-replay
	mon: add setting for fs to enable standby-replay
	ceph-mds: obsolete hot-standby option
	fs: obsolete standby_for config options
	messages/MMDSBeacon: use inline init
	mds: avoid unnecessary copy of entity_addrvec_t
	mds: use inline init for mds_info_t
	mds: use rank from MDSMap always
	mds: remove obsolete comment
	qa: use SIGTERM when stopping vstart service

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Zheng Yan <zyan@redhat.com>
2019-02-28 20:37:13 -08:00
Patrick Donnelly
a20a61f476
qa: unmount clients before deleting fs
Client unmount during test cleanup will hang if the file system was deleted.

Fixes: http://tracker.ceph.com/issues/38518
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-28 09:02:33 -08:00
Patrick Donnelly
4fa4eda9ee
qa: update discontinous map test to use mds freezing
and stop using standby_for_*.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-27 21:39:17 -08:00
Patrick Donnelly
8cbdad9f9b
qa: update testing for standby-replay
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-27 21:39:12 -08:00
Patrick Donnelly
2ea41574a2
qa: use SIGTERM when stopping vstart service
Otherwise MDS will not send down:dne to mons when restarting.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-27 21:38:24 -08:00
Volker Theile
c16d8f2964 mgr/dashboard: Configure all mgr modules in UI
Fixes: https://tracker.ceph.com/issues/37934

Signed-off-by: Volker Theile <vtheile@suse.com>
2019-02-26 14:12:07 +01:00
Lenz Grimmer
55dd5e11ce
Merge pull request #26539 from tspmelo/wip-nfs-status
mgr/dashboard: Add NFS status endpoint 

Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
2019-02-25 13:12:22 +01:00
Patrick Donnelly
0989abe3fe
Merge PR #26468 into master
* refs/pull/26468/head:
	qa: config recall settings to test cache drop
	qa: check cache dump works without timeout
	mds: add 2nd order recall throttle
	mds: drive log flush and cache trim during recall
	mds: avoid gather assertion when subs exist
	mds: output full details for recall threshold

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-21 10:26:25 -08:00
Tiago Melo
824726393b mgr/dashboard: Add NFS status endpoint
Currently each time NFS page is opened and NFS Ganesha is not configured
an error notification is thrown and no extra information is given.
Now the user will be redirected to an information page.

Removed the orchestrator information since it no longer applies.

Fixes: http://tracker.ceph.com/issues/38399

Signed-off-by: Tiago Melo <tmelo@suse.com>
2019-02-21 11:14:25 +00:00
Sage Weil
d198831c44 Merge PR #26262 into master
* refs/pull/26262/head:
	mgr/orch/ssh: add qa test
	qa: install/exclude ceph-mgr-ssh package
	debian: package ssh orchestrator
	spec.ceph: package ssh orchestrator
	ssh/orch: add ssh orchestrator
2019-02-20 13:05:11 -06:00
Noah Watkins
06422208ec mgr/orch/ssh: add qa test
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2019-02-20 09:45:48 -08:00
Lenz Grimmer
34e7f09017
Merge pull request #26450 from s0nea/wip-dashboard-readonly-config-options-2
mgr/dashboard: disallow editing read-only config options (part 2)

Reviewed-by: Volker Theile <vtheile@suse.com>
2019-02-19 12:50:30 +01:00
Lenz Grimmer
84d077a02a
Merge pull request #25233 from p-na/wip-pna-qos-pr
mgr/dashboard: Add support for managing RBD QoS

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Ricardo Dias <rdias@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2019-02-19 12:43:40 +01:00
Patrick Nawracay
cfbefbf169 mgr/dashboard: Add support for managing RBD QoS
Fixes: http://tracker.ceph.com/issues/36191

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2019-02-19 11:08:59 +01:00
Patrick Donnelly
b28dcd9d75
qa: config recall settings to test cache drop
If we use the defaults, the MDS/client will recall/release everything quickly.
We want it to take time to see things like the timeout get hit.

Fixes: http://tracker.ceph.com/issues/38348
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-18 16:28:37 -08:00
Patrick Donnelly
b150b22d2c
qa: check cache dump works without timeout
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-02-18 16:28:37 -08:00
Patrick Nawracay
1419346886 mgr/dashboard: Implement OSD purge
Removes the `remove` functionality from the UI.

Fixes: http://tracker.ceph.com/issues/35811

Signed-off-by: Patrick Nawracay <pnawracay@suse.com>
2019-02-18 07:18:09 +00:00
Tatjana Dehler
35c8a05a2c mgr/dashboard: check config option updatability also for bulk create
Fixes: https://tracker.ceph.com/issues/34528
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2019-02-15 17:00:43 +01:00
Tatjana Dehler
a2d5f983ad mgr/dashboard: add missing test case for 60bda9ae42
Fixes: https://tracker.ceph.com/issues/34528
Signed-off-by: Tatjana Dehler <tdehler@suse.com>
2019-02-15 16:26:30 +01:00
Lenz Grimmer
c1f6e0e867
Merge pull request #25989 from votdev/feature_36488
mgr/dashboard: Add UI to configure the telemetry mgr plugin

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Patrick Nawracay <pnawracay@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
2019-02-14 09:43:15 +01:00
Sebastian Wagner
b8fe1554d3
Merge pull request #26346 from noahdesu/mgr-orch-cache-refresh
mgr/orch: refresh option for inventory query

Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>
2019-02-13 11:13:52 +01:00
Volker Theile
b9177e43c0 mgr/dashboard: Add UI to configure the telemetry mgr plugin
Fixes: tracker.ceph.com/issues/36488

Signed-off-by: Volker Theile <vtheile@suse.com>
2019-02-12 16:36:56 +01:00
Noah Watkins
96c8a48010 pybind/orch: add refresh option to inventory query
allow a user of the orchestrator interface to express that the inventory
query should not read from any cached inventory state.

Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2019-02-11 15:27:21 -08:00
Sage Weil
0a1ac6418d Merge PR #26282 into master
* refs/pull/26282/head:
	mon: include progress events in 'ceph status' output
	mon/MgrStatMonitor: persist progress events
	mgr: send progress events to mon
	mgr/progress: revise message syntax a bit
	mgr/progress: report events into mgr
	mgr: allow progress events to be reported from modules
	mgr/progress: make persist period/interval configurable
	mgr/progress: prune completed events

Reviewed-by: Tim Serong <tserong@suse.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2019-02-11 17:26:19 -06:00
Sage Weil
059844f9b9 Merge PR #26336 into master
* refs/pull/26336/head:
	qa/tasks/keystone.py: no need for notcmalloc in example
	qa/suites/rgw/tempest/tasks/rgw_tempest: no need for notcmalloc

Reviewed-by: Alfredo Deza <adeza@redhat.com>
Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
2019-02-11 15:47:29 -06:00
Sage Weil
1d305f1264 mgr/progress: revise message syntax a bit
"osd.0", not "OSD 0"

Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-08 13:50:27 -06:00
Sage Weil
9c729a78be qa/tasks/keystone.py: no need for notcmalloc in example
Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-08 09:12:01 -06:00
Sebastian Wagner
1139c127d6 mgr/orchestrator: add tests to host, mgr and mon mgmt
Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
2019-02-07 12:23:51 +01:00
Lenz Grimmer
2275a41d09
Merge pull request #25918 from rjfd/wip-dashboard-nfs-backend
mgr/dashboard: NFS Ganesha management REST API

Reviewed-by: Jeff Layton <jlayton@redhat.com>
Reviewed-by: Tiago Melo <tmelo@suse.com>
2019-02-06 16:41:49 +01:00
Sebastian Wagner
d71a97b7fa mgr/orchestrator: make use of @CLICommand
Also Modified some command to allow multiple hosts
Also: Added more tests

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
2019-02-05 11:27:47 +01:00
Sebastian Wagner
d361f77a78
Merge pull request #26171 from sebastian-philipp/orchestrator-merge-create-add
mgr/orchestrator: Unify `osd create` and `osd add`

Reviewed-by: Jeff Layton <jlayton@redhat.com>
Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
2019-02-01 16:42:36 +01:00
Sebastian Wagner
9e926e9927 mgr/orchestrator: Unify osd create and osd add
Also:

* Added some more tests
* Better validation of drive Groups
* Simplified `TestWriteCompletion`

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
2019-02-01 10:10:59 +01:00
Patrick Donnelly
0d26266ccb
Merge PR #26038 into master
* refs/pull/26038/head:
	mds: simplify recall warnings
	mds: add extra details for cache drop output
	qa: test mds_max_caps_per_client conf
	mds: limit maximum number of caps held by session
	mds: adapt drop cache for incremental recall
	mds: recall caps incrementally
	mds: adapt drop cache for incremental trim
	mds: add throttle for trimming MDCache
	mds: cleanup SessionMap init
	mds: cleanup Session init

Reviewed-by: Zheng Yan <zyan@redhat.com>
2019-01-31 12:08:26 -08:00
Patrick Donnelly
c0b3a11484
mds: simplify recall warnings
Instead of a timeout and complicated decisions about whether the client is
releasing caps in an expeditious fashion, just use a DecayCounter that tracks
the number of caps we've recalled. This counter is decremented whenever the
client releases caps. If the counter passes a threshold, then we raise the
warning.

Similar reworking is done for the steady-state recall of client caps. Another
release DecayCounter is added so we can tell when the client is not releasing
any more caps.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2019-01-31 12:07:54 -08:00
Ricardo Dias
eabb054b1a
mgr/dashboard: ganesha: add support for sec_label_xattr CEPH fsal attribute
Signed-off-by: Ricardo Dias <rdias@suse.com>
2019-01-31 11:10:10 +00:00
Ricardo Dias
acd9fc679e
mgr/dashboard: ganesha: add 'security_label' support
Signed-off-by: Ricardo Dias <rdias@suse.com>
2019-01-31 11:10:10 +00:00
Ricardo Dias
e429afda53
qa/tasks/mgr/dashboard: NFS Ganesha tests
Signed-off-by: Ricardo Dias <rdias@suse.com>
2019-01-31 11:10:09 +00:00