Commit Graph

123778 Commits

Author SHA1 Message Date
Patrick Donnelly
84ae38594d
qa: test nowsync option in kernel client workflows
Fixes: https://tracker.ceph.com/issues/49341
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-04 19:15:12 -07:00
Patrick Donnelly
88f74dbfa6
qa: deep merge top level overrides for fuse/kclient
This allows for array/dict configs like mntopts to accumulate changes
from multiple yaml fragments.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-04 19:15:12 -07:00
Kefu Chai
9370ac9fa8
Merge pull request #41581 from tchaikov/wip-options-mgr-mon
common/options: extract mgr and mon options out

Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-06-05 10:06:07 +08:00
Kefu Chai
e8f1477da1
Merge pull request #40073 from jmolmo/delete_service_causes_osd_removal
mgr/cephadm: Warn about OSDs to remove manually when deleting an OSD service

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Reviewed-by: Adam King <adking@redhat.com>
2021-06-05 08:44:42 +08:00
Patrick Donnelly
8f8ba3d783
Merge PR #41697 into master
* refs/pull/41697/head:
	script: add a few more volume mounts for sepia
	script: drop ceph-fuse from docker debugging
	script: enable centos debuginfo repo for debugging
	script: update repo url for multi-arch builds
	script: fetch autobuild.asc key via HTTPS

Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-06-04 13:07:42 -07:00
Kefu Chai
68dd82e2c2
Merge pull request #41690 from tchaikov/wip-test-alloc_aging
test/objectstore/unittest_alloc_aging: init cct

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
2021-06-05 01:57:03 +08:00
Kefu Chai
6d0c175ae7
Merge pull request #41698 from tchaikov/wip-qa-rook
qa/suites/orch/rook/smoke: stop testing on ubuntu 18.04

Reviewed-by: Sage Weil <sage@redhat.com>
2021-06-05 01:23:35 +08:00
Kefu Chai
104f054cee qa/suites/orch/rook/smoke: stop testing on ubuntu 18.04
even rook does not really install ceph packages in the host directly, it
uses the ceph container image. but teuthology insists on checking the
existence of debian packages by querying shaman server when it sees a
teuthology facet file which includes:

os_type: ubuntu
os_version: "18.04"

but since we've stopped building ubuntu/bionic packages, teuthology
just complains when we are scheduling test suites which are composed
from facets in qa/suites/orch/rook/smoke.

in this change, the ubuntu_18.04.yaml is dropped because ubuntu/bionic
does not really increase the test coverage of ceph. it helps to test
the rook and container runtime though.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-06-05 01:11:14 +08:00
Patrick Donnelly
51b60e990b
script: add a few more volume mounts for sepia
We now have a few Ceph file systems with various possible mount points
depending which lab machine you're using.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-04 09:33:54 -07:00
Patrick Donnelly
9d71fc1bee
script: drop ceph-fuse from docker debugging
Install this on the fly as necessary...

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-04 09:33:30 -07:00
Patrick Donnelly
efbfd678a2
script: enable centos debuginfo repo for debugging
So we can fetch e.g. the sqlite debuginfo packages.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-04 09:32:52 -07:00
Patrick Donnelly
472bc091ad
script: update repo url for multi-arch builds
Brad suggested this change based on his commit [1]. Thank you!

[1] 267cce9e83

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-04 09:31:19 -07:00
Patrick Donnelly
6223ab14cc
script: fetch autobuild.asc key via HTTPS
Rather than relying the key being avaiable on the LRC /ceph file system.
(Someone appears to have deleted it recently.)

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-04 09:30:04 -07:00
Kefu Chai
523fcb711f
Merge pull request #41679 from AmnonHanuhov/wip-get_rid_of_pending_q
crimson/net: Use out_q instead of pending_q

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
2021-06-04 20:13:54 +08:00
Amnon Hanuhov
a59c0a9296 crimson/net: Use out_q instead of pending_q
pending_q contains the same messages as in out_q and it is only used
for creating a bytestream out of these messages. We can just use out_q for that.

Signed-off-by: Amnon Hanuhov <ahanukov@redhat.com>
2021-06-04 14:08:56 +03:00
Kefu Chai
08ad13bca5
Merge pull request #41631 from tchaikov/wip-keyring-decode
auth/KeyRing: always decode keying as plaintext

Reviewed-by: Willem Jan Withagen <wjw@digiware.nl>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2021-06-04 17:15:06 +08:00
Kefu Chai
04fd0002a3
Merge pull request #41587 from cfsnyder/bugfix_47738
mgr/DaemonServer.cc: prevent mgr crashes caused by integer underflow that is triggered by large increases to pg_num/pgp_num

Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-06-04 17:00:48 +08:00
Kefu Chai
fb114d994c
Merge pull request #41592 from tchaikov/wip-ceph-default-confffile
ceph.in: use rados.Rados.DEFAULT_CONF_FILES

Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-06-04 16:59:24 +08:00
Kefu Chai
c0f619cf58
Merge pull request #41594 from tchaikov/wip/test/librados/list
test/librados/list: print reason why test fails

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
2021-06-04 16:58:59 +08:00
Kefu Chai
8a0c13f437
Merge pull request #36941 from hoamer/patch-1
doc/mgr/administrator: add a more precise description for creating key

Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-06-04 16:57:41 +08:00
hoamer
78bb36cb62 doc/mgr/administrator: add a more precise description for creating key
added a more precise description to handle filename when creating key for mgr

Signed-off-by: hoamer <kontakt@sebastian-neugebauer.de>
2021-06-04 16:56:56 +08:00
Kefu Chai
8b651b3522 test/objectstore/unittest_alloc_aging: init cct
* initialize the cct use by test, otherwise g_ceph_context is
  not set at all.
* instead of using g_ceph_context, use static member variable cct.
  less dependency to the global instance.
* setup and teardown the cct for test suite, because global_init()
  initialize g_ceph_context, which cannot be set multiple times.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-06-04 14:07:29 +08:00
Kefu Chai
f3ec742fe2 test/objectstore: s/TearDownTestCase/TearDownTestSuite/
TearDownTestCase is deprecated by GTest. let's use the new API instead.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-06-04 14:07:23 +08:00
Kefu Chai
dba26fc7a8
Merge pull request #41652 from tchaikov/wip-qa-asock-or
qa/tasks/admin_socket: support "foo || bar" as command

Reviewed-by: Samuel Just <sjust@redhat.com>
2021-06-04 13:50:38 +08:00
Kefu Chai
c4d5121305
Merge pull request #41686 from t-msn/update-trace-doc
doc/dev: update how to use lttng/blkin trace

Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-06-04 12:30:23 +08:00
Misono Tomohiro
53a68e27bb doc/dev: update how to use lttng/blkin trace
Update doc to reflect current status.

Signed-off-by: Misono Tomohiro <misono.tomohiro@jp.fujitsu.com>
2021-06-04 11:41:29 +09:00
Sage Weil
9b9d90bddf Merge PR #41553 into master
* refs/pull/41553/head:
	ceph-volume: replace __ with _ in device_id

Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-06-03 22:04:55 -04:00
Sage Weil
ce033d629c Merge PR #41636 into master
* refs/pull/41636/head:
	mgr/cephadm/inventory: do not try to resolve current mgr host
	pybind/mgr/mgr_module: make get_mgr_ip() return mgr's IP from mgrmap
	mgr/restful: use get_mgr_ip() instead of hostname

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2021-06-03 22:04:32 -04:00
Kefu Chai
b5d9418c11
Merge pull request #41674 from tchaikov/wip-vstart-without-restful
vstart.sh: add an option named --without-restful

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2021-06-04 09:44:58 +08:00
Kefu Chai
25480e2452
Merge pull request #41670 from tchaikov/wip-op-tracking-spin-off-0
crimson, common: improve const-correctness of Operation::dump()s.

Reviewed-by: Samuel Just <sjust@redhat.com>
2021-06-04 07:50:44 +08:00
Kefu Chai
295afe68a2
Merge pull request #41672 from tchaikov/wip-crimson-test-handle-fut
test/crimson/seastore: always handle returned future<>

Reviewed-by: Samuel Just <sjust@redhat.com>
2021-06-04 07:50:21 +08:00
Patrick Donnelly
e3c4ae389e
Merge PR #41654 into master
* refs/pull/41654/head:
	mds: do not infinitely recursively print a metric

Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
2021-06-03 13:34:54 -07:00
Patrick Donnelly
943915f8df
Merge PR #41639 into master
* refs/pull/41639/head:
	mds/scrub: write root inode backtrace at creation

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-03 13:33:58 -07:00
Patrick Donnelly
a12db7941b
Merge PR #41499 into master
* refs/pull/41499/head:
	qa/tasks/mds_thrash: fix thrash iteration never skip

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-03 13:33:27 -07:00
Patrick Donnelly
a52712f955
Merge PR #41443 into master
* refs/pull/41443/head:
	test: update log-ignorelist for fs:mirror test

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-03 13:23:17 -07:00
Patrick Donnelly
4e1f812461
Merge PR #39910 into master
* refs/pull/39910/head:
	test: Add test for mgr hang when osd is full
	mgr: Set client_check_pool_perm to false
	mds: Add full caps to avoid osd full check

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2021-06-03 13:22:23 -07:00
Dan Mick
2544ee6d7f
Merge pull request #41559 from dmick/wip-grafana-container
monitoring/grafana/build/Makefile: revamp for arm64 builds, pushes to docker and quay, jenkins
2021-06-03 11:32:24 -07:00
Sage Weil
45e806676b mgr/cephadm/inventory: do not try to resolve current mgr host
The CNI configuration may set up a private network for the container, which
is mapped to the hostname in /etc/hosts.  For example, my test box sets
up 10.88.0.0/24 because I was using crio + kubeadm on this host earlier
(at least I think that's why):

$ sudo podman run --rm --name test123 --entrypoint /bin/bash -it quay.ceph.io/ceph-ci/ceph:master -c "cat /etc/hosts"
127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
10.88.0.8	f9e91bf2478f test123

In any case, we should never trust a lookup of our own hostname from inside
a container!

This isn't quite sufficient, though: if this is a single-host cluster, then
we fall back to using get_mgr_ip(). That value may be distorted by the
public_network option on the mgr, but we don't have any other good
options here, and single-node clusters are unlikely to have complex
network configs.

Refactor a bit to avoid the try/except nesting.

Signed-off-by: Sage Weil <sage@newdream.net>
2021-06-03 12:11:31 -04:00
Sage Weil
72d72fc5db pybind/mgr/mgr_module: make get_mgr_ip() return mgr's IP from mgrmap
The previous approach was convoluted: we tried to do a DNS lookup on the
hostname, which would fail if /etc/hosts had an entry.  Which, with podman,
it does.  And the IP it has will vary in all sorts of weird ways.  For
example, CNI on my host means that I get a dynamic address in 10.88.0.0/24.

Avoid all of that nonsense and use the IP that is in the mgrmap.  There
may be multiple IPs (v2 + v1, or maybe even IPv4 + v6 in the future); in
that case, use the first one.

Signed-off-by: Sage Weil <sage@newdream.net>
2021-06-03 12:11:31 -04:00
Sage Weil
df79946174 mgr/restful: use get_mgr_ip() instead of hostname
Now we match dashboard!

Signed-off-by: Sage Weil <sage@newdream.net>
2021-06-03 12:11:31 -04:00
Neha Ojha
11252f6117
Merge pull request #41308 from sseshasa/wip-osd-benchmark-for-mclock
osd: Run osd bench test to override default max osd capacity for mclock

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2021-06-03 08:39:22 -07:00
Casey Bodley
38ae161670
Merge pull request #41316 from cbodley/wip-50785
rgw: parse tenant name out of rgwx-bucket-instance

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
2021-06-03 11:05:00 -04:00
Kefu Chai
a53592e484
Merge pull request #41677 from tchaikov/wip-oom
ceph.spec.in: increase the mem_per_job to 3GiB

Reviewed-by: David Galloway <dgallowa@redhat.com>
2021-06-03 22:40:26 +08:00
Casey Bodley
0f44459bc9
Merge pull request #41668 from pleiadesian/patch-bucket-chown
rgw: require bucket name in bucket chown

Reviewed-by: Or Friedmann <ofriedma@redhat.com>
Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
2021-06-03 10:28:35 -04:00
Casey Bodley
416d60c20d
Merge pull request #41462 from yehudasa/wip-50920
rgw: auth v4 client: don't convert '+' to space

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2021-06-03 10:16:30 -04:00
Kefu Chai
7577fec7fd cmake: increase the MAX_{LINK,COMPILE}_MEM
based on recent observation, quite a few C++ source file take
around more than 3.0GiB to compile. for instance,
test_mock_HttpClient.cc could take up to 6270MiB memory to compile.

so increase MAX_{LINK,COMPILE}_MEM accordingly.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-06-03 21:03:03 +08:00
Kefu Chai
38be14bc0f ceph.spec.in: increase the mem_per_job to 3GiB
to lower the number of jobs, we are experiencing build failures on
a builder with 48c96t, 193 free mem. the failures were caused by
OOM killer which kills the c++ compiler

[498376.128969] oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),cpuset=/,mems_allowed=0,global_oom,task_memcg=/system.slice/jenkins.service,task=cc1plus,pid=1387895,uid=1110
[498376.145288] Out of memory: Killed process 1387895 (cc1plus) total-vm:3323312kB, anon-rss:3164568kB, file-rss:0kB, shmem-rss:0kB, UID:1110
[498376.315185] oom_reaper: reaped process 1387895 (cc1plus), now anon-rss:0kB, file-rss:0kB, shmem-rss:0kB
[498377.882072] cc1plus invoked oom-killer: gfp_mask=0x6200ca(GFP_HIGHUSER_MOVABLE), order=0, oom_score_adj=0

before this change, we use the total memory to calculate the number
of jobs, and assume that each job takes at most 2.5GiB mem. in the
case above, the # of job is 96.

after this change, we use the free memory, and increse the mem per job
to 3.0GiB. in the case above, the # of job would be 85.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-06-03 21:02:33 +08:00
Kefu Chai
e9ac37d424
Merge pull request #41669 from tchaikov/wip-crimson-asok-dump-metrics
crimson/admin: s/perf dump_seastar/dump_metrics/

Reviewed-by: Amnon Hanuhov <ahanukov@redhat.com>
2021-06-03 19:45:23 +08:00
Kefu Chai
284884218a vstart.sh: use here document to display multi-line message
for better readability

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-06-03 19:43:10 +08:00
Kefu Chai
edd05d764d vstart.sh: add an option named --without-restful"
so we don't need to wait for restful module to be loaded if not working
on this mgr module.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2021-06-03 19:43:10 +08:00