Commit Graph

142637 Commits

Author SHA1 Message Date
Laura Flores
cb10c0d821 osd: clear out unneeded pending pg-upmap-primary mappings
If the score did not improve, we should clear out any
pending pg-upmap-primary mappings so they don't execute
in situations where the same incremental is used to balance
multiple pools (i.e. in the balancer mgr module).

Signed-off-by: Laura Flores <lflores@ibm.com>
2024-01-28 13:15:49 -06:00
Laura Flores
33b5f3ddbb qa/suites/rados/thrash/d-balancer: add read and upmap-read modes to balancer testing
Here, we turn the balancer mgr module to read and upmap read mode
during various thrashing scenarios.

Signed-off-by: Laura Flores <lflores@ibm.com>
2024-01-28 13:15:49 -06:00
Laura Flores
f3c74a7be4 mgr/balancer: add pg_upmap_primaries to balancer status detail
Followup to 8a5553597c.
Streamlines some of the logic so pg upmap activity is properly
initalized, and updated in offline mode as well as online.

Signed-off-by: Laura Flores <lflores@ibm.com>
2024-01-28 13:15:49 -06:00
Laura Flores
61e721c9f1 PendingReleaseNotes: add note about read balancer mgr module integration
Signed-off-by: Laura Flores <lflores@ibm.com>
2024-01-28 13:15:49 -06:00
Laura Flores
e2ce8ed1ff mgr: add read balancer support inside the balancer module
Read balancing may now be managed automatically via the balancer
manager module. Users may choose between two new modes: ``upmap-read``, which
offers upmap and read optimization simultaneously, or ``read``, which may be used
to only optimize reads. Existing balancer commands have also been added to
contain more information about read balancing.

Run the following commands to test the new automatic behavior:
`ceph balancer on` (on by default)
`ceph balancer mode <read|upmap-read>`
`ceph balancer status`

Run the following commands to test the new supervised behavior:
`ceph balancer off`
`ceph balancer mode <read|upmap-read>`
`ceph balancer eval` | `ceph balancer eval <pool-name>`
`ceph balancer eval-verbose` | `ceph balancer eval-verbose <pool-name>`
`ceph balancer optimize <plan-name>`
`ceph balancer show <plan-name>`
`ceph balancer eval <plan-name>`
`ceph balancer execute <plan-name>`

In the balancer module, there is also a new "self_test" function which tests
the module's basic functionality. This test can be triggered with the following
commands:
`ceph mgr module enable selftest`
`ceph mgr self-test module balancer`

Related Trello: https://trello.com/c/sWoKctzL/859-add-read-balancer-support-inside-the-balancer-module
Signed-off-by: Laura Flores <lflores@ibm.com>
2024-01-28 13:15:38 -06:00
Laura Flores
15bd38eece mgr: add CephReleases class to sustainably compare releases
Changes how the upmap balancer compares min_mon_release
to account for release names eventually wrapping around the alphabet.

Signed-off-by: Laura Flores <lflores@ibm.com>
2024-01-26 22:41:23 +00:00
Laura Flores
702cb64e87
Merge pull request #55331 from ceph/revert-55096-sjust/for-review/wip-crush-msr
Revert "crush: add multistep retry rules"
2024-01-26 16:15:46 -06:00
Samuel Just
a5ce9c3863 Revert "crush: add multistep retry rules"
This PR was merged by accident before it was ready.
Let's revert for now and open a new PR.

Signed-off-by: Samuel Just <sjust@redhat.com>
2024-01-26 20:32:05 +00:00
Yuri Weinstein
37d5d931b0
Merge pull request #55096 from athanatos/sjust/for-review/wip-crush-msr
crush: add multistep retry rules

Reviewed-by: Laura Flores <lflores@redhat.com>
2024-01-26 11:57:53 -08:00
Casey Bodley
1112689da4
Merge pull request #55303 from cbodley/wip-63130-debug
cmake/arrow: don't treat warnings as errors

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
2024-01-26 14:18:48 +00:00
Ilya Dryomov
2b11aa38ea
Merge pull request #55234 from ajarr/wip-64063
rbd-nbd: use netlink interface by default

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2024-01-26 12:37:52 +01:00
Kefu Chai
d813ce1923
Merge pull request #55121 from zhscn/fix-ambiguous-error
common: fix ambiguous error when using gcc 13

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
2024-01-26 14:42:06 +08:00
Yingxin
3e190e5614
Merge pull request #54896 from cyx1231st/wip-crimson-save-conn-foreign-copy
crimson/osd: drop a foreign-copy to shard-0 for every pg operation

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
2024-01-26 13:47:37 +08:00
Casey Bodley
ecb4eb14e5
Merge pull request #52496 from adamemerson/wip-rgw-surface-neorados
rgw: Surface neorados

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2024-01-26 02:43:44 +00:00
Laura Flores
fc4ff1796e
Merge pull request #55308 from ljflores/wip-dependabot 2024-01-25 17:56:29 -06:00
zdover23
77fbe9ead3
Merge pull request #55307 from zdover23/wip-doc-2024-01-25-radosgw-admin-usage
doc/radosgw: edit "Usage" admin.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2024-01-26 09:24:13 +10:00
Zac Dover
d8df6f61e8 doc/radosgw: edit "Usage" admin.rst
Edit "Usage" in doc/radosgw/admin.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
2024-01-26 09:12:59 +10:00
Casey Bodley
93d158711e
Merge pull request #55315 from cbodley/wip-moncommand-dencoder
mon: zero-initialize MonCommand::flags

Reviewed-by: Kefu Chai <tchaikov@gmail.com>
2024-01-25 17:12:07 +00:00
Ramana Raja
fcbf7367d2 rbd-nbd: map using netlink interface by default
Mapping rbd images to nbd devices using ioctl interface is not
robust. It was discovered that the device size or the md5 checksum
of the nbd device was incorrect immediately after mapping using
ioctl method. When using the nbd netlink interface to map RBD images
the issue was not encountered. Switch to using nbd netlink interface
for mapping.

Fixes: https://tracker.ceph.com/issues/64063
Signed-off-by: Ramana Raja <rraja@redhat.com>
2024-01-25 11:00:59 -05:00
Casey Bodley
12b7352047 mon: zero-initialize MonCommand::flags
causing failures in check-generated.sh:

2 MonCommand
/tmp/typ-qmnDZ1ahR /tmp/typ-lIgJTZiUe differ: byte 100, line 6
**** MonCommand test 1 dump_json check failed ****
   ceph-dencoder type MonCommand select_test 1 dump_json > /tmp/typ-qmnDZ1ahR
   ceph-dencoder type MonCommand select_test 1 encode decode dump_json > /tmp/typ-lIgJTZiUe
6c6
<     "flags": 94661933599904
---
>     "flags": 94342467308704

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2024-01-25 09:45:10 -05:00
Ilya Dryomov
f7b52fc712
Merge pull request #55287 from ajarr/wip-64139
rbd-nbd: fix resize of images mapped using netlink

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2024-01-25 13:04:26 +01:00
Nizamudeen A
a70dc48d48
Merge pull request #55270 from afreen23/fix-cap-inconsistency-multisite
mgr/dashboard: Fix inconsistency in capitalisation of "Multi-site"

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: rosinL <NOT@FOUND>
2024-01-25 15:40:43 +05:30
Redouane Kachach
463716e64d
Merge pull request #55182 from rkachach/fix_issue_64029
mgr/rook: adding some basic rook e2e testing
2024-01-25 10:23:43 +01:00
Samuel Just
000188f2ba
Merge pull request #55266 from athanatos/sjust/wip-63996
crimson: retain map references in OSDSingletonState::store_maps

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
2024-01-24 21:05:09 -08:00
Samuel Just
db3addf01c crimson/osd/shard_services: retain map references in OSDSingletonState::store_maps
Introduced: 3f11cd94
Fixes: https://tracker.ceph.com/issues/63996
Signed-off-by: Samuel Just <sjust@redhat.com>
2024-01-25 03:02:06 +00:00
Samuel Just
cb02911686 crimson/osd/shard_service.cc: convert to newer logging machinery
Signed-off-by: Samuel Just <sjust@redhat.com>
2024-01-25 03:02:06 +00:00
Samuel Just
6ad175f0b9 crimson/osd/osd.cc: migrate logging to new style
Signed-off-by: Samuel Just <sjust@redhat.com>
2024-01-25 03:02:04 +00:00
Samuel Just
3eebce4aab
Merge pull request #55288 from athanatos/sjust/wip-64140
Revert "crimson/os/alienstore/alien_log: _flush concurrently"

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
2024-01-24 17:23:47 -08:00
Adam Emerson
0e584be902 rgw: Thread site config into RADOS driver
Make this available so multisite sync doesn't have to rely on the zone
service.

Signed-off-by: Adam Emerson <aemerson@redhat.com>
2024-01-24 16:59:58 -05:00
Adam C. Emerson
dc262061fe rgw: Asio frontend shares io_context with the rest of RGW
Pull the `io_context` and threads out of `AsioFrontend`, pass in a
reference to `io_context_pool` so it can be shut down at `AsioFrontend::join`

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
2024-01-24 16:59:56 -05:00
Yuri Weinstein
5ddd51e819
Merge pull request #54987 from batrick/i63822
pybind/mgr/devicehealth: skip legacy objects that cannot be loaded

Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>
Reviewed-by: Yaarit Hatuka <yaarithatuka@gmail.com>
2024-01-24 13:31:31 -08:00
Yuri Weinstein
e538587e18
Merge pull request #54491 from jianwei1216/fix_osd_pg_stat_report_interval_max_cmain
fix: resolve inconsistent judgment of osd_pg_stat_report_interval_max

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <Matan.Brz@gmail.com>
2024-01-24 13:30:50 -08:00
Yuri Weinstein
6bba2bc173
Merge pull request #53250 from YiteGu/add-perfcount-for-allocator
os/bluestore: add perfcount for bluestore/bluefs allocator

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
2024-01-24 13:30:07 -08:00
Yuri Weinstein
2fac6737f8
Merge pull request #52530 from amathuria/wip-amat-fix-59531
osd: Add memstore to unsupported objstores for QoS

Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
2024-01-24 13:28:16 -08:00
Adam C. Emerson
7c4eee61c4 rgw: SAL drivers take boost::asio::io_context
`RadosDriver` needs it, and since SAL generally uses `optional_yield`,
other stores are going to need it to implement that sensibly.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
2024-01-24 15:51:46 -05:00
Adam C. Emerson
82a40dc151 ceph/async: io_context_pool constructor/start takes init function
Add overloads of the constructor and `start` that take an init
function, for setting up the thread before dropping into
`io_context::run`.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
2024-01-24 15:51:46 -05:00
Adam C. Emerson
c7789f10fe common/async: io_context_pool constructor takes int64_t
Just because this is what Ceph's config uses and it saves a narrowing
conversion. If we want to set a max value on the thread count, we
should do it in config.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
2024-01-24 15:51:46 -05:00
Adam Emerson
809e1304b0 include/inline_memory.h: Add missing includes to fix build
Signed-off-by: Adam Emerson <aemerson@redhat.com>
2024-01-24 15:51:46 -05:00
Laura Flores
2b694195bf .github: fix length of dependabot commit message prefix
Followup to commit 265f727481.

Fixes this error mesage under GitHub Insights (https://github.com/ceph/ceph/network/updates):

```
The property '#/updates/1/commit-message/prefix' was not of a maximum string length of 15
```

Signed-off-by: Laura Flores <lflores@ibm.com>
2024-01-24 20:37:04 +00:00
Ramana Raja
1712b95c78 rbd-nbd: log errors during netlink_resize() using derr
When using rbd CLI to map the images to NBD devices via netlink,
any errors that arose during image resizing in netlink_resize()
were not logged. Switching the error logging from using cerr to
derr helps log the errors from netlink_resize().

Signed-off-by: Ramana Raja <rraja@redhat.com>
2024-01-24 15:33:59 -05:00
Ramana Raja
1eebb7ba79 rbd_nbd: fix resize of images mapped using netlink
Include device identifier or cookie in the message sent to the kernel
to resize images mapped to NBD devices using netlink. Otherwise,
netlink_resize() fails and the size of the device isn't updated.

Fixes: https://tracker.ceph.com/issues/64139
Signed-off-by: Ramana Raja <rraja@redhat.com>
2024-01-24 15:33:50 -05:00
Laura Flores
cd2edcf9d9
Merge pull request #49462 from rzarzynski/wip-bug-53789
osdc: fix the ENOCONN normalization in Objecter::_linger_reconnect()
2024-01-24 14:00:03 -06:00
Casey Bodley
5ea5521383
Merge pull request #55219 from samarahu/rgw_asio_frontend_asserts
rgw/asio: Add asserts to rgw_asio_frontend.cc

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
2024-01-24 18:39:14 +00:00
Redouane Kachach
310eb4f9b7
mgr/rook: increase minikube mem to 6GB to avoid stability issues
Signed-off-by: Redouane Kachach <rkachach@redhat.com>
2024-01-24 19:03:56 +01:00
Gregory Farnum
e5b0db4b2f
Merge pull request #55192 from dparmar18/fix_docstrings_ceph_test_case
qa: typo fixes in ceph_test_case docstrings
2024-01-24 09:37:34 -08:00
Casey Bodley
d3256c4841
Merge pull request #53320 from jzhu116-bloomberg/wip-62710
rgw/multisite: maintain endpoints connectable status and retry the requests to them when appropriate

Reviewed-by: Mark Kogan <mkogan@ibm.com>
2024-01-24 16:01:43 +00:00
Adam King
bf084a5fd1
Merge pull request #54941 from samsungceph/vstart_network_v2
vstart: Pick only CIDR-formatted routes when cephadm enabled

Reviewed-by: Adam King <adking@redhat.com>
2024-01-24 10:25:23 -05:00
Casey Bodley
5c1a57a1af cmake/arrow: don't treat warnings as errors
with arrow's default BUILD_WARNING_LEVEL, -Werror is added to cflags and
debug builds fail on a warning about _FORTIFY_SOURCE

Fixes: https://tracker.ceph.com/issues/63130

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2024-01-24 09:47:05 -05:00
Casey Bodley
f4758e560f
Merge pull request #53668 from mdw-at-linuxbox/wip-master-update-kmip-1
Update libkmip to pull in some portability changes.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2024-01-24 13:30:25 +00:00
Redouane Kachach
1b3454914e
mgr/rook: adding some basic rook e2e testing
Fixes: https://tracker.ceph.com/issues/64029

Signed-off-by: Redouane Kachach <rkachach@redhat.com>
2024-01-24 11:00:54 +01:00