RepoMirrors/ceph

mirror of https://github.com/ceph/ceph synced 2025-02-21 01:47:25 +00:00

Author	SHA1	Message	Date
Sage Weil	0f65e5cffa	qa/standalone: split osd/ into 2 directories The whole osd/ directory takes 3 hours to run. Of that, about half is osd-backfill*: 2021-04-05T20:38:55.932 INFO:tasks.workunit:Running workunit osd/osd-backfill-prio.sh... 2021-04-05T20:47:27.184 INFO:tasks.workunit:Running workunit osd/osd-backfill-recovery-log.sh... 2021-04-05T20:55:59.497 INFO:tasks.workunit:Running workunit osd/osd-backfill-space.sh... 2021-04-05T21:48:47.549 INFO:tasks.workunit:Running workunit osd/osd-backfill-stats.sh... 2021-04-05T22:17:09.197 INFO:tasks.workunit:Running workunit osd/osd-bench.sh... Signed-off-by: Sage Weil <sage@newdream.net>	2021-04-12 09:59:17 -05:00
David Zafman	4814648155	test: osd-recovery-prio.sh replace sleep with wait for both PGs recovering fixes: https://tracker.ceph.com/issues/48842 Signed-off-by: David Zafman <dzafman@redhat.com>	2021-01-11 17:30:00 -08:00
David Zafman	ef47a3e708	test: set mon_allow_pool_size_one for consistency with original test intention Signed-off-by: David Zafman <dzafman@redhat.com>	2020-11-03 21:49:00 +00:00
David Zafman	3ba7ebd3e2	test: Avoid races by waiting for PGs go clean before query Fixes: https://tracker.ceph.com/issues/46405 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-10-01 19:43:57 +00:00
David Zafman	b20a277f05	test: Inconsequential change to get object names as desired Signed-off-by: David Zafman <dzafman@redhat.com>	2020-09-29 18:01:24 +00:00
Kefu Chai	4f6443737e	Merge pull request #30838 from ifed01/wip-ifed-single-alloc os/bluestore: use single allocator for shared bluestore/bluefs device Reviewed-by: Sage Weil <sage@redhat.com>	2020-08-03 18:00:16 +08:00
Igor Fedotov	9a8f1ae492	os/bluestore: fix bluefs migrate/expand to match single allocator. Signed-off-by: Igor Fedotov <ifedotov@suse.com>	2020-07-31 15:36:47 +03:00
Dan van der Ster	b550112dba	qa/standalone/osd: add bad-inc-map.sh Test that the osd doesn't crash when it gets a bad incremental osdmap. Related-to: https://tracker.ceph.com/issues/46443 Signed-off-by: Dan van der Ster <daniel.vanderster@cern.ch>	2020-07-28 23:15:42 +02:00
David Zafman	661996d434	mgr: Warn when too many reads are repaired on an OSD Include test case Configurable by setting mon_osd_warn_num_repaired (default 10) Ignore new health warning with random eio injection test Fixes: https://tracker.ceph.com/issues/41564 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-06-16 17:45:27 -07:00
David Zafman	92f970cbed	test: osd-backfill-stats.sh use nobackfill to avoid races in remaining test Fixes: https://tracker.ceph.com/issues/44314 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-06-05 17:48:10 -07:00
Sage Weil	04e0b9c2f8	Merge PR #34126 into master * refs/pull/34126/head: qa/*/osd-backfill-recovery-log.sh: flush_pg_stats before checking log length Reviewed-by: Sage Weil <sage@redhat.com>	2020-03-23 13:55:16 -05:00
Neha	cfebec1b12	qa/*/osd-backfill-recovery-log.sh: flush_pg_stats before checking log length It is possible for the pg dump to not be the latest when we check for newprimary in _common_test(). This is because mgr_stats_period is 5 seconds, and we may not have fetched the latest stats just yet. This causes the test to look at the same stats before and after wait_for_clean. Fixes: https://tracker.ceph.com/issues/43807 (2) Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-03-23 15:37:12 +00:00
Kefu Chai	b0dca75a59	Merge pull request #34056 from xiexingguo/wip-44662 qa/*/osd-markdown.sh: propagate map to osd before testing its reaction Reviewed-by: Neha Ojha <nojha@redhat.com>	2020-03-21 14:27:51 +08:00
xie xingguo	afdff0cd3f	qa/*/osd-markdown.sh: propagate map to osd before testing its reaction Mon might fail to share the newest map with any of up osds, e.g., due to an injected broken pipe. Since we don't have any client activities during the osd-markdown tests, osds might be unaware of the map changes made through CLI. Make sure osds have pulled the newest map down before we can test its reaction correctly. Fixes: https://tracker.ceph.com/issues/44662 Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>	2020-03-19 18:17:28 +08:00
Neha	6edd1cb686	qa/standalone/osd/osd-backfill-stats.sh: get_latest_osdmap to propagate map change Fixes: https://tracker.ceph.com/issues/44518 Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-03-18 22:57:41 +00:00
Deepika Upadhyay	21508bd9dd	mon/OSDMonitor: add flag `--yes-i-really-mean-it` for setting pool size 1 Adds option `mon_allow_pool_size_one` which will be disabled by default to ensure pools are not configured without replicas. If the user still wants to use pool size 1, they will have to change the value of `mon_allow_pool_size_one` to true and then have to pass flag `--yes-i-really-mean-it` to cli command: Example: `ceph osd pool test set size 1 --yes-i-really-mean-it` Fixes: https://tracker.ceph.com/issues/44025 Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>	2020-03-09 23:27:36 +05:30
xie xingguo	023524a26d	osd/PeeringState: restart peering on any previous down acting member coming back One of our customers wants to verify the data safety of Ceph during scaling the cluster up, and the test case looks like: - keep checking the status of a speficied pg, who's up is [1, 2, 3] - add more osds: up [1, 2, 3] -> up [1, 4, 5], acting = [1, 2, 3], backfill_targets = [4, 5], pg is remapped - stop osd.2: up [1, 4, 5], acting = [1, 3], backfill_targets = [4, 5], pg is undersized - restart osd.2, acting will stay unchanged as 2 belongs to neither current up nor acting set, hence leaving the corresponding pg pinning undersized for a long time until all backfill targets completes It does not pose any critical problem -- we'll end up getting that pg back into active + clean, except that the long live DEGRADED warnings keep bothering our customer who cares about data safety more than any thing else. The right way to achieve the above goal is for: boost::statechart::result PeeringState::Active::react(const MNotifyRec& notevt) to check whether the newly booted node could be validly chosen for the acting set and request a new temp mapping. The new temp mapping would then trigger a real interval change that will get rid of the DEGRADED warning. Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn> Signed-off-by: Yan Jun <yan.jun8@zte.com.cn>	2020-02-21 17:52:52 +08:00
Sage Weil	f10cc22c60	Merge PR #32961 into master * refs/pull/32961/head: qa/standalone/osd/osd-bench: debug bluestore Reviewed-by: Neha Ojha <nojha@redhat.com>	2020-01-30 10:42:17 -06:00
Sage Weil	b99e506a3f	qa/standalone/osd/osd-bench: debug bluestore Looking for https://tracker.ceph.com/issues/43888 Signed-off-by: Sage Weil <sage@redhat.com>	2020-01-29 07:43:41 -06:00
David Zafman	e18519ad09	test: Update pg log test for new trimming behavior Fixes: https://tracker.ceph.com/issues/43864 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-01-28 15:23:45 -08:00
Neha	b20817795a	qa/standalone/osd/osd-backfill-recovery-log.sh: fix TEST_backfill_log_2 Fixes: https://tracker.ceph.com/issues/43807 Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-01-24 22:42:04 +00:00
Neha	994698277b	qa/standalone/osd/osd-backfill-recovery-log.sh: fix TEST_backfill_log_1 Fixes: https://tracker.ceph.com/issues/43807 Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-01-24 22:20:21 +00:00
David Zafman	9f7aabbe9f	test: Fix wait_for_state() to wait for a PG to get into a state To avoid confusion fix function names in osd-backfill-space.sh for how they actually work. Fixes: https://tracker.ceph.com/issues/43592 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-01-13 18:39:38 -08:00
David Zafman	676d882649	test: Improve races by using kill_daemons which waits for OSDs terminate osd-backfill-space.sh: More sleep time to make sure the backfill gets started Signed-off-by: David Zafman <dzafman@redhat.com>	2019-12-06 19:44:06 -08:00
David Zafman	43f6218993	test: Use activate_osd() when restarting OSDs Signed-off-by: David Zafman <dzafman@redhat.com>	2019-12-05 15:13:31 -08:00
Sage Weil	8994a65242	qa/standalone/osd/divergent-priors: add reproducer for bug 41816 Reproducer for https://tracker.ceph.com/issues/41816 Signed-off-by: Sage Weil <sage@redhat.com>	2019-09-21 10:09:15 -05:00
David Zafman	b98950e707	osd: Rename dump_reservations to dump_recovery_reservations Signed-off-by: David Zafman <dzafman@redhat.com>	2019-09-10 13:32:29 -07:00
David Zafman	fa698e18e1	mon: Improve health status for backfill_toofull and recovery_toofull Treat backfull_toofull as a warning condition because it can resolve itself. Includes test case for PG_BACKFILL_FULL Includes test case for recovery_toofull / PG_RECOVERY_FULL Fixes: https://tracker.ceph.com/issues/39555 Signed-off-by: David Zafman <dzafman@redhat.com>	2019-06-20 02:22:01 +00:00
David Zafman	7959159e83	test: Adding standalone test of log copy handling Signed-off-by: David Zafman <dzafman@redhat.com>	2019-05-10 15:31:51 -07:00
sjust@redhat.com	252d5c20cf	osd/: move stat updates and publishing to PeeringState Signed-off-by: Samuel Just <sjust@redhat.com>	2019-05-01 11:22:24 -07:00
David Zafman	66b041fa4a	Merge pull request #27769 from dzafman/wip-39333 osd-backfill-space.sh test failed in TEST_backfill_multi_partial() Reviewed-by: Neha Ojha <nojha@redhat.com>	2019-04-26 11:55:04 -07:00
David Zafman	9931023457	test: osd-backfill-spsace.sh doesn't matter which PG wins the race Fixes: http://tracker.ceph.com/issues/39333 Signed-off-by: David Zafman <dzafman@redhat.com>	2019-04-26 10:11:00 -07:00
David Zafman	39cc14bdc1	Merge pull request #27503 from dzafman/wip-39099 osd: Give recovery for inactive PGs a higher priority Reviewed-by: Sage Weil <sage@redhat.com> Reviewed-by: Neha Ojha <nojha@redhat.com>	2019-04-25 15:06:56 -07:00
David Zafman	444aa9f9fe	osd, mon: New pool recovery priority range -10 to 10 Use OSD_POOL_PRIORITY_MAX and OSD_POOL_PRIORITY_MIN constants Scale legacy priorities if exceeds maximum Signed-off-by: David Zafman <dzafman@redhat.com>	2019-04-25 13:53:27 -07:00
David Zafman	3a234164d0	Merge pull request #27279 from dzafman/wip-divergent Improvements to standalone tests Reviewed-by: Kefu Chai <kchai@redhat.com> Reviewed-by: Neha Ojha <nojha@redhat.com>	2019-04-24 10:58:11 -07:00
David Zafman	7e77898001	test: Divergent testing of _merge_object_divergent_entries() cases Case 1: A more recent update exists Case 2: The first entry in the divergent sequence is a create Case 3 NOT TESTED - Ohject currently missing Case 4: We can rollback all of the entries Case 5: We cannot rollback at least 1 of the entries Support starting OSDs even when "noup" is set (don't wait for up). Move create_ec_pool() to ceph-helpers.sh Fixes: https://tracker.ceph.com/issues/39162 Signed-off-by: David Zafman <dzafman@redhat.com>	2019-04-22 18:50:24 -07:00
Sage Weil	755e8c4ef2	Merge PR #27595 into master * refs/pull/27595/head: osd: add 'ceph osd stop <osd.nnn>' command Reviewed-by: Sage Weil <sage@redhat.com>	2019-04-20 08:52:01 -05:00
xie xingguo	5dbae13ce0	osd: add 'ceph osd stop <osd.nnn>' command stop command can be used to force stopping a specified osd daemon, e.g., you don't have to pre-figure out where it located. Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>	2019-04-18 13:55:02 +08:00
Sage Weil	dc97651cbd	Merge PR #27499 into master * refs/pull/27499/head: qa/standalone/osd/osd-markdown: fix dup command disabling Reviewed-by: Neha Ojha <nojha@redhat.com>	2019-04-12 06:54:58 -05:00
Sage Weil	f7216d0b2c	qa/standalone/osd/osd-markdown: fix dup command disabling The ceph cli tool checks for the presence of the variable, not its value. Fixes: http://tracker.ceph.com/issues/38359 Signed-off-by: Sage Weil <sage@redhat.com>	2019-04-10 16:44:38 -05:00
David Zafman	69fa515c95	test: Make most tests use default objectstore bluestore Change run_osd() to default objectstore bluestore Use run_osd_filestore() to use the non-default objectstore Fix inject_eio to handle any objectstore if config prefixed with type Remaining tests using filestore: osd-pool-create.sh TEST_pool_create_rep_expected_num_objects Test filestore directory creation qa/standalone/osd/osd-dup.sh TEST_filestore_to_bluestore Obvious qa/standalone/osd/osd-rep-recov-eio.sh TEST_rep_read_unfound Requires data digest in object info qa/standalone/scrub/osd-scrub-repair.sh multiple tests Erasure code pools append mode for filestore is tested qa/standalone/special/ceph_objectstore_tool.py Test code verifies COT by directly examining filestore contents Fixes: https://tracker.ceph.com/issues/39162 Signed-off-by: David Zafman <dzafman@redhat.com>	2019-04-10 08:55:04 -07:00
xie xingguo	6a8aedc107	qa: add new test case for pulling error Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>	2019-04-04 11:04:43 +08:00
David Zafman	11f072fee1	Add checking of num_shards_repaired in osd stats Signed-off-by: David Zafman <dzafman@redhat.com>	2019-04-04 11:04:42 +08:00
Sage Weil	420edba243	Merge PR #27169 into master * refs/pull/27169/head: common/config: parse --default-$option as a default value Reviewed-by: Sébastien Han <seb@redhat.com> Reviewed-by: Neha Ojha <nojha@redhat.com>	2019-03-27 09:48:33 -05:00
Sage Weil	fdd2000631	common/config: parse --default-$option as a default value Sometimes it is useful to specify an alternative default value for an option via the command line such that it has a lower priority than the mon config database, config file, the rest of the command line, or the environment. Signed-off-by: Sage Weil <sage@redhat.com>	2019-03-26 11:00:27 -05:00
David Zafman	d2ca3d2feb	osd: Track num_objects_repaired in pg stats 2(3) Leave repair pg state on until recovery finishes or a new scrub starts Fixes: http://tracker.ceph.com/issues/38616 Signed-off-by: David Zafman <dzafman@redhat.com>	2019-03-25 16:03:36 -07:00
Sage Weil	be1187575b	Merge PR #27021 into master * refs/pull/27021/head: msg: remove XioMessenger qa/suites/rados/thrash-old-clients: add nautilus qa/suites/rados/thrash-old-clients: add mimic v1 variant qa/suites/rados/thrash-old-clients: add mimic qa/suites/rados/thrash-old-clients: collapse msgr and client choice qa: remove simplemessenger tests ceph_test_msgr: remove simple msg: remove SimpleMessenger Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn> Reviewed-by: Matt Benjamin <mbenjami@redhat.com> Reviewed-by: Kefu Chai <kchai@redhat.com>	2019-03-22 04:42:30 -05:00
Sage Weil	28b4392a71	qa: remove simplemessenger tests Signed-off-by: Sage Weil <sage@redhat.com>	2019-03-20 06:10:25 -05:00
Sage Weil	fb915c4805	osd/PG: invalidate PG if merging with unexpected version If the source or target PG version is 0'0, we may silently take the max of the source and target and still leave the PG complete. This specifically can happen with an empty PG, as seen with bug 38655. In theory we could encounter one of the PGs with some other last_update that doesn't match what we expect. If that ever happens, make sure the result is incomplete so that backfill can clean up. Additionally check that the pool metadata for the last merge matches the PGs at all. This could mismatch if we have an osdmap gap and are forced to do some merge without merge info at all... in which case we should definitely invalidate: there should be newer copies of the PG(s), and we have no idea whether the PGs we are merging are what we want. If this is some disaster recovery situation, an operator is always free to use ceph-objectstore-tool to re-mark a PG complete (at their own peril!). Fixes: http://tracker.ceph.com/issues/38655 Signed-off-by: Sage Weil <sage@redhat.com>	2019-03-12 10:08:46 -05:00
Sage Weil	f978b27d2b	qa/standalone/osd/pg-split-merge.sh: reproduce pg merge problem with empty pgs This reproduces http://tracker.ceph.com/issues/38655 Signed-off-by: Sage Weil <sage@redhat.com>	2019-03-11 17:10:28 -05:00

1 2 3

128 Commits