RepoMirrors/ceph

mirror of https://github.com/ceph/ceph synced 2024-12-27 22:13:28 +00:00

Author	SHA1	Message	Date
Neha Ojha	7c8b627eaa	qa/*/osd-scrub-repair.sh: don't fail if PG is in active+clean+wait a0b453ad335671bd92f165115d6ee984d2412448 added the wait state, which can make PGs stay in active+clean+wait for a while instead of going into active+clean directly. As far as TEST_auto_repair_bluestore_failed is concerned, we only care about the repair state being cleared. Fixes: https://tracker.ceph.com/issues/45075 Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-04-23 20:24:28 +00:00
Neha Ojha	4f82ebf41b	qa/standalone/scrub/osd-scrub-repair.sh: fix race in TEST_auto_repair_bluestore_failed We need to flush_pg_stats before checking for active+clean. Fixed: https://tracker.ceph.com/issues/45075 Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-04-20 18:29:51 +00:00
Neha Ojha	61ad12e6ad	Merge pull request #34541 from neha-ojha/wip-balancer-on mgr: turn on balancer in upmap mode by default Reviewed-by: Josh Durgin <jdurgin@redhat.com>	2020-04-15 15:03:28 -07:00
Kefu Chai	eff9d0fc9a	Merge pull request #19076 from jecluis/wip-mon-fix-osdmap-lec-trim mon/OSDMonitor: allow trimming maps even if osds are down Reviewed-by: Kefu Chai <kchai@redhat.com>	2020-04-15 08:02:51 +08:00
Neha Ojha	ec85af5b19	qa/standalone/mon/osd-pool-df.sh: flush_pg_stats explicitly Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-04-14 19:09:45 +00:00
Neha Ojha	321faa9c6b	qa/standalone/mon/osd-pool-df.sh: fix test to check for the right values Though the test passed, we weren't checking for the correct values: .../qa/standalone/mon/osd-pool-df.sh:62: TEST_ceph_df: ceph df -f json .../qa/standalone/mon/osd-pool-df.sh:62: TEST_ceph_df: jq .stats.total_avail_bytes ../qa/standalone/mon/osd-pool-df.sh:62: TEST_ceph_df: local global_avail=0 .../qa/standalone/mon/osd-pool-df.sh:63: TEST_ceph_df: ceph df -f json .../qa/standalone/mon/osd-pool-df.sh:63: TEST_ceph_df: jq '.pools \| map(select(.name == "$rep_poolname"))[0].stats.max_avail' ../qa/standalone/mon/osd-pool-df.sh:63: TEST_ceph_df: local rep_avail=null .../qa/standalone/mon/osd-pool-df.sh:64: TEST_ceph_df: ceph df -f json .../qa/standalone/mon/osd-pool-df.sh:64: TEST_ceph_df: jq '.pools \| map(select(.name == "$ec_poolname"))[0].stats.max_avail' ../qa/standalone/mon/osd-pool-df.sh:64: TEST_ceph_df: local ec_avail=null ../qa/standalone/mon/osd-pool-df.sh:66: TEST_ceph_df: echo '0 >= null3' ../qa/standalone/mon/osd-pool-df.sh:66: TEST_ceph_df: bc 1 ../qa/standalone/mon/osd-pool-df.sh:67: TEST_ceph_df: echo '0 >= null1.5' ../qa/standalone/mon/osd-pool-df.sh:67: TEST_ceph_df: bc 1 Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-04-14 00:05:02 +00:00
Neha Ojha	480afa61b6	qa/standalone/mgr/balancer.sh: adapt test Now that the balancer is on by default the test needs these changes. Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-04-14 00:05:02 +00:00
Sage Weil	731e508bbe	qa/standalone/mon/msgr-v2-transition: remove test v2 was introduced in nautilus, and we don't support mimic -> pacific upgrades (only mimic -> octopus). This test can be removed! Signed-off-by: Sage Weil <sage@redhat.com>	2020-04-08 08:10:32 -05:00
Sage Weil	279c437994	qa/standalone/mon/misc: update TEST_mon_features Signed-off-by: Sage Weil <sage@redhat.com>	2020-04-08 08:10:32 -05:00
Kefu Chai	b1738cd1ef	qa/standalone/scrub: s/$(pgid)/${pgid}/ to address the test failures like ``` 2020-04-07T15:44:58.693 INFO:tasks.workunit.client.0.smithi049.stderr:/home/ubuntu/cephtest/clone.client.0/qa/standalone/scrub/osd-scrub-repair.sh:498: TEST_auto_repair_bluestore_failed: ceph pg dump pgs 2020-04-07T15:44:58.694 INFO:tasks.workunit.client.0.smithi049.stderr://home/ubuntu/cephtest/clone.client.0/qa/standalone/scrub/osd-scrub-repair.sh:498: TEST_auto_repair_bluestore_failed: pgid 2020-04-07T15:44:58.694 INFO:tasks.workunit.client.0.smithi049.stderr:/home/ubuntu/cephtest/clone.client.0/qa/standalone/scrub/osd-scrub-repair.sh: line 498: pgid: command not found ``` Signed-off-by: Kefu Chai <kchai@redhat.com>	2020-04-08 00:54:46 +08:00
Sage Weil	04e0b9c2f8	Merge PR #34126 into master * refs/pull/34126/head: qa/*/osd-backfill-recovery-log.sh: flush_pg_stats before checking log length Reviewed-by: Sage Weil <sage@redhat.com>	2020-03-23 13:55:16 -05:00
Neha	cfebec1b12	qa/*/osd-backfill-recovery-log.sh: flush_pg_stats before checking log length It is possible for the pg dump to not be the latest when we check for newprimary in _common_test(). This is because mgr_stats_period is 5 seconds, and we may not have fetched the latest stats just yet. This causes the test to look at the same stats before and after wait_for_clean. Fixes: https://tracker.ceph.com/issues/43807 (2) Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-03-23 15:37:12 +00:00
Joao Eduardo Luis	3d682c21f6	qa/standalone: exercise osdmon's last epoch clean Signed-off-by: Joao Eduardo Luis <joao@suse.de>	2020-03-23 14:58:59 +00:00
Kefu Chai	b0dca75a59	Merge pull request #34056 from xiexingguo/wip-44662 qa/*/osd-markdown.sh: propagate map to osd before testing its reaction Reviewed-by: Neha Ojha <nojha@redhat.com>	2020-03-21 14:27:51 +08:00
xie xingguo	afdff0cd3f	qa/*/osd-markdown.sh: propagate map to osd before testing its reaction Mon might fail to share the newest map with any of up osds, e.g., due to an injected broken pipe. Since we don't have any client activities during the osd-markdown tests, osds might be unaware of the map changes made through CLI. Make sure osds have pulled the newest map down before we can test its reaction correctly. Fixes: https://tracker.ceph.com/issues/44662 Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>	2020-03-19 18:17:28 +08:00
Neha	6edd1cb686	qa/standalone/osd/osd-backfill-stats.sh: get_latest_osdmap to propagate map change Fixes: https://tracker.ceph.com/issues/44518 Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-03-18 22:57:41 +00:00
Sage Weil	603383605f	Merge PR #33885 into master * refs/pull/33885/head: Merge pull request #33848 from mchangir/octopus-tests-remove-suprious-whitespace Merge PR #33746 into octopus Merge PR #33830 into octopus Merge PR #33732 into octopus Merge PR #33620 into octopus Merge pull request #33876 from tchaikov/octopus-cephadm-mypy cephadm: add "assert foo is not None" for mypy check Merge pull request #33067 from tspmelo/wip-rbd-delete-with-snapshot cephadm: add grafana adopt Merge PR #33771 into octopus Merge PR #33850 into octopus Merge PR #33853 into octopus Merge PR #33857 into octopus Merge PR #32990 into octopus Merge PR #33713 into octopus Merge PR #33838 into octopus qa/tasks/cephadm: no default mon\|mgr\|crash service specs qa/suites/rados/cephadm/upgrade: upgrade start point that supports the no-spec option Merge PR #33832 into octopus cephadm: bootstrap: wait for mgr to restart after enabling a module mgr: add 'mgr_status' tell command Merge pull request #33839 from rhcs-dashboard/44538-fix-rgw-grafana-get-put-latencies Merge pull request #33743 from votdev/issue_43869_fix_qa_test cephadm: create initial mon and mgr service specs too cephadm: no need to pregenerate a crash key for the bootstrap host mgr/cephadm: do not complain when we don't have enough hosts mgr/cephadm: remove orphan daemons mgr/cephadm: report size=0 for fabricated ServiceDescription mgr/cephadm: safety check to prevent removing all mon\|mgr daemons mgr/cephadm: prevent scaling mon\|mgr below count=1 mgr/cephadm: do not remove daemons from remove_service Merge pull request #33805 from tchaikov/wip-44500 spec: Podman (temporarily) requires apparmor-abstractions on suse mgr/cephadm: Make sure we don't co-locate the same daemon monitoring: fix RGW grafana chart 'Average GET/PUT Latencies' tests: remove spurious whitespace mgr/cephadm: fix service list filtering Merge PR #33825 into octopus Merge PR #33811 into octopus Revert "Merge pull request #33673 from cbodley/wip-denc-enum" mgr/cephadm: fix upgrade order Merge PR #33801 into octopus Merge PR #33822 into octopus cephadm: bootstrap: tolerate error return from -h Merge PR #33809 into octopus Merge PR #32678 into octopus cephadm: use `sh` instead of `bash` during enter ceph.in: only shut down rados on clean exit common/ceph_timer: Pass reference to waited time on stack common/ceph_timer: Add test common/ceph_timer: Use unique_function, allowing noncopyable events common/ceph_timer: Couple cleanups common/ceph_timer: Fix namespaces common/ceph_timer: Add missing includes common/ceph_timer.h: Don't indent contents of a namespace mgr/dashboard: Crush rule modal mgr/dashboard: Preserve rule selection on pool type change mgr/dashboard: Crush rule is only send during replicated pool creation mgr/dashboard: Explicit returns in pool form mgr/dashboard: Removes fork join in pool form mgr/dashboard: Hide ECP actions during ec pool edit mgr/dashboard: Pool form erasure/replicated boolean mgr/dashboard: Change pool info API endpoint mgr/dashboard: Moves ECP info endpoint to UI-API mgr/cephadm: add _remove_osds_bg back to main loop mgr/cephadm/osd: update removal report immediately qa/tasks/ceph_manager: use StringIO for capturing COT output qa/standalone/scrub/osd-scrub-repair: force osdmap prop to osds qa/standalone/scrub/osd-scrub-test: wait longer for update qa/tasks/ceph_manager: capture stderr for COT qa/suites/rados/ceph: drop opensuse for now mon/MonClient: send logs to mon on separate schedule than pings mgr/dashboard: Fix missing ImageSpec usage mgr/dashboard: Allow removing RBD with snapshots mgr/dashboard: Refactor and cleanup tasks.mgr.dashboard.test_user mgr/dashboard: support multiple DriveGroups when creating OSDs mon/MonClient: send logs to mon even if we have no keelalive2 cephadm: flag dashboard user to change password Reviewed-by: Sebastian Wagner <swagner@suse.com>	2020-03-11 17:38:59 -05:00
Neha Ojha	6117a0d4db	Merge pull request #33281 from ideepika/wip-set-osd-pool-size-extra-param-check mon/OSDMonitor: add flag `--yes-i-really-mean-it` for setting pool size 1 Reviewed-by: Greg Farnum <gfarnum@redhat.com> Reviewed-by: Kefu Chai <kchai@redhat.com> Reviewed-by: Neha Ojha <nojha@redhat.com>	2020-03-09 19:14:50 -07:00
Sage Weil	3212932ba1	Merge PR #33809 into octopus * refs/pull/33809/head: qa/standalone/scrub/osd-scrub-repair: force osdmap prop to osds qa/standalone/scrub/osd-scrub-test: wait longer for update Reviewed-by: David Zafman <dzafman@redhat.com>	2020-03-09 15:28:19 -05:00
Deepika Upadhyay	21508bd9dd	mon/OSDMonitor: add flag `--yes-i-really-mean-it` for setting pool size 1 Adds option `mon_allow_pool_size_one` which will be disabled by default to ensure pools are not configured without replicas. If the user still wants to use pool size 1, they will have to change the value of `mon_allow_pool_size_one` to true and then have to pass flag `--yes-i-really-mean-it` to cli command: Example: `ceph osd pool test set size 1 --yes-i-really-mean-it` Fixes: https://tracker.ceph.com/issues/44025 Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>	2020-03-09 23:27:36 +05:30
Sage Weil	0447ed0ff9	qa/standalone/scrub/osd-scrub-repair: force osdmap prop to osds flush_pg_stats isn't sufficient to ensure that OSDs have the latest OSDMap. Signed-off-by: Sage Weil <sage@redhat.com>	2020-03-08 14:52:10 -05:00
Sage Weil	ac9befd450	qa/standalone/scrub/osd-scrub-test: wait longer for update Fixes: https://tracker.ceph.com/issues/43865 Signed-off-by: Sage Weil <sage@redhat.com>	2020-03-08 14:45:00 -05:00
David Zafman	e509b7c7d0	test: Add flush_pg_stats to avoid race with getting num_shards_repaired Fixes: https://tracker.ceph.com/issues/44439 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-03-06 04:25:37 +00:00
Kefu Chai	c6088bdd26	Merge pull request #33593 from dzafman/wip-cot-fix test: Fix failing ceph_objectstore_tool.py test Reviewed-by: Neha Ojha <nojha@redhat.com> Reviewed-by: Kefu Chai <kchai@redhat.com>	2020-03-02 18:58:19 +08:00
Kefu Chai	7b0e18c09e	Merge pull request #33566 from dzafman/wip-44296 test: Expect being off by up to 2 and make sure all PGs are active+clean Reviewed-by: Neha Ojha <nojha@redhat.com> Reviewed-by: Kefu Chai <kchai@redhat.com>	2020-02-28 11:42:47 +08:00
David Zafman	08f7e7980f	test: Fix failing ceph_objectstore_tool.py test The -N option to vstart.sh was removed, use -k Old hinfo_key binary happen to be utf-8 decodable, now it throws an exception trying to decode it. Use new option to ceph-objectstore-tool to treat stdout as a terminal and convert binary data to base64. Signed-off-by: David Zafman <dzafman@redhat.com>	2020-02-27 18:14:36 -08:00
David Zafman	49d9c7d664	test: Expect being off by up to 2 and make sure all PGs are active+clean Fixes: https://tracker.ceph.com/issues/44296 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-02-27 18:12:25 -08:00
David Zafman	587cd64207	Merge pull request #32342 from dzafman/wip-43126 mon: Improvements to slow heartbeat health messages Reviewed-by: Sage Weil <sage@redhat.com>	2020-02-25 17:42:00 -08:00
Sage Weil	4d42b4c5a0	common/TextTable: default to 2 spaces separating columns This is what other projects and libraries default to, and it is more legible. Signed-off-by: Sage Weil <sage@redhat.com>	2020-02-23 15:46:30 -06:00
Sage Weil	5afec0fbfb	Merge PR #33091 into master * refs/pull/33091/head: qa/suites/rados: disable device scraping qa/standalone/ceph-helpers: disable device monitoring qa/tasks/ceph.py: add pre-mgr-commands option for ceph task mgr/devicehealth: set default monitoring to 'on' Reviewed-by: Sage Weil <sage@redhat.com>	2020-02-22 12:05:55 -06:00
xie xingguo	023524a26d	osd/PeeringState: restart peering on any previous down acting member coming back One of our customers wants to verify the data safety of Ceph during scaling the cluster up, and the test case looks like: - keep checking the status of a speficied pg, who's up is [1, 2, 3] - add more osds: up [1, 2, 3] -> up [1, 4, 5], acting = [1, 2, 3], backfill_targets = [4, 5], pg is remapped - stop osd.2: up [1, 4, 5], acting = [1, 3], backfill_targets = [4, 5], pg is undersized - restart osd.2, acting will stay unchanged as 2 belongs to neither current up nor acting set, hence leaving the corresponding pg pinning undersized for a long time until all backfill targets completes It does not pose any critical problem -- we'll end up getting that pg back into active + clean, except that the long live DEGRADED warnings keep bothering our customer who cares about data safety more than any thing else. The right way to achieve the above goal is for: boost::statechart::result PeeringState::Active::react(const MNotifyRec& notevt) to check whether the newly booted node could be validly chosen for the acting set and request a new temp mapping. The new temp mapping would then trigger a real interval change that will get rid of the DEGRADED warning. Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn> Signed-off-by: Yan Jun <yan.jun8@zte.com.cn>	2020-02-21 17:52:52 +08:00
Sage Weil	455cdcf89a	qa/standalone/ceph-helpers: disable device monitoring Signed-off-by: Sage Weil <sage@redhat.com>	2020-02-19 15:31:26 -06:00
Sage Weil	f10cc22c60	Merge PR #32961 into master * refs/pull/32961/head: qa/standalone/osd/osd-bench: debug bluestore Reviewed-by: Neha Ojha <nojha@redhat.com>	2020-01-30 10:42:17 -06:00
Sage Weil	b99e506a3f	qa/standalone/osd/osd-bench: debug bluestore Looking for https://tracker.ceph.com/issues/43888 Signed-off-by: Sage Weil <sage@redhat.com>	2020-01-29 07:43:41 -06:00
David Zafman	e18519ad09	test: Update pg log test for new trimming behavior Fixes: https://tracker.ceph.com/issues/43864 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-01-28 15:23:45 -08:00
Neha	b20817795a	qa/standalone/osd/osd-backfill-recovery-log.sh: fix TEST_backfill_log_2 Fixes: https://tracker.ceph.com/issues/43807 Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-01-24 22:42:04 +00:00
Neha	994698277b	qa/standalone/osd/osd-backfill-recovery-log.sh: fix TEST_backfill_log_1 Fixes: https://tracker.ceph.com/issues/43807 Signed-off-by: Neha Ojha <nojha@redhat.com>	2020-01-24 22:20:21 +00:00
Sage Weil	76ea774c10	qa/standalone/misc/ok-to-stop: improve test Make sure PGs peer (simply flushing state to mon isn't enough). Fixes: https://tracker.ceph.com/issues/43721 Signed-off-by: Sage Weil <sage@redhat.com>	2020-01-20 13:24:30 -06:00
Sage Weil	78ec6aec90	qa/standalone/ceph-helpers: add wait_for_peered Signed-off-by: Sage Weil <sage@redhat.com>	2020-01-20 13:23:56 -06:00
Sage Weil	c5710bc8fb	Merge PR #32628 into master * refs/pull/32628/head: test: Fix wait_for_state() to wait for a PG to get into a state Reviewed-by: Neha Ojha <nojha@redhat.com>	2020-01-18 14:39:19 -06:00
Sage Weil	65fbc620b6	qa/standalone/mon/osd-create-pool: fix utf-8 grep LANG This needs en_US.UTF-8... en_US does not work. Fixes: https://tracker.ceph.com/issues/43422 Signed-off-by: Sage Weil <sage@redhat.com>	2020-01-17 14:19:53 -06:00
David Zafman	886475b5fe	mon: Improvements to slow heartbeat health messages Include crush parentage for each osd Fixes: https://tracker.ceph.com/issues/43126 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-01-14 18:06:44 +00:00
David Zafman	9f7aabbe9f	test: Fix wait_for_state() to wait for a PG to get into a state To avoid confusion fix function names in osd-backfill-space.sh for how they actually work. Fixes: https://tracker.ceph.com/issues/43592 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-01-13 18:39:38 -08:00
David Zafman	c65d5c8d14	test: Sort pool list because the order isn't guaranteed from "balancer pool ls" Signed-off-by: David Zafman <dzafman@redhat.com>	2020-01-06 21:35:19 -08:00
David Zafman	b0a1b758d0	mgr: Change default upmap_max_deviation to 5 Fixes: https://tracker.ceph.com/issues/43312 Signed-off-by: David Zafman <dzafman@redhat.com>	2020-01-06 21:35:19 -08:00
David Zafman	8e46bbbf36	test: Fix test case for pool based balancing instead of rule batched Signed-off-by: David Zafman <dzafman@redhat.com>	2020-01-06 21:35:19 -08:00
Sage Weil	acd4f5bc43	qa/standalone: python -> python3 Signed-off-by: Sage Weil <sage@redhat.com>	2019-12-20 13:33:21 -06:00
Sage Weil	e47526e152	qa/standalone/special/ceph_objectstore_tool: python3 Signed-off-by: Sage Weil <sage@redhat.com>	2019-12-20 13:32:53 -06:00
Thomas Bechtold	0127cd1e88	qa: Enable flake8 tox and fix failures There were a couple of problems found by flake8 in the qa/ directory (most of them fixed now). Enabling flake8 during the usual check runs hopefully avoids adding new issues in the future. Signed-off-by: Thomas Bechtold <tbechtold@suse.com>	2019-12-12 10:21:01 +01:00
Sage Weil	137fa64e12	qa: rename ceph-daemon tests -> cephadm Also move the workunit to a better location. Signed-off-by: Sage Weil <sage@redhat.com>	2019-12-11 19:14:09 -06:00

1 2 3 4 5 ...

496 Commits