Commit Graph

206 Commits

Author SHA1 Message Date
Sage Weil
0adc99511f qa/suites: add upgrade/nautilus-x-singleton and symlink from rados/upgrade
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-02 16:35:25 -05:00
Sage Weil
7fbc0f6e2c qa/suites: move rados/upgrade/mimic-x-singleton to upgrade/ and symlink
Signed-off-by: Sage Weil <sage@redhat.com>
2019-04-02 16:33:38 -05:00
Sage Weil
f3499b10b3 qa/suites/upgrade/nautilus-x: misc updates
- start with msgr2 enabled (defaults)
- run nautilus branch for workunits
- drop msgr2 enable step at teh end
- add octopus placeholder (although it is empty for now)

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-22 16:38:27 -05:00
Kefu Chai
1231a306ec qa/suites: do not test luminous-x upgrade path
in this change,

* suites/rados/upgrade: luminous-x-singleton => mimic-x-singleton
* suites/upgrade: luminous-x => nautilus-x

we support upgrade from n to n+2 release. otherwise monitor refuses to
do so:

mon.a@-1(probing) e1 current monmap has min_mon_release 15 (luminous)
which is >2 releases older than me 15 (octopus), stopping.

Fixes: https://tracker.ceph.com/issues/38845
Signed-off-by: Kefu Chai <kchai@redhat.com>
2019-03-22 22:39:42 +08:00
Kefu Chai
6f1fd44bbc
Merge pull request #27011 from liewegas/wip-redirect-mimic
qa/suites/upgrade/mimic-x/parallel: enable all classes

Reviewed-by: Myoungwon Oh <omwmw@sk.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
2019-03-18 01:22:44 +08:00
Sage Weil
b71a766d19 qa/suitse/upgrade/luminous-x/stress-split-erasure-code: disable c-o-t tests
The stress-split thrasher already had this off, but the ec variant did
not.  We don't support ceph-objectstore-tool exports/imports between major
versions.

Fixes: http://tracker.ceph.com/issues/38294
Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-17 04:59:36 -05:00
Sage Weil
bfd0581c5f qa/suites/upgrade/mimic-x/parallel: enable all classes
This fixes the redirect tiering tests, which need the 'cas' class.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-16 15:27:29 -05:00
Sage Weil
a376a151ea qa/suites/upgrade/mimic-x/parallel: run master rados/test.sh
We rename ceph_test_rados_api_tier to add _pp, so the mimic version doesn't
work.  And in any case, at this stage the client host has master installed.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-03-06 16:50:12 -06:00
Sage Weil
d198831c44 Merge PR #26262 into master
* refs/pull/26262/head:
	mgr/orch/ssh: add qa test
	qa: install/exclude ceph-mgr-ssh package
	debian: package ssh orchestrator
	spec.ceph: package ssh orchestrator
	ssh/orch: add ssh orchestrator
2019-02-20 13:05:11 -06:00
Noah Watkins
a2e8960079 qa: install/exclude ceph-mgr-ssh package
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
2019-02-20 09:45:48 -08:00
Mykola Golub
04cc884df0 qa/suites/upgrade/luminous-x: force clone v1 format for final rbd python test
Fixes: https://tracker.ceph.com/issues/38346
Signed-off-by: Mykola Golub <mgolub@suse.com>
2019-02-18 16:23:13 +00:00
Sage Weil
19be155c3e qa/suites/upgrade/mimic-x/stress-split: drop pglog_hardlimit test
Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-15 16:06:52 -06:00
Sage Weil
20beafab36 qa/suites/upgrade/mimic-x/stress-split: update for msgr2
Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-15 16:06:52 -06:00
Sage Weil
6207c0b3f4 qa/suites/upgrade/mimic-x/parallel: update for msgr v2
Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-15 16:06:52 -06:00
Kefu Chai
9e561e9fb4
Merge pull request #26380 from tchaikov/wip-split-dashboard-package
deb,rpm,qa: split dashboard package

Reviewed-by: Tim Serong <tserong@suse.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2019-02-14 16:59:58 +08:00
Sage Weil
b20862c846 qa/suites/upgrade/luminous-x/parallel: enable all classes
Otherwise it's annoying because the class list changes between luminous and nautilus,
and we don't want to futz around with changing this setting during the upgrade.

The problematic classes are 'cas' (added) and 'sdk' (not enabled by default but
included by the cls/ workunit.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-13 14:35:18 -06:00
Sage Weil
525f3f7e34 qa/suites/upgrade/luminous-x/parallel/5-final-workload/rados_mon_thrash: use x branch
The luminous version is (1) not what we want and (2) will fail because
ceph_test_rados_api_tier no longer exists in master.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-13 14:35:18 -06:00
Sage Weil
28546e065d qa/suites/upgade/luminous-x: pglog_hardlimit succeeds now on luminous due to backport
Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-13 11:41:21 -06:00
Sage Weil
5947146423 qa/suites/upgrade/luminous-x/stress-split: mons on separate hosts, enable msgr2
Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-13 11:41:21 -06:00
Sage Weil
d260ba9f84 qa/suites/upgrade/luminous-x/parallel: mon per host, msgr2
- mon per host
- enable msgr2 at the end
- include the nautilus.yaml

Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-13 11:41:21 -06:00
Kefu Chai
3d128d040e qa: exclude new packages when installing old versions
ceph-mgr-dashboard is introduced in nautilus, so exclude it when
installing old versions.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2019-02-13 21:37:30 +08:00
Sage Weil
d93abd8c53 qa/suites/upgrade/luminous-x: whitelist 'slow request'
The luminous warnings say slow request.

Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-13 07:20:51 -06:00
Sage Weil
5cbdf32430 qa/suites: exclude new packages when installing old versions
Signed-off-by: Sage Weil <sage@redhat.com>
2019-02-08 12:59:22 -06:00
Matt Benjamin
d4c4082cca
Merge pull request #25774 from yehudasa/wip-rgw-tiering-3
rgw: rados tiering
2019-01-22 12:42:21 -05:00
Yehuda Sadeh
83f7af473b qa/ragweed: change the way we determine which ragweed branch to use
- if force-branch, use that
 - otherwise:
    - read default-branch from client config
    - use suite branch or ceph branch if suite branch is not defined
    - if this branch is one of official releases (or master), prefix
      it with 'ceph-'

try to clone branch specified above, if failed (branch doesn't exist probably)
and not force-branch, use default-branch.
Also add an option to override ragweed repo.

Switched all force-branch from ragweed qa suite to default-branch.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
2019-01-09 11:02:57 -08:00
Neha Ojha
c0da26505f qa/suites/upgrade/*-x/stress-split: set pglog_hardlimit flag
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-01-07 09:42:54 -08:00
Neha Ojha
24c3e2d669 qa/suites/upgrade/luminous-x: add pg log settings
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-01-07 09:42:54 -08:00
Yuri Weinstein
45af678d3d qa/tests: added pg log settings to mimic-x
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-01-07 09:42:22 -08:00
Sage Weil
251f667ef8 Merge PR #25009 into master
* refs/pull/25009/head:
	librbd: stringify locker name with get_legacy_str()
	osdc/Objecter: fix list_watchers addr rendering to match legacy
	test/crimson: disable unittest_seastar_messenger test
	msg/msg_types: encode entity_addr_t TYPE_ANY as TYPE_LEGACY for pre-nautilus
	client: make blacklist detection handle TYPE_ANY entries
	mon/OSDMonitor: maintain compat output for 'blacklist ls'
	client: maintain compat for {inst,addr}_str in status dump
	qa/tasks/ceph_manager: compare osd flush seq #'s as ints
	qa/suites/fs: make use of simple.yaml where appropriate
	qa/msgr: move msgr factet into generic re-usable dir
	crimson: fix monmap build for seastar
	doc/start/ceph.conf: trim the sample ceph.conf file
	doc/rados/operations: only describe --public-{addr,network} method for adding mons
	PendingReleaseNotes: deprecate 'mon addr'
	doc: fix some 'mon addr' references
	doc/rados/configuration: fix some 'mon addr' references
	doc/rados/configuration/network-config-ref: revise network docs somewhat
	doc/rados/configuration/network-config-ref: remove totally obsolete section
	qa/suites/rados: replace mon_seesaw.py task with a small bash script
	qa/suites/fs/upgrade: don't bind to v2 addrs
	qa/tasks/mon_thrash: avoid 'mon addr' in mon section
	mon/MonClient: disable ms_bind_msgr2 if NAUTILUS feature not set
	osd/OSDMap: maintain compat addr fields
	msg/msg_types: add get_legacy_str()
	mds/MDSMap.h: maintain compat addr field
	mon/MgrMap: maintain compat active_addr field
	mon/MonClient: reconnect to mon if it's addrvec appears to have changed
	qa/tasks/ceph.conf.template: increase mon_mgr_mkfs_grace
	msg/async/ProtocolV2: fill in IP for all peer_addrs
	msg/async: print all addrs on debug lines
	mon/MonMap: no noname- mon name prefix when for_mkfs
	ceph-monstore-tool: print initial monmap
	msg/async/ProtocolV2: advertise ourselves as a v2 addr when using v2 protocol
	msg/async: assert existing protocol matches current protocol
	msg/async: add missing modelines
	mon/MonMap: add missing modeline
	vstart.sh: put mon addrs in mon_host, not 'mon addr'
	msg/async: better debug around conn map lookups and updates
	mon/MonClient: dump initial monmap at debug level 10
	qa/standalone/osd/osd-fast-mark-down: use v1 addr w/ simplemessenger
	qa/tasks/ceph: set initial monmap features with using addrvec addrs
	monmaptool: add --enable-all-features option
	qa/tasks/ceph: only use monmaptool --addv if addr has [,:v]
	qa/tasks/ceph_manager: make get_mon_status use mon addr
	qa/tasks/ceph: keep mon addrs in ctx namespace
	mon/OSDMonitor: log all osd addrs on boot
	msg/simple: behave when v2 and v1 addrs are present at target
	mon/MonClient: warn if global_id changes
	msg/Connection: add warning/note on get_peer_global_id
	mds/MDSDaemon: clean up handle_mds_map debug output a bit
	qa/suites/rados/upgrade: debug mds
	mds/MDSRank: improve is_stale_message to handle addrvecs
	msg/async: make loopback detect when sending to one of our many addrs
	qa/suites/rados/upgrade: no aggressive pg num changes
	mon/OSDMonitor: require nautilus mons for require_osd_release=nautilus
	mon/OSDMonitor: require mimic mons for require_osd_release=mimic
	qa/suites/rados/thrash-old-clients: use legacy addr syntax in ceph.conf
	msg/async: preserve peer features when replacing a connection
	qa/tasks/ceph.py: move methods from teuthology.git into ceph.py directly; support mon bind * options
	mon/MonMap: adjust build_initial behavior for mkfs vs probe
	mon/MonMap: improve ambiguous addr behavior
	qa/suites/rados/upgrade: spread mons a bit
	qa/rados/thrash-old-clients: keep mons on separate hosts
	qa/standalone/mon/misc.sh: tweak test to be more robust
	qa/tasks/mon_seesaw: expect v1/v2 prefix in addr
	osd/OSDMap: fix is_blacklisted() check to assume type ANY
	mon/OSDMonitor: use ANY addr type for blacklisting
	mon/msg_types: TYPE_V1ORV2 -> TYPE_ANY
	qa/workunits/cephtool: fix blacklist test
	qa/suites/upgrade: install old version with only v1 addrs
	common/options: by default, bind to both msgr v1 and v2 addresses
	vstart.sh: add --msgr1, --msgr2, --msgr21 options
	msg/async/ProtocolV2: be flexible with server identity check
	msg/msg_types: fix entity_addrvec_t::parse() with null end arg
	qa/suites/rados/basic/msgr: no msgr2 addrs in initial monmaps
	qa/tasks/ceph: add 'mon_bind_addrvec' and 'mon_bind_msgr2' options
	monmaptool: add --addv argument to pass in addrvec directly
	qa/suites/rados/basic/msgr: do not use msgr2 with simplemessenger
	qa/suites/rados/basic/msgr: async is not experimental
	messages/MOSDBoot: fix compat with pre-nautilus
	mon/MonMap: allow v1 or v2 to be explicitly specified along with part
	msg/msg_types: allow parsing of IPs without assuming v1 vs v2
	msg/msg_types: default parse to v2 addrs
	msg: standarize on v1: and v2: prefixes for *all* entity_addr_t's
	vstart.sh: use msgr2 by default
	mon/MonMap: remove get_addr() methods
	ceph-mon: adjust startup/bind/join sequence to use addrs
	mon: use MonMap::get_addrs() (instead of get_addr())
	mon/MonClient: change pending_cons to addrvec-based map
	mon/MonMap: fix set_addr() caller, kill wrapper
	mon/MonMap: remove addr-based add()
	monmaptool: fix --add to do either legacy or msgr2+legacy
	monmaptool: clean up iterator use a bit
	mon/MonMap: handle ambiguous mon addrs by trying both legacy and msgr
	mon/MonMap: take addrvec for set_initial_members
	mon/MonMap: use addrvecs for test instances
	mon: pass addrvec via MMonJoin
	mon/MonmapMonitor: fix 'mon add' to populate addrvec
	mon/MonMap: addr -> addrvec
	msg/async/ProtocolV2: only update socket_addr if we learned our addr
	osd: go active even if mon only accepted our v1 addr
	test/msgr: add test for msgr2 protocol
	msg/async/ProtocolV2: share socket_addr and all addrs during handshake
	msg/async: print socket_addr for the connection
	msg/async: msgr2 protocol placeholder
	msg/async: move ProtocolV1 class to its own source file
	msg/async: keep listen addr in ServerSocket, pass to new connections
	msg/async/AsyncMessenger: fix set_addr_unknowns

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2019-01-04 13:42:09 -06:00
Neha Ojha
a74129d26d qa/suites/upgrade/mimic-x: fix rhel runs
The following fragment was required for rhel on ovh
overrides:
    ansible.cephlab:
      skip_tags: entitlements,packages,repos

Since, this suite runs on smithi in our nightlies, we should not need
this.

Signed-off-by: Neha Ojha <nojha@redhat.com>
2019-01-03 13:39:30 -08:00
Sage Weil
0692d06979 qa/suites/upgrade: install old version with only v1 addrs
v1+v2 support is new in nautilus.

Signed-off-by: Sage Weil <sage@redhat.com>
2018-12-21 15:31:32 -06:00
Sage Weil
09a8e5bce0 qa/suites/upgrade/mimic-x: add missing .qa
Signed-off-by: Sage Weil <sage@redhat.com>
2018-12-18 08:17:31 -06:00
Sage Weil
5612b6714c qa/suites/upgrade: fix wrt librados3
Signed-off-by: Sage Weil <sage@redhat.com>
2018-12-17 13:25:34 -06:00
Sage Weil
9ee3ce1ecd Merge PR #25345 into master
* refs/pull/25345/head:
	qa/suites: fix bluestore links
	qa/objectstore: bluestore -> bluestore-{bitmap,stupid}

Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2018-12-06 08:42:04 -06:00
Sage Weil
269910fc8b qa/suites: fix bluestore links
Signed-off-by: Sage Weil <sage@redhat.com>
2018-12-05 10:30:14 -06:00
Kefu Chai
ae36a61446 qa/suites: add librados2 to "extra_packages" for upgrade tests
we use the playbook of "testnodes.yml" defined by ceph-cm-ansible for
initializing test nodes, and the role of "testnode" is used by
testnodes.yml. "testnode" requires "qemu-system-x86" or "qemu-kvm"
package to be installed. the qemu in turn depends on librbd1 and
librados2.

before librados3 was introduced, this worked perfectly. because in ceph
repo, qa/packages/packages.yaml defines the default set of packages the
"install" tasks should install. and in that yaml file, librados2 was
listed. so the package management system will overwrite the librados2
installed by ansible playbook with the version specified by the
"install" task, as apt/yum thinks this is what user requires explicitly,
so it's fine to install a different version of librados2.

after librados3 was introduced, librados2 was removed from
qa/packages/packages.yaml. because, by default, we need to install
librados3 instead of librados2 for ready a nautilus cluster. but the
problem is, the packge list also applies to "install" tasks installing
releases before nautilus, where we still need to replace the librados2
installed by ansible.

so, to address this issue, "librados2" is added to "extra_packages" of
the "install" tasks of tests installing old releases to install
librados2 explicitly instead of as a dependency of other ceph packages
like librbd1.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-11-15 11:56:24 +08:00
Kefu Chai
f10fc004bc qa: add librados3 to exclude_packages for ugprade tests
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-11-10 07:17:24 -08:00
Kefu Chai
b4a5f76243 qa: add librados3 to exclude list of pre-nautilus install tasks
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-11-02 00:15:31 +08:00
Casey Bodley
d897b92878 osd: remove statelog from osd_class_load_list config
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-09-19 10:32:55 -04:00
Sage Weil
1ebafdb65f
Merge pull request #23292 from yuriw/wip-yuriw-25140-master
qa/tests: added 1st draft of mimic-x suite
2018-07-30 14:55:41 -05:00
Yuri Weinstein
baa4d0ea78
Merge pull request #23305 from smithfarm/wip-cleanup-upgrade
qa/upgrade: cleanup for nautilus

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
2018-07-30 09:01:03 -07:00
Yuri Weinstein
e6f21c1aa3 qa/tests: added 1st draft of mimic-x suite
Fixes: https://tracker.ceph.com/issues/25140
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-07-30 08:41:18 -07:00
Nathan Cutler
17d9b5be4d qa/upgrade: cleanup for nautilus
Drop unused suites, which ATM means all of them except upgrade/luminous-x
which recently got a cleanup in https://github.com/ceph/ceph/pull/23162

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-07-29 19:56:53 +02:00
Nathan Cutler
4432b16673 tests: upgrade/luminous-x: fix order of final-workload directory
This is just a cosmetic fix.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-07-21 20:30:38 +02:00
Nathan Cutler
2555524f1f tests: upgrade/luminous-x: whitelist REQUEST_SLOW for rados_mon_thrash
Fixes: http://tracker.ceph.com/issues/25051
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-07-21 11:31:20 +02:00
Patrick Donnelly
b39f9d06dc
qa: fix symlinks indirectly pointing at qa to .qa
Building on the previous commit.

Command used:

$ find suites/ -type l -and -not -name .qa -execdir ~/fix.sh {} \;

fix.sh:
    #!/bin/bash

    link="$(readlink "$1")"

    echo $link
    dirlink="$(dirname "$link")"
    baselink="$(basename "$link")"

    while true; do
        echo $dirlink
        if [ "$dirlink" -ef ~/ceph/qa ]; then
            ln -nsf ".qa/$baselink" "$1"
            exit
        else
            baselink="$(basename "$dirlink")/$baselink"
            dirlink="$(dirname "$dirlink")"
            if [ "$dirlink" -ef . ]; then
                break
            fi
        fi
    done

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-06-26 11:48:38 -07:00
Patrick Donnelly
716db6e2fd
qa: add .qa helper link
This utilizes the recent feature in teuthology [1] to skip hidden files in
suites when building the job matrix.

Idea of this change is to enable referring to the top-level qa directory in a
position-independent way such that copies of a suite to another location do not
break any symlinks.

[1] https://github.com/ceph/teuthology/pull/1185

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2018-06-26 11:33:48 -07:00
Yuri Weinstein
37ac8df555 qa/tests: changed disto symlink to point to new way using supporeted OS'es
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2018-06-12 13:49:55 -07:00
Sage Weil
2a6f1633b1
Merge pull request #22158 from liewegas/wip-24222
mon/MgrMonitor: change 'unresponsive' message to info level

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: John Spray <john.spray@redhat.com>
2018-05-23 06:42:48 -05:00
Sage Weil
4ddbbe561c mon/MgrMonitor: change 'unresponsive' message to info level
We generate a MGR_DOWN health warning at the appropriate points; having
this at WRN level just triggers failed teuthology runs but doesn't much
value for the user.

Clear out teuthology whitelisting for this message.

Fixes: http://tracker.ceph.com/issues/24222
Signed-off-by: Sage Weil <sage@redhat.com>
2018-05-22 16:55:03 -05:00
David Zafman
918921ab2f test: Need to escape parens in log-whitelist for grep
Signed-off-by: David Zafman <dzafman@redhat.com>
2018-05-21 09:47:59 -07:00
Sage Weil
59fe5dc5ce qa/suites/upgrade/luminous-x: whitelist SLOW_OPS
Signed-off-by: Sage Weil <sage@redhat.com>
2018-05-18 14:01:19 -05:00
Sage Weil
5a5e74e89c qa/suites/upgrade/luminous-x: more cinder volumes
Signed-off-by: Sage Weil <sage@redhat.com>
2018-05-18 14:01:01 -05:00
Sage Weil
6ae51aed13 Merge remote-tracking branch 'gh/mimic' 2018-05-10 12:18:29 -05:00
Casey Bodley
7da0fe2832
Merge pull request #21680 from cbodley/wip-rm-replica-log
rgw: remove all traces of cls replica_log

Reviewed-by: Orit Wasserman <owasserm@redhat.com>
2018-05-10 10:26:55 -04:00
Sage Weil
dd702b9fad qa/suites/upgarde/luminous-x/stress-split: 8 osds (not 6)
Signed-off-by: Sage Weil <sage@redhat.com>
2018-05-04 08:42:29 -05:00
Casey Bodley
82423b4a6e qa: add ragweed to luminous-x upgrade suite
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-05-03 15:20:23 -04:00
Sage Weil
1522c9fd1b qa/suites/upgrade/stress-split/4-workloads/radosbench: use less disk space
Signed-off-by: Sage Weil <sage@redhat.com>
2018-05-01 06:58:52 -05:00
Casey Bodley
f9ee48caa2 rgw: remove all traces of cls replica_log
replica log was for the old radosgw sync agent, which was replaced with
multisite v2 in jewel. no sense in continuing to maintain and test it

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2018-04-26 11:40:11 -04:00
Nathan Cutler
bc9b2e6c89 tests: cleanup: drop upgrade/jewel-x/point-to-point-x
This subsuite is only for testing upgrades from one jewel
point release to another. In luminous and master it serves no
useful purpose.

Fixes: http://tracker.ceph.com/issues/22888
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2018-02-01 23:53:29 +01:00
Kefu Chai
e7097593a7 qa/standalone: remove osd-map-max-advance related tests
this setting was removed in 8967b73

Fixes: http://tracker.ceph.com/issues/22596
Signed-off-by: Kefu Chai <kchai@redhat.com>
2018-01-06 19:40:15 +08:00
Vasu Kulkarni
67ff2de694 qa/tests: update upgrade tests from jewel->luminous to jewel->luminous->master
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-12-07 19:11:42 -08:00
Kefu Chai
3aaf5d24f1 qa: disable mon-health-to-clog in upgrade test
otherwise the cluster will complain that the quorum size is smaller than
that of monmap.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-12-01 01:15:59 +08:00
Nathan Cutler
afc5a7d320 qa: point-to-point-x: upgrade client.1 to -x along with cluster nodes
The client.1 rgw in workload_x had not been upgraded to -x.

Fixes: http://tracker.ceph.com/issues/21499
Signed-off-by: Nathan Cutler <ncutler@suse.com>
2017-09-22 10:07:28 +02:00
Vasu Kulkarni
71ea4987e0 Add kraken to luminous upgrade case
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
526d4aef1c Add kernel tar/untar and systemd task
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:57 -07:00
Vasu Kulkarni
9c85c76a90 Jewel to luminous upgrade scenario using ceph-deploy
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
2017-09-07 12:22:56 -07:00
Josh Durgin
317a8bae5e Merge pull request #17160 from ceph/wip-luminous-x-master
qa: add upgrade/luminous-x suite

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-09-01 14:19:48 -07:00
Yuri Weinstein
aff342627c Initial check in
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2017-09-01 08:56:19 -07:00
Sage Weil
bf296018ff qa/suites/upgrade/jewel-x/parallel: tolerate laggy mgr
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-24 10:30:01 -04:00
Sage Weil
d5d5d7d1d2 qa/suites/upgrade/jewel-x/stress-split: tolerate sloppy past_intervals
This is harmless in general, esp during upgrade.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-24 10:23:22 -04:00
Yuri Weinstein
304b492187 Initial check in luminous-x suite
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
2017-08-23 14:53:55 -07:00
Sage Weil
5455f599b3 qa/suites/upgrade/jewel-x/parallel: tolerate OBJECT_MISPLACED
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-23 14:24:00 -04:00
Sage Weil
2504ab1675 qa/suites/upgarde/jewel-x/parallel: tolerate mgr warning
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-23 14:22:34 -04:00
Kefu Chai
17d6e96a6f Merge pull request #16967 from liewegas/wip-upgrade-health
mon: fix legacy health checks in 'ceph status' during upgrade; fix jewel-x upgrade combo

Reviewed-by: Kefu Chai <kchai@redhat.com>
2017-08-16 19:44:36 +08:00
Sage Weil
dd2fb6c40b Merge pull request #16944 from liewegas/wip-kraken-x
mon/Elector: force election epoch bump on start

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
2017-08-10 11:12:43 -05:00
Sage Weil
c46bdf5efd Revert "qa/suites/upgrade/jewel-x/parallel: thrash layout"
This reverts commit 435777dbff.

This test combination is not yet stable.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-10 09:51:29 -04:00
Sage Weil
a0b9f37dbc qa/suites/upgrade/jewel-x/parallel: no loadgenbig
When we do the thrashing this leads to ENOSPC on smithi.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-10 09:44:17 -04:00
Sage Weil
435777dbff qa/suites/upgrade/jewel-x/parallel: thrash layout
We can't kill and restart osds because that will interfere with
the upgrade process.  We can, however, thrash the layout by
tweaking osd weights and so on.  This will exercise osd recovery
paths during the upgrade that aren't normally exercised (outside
of stress-split..which doesn't upgrade individual osds while they
are non-clean).

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-09 22:07:48 -04:00
Sage Weil
b61be07d45 qa/suites/upgrade/kraken-x/stress-split: more whitelisting
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-09 13:58:55 -04:00
Sage Weil
bbd5fe354c qa/suites/upgarde/jewel-x/point-to-point-x: disable app warnings
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-09 09:18:54 -04:00
Sage Weil
bf29142b08 qa/suites/upgrade/kraken-x/stress-split*: whitelist
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-07 21:36:58 -04:00
Sage Weil
2234a0ed11 qa/suites/upgrade/kraken-x/parallel: whitelist
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-07 21:36:58 -04:00
Sage Weil
3e7d157871 qa/suites/upgrade/jewel-x/parallel: fix POOL_APP_NOT_ENABLED disable
This code runs on the mgr.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-07 15:12:10 -04:00
Sage Weil
ed2d984ad1 qa/suites/upgarde/jewel-x/parallel: more whitelisting
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 10:04:14 -04:00
Sage Weil
58f15d2b98 qa/suites/upgrade/jewel-x/parallel: more whitelisting
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 09:56:55 -04:00
Sage Weil
622e950e43 qa/suites/upgrade/*-x/parallel: whitelist more stuff
Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 09:56:55 -04:00
Sage Weil
2d260443f0 qa/suites/upgrade/*/parallel: disable POOL_APP_NOT_ENABLED
There is some other random workload running (that creates pools)
while we upgrade and wait for healthy.  Just disable the warning
for these tests.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 09:56:55 -04:00
Sage Weil
f4c2863999 qa/suites/upgrade/jewel-x/parallel: whitelist OSD_DOWN
We restart OSDs during the upgrade.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-08-06 09:56:55 -04:00
Patrick Donnelly
d4ed085238
Merge PR #16713 into master
* refs/remotes/upstream/pull/16713/head:
	qa: ignore failed MDS message during upgrade
2017-08-02 19:41:42 -07:00
Kefu Chai
d12c51ca91 qa/suites: escape the parenthesis of the whitelist text
so we can avoid the warnings like

grep: Unmatched ( or \(

because we pass the whitelisted string to `egrep -v "$1"` directly.

Signed-off-by: Kefu Chai <kchai@redhat.com>
2017-08-01 21:54:44 +08:00
Patrick Donnelly
5e5ff5c086
qa: ignore failed MDS message during upgrade
The cluster is expected to become degraded during reboot.

Fixes: http://tracker.ceph.com/issues/20731
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2017-07-31 14:45:07 -07:00
Sage Weil
e398fd4ee4 qa/suites: more whitelisting
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-27 09:31:24 -04:00
Sage Weil
29549e6834 Merge pull request #13723 from ovh/bp-forced-recovery
osd/PG: make prioritized recovery possible

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2017-07-24 09:01:03 -05:00
John Spray
343e1a4281 qa: update whitelist for "wrongly marked me down"
Signed-off-by: John Spray <john.spray@redhat.com>
2017-07-24 14:54:46 +01:00
Sage Weil
27e8d75f61 Merge pull request #16429 from liewegas/wip-jewel-x
qa/suites/upgrade/jewel-x: misc fixes for new health checks
2017-07-20 10:47:05 -05:00
Piotr Dałek
b0134cc7a8 qa: add force/cancel recovery/backfill to QA testing
This randomly issues pg force-recovery/force-backfill and
pg cancel-force-recovery/cancel-force-backfill during QA
testing. Disabled for upgrades from hammer, jewel and kraken.

Signed-off-by: Piotr Dałek <piotr.dalek@corp.ovh.com>
2017-07-20 09:35:55 +02:00
Jason Dillaman
fa90be842e test: enable pool applications for new pools
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
2017-07-19 13:13:01 -04:00
Sage Weil
7102de8761 qa/suites/upgrade/jewel-x/point-to-point: move set-require-min-compat-client
Do it after workload completes and all jewel clients go away.

Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-18 12:32:17 -04:00
Sage Weil
e2fdfc0b10 qa/suites/upgrade/jewel-x: link to thrashosds yaml
Signed-off-by: Sage Weil <sage@redhat.com>
2017-07-18 12:29:01 -04:00