=============== Release Notes =============== v12.1.0 Luminous (RC) ===================== This is the first release candidate for Luminous, the next long term stable release. Ceph Luminous (v12.2.0) will be the foundation for the next long-term stable release series. There have been major changes since Kraken (v11.2.z) and Jewel (v10.2.z), and the upgrade process is non-trivial. Please read these release notes carefully. Major Changes from Kraken ------------------------- - *General*: * Ceph now has a simple, built-in web-based dashboard for monitoring cluster status. FIXME DOCS. - *RADOS*: * *BlueStore*: - The new *BlueStore* backend for *ceph-osd* is now stable and the new default for newly created OSDs. BlueStore manages data stored by each OSD by directly managing the physical HDDs or SSDs without the use of an intervening file system like XFS. This provides greater performance and features. FIXME DOCS - BlueStore supports *full data and metadata checksums* of all data stored by Ceph. - BlueStore supports inline compression using zlib, snappy, or LZ4. (Ceph also supports zstd for RGW compression but zstd is not recommended for BlueStore for performance reasons.) FIXME DOCS * *Erasure coded* pools now have full support for *overwrites*, allowing them to be used with RBD and CephFS. `Read more about EC overwrites`_. * *ceph-mgr*: - There is a new daemon, *ceph-mgr*, which is a required part of any Ceph deployment. Although IO can continue when *ceph-mgr* is down, metrics will not refresh and some metrics-related calls (e.g., ``ceph df``) may block. We recommend deploying several instances of *ceph-mgr* for reliability. See the notes on `Upgrading`_ below. - The *ceph-mgr* daemon includes a REST-based management API. The API is still experimental and somewhat limited but will form the basis for API-based management of Ceph going forward. FIXME DOCS * The overall *scalability* of the cluster has improved. We have successfully tested clusters with up to 10,000 OSDs. * Each OSD can now have a *device class* associated with it (e.g., `hdd` or `ssd`), allowing CRUSH rules to trivially map data to a subset of devices in the system. Manually writing CRUSH rules or manual editing of the CRUSH is normally not required. FIXME DOCS * You can now *optimize CRUSH weights* can now be optimized to maintain a *near-perfect distribution of data* across OSDs. FIXME DOCS * There is also a new `upmap` exception mechanism that allows individual PGs to be moved around to achieve a *perfect distribution* (this requires luminous clients). FIXME DOCS * Each OSD now adjusts its default configuration based on whether the backing device is an HDD or SSD. Manual tuning generally not required. * The prototype *mclock QoS queueing algorithm* is now available. FIXME DOCS * There is now a *backoff* mechanism that prevents OSDs from being overloaded by requests to objects or PGs that are not currently able to process IO. * There is a *simplified OSD replacement process* that is more robust. FIXME DOCS * You can query the supported features and (apparent) releases of all connected daemons and clients with ``ceph features``. FIXME DOCS * You can configure the oldest Ceph client version you wish to allow to connect to the cluster via ``ceph osd set-require-min-compat-client`` and Ceph will prevent you from enabling features that will break compatibility with those clients. FIXME DOCS * Several `sleep` settings, include ``osd_recovery_sleep``, ``osd_snap_trim_sleep``, and ``osd_scrub_sleep`` have been reimplemented to work efficiently. (These are used in some cases to work around issues throttling background work.) - *RGW*: * RGW *metadata search* backed by ElasticSearch now supports end user requests service via RGW itself, and also supports custom metadata fields. A query language a set of RESTful APIs were created for users to be able to search objects by their metadata. New APIs that allow control of custom metadata fields were also added. * RGW now supports *dynamic bucket index sharding*. As the number of objects in a bucket grows, RGW will automatically reshard the bucket index in response. No user intervention or bucket size capacity planning is required. * RGW introduces *server side encryption* of uploaded objects with three options for the management of encryption keys: automatic encryption (only recommended for test setups), customer provided keys similar to Amazon SSE-C specification, and through the use of an external key management service (Openstack Barbican) similar to Amazon SSE-KMS specification. * RGW now has preliminary AWS-like bucket policy API support. For now, policy is a means to express a range of new authorization concepts. In the future it will be the foundation for additional auth capabilities such as STS and group policy. * RGW has consolidated the several metadata index pools via the use of rados namespaces. - *RBD*: * RBD now has full, stable support for *erasure coded pools* via the new ``--data-pool`` option to ``rbd create``. * RBD mirroring's rbd-mirror daemon is now highly available. We recommend deploying several instances of rbd-mirror for reliability. * The default 'rbd' pool is no longer created automatically during cluster creation. Additionally, the name of the default pool used by the rbd CLI when no pool is specified can be overridden via a new ``rbd default pool = `` configuration option. * Initial support for deferred image deletion via new ``rbd trash`` CLI commands. Images, even ones actively in-use by clones, can be moved to the trash and deleted at a later time. * New pool-level ``rbd mirror pool promote`` and ``rbd mirror pool demote`` commands to batch promote/demote all mirrored images within a pool. * Mirroring now optionally supports a configurable replication delay via the ``rbd mirroring replay delay = `` configuration option. * Improved discard handling when the object map feature is enabled. * rbd CLI ``import`` and ``copy`` commands now detect sparse and preserve sparse regions. * Images and Snapshots will now include a creation timestamp - *CephFS*: * *Multiple active MDS daemons* is now considered stable. The number of active MDS servers may be adjusted up or down on an active CephFS file system. * CephFS *directory fragmentation* is now stable and enabled by default on new filesystems. To enable it on existing filesystems use "ceph fs set allow_dirfrags". Large or very busy directories are sharded and (potentially) distributed across multiple MDS daemons automatically. * Directory subtrees can be explicitly pinned to specific MDS daemons in cases where the automatic load balancing is not desired or effective. - *Miscellaneous*: * Release packages are now being built for *Debian Stretch*. The distributions we build for now includes: - CentOS 7 (x86_64 and aarch64) - Debian 8 Jessie (x86_64) - Debian 9 Stretch (x86_64) - Ubuntu 16.04 Xenial (x86_64 and aarch64) - Ubuntu 14.04 Trusty (x86_64) Note that QA is limited to CentOS and Ubuntu (xenial and trusty). * *CLI changes*: - The ``ceph -s`` or ``ceph status`` command has a fresh look. - ``ceph {osd,mds,mon} versions`` summarizes versions of running daemons. - ``ceph {osd,mds,mon} count-metadata `` similarly tabulates any other daemon metadata visible via the ``ceph {osd,mds,mon} metadata`` commands. - ``ceph features`` summarizes features and releases of connected clients and daemons. - ``ceph osd require-osd-release `` replaces the old ``require_RELEASE_osds`` flags. - ``ceph osd pg-upmap``, ``ceph osd rm-pg-upmap``, ``ceph osd pg-upmap-items``, ``ceph osd rm-pg-upmap-items`` can explicitly manage `upmap` items (FIXME DOCS). - ``ceph osd getcrushmap`` returns a crush map version number on stderr, and ``ceph osd setcrushmap [version]`` will only inject an updated crush map if the version matches. This allows crush maps to be updated offline and then reinjected into the cluster without fear of clobbering racing changes (e.g., by newly added osds or changes by other administrators). - ``ceph osd create`` has been replaced by ``ceph osd new``. This should be hidden from most users by user-facing tools like `ceph-disk`. - ``ceph osd destroy`` will mark an OSD destroyed and remove its cephx and lockbox keys. However, the OSD id and CRUSH map entry will remain in place, allowing the id to be reused by a replacement device with minimal data rebalancing. - ``ceph osd purge`` will remove all traces of an OSD from the cluster, including its cephx encryption keys, dm-crypt lockbox keys, OSD id, and crush map entry. - ``ceph osd ls-tree `` will output a list of OSD ids under the given CRUSH name (like a host or rack name). This is useful for applying changes to entire subtrees. For example, ``ceph osd down `ceph osd ls-tree rack1```. - ``ceph osd {add,rm}-{noout,noin,nodown,noup}`` allow the `noout`, `nodown`, `noin`, and `noup` flags to be applied to specific OSDs. - ``ceph log last [n]`` will output the last *n* lines of the cluster log. - ``ceph mgr dump`` will dump the MgrMap, including the currently active ceph-mgr daemon and any standbys. - ``ceph mgr module ls`` will list active ceph-mgr modules. - ``ceph mgr module {enable,disable} `` will enable or disable the named mgr module. The module must be present in the configured `mgr_module_path` on the host(s) where `ceph-mgr` is running. - ``ceph osd crush swap-bucket `` will swap the contents of two CRUSH buckets in the hierarchy while preserving the buckets' ids. This allows an entire subtree of devices to be replaced (e.g., to replace an entire host of FileStore OSDs with newly-imaged BlueStore OSDs) without disrupting the distribution of data across neighboring devices. - ``ceph osd set-require-min-compat-client `` configures the oldest client release the cluster is required to support. Other changes, like CRUSH tunables, will fail with an error if they would violate this setting. Changing this setting also fails if clients older than the specified release are currently connected to the cluster. - ``ceph config-key dump`` dumps config-key entries and their contents. (The existing ``ceph config-key list`` only dumps the key names, not the values.) - ``ceph osd set-{full,nearfull,backfillfull}-ratio`` sets the cluster-wide ratio for various full thresholds (when the cluster refuses IO, when the cluster warns about being close to full, when an OSD will defer rebalancing a PG to itself, respectively). - ``ceph osd reweightn`` will specify the `reweight` values for multiple OSDs in a single command. This is equivalent to a series of ``ceph osd reweight`` commands. - ``ceph osd crush class {create,rm,ls,rename}`` manage the new CRUSH *device class* feature. ``ceph crush set-device-class [...]`` will set the class for particular devices. - ``ceph osd crush rule create-replicated`` replaces the old ``ceph osd crush rule create-simple`` command to create a CRUSH rule for a replicated pool. Notably it takes a `class` argument for the *device class* the rule should target (e.g., `ssd` or `hdd`). - ``ceph mon feature ls`` will list monitor features recorded in the MonMap. ``ceph mon feature set`` will set an optional feature (none of these exist yet). - ``ceph tell help`` will now return a usage summary. .. _Read more about EC overwrites: ../rados/operations/erasure-code/#erasure-coding-with-overwrites Major Changes from Jewel ------------------------ - *RADOS*: * We now default to the AsyncMessenger (``ms type = async``) instead of the legacy SimpleMessenger.  The most noticeable difference is that we now use a fixed sized thread pool for network connections (instead of two threads per socket with SimpleMessenger). * Some OSD failures are now detected almost immediately, whereas previously the heartbeat timeout (which defaults to 20 seconds) had to expire.  This prevents IO from blocking for an extended period for failures where the host remains up but the ceph-osd process is no longer running. * The size of encoded OSDMaps has been reduced. * The OSDs now quiesce scrubbing when recovery or rebalancing is in progress. - *RGW*: * RGW now supports the S3 multipart object copy-part API. * It is possible now to reshard an existing bucket offline. Offline bucket resharding currently requires that all IO (especially writes) to the specific bucket is quiesced. (For automatic online resharding, see the new feature in Luminous above.) * RGW now supports data compression for objects. * Civetweb version has been upgraded to 1.8 * The Swift static website API is now supported (S3 support has been added previously). * S3 bucket lifecycle API has been added. Note that currently it only supports object expiration. * Support for custom search filters has been added to the LDAP auth implementation. * Support for NFS version 3 has been added to the RGW NFS gateway. * A Python binding has been created for librgw. - *RBD*: * The rbd-mirror daemon now supports replicating dynamic image feature updates and image metadata key/value pairs from the primary image to the non-primary image. * The number of image snapshots can be optionally restricted to a configurable maximum. * The rbd Python API now supports asynchronous IO operations. - *CephFS*: * libcephfs function definitions have been changed to enable proper uid/gid control. The library version has been increased to reflect the interface change. * Standby replay MDS daemons now consume less memory on workloads doing deletions. * Scrub now repairs backtrace, and populates `damage ls` with discovered errors. * A new `pg_files` subcommand to `cephfs-data-scan` can identify files affected by a damaged or lost RADOS PG. * The false-positive "failing to respond to cache pressure" warnings have been fixed. Upgrade from Jewel or Kraken ---------------------------- .. _Upgrading: #. Ensure that the ``sortbitwise`` flag is enabled:: # ceph osd set sortbitwise #. Make sure your cluster is stable and healthy (no down or recoverying OSDs). (Optional, but recommended.) #. Do not create any new erasure-code pools while upgrading the monitors. #. Set the ``noout`` flag for the duration of the upgrade. (Optional but recommended.):: # ceph osd set noout #. Upgrade monitors by installing the new packages and restarting the monitor daemons. Note that, unlike prior releases, the ceph-mon daemons *must* be upgraded first.:: # systemctl restart ceph-mon.target Verify the monitor upgrade is complete once all monitors are up by looking for the ``luminous`` feature string in the mon map. For example:: # ceph mon feature ls should include `luminous` under persistent features:: on current monmap (epoch NNN) persistent: [kraken,luminous] required: [kraken,luminous] #. Add or restart ``ceph-mgr`` daemons. If you are upgrading from kraken, upgrade packages and restart ceph-mgr daemons with:: # systemctl ceph-mgr.target If you are upgrading from kraken, you may already have ceph-mgr daemons deployed. If not, or if you are upgrading from jewel, you can deploy new daemons with tools like ceph-deploy or ceph-ansible. For example,:: # ceph-deploy mgr create HOST Verify the ceph-mgr daemons are running by checking ``ceph -s``:: # ceph -s ... services: mon: 3 daemons, quorum foo,bar,baz mgr: foo(active), standbys: bar, baz ... #. Upgrade all OSDs by installing the new packages and restarting the ceph-osd daemons on all hosts.:: # systemctl ceph-osd.target You can monitor the progress of the OSD upgrades with the new ``ceph osd versions`` command.:: # ceph osd versions { "ceph version 12.2.0 (...) luminous (stable)": 12, "ceph version 10.2.6 (...)": 3, } #. Upgrade all CephFS daemons by upgrading packages and restarting daemons on all hosts.:: # systemctl restart ceph-mds.target #. Upgrade all radosgw daemons by upgrading packages and restarting daemons on all hosts.:: # systemctl restart radosgw.target #. Complete the upgrade by disallowing pre-luminous OSDs:: # ceph osd require-osd-release luminous If you set ``noout`` at the beginning, be sure to clear it with:: # ceph osd unset noout #. Verify the cluster is healthy with ``ceph health``. Upgrading from pre-Jewel releases (like Hammer) ----------------------------------------------- You *must* first upgrade to Jewel (10.2.z) before attempting an upgrade to Luminous. Upgrade compatibility notes, Kraken to Luminous ----------------------------------------------- * We no longer test the FileStore ceph-osd backend in combination with ``btrfs``. We recommend against using btrfs. If you are using btrfs-based OSDs and want to upgrade to luminous you will need to add the follwing to your ceph.conf:: enable experimental unrecoverable data corrupting features = btrfs The code is mature and unlikely to change, but we are only continuing to test the Jewel stable branch against btrfs. We recommend moving these OSDs to FileStore with XFS or BlueStore. * The ``ruleset-*`` properties for the erasure code profiles have been renamed to ``crush-*`` to (1) move away from the obsolete 'ruleset' term and to be more clear about their purpose. There is also a new optional ``crush-device-class`` property to specify a CRUSH device class to use for the erasure coded pool. Existing erasure code profiles will be converted automatically when upgrade completes (when the ``ceph osd require-osd-release luminous`` command is run) but any provisioning tools that create erasure coded pools may need to be updated. * When assigning a network to the public network and not to the cluster network the network specification of the public network will be used for the cluster network as well. In older versions this would lead to cluster services being bound to 0.0.0.0:, thus making the cluster service even more publicly available than the public services. When only specifying a cluster network it will still result in the public services binding to 0.0.0.0. * In previous versions, if a client sent an op to the wrong OSD, the OSD would reply with ENXIO. The rationale here is that the client or OSD is clearly buggy and we want to surface the error as clearly as possible. We now only send the ENXIO reply if the osd_enxio_on_misdirected_op option is enabled (it's off by default). This means that a VM using librbd that previously would have gotten an EIO and gone read-only will now see a blocked/hung IO instead. * The "journaler allow split entries" config setting has been removed. - *librados*: * Some variants of the omap_get_keys and omap_get_vals librados functions have been deprecated in favor of omap_get_vals2 and omap_get_keys2. The new methods include an output argument indicating whether there are additional keys left to fetch. Previously this had to be inferred from the requested key count vs the number of keys returned, but this breaks with new OSD-side limits on the number of keys or bytes that can be returned by a single omap request. These limits were introduced by kraken but are effectively disabled by default (by setting a very large limit of 1 GB) because users of the newly deprecated interface cannot tell whether they should fetch more keys or not. In the case of the standalone calls in the C++ interface (IoCtx::get_omap_{keys,vals}), librados has been updated to loop on the client side to provide a correct result via multiple calls to the OSD. In the case of the methods used for building multi-operation transactions, however, client-side looping is not practical, and the methods have been deprecated. Note that use of either the IoCtx methods on older librados versions or the deprecated methods on any version of librados will lead to incomplete results if/when the new OSD limits are enabled. * The original librados rados_objects_list_open (C) and objects_begin (C++) object listing API, deprecated in Hammer, has finally been removed. Users of this interface must update their software to use either the rados_nobjects_list_open (C) and nobjects_begin (C++) API or the new rados_object_list_begin (C) and object_list_begin (C++) API before updating the client-side librados library to Luminous. Object enumeration (via any API) with the latest librados version and pre-Hammer OSDs is no longer supported. Note that no in-tree Ceph services rely on object enumeration via the deprecated APIs, so only external librados users might be affected. The newest (and recommended) rados_object_list_begin (C) and object_list_begin (C++) API is only usable on clusters with the SORTBITWISE flag enabled (Jewel and later). (Note that this flag is required to be set before upgrading beyond Jewel.) - *CephFS*: * When configuring ceph-fuse mounts in /etc/fstab, a new syntax is available that uses "ceph.=" in the options column, instead of putting configuration in the device column. The old style syntax still works. See the documentation page "Mount CephFS in your file systems table" for details. * CephFS clients without the 'p' flag in their authentication capability string will no longer be able to set quotas or any layout fields. This flag previously only restricted modification of the pool and namespace fields in layouts. * CephFS will generate a health warning if you have fewer standby daemons than it thinks you wanted. By default this will be 1 if you ever had a standby, and 0 if you did not. You can customize this using ``ceph fs set standby_count_wanted ``. Setting it to zero will effectively disable the health check. * The "ceph mds tell ..." command has been removed. It is superceded by "ceph tell mds. ..." Notable Changes since Kraken ---------------------------- build consolidated list before final release Notable Changes since v12.0.3 ----------------------------- * bluestore: ceph-disk: add --filestore argument, default to --bluestore (`pr#15437 `_, Loic Dachary, Sage Weil) * bluestore,core: os/bluestore: fix warning (`pr#15435 `_, Sage Weil) * bluestore,core: os/bluestore: improve mempool usage (`pr#15402 `_, Sage Weil) * bluestore,core: os/bluestore: write "mkfs_done" into disk only if we pass fsck() tests (`pr#15238 `_, xie xingguo) * bluestore,core: os: remove experimental status for BlueStore (`pr#15177 `_, Sage Weil) * bluestore: os/bluestore: better debug output on unsharing blobs (`issue#20227 `_, `pr#15746 `_, Sage Weil) * bluestore: os/bluestore/BlockDevice: support pmem device as bluestore backend (`pr#15102 `_, Jianpeng Ma) * bluestore: os/bluestore/BlueFS: Rebuild memcopy for bufferlist::page_aligned_app… (`pr#15728 `_, Jianpeng Ma, Sage Weil) * bluestore: os/bluestore/BlueFS: .slow should be compared with dirname (`pr#15595 `_, zhanglei) * bluestore: os/bluestore/BlueStore: no device no symlink. (`pr#15721 `_, Jianpeng Ma) * bluestore: os/bluestore: cleanup bluestore_types (`pr#15680 `_, xie xingguo) * bluestore: os/bluestore: configure rocksdb cache via bluestore_cache_kv_ratio (`pr#15580 `_, Sage Weil) * bluestore: os/bluestore: fix a typo about bleustore (`pr#15357 `_, Dongsheng Yang) * bluestore: os/bluestore: fix BitMapAllocator assert on out-of-bound hint value (`pr#15289 `_, Igor Fedotov) * bluestore: os/bluestore: fix buffers pinned by indefinitely deferred writes (`pr#15398 `_, Sage Weil) * bluestore: os/bluestore: fix false assert in IOContext::aio_wake (`pr#15268 `_, Igor Fedotov) * bluestore: os/bluestore: fix false asserts in Cache::trim_all() (`pr#15470 `_, xie xingguo) * bluestore: os/bluestore: fix fsck deferred_replay (`pr#15295 `_, Sage Weil) * bluestore: os/bluestore: fix possible out of order shard(offset == 0); add sanity check (`pr#15658 `_, xie xingguo) * bluestore: os/bluestore: fix potential access violation (`pr#15657 `_, xie xingguo) * bluestore: os/bluestore: fix unsharing blob dirty_range args (`issue#20227 `_, `pr#15766 `_, Sage Weil) * bluestore: os/bluestore: handle rounding error in cache ratios (`pr#15672 `_, Sage Weil) * bluestore: os/bluestore: initialize finishers properly (`pr#15666 `_, xie xingguo) * bluestore: os/bluestore/KernelDevice: fix comments (`pr#15264 `_, xie xingguo) * bluestore: os/bluestore/KernelDevice: helpful warning when aio limit exhausted (`pr#15116 `_, Sage Weil) * bluestore: os/bluestore: move sharedblob to new collection in same shard (`issue#20358 `_, `pr#15783 `_, Sage Weil) * bluestore: os/bluestore: refactor BlueStore::_do_write; kill dead ExtentMap::find_lextent() method (`pr#15750 `_, xie xingguo) * bluestore: os/bluestore: remove unused variables (`pr#15718 `_, zhanglei) * bluestore: os/bluestore: stop calculating bound if we must reshard; narrow shard combination condition (`pr#15631 `_, xie xingguo) * bluestore: os/bluestore: target_bytes should scale with meta/data ratios. (`pr#15708 `_, Mark Nelson) * bluestore,performance: os/bluestore: avoid overloading extents during reshard; atomic deferred_batch_ops (`pr#15502 `_, xie xingguo) * bluestore,performance: os/bluestore: batch throttle (`pr#15284 `_, Jianpeng Ma) * bluestore,performance: os/bluestore: keep statfs replica in RAM to avoid expensive KV retrieval (`pr#15309 `_, Igor Fedotov) * bluestore,performance: os/bluestore/KernelDevice: fix sync write vs flush (`pr#15034 `_, Sage Weil) * bluestore,performance: os/bluestore: move cache_trim into MempoolThread (`pr#15380 `_, xie xingguo) * bluestore,performance: os/bluestore: put bluefs in the middle of the shared device (`pr#14873 `_, Sage Weil) * bluestore,performance: os/bluestore: separate kv_sync_thread into two parts (`pr#14035 `_, Jianpeng Ma, Igor Fedotov, Sage Weil) * bluestore,performance: os/bluestore: try to unshare blobs for EC overwrite workload (`pr#14239 `_, Sage Weil) * bluestore,tests: qa/objectstore/bluestore*: fsck on mount (`pr#15785 `_, Sage Weil) * bluestore,tests: test/unittest_bluefs: When fsync ret is less than 0, fsync can not be… (`pr#15365 `_, shiqi) * bluestore: wrap blob id when it reaches maximum value of int16_t (`issue#19555 `_, `pr#15654 `_, Xiaoyan Li) * build/ops: 12.0.3 (`pr#15600 `_, Jenkins Build Slave User) * build/ops: build: move bash_completion.d/ceph to ceph-common (`pr#15148 `_, Leo Zhang) * build/ops: build: remove ceph-disk-udev entirely (`pr#15259 `_, Leo Zhang) * build/ops: build: revert -Wvla from #15342 (`pr#15469 `_, Willem Jan Withagen) * build/ops: build: Use .S suffix for ppc64le assembly files (`issue#20106 `_, `pr#15373 `_, Andrew Solomon) * build/ops: ceph-detect-init: detect init system by poking the system (`issue#19884 `_, `pr#15043 `_, Kefu Chai) * build/ops,common: build: Adds C++ warning flag for C Variable-Length Arrays. (`pr#15342 `_, Jesse Williamson) * build/ops,common: common/blkdev.cc: propagate get_device_by_fd to different OSes (`pr#15547 `_, Willem Jan Withagen) * build/ops: conditionalize rgw Beast frontend so it isn't built on s390x architecture (`issue#20048 `_, `pr#15225 `_, Willem Jan Withagen, Nathan Cutler, Kefu Chai, Tim Serong, Casey Bodley) * build/ops,core: common/freebsd_errno.cc: fix missing ; (`pr#15741 `_, Willem Jan Withagen) * build/ops,core: erasure-code: update ec_isa version + add missing AVX512 ISA-L sources (`pr#15636 `_, Ganesh Mahalingam, Tushar Gohad) * build/ops,core,tests: osd/dmclock/testing: reorganize testing, building now optional (`pr#15375 `_, J. Eric Ivancich) * build/ops: debian: ceph-mgr: fix package description (`pr#15513 `_, Fabian Grünbichler) * build/ops: debian: sync logrotate packaging with downstream (`issue#19938 `_, `pr#15567 `_, Fabian Grünbichler) * build/ops: do_cmake.sh: enable ccache if installed (`pr#15274 `_, Sage Weil) * build/ops: drop libfcgi build dependency (`pr#15285 `_, Nathan Cutler) * build/ops: .gitignore: exclude rpm files (`pr#15745 `_, Leo Zhang) * build/ops: install-deps.sh: workaround setuptools' dependency on six (`pr#15406 `_, Kefu Chai) * build/ops: miscellaneous cleanups and fixes (run-make-check.sh, ceph.spec.in) (`issue#20091 `_, `issue#20127 `_, `pr#15399 `_, Nathan Cutler) * build/ops,rbd,tests: test/librbd: decouple ceph_test_librbd_api from libceph-common (`issue#20175 `_, `pr#15611 `_, Kefu Chai) * build/ops: rpm: apply epoch only if %epoch macro is defined (`pr#15286 `_, Nathan Cutler) * build/ops: rpm: make librbd1 %post scriptlet depend on coreutils (`issue#20052 `_, `pr#15231 `_, Giacomo Comes, Nathan Cutler) * build/ops: rpm: move _epoch_prefix below Epoch definition (`pr#15417 `_, Nathan Cutler) * build/ops: rpm: move RDMA and python-prettytables build dependencies to distro-conditional section (`pr#15200 `_, Nathan Cutler) * build/ops: rpm: package COPYING, move sample ceph.conf to ceph-common (`pr#15596 `_, Nathan Cutler) * build/ops: selinux: Allow read on var_run_t (`issue#16674 `_, `pr#15523 `_, Boris Ranto) * build/ops: selinux: Do parallel relabel on package install (`issue#20077 `_, `pr#14871 `_, Boris Ranto) * build/ops: selinux: Install ceph-base before ceph-selinux (`issue#20184 `_, `pr#15490 `_, Boris Ranto) * build/ops: Set subman cron attributes in spec file (`issue#20074 `_, `pr#15270 `_, Thomas Serlin) * build/ops: The Clangtastic Mr. Clocks (`pr#15186 `_, Adam C. Emerson) * build/ops: yasm-wrapper: filter -pthread (`pr#15249 `_, Alessandro Barbieri) * cephfs: #17980: MDS client blacklisting and blacklist on eviction (`issue#17980 `_, `issue#9754 `_, `pr#14610 `_, John Spray) * cephfs: ceph: simplify CInode::maybe_export_pin() (`pr#15106 `_, "Yan, Zheng") * cephfs: client: avoid returning negative space available (`issue#20178 `_, `pr#15481 `_, John Spray) * cephfs: client: call the lru_remove() twice,when trim cache (`pr#15662 `_, huanwen ren) * cephfs: client: check for luminous MDS before sending FLUSH_MDLOG (`pr#15805 `_, John Spray) * cephfs: client: fix Dentry::dump (`pr#15779 `_, huanwen ren) * cephfs: client: fix display ino in the ldout (`pr#15314 `_, huanwen ren) * cephfs: client/inode: fix the dump type of Inode::dump() (`pr#15198 `_, huanwen ren) * cephfs,common,rbd: blkin: librbd trace hooks (`pr#15053 `_, Victor Araujo, Jason Dillaman) * cephfs: mds/MDBalancer: remove useless check_targets and hit_targets logic from MDS balancer (`issue#20131 `_, `pr#15407 `_, Zhi Zhang) * cephfs,mgr: pybind/mgr/fsstatus: use mds_mem.dn as dentry counter (`pr#15255 `_, Zhi Zhang) * cephfs: mon/FSCommand: fix indentation (`pr#15423 `_, Sage Weil) * cephfs: mon/MDSMonitor: respect mds_standby_for_rank config (`pr#15129 `_, "Yan, Zheng") * cephfs: osdc/Journaler: avoid executing on_safe contexts prematurely (`issue#20055 `_, `pr#15240 `_, "Yan, Zheng") * cephfs: osdc/Journaler: fix memory leak in Journaler::_issue_read() (`issue#20338 `_, `pr#15776 `_, "Yan, Zheng") * cephfs: osdc/Objecter: fix inflight_ops update (`pr#15768 `_, "Yan, Zheng") * cephfs: qa/cephfs: disable mds_bal_frag for TestStrays.test_purge_queue_op_rate (`issue#19892 `_, `pr#15105 `_, "Yan, Zheng") * cephfs: qa/tasks/cephfs: use getattr to guarantee inode is in client cache (`issue#19912 `_, `pr#15062 `_, "Yan, Zheng") * cephfs: qa: update log whitelists for kcephfs suite (`pr#14922 `_, "Yan, Zheng") * cephfs,tests: ceph-object-corpus: mark MMDSSlaveRequest incompat change (`pr#15730 `_, Sage Weil) * cephfs,tests: qa: fix float parse error in test_fragment (`pr#15122 `_, Patrick Donnelly) * cephfs,tests: qa: misc cephfs test improvements (`issue#20131 `_, `pr#15411 `_, John Spray) * cephfs,tests: qa: silence upgrade test failure (`issue#19934 `_, `pr#15126 `_, Patrick Donnelly) * cephfs,tests: qa: simplify TestJournalRepair (`pr#15096 `_, John Spray) * cephfs: tools/cephfs: remove `apply` mode of cephfs-journal-tool (`pr#15715 `_, John Spray) * cleanup: src: put-to operator function - const input cleanup (`issue#3977 `_, `pr#15364 `_, Jos Collin) * cmake: Add -finstrument-functions flag to OSD code (`pr#15055 `_, Mohamad Gebai) * cmake: build boost as an external project (`pr#15376 `_, Kefu Chai) * cmake: check the existence of gperf before using it (`pr#15164 `_, Kefu Chai) * cmake: do not add dependencies to INTERFACE library on cmake < 3.3 (`pr#15813 `_, Kefu Chai) * cmake: do not link libcommon against some libs (`pr#15340 `_, Willem Jan Withagen) * cmake: exclude \*.css while generating ctags (`pr#15663 `_, Leo Zhang) * cmake: fix boost components for WITH_SYSTEM_BOOST (`pr#15160 `_, Bassam Tabbara) * cmake: improved build speed by 5x when using ccache (`pr#15147 `_, Bassam Tabbara) * cmake: link against fcgi only if enabled (`pr#15425 `_, Yao Zongyou) * cmake: misc fixes for build on i386 (`pr#15516 `_, James Page) * cmake: rgw: do not link against boost in a wholesale (`pr#15347 `_, Nathan Cutler, Kefu Chai) * cmake: workaound ccache issue with .S assembly files (`pr#15142 `_, Bassam Tabbara) * common: add ceph::size() (`pr#15181 `_, Kefu Chai) * common: ceph_osd: remove client message cap limit (`pr#14944 `_, Haomai Wang) * common: cls: optimize header file dependency (`pr#15165 `_, Brad Hubbard, Xiaowei Chen) * common: cmdparse: more constness (`pr#15023 `_, Kefu Chai) * common: common/ceph_context: 'config diff get' option added (`pr#10736 `_, Daniel Oliveira) * common: common/ceph_context: fix leak of registered commands on exit (`pr#15302 `_, xie xingguo) * common: common/freebsd_errno.cc: fixed again a stupid typo (`pr#15742 `_, Willem Jan Withagen) * common: common/iso_8601.cc: Make return expression Clang compatible (`pr#15336 `_, Willem Jan Withagen) * common: common/LogEntry: include EntityName in log entries (`pr#15395 `_, Sage Weil) * common: common,osdc: remove atomic_t completely (`pr#15562 `_, Kefu Chai) * common: common/perf_counters: add average time for PERFCOUNTER_TIME (`pr#15478 `_, xie xingguo) * common: common/perf_counters: make schema more friendly and update docs (`pr#14933 `_, Sage Weil) * common: common,test: migrate atomic_t to std::atomic (`pr#14866 `_, Jesse Williamson) * common: config_opt: use bool instead of int for the default value of filestore_debug_omap_check (`pr#15651 `_, Leo Zhang) * common,core: ceph_test_rados_api_misc: fix LibRadosMiscConnectFailure.ConnectFailure retry (`issue#19901 `_, `pr#15522 `_, Sage Weil) * common: core/common: Fix ENODATA for FreeBSD with compat.h (`issue#19883 `_, `pr#15685 `_, Willem Jan Withagen) * common,core: osd/OSDMap: make osd_state 32 bits wide (`pr#15390 `_, Sage Weil) * common,core: osd/OSDMap: replace require_*_osds flags with a single require_osd_release field (`pr#15068 `_, Sage Weil) * common,core: osd/OSDMap: replace string-based min_compat_client with a CEPH_RELEASE_* uint8_t (`pr#15351 `_, Sage Weil) * common: crc32c: include acconfig.h to fix ceph_crc32c_aarch64() (`pr#15515 `_, Kefu Chai) * common: crush/CrushWrapper: fix has_incompat_choose_args (`pr#15218 `_, Sage Weil) * common: crush/CrushWrapper: fix has_incompat_choose_args() (`pr#15244 `_, Sage Weil) * common: denc: add encode/decode for basic_sstring (`pr#15135 `_, Kefu Chai, Casey Bodley) * common: fix typo in option of rados_mon_op_timeout's comment (`pr#15681 `_, Leo Zhang) * common: get_process_name: use getprogname on bsd systems (`pr#15338 `_, Mykola Golub) * common: Improved CRC calculation for zero buffers (`pr#11966 `_, Adam Kupczyk) * common: include/lru.h: add const to member functions (`pr#15408 `_, yonghengdexin735) * common: include/rados: Fix typo in rados_ioctx_cct() doc (`pr#15220 `_, Jos Collin) * common: include: Redo some includes for FreeBSD (`issue#19883 `_, `pr#15337 `_, Willem Jan Withagen) * common: initialize _hash in LogEntryKey() (`pr#15615 `_, Jos Collin) * common: int_types.h: remove hacks to workaround old systems (`pr#15069 `_, Kefu Chai) * common: librados,libradosstriper,test: migrate atomic_t to std::atomic (baragon) (`pr#14658 `_, Jesse Williamson) * common: libradosstriper: Add example code (`pr#15350 `_, Logan Blyth) * common: mempool: improve dump; fix buffer accounting bugs (`pr#15403 `_, Sage Weil) * common: mgr/PyFormatter: implement dump_format_va (`pr#15634 `_, Sage Weil) * common,mon: messenger,client,compressor: migrate atomic_t to std::atomic (`pr#14657 `_, Jesse Williamson) * common,mon: mon,crush: add 'osd crush swap-bucket' command (`pr#15072 `_, Sage Weil) * common: msg/async: add assert of ms_async_op_threads > 0 (`pr#15629 `_, linbing) * common,performance: buffer: allow buffers to be accounted in arbitrary mempools (`pr#15352 `_, Sage Weil) * common,performance: crc32c: Add ppc64le fast zero optimized assembly. (`pr#15100 `_, Andrew Solomon) * common,performance: inline_memory: optimized mem_is_zero for non-x64 (`pr#15307 `_, Piotr Dałek) * common,performance: kv/rocksdb: supports SliceParts interface (`pr#15058 `_, Haomai Wang) * common,performance: osd/OSDMap: make pg_temp more efficient (`pr#15291 `_, Sage Weil) * common,rdma: msg/async/rdma: automatically set RDMAV_HUGEPAGES_SAFE according to conf (`pr#15755 `_, DanielBar-On) * common: Remove redundant includes - 2 (`issue#19883 `_, `pr#15169 `_, Jos Collin) * common: Remove redundant includes - 3 (`issue#19883 `_, `pr#15204 `_, Jos Collin) * common: Remove redundant includes - 4 (`issue#19883 `_, `pr#15251 `_, Jos Collin) * common: Remove redundant includes - 5 (`issue#19883 `_, `pr#15267 `_, Jos Collin) * common: Remove redundant includes - 6 (`issue#19883 `_, `pr#15299 `_, Jos Collin) * common: Remove redundant includes (`issue#19883 `_, `pr#15042 `_, Brad Hubbard) * common: Remove redundant includes (`issue#19883 `_, `pr#15086 `_, Jos Collin) * common: src/common/ceph_string: stringify new osd states (`pr#15751 `_, xie xingguo) * common,tests: ceph_test_rados_api_list: more fix LibRadosListNP.ListObjectsError (`issue#19963 `_, `pr#15138 `_, Sage Weil) * common,tests: test: Make screencandy optional for FreeBSD (`pr#15444 `_, Willem Jan Withagen) * common: xio: migrate atomic_t to std::atomic<> (`pr#15230 `_, Jesse Williamson) * core: ceph-dencoder: Silence coverity CID 1412579 (`pr#15744 `_, Brad Hubbard) * core: ceph-disk: do not setup_statedir on trigger (`issue#19941 `_, `pr#15410 `_, Loic Dachary) * core: ceph-objectstore-tool: do not populate snapmapper with missing clones (`issue#19943 `_, `pr#15787 `_, Sage Weil) * core: compressor: add LZ4 support (`pr#15434 `_, Haomai Wang) * core: compressor: optimize header file dependency (`pr#15187 `_, Brad Hubbard, Xiaowei Chen) * core: crush, mon: make jewel the lower bound for client/crush compat for new clusters (`pr#15370 `_, Sage Weil) * core: erasure-code: optimize header file dependency (`pr#15172 `_, Brad Hubbard, Xiaowei Chen) * core: erasure-code: Remove duplicate of isa-l files (`pr#15372 `_, Ganesh Mahalingam) * core: filestore: migrate atomic_t to std::atomic<> (`pr#15228 `_, Jesse Williamson) * core: include/types.h, introduce host_to_ceph_errno (`pr#15496 `_, Willem Jan Withagen) * core: Install Pecan for FreeBSD (`pr#15610 `_, Willem Jan Withagen) * core: introduce (and fix) code to pass errno to other OSes (`pr#15495 `_, Willem Jan Withagen) * core: messages/MOSDPing: initialize MOSDPing padding (`issue#20323 `_, `pr#15714 `_, Sage Weil) * core,mgr: mgr/DaemonServer: stop spamming log with pg stats (`pr#15487 `_, Sage Weil) * core,mgr,mon: mon/PGMap: fix osd_epoch update when removing osd_stat (`issue#20208 `_, `pr#15573 `_, Sage Weil) * core,mgr,tests: qa/suites/rados/rest: test restful mgr module (`pr#15604 `_, Sage Weil) * core: mon,mgr: fix "ceph osd df", add some tools to find untested commands (`issue#20256 `_, `pr#15675 `_, Greg Farnum) * core,mon: mon/LogMonitor: 'log last' command (`pr#15497 `_, Sage Weil) * core,mon: mon/MgrStatMonitor: keep mgrstat version ahead of pgmon (`issue#20219 `_, `pr#15584 `_, Sage Weil) * core,mon: mon,osd: add crush_version to OSDMap, and allow crush map updates to gate on crush_version (`pr#15533 `_, Sage Weil) * core,mon: mon/OSDMonitor: cancel mapping job from update_from_paxos (`issue#20067 `_, `pr#15320 `_, Sage Weil) * core,mon: mon/OSDMonitor: use up set instead of acting set in reweight_by_utilization (`pr#13802 `_, Mingxin Liu) * core,mon: mon/PGMap: call blocked requests ERR not WARN (`pr#15501 `_, Sage Weil) * core: mon/OSDMonitor: batch noup/noin osds support (`pr#15725 `_, xie xingguo) * core: mon/OSDMonitor: batch OSDs nodown/noout support (`pr#15381 `_, xie xingguo) * core: mon/OSDMonitor: change info in 'osd failed' messages (`pr#15321 `_, Sage Weil) * core: mon,osd/OSDMap: a couple pg-upmap fixes (`pr#15319 `_, Sage Weil) * core: msg/async: avoid requeue racing with handle_write (`issue#20093 `_, `pr#15324 `_, Haomai Wang) * core: msg/async: fix deleted_conn is out of sync with conns (`issue#20230 `_, `pr#15645 `_, Haomai Wang) * core: objclass-sdk: use namespace ceph for bufferlist (`pr#15581 `_, Neha Ojha) * core: osd/ECTransaction: only read partial stripes when below *original* object size (`issue#19882 `_, `pr#15712 `_, Sage Weil) * core: osd,librados: add manifest, redirect (`pr#15325 `_, Sage Weil) * core: osd, messages/MOSDPing: bunch of fixes related to ping inflation (`pr#15727 `_, Piotr Dałek) * core: osd/OSD: bump up current version; conditionally encoding manifest into oi (`pr#15687 `_, xie xingguo) * core: osd/OSDMap: Change *pg_to_* to return void (`pr#15684 `_, Brad Hubbard) * core: osd/OSDMap: improve upmap calculation (`issue#19818 `_, `pr#14902 `_, Sage Weil) * core: osd/PG: drop pre-firefly compat_mode for choose_*_acting (`pr#15057 `_, Sage Weil) * core: osd/pglog: remove loop through empty collection (`pr#15121 `_, J. Eric Ivancich) * core: osd/PG: make non-empty PastIntervals non-fatal (`issue#20167 `_, `pr#15639 `_, Sage Weil) * core: osd/PrimaryLogPG: clear oi from trim_object() (`issue#19947 `_, `pr#15519 `_, Sage Weil) * core: osd/PrimaryLogPG: fix oi reset during trim_object (`issue#19947 `_, `pr#15696 `_, Sage Weil) * core: osd/PrimaryLogPG: record prior_version for DELETE events (`issue#20274 `_, `pr#15649 `_, Sage Weil) * core: os/filestore: when print log, use __func__ instead of hard code function name (`pr#15261 `_, mychoxin) * core: os/filestore: zfs add get_name() (`pr#15650 `_, Yanhu Cao) * core,performance: msg/async: reduce write_lock contention (`pr#15092 `_, Haomai Wang) * core,performance: osd/OSD.h: requeue the scrub job with higher priority to shorten the blocking time of related requests (`pr#15552 `_, Jin Cai) * core: qa: do not restrict valgrind runs to centos (`issue#18126 `_, `pr#15389 `_, Greg Farnum) * core,rgw: qa: Removed all 'default_idle_timeout' due to chnage in rwg task (`pr#15420 `_, Yuri Weinstein) * core,rgw,tests: qa/rgw_snaps: move default_idle_timeout config under the client (`issue#20128 `_, `pr#15400 `_, Yehuda Sadeh) * core: src/ceph.in: Use env(CEPH_DEV) to suppress noise from ceph (`pr#14746 `_, Willem Jan Withagen) * core,tests: ceph-disk: sensible default for block.db (`pr#15576 `_, Loic Dachary) * core,tests: ceph_test_rados_api_*: wait for snap trim on ENOENT during cleanup (`issue#19948 `_, `pr#15638 `_, Sage Weil) * core,tests: qa/suites/rados/\*/at-end: wait for healthy before scrubbing (`pr#15245 `_, Sage Weil) * core,tests: qa/suites/rados/singleton-nomsg/health-warnings: behave on ext4 (`issue#20043 `_, `pr#15207 `_, Sage Weil) * core,tests: qa/suites/rados: temporarily remove scrub_test from basic/ until post-luminous (`issue#19935 `_, `pr#15202 `_, Sage Weil) * core,tests: qa/suites/upgrade/kraken-x: enable experimental for bluestore (`pr#15359 `_, Sage Weil) * core,tests: qa/tasks/ceph_manager: get osds all in after thrashing (`pr#15784 `_, Sage Weil) * core,tests: qa/tasks/ceph: osd_scrub_pgs: reissue scrub requests in loop (`issue#20326 `_, `pr#15747 `_, Sage Weil) * core,tests: qa/tasks/ceph.py: tolerate active+clean+something (`pr#15717 `_, Sage Weil) * core,tests: qa/workunits/cephtool/test.sh: fix osd full health detail grep (`issue#20187 `_, `pr#15494 `_, Sage Weil) * core,tests: qa/workunits/rados/test_health_warning: misc fixes (`issue#19990 `_, `pr#15201 `_, Sage Weil) * core,tests: Revert "qa: do not restrict valgrind runs to centos" (`issue#20360 `_, `pr#15791 `_, Sage Weil) * core,tests: test/osd/TestRados.cc: run set-redirect test after finishing setup (`issue#20114 `_, `pr#15385 `_, Myoungwon Oh) * core,tools: osdmaptool: require --upmap-save before modifying input osdmap (`pr#15247 `_, Sage Weil) * crush: add missing tunable in tests (`pr#15412 `_, Loic Dachary) * crush: detect and (usually) fix ruleset != rule id (`pr#13683 `_, Sage Weil) * crush: encode can override weights with weight set (`issue#19836 `_, `pr#15002 `_, Loic Dachary) * crush: optimize header file dependency (`pr#9307 `_, Xiaowei Chen) * crush: update choose_args when items are added/removed (`pr#15311 `_, Loic Dachary) * doc: 12.1.0/release notes 2 (`pr#15627 `_, Abhishek Lekshmanan) * doc: add descriptions for mon/mgr options (`pr#15032 `_, Kefu Chai) * doc: add FreeBSD manual install (`pr#14941 `_, Willem Jan Withagen) * doc: add new cn ceph mirror to doc and mirroring (`pr#15089 `_, Shengjing Zhu) * doc: add rados xattr commands to manpage (`pr#15362 `_, Andreas Gerstmayr) * doc: add README to dmclock subdir to inform developers it's a git subtree (`pr#15386 `_, J. Eric Ivancich) * doc: AUTHORS: update with release manager, backport team (`pr#15391 `_, Sage Weil) * doc: Change the default values of some OSD options (`issue#20199 `_, `pr#15566 `_, Bara Ancincova) * doc: describe CephFS max_file_size (`pr#15287 `_, Ken Dreyer) * doc: describe mark_events logging available via the OSD's OpTracker (`pr#15095 `_, Greg Farnum) * doc: dev improve the s3tests doc to reflect current scripts (`pr#15180 `_, Abhishek Lekshmanan) * doc: doc/cephfs: mention RADOS object size limit (`pr#15550 `_, John Spray) * doc: doc/release-notes: update which jewel version does sortbitwise warning (`pr#15209 `_, Sage Weil) * doc: doc/release-note: update release-note (`pr#15748 `_, liuchang0812) * doc: doc/rgw: remove fastcgi page and sample configs (`pr#15133 `_, Casey Bodley) * doc: doc/rgw: remove Federated Configuration, clean up multisite (`issue#19504 `_, `issue#18082 `_, `pr#15132 `_, Casey Bodley) * doc: Documentation Fixes for http://tracker.ceph.com/issues/19879 (`issue#20057 `_, `issue#19879 `_, `pr#15606 `_, Sameer Tiwari) * doc: document perf historgrams (`pr#15150 `_, Piotr Dałek) * doc: Document RGW quota cache options (`issue#18747 `_, `pr#13395 `_, Daniel Gryniewicz) * doc: document the setup of restful and dashboard plugins (`issue#20239 `_, `pr#15707 `_, Kefu Chai) * doc: fix broken link in erasure-code.rst (`issue#19972 `_, `pr#15143 `_, MinSheng Lin) * doc: fix factual inaccuracy in doc/architecture.rst (`pr#15235 `_, Nathan Cutler, Sage Weil) * doc: fixing an error in 12.0.3 release notes (`pr#15195 `_, Abhishek Lekshmanan) * doc: fix syntax on code snippets in cephfs/multimds (`pr#15499 `_, John Spray) * doc: freshen mgr docs (`pr#15690 `_, John Spray) * doc: kill some broken links (`pr#15203 `_, liuchang0812) * doc: mailmap for v12.0.2 (`pr#14753 `_, Abhishek Lekshmanan) * doc: mailmap: Leo Zhang infomation and affiliation (`pr#15145 `_, Leo Zhang) * doc: mention certain conf vars should be in global (`pr#15119 `_, Ali Maredia) * doc: Merge pull request from stiwari/wip-19879 (`issue#19879 `_, `pr#15609 `_, Sameer Tiwari) * doc: minor fixes in radosgw/ (`pr#15103 `_, Drunkard Zhang) * doc: PendingReleaseNotes: notes on whiteouts vs pgnls (`pr#15575 `_, Sage Weil) * doc: PendingReleaseNotes: warning about 'osd rm ...' and #19119 (`issue#19119 `_, `pr#13731 `_, Sage Weil) * doc: release-notes clarify about rgw encryption (`pr#14800 `_, Abhishek Lekshmanan) * doc: release notes for v12.0.3 (dev) (`pr#15090 `_, Abhishek Lekshmanan) * doc: remove some non-existent and fix the default value according to … (`pr#15664 `_, Leo Zhang) * docs document "osd recovery max single start" setting (`issue#17396 `_, `pr#15275 `_, Ken Dreyer) * doc: tools/cephfs: fix cephfs-journal-tool --help (`pr#15614 `_, John Spray) * doc: typo fixes on hyperlink/words (`pr#15144 `_, Drunkard Zhang) * doc: update sample explaning "%" operator in test suites (`pr#15511 `_, Kefu Chai) * doc: Update some RGW documentation (`pr#15175 `_, Jens Rosenboom) * doc: update the usage of 'ceph-deploy purge' (`pr#15080 `_, Yu Shengzuo) * doc: use do_cmake.sh instead of `cmake ..` (`pr#15110 `_, Kefu Chai) * librbd: add create timestamp metadata for image (`pr#15757 `_, runsisi) * librbd: discard related IO should skip op if object non-existent (`issue#19962 `_, `pr#15239 `_, Mykola Golub) * librbd: do not raise an error if trash list returns -ENOENT (`pr#15085 `_, runsisi) * librbd: filter expected error codes from is_exclusive_lock_owner (`issue#20182 `_, `pr#15483 `_, Jason Dillaman) * librbd: fix issues with image removal state machine (`pr#15734 `_, Jason Dillaman) * librbd: fix valgrind errors and ensure tests detect future leaks (`pr#15415 `_, Jason Dillaman) * librbd: optimize copy-up to add hints only once to object op (`issue#19875 `_, `pr#15037 `_, Mykola Golub) * librbd: potential read IO hang when image is flattened (`issue#19832 `_, `pr#15234 `_, Jason Dillaman) * librbd: reacquire lock should update lock owner client id (`issue#19929 `_, `pr#15093 `_, Jason Dillaman) * librbd: reduce potential of erroneous blacklisting on image close (`issue#19970 `_, `pr#15162 `_, Jason Dillaman) * librbd: remove unused rbd_image_options_t ostream operator (`pr#15443 `_, Mykola Golub) * mds: change the type of data_pools (`pr#15278 `_, Vicente Cheng) * mds: check export pin during replay (`issue#20039 `_, `pr#15205 `_, Patrick Donnelly) * mds: Client syncfs is slow (waits for next MDS tick) (`issue#20129 `_, `pr#15544 `_, dongdong tao) * mds: explicitly output error msg for dump cache asok command (`pr#15592 `_, Zhi Zhang) * mds: fix CDir::merge() for mds_debug_auth_pins (`issue#19946 `_, `pr#15130 `_, "Yan, Zheng") * mds: fix client ID truncation (`pr#15258 `_, Henry Chang) * mds: fix hangs involving re-entrant calls to journaler (`issue#20165 `_, `pr#15430 `_, John Spray) * mds: improvements for stray reintegration (`pr#15548 `_, "Yan, Zheng") * mds: limit client writable range increment (`issue#19955 `_, `pr#15131 `_, "Yan, Zheng") * mds: mds perf item 'l_mdl_expos' always behind journaler (`pr#15621 `_, redickwang) * mds: miscellaneous multimds fixes part2 (`pr#15125 `_, "Yan, Zheng") * mds: miscellaneous multimds fixes (`pr#14550 `_, "Yan, Zheng") * mds: multimds flock fixes (`pr#15440 `_, "Yan, Zheng") * mds: Pass empty string to clear mantle balancer (`issue#20076 `_, `pr#15282 `_, Zhi Zhang) * mds: properly create aux subtrees for pinned directory (`issue#20083 `_, `pr#15300 `_, "Yan, Zheng") * mds: save projected path into inode_t::stray_prior_path (`issue#20340 `_, `pr#15800 `_, "Yan, Zheng") * mgr: ceph-create-keys: update client.admin if it already exists (`issue#19940 `_, `pr#15112 `_, John Spray) * mgr: ceph: introduce "tell x help" subcommand (`issue#19885 `_, `pr#15111 `_, liuchang0812) * mgr: ceph-mgr: Implement new pecan-based rest api (`pr#14457 `_, Boris Ranto) * mgr: cleanup, stop clients sending in perf counters (`pr#15578 `_, John Spray) * mgr: dashboard code cleanup (`pr#15577 `_, John Spray) * mgr: dashboard GUI module (`pr#14946 `_, John Spray, Dan Mick) * mgr: fix MgrStandby eating messages (`pr#15716 `_, John Spray) * mgr: load modules in separate python sub-interpreters (`pr#14971 `_, Tim Serong) * mgr: Mark session connections down on shutdown (`issue#19900 `_, `pr#15192 `_, Brad Hubbard) * mgr: mgr/DaemonServer.cc: log daemon type string as well as id (`pr#15560 `_, Dan Mick) * mgr: mgr/dashboard: fix type error in get_rate function (`issue#20276 `_, `pr#15668 `_, liuchang0812) * mgr: mgr/dashboard: load log lines on startup, split out audit log (`pr#15709 `_, John Spray) * mgr: mgr/MgrStandby: prevent use-after-free on just-shut-down Mgr (`issue#19595 `_, `pr#15297 `_, Sage Weil) * mgr: mgr/MgrStandby: respawn when deactivated (`issue#19595 `_, `issue#19549 `_, `pr#15557 `_, Sage Weil) * mgr: mgr,osd: ceph-mgr --help, unify usage text of other daemons (`pr#15176 `_, Tim Serong) * mgr,mon: mon,mgr: extricate PGmap from monitor (`issue#20067 `_, `issue#20174 `_, `issue#20050 `_, `pr#15073 `_, Kefu Chai, Sage Weil, Greg Farnum) * mgr,mon: mon/MgrMonitor: add 'mgr dump [epoch]' command (`pr#15158 `_, Sage Weil) * mgr,mon: mon,mgr: print pgmap reports to debug (not cluster) log (`pr#15740 `_, Sage Weil) * mgr: optimize DaemonStateIndex::cull() a little bit (`pr#14967 `_, Kefu Chai) * mgr: print a more helpful error message for when users lack mgr ceph caps (`issue#20296 `_, `pr#15697 `_, Greg Farnum) * mgr: pybind/mgr/dashboard: monkeypatch os.exit to stop cherrypy from taking down mgr (`issue#20216 `_, `pr#15588 `_, Sage Weil) * mgr: pybind/mgr: Delete `rest` module (`pr#15429 `_, John Spray) * mgr: pybind/mgr/restful: improve cert handling; work with vstart (`pr#15405 `_, Sage Weil) * mgr: raise python exception on failure in send_command() (`pr#15704 `_, Kefu Chai) * mgr: remove default cert; disable both restful and dashboard by default (`pr#15601 `_, Boris Ranto, Sage Weil) * mgr,tests: qa/upgrade/jewel-x/point-to-point: add a mgr during final upgrade (`pr#15637 `_, Sage Weil) * mon: add crush type down health warnings (`pr#14914 `_, Neha Ojha) * mon: Add override for FsNewHandler::handle() (`pr#15331 `_, yonghengdexin735) * mon: cleanups (`pr#15272 `_, Kefu Chai) * mon: delete useless function definition (`pr#15188 `_, shiqi) * mon: DIVIDE_BY_ZERO in PGMapDigest::dump_pool_stats_full() (`pr#15622 `_, Jos Collin) * mon: don't prefix mgr summary with epoch number (`pr#15512 `_, John Spray) * mon: fix accesing pending_fsmap from peon (`issue#20040 `_, `pr#15213 `_, John Spray) * mon: fix a few bugs with the osd health reporting (`pr#15179 `_, Sage Weil) * mon: Fixed typo in function comment blocks and in other comments (`pr#15304 `_, linbing) * mon: Fixed typo in @post of _active() (`pr#15191 `_, Linbing) * mon: fix mon_keyvaluedb application (`pr#15059 `_, Sage Weil) * mon: Incorrect expression in PGMap::get_health() (`pr#15648 `_, Jos Collin) * mon: it's no need to get pg action_primary osd twice in pg scrub (`pr#15313 `_, linbing) * mon: Log errors at startup (`issue#14088 `_, `pr#15723 `_, Ziye Yang) * mon: mon/MgrMonitor: send digests only if is_active() (`pr#15109 `_, Kefu Chai) * mon: mon/MonClient: cancel pending commands on shutdown (`issue#20051 `_, `pr#15227 `_, Kefu Chai, Sage Weil) * mon: {mon,osd,mds} {versions,count-metadata} (`pr#15436 `_, Sage Weil) * mon: mon/PGMap: show %used in formatted output (`issue#20123 `_, `pr#15387 `_, Joao Eduardo Luis) * mon: Removed unnecessary function declaration in MDSMonitor.h (`pr#15374 `_, yonghengdexin735) * mon: replace osds with `osd destroy` and `osd new` (`pr#14074 `_, Joao Eduardo Luis, Sage Weil) * mon: revise "ceph status" output (`pr#15396 `_, John Spray) * mon: show io status quickly if no update in a long period (`pr#14176 `_, Mingxin Liu) * mon: track features from connect clients, and use it to gate set-require-min-compat-client (`pr#15371 `_, Sage Weil) * mon: trim the creating_pgs after updating it with pgmap (`issue#20067 `_, `pr#15318 `_, Kefu Chai) * msg: do not enable client-side binding by default (`issue#20049 `_, `pr#15392 `_, Jason Dillaman) * msg: don't set msgr addr when disabing client bind (`pr#15243 `_, Haomai Wang) * msgr: msg/async: Lower down the AsyncMessenger's standby warning from debug (`pr#15242 `_, Pan Liu) * msgr: msg/async/rdma: check if fin message completed (`pr#15624 `_, Alexander Mikheev, Adir Lev) * msgr: msg/async/rdma: handle buffers after close msg (`pr#15749 `_, DanielBar-On, Alexander Mikheev, Adir Lev) * msgr: msg/async: remove false alert "assert" (`pr#15288 `_, Haomai Wang) * osd: don't leak pgrefs or reservations in SnapTrimmer (`issue#19931 `_, `pr#15214 `_, Greg Farnum) * osd: fix argument-dependent lookup of swap() (`pr#15124 `_, Casey Bodley) * osd: fix past_intervals base case by adding epoch_pool_created to pg_history_t (`issue#19877 `_, `pr#14989 `_, Sage Weil) * osd: hdd vs ssd defaults for osd op thread pool (`pr#15422 `_, Sage Weil) * osd: Implement asynchronous recovery sleep (`pr#15212 `_, Neha Ojha) * osd: Move scrub sleep timer to osdservice (`issue#19986 `_, `pr#15217 `_, Brad Hubbard) * osd: Object level shard errors are tracked and used if no auth available (`issue#20089 `_, `pr#15397 `_, David Zafman) * osd: osd/OSDMap.cc: check if osd is out in subtree_type_is_down (`issue#19989 `_, `pr#15250 `_, Neha Ojha) * osd: 'osd tree in|out|up|down' to filter tree results (`pr#15294 `_, Sage Weil) * osd: reduce buffer pinning from EC entries (`pr#15120 `_, Sage Weil) * osd: reduce map cache size (`pr#15292 `_, Sage Weil) * osd: reduce rados_max_object_size from 100 GB -> 128 MB (`pr#15520 `_, Sage Weil) * osd: rename osd -> osd_pglog; include pglog-related bufferlists (`pr#15531 `_, Sage Weil) * osd: Return early on shutdown (`issue#19900 `_, `pr#15345 `_, Brad Hubbard) * osd: take PGRef for recovery sleep wakeup event (`issue#20226 `_, `pr#15582 `_, Sage Weil) * osd: when osd in not in failure_pending, we don't need to get osd inst from osdmap. (`pr#15558 `_, linbing) * osd: When scrub finds an attr error mark shard inconsistent (`issue#20089 `_, `pr#15368 `_, David Zafman) * performance: common/config_opts.h: Lower HDD throttle cost. (`pr#15485 `_, Mark Nelson) * performance: denc: add need_contiguous to denc_traits (`pr#15224 `_, Kefu Chai) * pybind: pybind/ceph_argparse: fix empty string check (`issue#20135 `_, `pr#15500 `_, Sage Weil) * pybind: pybind/ceph_daemon.py: fix Termsize.update (`pr#15253 `_, Kefu Chai) * pybind: pybind/rados: avoid call free() on invalid pointer (`pr#15159 `_, Mingxin Liu) * pybind,rbd: pybind/rbd: OSError should be picklable (`issue#20223 `_, `pr#15574 `_, Jason Dillaman) * pybind: support mon target in pybind (`pr#15409 `_, liuchang0812) * qa: test/osd/osd-dup.sh: use wait_for_clean (`pr#15722 `_, Dan Mick) * rbd-mirror: coordinate image syncs with leader (`issue#18789 `_, `pr#14745 `_, Mykola Golub) * rbd-mirror: lock loss during sync should wait for in-flight copies (`pr#15532 `_, Jason Dillaman) * rbd-mirror: permit release of local image exclusive lock after force promotion (`issue#18963 `_, `pr#15140 `_, Jason Dillaman) * rbd: properly decode features when using image name optional (`issue#20185 `_, `pr#15492 `_, Jason Dillaman) * rbd: pybind/rbd: fix crash if more than 1024 images in trash bin (`pr#15134 `_, runsisi) * rbd: rbd/bench: fix write gaps when doing sequential writes with io-threads > 1 (`pr#15206 `_, Igor Fedotov) * rbd: removed hardcoded default pool (`pr#15518 `_, Jason Dillaman) * rbd,tests: qa: krbd discard/zeroout tests (`pr#15388 `_, Ilya Dryomov) * rbd,tests: qa/suites/krbd: unmap subsuite needs straw buckets (`pr#15290 `_, Ilya Dryomov) * rbd,tests: qa: update krbd_data_pool.sh to match the new rados ls behavior (`pr#15594 `_, Ilya Dryomov) * rbd,tests: test/librbd: unit tests cleanup (`pr#15113 `_, Mykola Golub) * rdma: msg/async/rdma: Add DSCP support (`pr#15484 `_, Sarit Zubakov) * rdma: msg/async: Revert RDMA-CM (`pr#15262 `_, Amir Vadai) * rgw: Adding code to create tenanted user for s3 bucket policy tests. (`pr#15028 `_, Pritha Srivastava) * rgw: add "rgw_verify_ssl" config (`pr#15301 `_, Shasha Lu) * rgw: add the Vim's modeline into rgw_orphan.cc. (`pr#15431 `_, Radoslaw Zarzynski) * rgw: bucket index check in radosgw-admin removes valid index. (`issue#18470 `_, `pr#12851 `_, Zhang Shaowen) * rgw: datalog trim and mdlog trim handles the result returned by osd incorrectly. (`issue#20190 `_, `pr#15507 `_, Zhang Shaowen) * rgw: display more info when using radosgw-admin bucket stats (`pr#15256 `_, fang.yuxiang) * rgw: drop asio/{yield,coroutine}.hpp replacements (`pr#15413 `_, Kefu Chai) * rgw: drop using std ns in header files and other cleanups (`pr#15137 `_, Abhishek Lekshmanan) * rgw: dynamic resharding (`pr#15493 `_, Yehuda Sadeh, Orit Wasserman) * rgw: fix 'gc list --include-all' command infinite loop the first items (`issue#19978 `_, `pr#12774 `_, Shasha Lu, fang yuxiang) * rgw: fix lc list failure when shards not be all created (`issue#19898 `_, `pr#15025 `_, Jiaying Ren) * rgw: fix radosgw-admin retcode (`pr#15257 `_, Shasha Lu) * rgw: fix test_multi.py default config file path (`pr#15306 `_, Jiaying Ren) * rgw: fix X-Object-Meta-Static-Large-Object in SLO download (`issue#19951 `_, `pr#15045 `_, Shasha Lu) * rgw: metadata search part 2 (`pr#14351 `_, Yehuda Sadeh) * rgw: migrate atomic_t to std::atomic<> (`pr#15001 `_, Jesse Williamson) * rgw: optimize data sync. Add zones_trace in log to avoid needless sync. (`issue#19219 `_, `pr#13851 `_, Zhang Shaowen) * rgw: optimize generating torrent file. Object data won't stay in memory now. (`pr#15153 `_, Zhang Shaowen) * rgw: pass authentication domain to civetweb (`issue#17657 `_, `pr#12861 `_, Abhishek Lekshmanan) * rgw: polymorphic error codes (`pr#10690 `_, Pritha Srivastava, Marcus Watts) * rgw: remove fastcgi from default rgw frontends (`pr#15098 `_, Casey Bodley) * rgw: rename s3_code to err_code for swift (`pr#12300 `_, Guo Zhandong) * rgw: return the version id in get object and object metadata request. (`issue#19370 `_, `pr#14117 `_, Zhang Shaowen) * rgw: rgw-admin: fix bucket limit check argparse, div(0) (`pr#15316 `_, Matt Benjamin) * rgw: rgw_common: use string::npos for the results of str.find (`pr#14341 `_, Abhishek Lekshmanan) * rgw: rgw_file: add lock protection for readdir against gc (`issue#20121 `_, `pr#15329 `_, Gui Hecheng) * rgw: rgw_file cleanup names (`pr#15568 `_, Gui Hecheng) * rgw: rgw_file: fix flags set on unsuccessful unlink (`pr#15222 `_, Gui Hecheng) * rgw: rgw_file: prevent conflict of mkdir between restarts (`issue#20275 `_, `pr#15655 `_, Gui Hecheng) * rgw: rgw_file: release rgw_fh lock and ref on ENOTEMPTY (`issue#20061 `_, `pr#15246 `_, Matt Benjamin) * rgw: rgw_file: removed extra rele() on fs in rgw_umount() (`pr#15152 `_, Gui Hecheng) * rgw: rgw_file: remove hidden uxattr objects from buckets on delete (`issue#20045 `_, `pr#15210 `_, Matt Benjamin) * rgw: rgw_file: remove post-unlink lookup check (`issue#20047 `_, `pr#15216 `_, Matt Benjamin) * rgw: rgw_file: replace raw fs->fh_lru.unref with predefined fs->unref (`pr#15541 `_, Gui Hecheng) * rgw: rgw_file: store bucket uxattrs on the bucket (`issue#20082 `_, `pr#15293 `_, Matt Benjamin) * rgw: rgw_file: v3: fix write-timer action (`issue#19932 `_, `pr#15097 `_, Matt Benjamin) * rgw: rgw_rados: create sync module instances only if run_sync_thread is set (`issue#19830 `_, `pr#14994 `_, Abhishek Lekshmanan) * rgw: rgw/rgw_swift_auth.cc: using string::back() instead as the C++11 recommend (`pr#14827 `_, liuyuhong) * rgw: segment fault when shard id out of range (`issue#19732 `_, `pr#14389 `_, redickwang) * rgw: set object accounted size correctly (`issue#20071 `_, `pr#14950 `_, fang yuxiang) * rgw: set placement rule properly (`pr#15221 `_, fang.yuxiang) * rgw: support certain archaic and antiquated distributions (`pr#15498 `_, Adam C. Emerson) * rgw,tests: qa/rgw: add multisite suite to configure and run multisite tests (`pr#14688 `_, Casey Bodley) * rgw,tests: qa/rgw: remove apache/fastcgi and radosgw-agent tests (`pr#15184 `_, Casey Bodley) * rgw: Turn off fcgi as a frontend (`issue#16784 `_, `pr#15070 `_, Thomas Serlin) * rgw: use get_data_extra_pool() when get extra pool (`issue#20064 `_, `pr#15219 `_, fang yuxiang) * rgw: use pre-defined calls to replace raw flag operation (`pr#15107 `_, Gui Hecheng) * tests: Add integration tests for admin socket output (`pr#15223 `_, Brad Hubbard) * tests: ceph-disk: add setting for external py-modules for tox-testing (`pr#15433 `_, Willem Jan Withagen) * tests: Check make_writeable() return value (`pr#15266 `_, zhanglei) * tests: config_opts: drop unused opts (`pr#15031 `_, Kefu Chai) * tests: qa: add task for dnsmasq configuration (`pr#15071 `_, Casey Bodley) * tests: qa: split test_tiering into smaller pieces (`pr#15146 `_, Kefu Chai) * tests: qa/suites/rados: fix ec thrashing (`pr#15087 `_, Sage Weil) * tests: qa/suites/rados/singleton-nomsgr: fix syntax (`pr#15276 `_, Sage Weil) * tests: qa/suites/rados/thrash: make sure osds have map before legacy scrub (`pr#15117 `_, Sage Weil) * tests: qa/suites/rados/upgrade: restart mds (`pr#15517 `_, Sage Weil) * tests: qa/tasks/ceph_manager: 'ceph $service tell ...' is obsolete (`pr#15252 `_, Sage Weil) * tests: qa/tasks/rebuild_mondb: grant "mgr:allow \*" to client.admin (`issue#19439 `_, `pr#14284 `_, Kefu Chai) * tests: qa/tasks/repair_test: unset flags we set (`pr#15296 `_, Sage Weil) * tests: qa/workunits/ceph-helpers.sh: use syntax understood by jq 1.3 (`pr#15530 `_, Kefu Chai) * tests: Rename FileJournal object to distinguish (`pr#15279 `_, Jos Collin) * tests: test/crush: silence warnings from -Walloc-size-larger-than= and -Wstringop-overflow= (`pr#15173 `_, Jos Collin) * tests: test: migrate atomic_t to std::atomic (`pr#14655 `_, Jesse Williamson) * tests: test/msgr: silence warnings from -Wsign-compare (`pr#15356 `_, Jos Collin) * tests: test/msgr: silence warnings from -Wsign-compare (`pr#15570 `_, Jos Collin) * tests: test/objectstore: Check apply_transaction() return values (`pr#15171 `_, zhanglei) * tests: test/old: Removed commented code (`pr#15366 `_, Jos Collin) * tests: test/osdc: fix comparison error and silence warning from -Wunused-value (`pr#15353 `_, Willem Jan Withagen) * tests: test: osd/TestOSDMap.cc: fix Clang complain about promotion (`pr#15525 `_, Willem Jan Withagen) * tests: test: test_denc.cc: silence warning from -Wsign-compare (`pr#15355 `_, Jos Collin) * tests: test: Test fix for SnapSet change (`pr#15161 `_, David Zafman) * tests: test/unittest_bluefs: check whether mounted success (`pr#14988 `_, shiqi) * tools: ceph-disk: command invocation needs all fields separate (`pr#15733 `_, Willem Jan Withagen) * tools: ceph.in: adjust usage width according to user's tty (`pr#15190 `_, Kefu Chai) * tools: ceph.in: assert(state==connected) before help_for_target() (`pr#15156 `_, Kefu Chai) * tools: ceph.in: drop the compatiiblity to handle non json commands (`pr#15508 `_, Kefu Chai) * tools: ceph.in: print return code when json_command failed (`pr#15378 `_, liuchang0812) * tools: ceph-rest-api: be more tolerant on network failure (`issue#20115 `_, `pr#15706 `_, Kefu Chai) * tools: fio_ceph_objectstore: Print db_statistics when rocksdb_perf is enabled (`pr#15796 `_, Xiaoyan Li) * tools: tools/ceph_kvstore_tool: add "bluestore-kv" to usage (`pr#15326 `_, xie xingguo) * tools: tools/crushtool: replicated-rule API support (`pr#15011 `_, xie xingguo) * tools: vstart: "debug_ms=1" for mgr by default (`pr#15127 `_, Kefu Chai) * tools: vstart: print "start osd.$id" instead of "start osd$id" (`pr#15427 `_, Kefu Chai) v12.0.3 Luminous (dev) ====================== This is the fourth development checkpoint release of Luminous, the next long term stable release. This release introduces several improvements in bluestore, monitor, rbd & rgw. Major changes from v12.0.2 -------------------------- Notable Changes --------------- * bluestore,core: osd/OSDMap: should update input param if osd dne (`pr#14863 `_, Kefu Chai) * bluestore: include/intarith: templatize ctz/clz/cbits helpers (`pr#14862 `_, Kefu Chai) * bluestore: os/bluestore: align reclaim size to bluefs_alloc_size (`pr#14744 `_, Haomai Wang) * bluestore: os/bluestore: assert blob map returns success (`pr#14473 `_, shiqi) * bluestore: os/bluestore: fix deferred write race (`issue#19880 `_, `pr#15004 `_, Sage Weil) * bluestore: os/bluestore: fix typo(s/trasnaction/transaction/) (`pr#14890 `_, xie xingguo) * bluestore: os/bluestore: fix use after free race with aio_wait (`pr#14956 `_, Sage Weil) * bluestore: os/bluestore: pre-calculate number of ghost buffers to evict (`pr#15029 `_, xie xingguo) * bluestore: os/bluestore: Record l_bluestore_state_kv_queued_lat for sync_submit_… (`pr#14448 `_, Jianpeng Ma) * bluestore: os/bluestore: Remove ExtentFreeListManager. (`pr#14772 `_, Jianpeng Ma) * bluestore: os/bluestore: remove unused condition variable (`pr#14973 `_, Igor Fedotov) * bluestore: os/bluestore: rename/fix throttle options (`pr#14717 `_, Sage Weil) * bluestore: os/bluestore: roundoff bluefs allocs to bluefs_alloc_size (`pr#14876 `_, Ramesh Chander) * bluestore: os/bluestore: use correct bound encode size for unused (`pr#14731 `_, Haomai Wang) * bluestore,performance: common/config_opts.h: compaction readahead for bluestore/rocksdb (`pr#14932 `_, Mark Nelson) * bluestore,performance: os/bluestore/BlueFS: add bluefs_sync_write option (`pr#14510 `_, Sage Weil) * bluestore,performance: os/bluestore: do not balance bluefs on every kv_sync_thread iteration (`pr#14557 `_, Sage Weil) * bluestore,performance: os/bluestore: eliminate some excessive stuff (`pr#14675 `_, Igor Fedotov) * bluestore,performance: os/bluestore: get rid off excessive lock at BitMapAllocator (`pr#14749 `_, Igor Fedotov) * bluestore,performance: os/blueStore: In osd_tp_thread, call _txc_finalize_kv. (`pr#14709 `_, Jianpeng Ma) * bluestore,performance: os/bluestore: make bluestore_max_blob_size parameter hdd/ssd case dependant (`pr#14434 `_, Igor Fedotov) * bluestore,performance: os/bluestore: rewrite deferred write handling (`issue#16644 `_, `pr#14491 `_, Sage Weil) * bluestore,performance: os/bluestore: use denc for varint encoding (`pr#14911 `_, Piotr Dałek) * bluestore,performance: os/fs/aio: use small_vector for aio_t; clean up header location (`pr#14853 `_, Sage Weil) * bluestore,tests: unittest_alloc: add test_alloc_big (`issue#16662 `_, `pr#14844 `_, Sage Weil) * bluestore,tools: ceph-kvstore-tool: allow 'bluestore-kv' as kvdb type; add escaping, compaction (`pr#14718 `_, Sage Weil) * build/ops: alpine: add alpine linux dev support (`pr#9853 `_, John Coyle) * build/ops: arch: use __get_cpuid instead of do_cpuid (`issue#7869 `_, `pr#14857 `_, Jos Collin) * build/ops: CMakeLists.txt: don't do crypto/isa-l if not Intel (`pr#14721 `_, Dan Mick) * build/ops: compressor/zlib: fix plugin for non-Intel arches (`pr#14947 `_, Dan Mick) * build/ops: debian/rpm: move radosgw-admin to ceph-common (`issue#19577 `_, `pr#14940 `_, Ali Maredia) * build/ops: dmclock: error: ‘function’ in namespace ‘std’ does not name a template type (`pr#14909 `_, Jos Collin) * build/ops: dmclock: initial commit of dmclock QoS library (`pr#14330 `_, J. Eric Ivancich) * build/ops: init-ceph: add ceph libraries path to environment (`pr#14693 `_, Mohamad Gebai) * build/ops: init-ceph: should have a space before "]" (`pr#14796 `_, Kefu Chai) * build/ops: merge v12.0.2 release tag (`pr#15091 `_, Jenkins Build Slave User) * build/ops,mgr: debian/ceph-base.dirs: create bootstrap-mgr dirs (`pr#14838 `_, Sage Weil) * build/ops,mon: mon/ConfigKeyService: add 'config-key dump' to show keys and vals (`pr#14858 `_, Dan Mick) * build/ops,performance,rbd: byteorder: use gcc intrinsics for byteswap (`pr#15012 `_, Kefu Chai) * build/ops: rocksdb: sync with upstream (`pr#14818 `_, Nathan Cutler, Kefu Chai) * build/ops: rpm: fix python-Sphinx package name for SUSE (`pr#15015 `_, Nathan Cutler, Jan Matejek) * build/ops: rpm: gperftools-devel >= 2.4 (`issue#13522 `_, `pr#14870 `_, Nathan Cutler) * build/ops: rpm: package crypto on x86_64 only (`pr#14779 `_, Nathan Cutler) * build/ops: debian: package crypto plugin only on amd64 (`pr#14820 `_, Kefu Chai) * build/ops: src/init-ceph.in: allow one((re)?start|stop) as commands (`pr#14560 `_, Willem Jan Withagen) * build/ops: yasm-wrapper: strip -E (stops ccache trashing source files) (`pr#14633 `_, Tim Serong) * cephfs: ceph-fuse: use user space permission check by default (`issue#19820 `_, `pr#14907 `_, "Yan, Zheng") * cephfs: client: client_quota no longer optional (`pr#14978 `_, Dan van der Ster) * cephfs: client: fix UserPerm::gid_in_group() (`issue#19903 `_, `pr#15039 `_, "Yan, Zheng") * cephfs: client: getattr before returning quota/layout xattrs (`issue#17939 `_, `pr#14018 `_, John Spray) * cephfs: fs/ceph-fuse: normalize file open flags on the wire (`pr#14822 `_, Jan Fajerski) * cephfs: mds/Server.cc: Don't evict a slow client if... (`issue#17855 `_, `pr#12935 `_, Michal Jarzabek) * cephfs: osdc/Filer: truncate large file party by party (`issue#19755 `_, `pr#14769 `_, "Yan, Zheng") * cephfs: osdc: remove journaler_allow_split_entries option (`issue#19691 `_, `pr#14636 `_, John Spray) * cephfs,performance: client: make seeky readdir more efficiency (`issue#19306 `_, `pr#14317 `_, "Yan, Zheng") * cephfs: qa/cephfs: Fix for test_data_scan (`issue#19893 `_, `pr#15094 `_, Douglas Fuller) * cephfs: qa/suites/fs: reserve more space for mds in full tests (`issue#19891 `_, `pr#15026 `_, "Yan, Zheng") * cephfs,tests: qa: silence spurious insufficient standby health warnings (`pr#15035 `_, Patrick Donnelly) * cephfs,tests: qa: Tidy up fs/ suite (`pr#14575 `_, John Spray) * cleanup: dmclock: include missing header. (`pr#14923 `_, Jos Collin) * cleanup: kill clang warnings (`pr#14549 `_, Kefu Chai) * cleanup: test: c_write_operations.cc: silence warning from -Wsign-compare (`pr#14889 `_, Jos Collin) * cmake: add "container" to required boost components (`pr#14850 `_, Kefu Chai) * cmake: align cmake names of library packages (`issue#19853 `_, `pr#14951 `_, Nathan Cutler) * cmake: Allow tests to build without NSS (`pr#13315 `_, Daniel Gryniewicz) * cmake: do not compile crush twice (`pr#14725 `_, Kefu Chai) * cmake: enable cross-compilation of boost (`issue#18938 `_, `pr#14881 `_, Kefu Chai) * cmake: fix the linked lib reference of unittest_rgw_crypto (`pr#14869 `_, Willem Jan Withagen) * cmake: kill duplicated cmake commands (`pr#14948 `_, liuchang0812) * cmake: pass -d0 to b2 if not CMAKE_VERBOSE_MAKEFILE (`pr#14651 `_, Kefu Chai) * cmake: should not compile crc32c_ppc.c on intel arch. (`pr#14423 `_, Kefu Chai) * common: Better handling for missing/inaccessible ceph.conf files (`issue#19658 `_, `pr#14757 `_, Dan Mick) * common: cls/log/cls_log.cc: reduce logging noise (`issue#19835 `_, `pr#14879 `_, Willem Jan Withagen) * common: common/admin_socket: add config for admin socket permission bits (`pr#11684 `_, runsisi) * common: common/ceph_context.cc: Use CEPH_DEV to reduce logfile noise (`pr#10384 `_, Willem Jan Withagen) * common: common/config: Add /usr/local/etc/ceph to default paths (`pr#14797 `_, Willem Jan Withagen) * common: common/config_opts: Set the HDD throttle cost to 1.5M (`pr#14808 `_, Mark Nelson) * common: common/Finisher: fix uninitialized variable warning (`pr#14958 `_, Piotr Dałek) * common: common/interval_set: return int64_t for size() (`pr#12898 `_, Xinze Chi) * common: crypto: cleanup NSPR in main thread (`pr#14801 `_, Kefu Chai) * common: fix building against libcryptopp (`pr#14949 `_, Shengjing Zhu) * common: Fix unused variable references warnings (`pr#14790 `_, Willem Jan Withagen) * common: msg/async: return right away in NetHandler::set_priority() if not supported (`pr#14795 `_, Kefu Chai) * common: osdc/Objecter: fix pool dne corner case (`issue#19552 `_, `pr#14901 `_, Sage Weil) * common: osdc/Objecter: handle command target that goes down (`issue#19452 `_, `pr#14302 `_, Sage Weil) * common: osd/OSDMap: fix feature commit comment (`pr#15056 `_, Sage Weil) * common,performance: common/Finisher: batch handle perfcounter && only send signal when waiter existed. (`pr#14363 `_, Jianpeng Ma) * common: Remove redundant includes (`issue#19883 `_, `pr#15003 `_, Brad Hubbard) * common: Remove redundant includes (`issue#19883 `_, `pr#15019 `_, Brad Hubbard) * common: src/common: change last_work_queue to next_work_queue. (`pr#14738 `_, Pan Liu) * core: ceph-disk: ceph-disk on FreeBSD should not use mpath-code (`pr#14837 `_, Willem Jan Withagen) * core: ceph-disk: implement prepare --no-locking (`pr#14728 `_, Dan van der Ster, Loic Dachary) * core: ceph-disk: separate ceph-osd --check-needs-* logs (`issue#19888 `_, `pr#15016 `_, Loic Dachary) * core: erasure-code: sync jerasure/gf-complete submodules (`pr#14424 `_, Loic Dachary) * core: introduce DirectMessenger (`pr#14755 `_, Casey Bodley, Matt Benjamin) * core: messages: remove compat cruft (`pr#14475 `_, Sage Weil) * core,mon: common/Finisher: fix uninitialized variable warning (`issue#19874 `_, `pr#14979 `_, Sage Weil) * core: mon,osd: add require_min_compat_client setting to enforce and clarify client compatibility (`pr#14959 `_, Sage Weil) * core: mon/OSDMonitor: skip prime_pg_temp if mapping is prior to osdmap (`pr#14826 `_, Kefu Chai) * core: mon/PGMonitor: fix wrongly report "pg stuck in inactive" (`pr#14391 `_, Mingxin Liu) * core: osd,librados: cmpext support (`pr#14715 `_, Zhengyong Wang, David Disseldorp, Mike Christie) * core: osd/OSDMap: bump encoding version for require_min_compat_client (`pr#15046 `_, "Yan, Zheng") * core: osd/PG.cc: Optimistic estimation on PG.last_active (`pr#14799 `_, Xiaoxi Chen) * core: osd/PG.cc: unify the call of checking whether lock is held (`pr#15013 `_, Jin Cai) * core: osd/PG: fix possible overflow on unfound objects (`pr#12669 `_, huangjun) * core: osd/PrimaryLogPG: do not call on_shutdown() if (pg.deleting) (`issue#19902 `_, `pr#15040 `_, Kefu Chai) * core: osd/PrimayLogPG: update modified range to include the whole object size for write_full op (`pr#15021 `_, runsisi) * core: osd/ReplicatedBackend: remove MOSDSubOp cruft from repop_applied (`pr#14358 `_, Jianpeng Ma) * core: os/filestore/FileJournal: Fix typo in the comment (`pr#14493 `_, Zhou Zhengping) * core: os/filestore: fix infinit loops in fiemap() (`pr#14367 `_, Ning Yao) * core,performance: osd, os: reduce fiemap burden (`pr#14640 `_, Piotr Dałek) * core,performance: os/filestore: use new sleep strategy when io_submit gets EAGAIN. (`pr#14860 `_, Pan Liu) * core,performance: os/kstore: Added rocksdb bloom filter settings (`pr#13053 `_, Ted-Chang) * core,tests: ceph_test_rados_api_watch_notify: make LibRadosWatchNotify.Watch3Timeout tolerate thrashing (`issue#19433 `_, `pr#14899 `_, Sage Weil) * core,tools: ceph: perfcounter priorities and daemonperf updates to use them (`pr#14793 `_, Sage Weil, Dan Mick) * core,tools: kv: move 'bluestore-kv' hackery out of KeyValueDB into ceph-kvstore-tool (`issue#19778 `_, `pr#14895 `_, Sage Weil) * crush: builder: legacy has chooseleaf_stable = 0 (`pr#14695 `_, Loic Dachary) * crush: crush_init_workspace starts with struct crush_work (`pr#14696 `_, Loic Dachary) * crush: update documentation for negative choose step (`pr#14970 `_, Loic Dachary) * doc: AUTHORS: update tech leads (`pr#14350 `_, Patrick Donnelly) * doc: correct and improve add user capability section (`pr#14055 `_, Chu, Hua-Rong) * doc: Correcting the remove bucket example and adding bucket link/unlink examples (`pr#12460 `_, Uday Mullangi) * doc: dev add a note about ccache (`pr#14478 `_, Abhishek Lekshmanan) * doc: doc/dev: add some info about FreeBSD (`pr#14503 `_, Willem Jan Withagen) * doc: fio: update README.md so only the fio ceph engine is built (`pr#15081 `_, Kefu Chai) * doc: fix link that pointed to a nonexistent file (`pr#14740 `_, Peter Maloney) * doc: Indicate how to add multiple admin capbabilies (`pr#13956 `_, Chu, Hua-Rong) * doc: mailmap: add Alibaba into organization map (`pr#14900 `_, James Liu) * doc: mailmap: update organization info (`pr#14747 `_, liuchang0812) * doc: mailmap: V12.0.1 credits (`pr#14479 `_, M Ranga Swami Reddy) * doc: mailmap: Weibing Zhang mailmap affiliation (`pr#15076 `_, Weibing Zhang) * doc: mailmap: ztczll affiliation (`pr#15079 `_, zhanglei) * doc: mention teuthology-worker security group (`pr#14748 `_, Nathan Cutler) * doc: peoplemap: add pdonnell alias (`pr#14352 `_, Patrick Donnelly) * doc: remove deprecated subcommand in man/8/ceph.rst (`pr#14928 `_, Drunkard Zhang) * doc: Re-word the warnings about using git subtrees. (`pr#14999 `_, J. Eric Ivancich) * doc: rgw: Clean up create subuser parameters (`pr#14335 `_, hrchu) * doc: rgw: correct get usage parameter default value (`pr#14372 `_, hrchu) * doc: rgw: Get user usage needs to specify user (`pr#14804 `_, hrchu) * doc: rgw: note rgw_enable_usage_log option in adminops guide (`pr#14803 `_, hrchu) * doc: rgw: Rewrite Java swift examples (`pr#14268 `_, Chu, Hua-Rong) * doc: rgw: Rewrite the key management (`pr#14384 `_, hrchu) * doc: style fix for doc/cephfs/client-config-ref.rst (`pr#14840 `_, Drunkard Zhang) * doc: two minor fixes (`pr#14494 `_, Drunkard Zhang) * doc: update the support status of swift static website (`pr#13824 `_, Jing Wenjun) * doc: v12.0.2 (dev) release notes (`pr#14625 `_, Abhishek Lekshmanan) * librados: fix rados_pool_list when buf is null (`pr#14859 `_, Sage Weil) * librbd: cleanup logging code under librbd/io (`pr#14975 `_, runsisi) * librbd: do not instantiate templates while building tests (`issue#18938 `_, `pr#14891 `_, Kefu Chai) * librbd: minor fixes for image trash move (`pr#14834 `_, runsisi) * librbd: remove redundant check for image id emptiness (`pr#14830 `_, runsisi) * librbd: silence -Wunused-variable warning (`pr#14953 `_, Kefu Chai) * mds: add perf counters for file system operations (`pr#14938 `_, Michael Sevilla) * mds: change_attr++ and set ctime for set_vxattr (`issue#19583 `_, `pr#14726 `_, Patrick Donnelly) * mds: fix mantle script to not fail for last rank (`issue#19589 `_, `pr#14704 `_, Patrick Donnelly) * mds: fix use-after-free in Locker::file_update_finish() (`issue#19828 `_, `pr#14991 `_, "Yan, Zheng") * mds: issue new caps when sending reply to client (`issue#19635 `_, `pr#14743 `_, "Yan, Zheng") * mds: relocate PTRWAITER put near get (`pr#14921 `_, Patrick Donnelly) * mds: remove boost::pool usage and use tcmalloc directly (`issue#18425 `_, `pr#12792 `_, Zhi Zhang) * mds: remove "mds log" config option (`issue#18816 `_, `pr#14652 `_, John Spray) * mds: support export pinning on directories (`issue#17834 `_, `pr#14598 `_, "Yan, Zheng", Patrick Donnelly) * mds: use debug_mds for most subsys (`issue#19734 `_, `pr#15052 `_, Patrick Donnelly) * mgr: add machinery for python modules to send MCommands to daemons (`pr#14920 `_, John Spray) * mgr: add mgr allow * to client.admin (`pr#14864 `_, huanwen ren) * mgr: do shutdown using finisher so we can do it in the right order (`issue#19743 `_, `pr#14835 `_, Kefu Chai) * mgr: do the shutdown in the right order (`issue#19813 `_, `pr#14952 `_, Kefu Chai) * mgr: fix crash on set_config from python module with insufficient caps (`issue#19629 `_, `pr#14706 `_, Tim Serong) * mgr: fix metadata handling from old MDS daemons (`pr#14161 `_, John Spray) * mgr: fix python module teardown & add tests (`issue#19407 `_, `issue#19412 `_, `issue#19258 `_, `pr#14232 `_, John Spray) * mgr: fix session leak (`issue#19591 `_, `pr#14720 `_, Sage Weil) * mgr: Misc. bug fixes (`issue#18994 `_, `pr#14883 `_, John Spray) * mgr: mkdir bootstrap-mgr (`pr#14824 `_, huanwen ren) * mgr,mon: mon/MgrMonitor: only propose if we updated (`pr#14645 `_, Sage Weil) * mgr,mon: mon,mgr: trim osdmap without the help of pgmap (`pr#14504 `_, Kefu Chai) * mgr: pybind/mgr/rest: completely terminate cherrypy in shutdown (`pr#14995 `_, Tim Serong) * mgr: redirect python stdout,stderr to ceph log (`pr#14189 `_, Kefu Chai, Tim Serong, Dan Mick) * mgr: release allocated PyString (`pr#14716 `_, Kefu Chai) * mgr: remove non-existent MDS daemons from FSMap (`issue#17453 `_, `pr#14937 `_, Spandan Kumar Sahu) * mgr,tests: qa/suites: move mgr tests into rados suite (`pr#14687 `_, John Spray) * mgr: various cleanups (`pr#14802 `_, Kefu Chai) * mon: check is_shutdown() in timer callbacks (`issue#19825 `_, `pr#14919 `_, Kefu Chai) * mon: do not prime_pg_temp creating pgs; clean up pg create conditions (`issue#19826 `_, `pr#14913 `_, Sage Weil) * mon: don't call propose_pending in prepare_update() (`issue#19738 `_, `pr#14711 `_, John Spray) * mon: logclient: use the seq id of the 1st log entry when resetting session (`issue#19427 `_, `pr#14927 `_, Kefu Chai) * mon: mon/OSDMonitor: check get()'s return value instead of bl's length (`pr#14805 `_, Kefu Chai) * mon: mon/OSDMonitor: cleanup pending_created_pgs after done with it (`pr#14898 `_, Kefu Chai) * mon: mon/OSDMonitor: do not alter the "created" epoch of a pg (`issue#19787 `_, `pr#14849 `_, Kefu Chai) * mon: mon/OSDMonitor: fix output func name in can_mark_out (`pr#14758 `_, xie xingguo) * mon: mon/OSDMonitor: increase last_epoch_clean's lower bound if possible (`pr#14855 `_, Kefu Chai) * mon: mon/OSDMonitor: tolerate upgrade from post-kraken dev cluster (`pr#14442 `_, Sage Weil) * mon: mon/OSDMonitor: update creating_pgs using pending_creatings (`issue#19814 `_, `pr#14897 `_, Kefu Chai) * mon: mon/OSDMonitor: update pg_creatings even the new acting set is empty (`issue#19744 `_, `pr#14730 `_, Kefu Chai) * mon: mon/PGMap: add up_primary pg number field for pg-dump cmd (`pr#13451 `_, xie xingguo) * mon: mon/PGMap.cc: fix "osd_epochs" section of dump_basic (`pr#14996 `_, xie xingguo) * mon: mon/PGMonitor: clean up min/max span warning (`pr#14611 `_, Sage Weil) * mon: move 'pg map' to OSDMonitor (`pr#14559 `_, Sage Weil) * mon: osd/PGMonitor: always update pgmap with latest osdmap (`issue#19398 `_, `pr#14777 `_, Kefu Chai) * mon: show inactive % in ceph status (`pr#14810 `_, Sage Weil) * msg: Increase loglevels on some messages (`pr#14707 `_, Willem Jan Withagen) * msg: msg/async/net_handler: errno should be stored before calling next function (`pr#14985 `_, Zhou Zhengping) * msg: msg/simple: use my addr when setting sock priority (`issue#19801 `_, `pr#14878 `_, Kefu Chai) * msg: src/msg/async/AsyncConnect.cc: Use of sizeof() on a Pointer Type (`pr#14773 `_, Svyatoslav) * msg: src/msg/simple/Pipe.cc: Fix the inclusion of '}' (`pr#14843 `_, Willem Jan Withagen) * osd: check fsid is normal before osd mkfs (`pr#13898 `_, song baisen) * osd: clean nonused work queue (`pr#14990 `_, Wei Jin) * osd: eliminate snapdir objects and move clone snaps vector into SnapSet (`pr#13610 `_, Sage Weil) * osd: fix occasional MOSDMap leak (`issue#18293 `_, `pr#14558 `_, Sage Weil) * osd: fix typo in comment (`pr#13061 `_, Gu Zhongyan) * osd: Implement asynchronous scrub sleep (`issue#19497 `_, `pr#14886 `_, Brad Hubbard) * osd: Implement peering state timing (`pr#14627 `_, Brad Hubbard) * osd: objclass sdk (`pr#14723 `_, Neha Ojha) * osd: osdc/Objecter: more constness (`pr#14819 `_, Kefu Chai) * osd: osdc: silence warning from `-Wsign-compare` (`pr#14729 `_, Jos Collin) * osd: pglog trimming fixes (`pr#12882 `_, Zhiqiang Wang) * osd: pglog: with config, don't assert in the presence of stale diverg… (`issue#17916 `_, `pr#14648 `_, Greg Farnum) * osd: put osdmap in mempool (`pr#14780 `_, Sage Weil) * osd: renamed the new vector name in OSDMap::build_simple_crush_map_from_conf (`pr#14583 `_, Jos Collin) * osd: ReplicatedBackend::prep_push() remove redundant variable assignments (`pr#14817 `_, Jin Cai) * osd: sched_scrub() lock pg only if all scrubbing conditions are fulfilled (`pr#14968 `_, Jin Cai) * osd: simplify past_intervals representation (`pr#14444 `_, Samuel Just, Sage Weil) * osd: stop mgrc earlier in shutdown() (`issue#19638 `_, `pr#14904 `_, Kefu Chai) * osd: stop MgrClient callbacks on shutdown (`issue#19638 `_, `pr#14896 `_, Sage Weil) * osd: strip pglog op name (`pr#14764 `_, liuchang0812) * osd: support dumping long ops (`pr#13019 `_, Zhiqiang Wang) * osd: switch filestore to default to rocksdb (`pr#14814 `_, Neha Ojha) * osd: tag fast dispatch messages with min_epoch (`pr#13681 `_, Sage Weil) * osd: use append(bufferlist &) to avoid unnecessary copy (`pr#12272 `_, Yunchuan Wen) * osd: zipkin tracing (`pr#14305 `_, Sage Weil, Marios-Evaggelos Kogias, Victor Araujo, Casey Bodley, Andrew Shewmaker, Chendi.Xue) * performance: buffer, osd: add missing crc cache miss perf counter (`pr#14957 `_, Piotr Dałek) * performance: osd/PG.cc: loop invariant code motion (`pr#12720 `_, Li Wang) * pybind: better error msg (`pr#14497 `_, Kefu Chai) * pybind: fix open flags calculation (`issue#19890 `_, `pr#15018 `_, "Yan, Zheng") * qa: qa/added overrides (`pr#14917 `_, Yuri Weinstein) * qa: qa/suite: replace reference to fs/xfs.yaml (`pr#14756 `_, Yehuda Sadeh) * qa: qa/suites/rados/singleton-bluestore: concat settings (`pr#14884 `_, Kefu Chai) * rbd: cls_rbd: default initialize snapshot namespace for legacy clients (`issue#19413 `_, `pr#14903 `_, Jason Dillaman) * rbd: common/bit_vector: utilize deep-copy during data decode (`issue#19863 `_, `pr#15017 `_, Jason Dillaman) * rbd: import needs to sanity check auto-generated image name (`issue#19128 `_, `pr#14754 `_, Mykola Golub) * rbd: info command should indicate if parent is in trash (`pr#14875 `_, Jason Dillaman) * rbd-mirror: ensure missing images are re-synced when detected (`issue#19811 `_, `pr#14945 `_, Jason Dillaman) * rbd-mirror: failover and failback of unmodified image results in split-brain (`issue#19858 `_, `pr#14963 `_, Jason Dillaman) * rbd-mirror: image deletions should be handled by assigned instance (`pr#14832 `_, Jason Dillaman) * rbd-mirror: remove tracking of image names from pool watcher (`pr#14712 `_, Jason Dillaman) * rbd-mirror: resolve admin socket path names collision (`issue#19907 `_, `pr#15048 `_, Mykola Golub) * rbd-nbd: relax size check for newer kernel versions (`issue#19871 `_, `pr#14976 `_, Mykola Golub) * rbd: rbd/bench: add notes of default values, it's easy to use (`pr#14762 `_, Zheng Yin) * rbd: rbd, librbd: migrate atomic_t to std::atomic (`pr#14656 `_, Jesse Williamson) * rbd: rbd-mirror A/A: proxy InstanceReplayer APIs via InstanceWatcher RPC (`issue#18787 `_, `pr#13978 `_, Mykola Golub) * rbd: recognize exclusive option (`pr#14785 `_, Ilya Dryomov) * rbd: removed spurious error message from mirror pool commands (`pr#14935 `_, Jason Dillaman) * rbd: stop indefinite thread waiting in krbd udev handling (`issue#17195 `_, `pr#14051 `_, Spandan Kumar Sahu) * rbd,tests: qa: krbd write-after-checksum tests (`pr#14836 `_, Ilya Dryomov) * rbd,tests: qa/workunits/rbd: increased trash deferment period (`pr#14846 `_, Jason Dillaman) * rbd,tests: qa/workunits: switch to OpenStack Ocata release for RBD testing (`pr#14465 `_, Jason Dillaman) * rbd,tests: test/librbd/test_librbd.cc: set \*features even if RBD_FEATURES is unset (`issue#19865 `_, `pr#14965 `_, Dan Mick) * rbd,tests: test/librbd/test_notify.py: don't disable feature in slave (`issue#19716 `_, `pr#14751 `_, Mykola Golub) * rbd,tests: test/rbd_mirror: race in TestMockLeaderWatcher.AcquireError (`issue#19405 `_, `pr#14741 `_, Mykola Golub) * rbd,tests: test: remove hard-coded image name from RBD metadata test (`issue#19798 `_, `pr#14848 `_, Jason Dillaman) * rdma: msg/async/rdma: add inqueue rx chunks perf counter (`pr#14782 `_, Haomai Wang) * rgw: add bucket size limit check to radosgw-admin (`issue#17925 `_, `pr#11796 `_, Matt Benjamin) * rgw: add 'state==SyncState::IncrementalSync' condition when add item … (`pr#14552 `_, Shasha Lu) * rgw: add support container and object levels of swift bulkupload (`pr#14775 `_, Jing Wenjun) * rgw: add support for FormPost of Swift API (`issue#17273 `_, `pr#11179 `_, Radoslaw Zarzynski, Orit Wasserman) * rgw: civetweb don't go past the array index while calling mg_start (`issue#19749 `_, `pr#14750 `_, Abhishek Lekshmanan, Jesse Williamson) * rgw: clean unuse code in cls_statelog_check_state (`pr#10260 `_, weiqiaomiao) * rgw: cleanup: fix variable name in RGWRados::create_pool() declaration (`pr#14547 `_, Nathan Cutler) * rgw: cleanup lc continuation (`pr#14906 `_, Jiaying Ren) * rgw: cls/rgw: list_plain_entries() stops before bi_log entries (`issue#19876 `_, `pr#14981 `_, Casey Bodley) * rgw: custom user data header (`issue#19644 `_, `pr#14592 `_, Pavan Rallabhandi) * rgw: deduplicate variants of rgw_make_bucket_entry_name(). (`pr#14299 `_, Radoslaw Zarzynski) * rgw: don't do unneccesary write if buffer with zero length (`pr#14925 `_, fang yuxiang) * rgw: dont spawn error_repo until lease is acquired (`issue#19446 `_, `pr#14714 `_, Casey Bodley) * rgw: drop unused param "bucket" from select_bucket_placement (`pr#14390 `_, Shasha Lu) * rgw: drop unused port var (`pr#14412 `_, Jiaying Ren) * rgw: fix broken /crossdomain.xml, /info and /healthcheck of Swift API. (`issue#19520 `_, `pr#14373 `_, Radoslaw Zarzynski) * rgw: fix forward request for bulkupload to be applied in multisite (`issue#19645 `_, `pr#14601 `_, Jing Wenjun) * rgw: fix handling of --remote in radosgw-admin period commands (`issue#19554 `_, `pr#14407 `_, Casey Bodley) * rgw: fix RadosGW hang during multi-chunk upload of AWSv4. (`issue#19754 `_, `pr#14770 `_, Radoslaw Zarzynski) * rgw: migrate atomic_t to std::atomic<> (ebirah) (`pr#14839 `_, Jesse Williamson) * rgw: radosgw-admin: warn that 'realm rename' does not update other clusters (`issue#19746 `_, `pr#14722 `_, Casey Bodley) * rgw: rgw_file: cleanup virtual keyword on derived functions (`pr#14908 `_, Gui Hecheng) * rgw: rgw_ldap: log the ldap err in case of bind failure (`pr#14781 `_, Abhishek Lekshmanan) * rgw: rgw multisite: automated mdlog trimming (`pr#13111 `_, Casey Bodley) * rgw: rgw multisite: fixes for meta sync across periods (`issue#18639 `_, `pr#13070 `_, Casey Bodley) * rgw: rgw multisite: remove the redundant post in OPT_ZONEGROUP_MODIFY (`pr#14359 `_, Jing Wenjun) * rgw: RGWPeriodPusher spawns http thread before cr thread (`issue#19834 `_, `pr#14936 `_, Casey Bodley) * rgw: rgw_rados drop deprecated global var (`pr#14411 `_, Jiaying Ren) * rgw: should delete in_stream_req if conn->get_obj(...) return not zero value (`pr#9950 `_, weiqiaomiao) * rgw: swift: ability to update swift read and write acls separately. (`issue#19289 `_, `pr#14499 `_, Marcus Watts) * rgw: swift: disable revocation thread if sleep == 0 (`issue#19499 `_, `issue#9493 `_, `pr#14501 `_, Marcus Watts) * rgw,tests: qa/rgw: add cluster name to path when s3tests scans rgw log (`pr#14845 `_, Casey Bodley) * rgw,tests: qa/rgw: don't scan radosgw logs for encryption keys on jewel upgrade test (`pr#14697 `_, Casey Bodley) * rgw,tests: qa/rgw: fix assertions in radosgw_admin task (`pr#14842 `_, Casey Bodley) * rgw,tests: test/rgw: fixes for test_multi_period_incremental_sync() (`pr#13067 `_, Casey Bodley) * rgw,tests: test/rgw: fix for empty lists as default arguments (`pr#14816 `_, Casey Bodley) * rgw: update Beast for streaming reads in asio frontend (`pr#14273 `_, Casey Bodley) * rgw: using the same bucket num_shards as master zg when create bucket in secondary zg (`issue#19745 `_, `pr#14388 `_, Shasha Lu) * rgw: when create_bucket use the same num_shards with info.num_shards (`issue#19745 `_, `pr#15010 `_, Shasha Lu) * tests: ceph_test_rados_api_tier: tolerate ENOENT from 'pg scrub' (`pr#14807 `_, Sage Weil) * tests: cephtool/test.sh error on full tests (`issue#19698 `_, `pr#14647 `_, Willem Jan Withagen, David Zafman) * tests: Don't dump core when using EXPECT_DEATH (`pr#14821 `_, Kefu Chai, Brad Hubbard) * tests: fio_ceph_objectstore: fixes improper write request data lifetime (`pr#14338 `_, Adam Kupczyk) * tests: fix broken links in upgrade/hammer-jewel-x/stress-split (`issue#19793 `_, `pr#14831 `_, Nathan Cutler) * tests: include/denc: support ENCODE_DUMP (`pr#14962 `_, Sage Weil) * tests: libradosstriper: do not assign garbage to returned value (`pr#15009 `_, Kefu Chai) * tests: qa/erasure-code: override min_size to 2 (`issue#19770 `_, `pr#14872 `_, Kefu Chai) * tests: qa/suites/jewel-x/point-to-point: don't scane for keys on second s3tests either (`pr#14788 `_, Sage Weil) * tests: qa/suites: Reduce fs combination tests for smoke, use bluestore (`pr#14854 `_, Vasu Kulkarni) * tests: qa/suites: Revert "qa/suites: add mon-reweight-min-pgs-per-osd = 4" (`pr#14584 `_, Kefu Chai) * tests: qa/suites/upgrade/jewel-x: add mgr.x role (`pr#14689 `_, Sage Weil) * tests: qa/suites/upgrade/kraken-x: misc fixes (`pr#14887 `_, Sage Weil) * tests: qa/tasks/ceph_manager: always fix pgp_num when done with thrashosd task (`issue#19771 `_, `pr#14931 `_, Kefu Chai) * tests: qa/tasks: few fixes to get ceph-deploy 1node to working state (`pr#14400 `_, Vasu Kulkarni) * tests: rados: move cephtool.yaml to new singleton/bluestore subsuite (`issue#19797 `_, `pr#14847 `_, Nathan Cutler) * tests: set -x in suites/iozone.sh workunit (`issue#19740 `_, `pr#14713 `_, Nathan Cutler) * tests: test/compressor: disable isal tests if not available (`pr#14929 `_, Kefu Chai) * tests: test: c_read_operations.cc: silence warning from -Wsign-compare (`pr#14888 `_, Jos Collin) * tests: test/fio_ceph_objectstore: fix fio plugin build failure by engine_data (`pr#15044 `_, lisali) * tests: test: fixing assert that creates warning: comparison between signed and unsigned integer expressions (`pr#14794 `_, Jos Collin) * tests: test/fsx: Remove the dead code associated with aio backend (`pr#14905 `_, Zhou Zhengping) * tests: test/objectstore/: Check put_ref return value (`pr#15007 `_, zhanglei) * tests: test/osd/osd-dup.sh: lower wb fd throttle limits (`pr#14984 `_, Dan Mick) * tests: test: use 7130 for crush-classes.sh (`pr#14783 `_, Loic Dachary) * tests: test: warning: comparison between signed and unsigned integer expressions (`pr#14705 `_, Jos Collin) * tests,tools: test: kill warnings (`pr#14892 `_, Kefu Chai) * tools: change compare_exchange_weak to compare_exchange_strong (`pr#15030 `_, Jesse Williamson) * tools: rados: check for negative return value of rados_create_with_context() as its comment put (`pr#10893 `_, zhang.zezhu) * tools: rados: out json 'df' values as numbers, not strings (`issue#15546 `_, `pr#14644 `_, Sage Weil) v12.0.2 Luminous (dev) ====================== This is the third development checkpoint release of Luminous, the next long term stable release. Major changes from v12.0.1 -------------------------- Notable Changes --------------- * bluestore: bluestore/NVMEDEVICE: update SPDK to version 17.03 (`pr#14585 `_, optimistyzy) * bluestore: bluestore, NVMeDevice: use task' own lock for (random) read (`pr#14094 `_, optimistyzy) * bluestore: common/config: set rocksdb_cache_size to OPT_U64 (`pr#13995 `_, liuhongtong) * bluestore: os/bluestore: avoid nullptr in bluestore_extent_ref_map_t::bound_encode (`pr#14073 `_, Sage Weil) * bluestore: os/bluestore: clean up flush logic (`pr#14162 `_, Jianpeng Ma) * bluestore: os/bluestore: clean up Invalid return value judgment (`pr#14219 `_, shiqi) * bluestore: os/bluestore: fix bug for calc extent_avg in reshard function (`pr#13931 `_, wangzhengyong) * bluestore: os/bluestore: fix bug in _open_alloc() (`pr#13577 `_, yonghengdexin735) * bluestore: os/bluestore: fix deferred writes vs collection split race (`issue#19379 `_, `pr#14157 `_, Sage Weil) * bluestore: os/bluestore: fix narrow osr->flush() race (`pr#14489 `_, Sage Weil) * bluestore: os/bluestore: fix perf counters (`pr#13965 `_, Sage Weil) * bluestore: os/bluestore: fsck: verify blob.unused field (`pr#14316 `_, Sage Weil) * bluestore: os/bluestore: implement collect_metadata (`pr#14115 `_, Sage Weil) * bluestore: os/bluestore: make live changes for BlueStore throttle config work like initial config (`pr#14225 `_, J. Eric Ivancich) * bluestore: os/bluestore: misc fixes (`pr#14333 `_, Sage Weil) * bluestore: os/bluestore/NVMEDevice: Add multiple thread support for SPDK I/O thread (`pr#14420 `_, Ziye Yang) * bluestore: os/bluestore/NVMEDevice: fix the compilation issue for collect_metadata (`pr#14455 `_, optimistyzy) * bluestore,performance: kv/RocksDBStore: implement rm_range_keys operator interface and test (`pr#13855 `_, Haomai Wang) * bluestore,performance: os/bluestore: avoid the VTABLE-related burden in BitMapAllocator's hotspot (`pr#14348 `_, Radoslaw Zarzynski) * bluestore,performance: os/bluestore/BlueFS: optimize get_allocated (`pr#14121 `_, Jianpeng Ma) * bluestore,performance: os/bluestore: memory and dereference clean-up in the BitAllocator (`pr#13811 `_, Radoslaw Zarzynski) * bluestore,performance: os/bluestore: optimize blob usage when doing appends/overwrites (`pr#13337 `_, Igor Fedotov) * bluestore,performance: os/bluestore: refactor small write handling to reuse blob more effect… (`pr#14399 `_, Igor Fedotov) * bluestore,performance: os/bluestore: tune deferred_batch_ops separately for hdd and ssd (`pr#14435 `_, Sage Weil) * bluestore,performance: os/bluestore: unify throttling model (`issue#19542 `_, `pr#14306 `_, Sage Weil) * bluestore,tests: test/objectstore/store_test_fixture.cc: Exclude bluestore code if required. (`pr#14085 `_, Willem Jan Withagen) * bluestore,tests: test/store_test: fix bluestore test cases disablement (`pr#14228 `_, Igor Fedotov) * bluestore,tests: test/unittest_bluefs: check whether add_block_device success (`pr#14013 `_, shiqi) * bluestore,tools: ceph-bluestore-tool: rename from bluefs-tool; improve usage (`pr#14258 `_, Sage Weil) * build/ops: add 12.0.1 release tag in master (`pr#14690 `_, Jenkins Build Slave User) * build/ops: arch: fix build on PowerPC with FreeBSD (`pr#14378 `_, Andrew Solomon) * build/ops: arch: fix cmake's ARM CRC intrinsics test to handle duplicitous gcc 4.8.5 (`issue#19386 `_, `pr#14132 `_, Dan Mick) * build/ops: build: Add Virtuozzo Linux support (`pr#14301 `_, Andrey Parfenov) * build/ops,core: build: let FreeBSD build ceph-fuse (`pr#14282 `_, Willem Jan Withagen) * build/ops,core: os: allow offline conversion of filestore -> bluestore (or anything else) (`pr#14210 `_, Sage Weil) * build/ops: debian: package ceph.logroate properly (`issue#19390 `_, `pr#14600 `_, Kefu Chai) * build/ops: debian/rules, ceph.spec.in: invoke cmake with -DBOOST_J (`pr#14114 `_, Dan Mick) * build/ops: gitignore: Ignore rejects by patch (`pr#14405 `_, Willem Jan Withagen) * build/ops,performance: crc32c: Add crc32c function optimized for ppc architecture (`pr#13909 `_, Andrew Solomon) * build/ops: rocksdb: sync with upstream (`pr#14456 `_, Kefu Chai) * build/ops: script: drop the -x arg for credits script (`pr#14296 `_, Abhishek Lekshmanan) * build/ops: systemd/ceph-disk: make it possible to customize timeout (`issue#18740 `_, `pr#13197 `_, Alexey Sheplyakov) * build/ops: systemd: remove ceph-create-keys from presets (`pr#14226 `_, Sébastien Han) * build/ops: Tested-by: Andrew Solomon (`pr#14289 `_, Kefu Chai) * build/ops,tests: test/osd/CMakeLists.txt: osd-dup.sh require BlueStore/AIO (`pr#14387 `_, Willem Jan Withagen) * build/ops: vstart: do not start mgr if not start_all (`pr#13974 `_, Kefu Chai) * cephfs: client/Client.cc: after reset session from MDS - reconnect (`issue#18757 `_, `pr#13522 `_, Henrik Korkuc) * cephfs: client: drop cap snaps when auth mds session gets closed (`issue#19022 `_, `pr#13579 `_, "Yan, Zheng") * cephfs: client: refine fsync/close writeback error handling (`pr#14589 `_, John Spray) * cephfs: client: specify inode in get_caps log message (`pr#13966 `_, John Spray) * cephfs,core: Add test for is_hacky_ecoverwrites in cephfs pool checks (`pr#13466 `_, John Spray) * cephfs: fix mount point break off problem after mds switch occured (`issue#19437 `_, `pr#14267 `_, Guan yunfei) * cephfs: fix write_buf's _len overflow problem (`issue#19033 `_, `pr#13587 `_, Yang Honggang) * cephfs: Mitigation for #16842, validate sessions after load (`issue#16842 `_, `pr#14164 `_, John Spray) * cephfs: Permit recovering metadata into a new RADOS pool (`issue#15069 `_, `issue#15068 `_, `pr#10636 `_, Douglas Fuller) * cephfs: qa: whitelist new fullness messages in fs tests (`issue#19253 `_, `pr#13915 `_, John Spray) * cephfs,tests: qa: fix test_standby_for_invalid_fscid with vstart_runner (`pr#14272 `_, John Spray) * cephfs,tests: qa, mds: add checks for fragmentation, and enable it by default (`issue#16523 `_, `pr#13862 `_, john Spray, John Spray) * cephfs,tests: qa: re-enable ENOSPC tests for kclient (`issue#19550 `_, `pr#14396 `_, John Spray) * cephfs,tests: qa/vstart_runner: amend ps invocation (`pr#14254 `_, Ilya Dryomov) * cephfs,tests: qa: whitelist another fullness log message (`issue#19253 `_, `pr#14221 `_, John Spray) * cephfs: tools/cephfs: set dir_layout when injecting inodes (`issue#19406 `_, `pr#14234 `_, John Spray) * cleanup: misc: Warning Elimination (`pr#14439 `_, Adam C. Emerson) * cmake: Add simple recursive ctags target for Ceph source only (`pr#14334 `_, Kefu Chai, Dan Mick) * cmake: build tracepoint libraries for vstart target (`pr#14354 `_, Mohamad Gebai) * cmake: do not try to add submodule to exclude list if .git is not around (`pr#14495 `_, Kefu Chai) * cmake: support optional argument for overriding default ctag excludes (`pr#14379 `_, Kefu Chai) * common: common/ceph_context: fewer warnings about experimental features (`pr#14170 `_, Sage Weil) * common: common/config: eliminate config_t::set_val unsafe option (`issue#19106 `_, `pr#13687 `_, liuchang0812) * common: common/perf_counters: fix race condition with atomic variables (`pr#14227 `_, J. Eric Ivancich) * common,core,tests: Wip cppcheck errors (`pr#14446 `_, Brad Hubbard) * common: do not print error when asok is closed (`pr#14022 `_, Patrick Donnelly) * common: Fix heap buffer overflow in do_request (`issue#19393 `_, `pr#14173 `_, Brad Hubbard) * common: fix segfault in public IPv6 addr picking (`issue#19371 `_, `pr#14124 `_, Fabian Grünbichler) * common: Implements simple_spin_t in terms of std::atomic_flag. (`pr#14370 `_, Jesse Williamson) * common: include/denc: remove nullptr runtime magic boundedness check (`pr#13889 `_, Sage Weil) * common: Make spinlock delay more conventional (`pr#14248 `_, Brad Hubbard) * common: messages: fix return type name of MOSDMap (`pr#14382 `_, Leo Zhang) * common: msg/async/rdma: Add fork safe on RDMA (`pr#13740 `_, Sarit Zubakov) * common: msg/async/rdma: Update fix broken compilation (`pr#13940 `_, Sarit Zubakov) * common: msg/SimpleMessenger: error out misplace in set_socket_options (`pr#13961 `_, wangzhengyong) * common: .organizationmap: Updated authors (`pr#14360 `_, Jos Collin) * common: rados: more info added to pool deletion error (`issue#19400 `_, `pr#14235 `_, Vedant Nanda) * common,rbd,rgw: common/escape: do not escape / in json (`pr#14130 `_, Sage Weil) * common,rdma: msg/async/rdma: Introduce RDMAConnMgr + Debug prints (`pr#14201 `_, Amir Vadai) * common,rdma: msg/async/rdma: Move resource handling to Device (`pr#14088 `_, Sarit Zubakov, Amir Vadai) * common,rdma: msg/async/rdma: RDMA-CM Initialize device on first connect (`pr#14179 `_, Amir Vadai) * common: remove useless parameter (`pr#14096 `_, baiyanchun) * common: support s390 and unknown architectures in spin-wait loop (`issue#19492 `_, `pr#14337 `_, Nathan Cutler) * common: Tested-by: Andrew Solomon (`pr#14310 `_, Kefu Chai) * common: tracing: fix segv (`issue#18576 `_, `pr#14304 `_, Anjaneya Chagam) * core: ceph-disk: Adding retry loop in get_partition_dev() (`pr#14275 `_, Erwan Velu) * core: ceph-disk/ceph_disk/main.py: fix calling of the bsdrc init scripts (`pr#14476 `_, Willem Jan Withagen) * core: ceph-disk: enable directory backed OSD at boot time (`issue#19628 `_, `pr#14546 `_, Loic Dachary) * core: ceph-disk,osd: add support for crush device classes (`issue#19513 `_, `pr#14436 `_, Loic Dachary) * core: ceph-disk: Populate mount options when running "list" (`issue#17331 `_, `pr#14293 `_, Brad Hubbard) * core: ceph-disk: Write 10M to all partitions before zapping (`issue#18962 `_, `pr#13766 `_, Wido den Hollander) * core: client/SyntheticClient.cc: Fix warning in random_walk (`issue#19445 `_, `pr#14308 `_, Brad Hubbard) * core: cls/timeindex: clean up cls_timeindex_client.h|cc (`pr#13987 `_, Shinobu Kinjo) * core: common/TrackedOp: allow dumping historic ops sorted by duration (`pr#14050 `_, Piotr Dałek) * core: Give requested scrub work a higher priority (`issue#15789 `_, `pr#14488 `_, David Zafman) * core: include/denc: add {encode,decode}_nohead for denc_traits (`issue#18938 `_, `pr#14099 `_, Kefu Chai) * core,mgr,mon: mon,mgr: tag some commands for ceph-mgr (`pr#13617 `_, Sage Weil) * core: misc: SCA fixes (`pr#14426 `_, Danny Al-Gaaf) * core,mon: mon,osd: decouple creating pgs from pgmap (`pr#13999 `_, Kefu Chai) * core: mon/OSDMonitor: osd crush set-device-class (`issue#19307 `_, `pr#14039 `_, Loic Dachary) * core: msg/async/AsyncConnection: keepalive objecter ping connection to avoid timeout (`pr#14009 `_, Haomai Wang) * core: osdc/Objecter: respect epoch barrier in _op_submit() (`issue#19396 `_, `pr#14190 `_, Ilya Dryomov) * core: osd,mon: misc full fixes and cleanups (`pr#13968 `_, David Zafman) * core: osd/OSDMap: hide require_*_osd and sortbitwise flags (`pr#14440 `_, Sage Weil) * core: osd/PG: only correct filestore collection bits on load (`issue#19541 `_, `pr#14397 `_, Sage Weil) * core: osd/PG: some minor cleanups (`pr#14133 `_, runsisi) * core: osd/PrimaryLogPG: do not expect FULL_TRY ops to get resent (`issue#19430 `_, `pr#14255 `_, Sage Weil) * core: osd/PrimaryLogPG: set return value if sparse read failed (`pr#14093 `_, huangjun) * core: osd/ReplicatedBackend: clear pull source once we are done with it (`issue#19076 `_, `pr#13879 `_, Samuel Just) * core: os/filestore: handle error returned from write_fd() (`pr#10146 `_, yonghengdexin735) * core: os/fs/FS.cc: remove the redundant code. (`pr#14362 `_, Jianpeng Ma) * core: os/kstore: some error handling (`pr#13960 `_, wangzhengyong) * core,performance: mon,osd: explicitly remap some pgs (`pr#13984 `_, Sage Weil) * core: Prefix /proc/ with FreeBSD emulation (`pr#14290 `_, Willem Jan Withagen) * core: Revert "msg/async/rdma: Debug prints for ibv_*" (`pr#14245 `_, Kefu Chai) * core: test, osd: fix some coverity issues (`pr#13293 `_, liuchang0812) * core,tests: ceph-disk/tests/test_main.py: FreeBSD does not do multipath (`pr#13847 `_, Willem Jan Withagen) * core,tests: ceph_test_librados_api_misc: fix stupid LibRadosMiscConnectFailure.ConnectFailure test (`issue#15368 `_, `pr#14261 `_, Sage Weil) * core,tests: os: allow 'osd objectstore = random' to pick either filestore or bluestore (`pr#13754 `_, Sage Weil) * core,tests,tools: Fixes: http://tracker.ceph.com/issues/18533 (`pr#13423 `_, Samuel Just, David Zafman) * crush: bucket: crush_add_uniform_bucket_item should check for uniformity (`pr#14208 `_, Sahid Orentino Ferdjaoui) * crush: builder: clean the arguments of crush_reweight* methods (`pr#14110 `_, Sahid Orentino Ferdjaoui) * crush: builder: creating crush map with optimal configurations (`pr#14209 `_, Sahid Orentino Ferdjaoui) * crush: only encode class info if SERVER_LUMINOUS (`issue#19361 `_, `pr#14131 `_, Sage Weil) * crush: weight_set and id remapping (`issue#15653 `_, `pr#14486 `_, Loic Dachary) * doc: add 12.0.1 release notes (`pr#14106 `_, Abhishek Lekshmanan) * doc: add changelog for v10.2.7 (`pr#14441 `_, Abhishek Lekshmanan) * doc: add RGW ldap auth documentation (`pr#14339 `_, Harald Klein) * doc: add some undocumented options to rbd-nbd (`pr#14134 `_, wangzhengyong) * doc: change osd_op_thread_timeout default value to 15 (`pr#14199 `_, Andreas Gerstmayr) * doc: correct arguments for ceph tell osd.N bench (`pr#14462 `_, Patrick Dinnen) * doc: correct the quota section (`issue#19397 `_, `pr#14122 `_, Chu, Hua-Rong) * doc: dev guide: how to run s3-tests locally against vstart (`pr#14508 `_, Nathan Cutler, Abhishek Lekshmanan) * doc: doc/radosgw/s3/cpp.rst: update usage of libs3 APIs to make the examples work (`pr#10851 `_, Weibing Zhang) * doc: doc/rgw: instructions for changing multisite master zone (`pr#14089 `_, Casey Bodley) * doc: docs: Clarify the relationship of min_size to EC pool recovery (`pr#14419 `_, Brad Hubbard) * doc: how to specify filesystem for cephfs clients (`pr#14087 `_, John Spray) * doc: mailmap: shiqi affiliation (`pr#14361 `_, shiqi) * doc: mention --show-mappings in crushtool manpage (`issue#19649 `_, `pr#14599 `_, Nathan Cutler, Loic Dachary) * doc: PendingReleaseNotes: recent cephfs changes (`pr#14196 `_, John Spray) * doc: radosgw-admin: new 'global quota' commands update period config (`issue#19409 `_, `pr#14252 `_, Casey Bodley) * doc: README.FreeBSD: Update the status (`pr#14406 `_, Willem Jan Withagen) * doc: README.md: fix build instructions inconsistent. (`pr#14555 `_, Yao Zongyou) * doc: README.md: use github heading syntax to mark the headings (`pr#14591 `_, Kefu Chai) * doc: release notes for v10.2.7 Jewel (`pr#14295 `_, Abhishek Lekshmanan) * doc: rgw: remove mention of megabytes for quotas (`pr#14413 `_, Hans van den Bogert) * doc: rgw server-side encryption and barbican (`pr#13483 `_, Adam Kupczyk, Casey Bodley) * doc: script: build-doc/serve-doc fixes (`pr#14438 `_, Abhishek Lekshmanan) * doc: script: ceph-release-notes: use https instead of http (`pr#14103 `_, Kefu Chai) * doc: typo in hit_set_search_last_n (`pr#14108 `_, Sven Seeberg) * doc: update packages mentioned by build-doc and related doc (`pr#14649 `_, Yu Shengzuo) * doc: wip-doc-multisite ports downstream multisite document upstream. (`pr#14259 `_, John Wilkins) * librados: set the flag CEPH_OSD_FLAG_FULL_TRY of Op in the right place. (`pr#14193 `_, Pan Liu) * librbd: changed the return type of ImageRequestWQ::discard() (`issue#18511 `_, `pr#14032 `_, Jos Collin) * librbd: corrected resize RPC message backwards compatibility (`issue#19636 `_, `pr#14615 `_, Jason Dillaman) * librbd: deferred image deletion (`issue#18481 `_, `pr#13105 `_, Ricardo Dias) * librbd: fix rbd_metadata_list and rbd_metadata_get (`issue#19588 `_, `pr#14471 `_, Mykola Golub) * librbd: is_exclusive_lock_owner API should ping OSD (`issue#19287 `_, `pr#14003 `_, Jason Dillaman) * librbd: Notifier::notify API improvement (`pr#14072 `_, Mykola Golub) * librbd: pass an uint64_t to clip_io() as the third param (`issue#18938 `_, `pr#14159 `_, Kefu Chai) * librbd: potential use of uninitialised value in ImageWatcher (`pr#14091 `_, Mykola Golub) * librbd: random unit test failures due to shut down race (`issue#19389 `_, `pr#14166 `_, Jason Dillaman) * mds: bump client_reply debug to match client_req (`pr#14036 `_, Patrick Donnelly) * mds: don't assert on read errors in RecoveryQueue (`issue#19282 `_, `pr#14017 `_, John Spray) * mds: drop partial entry and adjust write_pos when opening PurgeQueue (`issue#19450 `_, `pr#14447 `_, "Yan, Zheng") * mds: fix mgrc shutdown (`issue#19566 `_, `pr#14505 `_, John Spray) * mds: fix stray creation/removal notification (`issue#19630 `_, `pr#14554 `_, "Yan, Zheng") * mds: ignore ENOENT on writing backtrace (`issue#19401 `_, `pr#14207 `_, John Spray) * mds: include advisory `path` field in damage (`issue#18509 `_, `pr#14104 `_, John Spray) * mds: make C_MDSInternalNoop::complete() delete 'this' (`issue#19501 `_, `pr#14347 `_, "Yan, Zheng") * mds: miscellaneous multimds fixes (`issue#19022 `_, `pr#13698 `_, "Yan, Zheng") * mds: pretty json from `tell` commands (`pr#14105 `_, John Spray) * mds: remove legacy "mds tell" command (`issue#19288 `_, `pr#14015 `_, John Spray) * mds: Return error message instead of asserting (`pr#14469 `_, Brad Hubbard) * mds: set ceph-mds name uncond for external tools (`issue#19291 `_, `pr#14021 `_, Patrick Donnelly) * mds: shut down finisher before objecter (`issue#19204 `_, `pr#13859 `_, John Spray) * mds: use same inode count in health check as in trim (`issue#19395 `_, `pr#14197 `_, John Spray) * mds: warn if insufficient standbys exist (`issue#17604 `_, `pr#12074 `_, Patrick Donnelly) * mgr: always free allocated MgrPyModule (`issue#19590 `_, `pr#14507 `_, Kefu Chai) * mgr: ceph-mgr: rotate logs on sighup (`issue#19568 `_, `pr#14437 `_, Dan van der Ster) * mgr: fix bugs in init, beacons (`issue#19516 `_, `issue#19502 `_, `pr#14374 `_, Sage Weil) * mgr: fix crash on missing 'ceph_version' in daemon metadata (fixes #18764) (`issue#18764 `_, `pr#14129 `_, Tim Serong) * mgr: fix several init/re-init bugs (`issue#19491 `_, `pr#14328 `_, Sage Weil) * mgr: mgr/MgrClient: fix reconnect event leak (`issue#19580 `_, `pr#14431 `_, Sage Weil) * mgr,mon: mon,mgr: move reweight-by-* to mgr (`pr#14404 `_, Kefu Chai) * mgr: move 'osd perf' and 'osd blocked-by' to mgr (`pr#14303 `_, Sage Weil) * mgr: move "osd pool stats" to mgr (`pr#14365 `_, Kefu Chai) * mgr: optimization some judgment and adjust the debug remove value in register_new_pgs (`pr#14046 `_, song baisen) * mgr: pass through cluster log to plugins (`pr#13690 `_, John Spray) * mgr: pybind/mgr/rest: don't set timezone to Chicago (`pr#14184 `_, Tim Serong) * mgr: remove unused function declarations (`pr#14366 `_, Wei Jin) * mgr: rm nonused main function (`pr#14313 `_, Wei Jin) * mgr: shutdown py_modules in Mgr::shutdown() (`issue#19258 `_, `pr#14078 `_, Kefu Chai) * mon: add mon_debug_no_require_luminous (`pr#14490 `_, Sage Weil) * mon: avoid segfault in wait_auth_rotating (`issue#19566 `_, `pr#14430 `_, John Spray) * mon: common/config_opts.h: kill mon_pg_create_interval (`pr#13800 `_, xie xingguo) * mon: emit cluster log messages on MDS health changes (`issue#19551 `_, `pr#14398 `_, John Spray) * mon: fix hiding mdsmonitor informative strings (`issue#16709 `_, `pr#13904 `_, John Spray) * mon: fix synchronise pgmap with others (`pr#14418 `_, song baisen, z09440) * mon: mon/MonClient: make get_mon_log_message() atomic (`issue#19427 `_, `pr#14422 `_, Kefu Chai) * mon: mon/Monitor.h: add const to member function (`pr#10412 `_, Michal Jarzabek) * mon: mon/OSDMonitor: check last_osd_report only when the whole cluster is lu… (`pr#14294 `_, Kefu Chai) * mon: mon/OSDMonitor: Clean up: delete extra S signature for plural (`pr#14174 `_, Shinobu Kinjo) * mon: mon/OSDMonitor: spinlock -> std::mutex (`pr#14269 `_, Sage Weil) * mon: mon/OSDMonitor: transit creating_pgs from pgmap when upgrading (`issue#19584 `_, `pr#14551 `_, Kefu Chai) * mon: mon/OSDMonitor: update creating epoch if target osd changed (`issue#19515 `_, `pr#14386 `_, Kefu Chai) * mon: mon/PGMap: make si units more readable in PGMap summary (`pr#14185 `_, liuhong) * mon: osd crush set crushmap need sanity check (`issue#19302 `_, `pr#14029 `_, Loic Dachary) * mon: Tidy up removal of debug mon features (`pr#14467 `_, Brad Hubbard) * mon: update mgrmap when active goes offline (`issue#19407 `_, `pr#14220 `_, John Spray) * msg/async: Postpone bind if network stack is not ready (`pr#14414 `_, Amir Vadai, Haomai Wang) * msg: src/msg/async: Update fix broken compilation for Posix (`pr#14336 `_, Sarit Zubakov) * NVMEDevice: remove unnessary dpdk header file (`pr#14650 `_, optimistyzy) * osd: add "heap \*" admin command (`issue#15475 `_, `pr#13073 `_, Jesse Williamson) * osd: add override in headers files (`pr#13962 `_, liuchang0812) * osd: Cleanup-Updated OSDMap.cc with C++11 style range-for loops (`pr#14381 `_, Jos Collin) * osd: combine unstable stats with info.stats when publish stats to osd (`pr#14060 `_, Mingxin Liu) * osd: fix comments about pg refs and lock (`pr#14279 `_, tang.jin) * osd: fix some osd beacon bugs (`pr#14274 `_, Sage Weil) * osd: kill all remaining MOSDSubOp users (`pr#13401 `_, Sage Weil) * osd: make ec overwrites ready to use (`pr#14496 `_, Josh Durgin) * osd: new op for calculating an extent checksum (`pr#14256 `_, Jason Dillaman) * osd: osd/PrimaryLogPG: nullptr not NULL (`pr#13973 `_, Shinobu Kinjo) * osd: pg-remap -> pg-upmap (`pr#14556 `_, Sage Weil) * osd: print pg_info_t::purged_snaps as array, not string (`issue#18584 `_, `pr#14217 `_, liuchang0812) * qa: krbd_data_pool.sh: account for rbd_info metadata object (`pr#14631 `_, Ilya Dryomov) * qa/objectstore: test bluestore with aggressive compression (`pr#14623 `_, Sage Weil) * qa/tasks: assert on pg status with a timeout (`issue#19594 `_, `pr#14608 `_, Kefu Chai) * qa/workunits/cephtool/test.sh: Be more liberal in testing health-output. (`pr#14614 `_, Willem Jan Withagen) * rbd: correct issues with image importing (`pr#14401 `_, Jason Dillaman) * rbd: error out if import image format failed (`pr#13957 `_, wangzhengyong) * rbd: import-diff should discard any zeroed extents (`pr#14445 `_, Jason Dillaman) * rbd: import real thin-provision image (`issue#15648 `_, `pr#12883 `_, yaoning, Ning Yao) * rbd-mirror: pool watcher should track mirror uuid (`pr#14240 `_, Jason Dillaman) * rbd-mirror: separate ImageReplayer handling from Replayer (`issue#18785 `_, `pr#13803 `_, Mykola Golub) * rbd-nbd: clean up the doc and help information (`pr#14146 `_, Pan Liu) * rbd-nbd: remove debug messages from do_unmap (`pr#14253 `_, Pan Liu) * rbd-nbd: s/cpp_error/cpp_strerror/ to fix FTBFS (`pr#14223 `_, Kefu Chai) * rbd-nbd: support signal handle for SIGHUP, SIGINT and SIGTERM. (`issue#19349 `_, `pr#14079 `_, Pan Liu) * rbd: pybind/rbd: add image metadata methods (`issue#19451 `_, `pr#14463 `_, Mykola Golub) * rbd,tests: qa/workunits: corrected issues with RBD cli test (`pr#14460 `_, Jason Dillaman) * rbd,tests: qa/workunits/rbd: diff.sh failed removing nonexistent file (`pr#14482 `_, Mykola Golub) * rbd,tests: test/librados_test_stub: fixed cls_cxx_map_get_keys/vals return value (`issue#19597 `_, `pr#14484 `_, Jason Dillaman) * rbd,tests: test/rbd_mirror: race in TestMockInstanceWatcher on destroy (`pr#14453 `_, Mykola Golub) * rbd: use min() explicitly (`issue#18938 `_, `pr#14202 `_, Kefu Chai) * rbd: validate pool and snap name optionals (`issue#14535 `_, `pr#13836 `_, Gaurav Kumar Garg) * rbd: warning, ‘devno’ may be used uninitialized in this function (`pr#14271 `_, Jos Collin) * rdma: msg/async/rdma: Debug prints for ibv_* (`pr#14249 `_, Amir Vadai) * rdma: msg/async/rdma: Device::last_poll_dev must be positive (`pr#14250 `_, Amir Vadai) * rdma: msg/async/rdma: Fix small memory leaks detected by valgrind (`pr#14288 `_, Amir Vadai) * rdma: msg/async/rdma: Make poll_blocking() poll for async events in additio… (`pr#14320 `_, Amir Vadai) * rdma: msg/async/rdma: Make port number an attribute of the Connection not o… (`pr#14297 `_, Amir Vadai) * rdma: msg/async/rdma: RDMA-CM, get_device() by ibv_context (`pr#14410 `_, Amir Vadai) * rdma: msg/async/rdma: RDMA-CM, Pass specific ConnMgr info in constructor (`pr#14409 `_, Amir Vadai) * rgw: Added code to correctly account for bytes sent/ received during a 'PUT' operation. (`pr#14042 `_, Pritha Srivastava) * rgw: add --num-zonegroups option for multi test (`pr#14216 `_, lvshuhua) * rgw: add pool namespace to cache's key so that system obj can have unique key (`issue#19372 `_, `pr#14125 `_, Zhang Shaowen) * rgw: add support for multipart upload expiration. (`issue#19088 `_, `pr#13622 `_, Zhang Shaowen) * rgw: add support for the BulkUpload of Swift API (`pr#12243 `_, Radoslaw Zarzynski) * rgw: add the remove-x-delete feature to cancel swift object expiration (`issue#19074 `_, `pr#13621 `_, Jing Wenjun) * rgw: Add --zonegroup-new-name in usage (`pr#12084 `_, Hans van den Bogert) * rgw: allow larger payload for period commit (`issue#19505 `_, `pr#14355 `_, Casey Bodley) * rgw: allow system users to read SLO parts (`issue#19027 `_, `pr#13561 `_, Casey Bodley) * rgw: avoid listing user buckets for rgw_delete_user (`pr#13991 `_, liuchang0812) * rgw: avoid using null pointer in rgw_file.cc (`pr#14474 `_, lihongjie) * rgw: bucket stats display bucket index type (`pr#14466 `_, fang yuxiang) * rgw: clean up the unneeded rgw::io::ChunkingFilter::has_content_length. (`pr#13504 `_, Radoslaw Zarzynski) * rgw: cls/rgw: Clean up the "magic string" usage in the cls layer for RGW. (`pr#12536 `_, Ira Cooper) * rgw: cls/user: cls_user_bucket backward compatibility (`issue#19367 `_, `pr#14128 `_, Yehuda Sadeh) * rgw: cls_user don't clobber existing bucket stats when creating bucket (`issue#16357 `_, `pr#10121 `_, Abhishek Lekshmanan) * rgw: continuation of the auth rework (`pr#12893 `_, Radoslaw Zarzynski, Matt Benjamin) * rgw: delete non-empty buckets in slave zonegroup works not well (`issue#19313 `_, `pr#14043 `_, Zhang Shaowen) * rgw: don't read all user input for a few param requests (`pr#13815 `_, Abhishek Lekshmanan) * rgw: enable to update acl of bucket created in slave zonegroup (`issue#16888 `_, `pr#14082 `_, Guo Zhandong) * rgw: error more verbosely in RGWRados::create_pool (`pr#14642 `_, Matt Benjamin) * rgw: fix build of conflict after auth rework (`pr#14203 `_, Casey Bodley) * rgw: fix configurable write obj window size (`pr#13934 `_, hechuang) * rgw: fix for EINVAL errors on forwarded bucket put_acl requests (`pr#14376 `_, Casey Bodley) * rgw: fix for null version_id in fetch_remote_obj() (`pr#14375 `_, Casey Bodley) * rgw: fix memory leak in delete_obj_aio (`pr#13998 `_, wangzhengyong) * rgw: fix memory leak in RGWGetObjLayout (`pr#14014 `_, liuchang0812) * rgw: fix response header of Swift API (`issue#19443 `_, `pr#14280 `_, tone-zhang) * rgw: fix slave zonegroup cannot enable the bucket versioning (`issue#18003 `_, `pr#12175 `_, lvshuhua) * rgw: fix uninitialized fields (`pr#14120 `_, wangzhengyong) * rgw: LCWorker's worktime is not the same as config rgw_lifecycle_work_time. (`issue#18087 `_, `pr#11963 `_, Zhang Shaowen) * rgw: multisite enabled over multiple clusters (`pr#12535 `_, Ali Maredia) * rgw: multisite: fixes for zonegroup redirect (`issue#19488 `_, `pr#14319 `_, Casey Bodley) * rgw: radosgw-admin: use zone id when creating a zone (`issue#19498 `_, `pr#14340 `_, Orit Wasserman) * rgw: Removed Unwanted headers (`pr#14183 `_, Jos Collin) * rgw: remove duplicate flush formatter (`pr#12437 `_, Guo Zhandong) * rgw: remove extra RGWMPObj in rgw_multi.h (`pr#14619 `_, Casey Bodley) * rgw: rgw_file: fix missing unlock in unlink (`issue#19435 `_, `pr#14262 `_, Gui Hecheng) * rgw: rgw_file: fix non-posix errcode EINVAL to ENAMETOOLONG (`pr#13764 `_, Gui Hecheng) * rgw: rgw_file: fix readdir after dirent-change (`issue#19634 `_, `pr#14561 `_, Matt Benjamin) * rgw: rgw_file: pre-compute unix attrs in write_finish() (`issue#19653 `_, `pr#14609 `_, Matt Benjamin) * rgw: rgw_file: support readdir cb type hints (plus fixes) (`issue#19623 `_, `issue#19625 `_, `issue#19624 `_, `pr#14458 `_, Matt Benjamin) * rgw: rgw_lc: drop a bunch of unused headers (`pr#14342 `_, Abhishek Lekshmanan) * rgw: rgw/lifecycle: do not send lifecycle rules when GetLifeCycle failed (`issue#19363 `_, `pr#14160 `_, liuchang0812) * rgw: rgw_op: remove unused variable iter (`pr#14276 `_, Weibing Zhang) * rgw: s3 server-side encryption (SSE-C, SSE-KMS) (`pr#11049 `_, Adam Kupczyk, Casey Bodley, Radoslaw Zarzynski) * rgw: stat requests skip compression, manifest handling, etc (`pr#14109 `_, Casey Bodley) * rgw: switch from "timegm()" to "internal_timegm()" for better portability (`issue#12863 `_, `pr#14327 `_, Rishabh Kumar) * rgw: switch to std::array in RGWBulkUploadOp due to C++11 and FreeBSD. (`pr#14314 `_, Radoslaw Zarzynski) * rgw,tests: qa/rgw: add configuration for server-side encryption tests (`pr#13597 `_, Casey Bodley) * rgw,tests: test/rgw: test_bucket_delete_notempty in test_multi.py (`pr#14090 `_, Casey Bodley) * rgw: update is_truncated in function rgw_read_user_buckets (`issue#19365 `_, `pr#14343 `_, liuchang0812) * rgw: verified f23 (`pr#14553 `_, Gui Hecheng) * rgw: version id doesn't work in fetch_remote_obj (`pr#14010 `_, Zhang Shaowen) * rgw: warning, output may be truncated before the last format character (`pr#14194 `_, Jos Collin) * src/ceph-disk/ceph_disk/main.py: Make 'ceph-disk list' work on FreeBSD (`pr#14483 `_, Willem Jan Withagen) * test/osd/osd-dup.sh: warn on low open file limit (`pr#14637 `_, Piotr Dałek) * test: rbd master/slave notify test should test active features (`issue#19692 `_, `pr#14638 `_, Jason Dillaman) * test/rgw: refactor test_multi.py for use in qa suite (`pr#14433 `_, Casey Bodley) * tests: add MGR=1 so 'pg dump' won't be blocked (`pr#14266 `_, Kefu Chai) * tests: ceph_objectstore_tool.py: kill all daemons (`pr#14428 `_, Kefu Chai) * tests: qa: Add reboot case for systemd test (`pr#14229 `_, Vasu Kulkarni) * tests: qa/suites: drop 'fs' facet, and add 'objectstore' facet where missing (`pr#14198 `_, Sage Weil) * tests: qa/tasks: use sudo to check ceph health for systemd test (`pr#14464 `_, Vasu Kulkarni) * tests: qa/tasks/workunit.py: use "overrides" as the default settings of workunit (`issue#19429 `_, `pr#14281 `_, Kefu Chai) * tests: qa/workunits/ceph-helpers: display rejected string (`pr#14468 `_, Kefu Chai) * tests: tasks/workunit.py: when cloning, use --depth=1 (`pr#14214 `_, Dan Mick) * tests: test: add explicit braces to avoid ambiguous ‘else’ and to silence warnings (`pr#14472 `_, Jos Collin) * tests: test: objectstore: chain_xattr: fix wrong memset usage to fill buf (`pr#14277 `_, Weibing Zhang) * tests: Thrasher: handle "OSD has the store locked" gracefully (`issue#19556 `_, `pr#14415 `_, Nathan Cutler) * tests: workunit: request branch when cloning (`pr#14260 `_, Kefu Chai, Dan Mick) * tools: ceph-disk: Add fix subcommand (`pr#13310 `_, Boris Ranto) * tools: ceph-release-notes: prefixes and pep8 compliance (`pr#14156 `_, Nathan Cutler) * tools: ceph-release-notes: strip trailing punctuation (`pr#14385 `_, Nathan Cutler) * tools: stop.sh: boilerplate error (don't stop mon when stopping mgr) (`pr#14461 `_, Dan Mick) * tools: warning, ‘%.16x’ directive output truncated writing 16 bytes into a region of size 9. (`pr#14292 `_, Jos Collin) v12.0.1 Luminous (dev) ====================== This is the second development checkpoint release of Luminous, the next long term stable release. Major changes from 12.0.0 ------------------------- * The original librados rados_objects_list_open (C) and objects_begin (C++) object listing API, deprecated in Hammer, has finally been removed. Users of this interface must update their software to use either the rados_nobjects_list_open (C) and nobjects_begin (C++) API or the new rados_object_list_begin (C) and object_list_begin (C++) API before updating the client-side librados library to Luminous. Object enumeration (via any API) with the latest librados version and pre-Hammer OSDs is no longer supported. Note that no in-tree Ceph services rely on object enumeration via the deprecated APIs, so only external librados users might be affected. The newest (and recommended) rados_object_list_begin (C) and object_list_begin (C++) API is only usable on clusters with the SORTBITWISE flag enabled (Jewel and later). (Note that this flag is required to be set before upgrading beyond Jewel.) * CephFS clients without the 'p' flag in their authentication capability string will no longer be able to set quotas or any layout fields. This flag previously only restricted modification of the pool and namespace fields in layouts. * The rados copy-get-classic operation has been removed since it has not been used by the OSD since before hammer. It is unlikely any librados user is using this operation explicitly since there is also the more modern copy-get. * The RGW api for getting object torrent has changed its params from 'get_torrent' to 'torrent' so that it can be compatible with Amazon S3. Now the request for object torrent is like 'GET /ObjectName?torrent'. Other Notable Changes --------------------- * auth: 'ceph auth import -i' overwrites caps, if caps are not specified (`issue#18932 `_, `pr#13468 `_, Vikhyat Umrao) * BitAllocator: delete useless codes (`pr#13619 `_, Jie Wang) * Bluestore, NVMEDevice: add the spdk core mask check (`pr#14068 `_, optimistyzy) * Bluestore, NVMeDevice: fix the core id for rte_remote_launch (`pr#13896 `_, optimistyzy) * bluestore,NVMEDevice: minor error for get slave core (`pr#14012 `_, Ziye Yang) * bluestore, NVMEDEVICE: Specify the max io completion in conf (`pr#13799 `_, optimistyzy) * brag: count the number of mds in fsmap not in mdsmap (`issue#19192 `_, `pr#13798 `_, Peng Zhang) * build/ops: add psmisc dependency to ceph-base (deb and rpm) (`issue#19129 `_, `pr#13744 `_, Nathan Cutler) * build/ops: make package groups comply with openSUSE guidelines (`issue#19184 `_, `pr#13781 `_, Nathan Cutler) * buildpackages: remove because it does not belong (`issue#18846 `_, `pr#13297 `_, Loic Dachary) * build: remove ceph-qa-suite directory (`pr#13880 `_, Casey Bodley) * ceph_common.sh: fix syntax error (`issue#17826 `_, `pr#13419 `_, Dan Mick) * ceph-create-keys: Add connection timeouts. (`pr#11995 `_, Owen Synge) * ceph-dencoder: enable bluestore types (`pr#13595 `_, Willem Jan Withagen, Kefu Chai) * ceph-detect-init: Add docker detection (`pr#13218 `_, Guillaume Abrioux) * ceph-detect-init: Adds Oracle Linux Server and Oracle VM Server detect (`pr#13917 `_, Nikita Gerasimov) * ceph-disk: ability to use a different cluster name with dmcrypt (`issue#17821 `_, `pr#11786 `_, Sébastien Han, Erwan Velu) * ceph-disk: dmcrypt activate must use the same cluster as prepare (`issue#17821 `_, `pr#13573 `_, Loic Dachary) * ceph-disk: Fix getting wrong group name when --setgroup in bluestore (`issue#18955 `_, `pr#13457 `_, craigchi) * ceph-disk: FreeBSD changes to get it working and passing tests (`pr#12086 `_, Willem Jan Withagen) * ceph-disk: Reporting /sys directory in get_partition_dev() (`pr#14080 `_, Erwan Velu) * ceph-disk: Revert "Revert "change get_dmcrypt_key test to support different cluster name"" (`pr#13600 `_, Loic Dachary) * ceph-disk/tests: Certain partition types do not work on FreeBSD (`pr#13560 `_, Willem Jan Withagen) * ceph: do not throw TypeError on connection failure (`pr#13268 `_, Kefu Chai) * ceph_fuse: fix daemonization when pid file is non-empty (`pr#13532 `_, "Yan, Zheng") * ceph.in: add help for locally-handled commands (`pr#13288 `_, Dan Mick) * ceph.in: Fix couple of minor issues on the messages (`pr#12797 `_, Dave Chen) * ceph-osd: --flush-journal: sporadic segfaults on exit (`issue#18820 `_, `pr#13311 `_, Alexey Sheplyakov) * ceph.spec.in, debian/control: Add bc to build dependencies (`issue#18876 `_, `pr#13338 `_, Kyr Shatskyy) * ceph_test_objectstore: tolerate fsck EOPNOTSUPP too (`pr#13325 `_, Sage Weil) * ceph_test_rados_api_misc: avoid livelock from PoolCreationRace (`pr#13565 `_, Sage Weil) * cleanup: rbd: fix a typo in comment (`pr#14049 `_, Dongsheng Yang) * cleanup: resolve compiler warnings (`pr#13236 `_, Adam C. Emerson) * client: priority to verify the correctness of the "flag" (`pr#12897 `_, huanwen ren) * client: remove dead log code (`pr#13093 `_, Patrick Donnelly) * client: wait for lastest osdmap when handling set file/dir layout (`issue#18914 `_, `pr#13580 `_, "Yan, Zheng") * cmake: link consumers of libclient with libcommon (`issue#18838 `_, `pr#13394 `_, Kefu Chai) * CMakeLists.txt: suppress unneeded warning about jemalloc (`pr#13377 `_, Willem Jan Withagen) * common: add override in common and misc (`issue#18922 `_, `pr#13443 `_, liuchang0812) * common: add override in header file (`pr#13774 `_, liuchang0812) * common: add override in msg subsystem (`pr#13771 `_, liuchang0812) * common/auth: add override in headers (`pr#13692 `_, liuchang0812) * common/config_opts.h: Remove deprecated osd_compact_leveldb_on_mount option (`issue#19318 `_, `pr#14059 `_, Vikhyat Umrao) * common/config_opts: improve rdma buffer size to 128k (`pr#13510 `_, Haomai Wang) * common/EventTrace: fix compiler warning. (`pr#13659 `_, Jianpeng Ma) * common: Fix clang compilation (`pr#13335 `_, Bartłomiej Święcki) * common: fix lockdep vs recursive mutexes (`pr#9940 `_, Adam Kupczyk) * common: fix that $host always expands to localhost instead of actual hostname (`issue#11081 `_, `pr#12998 `_, liuchang0812) * common: include/ceph_features.h uses uint64_t, which is in sys/types.h (`pr#13339 `_, Willem Jan Withagen) * common/MemoryModel: Bump int to long and drop mallinfo (`pr#13453 `_, Xiaoxi Chen) * common, osd, tools: Add histograms to performance counters (`pr#12829 `_, Bartłomiej Święcki) * common/perf_counters.: Remove unnecessary judgment (`pr#10407 `_, zhang.zezhu) * common: possible lockdep false alarm for ThreadPool lock (`issue#18819 `_, `pr#13258 `_, Mykola Golub) * common: remove \n on clog messages (`pr#13794 `_, Sage Weil) * common/TracepointProvider: add assert if dlopen error. (`pr#13430 `_, Jianpeng Ma) * common/TrackedOp: make TrackedOp::reset_desc() safe (`issue#19110 `_, `pr#13702 `_, Sage Weil) * common: use ref to avoid unnecessary memory copy (`issue#19107 `_, `pr#13689 `_, liuchang0812) * compressor: fix Mutex::Locker used is not correct (`pr#13935 `_, hechuang) * compressor/zstd: add zstd to embedded ceph (`pr#13159 `_, Bassam Tabbara) * crc32c: optimize aarch64 crc32c implementation (`pr#12977 `_, wei xiao) * crush: add devices class that rules can use as a filter (`issue#18943 `_, `pr#13444 `_, Loic Dachary) * crush: add --dump to crushtool (`pr#13726 `_, Loic Dachary) * crush: allow uniform buckets with no items (`pr#13521 `_, Loic Dachary) * crush: document tunables and rule step set_ (`pr#13722 `_, Loic Dachary) * crush: do is_out test only if we do not collide (`pr#13326 `_, xie xingguo) * crush: fix dprintk compilation (`pr#13424 `_, Loic Dachary) * debian: Add missing tp files in deb packaging (`pr#13526 `_, Ganesh Mahalingam) * debian/control: add ceph-base-dbg (`pr#13796 `_, Sage Weil) * debian: drop boost build dependencies (`pr#13524 `_, Kefu Chai) * doc: add changelog for v0.94.10 (`pr#13572 `_, Abhishek Lekshmanan) * doc: add changelog for v10.2.6 Jewel release (`pr#13839 `_, Abhishek Lekshmanan) * doc: add optional argument for build-doc (`pr#14058 `_, Kefu Chai) * doc: clarify that "ms bind ipv6" disables IPv4 (`pr#13317 `_, Ken Dreyer) * doc: crush: API documentation fixes (`pr#13589 `_, Loic Dachary) * doc: crush typo in algorithm description (`pr#13661 `_, Loic Dachary) * doc: deletes duplicated word and clarifies an example (`pr#13746 `_, Tahia Khan) * doc: document `osd-agent-{max,low}-ops` options (`pr#13648 `_, Kefu Chai) * doc: fix a typo (`pr#13930 `_, Drunkard Zhang) * doc: fixes a broken hyperlink to RADOS paper in architecture (`pr#13682 `_, Tahia Khan) * doc: Fixes a typo. (`pr#13985 `_, Edwin F. Boza) * doc: Fixes parameter name in rbd configuration on openstack havana/icehouse (`issue#17978 `_, `pr#13403 `_, Michael Eischer) * doc: fixes to silence sphinx-build (`pr#13997 `_, Kefu Chai) * doc: fix the links to http://ceph.com/docs (`issue#19090 `_, `pr#13976 `_, Kefu Chai) * doc: Fix typo and grammar in RGW config reference (`pr#13356 `_, Ruben Kerkhof) * doc: fix typos in radosgw-admin usage (`pr#13936 `_, Enming Zhang) * doc: hammer 0.94.10 release notes (`pr#13152 `_, Nathan Cutler) * doc: Have install put manpages in the FreeBSD correct location (`pr#13301 `_, Willem Jan Withagen) * doc: improve firewalld instructions (`pr#13360 `_, Ken Dreyer) * doc: instructions and guidance for multimds (`issue#19135 `_, `pr#13830 `_, John Spray) * doc: instructions for provisioning OpenStack VMs ad hoc (`pr#13368 `_, Nathan Cutler) * doc: Jewel 10.2.6 release notes (`pr#13835 `_, Abhishek Lekshmanan) * doc: kernel client os-recommendations update (`pr#13369 `_, John Spray, Ilya Dryomov) * doc: mention ENXIO change in the 10.2.6 release notes (`pr#13878 `_, Nathan Cutler) * doc: minor changes in fuse client config reference (`pr#13065 `_, Barbora Ančincová) * doc: misc minor fixes (`pr#13713 `_, Drunkard Zhang) * doc: Modify Configuring Cinder section (`issue#18840 `_, `pr#13400 `_, Shinobu Kinjo) * doc/release-notes: avoid 'production-ready' in describing kraken (`pr#13675 `_, Sage Weil) * doc: releases update the luminous, hammer, jewel release dates (`pr#13584 `_, Abhishek Lekshmanan) * doc: remove duplicated references (`pr#13396 `_, Kefu Chai) * doc: remove mentions about mon_osd_min_down_reports (`issue#19016 `_, `pr#13558 `_, Barbora Ančincová) * doc: rgw: make a note abt system users vs normal users (`issue#18889 `_, `pr#13461 `_, Abhishek Lekshmanan) * docs: Fix problems with example code (`pr#14007 `_, Brad Hubbard) * doc: Update adminops.rst (`pr#13893 `_, Chu, Hua-Rong) * doc: Update CephFS disaster recovery documentation (`pr#12370 `_, Wido den Hollander) * doc: update links to point to ceph/qa instead of ceph-qa-suite (`pr#13397 `_, Jan Fajerski, Nathan Cutler) * doc: Update sample.ceph.conf (`pr#13751 `_, Saumay Agrawal) * doc: update to new ceph fs commands (`pr#13346 `_, Patrick Donnelly) * doc: v12.0.0 release notes (`pr#13281 `_, Abhishek Lekshmanan) * do_freebsd.sh: Remove ENODATA requirement (`pr#13626 `_, Willem Jan Withagen) * fix upgrate from hammer when zone doesn't have zoneparams (`issue#19231 `_, `pr#13900 `_, Orit Wasserman) * include/denc, kv: silence gcc warnings (`pr#13458 `_, Kefu Chai) * include/mempool.h: fix Clangs complaint about types (`pr#13523 `_, Willem Jan Withagen) * init-ceph: fix ceph user args (`pr#13467 `_, Sage Weil) * init-ceph: Make init-ceph work under FreeBSD for init-system (`pr#13209 `_, Willem Jan Withagen) * init-ceph: print trailing n in "status" output (`pr#13351 `_, Kefu Chai) * librados: add override for librados (`issue#18922 `_, `pr#13442 `_, liuchang0812) * librados: add override in headers (`pr#13775 `_, liuchang0812) * librados: do not expose non-public symbols (`pr#13265 `_, Kefu Chai) * librados, osd: clang fixes (`pr#13768 `_, Kefu Chai) * librados: remove legacy object listing API, clean up newer api (`pr#13149 `_, Sage Weil) * librados: replace the var name from onack to complete (`pr#13857 `_, Pan Liu) * librados: use cursor for nobjects listing (`pr#13323 `_, Yehuda Sadeh, Sage Weil) * librbd: add override keyword in header files (`issue#19012 `_, `pr#13536 `_, liuchang0812) * librbd: add SnapshotNamespace to ImageCtx (`pr#12970 `_, Victor Denisov) * librbd: add writesame API (`pr#12645 `_, Mingxin Liu, Gui Hecheng) * librbd: asynchronous clone state machine (`pr#12041 `_, Dongsheng Yang) * librbd: asynchronous image removal state machine (`pr#12102 `_, Dongsheng Yang, Venky Shankar) * librbd: eliminate compiler warnings (`pr#13729 `_, Jason Dillaman) * librbd: fix clang compilation error (`issue#19260 `_, `pr#13926 `_, Mykola Golub) * librbd: fix segfault on EOPNOTSUPP returned while fetching snapshot timestamp (`issue#18839 `_, `pr#13287 `_, Gui Hecheng) * librbd: fix valid coverity warnings (`pr#14023 `_, Jason Dillaman) * librbd: image create validates that pool supports overwrites (`issue#19081 `_, `pr#13986 `_, Jason Dillaman) * librbd: image-extent cache needs to clip out-of-bounds read buffers (`pr#13679 `_, Jason Dillaman) * librbd: Include WorkQueue.h since we use it (`issue#18862 `_, `pr#13322 `_, Boris Ranto) * librbd: rbd ack cleanup (`pr#13791 `_, runsisi) * librbd: relax "is parent mirrored" check when enabling mirroring for pool (`issue#19130 `_, `pr#13752 `_, Mykola Golub) * librbd: scatter/gather support for the C API (`issue#13025 `_, `pr#13447 `_, Jason Dillaman) * librbd: simplify image open/close semantics (`pr#13701 `_, Jason Dillaman) * librbd: use 'override' keyword instead of 'virtual' (`issue#18922 `_, `pr#13437 `_, liuchang0812) * mailmap: fix Zhao Chao affiliation. (`pr#13413 `_, Zhao Chao) * mailmap: Liu Yang affiliation (`pr#13427 `_, LiuYang) * mailmap updates (`pr#13309 `_, Loic Dachary) * mailmap: Willem Jan Withagen affiliation (`pr#13034 `_, Willem Jan Withagen) * make-srpm: Pass first parameter to make-dist for building SRPM (`pr#13480 `_, Wido den Hollander) * man/8/ceph-disk: fix formatting (`pr#13969 `_, Kefu Chai) * mds: #11950: Persistent purge queue (`issue#11950 `_, `pr#12786 `_, John Spray) * mds: #18600: Clear out tasks that don't make sense from multimds suite (`issue#18600 `_, `pr#13089 `_, John Spray) * mds: Add multimds:thrash sub-suite and fix bugs in thrasher for multimds (`issue#18690 `_, `issue#10792 `_, `pr#13262 `_, Patrick Donnelly) * mds: add override in headers (`pr#13691 `_, liuchang0812) * mds: add override in mds subsystem (`issue#18922 `_, `pr#13438 `_, liuchang0812) * mds: automate MDS object count tracking (`pr#13591 `_, Patrick Donnelly) * mds: extend 'p' auth cap to cover all vxattr stuff (`issue#19075 `_, `pr#13628 `_, John Spray) * mds: fix handling very fast delete ops (`issue#19245 `_, `pr#13899 `_, John Spray) * mds: fix IO error handling in SessionMap (`pr#13464 `_, John Spray) * mds: ignore fs full check for CEPH_MDS_OP_SETFILELOCK (`issue#18953 `_, `pr#13455 `_, "Yan, Zheng") * mds/MDLog.cc Fix perf counter type for jlat (`pr#13449 `_, Xiaoxi Chen) * mds: misc multimds fixes (`issue#18717 `_, `issue#18754 `_, `pr#13227 `_, "Yan, Zheng") * mds: print rank as int (`issue#19201 `_, `pr#13816 `_, Patrick Donnelly) * mds: remove some redundant object counters (`pr#13704 `_, Patrick Donnelly) * mds: replace C_VoidFn in MDSDaemon with lambdas (`pr#13465 `_, John Spray) * mds/StrayManager: avoid reusing deleted inode in StrayManager::_purge_stray_logged (`issue#18877 `_, `pr#13347 `_, Zhi Zhang) * mds: try to avoid false positive heartbeat timeouts (`issue#19118 `_, `pr#13807 `_, John Spray) * messages/MOSDOp: fix pg_t decoding for version <7 decoding (`issue#19005 `_, `pr#13537 `_, Sage Weil) * mgr: add a get_version to the python interface (`pr#13669 `_, John Spray) * mgr: add override in headers (`pr#13772 `_, liuchang0812) * mgr: add override in mgr subsystem (`issue#18922 `_, `pr#13436 `_, liuchang0812) * mgr/MgrClient: use unique_ptr for MgrClient::session (`issue#19097 `_, `pr#13685 `_, Kefu Chai) * mgr: use unique_ptr for MgrStandby::active_mgr (`pr#13667 `_, John Spray) * misc: SCA and Coverity Fixes (`pr#13208 `_, Danny Al-Gaaf) * mon: add override in headers (`pr#13693 `_, liuchang0812) * mon: add override in mon subsystem (`issue#18922 `_, `pr#13440 `_, liuchang0812) * mon: add warn info for osds were removed from osdmap but still kept in crushmap (`pr#12273 `_, song baisen) * mon: avoid start election twice when quorum enter (`pr#10150 `_, song baisen) * mon: crush straw_calc_version value is 0 or 1 not 0 to 2. (`pr#13554 `_, song baisen) * mon: detect existing fs and duplicate name earlier (`issue#18964 `_, `pr#13471 `_, Patrick Donnelly) * mon: drop useless assignment statements (`pr#13958 `_, wangzhengyong) * mon: enable luminous monmap feature on full quorum (`pr#13379 `_, Joao Eduardo Luis) * mon: fix force_pg_create pg stuck in creating bug (`issue#18298 `_, `pr#12539 `_, Sage Weil) * mon: in output of "ceph osd df tree", display "-", not "0", for pg amount of a bucket (`pr#13015 `_, Chuanhong Hong) * mon/MDSMonitor: remove create_new_fs from header (`pr#14019 `_, Henrik Korkuc) * mon/MonClient: discard stray messages from non-acitve conns (`issue#19015 `_, `pr#13656 `_, Kefu Chai) * mon/MonClient: don't return zero global_id (`issue#19134 `_, `pr#13853 `_, "Yan, Zheng", Kefu Chai) * mon/MonClient: hunt monitors in parallel (`issue#16091 `_, `pr#11128 `_, Steven Dieffenbach, Kefu Chai) * mon/MonClient: persist global_id across re-connecting (`issue#18968 `_, `pr#13550 `_, Kefu Chai) * mon/MonClient: random all ranks then pick first_n (`pr#13479 `_, Mingxin Liu) * mon,osd: luminous feature bits, require flags, upgrade gates (`pr#13278 `_, Sage Weil) * mon: OSDMonitor add check only concern our self cluster command (`pr#10309 `_, song baisen) * mon/OSDMonitor: fix dividing by zero in OSDUtilizationDumper (`pr#13531 `_, Mingxin Liu) * mon/OSDMonitor: make mapping job behave if mon_osd_prime_pg_temp = false (`issue#19020 `_, `pr#13574 `_, Sage Weil) * mon/OSDMonitor: remove trivial PGMap dependency for 'osd primary-temp' command (`pr#13616 `_, Sage Weil) * mon/OSDMonitor: some cleanup for reweight-by-pg (`pr#13462 `_, Haodong Tang) * mon,osd: new mechanism for managing full and nearfull OSDs for luminous (`pr#13615 `_, Sage Weil) * mon/PGMap: factor mon_osd_full_ratio into MAX AVAIL calc (`issue#18522 `_, `pr#12923 `_, Sage Weil) * mon: PGMonitor add check only concern our self cluster command (`pr#9976 `_, song baisen) * mon/PGMonitor: rm nonused function (`pr#14033 `_, Wei Jin) * mon: refactor MDSMonitor command handling (`pr#13581 `_, John Spray) * mon: remove the redudant jugement in paxosservice is_writeable function (`pr#10240 `_, song baisen) * mon: remove unnecessary function declaration (`pr#13762 `_, liuchang0812) * mon: restructure prime_pg_temp around a full pg mapping calculated on multiple CPUs (`pr#13207 `_, Sage Weil) * mon: smooth io/recovery stats over longer period (`pr#13249 `_, Sage Weil) * mon: Update OSDMon.cc comments (`pr#13750 `_, Saumay Agrawal) * msg/async: avoid lossy connection sending ack message (`pr#13700 `_, Haomai Wang) * msg/async: cleanup code. (`pr#13304 `_, Jianpeng Ma) * msg/async: fix crash that writing char to nonblock-fd gets EAGAIN in EventCenter::wakeup (`pr#13822 `_, liuchang0812) * msg/AsyncMessenger: remove unused method (`pr#10125 `_, Michal Jarzabek) * msg/async/rdma: add log to show correct destruct queuepair (`pr#13412 `_, Haomai Wang) * msg/async/rdma: add perf counters to RDMA backend (`pr#13484 `_, Haomai Wang) * msg/async/rdma: check if exp verbs avail (`pr#13391 `_, Oren Duer, Adir Lev) * msg/async/rdma: check ulimit (`pr#13655 `_, Sarit Zubakov, Adir Lev) * msg/async/rdma: cleanup (`pr#13509 `_, Haomai Wang) * msg/async/rdma: compile with rdma as default (`pr#13901 `_, DanielBar-On) * msg/async/rdma: destroy QueuePair if needed (`pr#13810 `_, Haomai Wang) * msg/async/rdma: don't need to delete event when tcp connection isn't … (`pr#13528 `_, Haomai Wang) * msg/async/rdma: Fix broken compilation (`pr#13603 `_, Sarit Zubakov) * msg/async/rdma: fix outstanding queuepair when destruct RDMAStack (`pr#13905 `_, Haomai Wang) * msg/async/rdma: fix potential racing connection usage (`pr#13738 `_, Haomai Wang) * msg/async/rdma: Introduce Device.{cc,h} (`pr#14001 `_, Amir Vadai) * msg/async/rdma: make Infiniband can be forkable (`pr#13525 `_, Haomai Wang) * msg/async/rdma: move active_queue_pairs perf counter dec to polling (`pr#13716 `_, DanielBar-On) * msg/async/rdma: Print error only on ENOMEM (`pr#13538 `_, Sarit Zubakov) * msg/async/rdma: refactor tx handle flow to get rid of locks (`pr#13680 `_, Haomai Wang) * msg/async/rdma: rename chunk_size to buffer_size (`pr#13666 `_, Adir Lev) * msg/async/rdma: update destructor message (`pr#13539 `_, Sarit Zubakov) * msg/async/rdma: zero wqe inline (`pr#13392 `_, Adir Lev) * msg/async/rdm: fix leak when existing failure in ip network (`pr#13435 `_, Haomai Wang) * msg/async: remove useless close function (`pr#13286 `_, liuchang0812) * msg/async: set thread name for msgr worker (`pr#13699 `_, Haomai Wang) * msg/async/Stack.cc: use of pthread_setname_np() needs compat.h (`pr#13825 `_, Willem Jan Withagen) * msg/async: support IPv6 QoS. (`issue#18887 `_, `issue#18928 `_, `pr#13418 `_, Robin H. Johnson) * msg: end parameter in entity_addr_t::parse is optional (`pr#13650 `_, Mykola Golub) * msg: Fix calls to Messenger::create with new parameter (`pr#13329 `_, Sarit Zubakov) * msg, messages: coverity fixes (`pr#13473 `_, Kefu Chai) * msg: no need to pass supported features to Messenger::Policy ctor (`pr#13785 `_, Sage Weil) * msg/simple: fix missing unlock when already bind (`pr#13267 `_, Haomai Wang) * msg/simple/Pipe: support IPv6 QoS. (`issue#18887 `_, `pr#13370 `_, Robin H. Johnson) * msg/simple/Pipe:the returned value for do_recv unequal to zero (`pr#10272 `_, zhang.zezhu) * os/aio: remove the redundant memset(struct iocb) (`pr#13662 `_, Jianpeng Ma) * os/blestore/NVMEDevice: fix the I/O logic for read (`pr#13971 `_, optimistyzy) * os/bluestore: add bluestore_prefer_wal_size option (`pr#13217 `_, Sage Weil) * os/bluestore: add flush_store_cache cmd (`pr#13428 `_, xie xingguo) * os/bluestore: add more perf_counters to BlueStore (`pr#13274 `_, Igor Fedotov) * os/bluestore: add new garbage collector (`pr#12144 `_, Igor Fedotov) * os/bluestore: add "_" prefix for internal methods (`pr#13409 `_, xie xingguo) * os/bluestore/Allocator: drop unused return value in release function (`pr#13913 `_, wangzhengyong) * os/bluestore/BitAllocator: fix bug of checking required blocks (`pr#13470 `_, wangzhengyong) * os/bluestore/BitMapAllocator: rm unused variable (`pr#13599 `_, Jie Wang) * os/bluestore/BlueStore.cc:remove unuse code in _open_bdev() (`pr#13553 `_, yonghengdexin735) * os/bluestore: cleanup, got rid of table reference of 1<`_, Adam Kupczyk) * os/bluestore: default 16KB min_alloc_size on ssd (`pr#14076 `_, Sage Weil) * os/bluestore: do not use nullptr to calc the size of bluestore_pextent_t (`pr#14030 `_, Kefu Chai) * os/bluestore: enable SSE-assisted CRC32 calculations in RocksDB (`pr#13741 `_, Radoslaw Zarzynski) * os/bluestore: fix a bug in small write handling on sharded extents (`pr#13728 `_, Igor Fedotov) * os/bluestore: fix bug in aio_read() (`pr#13511 `_, tangwenjun) * os/bluestore: fix bug in _open_super_meta() (`pr#13559 `_, Taeksang Kim) * os/bluestore: fix bugs in bluefs and bdev flush (`issue#19250 `_, `issue#19251 `_, `pr#13911 `_, Sage Weil) * os/bluestore: fix coredump in register_ctrlr() (`pr#13556 `_, tangwenjun) * os/bluestore: fix deferred writes; improve flush (`pr#13888 `_, Sage Weil) * os/bluestore: fix wal-queue bytes-counter to keep pace with others (`pr#13382 `_, xie xingguo) * os/bluestore: leverage the type knowledge in BitMapAreaLeaf. (`pr#13736 `_, Radoslaw Zarzynski) * os/bluestore: "noid" is not always necessary in clone op (`pr#13769 `_, wangzhengyong) * os/bluestore: partial reshard support (`pr#13162 `_, Sage Weil) * os/bluestore: remove CephContext* from BmapEntry. (`pr#13651 `_, Radoslaw Zarzynski) * os/bluestore: remove unneeded indirection in BitMapZone. (`pr#13743 `_, Radoslaw Zarzynski) * os/bluestore: remove unused local variable "pos" (`pr#13715 `_, wangzhengyong) * os/bluestore rm unused variable in aio_read() (`pr#13530 `_, tangwenjun) * os/bluestore: silence gcc warning (`pr#14028 `_, Kefu Chai) * os/bluestore: some cleanup (`pr#13390 `_, liuchang0812) * os/bluestore,test/ceph_test_objectstore: silence gcc warnings (`pr#13924 `_, Kefu Chai) * os/bluestore: the exhausted check in BitMapZone can be lock-less. (`pr#13653 `_, Radoslaw Zarzynski) * osd: adding PerfCounters for backoff throttle (`pr#13017 `_, Chuanhong Wang) * osd: add is_split check before _start_split (`pr#13307 `_, song baisen) * osd: add override in osd subsystem (`issue#18922 `_, `pr#13439 `_, liuchang0812) * osd: add snap trim reservation and re-implement osd_snap_trim_sleep (`pr#13594 `_, Samuel Just) * osdc: clean up osd_command/start_mon_command interfaces (`pr#13727 `_, John Spray) * osdc: fix osdc_osd_seesion perf counter. (`pr#13478 `_, Xiaoxi Chen) * osd: change a few messages at level 0 and 1; change default level to 1/5 (`pr#13407 `_, Sage Weil) * osd: clarify REQUIRE_LUMINOUS error message (`pr#13363 `_, Josh Durgin) * osdc/Objecter: fix bugs in explicit naming of op spg_t (`pr#13534 `_, Sage Weil) * osdc/Objecter: fix possible OSDSession leak on wrong connection (`pr#13365 `_, xie xingguo) * osdc/Objecter: resend RWORDERED ops on full (`issue#19133 `_, `pr#13759 `_, Sage Weil) * osd: constify OpRequest::get_req(); fix a few cases of operator<< vs mutated message races (`pr#13545 `_, Sage Weil) * osd: correct epoch setting of osd boot msg (`pr#12623 `_, Mingxin Liu) * osd: correct the func name in execute_ctx() log messages (`pr#13582 `_, Gu Zhongyan) * osd: debug con in ms_handle_connect (`pr#13540 `_, Sage Weil) * osd/: don't leak context for Blessed*Context or RecoveryQueueAsync (`issue#18809 `_, `pr#13342 `_, Samuel Just) * osd: don't share osdmap with objecter when preboot (`issue#15025 `_, `pr#13946 `_, Mingxin Liu) * osd: don't use ORDERSNAP for flush; always request/send ondisk ack (`issue#18961 `_, `pr#13570 `_, Samuel Just) * OSD: drop parameter t from merge_log() (`pr#13923 `_, xie xingguo) * osd: drop support for listing objects at a given snap (`pr#13398 `_, Sage Weil) * osd/ECBackend: drop duplicated pending_commit field from << operator (`pr#13665 `_, xie xingguo) * osd: fall back to failsafe threshold if osdmap doesn't set [near]full (`pr#14004 `_, Sage Weil) * osd: faster dispatch (`pr#13343 `_, Sage Weil) * osd: fix a signed/unsigned warning in PG (`pr#13922 `_, Greg Farnum) * osd: fix func name in log produced by handle_pg_peering_evt() (`pr#13801 `_, xie xingguo) * osd: fix stat sum update of recovery pushing (`pr#13328 `_, Zhiqiang Wang) * osd: fix the setting of soid in sub_op_push (`pr#13353 `_, Zhiqiang Wang) * osd: have clients resend ops on pg split (`pr#13235 `_, Sage Weil) * osd: kill sortbitwise (`pr#13321 `_, Sage Weil) * osd,osdc: pg and osd-based backoff (`pr#12342 `_, Sage Weil) * osd: osd discussion docs (`pr#13344 `_, Greg Farnum) * osd/osd_internal_types: wake snaptrimmer on put_read lock, too (`issue#19131 `_, `pr#13755 `_, Sage Weil) * osd/OSDMap: don't set weight to IN when OSD is destroyed (`issue#19119 `_, `pr#13730 `_, Ilya Dryomov) * osd/PG: conditionally retry on receiving pg-notify when Primary is Incomplete (`pr#13942 `_, xie xingguo) * osd/PGLog: fix index for parent and child log on split (`issue#18975 `_, `pr#13493 `_, Sage Weil) * osd/PG: restrict want_acting to up+acting on recovery completion (`issue#18929 `_, `pr#13420 `_, Sage Weil) * osd/Pool: Disallow enabling 'hashpspool' option to a pool without '--yes-i-really-mean-it' (`issue#18468 `_, `pr#13406 `_, Vikhyat Umrao) * osd/PrimaryLogPG: remove duplicated code. (`pr#13894 `_, Jianpeng Ma) * osd: remove copy-get-classic (`pr#13547 `_, Sage Weil) * osd: remove sortbitwise thrashing (`pr#13296 `_, Sage Weil) * osd: replace object_info_t::operator=() with decode() (`pr#13938 `_, tang.jin) * osd: small clear up and optimize on _recover_now and should_share_map function (`pr#13476 `_, song baisen) * osd: the osd should not share map with others when it is in stopping state (`pr#13668 `_, song baisen) * osd: various changes for preventing internal ENOSPC condition (`issue#16878 `_, `pr#13425 `_, David Zafman) * osd: we know the definite epoch of marking down (`pr#13121 `_, Mingxin Liu) * os/filestore: avoid unnecessary copy in filestore::_do_transaction (`pr#12578 `_, Yunchuan Wen) * os/filestore: debug which omap keys are set (`issue#19067 `_, `pr#13671 `_, Sage Weil) * os/filestore/FileJournal: bufferlist rebuild (`pr#13980 `_, Jianpeng Ma) * os/filestore/FileStore.cc: remove unneeded loop (`pr#12177 `_, Li Wang) * os/filestore/HashIndex: be loud about splits (`issue#18235 `_, `pr#12421 `_, Dan van der Ster) * os/filestore: use existing variable for same func. (`pr#13742 `_, Pan Liu) * os/FuseStore: include header in src/os/FuseStore.h for gcc 7.x (`pr#13454 `_, Jos Collin) * PendingReleaseNotes: note the fuse fstab format change (`pr#13259 `_, John Spray) * pybind/ceph_daemon: use small chunk for recv (`pr#13804 `_, Xiaoxi Chen) * pybind: cephfs should be built without librados / python-rados (`pr#13431 `_, Kefu Chai) * pybind: fix docstring for librbd Python binding (`pr#13977 `_, runsisi) * qa: add supported distros for ceph-ansible (`pr#13711 `_, Tamil Muthamizhan) * qa: add workunit to test krbd data-pool support (`pr#13482 `_, Ilya Dryomov) * qa: fix race in Mount.open_background (`issue#18661 `_, `pr#13137 `_, John Spray) * qa: handle SSHException in logrotate (`pr#13359 `_, John Spray) * qa/rados/upgrade/jewel-x-singleton: run luminous.yaml at the end (`pr#13378 `_, Sage Weil) * qa/suites/{ceph-ansible,rest}: OpenStack volumes (`pr#13672 `_, Zack Cerza) * qa/suites/ceph-deploy: Drop OpenStack volume count (`pr#13706 `_, Zack Cerza) * qa/suites: fix upgrade tests vs cluster full thrashing (`pr#13852 `_, Sage Weil) * qa/suites/fs: Add openstack volume configuration (`pr#13640 `_, Zack Cerza) * qa/suites/kcephfs: Openstack volume configuration (`pr#13634 `_, Zack Cerza) * qa/suites/{knfs,hadoop,samba}: OpenStack volume configuration (`pr#13637 `_, Zack Cerza) * qa/suites/krbd: Add openstack volume configuration (`pr#13631 `_, Zack Cerza) * qa/suites/rados/objectstore: enable experimental features for testing bluestore (`pr#13456 `_, Kefu Chai, Dan Mick) * qa/suites/rgw: Add openstack volume configuration (`pr#13611 `_, Zack Cerza) * qa/suites/rgw/thrash: add osd thrashing tests (`pr#13445 `_, Sage Weil) * qa/suites/upgrade: add tiering test to hammer-jewel-x (`issue#19185 `_, `pr#13805 `_, Kefu Chai) * qa/suites/upgrade/kraken-x: do not thrash cluster full during upgrade (`issue#19232 `_, `pr#13892 `_, Dan Mick) * qa/suites/upgrade/kraken-x (`pr#13517 `_, Sage Weil, Yuri Weinstein) * qa/suite/upgrade/jewel-x: various fixes (`pr#13734 `_, Sage Weil) * qa/tasks/ceph.py: debug which pgs aren't scrubbing (`pr#13649 `_, Sage Weil) * qa/tasks: force umount during kclient teardown (`issue#18663 `_, `pr#13099 `_, John Spray) * qa/tasks/rgw.py: start Apache before RadosGW. (`pr#13846 `_, Radoslaw Zarzynski) * qa/tasks/workunit: use ceph.git as an alternative of ceph-ci.git for cloning workunit (`pr#13663 `_, Kefu Chai) * qa/tasks/workunit: use the suite repo for cloning workunit (`pr#13452 `_, Kefu Chai) * qa/tasks/workunit: use the suite repo for cloning workunit (`pr#13625 `_, Kefu Chai) * qa/workunits/rados/test.sh: print test name when it fails (`pr#13264 `_, Kefu Chai) * qa/workunits/rbd: resolve potential rbd-mirror race conditions (`issue#18935 `_, `pr#13421 `_, Jason Dillaman) * qa/workunits/rbd: tweak rbd-mirror config to spead up testing (`pr#13228 `_, Mykola Golub) * rados: allow "rados purge" to delete objects when osd is full (`pr#13814 `_, Pan Liu) * rados: we need to get the latest osdmap when pool does not exists (`pr#13289 `_, song baisen) * rbd: demote/promote all mirrored images in a pool (`issue#18748 `_, `pr#13758 `_, Jason Dillaman) * rbd: fix typo in Kernel.cc (`issue#19273 `_, `pr#13983 `_, Gaurav Kumar Garg) * rbd: introduce v2 format for rbd export/import (`issue#13186 `_, `pr#10487 `_, Dongsheng Yang) * rbdmap: consider /etc/ceph/rbdmap when unmapping images (`issue#18884 `_, `pr#13361 `_, David Disseldorp) * rbd-mirror A/A: leader should track up/down rbd-mirror instances (`issue#18784 `_, `pr#13571 `_, Mykola Golub) * rbd-mirror: deleting a snapshot during sync can result in read errors (`issue#18990 `_, `pr#13568 `_, Jason Dillaman) * rbd-mirror: InstanceWatcher watch/notify stub for leader/follower RPC (`issue#18783 `_, `pr#13312 `_, Mykola Golub) * rbd-mirror: replace remote pool polling with add/remove notifications (`issue#15029 `_, `pr#12364 `_, Jason Dillaman) * rbd-mirror: track images via global image id (`pr#13416 `_, Jason Dillaman) * rbd-nbd: check /sys/block/nbdX/size to ensure kernel mapped correctly (`issue#18335 `_, `pr#13229 `_, Mykola Golub) * rbd-nbd: create admin socket only for map command (`issue#17951 `_, `pr#12433 `_, Pan Liu) * rbd-nbd: don't ignore --read-only option in BLKROSET ioctl (`pr#13944 `_, Pan Liu) * rbd-nbd: fix a typo "moudle" (`pr#13652 `_, Pan Liu) * rbd-nbd: fix typo in comment (`pr#14034 `_, Pan Liu) * rbd-nbd: no need to check image format any more (`pr#13389 `_, Mykola Golub) * rbd-nbd: update size only when NBD_SET_SIZE successful (`pr#14005 `_, Pan Liu) * rbd-nbd: warn when kernel parameters are ignored (`issue#19108 `_, `pr#13694 `_, Pan Liu) * rbd: prevent adding multiple mirror peers to a single pool (`issue#19256 `_, `pr#13919 `_, Jason Dillaman) * rbd: spell out image features unsupported by the kernel (`issue#19095 `_, `pr#13812 `_, Ilya Dryomov) * rbd: use concurrent writes for imports (`issue#19034 `_, `pr#13782 `_, Venky Shankar) * rbd: When Ceph cluster becomes full, should allow user to remove rbd … (`pr#12627 `_, Pan Liu) * ReplicatedBackend: don't queue Context outside of ObjectStore with obc (`issue#18927 `_, `pr#13569 `_, Samuel Just) * Revert "dummy: reduce run time, run user.yaml playbook" (`issue#18259 `_, `pr#12506 `_, Nathan Cutler) * Revert "qa/tasks/workunit: use the suite repo for cloning workunit" (`pr#13495 `_, Sage Weil) * Reviewed by: Yan, Zheng (`issue#18830 `_, `pr#13272 `_, John Spray) * rgw: AbortMultipart request returns NoSuchUpload error if the meta obj doesn't exist (`pr#12793 `_, Zhang Shaowen) * rgw: add apis to support ragweed (`pr#13645 `_, Yehuda Sadeh) * rgw: add override in header files mostly (`pr#13586 `_, liuchang0812) * rgw: add override in rgw subsystem (`issue#18922 `_, `pr#13441 `_, liuchang0812) * rgw: add radosclient finisher to perf counter (`issue#19011 `_, `pr#13535 `_, lvshuhua) * rgw: add support for noncurrentversion expiration in s3 lifecycle. (`issue#18916 `_, `pr#13385 `_, Zhang Shaowen) * rgw-admin: remove deprecated regionmap commands (`issue#18725 `_, `pr#13963 `_, Casey Bodley) * rgw: change default chunk size to 4MB (`issue#18621 `_, `issue#18622 `_, `issue#18623 `_, `pr#13035 `_, Yehuda Sadeh) * rgw: change loglevel to 20 for 'System already converted' message (`issue#18919 `_, `pr#13399 `_, Vikhyat Umrao) * rgw: change loglevel to 5 in user's quota sync. (`issue#18921 `_, `pr#13408 `_, Zhang Shaowen) * rgw: clean redundant code (`pr#13302 `_, Yankun Li) * rgw: cleanup lifecycle managament (`pr#13820 `_, Jiaying Ren) * rgw: clean up the redundant assignment in last_entry_in_listing (`pr#13387 `_, Jing Wenjun) * rgw: cleanup unused var in rgw/rgw_rest_s3.cc (`pr#13434 `_, Jiaying Ren) * rgw: clear old zone short ids on period update (`issue#15618 `_, `pr#13949 `_, Casey Bodley) * rgw: correct the debug info when unlink instance failed. (`pr#13761 `_, Zhang Shaowen) * rgw: Correct the return codes for the health check feature (`issue#19025 `_, `pr#13557 `_, Pavan Rallabhandi) * rgw: data sync includes instance in rgw_obj_index_key (`pr#13948 `_, Casey Bodley) * rgw: don't init rgw_obj from rgw_obj_key when it's incorrect to do so (`issue#19096 `_, `pr#13676 `_, Yehuda Sadeh) * rgw: don't log the env_map twice (`pr#13481 `_, Abhishek Lekshmanan) * rgw: don't return skew time error in pre-signed url (`issue#18828 `_, `pr#13354 `_, liuchang0812) * rgw: error_code in error log is not right when data sync fails. (`issue#18437 `_, `pr#12810 `_, Zhang Shaowen) * rgw_file: avoid stranding invalid-name bucket handles in fhcache (`issue#19036 `_, `pr#13590 `_, Matt Benjamin) * rgw_file: ensure valid_s3_object_name for directories, too (`issue#19066 `_, `pr#13614 `_, Matt Benjamin) * rgw_file: fix double unref on rgw_fh for rename (`pr#13988 `_, Gui Hecheng) * rgw_file: fix fs_inst progression (`issue#19214 `_, `pr#13832 `_, Matt Benjamin) * rgw_file: fix non-negative return code for open operation (`pr#14045 `_, Gui Hecheng) * rgw_file: fix reversed return value of getattr (`pr#13895 `_, Gui Hecheng) * rgw_file: implement reliable has-children check (unlink dir) (`issue#19270 `_, `pr#13953 `_, Matt Benjamin) * rgw_file: posix style atime,ctime,mtime (`pr#13765 `_, Gui Hecheng) * rgw_file: RGWFileHandle dtor must also cond-unlink from FHCache (`issue#19112 `_, `pr#13712 `_, Matt Benjamin) * rgw: fix break inside of yield in RGWFetchAllMetaCR (`issue#17655 `_, `pr#11586 `_, Casey Bodley) * rgw: fix disabling Swift's object versioning through empty X-Versions-Location (`issue#18852 `_, `pr#13303 `_, Jing Wenjun) * rgw: fix error code of inexistence of versions location in swift api (`issue#18880 `_, `pr#13350 `_, Jing Wenjun) * rgw:Fixes typo in rgw_admin.cc (`issue#19026 `_, `pr#13576 `_, Ronak Jain) * rgw: fix for broken yields in RGWMetaSyncShardCR (`issue#18076 `_, `pr#12223 `_, Casey Bodley) * rgw: fix init_bucket_for_sync retcode (`pr#13684 `_, Shasha Lu) * rgw: fix transition from full to incremental meta sync (`pr#13920 `_, Casey Bodley) * rgw: fix typo in comment (`pr#13578 `_, liuchang0812) * rgw: get torrent request's parameter is not the same as amazon s3. (`issue#19136 `_, `pr#13760 `_, Zhang Shaowen) * rgw: handle error return value in build_linked_oids_index (`pr#13955 `_, wangzhengyong) * rgw: http_client clarify the debug msg function call (`pr#13688 `_, Abhishek Lekshmanan) * rgw: if user.email is empty, dont try to delete (`issue#18980 `_, `pr#13783 `_, Casey Bodley) * rgw: improve handling of illformed Swift's container ACLs. (`issue#18796 `_, `pr#13248 `_, Radoslaw Zarzynski) * rgw: Let the object stat command be shown in the usage (`issue#19013 `_, `pr#13291 `_, Pavan Rallabhandi) * rgw: librgw refcnt (`pr#13405 `_, Matt Benjamin) * rgw: make sending Content-Length in 204 and 304 responses controllable (`issue#16602 `_, `pr#10156 `_, Radoslaw Zarzynski) * rgw: make sync thread name clear (`issue#18860 `_, `pr#13324 `_, lvshuhua) * rgw: print is_admin as int instead of __u8 (`pr#12264 `_, Casey Bodley) * rgw: put object's acl can't work well on the latest object (`issue#18649 `_, `pr#13078 `_, Zhang Shaowen) * rgw: remove redundant codes in rgw_cache.h (`pr#13902 `_, lihongjie) * rgw: remove useless --tier_type in radosgw-admin. (`pr#13856 `_, Zhang Shaowen) * rgw: rest_admin/user avoid double checking input args (`pr#13460 `_, Abhishek Lekshmanan) * rgw/rgw_main.cc: fix parenteses and function result (`pr#12295 `_, Willem Jan Withagen) * rgw: set dumpable flag after setuid post ff0e521 (`issue#19089 `_, `pr#13657 `_, Brad Hubbard) * rgw: set FCGI_INCLUDE_DIR for cephd_rgw_base (`issue#18918 `_, `pr#13393 `_, David Disseldorp) * rgw structures rework (`issue#17996 `_, `issue#19249 `_, `pr#11485 `_, Yehuda Sadeh) * rgw: use rgw_zone_root_pool for region_map like is done in hammer (`issue#19195 `_, `pr#13928 `_, Orit Wasserman) * rgw: use separate http_manager for read_sync_status (`issue#19236 `_, `pr#13660 `_, Shasha Lu) * rgw: wip dir orphan (`issue#18992 `_, `issue#18989 `_, `issue#19018 `_, `issue#18991 `_, `pr#13529 `_, Matt Benjamin) * rgw: wip parentref (`issue#19060 `_, `issue#19059 `_, `pr#13607 `_, Matt Benjamin) * rocksdb: sync with upstream (`issue#18464 `_, `pr#13306 `_, Kefu Chai) * rpm: build ceph-resource-agents by default (`issue#17613 `_, `pr#13515 `_, Nathan Cutler) * rpm: disable dwz to speed up valgrind (`issue#19099 `_, `pr#13748 `_, Kefu Chai) * rpm: drop boost build dependencies (`pr#13519 `_, Nathan Cutler) * script/find_dups_in_pg_log: scrip to find dup requests due to short pg logs (`pr#13417 `_, Sage Weil) * script/sepia_bt.sh: get sha1,release from t.log if it's not in core (`pr#13620 `_, Kefu Chai) * script/sepia_bt.sh: no need to pass version and sha1 anymore (`pr#13380 `_, Kefu Chai) * script/sepia_bt.sh: support xenial (`pr#13292 `_, Kefu Chai) * selinux: Allow ceph daemons to read net stats (`issue#19254 `_, `pr#13945 `_, Boris Ranto) * Signed-off-by: Kefu Chai (`pr#14006 `_, shiqi) * systemd: Start OSDs after MONs (`issue#18516 `_, `pr#13097 `_, Boris Ranto) * tasks/cephfs: tear down on mount() failure (`pr#13282 `_, John Spray) * test: add override in test submodule (`pr#13773 `_, liuchang0812) * test/ceph-test-objectstore: Don't always include BlueStore code (`pr#13516 `_, Willem Jan Withagen) * tested by-had, using fault injection (`issue#19019 `_, `pr#13546 `_, Casey Bodley) * test/encoding: fix readable.sh bugs; fix ceph-object-corpus (`pr#13678 `_, Sage Weil) * test: Fix mismatched sign comparison in histogram test (`pr#13362 `_, Adam C. Emerson) * test: fix rbd unit test cases w/ striping feature (`issue#18888 `_, `pr#13196 `_, Venky Shankar) * test: fix test_pidfile (`pr#13646 `_, yaoning) * test/libcephfs: avoid buffer overflow when testing ceph_getdents() (`issue#18941 `_, `pr#13429 `_, "Yan, Zheng") * test/librbd/CMakeLists.txt: ceph_test_librbd_fsx requires linux includes/libs (`pr#13630 `_, Willem Jan Withagen) * test/librbd: move tests using non-public api to internal (`pr#13806 `_, Venky Shankar) * test: Need to exclude the fsx executable also on FreeBSD (`pr#13686 `_, Willem Jan Withagen) * test/rgw: add bucket acl and versioning tests to test_multi.py (`pr#12449 `_, Casey Bodley) * tests: ceph_test_rados_api_watch_notify: test timeout using rados_wat… (`issue#19312 `_, `pr#14061 `_, Kefu Chai) * tests: drop buildpackages.py (`issue#18846 `_, `pr#13319 `_, Nathan Cutler) * tests: drop obsolete Perl scripts (`pr#13951 `_, Nathan Cutler) * test: sed on FreeBSD requires "-i extension", so use gsed (`pr#13903 `_, Willem Jan Withagen) * tests: fix regression in qa/tasks/ceph_master.py (`issue#16263 `_, `pr#13279 `_, Nathan Cutler, Kefu Chai) * tests: osd-scrub-repair.sh disable scrub backoff in test (`pr#13334 `_, Kefu Chai) * tests: re-enable cephfs python tests on kclient (`issue#17193 `_, `issue#18161 `_, `pr#13200 `_, Nathan Cutler) * test/store_test: add deferred test case setup to support explicit min… (`issue#18857 `_, `pr#13415 `_, Igor Fedotov) * tests: update SUSE yaml facets in qa/distros/all (`issue#18856 `_, `pr#13313 `_, Nathan Cutler) * test: support blacklisting within librados_test_stub (`pr#13737 `_, Jason Dillaman) * test: Thrasher: do not update pools_to_fix_pgp_num if nothing happens (`pr#13518 `_, Kefu Chai) * test: Thrasher: update pgp_num of all expanded pools if not yet (`pr#13367 `_, Kefu Chai) * test: unittest_hostname compile error on freebsd (`pr#13739 `_, liuchang0812) * tools: add override in tool submodule (`pr#13776 `_, liuchang0812) * tools/rados: Check return value of connect (`issue#19319 `_, `pr#14057 `_, Brad Hubbard) * tools/rados: remove useless function declaration (`pr#12566 `_, liuchang0812) * tools/scripts:"FreeBSD getopt is not compatible, use the one from packages" (`pr#13260 `_, Willem Jan Withagen) * tracing: don't include oid when tracing at dequeue_op() (`pr#13410 `_, Yehuda Sadeh) * upstart: start radosgw-all according to runlevel (`issue#18313 `_, `pr#12586 `_, Ken Dreyer) * verified by hand f23 (`issue#19111 `_, `pr#13703 `_, Matt Benjamin) * vstart: add --help documentation for rgw_num (`pr#13817 `_, Ali Maredia) * vstart: clean up usage a bit (`pr#13138 `_, Sage Weil) * vstart: do not add host for mgr.* section if not $overwrite_conf (`pr#13767 `_, Kefu Chai) * vstart: don't configure rgw_dns_name (`pr#13411 `_, Yehuda Sadeh) * vstart: don't create cluster by default (`pr#13891 `_, Yehuda Sadeh) v12.0.0 Luminous (dev) ====================== This is the first development checkpoint release of Luminous series, the next long term release. We're off to a good start to release Luminous in the spring of '17. Notable Changes --------------- * bluestore: avoid unnecessary copy with coll_t (`pr#12576 `_, Yunchuan Wen) * bluestore: fixed compilation error when enable spdk (`pr#12672 `_, Pan Liu) * bluestore: os/bluestore: add a debug option to bypass block device writes for bl… (`pr#12464 `_, Igor Fedotov) * bluestore: os/bluestore: Add bluestore pextent vector to mempool (`pr#12946 `_, Igor Fedotvo, Igor Fedotov) * bluestore: os/bluestore: add perf variable for throttle info in bluestore (`pr#12583 `_, Pan Liu) * bluestore: os/bluestore: allow multiple SPDK BlueStore OSD instances (`issue#16966 `_, `pr#12604 `_, Orlando Moreno) * bluestore: os/bluestore/BitmapFreelistManager: readability improvements (`pr#12719 `_, xie xingguo) * bluestore: os/bluestore/BlueFS: fix reclaim_blocks (`issue#18368 `_, `pr#12725 `_, Sage Weil) * bluestore: os/bluestore: conditionally load crr option (`pr#12877 `_, xie xingguo) * bluestore: os/bluestore: fix Allocator::allocate() int truncation (`issue#18595 `_, `pr#13010 `_, Sage Weil) * bluestore: os/bluestore: fix min_alloc_size at mkfs time (`pr#13192 `_, Sage Weil) * bluestore: os/bluestore: fix NVMEDevice::open failure if serial number ends with a … (`pr#12956 `_, Hongtong Liu) * bluestore: os/bluestore: fix OnodeSizeTracking testing (`pr#12684 `_, xie xingguo) * bluestore: os/bluestore: fix potential assert in cache _trim method. (`pr#13234 `_, Igor Fedotov) * bluestore: os/bluestore: fix reclaim_blocks and clean up Allocator interface (`issue#18573 `_, `pr#12963 `_, Sage Weil) * bluestore: os/bluestore: include logical object offset in crc error (`pr#13074 `_, Sage Weil) * bluestore: os/bluestore/KernelDevice: fix debug message (`pr#13135 `_, Sage Weil) * bluestore: os/bluestore/KernelDevice: kill zeros (`pr#12856 `_, xie xingguo) * bluestore: os/bluestore: kill BufferSpace.empty() (`pr#12871 `_, xie xingguo) * bluestore: os/bluestore: kill orphan declaration of do_write_check_depth() (`pr#12853 `_, xie xingguo) * bluestore: os/bluestore: miscellaneous fixes to BitAllocator (`pr#12696 `_, xie xingguo) * bluestore: os/bluestore: nullptr in OmapIteratorImpl::valid (`pr#12900 `_, Xinze Chi) * bluestore: os/bluestore/NVMEDevice.cc: fix the random read issue. (`pr#13055 `_, optimistyzy) * bluestore: os/bluestore/NVMEdevice: fix the unrelease segs issue (`pr#12862 `_, optimistyzy) * bluestore: os/bluestore: preallocate object[extent_shard] key to avoid reallocate (`pr#12644 `_, xie xingguo) * bluestore: os/bluestore: put strings in mempool (`pr#12651 `_, Allen Samuels, Sage Weil) * bluestore: os/bluestore: refactor ExtentMap::update to avoid preceeding db updat… (`pr#12394 `_, Igor Fedotov) * bluestore: os/bluestore: remove a never read value (`pr#12618 `_, liuchang0812) * bluestore: os/bluestore: remove intermediate key var to avoid string copy (`pr#12643 `_, xie xingguo) * bluestore: os/bluestore: shrink buffer_map key into uint32_t (`pr#12850 `_, xie xingguo) * bluestore: os/bluestore: _txc_release_alloc when do wal cleaning (`pr#12692 `_, Xinze Chi) * bluestore: os: extend ObjectStore interface to dump store's performance counters (`pr#13203 `_, Igor Fedotov) * bluestore: rocksdb: add option: writable_file_max_buffer_size = 0 (`pr#12562 `_, Jianpeng Ma) * build/ops: add sanity checks to run-make-check.sh (`pr#12683 `_, Nathan Cutler) * build/ops: build/cmake: provide asan, tsan, ubsan builds (`pr#12615 `_, Matt Benjamin) * build/ops: builds with dpdk v16.07 (`pr#12707 `_, Kefu Chai) * build/ops: ceph-detect-init: adding Arch Linux support (`pr#12787 `_, Jamin W. Collins) * build/ops,cleanup: auth: Let's not use the deprecated cephx option (`pr#12721 `_, Dave Chen) * build/ops: CMakeLists.txt: boost_python.so requires libpython.*.so on FreeBSD (`pr#12763 `_, Willem Jan Withagen) * build/ops: common/module.c: do not use strerror_r the GNU way. (`pr#12363 `_, Willem Jan Withagen) * build/ops,core: ceph-disk: use correct user in check_journal_req (`issue#18538 `_, `pr#12947 `_, Samuel Matzek) * build/ops: mailmap: Update OVH contributors (`pr#13063 `_, Bartłomiej Święcki) * build/ops: packaging: install libceph-common.so* not libceph-common.so.* (`issue#18692 `_, `pr#13148 `_, Kefu Chai) * build/ops: Reviewd-by: Loic Dachary (`issue#18635 `_, `pr#13057 `_, Wido den Hollander) * build/ops,rgw: rgw_file: radosgw-admin can be built under FreeBSD (`pr#12191 `_, Willem Jan Withagen) * build/ops: rocksdb: build with ppc64 (`pr#12908 `_, Kefu Chai) * build/ops: script: adding contributor credits script (`pr#13251 `_, Patrick McGarry) * build/ops: script/sepia_bt.sh: download packages from shaman not gitbuilder (`pr#12799 `_, Kefu Chai) * build/ops: spdk: upgrade spdk to v16.12 (`pr#12734 `_, Pan Liu) * build/ops: src/CMakeLists.txt: disable -Werror on rocksdb (`pr#12560 `_, Willem Jan Withagen) * build/ops: src/CMakeLists.txt: Move parse_secret_objs setting within definition block (`pr#12785 `_, Willem Jan Withagen) * build/ops: test/fio_ceph_objectstore: fix fio plugin build failure caused by rec… (`pr#12655 `_, Igor Fedotov) * build/ops: tool: add some ceph relate processes to ps-ceph.pl (`pr#12406 `_, songbaisen) * build/ops: tracing: Fix error in including all files in osd_tp (`pr#12501 `_, Ganesh Mahalingam) * cephfs: ceph_fuse: pid_file default to empty (`issue#18309 `_, `pr#12628 `_, Nathan Cutler) * cephfs,cleanup: client: fix potential buffer overflow (`pr#12515 `_, Yunchuan Wen) * cephfs,cleanup: client: simplify remove_cap interface (`pr#12161 `_, John Spray) * cephfs,cleanup: libcephfs: cleanups (`pr#12830 `_, huanwen ren) * cephfs,cleanup: qa: unpin knfs from ubuntu (`issue#16397 `_, `pr#13088 `_, John Spray) * cephfs,cleanup: Rewrite mount.fuse.ceph (to python) and move ceph-fuse options to fs_mntops (`pr#11448 `_, Edgaras Lukosevicius) * cephfs: client/Client.cc: prevent segfaulting (`issue#9935 `_, `pr#12550 `_, Michal Jarzabek) * cephfs: client: don't request lookup parent if ino is root (`pr#12478 `_, huanwen ren) * cephfs: client: fix clang warn of "argument is an uninitialized value" (`pr#12580 `_, liuchang0812) * cephfs: client: fix Client::handle_cap_flushsnap_ack() crash (`issue#18460 `_, `pr#12859 `_, Yan, Zheng) * cephfs: client: fix the cross-quota rename boundary check conditions (`pr#12489 `_, Greg Farnum) * cephfs: client: populate metadata during mount (`issue#18361 `_, `pr#12915 `_, John Spray) * cephfs: client: remove request from session->requests when handling forward (`issue#18675 `_, `pr#13124 `_, "Yan, Zheng") * cephfs,common: include/fs_types: fix unsigned integer overflow (`pr#12440 `_, runsisi) * cephfs,common,rbd: osdc: cache should ignore error bhs during trim (`issue#18436 `_, `pr#12966 `_, Jason Dillaman) * cephfs,core,cleanup,common: librados,osdc: kill ack vs commit distinction (`pr#12607 `_, Sage Weil) * cephfs: libcephfs.cc: fix memory leak (`pr#12557 `_, Michal Jarzabek) * cephfs: libcephfs: fix cct refcount constructing from rados (`pr#12831 `_, John Spray) * cephfs: mon/MDSMonitor.cc:refuse fs new on pools with obj (`issue#11124 `_, `pr#12825 `_, Michal Jarzabek) * cephfs: mount: do not print "unknown" option to kclient (`issue#18159 `_, `pr#12465 `_, John Spray) * cephfs: qa: update remaining ceph.com to download.ceph.com (`issue#18574 `_, `pr#12964 `_, John Spray) * cephfs: tasks/cephfs: fix kernel force umount (`issue#18396 `_, `pr#12833 `_, Yan, Zheng) * cleanup,bluestore: os/bluestore: avoid unnecessary memory copy, use variable reference in BlockDevice::Open (`pr#12942 `_, liuchang0812) * cleanup,bluestore: os/bluestore: cleanup BitAllocator (`pr#12661 `_, xie xingguo) * cleanup,bluestore: os/bluestore: remove no use parameter in bluestore_blob_t::map_bl (`pr#13013 `_, wangzhengyong) * cleanup,common: auth: Enhancement for the supported auth methods (`pr#12937 `_, Dave Chen) * cleanup,common: bufferlist: cleanup semantical wrong for bufferlist::append (`pr#12247 `_, Yankun Li) * cleanup,common: common/BackTrace: add operator<< (`pr#9028 `_, Kefu Chai) * cleanup: common/config: fix return type of string::find and use string::npos (`pr#9924 `_, Yan Jun) * cleanup: common/config_opts.h: remove obsolete configuration option (`pr#12659 `_, Li Wang) * cleanup,common: global: we need to handle the init_on_startup return value when global_init. (`pr#13018 `_, song baisen) * cleanup,common: msg/async: assert if compiled code doesn't support the configured ms_… (`pr#12559 `_, Avner BenHanoch) * cleanup,common: msg/async/rdma: clean line endings (`pr#12688 `_, Adir Lev) * cleanup,common: msg/async/rdma: Remove compilation warning (`pr#13142 `_, Sarit Zubakov) * cleanup,common: osd/OSDMap: get_previous_up_osd_before() may run into endless loop (`pr#12976 `_, Mingxin Liu) * cleanup,common: osd/osd_types: clean up OSDOp printers (`pr#12980 `_, Sage Weil) * cleanup: include/mempool: fix typo in comments (`pr#12772 `_, huangjun) * cleanup: mds,mon: Clean issues detected by cppcheck (`pr#13199 `_, Ilya Shipitsin) * cleanup: misc: fix code typos in header files (`pr#12716 `_, Xianxia Xiao) * cleanup: mon/PGMonitor: fix description for ceph pg ls (`pr#12807 `_, runsisi) * cleanup: Move code from .h into .cc (`pr#12737 `_, Amir Vadai) * cleanup: msg/async: avoid atomic variable overhead (`pr#12809 `_, Wei Jin) * cleanup: msg/async: cleanups (`pr#12832 `_, Wei Jin) * cleanup: msg/async/rdma: fix log line spacing (`pr#13263 `_, Adir Lev) * cleanup: msg/async: rm nonused thread variable in posixworker (`pr#12777 `_, Wei Jin) * cleanup: msg/Messenger.cc: add std::move (`pr#9760 `_, Michal Jarzabek) * cleanup: msg: Revert the change from assert(0)-> ceph_abort() where is not applicable (`pr#12930 `_, Dave Chen) * cleanup: msg/simple: Remove dead code in pipe.cc (`issue#12684 `_, `pr#12601 `_, Rishabh Kumar) * cleanup: os/bluestore/BlueStore.cc: remove unused variable (`pr#12703 `_, Li Wang) * cleanup: osdc/ObjectCacher: use state instead of get_state() (`pr#12544 `_, huangjun) * cmake: add RGW and MDS to libcephd (`pr#12345 `_, Bassam Tabbara) * cmake: cleanup the use of udev and blkid in target_link_lib() (`pr#12811 `_, Willem Jan Withagen) * cmake: disable mallinfo for jemalloc (`pr#12469 `_, Bassam Tabbara) * cmake: explictly call find_package(PythonInterp) first to fix build err (`pr#12385 `_, Yixun Lan) * cmake: Fix broken async/rdma compilation since move to libceph-common (`pr#13122 `_, Oren Duer) * cmake: fix broken RDMA compilation after merge PR #12878 (`pr#13186 `_, Oren Duer) * cmake: fix hard coded boost python lib (`pr#12480 `_, John Coyle) * cmake: fix rpath on shared libraries and binaries targets (`pr#12927 `_, Ricardo Dias) * cmake: link ceph-{mgr,mon,mds,osd} against libcommon statically (`pr#12878 `_, Kefu Chai) * cmake: remove Findpciaccess.cmake (`pr#12776 `_, optimistyzy) * cmake: search for Keyutils in default paths (`pr#12769 `_, Pascal Bach) * cmake: search for nspr include files for both suffixes: nspr4 and nspr (`issue#18535 `_, `pr#12939 `_, John Lin) * cmake: simplify find_package jemalloc (`pr#12468 `_, Bassam Tabbara) * cmake: support for external rocksdb (`pr#12467 `_, Bassam Tabbara) * cmake: turn libcommon into a shared library (`pr#12840 `_, Kefu Chai) * common/admin-socket: fix potential buffer overflow (`pr#12518 `_, Yunchuan Wen) * common: common/BackTrace: demangle on FreeBSD also (`pr#12992 `_, Kefu Chai) * common: common/buffer: close pipe fd if set nonblocking fails. (`pr#12828 `_, donglinpeng) * common: common/ceph_context: Show clear message if all features are enabled (`pr#12676 `_, Dave Chen) * common: common/xmlformatter: turn on underscored and add unittest (`pr#12916 `_, liuchang0812) * common: compressor/zstd: add zstd compression plugin (`pr#13075 `_, Kefu Chai, Sage Weil) * common: config: Improve warning for unobserved value (`issue#18424 `_, `pr#12855 `_, Brad Hubbard) * common/config_opts.h: FreeBSD timing changed due to no SO_REUSEADDR (`pr#12594 `_, Willem Jan Withagen) * common: delete unused conf "filestore_debug_disable_sharded_check" (`pr#13051 `_, Chuanhong Wang) * common: get rid of "warning: ignoring return value of ‘strerror_r’" (`pr#12775 `_, xie xingguo) * common: include/denc: improvements (`pr#12626 `_, Adam C. Emerson) * common: make attempts of auth rotating configurable (`pr#12563 `_, xie xingguo) * common: misc fixes detected by crypto shutdown assert (`pr#12925 `_, Sage Weil) * common: msg/async/rdma: reduce number of rdma rx/tx buffers (`pr#13190 `_, Adir Lev) * common: msg/async/rdma: Support for RoCE v2 and SL (`pr#12556 `_, Oren Duer) * common: msg/simple: call clear_pipe in wait() shutdown path (`issue#15784 `_, `pr#12633 `_, Sage Weil) * common: tracing: Fix handle leak in TracepointProvider (`pr#12652 `_, Brad Hubbard) * core,bluestore: NVMEDevice: fix bug in data_buf_next_sge (`pr#12812 `_, optimistyzy) * core,bluestore: os/bluestore: fix statfs to not include DB partition in free space (`issue#18599 `_, `pr#13140 `_, Sage Weil) * core,cephfs,cleanup: cleanup: use std::make_shared to replace new (`pr#12276 `_, Yunchuan Wen) * core,cleanup: msg/async/AsyncConnection: socket's fd can be zero, avoid false assert (`pr#13080 `_, Haomai Wang) * core,cleanup: osd/ECBackend: cleanup for unnecessary copy with pg_stat_t (`pr#12564 `_, Yunchuan Wen) * core,cleanup: osd/ECBackend: only need check missing_loc when doing recovery (`pr#12526 `_, huangjun) * core,cleanup: osd/ECBackend: remove unused variable "ReadCB" (`pr#12543 `_, huangjun) * core,cleanup: osd,librados: remove clone_range and associated multi-object cruft (`pr#13008 `_, Samuel Just) * core,cleanup: osd/OpRequest: dump both name and addr for the client op (`pr#12691 `_, runsisi) * core,cleanup: osd/OSDMap: Uncomment code to enable private default constructors (`pr#12597 `_, Brad Hubbard) * core,cleanup: osd/osd_type: Fix logging output (`pr#12778 `_, Brad Hubbard) * core,cleanup: osd/osd_types: Move comment to more relevant position (`pr#12779 `_, Brad Hubbard) * core,cleanup: osd/osd_types: print notify-ack op properly (`pr#12585 `_, Sage Weil) * core,cleanup: osd/PG: add new have_unfound() function in MissingLoc (`pr#12668 `_, huangjun) * core,cleanup: osd/PG: check the connection first in fulfill_log (`pr#12579 `_, huangjun) * core,cleanup: osd/PG: simplify the logic of backfill_targets checking (`pr#12519 `_, huangjun) * core,cleanup: osd/PrimaryLogPG: optimal pick_newest_available (`pr#12695 `_, huangjun) * core,cleanup: osd/PrimaryLogPG: unify the access to primary pg (`pr#12527 `_, huangjun) * core,cleanup: os/filestore/JournalingObjectStore cleanup (`pr#12528 `_, Li Wang) * core,common: common/pick_address.cc: Copy public_netw to cluster_netw if cluster empty (`pr#12929 `_, Willem Jan Withagen) * core,common: mempool: put bloom_filter in mempool (`pr#13009 `_, Sage Weil) * core: global: start removing g_ceph_context (`pr#12149 `_, Adam C. Emerson) * core: messages/MOSDPGTrim: add the missed HEAD_VERSION AND COMPAT_VERSION (`issue#18266 `_, `pr#12517 `_, huangjun) * core: messages/MOSDSubOp: Make encode_payload can be reentrant (`pr#12654 `_, Haomai Wang) * core: mon/OSDMonitor: fixup sortbitwise flag warning (`pr#12682 `_, huanwen ren) * core: mon/OSDMonitor: make 'osd crush move ...' work on osds (`issue#18587 `_, `pr#12981 `_, Sage Weil) * core: mon/OSDMonitor: make snaps on tier pool should not be allowed (`pr#9348 `_, Mingxin Liu) * core: mon/OSDMonitor: set last_force_op_resend on overlay pool too (`issue#18366 `_, `pr#12712 `_, Sage Weil) * core: mon/OSDMonitor: should propose osdmap update when cluster addr changed (`pr#11065 `_, Mingxin Liu) * core: msg: allow different ms type for cluster network and public network (`pr#12023 `_, Haomai Wang) * core: msg/async/dpdk: fix compile errors (`pr#12698 `_, Haomai Wang) * core: msg/async/rdma: fix ceph_clock_now calls (`pr#12711 `_, Haomai Wang) * core: msg: client bind (`pr#12901 `_, Zengran Zhang, Haomai Wang) * core: osdc/Objecter: resend pg commands on interval change (`issue#18358 `_, `pr#12869 `_, Samuel Just) * core: osd/PG: publish PG stats when backfill-related states change (`issue#18369 `_, `pr#12727 `_, Sage Weil) * core: osd/PrimaryLogPG::failed_push: update missing as well (`issue#18165 `_, `pr#12888 `_, Samuel Just) * core: osd/PrimaryLogPG::try_lock_for_read: give up if missing (`issue#18583 `_, `pr#13087 `_, Samuel Just) * core: osd/ReplicatedBackend: take read locks for clone sources during recovery (`issue#17831 `_, `pr#12844 `_, Samuel Just) * core: os/filestore: fix clang static check warn "use-after-free“ (`pr#12581 `_, liuchang0812) * core: os/filestore: version will be uninitialized varible if store_version doesn't exist (`pr#12582 `_, liuchang0812) * core,performance: common/TrackedOp: various cleanups and optimizations (`pr#12537 `_, Sage Weil) * core,performance: osd/ECBackend: Send write message to peers first, then do local write (`pr#12522 `_, huangjun) * core,performance: osd/pg: bound the portion of the log we request in GetLog::GetLog() (`pr#12233 `_, Jie Wang) * core: PrimaryLogPG: don't update digests for objects with mismatched names (`issue#18409 `_, `pr#12788 `_, Samuel Just) * core: Revert "PrimaryLogPG::failed_push: update missing as well" (`issue#18624 `_, `pr#13090 `_, David Zafman) * core: test/pybind/test_rados.py: tolerate TimedOut in test_ping_monitor (`issue#18529 `_, `pr#12934 `_, Samuel Just) * core,tests: ceph_test_rados_api_misc: Fix trivial memory leak (`pr#12680 `_, Brad Hubbard) * core,tests: c_write_operations.cc: Fix trivial memory leak (`pr#12663 `_, Brad Hubbard) * core,tests: qa/suites/rados/basic: set low omap limit for rgw workload (`pr#13071 `_, Sage Weil) * core,tests: qa/workunits/rest: use unique pool names for cephfs test (`pr#13188 `_, Sage Weil) * core,tests: test/librados/cmd.cc: Fix trivial memory leaks (`pr#12671 `_, Brad Hubbard) * core,tests: test/librados/c_read_operations.cc: Fix trivial memory leak (`pr#12656 `_, Brad Hubbard) * core,tests: test/librados/c_read_operations.cc: Fix valgrind errors (`issue#18354 `_, `pr#12657 `_, Brad Hubbard) * core,tests: test/librados: Silence Coverity memory leak warnings (`pr#12442 `_, Brad Hubbard, Samuel Just) * core,tests: test/librados/snapshots.cc: Fix memory leak (`pr#12690 `_, Brad Hubbard) * core,tests: test/librados/tier.cc: Fix valgrind errors (`issue#18360 `_, `pr#12705 `_, Brad Hubbard) * core,tests: test_rados_watch_notify: Fix trivial memory leaks (`pr#12713 `_, Brad Hubbard) * crush: API documentation (`pr#13205 `_, Loic Dachary) * crush: verify weights is influenced by the number of replicas (`issue#15653 `_, `pr#13083 `_, Adam C. Emerson, Loic Dachary) * crush: when osd_location_hook does not exist, we should exit error. (`pr#12961 `_, song baisen) * doc: add 0.94.10 and hammer EOL to releases.rst (`pr#13069 `_, Nathan Cutler) * doc: add verbiage to rbdmap manpage (`issue#18262 `_, `pr#12509 `_, Nathan Cutler) * doc: build/install-deps.sh: Add sphinx package for building docs on FreeBSD (`pr#13223 `_, Willem Jan Withagen) * doc/cephfs/troubleshooting: fix broken bullet list (`pr#12894 `_, Dan Mick) * doc: clarify "ceph quorum" syntax (`issue#17802 `_, `pr#11787 `_, Nathan Cutler) * doc: clarify SubmittingPatches.rst (`pr#12988 `_, Nathan Cutler) * doc: clarify the path restriction mds cap example (`pr#12993 `_, John Spray) * doc: correct S3 lifecycle support explain (`issue#18459 `_, `pr#12827 `_, liuchang0812) * doc: doc/dev: update log_based_pg.rst, fix some display problem (`pr#12730 `_, liuchang0812) * doc: Doc:Fixes Python Swift client commands (`issue#17746 `_, `pr#12887 `_, Ronak Jain) * doc: doc/release-notes: final kraken notes (`pr#12968 `_, Sage Weil) * doc: document repair/scrub features (`issue#15786 `_, `pr#9032 `_, Kefu Chai, David Zafman) * doc: Document that osd_heartbeat_grace applies to MON and OSD (`pr#13098 `_, Wido den Hollander) * doc: explain about logging levels (`pr#12920 `_, liuchang0812) * doc: fix document about rados mon (`pr#12662 `_, liuchang0812) * doc: Fixes radosgw-admin ex: in swift auth section (`issue#16687 `_, `pr#12646 `_, SirishaGuduru) * doc: fix the librados c api can not compile problem (`pr#9396 `_, song baisen) * doc: mailmap: Michal Koutny affiliation (`pr#13036 `_, Nathan Cutler) * doc: mailmap updates for v11.1.0 (`pr#12335 `_, Abhishek Lekshmanan) * doc: minor change to a cloud testing paragraph (`pr#13277 `_, Jan Fajerski) * doc: min_size advice is not helpful (`pr#12936 `_, Brad Hubbard) * doc: organizationmap: add Xianxia Xiao to Kylin Cloud team (`pr#12718 `_, Yunchuan Wen) * doc: README.FreeBSD: update current status (`pr#12096 `_, Willem Jan Withagen) * doc: release notes for v11.1.1 (`pr#12642 `_, Abhishek Lekshmanan) * doc: Remove "splitting" state (`pr#12636 `_, Brad Hubbard) * doc: typo fix in s3_compliance (`pr#12598 `_, LiuYang) * doc: Update disk thread section to reflect that scrubbing is no longe… (`pr#12621 `_, Nick Fisk) * doc: Update keystone.rst (`pr#12717 `_, Chu, Hua-Rong) * librados: asynchronous selfmanaged_snap_create/selfmanaged_snap_remove APIs (`issue#16180 `_, `pr#12050 `_, Jason Dillaman) * librados: fix compile errors from simplified aio completions (`pr#12849 `_, xie xingguo) * librbd: added rbd_flatten_with_progress to API (`issue#15824 `_, `pr#12905 `_, Ricardo Dias) * librbd: allow to open an image without opening the parent image (`issue#18325 `_, `pr#12885 `_, Ricardo Dias) * librbd: avoid possible recursive lock when racing acquire lock (`issue#17447 `_, `pr#12991 `_, Jason Dillaman) * librbd: create fewer empty objects during copyup (`issue#15028 `_, `pr#12326 `_, Douglas Fuller, Venky Shankar) * librbd: delay mirror registration when creating clones (`issue#17993 `_, `pr#12839 `_, Jason Dillaman) * librbd: don't continue to remove an image w/ incompatible features (`issue#18315 `_, `pr#12638 `_, Dongsheng Yang) * librbd: fixed initializer list ordering (`pr#13042 `_, Jason Dillaman) * librbd: initialize diff parent overlap to zero (`pr#13077 `_, Gu Zhongyan) * librbd: introduce new constants for tracking max block name prefix (`issue#18653 `_, `pr#13141 `_, Jason Dillaman) * librbd: managed lock refactoring (`pr#12922 `_, Mykola Golub) * librbd: metadata_set API operation should not change global config setting (`issue#18465 `_, `pr#12843 `_, Mykola Golub) * librbd: new API method to force break a peer's exclusive lock (`issue#18429 `_, `issue#16988 `_, `issue#18327 `_, `pr#12639 `_, Jason Dillaman) * librbd: permit removal of image being bootstrapped by rbd-mirror (`issue#16555 `_, `pr#12549 `_, Mykola Golub) * librbd: possible deadlock with flush if refresh in-progress (`issue#18419 `_, `pr#12838 `_, Jason Dillaman) * librbd: prevent self-blacklisting during break lock (`issue#18666 `_, `pr#13110 `_, Jason Dillaman) * librbd: race initializing exclusive lock and configuring IO path (`pr#13086 `_, Jason Dillaman) * librbd: refactor exclusive lock support into generic managed lock (`issue#17016 `_, `pr#12846 `_, Ricardo Dias, Jason Dillaman) * librbd: remove unnecessary dependencies of ManagedLock (`pr#12982 `_, Jason Dillaman) * librbd: resolve static analyser warnings (`pr#12863 `_, Jason Dillaman) * librbd: support for shared locking in ManagedLock (`pr#12886 `_, Ricardo Dias) * librbd: support to list snapshot timestamp (`issue#808 `_, `pr#12817 `_, Pan Liu) * librbd: warning message for mirroring pool option (`issue#18125 `_, `pr#12319 `_, Gaurav Kumar Garg) * log: use one write system call per message (`pr#11955 `_, Patrick Donnelly) * mds: add authority check for delay dirfrag split (`issue#18487 `_, `pr#12994 `_, "Yan, Zheng") * mds: check for errors decoding backtraces (`issue#18311 `_, `pr#12588 `_, John Spray) * mds: don't modify inode that is not projected (`issue#16768 `_, `pr#13052 `_, "Yan, Zheng") * mds: finish clientreplay requests before requesting active state (`issue#18461 `_, `pr#12852 `_, Yan, Zheng) * mds: fix incorrect assertion in Server::_dir_is_nonempty() (`issue#18578 `_, `pr#12973 `_, Yan, Zheng) * mds: fix null pointer dereference in Locker::handle_client_caps (`issue#18306 `_, `pr#12808 `_, Yan, Zheng) * mds: miscellaneous fixes (`issue#18646 `_, `pr#12974 `_, Yan, Zheng, "Yan, Zheng") * mds: misc multimds fixes part2 (`pr#12794 `_, Yan, Zheng) * mds: misc multimds fixes (`pr#12274 `_, Yan, Zheng) * mds: propagate error encountered during opening inode by number (`issue#18179 `_, `pr#12749 `_, Yan, Zheng) * mds: skip fragment space check for replayed request (`issue#18660 `_, `pr#13095 `_, "Yan, Zheng") * mgr: Make stats period configurable (`issue#17449 `_, `pr#12732 `_, liuchang0812) * mon: clean up some osdmon/pgmon interactions (`pr#12403 `_, Sage Weil) * mon: don't set last_osd_report when the pg stats msg is ignored (`pr#12975 `_, Zhiqiang Wang) * mon: fix a few nits (`pr#12670 `_, Sage Weil) * mon: fix wrongly delete routed pgstats op (`issue#18458 `_, `pr#12784 `_, Mingxin Liu) * mon: no delay for single message MSG_ALIVE and MSG_PGTEMP (`pr#12107 `_, yaoning) * mon: optracker's initiated_at timestamp should not be NULL (`pr#12826 `_, Mingxin Liu) * mon/OSDMonitor: fix process osd failure (`pr#12938 `_, Mingxin Liu) * mon: post-jewel cleanups (`pr#13150 `_, Kefu Chai) * msg/async: Fixed compilation error when enable -DWITH_DPDK (`pr#12660 `_, Pan Liu) * msg/async: fixed coredump when enable dpdk. (`pr#12854 `_, Pan Liu) * msg/async: fixed the error "Cause: Cannot create lock on '/var/run/.rte_c… (`pr#12860 `_, Pan Liu) * msg/async: fix file description leak in NetHandler (`pr#13271 `_, liuchang0812) * msg/async: NVMEDevice.cc: cleanup the logic in data_buf_next_sge (`pr#13056 `_, optimistyzy) * msg/async/rdma: Fix memory leak of OSD (`pr#13101 `_, Sarit Zubakov) * msg/async/rdma: fix RoCE v2 deafult value (`pr#12648 `_, Adir Lev, Oren Duer) * msg/RDMA: Fix broken compilation due to new argument in net.connect() (`pr#13096 `_, Amir Vadai) * os/bluestore: NVMEDevice: optimize sector_size usage (`pr#12780 `_, optimistyzy) * osd: add asock command to dump the scrub queue (`issue#17861 `_, `pr#12728 `_, liuchang0812) * osd: allow client throttler to be adjusted on-fly, without restart (`issue#18791 `_, `pr#13213 `_, Piotr Dałek) * osd: Calculate degraded and misplaced more accurately (`issue#18619 `_, `pr#13031 `_, David Zafman) * osd: cleanup: use string & to avoid unnecessary copy (`pr#12336 `_, Yunchuan Wen) * osd: condition object_info_t encoding on required (not up) features (`issue#18644 `_, `pr#13114 `_, Sage Weil) * osd: correct comment of perfcounter cached_crc in code (`pr#13256 `_, lvshuhua) * osd: do not send ENXIO on misdirected op by default (`issue#18751 `_, `pr#13206 `_, Sage Weil) * osd: _exit() intead of exit() for failure injection (`issue#18372 `_, `pr#12726 `_, Sage Weil) * osd: extend OMAP_GETKEYS and GETVALS to include a 'more' output field (`pr#12950 `_, Sage Weil) * osd: fix coverity warning for uninitialized members (`pr#12724 `_, Li Wang) * osd: Fix useless MAX(0, unsigned) to prevent out of wack misplaced (`issue#18718 `_, `pr#13164 `_, David Zafman) * osd: improve error message when FileStore op fails due to EPERM (`issue#18037 `_, `pr#12181 `_, Nathan Cutler) * osd: never send rados ack (only commit) (`pr#12451 `_, Sage Weil) * osd: osd/OSDMap: require OSD features only of OSDs (`issue#18831 `_, `pr#13275 `_, Ilya Dryomov) * osd: Return correct osd_objectstore in OSD metadata (`issue#18638 `_, `pr#13072 `_, Wido den Hollander) * osd: use separate waitlist for scrub (`pr#13136 `_, Sage Weil) * performance,bluestore: os/bluestore/bluestore_types: drop std::bitset for blob unused (`pr#12569 `_, Sage Weil) * performance,bluestore: os/bluestore: do not cache shard keys (`pr#12634 `_, Sage Weil) * performance,bluestore: os/bluestore: generate same onode extent-shard keys in a more efficient way (`pr#12681 `_, xie xingguo) * performance,bluestore: os/bluestore: kvdb histogram (`pr#12620 `_, Varada Kari) * performance,bluestore: os/bluestore: prevent lock for almost "flush" calls (`pr#12524 `_, Haomai Wang) * performance,bluestore: os/bluestore: replace Blob ref_map with reference counting (`pr#12904 `_, Igor Fedotov) * performance,bluestore: os/bluestore: set cache meta ratio to .9 (`pr#12635 `_, Sage Weil) * performance,bluestore: os/bluestore: use aio for reads (`pr#13066 `_, Sage Weil) * performance,bluestore: os/bluestore: use Best-Effort policy when evicting onode from cache (`pr#12876 `_, xie xingguo) * performance,bluestore: os/bluestore: various onode changes to reduce its in-memory footprint. (`pr#12700 `_, Igor Fedotov) * performance,bluestore: Tested-by: Chagam, Anjaneya (`pr#13032 `_, Sage Weil) * performance,cephfs: mds/server: skip unwanted dn in handle_client_readdir (`pr#12870 `_, Xiaoxi Chen) * performance,common,bluestore: common/Finisher: Using queue(list) instead queue(context*). (`pr#8942 `_, Jianpeng Ma) * performance: osd, messenger, librados: lttng oid tracing (`pr#12492 `_, Anjaneya Chagam) * performance: osd/ReplicatedBackend: do not set omap header if it is empty (`pr#12612 `_, fang yuxiang) * pybind: conditional compile the linux specific constant (`pr#12198 `_, Kefu Chai) * qa: Add openstack requirements to smoke suite (`pr#12913 `_, Zack Cerza) * qa: src/test/test_denc.cc: Fix errors in buffer overflow (`pr#12653 `_, Willem Jan Withagen) * rbd: add error prompt when input command 'snap set limit' is incomplete (`pr#12945 `_, Tang Jin) * rbd: additional validation for 'bench' optional parameters (`pr#12697 `_, Yunchuan Wen) * rbd: bench-write should return error if io-size >= 4G (`issue#18422 `_, `pr#12864 `_, Gaurav Kumar Garg) * rbd: cleanup: fix the typo in namespace comment (`pr#12858 `_, Dongsheng Yang) * rbd: correct coverity warnings (`pr#12954 `_, Jason Dillaman) * rbd: destination pool should be source pool if it is not specified (`issue#18326 `_, `pr#13189 `_, Gaurav Kumar Garg) * rbd: enable rbd on FreeBSD (without KRBD) (`pr#12798 `_, Willem Jan Withagen) * rbd: journal: don't hold future lock during assignment (`issue#18618 `_, `pr#13033 `_, Jason Dillaman) * rbd: journal: stop processing removal after error (`issue#18738 `_, `pr#13193 `_, Jason Dillaman) * rbd: rbd-mirror: add support for active/passive daemon instances (`issue#17018 `_, `issue#17019 `_, `issue#17020 `_, `pr#12948 `_, Mykola Golub) * rbd: rbd-mirror: avoid processing new events after stop requested (`issue#18441 `_, `pr#12837 `_, Jason Dillaman) * rbd: rbd-mirror: check remote image mirroring state when bootstrapping (`issue#18447 `_, `pr#12820 `_, Mykola Golub) * rbd: rbd-mirror: delayed replication support (`issue#15371 `_, `pr#11879 `_, Mykola Golub) * rbd: remove direct linking to static boost libraries (`pr#12962 `_, Jason Dillaman) * rbd: remove unused condition within group action handler (`pr#12723 `_, Gaurav Kumar Garg) * rgw: Added a globbing method for AWS Policies. (`pr#12445 `_, Pritha Srivastava) * rgw: Added code for REST APIs for AWS Roles. (`pr#12104 `_, Pritha Srivastava) * rgw: add lifecycle validation according to S3. (`issue#18394 `_, `pr#12750 `_, Zhang Shaowen) * rgw: be aware abount tenants on cls_user_bucket -> rgw_bucket conversion (`issue#18364 `_, `issue#16355 `_, `pr#13220 `_, Radoslaw Zarzynski) * rgw: clear master_zonegroup when reseting RGWPeriodMap (`issue#17239 `_, `pr#12658 `_, Orit Wasserman) * rgw: complete versioning enablement after sending it to meta master (`issue#18003 `_, `pr#12444 `_, Orit Wasserman) * rgw: fix handling RGWUserInfo::system in RGWHandler_REST_SWIFT. (`issue#18476 `_, `pr#12865 `_, Radoslaw Zarzynski) * rgw: fix off-by-one in RGWDataChangesLog::get_info (`issue#18488 `_, `pr#12884 `_, Casey Bodley) * rgw: fix period update crash (`issue#18631 `_, `pr#13054 `_, Orit Wasserman) * rgw: fix use of marker in List::list_objects() (`issue#18331 `_, `pr#13147 `_, Yehuda Sadeh) * rgw: fix versioned bucket data sync fail when upload is busy (`issue#18208 `_, `pr#12357 `_, lvshuhua) * rgw: ldap: simple_bind() should set ldap version option on tldap (`pr#12616 `_, Weibing Zhang) * rgw: librgw shut (`issue#18585 `_, `pr#12972 `_, Matt Benjamin) * rgw: orphan: fix error messages (`pr#12782 `_, Weibing Zhang) * rgw: remove invalid read size4 (`issue#18071 `_, `pr#12767 `_, Matt Benjamin) * rgw: rgw_file: add timed namespace invalidation (`issue#18651 `_, `pr#13038 `_, Matt Benjamin) * rgw: rgw_file: fix RGWLibFS::setattr for directory objects (`issue#18808 `_, `pr#13252 `_, Matt Benjamin) * rgw: rgw_file: interned RGWFileHandle objects need parent refs (`issue#18650 `_, `pr#13084 `_, Matt Benjamin) * rgw: rgw_file: split last argv on ws, if provided (`pr#12965 `_, Matt Benjamin) * rgw: RGWMetaSyncShardCR drops stack refs on destruction (`issue#18412 `_, `issue#18300 `_, `pr#12605 `_, Casey Bodley) * rgw: rgw multisite: fix ref counting of completions (`issue#18414 `_, `issue#18407 `_, `pr#12841 `_, Casey Bodley) * rgw: rgw realm set fixes (`issue#18333 `_, `pr#12731 `_, Orit Wasserman) * rgw: swift: fix anonymous user's error code of getting object (`issue#18806 `_, `pr#13242 `_, Jing Wenjun) * rgw: swift: the http referer acl in swift API should be shown (`issue#18665 `_, `pr#13003 `_, Jing Wenjun) * rgw: swift: The http referer should be parsed to compare in swift API (`issue#18685 `_, `pr#13005 `_, Jing Wenjun) * rgw: sync status compares the current master period (`issue#18064 `_, `pr#12907 `_, Abhishek Lekshmanan) * rgw: the swift container acl should support field .ref (`issue#18484 `_, `pr#12874 `_, Jing Wenjun) * rgw: Use decoded URI when verifying TempURL (`issue#18590 `_, `pr#13007 `_, Michal Koutný) * rgw: Wip rgw openssl 7 (`issue#11239 `_, `issue#16535 `_, `pr#11776 `_, Yehuda Sadeh, Marcus Watts) * tests,bluestore: ceph_test_objectstore: match clone_range src and dst offset (`pr#13211 `_, Sage Weil) * tests,cephfs: Improve vstart_runner to (optionally) create its own cluster (`pr#12800 `_, John Spray) * tests: ceph-helpers.sh reduce get_timeout_delays() verbosity (`pr#13257 `_, Kefu Chai) * tests: ceph_test_rados_api_watch_notify: move global variables into test class (`issue#18395 `_, `pr#12751 `_, Kefu Chai) * tests,cleanup,rbd: test: correct language mode in file headers (`pr#12924 `_, Jason Dillaman) * tests,cleanup: test: clean up unused variable (`pr#12873 `_, liuchang0812) * tests,cleanup: test: update test_rados_tool.sh, use POOL and OBJ var (`pr#12706 `_, liuchang0812) * tests: drop rbd_cli_tests.pl and RbdLib.pm (`issue#14825 `_, `pr#12821 `_, Nathan Cutler) * tests: drop unused rbd_functional_tests.pl script (`issue#14825 `_, `pr#12818 `_, Nathan Cutler) * tests: fix NULL references to be acceptable by Clang (`pr#12880 `_, Willem Jan Withagen) * tests: fix template specialization of PromoteRequest class (`pr#12815 `_, Ricardo Dias) * tests: ignore bogus ceph-objectstore-tool error in ceph_manager (`issue#16263 `_, `pr#13194 `_, Nathan Cutler) * tests: nosetests: use /usr/bin/env to find nosetests (`pr#12091 `_, Willem Jan Withagen) * tests: qa: Cleaned up distros to use `latest` versions (`pr#12804 `_, Yuri Weinstein) * tests: qa/distros: make centos_latest 7.3 (`pr#12944 `_, Sage Weil) * tests: qa: do not mention ceph branch explicitly (`pr#13225 `_, Tamil Muthamizhan) * tests: qa: fixed distros links (`pr#12770 `_, Yuri Weinstein) * tests: qa: specify client for fs workunit (`pr#12914 `_, Tamil Muthamizhan) * tests: qa/suite: Added a smoke suite for ceph-ansible (`pr#12610 `_, Tamil Muthamizhan) * tests: qa/suites/ceph-ansible: removing fs workunit (`pr#12928 `_, Tamil Muthamizhan) * tests: qa/tasks/ceph-deploy: create-keys explicitly (`pr#12867 `_, Vasu Kulkarni) * tests: qa/tasks/ceph-deploy: use the new create option during instantiation (`pr#12892 `_, Vasu Kulkarni) * tests: qa/test_rados_tool.sh: POSIX dd only accepts 'k' as multiplier (`pr#12699 `_, Willem Jan Withagen) * tests: qa: Using centos 7.2 for `latest` version (`pr#12806 `_, Yuri Weinstein) * tests: qa/workunits/ceph-helpers: wait_for_clean() races with pg creation (`pr#12866 `_, David Zafman) * tests,rbd: qa: integrate OpenStack 'gate-tempest-dsvm-full-devstack-plugin-ceph' (`issue#18594 `_, `pr#13158 `_, Jason Dillaman) * tests,rbd: qa/tasks/qemu: update default image url after ceph.com redesign (`issue#18542 `_, `pr#12953 `_, Jason Dillaman) * tests,rbd: test/librbd: add break_lock test (`pr#12842 `_, Mykola Golub) * tests: rbd/test_lock_fence.sh: fix rbdrw.py relative path (`issue#18388 `_, `pr#12747 `_, Nathan Cutler) * tests,rbd: test: use librados API to retrieve config params (`issue#18617 `_, `pr#13076 `_, Jason Dillaman) * tests: remove temporary file (`pr#12919 `_, Kefu Chai) * tests,rgw: test/rgw: add test for versioned object sync (`pr#12474 `_, Casey Bodley) * tests: subst repo and branch in git.ceph.com URL in qa/tasks/cram.py and qa/tasks/qemu.py (`issue#18440 `_, `pr#12816 `_, Nathan Cutler) * tests: test: librados_test_stub: tmap_update: return -ENOENT when removing nonexisent key (`pr#12667 `_, Mykola Golub) * tests: test: replace hard-code binary names with varibles (`pr#12675 `_, liuchang0812) * tests: Thrasher: eliminate a race between kill_osd and __init__ (`issue#18799 `_, `pr#13237 `_, Nathan Cutler) * tools: ceph-disk: convert none str to str before printing it (`issue#18371 `_, `pr#12760 `_, Kefu Chai) * tools: ceph-objectstore-tool: Handle object names that are also valid json (`pr#12848 `_, David Zafman) * tools: ceph-release-notes: ignore low-numbered PRs (`issue#18695 `_, `pr#13151 `_, Nathan Cutler) * tools,common: osdmaptool: show all the pg map to osds info (`pr#9419 `_, song baisen) * tools: script: ceph-release-notes check orig. issue only for backports (`pr#12979 `_, Abhishek Lekshmanan) * tools: support hammer in rbd_recover_tool (`pr#12413 `_, Bartłomiej Święcki) * tools: tools/rados: add a parameter "--offset" to rados put command (`pr#12674 `_, liuchang0812) v11.2.0 Kraken ============== This is the first release of the Kraken series. It is a stable release that will be maintained with bugfixes and backports until the next stable release, Luminous, is completed in the Spring of 2017. Major Changes from Jewel ------------------------ - *RADOS*: * The new *BlueStore* backend now has a stable disk format and is passing our failure and stress testing. Although the backend is still flagged as experimental, we encourage users to try it out for non-production clusters and non-critical data sets. * RADOS now has experimental support for *overwrites on erasure-coded* pools. Because the disk format and implementation are not yet finalized, there is a special pool option that must be enabled to test the new feature.  Enabling this option on a cluster will permanently bar that cluster from being upgraded to future versions. * We now default to the AsyncMessenger (``ms type = async``) instead of the legacy SimpleMessenger.  The most noticeable difference is that we now use a fixed sized thread pool for network connections (instead of two threads per socket with SimpleMessenger). * Some OSD failures are now detected almost immediately, whereas previously the heartbeat timeout (which defaults to 20 seconds) had to expire.  This prevents IO from blocking for an extended period for failures where the host remains up but the ceph-osd process is no longer running. * There is a new ``ceph-mgr`` daemon.  It is currently collocated with the monitors by default, and is not yet used for much, but the basic infrastructure is now in place. * The size of encoded OSDMaps has been reduced. * The OSDs now quiesce scrubbing when recovery or rebalancing is in progress. - *RGW*: * RGW now supports a new zone type that can be used for metadata indexing via ElasticSearch. * RGW now supports the S3 multipart object copy-part API. * It is possible now to reshard an existing bucket. Note that bucket resharding currently requires that all IO (especially writes) to the specific bucket is quiesced. * RGW now supports data compression for objects. * Civetweb version has been upgraded to 1.8 * The Swift static website API is now supported (S3 support has been added previously). * S3 bucket lifecycle API has been added. Note that currently it only supports object expiration. * Support for custom search filters has been added to the LDAP auth implementation. * Support for NFS version 3 has been added to the RGW NFS gateway. * A Python binding has been created for librgw. - *RBD*: * RBD now supports images stored in an *erasure-coded* RADOS pool using the new (experimental) overwrite support. Images must be created using the new rbd CLI "--data-pool " option to specify the EC pool where the backing data objects are stored. Attempting to create an image directly on an EC pool will not be successful since the image's backing metadata is only supported on a replicated pool. * The rbd-mirror daemon now supports replicating dynamic image feature updates and image metadata key/value pairs from the primary image to the non-primary image. * The number of image snapshots can be optionally restricted to a configurable maximum. * The rbd Python API now supports asynchronous IO operations. - *CephFS*: * libcephfs function definitions have been changed to enable proper uid/gid control. The library version has been increased to reflect the interface change. * Standby replay MDS daemons now consume less memory on workloads doing deletions. * Scrub now repairs backtrace, and populates `damage ls` with discovered errors. * A new `pg_files` subcommand to `cephfs-data-scan` can identify files affected by a damaged or lost RADOS PG. * The false-positive "failing to respond to cache pressure" warnings have been fixed. Upgrading from Kraken release candidate 11.1.0 ---------------------------------------------- * The new *BlueStore* backend had an on-disk format change after 11.1.0. Any BlueStore OSDs created with 11.1.0 will need to be destroyed and recreated. Upgrading from Jewel -------------------- * All clusters must first be upgraded to Jewel 10.2.z before upgrading to Kraken 11.2.z (or, eventually, Luminous 12.2.z). * The ``sortbitwise`` flag must be set on the Jewel cluster before upgrading to Kraken. The latest Jewel (10.2.8+) releases issue a health warning if the flag is not set, so this is probably already set. If it is not, Kraken OSDs will refuse to start and will print and error message in their log. * You may upgrade OSDs, Monitors, and MDSs in any order. RGW daemons should be upgraded last. * When upgrading, new ceph-mgr daemon instances will be created automatically alongside any monitors. This will be true for Jewel to Kraken and Jewel to Luminous upgrades, but likely not be true for future upgrades beyond Luminous. You are, of course, free to create new ceph-mgr daemon instances and destroy the auto-created ones if you do not with them to be colocated with the ceph-mon daemons. BlueStore --------- BlueStore is a new backend for managing data stored by each OSD on the directly hard disk or SSD. Unlike the existing FileStore implementation, which makes use of an XFS file system to store objects as files, BlueStore manages the underlying block device directly. Implements its own file system-like on-disk structure the is designed specifically for Ceph OSD workloads. Key features of BlueStore include: * Checksums on all data written to disk, with checksum verifications on all reads, enabled by default. * Inline compression support, which can be enabled on a per-pool or per-object basis via pool properties or client hints, respectively. * Efficient journaling. Unlike FileStore, which writes *all* data to its journal device, BlueStore only journals metadata and (in some cases) small writes, reducing the size and throughput requirements for its journal. As with FileStore, the journal can be colocated on the same device as other data or allocated on a smaller, high-performance device (e.g., an SSD or NVMe device). BlueStore journals are only 512 MB by default. The BlueStore on-disk format is expected to continue to evolve. However, we will provide support in the OSD to migrate to the new format on upgrade. .. note: BlueStore is still marked "experimental" in Kraken. We recommend its use for proof-of-concept and test environments, or other cases where data loss can be tolerated. Although it is stable in our testing environment, the code is new and bugs are inevitable. We hope that with user feedback from Kraken deployments we will have sufficient confidence to mark it stable (and the default) in the next major release (Luminous). In order to enable BlueStore, add the following to ceph.conf:: enable experimental unrecoverable data corrupting features = bluestore To create a BlueStore OSD, pass the --bluestore option to ceph-disk or ceph-deploy during OSD creation. Upgrade notes ------------- * The OSDs now avoid starting new scrubs while recovery is in progress. To revert to the old behavior (and do not let recovery activity affect the scrub scheduling) you can set the following option:: osd scrub during recovery = true * The list of monitor hosts/addresses for building the monmap can now be obtained from DNS SRV records. The service name used in when querying the DNS is defined in the "mon_dns_srv_name" config option, which defaults to "ceph-mon". * The 'osd class load list' config option is a list of object class names that the OSD is permitted to load (or '*' for all classes). By default it contains all existing in-tree classes for backwards compatibility. * The 'osd class default list' config option is a list of object class names (or '*' for all classes) that clients may invoke having only the '*', 'x', 'class-read', or 'class-write' capabilities. By default it contains all existing in-tree classes for backwards compatibility. Invoking classes not listed in 'osd class default list' requires a capability naming the class (e.g. 'allow class foo'). * The 'rgw rest getusage op compat' config option allows you to dump (or not dump) the description of user stats in the S3 GetUsage API. This option defaults to false. If the value is true, the reponse data for GetUsage looks like:: "stats": { "TotalBytes": 516, "TotalBytesRounded": 1024, "TotalEntries": 1 } If the value is false, the reponse for GetUsage looks as it did before:: { 516, 1024, 1 } * The 'osd out ...' and 'osd in ...' commands now preserve the OSD weight. That is, after marking an OSD out and then in, the weight will be the same as before (instead of being reset to 1.0). Previously the mons would only preserve the weight if the mon automatically marked and OSD out and then in, but not when an admin did so explicitly. * The 'ceph osd perf' command will display 'commit_latency(ms)' and 'apply_latency(ms)'. Previously, the names of these two columns are 'fs_commit_latency(ms)' and 'fs_apply_latency(ms)'. We remove the prefix 'fs_', because they are not filestore specific. * Monitors will no longer allow pools to be removed by default. The setting mon_allow_pool_delete has to be set to true (defaults to false) before they allow pools to be removed. This is a additional safeguard against pools being removed by accident. * If you have manually specified the monitor user rocksdb via the ``mon keyvaluedb = rocksdb`` option, you will need to manually add a file to the mon data directory to preserve this option:: echo rocksdb > /var/lib/ceph/mon/ceph-`hostname`/kv_backend New monitors will now use rocksdb by default, but if that file is not present, existing monitors will use leveldb. The ``mon keyvaluedb`` option now only affects the backend chosen when a monitor is created. * The 'osd crush initial weight' option allows you to specify a CRUSH weight for a newly added OSD. Previously a value of 0 (the default) meant that we should use the size of the OSD's store to weight the new OSD. Now, a value of 0 means it should have a weight of 0, and a negative value (the new default) means we should automatically weight the OSD based on its size. If your configuration file explicitly specifies a value of 0 for this option you will need to change it to a negative value (e.g., -1) to preserve the current behavior. * The `osd crush location` config option is no longer supported. Please update your ceph.conf to use the `crush location` option instead. * The static libraries are no longer included by the debian development packages (lib*-dev) as it is not required per debian packaging policy. The shared (.so) versions are packaged as before. * The libtool pseudo-libraries (.la files) are no longer included by the debian development packages (lib*-dev) as they are not required per https://wiki.debian.org/ReleaseGoals/LAFileRemoval and https://www.debian.org/doc/manuals/maint-guide/advanced.en.html. * The jerasure and shec plugins can now detect SIMD instruction at runtime and no longer need to be explicitly configured for different processors. The following plugins are now deprecated: jerasure_generic, jerasure_sse3, jerasure_sse4, jerasure_neon, shec_generic, shec_sse3, shec_sse4, and shec_neon. If you use any of these plugins directly you will see a warning in the mon log file. Please switch to using just 'jerasure' or 'shec'. * The librados omap get_keys and get_vals operations include a start key and a limit on the number of keys to return. The OSD now imposes a configurable limit on the number of keys and number of total bytes it will respond with, which means that a librados user might get fewer keys than they asked for. This is necessary to prevent careless users from requesting an unreasonable amount of data from the cluster in a single operation. The new limits are configured with `osd_max_omap_entries_per_request`, defaulting to 131,072, and 'osd_max_omap_bytes_per_request', defaulting to 4MB. * Calculation of recovery priorities has been updated. This could lead to unintuitive recovery prioritization during cluster upgrade. In case of such recovery, OSDs in old version would operate on different priority ranges than new ones. Once upgraded, cluster will operate on consistent values. Notable Changes --------------- * bluestore: add counter to trace blob splitting (`pr#11718 `_, xie xingguo) * bluestore: a few more cleanups (`pr#11780 `_, xie xingguo) * bluestore: avoid polluting shard info if need resharding (`pr#11439 `_, xie xingguo) * bluestore: avoid unnecessary call to init_csum() (`pr#12015 `_, xie xingguo) * bluestore: ceph-disk: adjust bluestore default device sizes (`pr#12530 `_, Sage Weil) * bluestore: ceph_test_objectstore: smaller device (`pr#11591 `_, Sage Weil) * bluestore: clean up Allocator::dump (`issue#18054 `_, `pr#12282 `_, Sage Weil) * bluestore: clear extent map on object removal (`pr#11603 `_, Sage Weil) * bluestore: compressor/ZLibCompressor: fix broken isal-l (`pr#11445 `_, Igor Fedotov) * bluestore: dedup if space overlap truly exists (`pr#11986 `_, xie xingguo) * bluestore: dedup omap_head, reuse nid instead (`pr#12275 `_, xie xingguo) * bluestore: deep fsck (`pr#11724 `_, Sage Weil) * bluestore: default bluestore_clone_cow=true (`pr#11540 `_, Sage Weil) * bluestore: drop inline_dirty from struct ExtentMap (`pr#11377 `_, xie xingguo) * bluestore: drop member "space" from Onode (`pr#12185 `_, xie xingguo) * bluestore: fix alloc release timing on sync submits (`pr#11983 `_, Sage Weil) * bluestore: fix bufferspace stats leak due to blob splitting (`pr#12039 `_, xie xingguo) * bluestore: fix collection_list end bound off-by-one (`pr#11771 `_, Sage Weil) * bluestore: fix compiler warnings (`pr#11905 `_, xie xingguo) * bluestore: fixes and cleanups (`pr#11761 `_, xie xingguo) * bluestore: fix escaping of chars > 0x80 (`pr#11502 `_, Sage Weil) * bluestore: fix extent shard span check (`pr#11725 `_, Sage Weil) * bluestore: fix has_aios (`pr#11317 `_, Sage Weil) * bluestore: Fix invalid compression statfs caused by clone op (`pr#11351 `_, Igor Fedotov) * bluestore: fix lack of resharding (`pr#11597 `_, Igor Fedotov) * bluestore: fix latency calculation (`pr#12040 `_, Pan Liu) * bluestore: fix onode vs extent key suffix (`pr#11452 `_, Sage Weil) * bluestore: fix potential memory leak (`pr#11893 `_, xie xingguo) * bluestore: fix race condtion during blob spliting (`pr#11422 `_, xiexingguo, xie xingguo) * bluestore: fix remove_collection to properly detect collection e… (`pr#11398 `_, Igor Fedotov) * bluestore: fix _split_collections race with osr_reap (`pr#11748 `_, Sage Weil) * bluestore: fix up compression tests and debug output (`pr#11350 `_, Sage Weil) * bluestore: fix writes that span existing shard boundaries (`pr#11451 `_, Sage Weil) * bluestore: flush before enumerating omap values (`issue#18140 `_, `pr#12328 `_, Sage Weil) * bluestore: formatting nits (`pr#11514 `_, xie xingguo) * bluestore: fsck: fix omap_head check (`pr#11726 `_, Sage Weil) * bluestore: GC infra refactor, more UTs and GC range calculation fixes (`pr#11482 `_, Igor Fedotov) * bluestore: KernelDevice: fix race in aio_thread vs aio_wait (`issue#17824 `_, `pr#12204 `_, Sage Weil) * bluestore: kv: dump rocksdb stats (`pr#12287 `_, Varada Kari, Jianpeng Ma, Sage Weil) * bluestore: kv/rocksdb: enable rocksdb write path breakdown (`pr#11696 `_, Haodong Tang) * bluestore: kv/RocksDBStore: rename option (`pr#11769 `_, Sage Weil) * bluestore: less code redundancy (`pr#11740 `_, xie xingguo) * bluestore: make 2q cache kin/kout size tunable (`pr#11599 `_, Haodong Tang) * bluestore: mark ops that can't tolerate ENOENT (`pr#12114 `_, Sage Weil) * bluestore: mempool: changes for bitmap allocator (`pr#11922 `_, Ramesh Chander) * bluestore: misc. fixes and cleanups (`pr#11964 `_, xie xingguo) * bluestore: move bluefs into its own mempool (`pr#11834 `_, Sage Weil) * bluestore: no garbage collection for uncompressed blobs (`pr#11539 `_, Roushan Ali, Sage Weil) * bluestore: optional debug mode to identify aio stalls (`pr#11818 `_, Sage Weil) * bluestore: os/bluestore: a few cleanups (`pr#11483 `_, Sage Weil) * bluestore: os/bluestore: avoid resharding if the last shard size fall below shar… (`pr#12447 `_, Igor Fedotov) * bluestore: os/bluestore: bitmap allocator dump functionality (`pr#12298 `_, Ramesh Chander) * bluestore: os/bluestore: bluestore_sync_submit_transaction = false (`pr#12367 `_, Sage Weil) * bluestore: os/bluestore: cleanup around Blob::ref_map (`pr#11896 `_, Igor Fedotov) * bluestore: os/bluestore: clear omap flag if parent has none (`pr#12351 `_, xie xingguo) * bluestore: os/bluestore: don't implicitly create the source object for clone (`pr#12353 `_, xie xingguo) * bluestore: os/bluestore: drop old bluestore preconditioning; replace with wal preextension of file size (`pr#12265 `_, Sage Weil) * bluestore: os/bluestore: fix global commit latency (`pr#12356 `_, xie xingguo) * bluestore: os/bluestore: fix ondisk encoding for blobs (`pr#12488 `_, Varada Kari, Sage Weil) * bluestore: os/bluestore: fix potential csum_order overflow (`pr#12333 `_, xie xingguo) * bluestore: os/bluestore: fix target_buffer value overflow in Cache::trim() (`pr#12507 `_, Igor Fedotov) * bluestore: os/bluestore: include modified objects in flush list even if onode unchanged (`pr#12541 `_, Sage Weil) * bluestore: os/bluestore: kill dead gc-related counters (`pr#12065 `_, xie xingguo) * bluestore: os/bluestore: kill overlay related options (`pr#11557 `_, xie xingguo) * bluestore: os/bluestore: misc coverity fixes/cleanups (`pr#12202 `_, Sage Weil) * bluestore: os/bluestore: preserve source collection cache during split (`pr#12574 `_, Sage Weil) * bluestore: os/bluestore: remove 'extents' from shard_info (`pr#12629 `_, Sage Weil) * bluestore: os/bluestore: simplified allocator interfaces to single apis (`pr#12355 `_, Ramesh Chander) * bluestore: os/bluestore: simplify allocator release flow (`pr#12343 `_, Sage Weil) * bluestore: os/bluestore: simplify can_split_at() (`pr#11607 `_, xie xingguo) * bluestore: os/bluestore: use iterator for erase() method directly (`pr#11490 `_, xie xingguo) * bluestore: os/kstore: rmcoll fix to satisfy store_test (`pr#11533 `_, Igor Fedotov) * bluestore: os: make filestore_blackhole -> objectstore_blackhole (`pr#11788 `_, Sage Weil) * bluestore: os: move_ranges_destroy_src (`pr#11237 `_, Manali Kulkarni, Sage Weil) * bluestore: readability improvements and doxygen fix (`pr#11895 `_, xie xingguo) * bluestore: reap collection after all pending ios done (`pr#11797 `_, Haomai Wang) * bluestore: reap ioc when stopping aio_thread. (`pr#11811 `_, Haodong Tang) * bluestore: refactor _do_write(); move initializaiton of csum out of loop (`pr#11823 `_, xie xingguo) * bluestore: remove duplicated namespace of tx state (`pr#11845 `_, xie xingguo) * bluestore: remove garbage collector staff (`pr#12042 `_, Igor Fedotov) * bluestore: set next object as ghobject_t::get_max() when start.hobj.i… (`pr#11495 `_, Xinze Chi, Haomai Wang) * bluestore: simplify blob status checking for small writes (`pr#11366 `_, xie xingguo) * bluestore: some more cleanups (`pr#11910 `_, xie xingguo) * bluestore: spdk: a few fixes (`pr#11882 `_, Yehuda Sadeh) * bluestore: speed up omap-key generation for same onode (`pr#11807 `_, xie xingguo) * bluestore: traverse buffer_map in reverse order when spliting BufferSpace (`pr#11468 `_, xie xingguo) * bluestore: update cache logger after 'trim_cache' operation (`pr#11695 `_, Haodong Tang) * bluestore: use bitmap allocator for bluefs (`pr#12285 `_, Sage Weil) * bluestore: use std::unordered_map for SharedBlob lookup (`pr#11394 `_, Sage Weil) * build/ops: AArch64: Detect crc32 extension support from assembler (`issue#17516 `_, `pr#11391 `_, Alexander Graf) * build/ops: boost: embedded (`pr#11817 `_, Sage Weil, Matt Benjamin) * build/ops: build: dump env during build (`issue#18084 `_, `pr#12284 `_, Sage Weil) * build/ops: ceph-detect-init: FreeBSD introduction of bsdrc (`pr#11906 `_, Willem Jan Withagen, Kefu Chai) * build/ops: ceph-disk: enable --runtime ceph-osd systemd units (`issue#17889 `_, `pr#12241 `_, Loic Dachary) * build/ops: ceph.spec: add pybind rgwfile (`pr#11847 `_, Haomai Wang) * build/ops,cleanup,bluestore: os/bluestore: remove build warning in a better way (`pr#11920 `_, Igor Fedotov) * build/ops: CMakeLists: add vstart-base target (`pr#12476 `_, Sage Weil) * build/ops: CMakeLists.txt: enable LTTNG by default (`pr#11500 `_, Sage Weil) * build/ops: common/buffer.cc: raw_pipe depends on splice(2) (`pr#11967 `_, Willem Jan Withagen) * build/ops,common: common/str_list.h: fix clang warning about std::move (`pr#12570 `_, Willem Jan Withagen) * build/ops,core: xio: fix build (`pr#11768 `_, Matt Benjamin) * build/ops: deb: add python dependencies where needed (`issue#17579 `_, `pr#11507 `_, Nathan Cutler, Kefu Chai) * build/ops: deb: add python-rgw packages (`pr#11832 `_, Sage Weil) * build/ops: debian: apply dh_python to python-rgw also (`pr#12260 `_, Kefu Chai) * build/ops: deb: update python-rgw dependencies to librgw2 (`pr#11885 `_, Casey Bodley) * build/ops: do_freebsd.sh: Build with SYSTEM Boost on FreeBSD (`pr#11942 `_, Willem Jan Withagen) * build/ops: do_freebsd.sh: Do not use LTTNG on FreeBSD (`pr#11551 `_, Willem Jan Withagen) * build/ops: do_freebsd.sh: Set options for debug building. (`pr#11443 `_, Willem Jan Withagen) * build/ops: FreeBSD: do_freebsd.sh (`pr#12090 `_, Willem Jan Withagen) * build/ops: FreeBSD:test/encoding/readable.sh": fix nproc and ls -v calls (`pr#11522 `_, Willem Jan Withagen) * build/ops: FreeBSD: update require packages (`pr#11512 `_, Willem Jan Withagen) * build/ops: git-archive-all.sh: use an actually unique tmp dir (`pr#12011 `_, Dan Mick) * build/ops: include/enc: make clang happy (`pr#11638 `_, Kefu Chai, Sage Weil) * build/ops: install-deps.sh: allow building on SLES systems (`pr#11708 `_, Nitin A Kamble) * build/ops: install-deps.sh: JQ is needed in one script (`pr#12080 `_, Willem Jan Withagen) * build/ops: Log: Replace namespace log with logging (`pr#11650 `_, Willem Jan Withagen) * build/ops: Merging before make check because it clearly breaks the build and the build part is done (`pr#11924 `_, Sage Weil) * build/ops: ok, w/upstream acks, merging--jenkins build did succeed (this is a build-only change) (`pr#12008 `_, Matt Benjamin) * build/ops: qa: Add ceph-ansible installer. (`issue#16770 `_, `pr#10402 `_, Warren Usui) * build/ops: rocksdb: do not build with --march=native (`pr#11677 `_, Kefu Chai) * build/ops: rocksdb: update to latest (`pr#12100 `_, Kefu Chai) * build/ops: rpm: Remove trailing whitespace in usermod command (SUSE) (`pr#10707 `_, Tim Serong) * build/ops: scripts/release-notes: allow title guesses from gh tags & description update (`pr#11399 `_, Abhishek Lekshmanan) * build/ops: systemd: Fix startup of ceph-mgr on Debian 8 (`pr#12555 `_, Mark Korenberg) * build/ops: tracing/objectstore.tp: add missing move_ranges_... tp (`pr#11484 `_, Sage Weil) * build/ops: upstart: fix ceph-crush-location default (`issue#6698 `_, `pr#803 `_, Jason Dillaman) * build/ops: upstart: start ceph-all after static-network-up (`issue#17689 `_, `pr#11631 `_, Billy Olsen) * cephfs: add gid to asok status (`pr#11487 `_, Patrick Donnelly) * cephfs: API cleanup for libcephfs interfaces (`issue#17911 `_, `pr#12106 `_, Jeff Layton) * cephfs: ceph-fuse: start up log on parent process before shutdown (`issue#18157 `_, `pr#12347 `_, Greg Farnum) * cephfs: ceph_fuse: use sizeof get the buf length (`pr#11176 `_, LeoZhang) * cephfs,cleanup: ceph-fuse: start up log on parent process before shutdown (`issue#18157 `_, `pr#12358 `_, Kefu Chai) * cephfs: client: add pid to metadata (`issue#17276 `_, `pr#11359 `_, Patrick Donnelly) * cephfs: client: Client.cc: remove duplicated op type checking against CEPH_MD… (`pr#11608 `_, Weibing Zhang) * cephfs: client: don't take extra target inode reference in ll_link (`pr#11440 `_, Jeff Layton) * cephfs: client: fix mutex name typos (`pr#12401 `_, Yunchuan Wen) * cephfs: client: get caller's uid/gid on every libcephfs operation (`issue#17591 `_, `pr#11526 `_, Yan, Zheng) * cephfs: client: get gid from MonClient (`pr#11486 `_, Patrick Donnelly) * cephfs: client: improve failure messages/debugging (`pr#12110 `_, Patrick Donnelly) * cephfs: client/mds: Clear setuid bits when writing or truncating (`issue#18131 `_, `pr#12412 `_, Jeff Layton) * cephfs: client: put CapSnap not ptr in cap_snaps map (`pr#12111 `_, Patrick Donnelly) * cephfs: client: remove redundant initialization (`pr#12028 `_, Patrick Donnelly) * cephfs: client: remove unnecessary bufferptr[] for writev (`pr#11836 `_, Patrick Donnelly) * cephfs: client: remove unneeded layout on MClientCaps (`pr#11790 `_, John Spray) * cephfs: client: set metadata["root"] from mount method when it's called with … (`pr#12505 `_, Jeff Layton) * cephfs: client: trim_caps() do not dereference cap if it's removed (`pr#12145 `_, Kefu Chai) * cephfs: client: use unique_ptr (`pr#11837 `_, Patrick Donnelly) * cephfs: common/ceph_string: add ceph string constants for CEPH_SESSION_FORCE_RO (`pr#11516 `_, Zhi Zhang) * cephfs: Fix #17562 (backtrace check fails when scrubbing directory created by fsstress) (`issue#17562 `_, `pr#11517 `_, Yan, Zheng) * cephfs: fix missing ll_get for ll_walk (`issue#18086 `_, `pr#12061 `_, Gui Hecheng) * cephfs: get new fsmap after marking clusters down (`issue#7271 `_, `issue#17894 `_, `pr#1262 `_, Patrick Donnelly) * cephfs: Have ceph clear setuid/setgid bits on chown (`issue#18131 `_, `pr#12331 `_, Jeff Layton) * cephfs: libcephfs: add ceph_fsetattr&&ceph_lchmod&&ceph_lutime (`pr#11191 `_, huanwen ren) * cephfs: libcephfs: add readlink function in cephfs.pyx (`pr#12384 `_, huanwen ren) * cephfs: libcephfs and test suite fixes (`issue#18013 `_, `issue#17982 `_, `pr#12228 `_, Jeff Layton) * cephfs: libcephfs client API overhaul and update (`pr#11647 `_, Jeff Layton) * cephfs: lua: use simpler lua_next traversal structure (`pr#11958 `_, Patrick Donnelly) * cephfs: mds/Beacon: move C_MDS_BeaconSender class to .cc (`pr#10940 `_, Michal Jarzabek) * cephfs: mds/CDir.cc: remove unneeded use of count (`pr#11613 `_, Michal Jarzabek) * cephfs: mds/CInode.h: remove unneeded use of count (`pr#11371 `_, Michal Jarzabek) * cephfs: mds/DamageTable.cc: move shared ptrs (`pr#11435 `_, Michal Jarzabek) * cephfs: mds/DamageTable.cc: remove unneeded use of count (`pr#11625 `_, Michal Jarzabek) * cephfs: mds/DamageTable: move classes to .cc file (`pr#11450 `_, Michal Jarzabek) * cephfs: mds/flock: add const to member functions (`pr#11692 `_, Michal Jarzabek) * cephfs: mds/FSMap.cc: remove unneeded use of count (`pr#11402 `_, Michal Jarzabek) * cephfs: mds/FSMapUser.h: remove copy ctr and assign op (`pr#11509 `_, Michal Jarzabek) * cephfs: mds/InfoTable.h: add override to virtual functs (`pr#11496 `_, Michal Jarzabek) * cephfs: mds/InoTable.h: add override to virtual functs (`pr#11604 `_, Michal Jarzabek) * cephfs: mds/Mantle.h: include correct header files (`pr#11886 `_, Michal Jarzabek) * cephfs: mds/Mantle: pass parameters by const ref (`pr#11713 `_, Michal Jarzabek) * cephfs: mds/MDCache.h: remove unneeded call to clear func (`pr#11954 `_, Michal Jarzabek) * cephfs: mds/MDCache.h: remove unused functions (`pr#11908 `_, Michal Jarzabek) * cephfs: mds/MDLog: add const to member functions (`pr#11663 `_, Michal Jarzabek) * cephfs: mds/MDSMap.h: add const to member functions (`pr#11511 `_, Michal Jarzabek) * cephfs: mds/MDSRank: add const to member functions (`pr#11752 `_, Michal Jarzabek) * cephfs: mds/MDSRank.h: add override to virtual function (`pr#11727 `_, Michal Jarzabek) * cephfs: mds/MDSRank.h: make destructor protected (`pr#11651 `_, Michal Jarzabek) * cephfs: mds/MDSTableClient.h: add const to member funct (`pr#11681 `_, Michal Jarzabek) * cephfs: mds/Migrator.cc: remove unneeded use of count (`pr#11523 `_, Michal Jarzabek) * cephfs: mds/Migrator.h: add const to member functions (`pr#11819 `_, Michal Jarzabek) * cephfs: mds/Migrator.h: remove unneeded use of count (`pr#11833 `_, Michal Jarzabek) * cephfs: mds/Mutation.h: add const to member functions (`pr#11670 `_, Michal Jarzabek) * cephfs: mds/Mutation.h: simplify constructors (`pr#11455 `_, Michal Jarzabek) * cephfs: MDS: reduce usage of context wrapper (`pr#11560 `_, Yan, Zheng) * cephfs: mds/ScrubHeader.h: pass string by const reference (`pr#11904 `_, Michal Jarzabek) * cephfs: mds/server: merge the snapshot request judgment (`pr#11150 `_, huanwen ren) * cephfs: mds/SessionMap: add const to member functions (`pr#11541 `_, Michal Jarzabek) * cephfs: mds/SessionMap.cc: avoid copying and add const (`pr#11297 `_, Michal Jarzabek) * cephfs: mds/SessionMap.cc:put classes in unnamed namespace (`pr#11316 `_, Michal Jarzabek) * cephfs: mds/SessionMap.cc: remove unneeded use of count (`pr#11338 `_, Michal Jarzabek) * cephfs: mds/SessionMap.h: remove unneeded function (`pr#11565 `_, Michal Jarzabek) * cephfs: mds/SessionMap.h: remove unneeded use of count (`pr#11358 `_, Michal Jarzabek) * cephfs: mds/SnapRealm: remove unneeded use of count (`pr#11609 `_, Michal Jarzabek) * cephfs: mds/SnapServer.h: add override to virtual functs (`pr#11380 `_, Michal Jarzabek) * cephfs: mds/SnapServer.h: add override to virtual functs (`pr#11583 `_, Michal Jarzabek) * cephfs: mon/MDSMonitor: fix iterating over mutated map (`issue#18166 `_, `pr#12395 `_, John Spray) * cephfs: multimds: fix state check in Migrator::find_stale_export_freeze() (`pr#12098 `_, Yan, Zheng) * cephfs: osdc: After write try merge bh. (`issue#17270 `_, `pr#11545 `_, Jianpeng Ma) * cephfs: Partial organization of mds/ header sections (`pr#11959 `_, Patrick Donnelly) * cephfs: Port/bootstrap (`pr#827 `_, Yan, Zheng) * cephfs: Revert "osdc: After write try merge bh." (`issue#17270 `_, `pr#11262 `_, John Spray) * cephfs: Small pile of random cephfs fixes and cleanup (`pr#11421 `_, Jeff Layton) * cephfs: src/mds: fix MDSMap upgrade decoding (`issue#17837 `_, `pr#12097 `_, John Spray) * cephfs: systemd: add ceph-fuse service file (`pr#11542 `_, Patrick Donnelly) * cephfs: test fragment size limit (`issue#16164 `_, `pr#1069 `_, Patrick Donnelly) * cephfs: test readahead is working (`issue#16024 `_, `pr#1046 `_, Patrick Donnelly) * cephfs: test: temporarily remove fork()ing flock tests (`issue#16556 `_, `pr#11211 `_, John Spray) * cephfs: tool/cephfs: displaying "list" in journal event mode (`pr#11236 `_, huanwen ren) * cephfs: tools/cephfs: add pg_files command (`issue#17249 `_, `pr#11026 `_, John Spray) * cephfs: tools/cephfs: add scan_links command which fixes linkages errors (`pr#11446 `_, Yan, Zheng) * cephfs: update tests to enable multimds when needed (`pr#933 `_, Greg Farnum) * cleanup: build: The Light Clangtastic (`pr#11921 `_, Adam C. Emerson) * cleanup,common: common/blkdev: use realpath instead of readlink to resolve the recurs… (`pr#12462 `_, Xinze Chi) * cleanup,common: common/throttle: simplify Throttle::_wait() (`pr#11165 `_, xie xingguo) * cleanup,common: src/common: remove nonused config option (`pr#12311 `_, Wei Jin) * cleanup: coverity fix: fixing few coverity issue (`pr#9624 `_, Gaurav Kumar Garg) * cleanup: deprecate readdir_r() with readdir() (`pr#11805 `_, Kefu Chai) * cleanup: erasure-code: fix gf-complete warning (`pr#12150 `_, Kefu Chai) * cleanup: fix typos (`pr#12502 `_, xianxiaxiao) * cleanup: mds/FSMap.cc: prevent unneeded copy of map entry (`pr#11798 `_, Michal Jarzabek) * cleanup: mds/FSMap.h: add const and reference (`pr#11802 `_, Michal Jarzabek) * cleanup: mds/FSMap: pass shared_ptr by const ref (`pr#11383 `_, Michal Jarzabek) * cleanup: mds/SnapServer: add const to member function (`pr#11688 `_, Michal Jarzabek) * cleanup: mon/MonCap.h: add std::move for std::string (`pr#10722 `_, Michal Jarzabek) * cleanup: mon/OSDMonitor: only show interesting flags in health warning (`issue#18175 `_, `pr#12365 `_, Sage Weil) * cleanup: msg/async: assert(0) -> ceph_abort() (`pr#12339 `_, Li Wang) * cleanup: msg/AsyncMessenger: remove unneeded include (`pr#9846 `_, Michal Jarzabek) * cleanup: msg/async/rdma: fix disconnect log line (`pr#12254 `_, Adir Lev) * cleanup: msg/async: remove unused member variable (`pr#12387 `_, Kefu Chai) * cleanup: msg: fix format specifier for unsigned value id (`pr#11145 `_, Weibing Zhang) * cleanup: msg/Pipe: move DelayedDelivery class to cc file (`pr#10447 `_, Michal Jarzabek) * cleanup: msg/test: fix the guided compile-command to ceph_test_msgr (`pr#10490 `_, Yan Jun) * cleanup: osd/PGBackend: build_push_op segment fault (`pr#9357 `_, Zengran Zhang) * cleanup: osd/PG.h: change PGRecoveryStats struct to class (`pr#11178 `_, Michal Jarzabek) * cleanup: osd/PG.h: remove unneeded forward declaration (`pr#12135 `_, Li Wang) * cleanup: osd/ReplicatedPG: remove unneeded use of count (`pr#11251 `_, Michal Jarzabek) * cleanup: os/filestore: clean filestore perfcounters (`pr#11524 `_, Wei Jin) * cleanup: os/fs/FS.cc: condition on WITH_AIO for FreeBSD (`pr#11913 `_, Willem Jan Withagen) * cleanup,rbd: cls_rbd: silence compiler warnings (`pr#11363 `_, xiexingguo) * cleanup,rbd: journal: avoid logging an error when a watch is blacklisted (`issue#18243 `_, `pr#12473 `_, Jason Dillaman) * cleanup,rbd: journal: prevent repetitive error messages after being blacklisted (`issue#18243 `_, `pr#12497 `_, Jason Dillaman) * cleanup,rbd: librbd/ImageCtx: no need for virtual dtor (`pr#12220 `_, Sage Weil) * cleanup,rbd: rbd-mirror: configuration overrides for hard coded timers (`pr#11840 `_, Dongsheng Yang) * cleanup,rbd: rbd-mirror: set SEQUENTIAL and NOCACHE advise flags on image sync (`issue#17127 `_, `pr#12280 `_, Mykola Golub) * cleanup: remove unneeded forward declaration (`pr#12257 `_, Li Wang, Yunchuan Wen) * cleanup: remove unused declaration (`pr#12466 `_, Li Wang, Yunchuan Wen) * cleanup,rgw: rgw multisite: move lease up to RunBucketSync instead of child crs (`pr#11598 `_, Casey Bodley) * cleanup,rgw: rgw/rest: don't print empty x-amz-request-id (`pr#10674 `_, Marcus Watts) * cleanup,rgw: verified: f23 (`pr#12103 `_, Radoslaw Zarzynski) * cleanup: src/common/perf_counters.h: fix wrong word (`pr#11690 `_, zhang.zezhu) * cleanup: Wip ctypos (`pr#12495 `_, xianxiaxiao) * cleanup: xio: provide dout_prefix for XioConnection (`pr#9444 `_, Avner BenHanoch) * cleanup: yasm-wrapper: translate "-isystem $1" to "-i $1" (`pr#12093 `_, Kefu Chai) * cmake: add -Wno-unknown-pragmas to CMAKE_CXX_FLAGS (`pr#12128 `_, Kefu Chai) * cmake: check WITH_RADOSGW for fcgi and expat dependencies (`pr#11481 `_, David Disseldorp) * cmake: compile C code with c99 (`pr#12369 `_, Kefu Chai) * cmake: detect keyutils if WITH_LIBCEPHFS OR WITH_RBD (`pr#12359 `_, Kefu Chai) * cmake: do not link erasure tests again libosd (`pr#11738 `_, Kefu Chai) * cmake: find gperftools package for tcmalloc_minimal too (`pr#11403 `_, Bassam Tabbara) * cmake: fix boost build on ubuntu 16.10 yakkety (`pr#12143 `_, Bassam Tabbara) * cmake: Fix for cross compiling (`pr#11404 `_, Bassam Tabbara) * cmake: fix git version string, cleanup (`pr#11661 `_, Sage Weil) * cmake: librbd cleanup (`pr#11842 `_, Kefu Chai) * cmake: link tests against static librados (`issue#17260 `_, `pr#11575 `_, Kefu Chai) * cmake: pass CMAKE_BUILD_TYPE down to rocksdb (`pr#11767 `_, Kefu Chai) * cmake: remove include/Makefile.am (`pr#11666 `_, Kefu Chai) * cmake: replace civetweb symlink w/file copy (`pr#11900 `_, Matt Benjamin) * cmake: should link against ${ALLOC_LIBS} (`pr#11978 `_, Kefu Chai) * cmake: src/test/CMakeLists.txt: Exclude test on HAVE_BLKID (`pr#12301 `_, Willem Jan Withagen) * cmake: Support for embedding Ceph Daemons (`pr#11764 `_, Bassam Tabbara) * cmake: use external project for rocksdb (`pr#11385 `_, Bassam Tabbara) * common: Add throttle_get_started perf counter (`pr#12163 `_, Bartłomiej Święcki) * common: assert(0) -> ceph_abort() (`pr#12031 `_, Sage Weil) * common: auth: fix NULL pointer access when trying to delete CryptoAESKeyHandler instance (`pr#11614 `_, runsisi) * common,bluestore: compressor: fixes and tests; disable zlib isal (it's broken) (`pr#11349 `_, Sage Weil) * common,bluestore: mempool: mempool infrastructure, bluestore changes to use it (`pr#11331 `_, Allen Samuels, Sage Weil) * common: buffer: add advance(unsigned) back (`issue#17809 `_, `pr#11993 `_, Kefu Chai) * common: buffer: add copy(unsigned, ptr) back (`issue#17809 `_, `pr#12246 `_, Kefu Chai) * common: client/Client.cc: fix/silence "logically dead code" CID-Error (`pr#291 `_, Yehuda Sadeh) * common: common/strtol.cc: Get error testing also to work on FreeBSD (`pr#12034 `_, Willem Jan Withagen) * common: fix clang compilation error (`pr#12565 `_, Mykola Golub) * common: FreeBSD/EventKqueue.{h,cc} Added code to restore events on (thread)fork (`pr#11430 `_, Willem Jan Withagen) * common: log/LogClient: fill seq & who for syslog and graylog (`issue#16609 `_, `pr#10196 `_, Xiaoxi Chen) * common: make l_finisher_complete_lat more accurate (`pr#11637 `_, Pan Liu) * common: msg/simple/Accepter.cc: replace shutdown() with selfpipe event in poll() (FreeBSD) (`pr#10720 `_, Willem Jan Withagen) * common: osdc/Objecter: fix relock race (`issue#17942 `_, `pr#12234 `_, Sage Weil) * common: osdc/Objecter: handle race between calc_target and handle_osd_map (`issue#17942 `_, `pr#12055 `_, Sage Weil) * common: osd/osdmap: fix divide by zero error (`pr#12521 `_, Yunchuan Wen) * common: release g_ceph_context before returns (`issue#17762 `_, `pr#11733 `_, Kefu Chai) * common: Remove the runtime dependency on lsb_release (`issue#17425 `_, `pr#11365 `_, Brad Hubbard) * common: test/fio: fix global CephContext life cycle (`pr#12245 `_, Igor Fedotov) * core: auth: tolerate missing MGR keys during upgrade (`pr#11401 `_, Sage Weil) * core,bluestore: os/bluestore: fix warning and uninit variable (`pr#12032 `_, Sage Weil) * core,bluestore: os: fix offsets for move_ranges operation (`pr#11595 `_, Sage Weil) * core,bluestore: os: remove move_ranges_destroy_src (`pr#11791 `_, Sage Weil) * core: ceph-disk: allow using a regular file as a journal (`issue#17662 `_, `pr#11619 `_, Jayashree Candadai, Loic Dachary) * core: ceph-disk: resolve race conditions (`issue#17889 `_, `issue#17813 `_, `pr#12136 `_, Loic Dachary) * core,cephfs: osdc/ObjectCacher: wake up dirty stat waiters after removing buffers (`issue#17275 `_, `pr#11593 `_, Yan, Zheng) * core: ceph.in: allow 'flags' to not be present in cmddescs (`issue#18297 `_, `pr#12540 `_, Dan Mick) * core,cleanup: ceph-disk: do not create bluestore wal/db partitions by default (`issue#18291 `_, `pr#12531 `_, Loic Dachary) * core,cleanup,common: common/TrackedOp: remove unused 'now' in _dump() (`pr#12007 `_, John Spray) * core,cleanup: FileStore: Only verify split when it has been really done and done correctly (`pr#11731 `_, Li Wang) * core,cleanup: kv: remove snapshot iterator (`pr#12049 `_, Sage Weil) * core,cleanup: mon/MonClient.h: remove repeated searching of map (`pr#10601 `_, Michal Jarzabek) * core,cleanup: msg: Fix typos in socket creation error message (`pr#11907 `_, Brad Hubbard) * core,cleanup: osd/command tell: check pgid at the right time (`pr#11547 `_, Javeme) * core,cleanup: osd/OSDMap.cc: fix duplicated assignment for new_blacklist_entries (`pr#11799 `_, Ker Liu) * core,cleanup: osd/PG.cc: prevent repeated searching of map/set (`pr#11203 `_, Michal Jarzabek) * core,cleanup: osd/ReplicatedPG: remove redundant check for balance/localize read (`pr#10209 `_, runsisi) * core,cleanup: osd/ReplicatedPG: remove unneeded use of count (`pr#11242 `_, Michal Jarzabek) * core,cleanup: os/filestore: handle EINTR returned by io_getevents() (`pr#11890 `_, Pan Liu) * core,cleanup: os/ObjectStore: remove legacy tbl support (`pr#11770 `_, Jianpeng Ma) * core,cleanup: scan build fixes (`pr#12148 `_, Kefu Chai) * core,cleanup: src: rename ReplicatedPG to PrimaryLogPG (`pr#12487 `_, Samuel Just) * core,cleanup: Wip scrub misc (`pr#11397 `_, David Zafman) * core,common: buffer: put buffers in buffer_{data,meta} mempools (`pr#11839 `_, Sage Weil) * core,common: msg: add entity_addr_t types; add new entity_addrvec_t type (`pr#9825 `_, Zhao Junwang, Sage Weil) * core,common: msg/simple/Pipe: handle addr decode error (`issue#18072 `_, `pr#12221 `_, Sage Weil) * core: compress: Fix compilation failure from missing header (`pr#12108 `_, Adam C. Emerson) * core: denc: don't pass null instances into encoder fns (`issue#17636 `_, `pr#11577 `_, John Spray) * core: erasure-code: synchronize with upstream gf-complete (`issue#18092 `_, `pr#12382 `_, Loic Dachary) * core: FreeBSD/OSD.cc: add client_messenger to the avoid_ports set. (`pr#12463 `_, Willem Jan Withagen) * core: include/object: pass "snapid_t&" to bound_encode() (`pr#11552 `_, Kefu Chai) * core: kv/RocksDBStore: Don't update rocksdb perf_context if rocksdb_perf di… (`pr#12064 `_, Jianpeng Ma) * core: librados-dev: install inline_memory.h (`issue#17654 `_, `pr#11730 `_, Josh Durgin) * core: messages/MForward: reencode forwarded message if target has differing features (`pr#11610 `_, Sage Weil) * core,mgr: messages: fix out of range assertion (`pr#11345 `_, John Spray) * core: mon,ceph-disk: add lockbox permissions to bootstrap-osd (`issue#17849 `_, `pr#11996 `_, Loic Dachary) * core: mon: make it more clearly to debug for paxos state (`pr#12438 `_, song baisen) * core: mon/OSDMonitor: encode full osdmaps with features all OSDs can understand (`pr#11284 `_, Sage Weil) * core: mon/OSDMonitor: encode OSDMap::Incremental with same features as OSDMap (`pr#11596 `_, Sage Weil) * core: mon/OSDMonitor: newly created osd should not be wrongly marked in (`pr#11795 `_, runsisi) * core: mon/OSDMonitor: remove duplicate jewel/kraken flag warning (`pr#11775 `_, Josh Durgin) * core: mon/PGMap: PGs can be stuck more than one thing (`issue#17515 `_, `pr#11339 `_, Sage Weil) * core: mon: print the num_pools and num_objects in 'ceph -s -f json/json-p… (`issue#17703 `_, `pr#11654 `_, huangjun) * core: msg/async/AsyncConnection: dispatch write handler on keepalive2 (`issue#17664 `_, `pr#11601 `_, Ilya Dryomov) * core: msg/async: DPDKStack as AsyncMessenger backend (`pr#10748 `_, Haomai Wang) * core: msg/async/rdma: change log level: 0 -> 1 (`pr#12334 `_, Avner BenHanoch) * core: msg/async/rdma: don't use more buffers than what device capabilities … (`pr#12263 `_, Avner BenHanoch) * core: msg/async/rdma: ensure CephContext existed (`pr#12068 `_, Haomai Wang) * core: msg/async/rdma: event polling thread can block on event (`pr#12270 `_, Haomai Wang) * core: msg/async/rdma: fixup memory free (`pr#12236 `_, gongchuang) * core: msg/async/rdma: set correct value to memory manager (`pr#12299 `_, Adir Lev) * core: msg/async: set nonce before starting the workers (`pr#12390 `_, Kefu Chai) * core: msg: make loopback Connection feature accurate all the time (`pr#11183 `_, Sage Weil) * core: msg: seed random engine used for ms_type="random" (`pr#11880 `_, Casey Bodley) * core: msg/simple/Pipe: avoid returning 0 on poll timeout (`issue#18184 `_, `pr#12375 `_, Sage Weil) * core: msg/simple/Pipe::stop_and_wait: unlock pipe_lock for stop_fast_dispatching() (`issue#18042 `_, `pr#12307 `_, Samuel Just) * core: msg/simple: save the errno in case being changed by subsequent codes (`pr#10297 `_, Yan Jun) * core: osd/ECTransaction: only write out the hinfo if not delete (`issue#17983 `_, `pr#12141 `_, Samuel Just) * core: OSDMonitor: only reject MOSDBoot based on up_from if inst matches (`issue#17899 `_, `pr#12003 `_, Samuel Just) * core: osd,mon: require sortbitwise flag to upgrade beyond jewel (`pr#11772 `_, Sage Weil) * core: osd/osd_types: fix the osd_stat_t::decode() (`pr#12235 `_, Kefu Chai) * core: osd/PG: add "down" pg state (distinct from down+peering) (`pr#12289 `_, Sage Weil) * core: osd/PGLog::proc_replica_log,merge_log: fix bound for last_update (`issue#18127 `_, `pr#12340 `_, Samuel Just) * core: osd/ReplicatedPG: do_update_log_missing: take the pg lock in the callback (`issue#17789 `_, `pr#11754 `_, Samuel Just) * core: osd/ReplicatedPG::record_write_error: don't leak orig_reply on cancel (`issue#18180 `_, `pr#12450 `_, Samuel Just) * core: os/filestore: avoid to get the wrong hardlink number. (`pr#11841 `_, huangjun) * core: os/filestore/chain_xattr.h:uses ENODATA, so include compat.h (`pr#12279 `_, Willem Jan Withagen) * core: os/filestore: Fix erroneous WARNING: max attr too small (`issue#17420 `_, `pr#11246 `_, Brad Hubbard) * core: os/FileStore: fix fiemap issue in xfs when #extents > 1364 (`pr#11554 `_, Ning Yao) * core: os/filestore: fix journal logger (`pr#12099 `_, Wei Jin) * core: os/filestore: fix potential result code overwriting (`pr#11491 `_, xie xingguo) * core: os/filestore/HashIndex: fix list_by_hash_* termination on reaching end (`issue#17859 `_, `pr#11898 `_, Sage Weil) * core: os/ObjectStore: properly clear object map when replaying OP_REMOVE (`issue#17177 `_, `pr#11388 `_, Yan, Zheng) * core,performance: msg/async: ibverbs/rdma support (`pr#11531 `_, Haomai Wang, Zhi Wang) * core,performance: osd/OSDMap.cc: remove unneeded use of count (`pr#11221 `_, Michal Jarzabek) * core,performance: osd/PrimaryLogPG: don't truncate if we don't have to for WRITEFULL (`pr#12534 `_, Samuel Just) * core,performance: os/fs/FS: optimize aio::pwritev which make caller provide length. (`pr#9062 `_, Jianpeng Ma) * core,pybind,common: python-rados: implement new aio_execute (`pr#12140 `_, Iain Buclaw) * core,rbd,bluestore,rgw,performance,cephfs: fast denc encoding (`pr#11027 `_, Sage Weil) * core: remove spurious executable permissions on source code files (`pr#1061 `_, Samuel Just) * core: ReplicatedPG::failed_push: release read lock on failure (`issue#17857 `_, `pr#11914 `_, Kefu Chai) * core: rocksdb: update to latest, and make it the default for the mons (`pr#11354 `_, Sage Weil) * core: set dumpable flag after setuid (`issue#17650 `_, `pr#11582 `_, Patrick Donnelly) * core: systemd/ceph-disk: reduce ceph-disk flock contention (`issue#18049 `_, `issue#13160 `_, `pr#12200 `_, David Disseldorp) * core: tchaikov (`issue#17713 `_, `pr#11382 `_, Haomai Wang) * core,tests: ceph_test_rados_api_tier: dump hitset that we fail to decode (`issue#17945 `_, `pr#12057 `_, Sage Weil) * core,tests: common osd: Improve scrub analysis, list-inconsistent-obj output and osd-scrub-repair test (`issue#18114 `_, `pr#9613 `_, Kefu Chai, David Zafman) * core,tests: test,cmake: turn unit.h into unit.cc to speed up compilation (`pr#12194 `_, Kefu Chai) * core,tests: test/rados/list.cc: Memory leak in ceph_test_rados_api_list (`issue#18250 `_, `pr#12479 `_, Brad Hubbard) * core,tests: workunits/ceph-helpers.sh: Fixes for FreeBSD (`pr#12085 `_, Willem Jan Withagen) * core,tools: Added append functionality to rados tool. (`pr#11036 `_, Tomy Cheru) * core,tools: Tested-by: Huawen Ren (`issue#17400 `_, `pr#11276 `_, Kefu Chai) * core,tools: vstart: decrease pool size if <3 OSDs (`pr#11528 `_, John Spray) * crush: make counting of choose_tries consistent (`issue#17229 `_, `pr#10993 `_, Vicente Cheng) * crush: remove the crush_lock (`pr#11830 `_, Adam C. Emerson) * crush: Silence coverity warnings for test/crush/crush.cc (`pr#12436 `_, Brad Hubbard) * doc: Add doc about osd scrub {during recovery|chunk {min|max}| sleep} (`pr#12176 `_, Paweł Sadowski) * doc: Add docs about looking up Monitors through DNS (`issue#14527 `_, `pr#10852 `_, Wido den Hollander) * doc: add docs for raw compression (`pr#12244 `_, Casey Bodley) * doc: Add documentation about mon_allow_pool_delete before pool remove (`pr#11943 `_, Wido den Hollander) * doc: add infernalis EOL date (`pr#11925 `_, Ken Dreyer) * doc: adding changelog for v10.2.4 (`pr#12346 `_, Abhishek Lekshmanan) * doc: Add MON docs about pool flags and pool removal config settings (`pr#10853 `_, Wido den Hollander) * doc: add python-rgw doc (`pr#11859 `_, Kefu Chai) * doc: change the osd_max_backfills default to 1 (`issue#17701 `_, `pr#11658 `_, huangjun) * doc: clarify file deletion from OSD restricted pool behaviour (`issue#17937 `_, `pr#12054 `_, David Disseldorp) * doc: clarify mds deactivate purpose (`pr#11957 `_, Patrick Donnelly) * doc: common/Throttle: fix typo for BackoffThrottle (`pr#12129 `_, Wei Jin) * doc: correcting the object name (`pr#12354 `_, Uday Mullangi) * doc: Correcting the sample python tempurl generation script. (`issue#15258 `_, `pr#8712 `_, Diwakar Goel) * doc: Coverity and SCA fixes (`pr#7784 `_, Danny Al-Gaaf) * doc: doc/dev/osd_internals: add pgpool.rst (`pr#12500 `_, Brad Hubbard) * doc: doc/dev/perf: a few notes on perf (`pr#12168 `_, Sage Weil) * doc: doc/dev/perf: fix dittography (`pr#12317 `_, xie xingguo) * doc: doc/man: avoid file builtin to solve build error (`pr#11984 `_, Patrick Donnelly) * doc: doc/rados/configuration/ms-ref.rst: document a few async msgr options (`pr#12126 `_, Piotr Dałek) * doc: doc/rados/configuration/osd-config-ref.rst: document the fast mark down (`pr#12124 `_, Piotr Dałek) * doc: doc/release-notes: kraken release notes (draft) (`pr#12338 `_, Sage Weil) * doc: doc/releases: add links to kraken and v10.2.4 (`pr#12409 `_, Kefu Chai) * doc: doc/start/hardware-recommentdations: cosmetic (`pr#10585 `_, Zhao Junwang) * doc: Documentation syntax cleanup (`pr#11784 `_, John Spray) * doc: document osd tell bench (`issue#5431 `_, `pr#16 `_, Sage Weil) * doc: drop --journal-check from ceph-mds man page (`issue#17747 `_, `pr#11912 `_, Nathan Cutler) * doc: explain rgw_fcgi_socket_backlog in rgw/config-ref.rst (`pr#12548 `_, liuchang0812) * doc: final additions to 11.1.0-rc release notes (`pr#12448 `_, Abhishek Lekshmanan) * doc: Fix broken link for caps (`issue#17587 `_, `pr#11546 `_, Uday Mullangi) * doc: fix broken links (`issue#17587 `_, `pr#11518 `_, Uday Mullangi) * doc: fix dead link "Hardware Recommendations" (`pr#11379 `_, Leo Zhang) * doc: fix dead link of "os-recommendations" in troubleshooting-osd (`pr#11454 `_, Leo Zhang) * doc: Fixed mapping error in legacy mds command (`pr#11668 `_, Malte Fiala) * doc: Fix for worker arguments to cephfs-data-scan tool (`pr#12360 `_, Wido den Hollander) * doc: fix grammar/spelling in RGW sections (`pr#12329 `_, Ken Dreyer) * doc: Fixing the broken hyperlinks by pointing to correct documentation. (`pr#11617 `_, Uday Mullangi) * doc: fix librados example programs (`pr#11302 `_, Alexey Sheplyakov) * doc: fix mgr literal block rST syntax (`pr#11652 `_, Ken Dreyer) * doc: fix start development cluster operation in index.rst (`pr#11233 `_, Leo Zhang) * doc: fix the script for rebuild monitor db (`pr#11962 `_, Kefu Chai) * doc: fix typos (`pr#8751 `_, Li Peng) * doc: Flag deprecated mds commands and omit deprecated mon commands in help output (`pr#11434 `_, Patrick Donnelly) * doc: mailmap: change personal info (`pr#12310 `_, Wei Jin) * doc: mailmap updates sept (`pr#10955 `_, Yann Dupont) * doc: mds: fixup "mds bal mode" Description (`pr#12127 `_, huanwen ren) * doc: mention corresponding libvirt section in nova.conf (`pr#12584 `_, Marc Koderer) * doc: Modify documentation for mon_osd_down_out_interval (`pr#12408 `_, Brad Hubbard) * doc: network-protocol typos (`pr#9837 `_, Zhao Junwang) * doc: openstack glance mitaka uses show_multiple_locations (`pr#12020 `_, Sébastien Han) * doc: README.FreeBSD: update to match the bimonthly FreeBSD status report (`pr#11442 `_, Willem Jan Withagen) * doc: README: hint at where to look to diagnose test failures (`pr#11903 `_, Dan Mick) * doc: reformat SubmittingPatches with more rst syntax (`pr#11570 `_, Kefu Chai) * doc: release notes for 10.2.4 (`pr#12053 `_, Abhishek Lekshmanan) * doc: release notes for 10.2.5 (`issue#18207 `_, `pr#12410 `_, Loic Dachary) * doc: release notes for 11.0.2 (`pr#11369 `_, Abhishek Lekshmanan) * doc: Remove duplicate command for Ubuntu (`pr#12186 `_, chrone) * doc: reviewed-by: John Wilkins (`issue#17526 `_, `pr#11352 `_, Loic Dachary) * doc: reviewed-by: John Wilkins (`issue#17665 `_, `pr#11602 `_, Jason Dillaman) * doc: rgw: fix a typo in S3 java api example (`pr#11762 `_, Weibing Zhang) * doc: rm "type=rpm-md" from yum repositories (`pr#10248 `_, Ken Dreyer) * doc: Small styling fix to mirror documentation (`pr#9714 `_, Wido den Hollander) * doc: src/doc: fix class names in exports.txt (`pr#12000 `_, John Spray) * doc: standardize EPEL instructions (`pr#11653 `_, Ken Dreyer) * doc: update cinder key permissions for mitaka (`pr#12211 `_, Sébastien Han) * doc: Update crush-map.rst, fix a typo mistake (`pr#11785 `_, whu_liuchang) * doc: Update filestore xattr config documentation. (`pr#11826 `_, Bartłomiej Święcki) * doc: Update install-ceph-gateway.rst (`pr#11432 `_, Hans van den Bogert) * doc: Update keystone doc about v3 options (`pr#11392 `_, Proskurin Kirill) * doc: Update layout.rst, move commands to CODE block (`pr#11987 `_, liuchang0812) * doc: we can now run multiple MDS, so qualify warning (`issue#18040 `_, `pr#12184 `_, Nathan Cutler) * fs: add snapshot tests to mds thrashing (`pr#1073 `_, Yan, Zheng) * fs: enable ceph-fuse permission checking for all pjd suites (`pr#1187 `_, Greg Farnum) * fs: fix two frag_enable fragments (`issue#6143 `_, `pr#656 `_, Sage Weil) * fs: fix up dd testing again (`issue#10861 `_, `pr#373 `_, Greg Farnum) * fs: fuse_default_permissions = 0 for kernel build test (`pr#1109 `_, Patrick Donnelly) * fs: Mantle: A Programmable Metadata Load Balancer (`pr#10887 `_, Michael Sevilla) * fs: unify common parts of sub-suites (`issue#1737 `_, `pr#1282 `_, Patrick Donnelly) * librados: Add rados_aio_exec to the C API (`pr#11709 `_, Iain Buclaw) * librados: add timeout to watch/notify (`pr#11378 `_, Ryne Li) * librados: do not request osd ack if no completed completion is set (`pr#11204 `_, Sage Weil) * librados: For C-API, expose LIBRADOS_OPERATION_FULL_FORCE flag (`pr#9172 `_, Jianpeng Ma) * librados: improvements async IO in librados and libradosstriper (`pr#10049 `_, Sebastien Ponce) * librados: Memory leaks in object_list_begin and object_list_end (`issue#18252 `_, `pr#12482 `_, Brad Hubbard) * librados: postpone cct deletion (`pr#11659 `_, Kefu Chai) * librados: remove new setxattr overload to avoid breaking the C++ ABI (`issue#18058 `_, `pr#12206 `_, Josh Durgin) * librados: remove unused bufferlist from rados_write_op_rmxattr (`pr#12030 `_, Piotr Dałek) * librbd: add support for snapshot namespaces (`pr#11160 `_, Victor Denisov) * librbd: API changes to support separate data pool (`pr#11353 `_, Jason Dillaman) * librbd: batch object map updates during trim (`issue#17356 `_, `pr#11510 `_, Venky Shankar) * librbd: bug fixes for optional data pool support (`pr#11960 `_, Venky Shankar) * librbd: cannot access non-primary image when mirroring force disabled (`issue#16740 `_, `issue#17588 `_, `pr#11568 `_, Jason Dillaman) * librbd: cls_rbd updates for separate data pool (`issue#17422 `_, `pr#11327 `_, Jason Dillaman) * librbd: default features should be negotiated with the OSD (`issue#17010 `_, `pr#11808 `_, Mykola Golub) * librbd: diffs to clone's first snapshot should include parent diffs (`issue#18068 `_, `pr#12218 `_, Jason Dillaman) * librbd: do not create empty object map object on image creation (`issue#17752 `_, `pr#11704 `_, Jason Dillaman) * librbd: enabling/disabling rbd feature should report missing dependency (`issue#16985 `_, `pr#12238 `_, Gaurav Kumar Garg) * librbd: ensure consistency groups will gracefully fail on older OSDs (`pr#11623 `_, Jason Dillaman) * librbd: exclusive lock incorrectly initialized when switching to head revision (`issue#17618 `_, `pr#11559 `_, Jason Dillaman) * librbd: fix rollback if failed to disable mirroring for image (`pr#11260 `_, runsisi) * librbd: ignore error when object map is already locked by current client (`issue#16179 `_, `pr#12484 `_, runsisi) * librbd: ignore notify errors on missing image header (`issue#17549 `_, `pr#11395 `_, Jason Dillaman) * librbd: keep rbd_default_features setting as bitmask (`issue#18247 `_, `pr#12486 `_, Jason Dillaman) * librbd: mark request as finished after failed refresh (`issue#17973 `_, `pr#12160 `_, Venky Shankar) * librbd: minor cleanup (`pr#12078 `_, Dongsheng Yang) * librbd: new API method to force break a peer's exclusive lock (`issue#18429 `_, `issue#16988 `_, `issue#18327 `_, `pr#12889 `_, Jason Dillaman) * librbd: parse rbd_default_features config option as a string (`pr#11175 `_, Alyona Kiseleva, Alexey Sheplyakov) * librbd: possible assert failure creating image when using data pool (`pr#11641 `_, Venky Shankar) * librbd: proper check for get_data_pool compatibility (`issue#17791 `_, `pr#11755 `_, Mykola Golub) * librbd: properly order concurrent updates to the object map (`issue#16176 `_, `pr#12420 `_, Jason Dillaman) * librbd: release lock after demote (`issue#17880 `_, `pr#11940 `_, Mykola Golub) * librbd: remove consistency group rbd cli and API support (`issue#18231 `_, `pr#12475 `_, Jason Dillaman) * librbd: remove image header lock assertions (`issue#18244 `_, `pr#12472 `_, Jason Dillaman) * librbd: remove unused local variable (`pr#12388 `_, Yunchuan Wen) * librbd: silence the unused variable warning (`pr#11678 `_, Kefu Chai) * librbd: snap_get_limit compatibility check (`pr#11766 `_, Mykola Golub) * librbd: update internals to use optional separate data pool (`pr#11356 `_, Jason Dillaman) * librbd: use proper snapshot when computing diff parent overlap (`issue#18200 `_, `pr#12396 `_, Xiaoxi Chen) * log: optimize header file dependency (`pr#9768 `_, Xiaowei Chen) * mds: add debug assertion for issue #17636 (`pr#11576 `_, Yan, Zheng) * mds: add tests for mantle (programmable balancer) (`pr#1145 `_, Michael Sevilla) * mds: check if down mds is known (`issue#17670 `_, `pr#11611 `_, Patrick Donnelly) * mds: don't access mdsmap from log submit thread (`issue#18047 `_, `pr#12208 `_, Yan, Zheng) * mds: don't maintain bloom filters in standby replay (`issue#16924 `_, `pr#12133 `_, John Spray) * mds: enable rmxattr on pool_namespace attrs (`issue#17797 `_, `pr#11783 `_, John Spray) * mds: fix dropping events in standby replay (`issue#17954 `_, `pr#12077 `_, John Spray) * mds: fix EMetaBlob::fullbit xattr dump (`pr#11536 `_, Sage Weil) * mds: fix false "failing to respond to cache pressure" warning (`pr#11373 `_, Yan, Zheng) * mds: force client flush snap data before truncating objects (`issue#17193 `_, `pr#11994 `_, Yan, Zheng) * mds: handle bad standby_for_fscids in fsmap (`issue#17466 `_, `pr#11281 `_, John Spray) * mds: ignore 'session evict' when mds is replaying log (`issue#17801 `_, `pr#11813 `_, Yan, Zheng) * mds: include legacy client fsid in FSMap print (`pr#11283 `_, John Spray) * mds: more deterministic timing on frag split/join (`issue#17853 `_, `pr#12022 `_, John Spray) * mds: more unique_pointer changes (`pr#11635 `_, Patrick Donnelly) * mds: properly commit new dirfrag before splitting it (`issue#17990 `_, `pr#12125 `_, Yan, Zheng) * mds: release pool allocator memory after exceeding size limit (`issue#18225 `_, `pr#12443 `_, John Spray) * mds: remove duplicated log in handle_client_readdir (`pr#11806 `_, Zhi Zhang) * mds: remove "--journal-check" help text (`issue#17747 `_, `pr#11739 `_, Nathan Cutler) * mds: remove unused EFragment::OP_ONESHOT (`pr#11887 `_, John Spray) * mds: repair backtraces during scrub (`issue#17639 `_, `pr#11578 `_, John Spray) * mds: require MAY_SET_POOL to set pool_ns (`issue#17798 `_, `pr#11789 `_, John Spray) * mds: respawn using /proc/self/exe (`issue#17531 `_, `pr#11362 `_, Patrick Donnelly) * mds: revert "mds/Mutation: remove redundant _dump method" (`issue#17906 `_, `pr#11985 `_, Patrick Donnelly) * mds: use parse_filesystem in parse_role to handle exceptions and reuse parsing code (`issue#17518 `_, `pr#11357 `_, Patrick Donnelly) * mds: use projected path construction for access (`issue#17858 `_, `pr#12063 `_, Patrick Donnelly) * mds: use unique_ptr to simplify resource mgmt (`pr#11543 `_, Patrick Donnelly) * mgr: doc/mgr: fix mgr how long to wait to failover (`pr#11550 `_, huanwen ren) * mgr: init() return when connection daemons failed && add some err info (`pr#11424 `_, huanwen ren) * mgr: misc minor changes (`issue#17455 `_, `pr#11386 `_, xie xingguo) * mgr: PyModules.cc: remove duplicated if condition for fs_map (`pr#11639 `_, Weibing Zhang) * mgr: remove unnecessary C_StdFunction (`pr#11883 `_, John Spray) * mon: add missing space in warning message (`pr#11361 `_, Patrick Donnelly) * mon: clean legacy code (`pr#9643 `_, Wei Jin) * mon: clear duplicated logic in MDSMonitor (`pr#11209 `_, Zhi Zhang) * mon: Do not allow pools to be deleted by default (`pr#11665 `_, Wido den Hollander) * mon: fix "OSDs marked OUT wrongly after monitor failover" (`issue#17719 `_, `pr#11664 `_, Dong Wu) * mon: Forbidden copy and assignment function in monoprequest (`pr#9513 `_, song baisen) * mon: have mon-specific features & rework internal monmap structures (`pr#10907 `_, Joao Eduardo Luis) * mon: if crushtool config is empty use internal crush test (`pr#11765 `_, Bassam Tabbara) * mon: make MDSMonitor tolerant of slow mon elections (`issue#17308 `_, `pr#11167 `_, John Spray) * mon: MonmapMonitor: return success when monitor will be removed (`issue#17725 `_, `pr#11747 `_, Joao Eduardo Luis) * mon: move case CEPH_MSG_POOLOP to OSDs group (`pr#11848 `_, Javeme) * mon: osdmap's epoch should be more than 0 (`pr#9859 `_, Na Xie) * mon: OSDMonitor: fix the check error of pg creating (`issue#17169 `_, `pr#10916 `_, DesmondS) * mon: paxos add the timeout function when peon recovery (`pr#10359 `_, song baisen) * mon: preserve osd weight when marking osd out, then in (`pr#11293 `_, Sage Weil) * mon: prevent post-jewel OSDs from booting if require_jewel_osds is not set (`pr#11498 `_, Sage Weil) * mon: remove ceph-create-keys from mon startup (`issue#16036 `_, `pr#9345 `_, Owen Synge) * mon: remove the redudant jugement in LogMonitor tick function (`pr#10474 `_, song baisen) * mon: remove utime_t param in _dump (`pr#12029 `_, Patrick Donnelly) * mon: send updated monmap to its subscribers (`issue#17558 `_, `pr#11456 `_, Kefu Chai) * mon: small change on the HealthMonitor start_epoch function (`pr#10296 `_, songbaisen) * mon: support for building without leveldb + mon mkfs bug fix (`pr#11800 `_, Bassam Tabbara) * osd: add a pg _fastinfo attribute to reduce per-io metadata updates (`pr#11213 `_, Sage Weil) * osd: Add config option to disable new scrubs during recovery (`issue#17866 `_, `pr#11874 `_, Wido den Hollander) * osd: a few fast dispatch optimizations (`pr#12052 `_, Sage Weil) * osd: cleanup C_CompleteSplits::finish() (`pr#12094 `_, Jie Wang) * osd: clean up PeeringWQ::_dequeue(), remove unnecessary variable (`pr#12117 `_, Jie Wang) * osd: clean up process_peering_events (`pr#12009 `_, Jie Wang) * osdc/Objecter: resend pg commands on interval change (`issue#18358 `_, `pr#12910 `_, Samuel Just) * osd: condition OSDMap encoding on features (`pr#12166 `_, Sage Weil) * osd: default osd_scrub_during_recovery=false (`pr#12402 `_, Sage Weil) * osd: do not open pgs when the pg is not in pg_map (`issue#17806 `_, `pr#11803 `_, Xinze Chi) * osd: drop stray debug message (`pr#11296 `_, Sage Weil) * osd: EC Overwrites (`issue#17668 `_, `pr#11701 `_, Tomy Cheru, Samuel Just) * osd: enhance logging for osd network error (`pr#12458 `_, liuchang0812) * osd: fix CEPH_OSD_FLAG_RWORDERED (`pr#12603 `_, Sage Weil) * osd: fix duplicated id of incompat feature "fastinfo" (`pr#11588 `_, xie xingguo) * osd: fix ec scrub errors (`issue#17999 `_, `pr#12306 `_, Samuel Just) * osd: fixes to make rbd on ec work (`pr#12305 `_, Samuel Just) * osd: Fix map gaps again (bug 15943) (`issue#15943 `_, `pr#12571 `_, Samuel Just) * osd: fix memory leak from EC write workload (`issue#18093 `_, `pr#12256 `_, Sage Weil) * osd: fix rados write op hang (`pr#11143 `_, Yunchuan Wen) * osd: Fix read error propogation in ECBackend (`issue#17966 `_, `pr#12142 `_, Samuel Just) * osd: fix scrub boundary to not include a SnapSet (`pr#11255 `_, Samuel Just) * osd: fix signed/unsigned comparison warning (`pr#12400 `_, Greg Farnum) * osd: fix typo in PG::clear_primary_state (`pr#11513 `_, Brad Hubbard) * osd: Fix typos in PG::find_best_info (`pr#11515 `_, Brad Hubbard) * osd: fix typos in "struct OSDOp" comments (`pr#12350 `_, Chanyoung Park) * osd: Flush Journal on shutdown (`pr#11249 `_, Wido den Hollander) * osd: force watch PING to be write ordered (`issue#18310 `_, `pr#12590 `_, Samuel Just) * osd: handle EC recovery read errors (`issue#13937 `_, `pr#9304 `_, David Zafman) * osd: heartbeat peers need to be updated when a new OSD added into an existed cluster (`issue#18004 `_, `pr#12069 `_, Pan Liu) * osd: Increase priority for inactive PGs backfill (`pr#12389 `_, Bartłomiej Święcki) * osd: kill PG_STATE_SPLITTING (`pr#11824 `_, xie xingguo) * osd: mark queued flag for op (`pr#12352 `_, Yunchuan Wen) * osd: osdc: pass a string reference type to "osdmap->lookup_pg_pool_name" (`pr#12219 `_, Leo Zhang) * osd: osd/OSDMonitor: accept 'osd pool set ...' value as string (`pr#911 `_, David Zafman) * osd: PGLog: initialize writeout_from in PGLog constructor (`issue#12973 `_, `pr#558 `_, Sage Weil) * osd/PrimaryLogPG: don't update digests for objects with mismatched names (`issue#18409 `_, `pr#12803 `_, Samuel Just) * osd/PrimaryLogPG::failed_push: update missing as well (`issue#18165 `_, `pr#12911 `_, Samuel Just) * osd: print log when osd want to kill self (`pr#9288 `_, Haomai Wang) * osd: Remove extra call to reg_next_scrub() during splits (`issue#16474 `_, `pr#11206 `_, David Zafman) * osd: remove redudant call of heartbeat_check (`pr#12130 `_, Pan Liu) * osd: remove the lock heartbeat_update_lock, and change heatbeat_need_… (`pr#12461 `_, Pan Liu) * osd: remove the redundant clear method in consume_map function (`pr#10553 `_, song baisen) * osd: Remove unused '_lsb_release_' declarations (`pr#11364 `_, Brad Hubbard) * osd: replace hb_out and hb_in with a single hb_peers (`issue#18057 `_, `pr#12178 `_, Pan Liu) * osd: ReplicatedPG: don't bless C_OSD_SendMessageOnConn (`issue#13304 `_, `pr#669 `_, Jason Dillaman) * osd: set server-side limits on omap get operations (`pr#12059 `_, Sage Weil) * osd: When deep-scrub errors present upgrade regular scrubs (`pr#12268 `_, David Zafman) * performance,bluestore: kv/MemDB: making memdb code adapt to generic maps (`pr#11436 `_, Ramesh Chander) * performance,bluestore: os/bluestore: allow default to buffered write (`pr#11301 `_, Sage Weil) * performance,bluestore: os/bluestore: bluestore_cache_meta_ratio = .5 (`pr#11919 `_, Sage Weil) * performance,bluestore: os/bluestore: reduce Onode in-memory footprint (`pr#12568 `_, Igor Fedotov) * performance,bluestore: os/bluestore: refactor bluestore_sync_submit_transaction (`pr#11537 `_, Sage Weil) * performance,bluestore: os/bluestore: speed up omap-key generation for same onode(the read paths) (`pr#11894 `_, xie xingguo) * performance,bluestore: os/bluestore: speedup the performance of multi-replication flow by switc… (`pr#11844 `_, Pan Liu) * performance,cephfs: Fix long stalls when calling ceph_fsync() (`issue#17563 `_, `pr#11710 `_, Jeff Layton) * performance,cleanup: Context: std::move the callback param in FunctionContext's ctor (`pr#11892 `_, Kefu Chai) * performance,cleanup: osd/PG.h: move shared ptr instead of copying it (`pr#11154 `_, Michal Jarzabek) * performance,common: common/config_opts.h: Optimized RocksDB WAL settings. (`pr#11530 `_, Mark Nelson) * performance,common: osd/OSDMap: improve the performance of pg_to_acting_osds (`pr#12190 `_, Pan Liu) * performance: msg/async: set ms_async_send_inline to false to improve small randread iops (`pr#11521 `_, Mark Nelson) * performance,tools: rados: add hints to rados bench (`pr#12169 `_, Sage Weil) * pybind: avoid "exception 'int' object is not iterable" (`pr#11532 `_, Javeme) * pybind,cephfs: ceph_volume_client: fix recovery from partial auth update (`issue#17216 `_, `pr#11304 `_, Ramana Raja) * pybind,cephfs: ceph_volume_client: set an existing auth ID's default mon caps (`issue#17800 `_, `pr#11917 `_, Ramana Raja) * pybind: ceph-rest-api: understand the new style entity_addr_t representation (`issue#17742 `_, `pr#11686 `_, Kefu Chai) * pybind: clean up mgr stuff for flake8 (`pr#11314 `_, John Spray) * pybind: fix build failure of rgwfile binding (`pr#11825 `_, Kefu Chai) * pybind: pybind/rados: add missing "length" requires for aio_execute() (`pr#12439 `_, Kefu Chai) * pybind: pybind/rados: Add @requires for all aio methods (`pr#12327 `_, Iain Buclaw) * qa: fixed distros links (`pr#12773 `_, Yuri Weinstein) * qa: Fixed link to centos distro (`pr#12768 `_, Yuri Weinstein) * qa/suites: switch from centos 7.2 to 7.x (`pr#12632 `_, Sage Weil) * qa/tasks/peer: update task based on current peering behavior (`issue#18330 `_, `pr#12614 `_, Sage Weil) * qa/tasks/workunit: clear clone dir before retrying checkout (`issue#18336 `_, `pr#12630 `_, Sage Weil) * qa: update Ubuntu image url after ceph.com refactor (`issue#18542 `_, `pr#12960 `_, Jason Dillaman) * qa/workunits/rbd/test_lock_fence.sh fails (`issue#18388 `_, `pr#12752 `_, Nathan Cutler) * rbd: added rbd-nbd fsx test case (`pr#1049 `_, Jason Dillaman) * rbd: add fsx journal replay test case (`pr#821 `_, Jason Dillaman) * rbd: add singleton to assert no rbdmap regression (`issue#14984 `_, `pr#902 `_, Nathan Cutler) * rbd: add some missing workunits (`pr#870 `_, Josh Durgin) * rbd: add support for separate image data pool (`issue#17424 `_, `pr#11355 `_, Jason Dillaman) * rbd: expose rbd unmap options (`issue#17554 `_, `pr#11370 `_, Ilya Dryomov) * rbd: fix json formatting for image and journal status output (`issue#18261 `_, `pr#12525 `_, Mykola Golub) * rbd: fix parsing of group and image specific pools (`pr#11632 `_, Victor Denisov) * rbd: journal: do not prematurely flag object recorder as closed (`issue#17590 `_, `pr#11520 `_, Jason Dillaman) * rbd: krbd: kernel client expects ip[:port], not an entity_addr_t (`pr#11902 `_, Ilya Dryomov) * rbd: --max_part and --nbds_max options for nbd map (`issue#18186 `_, `pr#12379 `_, Pan Liu) * rbd: move nbd test workload to separate client host from OSDs (`pr#1170 `_, Jason Dillaman) * rbd: provision volumes to format as XFS (`issue#6693 `_, `pr#1028 `_, Loic Dachary) * rbd: rbd-mirror: fix sparse read optimization in image sync (`issue#18146 `_, `pr#12368 `_, Mykola Golub) * rbd: rbd-mirror HA: move librbd::image_watcher::Notifier to librbd::object_watcher (`issue#17017 `_, `pr#11290 `_, Mykola Golub) * rbd: rbd-mirror: recovering after split-brain (`issue#16991 `_, `issue#18051 `_, `pr#12212 `_, Mykola Golub) * rbd: rbd-mirror: snap protect of non-layered image results in split-brain (`issue#16962 `_, `pr#11744 `_, Mykola Golub) * rbd: rbd-nbd: disallow mapping images >2TB in size (`issue#17219 `_, `pr#11741 `_, Mykola Golub) * rbd: rbd-nbd: invalid error code for "failed to read nbd request" messages (`issue#18242 `_, `pr#12483 `_, Mykola Golub) * rbd: rbd-nbd: restart parent process logger after forking (`issue#18070 `_, `pr#12222 `_, Jason Dillaman) * rbd: rbd-nbd: support disabling auto-exclusive lock transition logic (`issue#17488 `_, `pr#11438 `_, Mykola Golub) * rbd: rbd-nbd: support partition for rbd-nbd mapped raw block device (`issue#18115 `_, `pr#12259 `_, Pan Liu) * rbd: tests with rbd_skip_partial_discard option enabled (`pr#1077 `_, Mykola Golub) * rbd,tools: rbd : make option --stripe-unit w/ B/K/M work (`pr#12407 `_, Jianpeng Ma) * rbd: updated tests to use new rbd default feature set (`pr#842 `_, Jason Dillaman) * rbd: use snap_remove implementation from internal (`pr#12035 `_, Victor Denisov) * rgw: add default zone name (`issue#7009 `_, `pr#954 `_, Orit Wasserman) * rgw: add documentation for upgrading with rgw_region_root_pool (`pr#12138 `_, Orit Wasserman) * rgw: add option to log custom HTTP headers (rgw_log_http_headers) (`pr#7639 `_, Matt Benjamin) * rgw: add recovery procedure for upgrade to older version of jewel (`issue#17820 `_, `pr#11827 `_, Orit Wasserman) * rgw: add rgw_compression_type=random for teuthology testing (`pr#11901 `_, Casey Bodley) * rgw: add sleep to let the sync agent init (`pr#1136 `_, Orit Wasserman) * rgw: add suport for creating S3 type subuser of admin rest api (`issue#16682 `_, `pr#10325 `_, snakeAngel2015) * rgw: add support for the prefix parameter in account listing of Swift API (`issue#17931 `_, `pr#12047 `_, Radoslaw Zarzynski) * rgw: allow fastcgi idle timeout to be adjusted (`pr#230 `_, Sage Weil) * rgw: also approve, passed teuthology (many false positives in several classes) (`issue#17985 `_, `pr#12224 `_, Yehuda Sadeh, Sage Weil) * rgw: Anonymous users shouldn't be able to access requester pays buckets. (`issue#17175 `_, `pr#11719 `_, Zhang Shaowen) * rgw: aws4: add presigned url bugfix in runtime (`issue#16463 `_, `pr#10160 `_, Javier M. Mellid) * rgw: bucket resharding (`issue#17550 `_, `pr#11230 `_, Yehuda Sadeh) * rgw:bugfix for deleting objects name beginning and ending with underscores of one bucket using POST method of AWS's js sdk. (`issue#17888 `_, `pr#11982 `_, root) * rgw: Class member cookie is not initialized correctly in some coroutine's constructor. (`pr#11673 `_, Zhang Shaowen) * rgw: clean up RGWShardedOmapCRManager on early return (`issue#17571 `_, `pr#11505 `_, Casey Bodley) * rgw: clear data_sync_cr if RGWDataSyncControlCR fails (`issue#17569 `_, `pr#11506 `_, Casey Bodley) * rgw: compilation of the ASIO front-end is enabled by default. (`pr#12073 `_, Radoslaw Zarzynski) * rgw: compression uses optional::emplace instead of in-place factories (`pr#12021 `_, Radoslaw Zarzynski) * rgw: conform to the standard usage of string::find (`pr#10086 `_, Yan Jun) * rgw: data_extra_pool is unique per zone (`issue#17025 `_, `pr#1119 `_, Orit Wasserman) * rgw: delete entries_index in RGWFetchAllMetaCR (`issue#17812 `_, `pr#11816 `_, Casey Bodley) * rgw: do not abort when accept a CORS request with short origin (`pr#12381 `_, LiuYang) * rgw: do not enable both tcp and uds for fastcgi (`issue#5797 `_, `pr#479 `_, Andrew Schoen) * rgw: don't error out on empty owner when setting acls (`issue#6892 `_, `pr#877 `_, Loic Dachary, Nathan Cutler) * rgw: Don't loop forever when reading data from 0 sized segment. (`issue#17692 `_, `pr#11567 `_, Marcus Watts) * rgw: dont set CURLOPT_UPLOAD for GET requests (`issue#17822 `_, `pr#12105 `_, Casey Bodley) * rgw: don't store empty chains in gc (`issue#17897 `_, `pr#11969 `_, Yehuda Sadeh) * rgw: do quota tests on ubuntu (`issue#6382 `_, `pr#635 `_, Sage Weil) * rgw: dump objects in RGWBucket::check_object_index() (`issue#14589 `_, `pr#11324 `_, Yehuda Sadeh) * rgw: dump remaining coroutines when cr deadlock is detected (`pr#11580 `_, Casey Bodley) * rgw: extract host name from host:port string (`issue#17788 `_, `pr#11751 `_, Yehuda Sadeh) * rgw: Fixed problem with PUT with x-amz-copy-source when source object is compressed. (`pr#12253 `_, Adam Kupczyk) * rgw: fixes for virtual hosting of buckets (`issue#17440 `_, `issue#15975 `_, `issue#17136 `_, `pr#11280 `_, Casey Bodley, Robin H. Johnson) * rgw: fix etag in multipart complete (`issue#17794 `_, `issue#6830 `_, `issue#16129 `_, `issue#17872 `_, `pr#1269 `_, Casey Bodley, Orit Wasserman) * rgw: fix for bucket delete racing with mdlog sync (`issue#17698 `_, `pr#11648 `_, Casey Bodley) * rgw: fix for passing temporary in InitBucketSyncStatus (`issue#17661 `_, `pr#11594 `_, Casey Bodley) * rgw: fix for unsafe change of rgw_zonegroup (`issue#17962 `_, `pr#12075 `_, Casey Bodley) * rgw: fix indentation for cache_pools (`issue#8295 `_, `pr#251 `_, Sage Weil) * rgw: fix missing master zone for a single zone zonegroup (`issue#17364 `_, `pr#11965 `_, Orit Wasserman) * rgw: fix osd crashes when execute "radosgw-admin bi list --max-entries=1" command (`issue#17745 `_, `pr#11697 `_, weiqiaomiao) * rgw: fix put_acls for objects starting and ending with underscore (`issue#17625 `_, `pr#11566 `_, Orit Wasserman) * rgw: fix RGWSimpleRadosLockCR set_description() (`pr#11961 `_, Tianshan Qu) * rgw: fix the field 'total_time' of log entry in log show opt (`issue#17598 `_, `pr#11425 `_, weiqiaomiao) * rgw: fix uncompressed object size deduction in RGWRados::copy_obj_data. (`issue#17803 `_, `pr#11794 `_, Radoslaw Zarzynski) * rgw: frontend subsystem rework (`pr#10767 `_, Radoslaw Zarzynski, Casey Bodley, Matt Benjamin) * rgw: ftw (`issue#17888 `_, `pr#12262 `_, Casey Bodley) * rgw: get_system_obj does not use result of get_system_obj_state (`issue#17580 `_, `pr#11444 `_, Casey Bodley) * rgw: get_zonegroup() uses "default" zonegroup if empty (`issue#17372 `_, `pr#11207 `_, Yehuda Sadeh) * rgw: handle empty POST condition (`issue#17635 `_, `pr#11581 `_, Yehuda Sadeh) * rgw: handle Swift auth errors in a way compatible with new Tempests. (`issue#16590 `_, `pr#10021 `_, Radoslaw Zarzynski) * rgw: json encode/decode index_type, allow modification (`issue#17755 `_, `pr#11707 `_, Yehuda Sadeh) * rgw: loses realm/period/zonegroup/zone data: period overwritten if somewhere in the cluster is still running Hammer (`issue#17371 `_, `pr#11426 `_, Orit Wasserman) * rgw: make RGWLocalAuthApplier::is_admin_of() aware about system users. (`issue#18106 `_, `pr#12283 `_, Radoslaw Zarzynski) * rgw: metadata sync info should be shown at master zone of slave zoneg… (`issue#18091 `_, `pr#12187 `_, Jing Wenjun) * rgw: minor cleanup (`pr#10057 `_, Yan Jun) * rgw: move compression config into zone placement (`pr#12113 `_, Casey Bodley) * rgw: move xfs to a seperate directory (`pr#969 `_, Orit Wasserman) * rgw: multipart upload copy (`issue#12790 `_, `pr#11269 `_, Yehuda Sadeh, Javier M. Mellid) * rgw: need to close_section in lc list op (`pr#12232 `_, weiqiaomiao) * rgw: policy acl format should be xml (`pr#946 `_, Orit Wasserman) * rgw: radosgw-admin: more on placement configuration (`issue#18078 `_, `pr#12242 `_, Casey Bodley) * rgw: region conversion respects pre-existing rgw_region_root_pool (`issue#17963 `_, `pr#12076 `_, Casey Bodley) * rgw: remove a redundant judgement when listng objects. (`pr#10849 `_, zhangshaowen) * rgw: remove circular reference in RGWAsyncRadosRequest (`issue#17793 `_, `issue#17792 `_, `pr#11815 `_, Casey Bodley) * rgw: remove suggestion to upgrade libcurl (`pr#11630 `_, Casey Bodley) * rgw: remove unused variable "ostr" in rgw_b64.h and fix the comment (`pr#11329 `_, Weibing Zhang) * rgw: Replacing '+' with "%20" in canonical uri for s3 v4 auth. (`issue#17076 `_, `pr#10919 `_, Pritha Srivastava) * rgw: revert unintentional change to civetweb (`pr#12004 `_, Bassam Tabbara) * rgw: rgw-admin: new commands to control placement (`issue#18078 `_, `pr#12230 `_, Yehuda Sadeh) * rgw: RGWBucketSyncStatusManager uses existing async_rados (`issue#18083 `_, `pr#12229 `_, Casey Bodley) * rgw: rgw_file: apply missed base64 try-catch (`issue#17663 `_, `pr#11671 `_, Matt Benjamin) * rgw: RGWHTTPArgs::get_str() - return argument string that was set. (`pr#10672 `_, Marcus Watts) * rgw: rgw multisite: fix the increamtal bucket sync init (`issue#17624 `_, `pr#11553 `_, Zengran Zhang) * rgw: rgw multisite: use a rados lock to coordinate data log trimming (`pr#10546 `_, Casey Bodley) * rgw: RGW Python bindings - use explicit array (`pr#11831 `_, Daniel Gryniewicz) * rgw: rgw_rados.cc fix shard_num format for snprintf (`pr#11493 `_, Weibing Zhang) * rgw: rgw/rgw_file.cc: Add compat.h to allow CLOCK_MONOTONE (`pr#12309 `_, Willem Jan Withagen) * rgw: RGWSimpleRadosReadCR tolerates empty reads (`issue#17568 `_, `pr#11504 `_, Casey Bodley) * rgw: [RGW] Wip rgw compression (`pr#11494 `_, Alyona Kiseleva, Adam Kupczyk, Casey Bodley) * rgw: set duration for lifecycle lease (`issue#17965 `_, `pr#12231 `_, Yehuda Sadeh) * rgw: should assign 'olh_bl" to state.attrset[RGW_ATTR_OLH_ID_TAG] instead of 'bl' (`pr#10239 `_, weiqiaomiao) * rgw: skip empty http args in method parse() to avoid extra effort (`pr#11989 `_, Guo Zhandong) * rgw: split osd's in 2 nodes (`issue#15612 `_, `pr#1019 `_, Vasu Kulkarni) * rgw: support for x-robots-tag header (`issue#17790 `_, `pr#11753 `_, Yehuda Sadeh) * rgw: sync modules, metadata search (`pr#10731 `_, Yehuda Sadeh) * rgw: Update version of civetweb to 1.8 (`pr#11343 `_, Marcus Watts) * rgw: use civetweb if no frontend was configured (`pr#958 `_, Orit Wasserman) * rgw: use explicit flag to cancel RGWCoroutinesManager::run() (`issue#17465 `_, `pr#12452 `_, Casey Bodley) * rgw: valgrind fixes for kraken (`issue#18414 `_, `issue#18407 `_, `issue#18412 `_, `issue#18300 `_, `pr#12949 `_, Casey Bodley) * rgw: verified that failed check is in osd-scrub-repair.sh (`issue#17850 `_, `pr#11881 `_, Matt Benjamin) * rgw: we don't support btrfs any more (`pr#1132 `_, Orit Wasserman) * rgw: Wip rgwfile pybind (`pr#11624 `_, Haomai Wang) * tests,bluestore: os/bluestore: add UT for an estimation of Onode in-memory size (`pr#12532 `_, Igor Fedotov) * tests,bluestore: os/test/store_test: fix legacy bluestore cache settings application (`pr#11915 `_, Igor Fedotov) * tests: ceph-disk: force debug monc = 0 (`issue#17607 `_, `pr#11534 `_, Loic Dachary) * tests: ceph_objectstore_tool.py: Don't use btrfs on FreeBSD (`pr#10507 `_, Willem Jan Withagen) * tests: ceph_test_objectstore: fix Rename test (`pr#12261 `_, Sage Weil) * tests: check hostname --fqdn sanity before running make check (`issue#18134 `_, `pr#12297 `_, Nathan Cutler) * tests,cleanup,rbd: test/librbd: in test_notify set object-map and fast-diff features by default (`pr#11821 `_, Mykola Golub) * tests,cleanup: test_bloom_filter.cc: Fix a mismatch for the random_seed parameter (`pr#11774 `_, Willem Jan Withagen) * tests,cleanup: test/osd/osd-fast-mark-down.sh: remove unnecessary teardown() calls (`pr#12101 `_, Kefu Chai) * tests,cleanup: test/osd-scrub-repair.sh: use repair() instead of "ceph pg repair" (`pr#12036 `_, Kefu Chai) * tests,cleanup: test/rados: remove unused bufferlist variable (`pr#10221 `_, Yan Jun) * tests,common: test: add perf-reset test in test/perf_counters.cc (`pr#8948 `_, wangsongbo) * tests: disable failing tests (`issue#17561 `_, `issue#17757 `_, `pr#11714 `_, Loic Dachary) * tests: disable the echo when running get_timeout_delays() (`pr#12180 `_, Kefu Chai) * tests: do not use memstore.test_temp_dir in two tests (`issue#17743 `_, `pr#12281 `_, Loic Dachary) * tests: erasure-code: add k=2, m=2 for isa & jerasure (`issue#18188 `_, `pr#12383 `_, Loic Dachary) * tests: facilitate background process debug in ceph-helpers.sh (`issue#17830 `_, `pr#12183 `_, Loic Dachary) * tests: fix ceph-helpers.sh wait_for_clean delays (`issue#17830 `_, `pr#12095 `_, Loic Dachary) * tests: fix osd-scrub-repair.sh (`pr#12072 `_, David Zafman) * tests: Fix racey test by setting noout flag (tracker 17757) (`issue#17757 `_, `pr#11715 `_, David Zafman) * tests: merge ceph-qa-suite * tests: Minor clean-ups (`pr#12048 `_, David Zafman) * tests: minor make check cleanup (`pr#12146 `_, David Zafman) * tests: no python3 tests for ceph-disk (`issue#17923 `_, `pr#12025 `_, Loic Dachary) * tests: osd-crush.sh must retry crush dump (`issue#17919 `_, `pr#12016 `_, Loic Dachary) * tests: osd-scrub-repair.sh abort if add_something fails (`pr#12172 `_, Loic Dachary) * tests: os/memstore: fix a mem leak in MemStore::Collection::create_object() (`pr#12201 `_, Kefu Chai) * tests: os/memstore, os/filestore: fix store_test's to satisfy rm_coll behavi… (`pr#11558 `_, Igor Fedotov) * tests: paxos fixes (`issue#11913 `_, `pr#457 `_, John Spray) * tests: pin flake8 to avoid behavior changes (`issue#17898 `_, `pr#11971 `_, Loic Dachary) * tests: qa: fixed script to schedule rados and other suites with --subset option (`pr#12587 `_, Yuri Weinstein) * tests: qa/tasks/admin_socket: subst in repo name (`pr#12508 `_, Sage Weil) * tests: qa/tasks/ceph_deploy: use dev option instead of dev-commit (`pr#12514 `_, Vasu Kulkarni) * tests: qa/tasks/osd_backfill.py: wait for osd.[12] to start (`issue#18303 `_, `pr#12577 `_, Sage Weil) * tests: qa/workunits/cephtool/test.sh: FreeBSD has no distro. (`pr#11702 `_, Willem Jan Withagen) * tests: qa/workunits: include extension for nose tests (`pr#12572 `_, Sage Weil) * tests: qa/workunits/rados/test_envlibrados_for_rocksdb: force librados-dev install (`pr#11941 `_, Sage Weil) * tests,rbd: qa/workunits/rbd: fix (`issue#18271 `_, `pr#12511 `_, Sage Weil) * tests,rbd: qa/workunits/rbd: removed qemu-iotest case 077 (`issue#10773 `_, `pr#12366 `_, Jason Dillaman) * tests,rbd: qa/workunits/rbd: simplify running nbd test under build env (`pr#11781 `_, Mykola Golub) * tests,rbd: qa/workunits/rbd: use image id when probing for image presence (`issue#18048 `_, `pr#12195 `_, Mykola Golub) * tests,rbd: qa/workunits/rbd: use more recent qemu-iotests that support Xenial (`issue#18149 `_, `pr#12371 `_, Jason Dillaman) * tests,rbd: rbd-mirror: fix gmock warnings in bootstrap request unit tests (`issue#18156 `_, `pr#12344 `_, Mykola Golub) * tests,rbd: rbd-mirror: improve resiliency of stress test case (`issue#17416 `_, `pr#11326 `_, Jason Dillaman) * tests,rbd: test: new librbd discard after write test case (`pr#11645 `_, Jason Dillaman) * tests,rbd: test: skip TestLibRBD.DiscardAfterWrite if skip partial discard enabled (`issue#17750 `_, `pr#11703 `_, Jason Dillaman) * tests,rbd: test: TestJournalReplay test cases need to wait for event commit (`issue#17566 `_, `pr#11480 `_, Jason Dillaman) * tests: remove TestPGLog EXPECT_DEATH tests (`issue#18030 `_, `pr#12361 `_, Loic Dachary) * tests: save 9 characters for asok paths (`issue#16014 `_, `pr#12066 `_, Loic Dachary) * tests: sync ceph-erasure-code-corpus for using 'arch' not 'uname -p' (`pr#12024 `_, Kefu Chai) * tests: test/ceph_crypto: do not read ceph.conf in global_init() (`issue#18128 `_, `pr#12318 `_, Kefu Chai) * tests: test: ceph-objectstore-tool: should import platform before using it (`pr#12038 `_, Kefu Chai) * tests: test/ceph_test_msgr: do not use Message::middle for holding transient… (`issue#17728 `_, `pr#11680 `_, Kefu Chai) * tests: test: disable osd-scrub-repair and test-erasure-eio (`issue#17830 `_, `pr#12058 `_, Loic Dachary, Dan Mick) * tests: test: disable osd-scrub-repair and test-erasure-eio (`pr#11979 `_, Dan Mick) * tests: test: Don't write to a poolid that this test might not have created (`pr#12378 `_, David Zafman) * tests: test: enable unittest_dns_resolve (`pr#12209 `_, Kefu Chai) * tests: test/encoding/readable.sh: fix shell script warning (`pr#11527 `_, Willem Jan Withagen) * tests: TestErasureCodePluginJerasure must stop the log thread (`issue#17561 `_, `pr#11721 `_, Loic Dachary) * tests: test: fix test-erasure-eio and osd-scrub-repair races (17830) (`pr#11926 `_, David Zafman) * tests: test/osd-fast-mark-down.sh: wrong assumption on first subtest (`pr#12123 `_, Piotr Dałek) * tests: test/osd/osd-fast-mark-down.sh: introduce large timeout (`issue#17918 `_, `pr#12019 `_, Piotr Dałek) * tests: test/osd-scrub-repair.sh: Use test case specific object names to help… (`pr#11449 `_, David Zafman) * tests: test/store_test: fix errors on the whole test suite run caused by the… (`pr#11427 `_, Igor Fedotov) * tests: test_subman.sh: Don't use --tmpdir (`pr#11384 `_, Willem Jan Withagen) * tests: test: test-erasure-eio.sh fix recovery testing and enable it (`pr#12170 `_, David Zafman) * tests: The default changed to disallow pool delete as of #11665; the tests assume it's allowed. (`pr#11897 `_, Sage Weil) * tests: Turn off tests again due to Jenkins failures (`pr#12217 `_, David Zafman) * tests: unittest_throttle avoid ASSERT_DEATH (`issue#18036 `_, `pr#12393 `_, Loic Dachary) * tests: update rbd/singleton/all/formatted-output.yaml to support ceph-ci (`issue#18440 `_, `pr#12823 `_, Nathan Cutler) * tests: use shorter directories for tests (`issue#16014 `_, `pr#12046 `_, Loic Dachary) * tests: vstart.sh: fix bashism in the script (`pr#11889 `_, Mykola Golub) * tests: workunits/ceph-helpers.sh: FreeBSD returns a different errorstring. (`pr#12005 `_, Willem Jan Withagen) * tools: Adding ceph-lazy tool (`pr#11055 `_, gcharot) * tools: ceph-create-keys should not try forever to do things (`issue#17753 `_, `issue#12649 `_, `issue#16255 `_, `pr#11749 `_, Alfredo Deza) * tools: ceph_detect_init: add support for Alpine (`pr#8316 `_, John Coyle) * tools: ceph-disk: fix flake8 errors (`issue#17898 `_, `pr#11973 `_, Ken Dreyer) * tools: ceph-disk: prevent unnecessary tracebacks from subprocess.check_call (`issue#16125 `_, `pr#12414 `_, Alfredo Deza) * tools: ceph-post-file: single command to upload a file to cephdrop (`pr#505 `_, Dan Mick, Travis Rhoden) * tools: cleanup phase of cephfs-data-scan (`pr#12337 `_, Vishal Kanaujia) * tools: osdmaptool: additional tests (`pr#1196 `_, Sage Weil) * tools: osdmaptool: fix divide by zero error (`pr#12561 `_, Yunchuan Wen) * tools: rados: fix segfaults when run without --pool (`issue#17684 `_, `pr#11633 `_, David Disseldorp) * tools: rados: optionally support reading omap key from file (`issue#18123 `_, `pr#12286 `_, Jason Dillaman) * tools: script/run-coverity: update (`pr#12162 `_, Sage Weil) * tools: script/sepia_bt.sh: a script to prepare for debugging on teuthology@sepia (`pr#12012 `_, Kefu Chai) * tools: src/vstart.sh: Only execute btrfs if it is available (`pr#11683 `_, Willem Jan Withagen) * tools: tools/ceph-monstore-update-crush.sh: FreeBSD getopt is not compatible… (`pr#11525 `_, Willem Jan Withagen) v11.0.2 Kraken ============== This development checkpoint release includes a lot of changes and improvements to Kraken. This is the first release introducing ceph-mgr, a new daemon which provides additional monitoring & interfaces to external monitoring/management systems. There are also many improvements to bluestore, RGW introduces sync modules, copy part for multipart uploads and metadata search via elastic search as a tech preview. Notable Changes --------------- * bluestore: os/bluestore: misc fixes (`pr#10953 `_, Sage Weil) * bluestore: os/bluestore/BlueFS: do not op_file_update deleted files (`pr#10686 `_, Sage Weil) * bluestore: bluestore/BitAllocator: Fix deadlock with musl libc (`pr#10634 `_, John Coyle) * bluestore: bluestore/BlueFS: revert direct IO for WRITER_WAL (`pr#11059 `_, Mark Nelson) * bluestore: ceph-disk: support creating block.db and block.wal with customized size for bluestore (`pr#10135 `_, Zhi Zhang) * bluestore: compressor/zlib: switch to raw deflate (`pr#11122 `_, Piotr Dałek) * bluestore: do not use freelist to track bluefs_extents (`pr#10698 `_, Sage Weil) * bluestore: initialize csum_order properly (`pr#10728 `_, xie xingguo) * bluestore: kv/rocksdb: dump transactions on error (`pr#11042 `_, Somnath Roy) * bluestore: kv: In memory keyvalue db implementation (`pr#9933 `_, Ramesh Chander) * bluestore: os/bluestore/BitAllocator: batch is_allocated bit checks (`pr#10704 `_, Ramesh Chander) * bluestore: os/bluestore/BlueFS: For logs of rocksdb & bluefs only use directio. (`pr#11012 `_, Jianpeng Ma) * bluestore: os/bluestore/BlueFS: async compaction (`pr#10717 `_, Varada Kari, Sage Weil) * bluestore: os/bluestore/BlueFS: do not hold internal lock while waiting for IO (`pr#9898 `_, Varada Kari, Sage Weil) * bluestore: os/bluestore/BlueFS: do not start racing async compaction (`pr#11010 `_, Sage Weil) * bluestore: os/bluestore/BlueFS: don't inc l_bluefs_files_written_wal if overwrite. (`pr#10143 `_, Jianpeng Ma) * bluestore: os/bluestore/BlueFS: factor unflushed log into runway calculation (`pr#10966 `_, Sage Weil) * bluestore: os/bluestore/BlueFS: fix async compaction logging bug (`pr#10964 `_, Sage Weil) * bluestore: os/bluestore/BlueFS: log dirty files at sync time (`pr#11108 `_, Sage Weil) * bluestore: os/bluestore/BlueFS: only extend extent on same bdev (`pr#11023 `_, Sage Weil) * bluestore: os/bluestore/BlueFS: prevent concurrent async compaction (`pr#11095 `_, Sage Weil) * bluestore: os/bluestore/BlueFS: release completed aios (`pr#11268 `_, Sage Weil) * bluestore: os/bluestore/BlueFS: use StupidAllocator; fix async compaction bug (`pr#11087 `_, Sage Weil) * bluestore: os/bluestore/bluefs: add file refs check (`pr#10863 `_, xie xingguo) * bluestore: os/bluestore/bluefs: use map to track dirty files (`pr#10923 `_, xie xingguo) * bluestore: os/bluestore/bluefs_types: fix extent operator<< (`pr#10685 `_, Sage Weil) * bluestore: os/bluestore/bluestore_types: uint64_t for ref_map (`pr#11267 `_, Sage Weil) * bluestore: os/bluestore: Hint based allocation in bitmap Allocator (`pr#10978 `_, Ramesh Chander) * bluestore: os/bluestore: Remove bit alloc Woverloaded-virtual warnings (`pr#10082 `_, Ramesh Chander) * bluestore: os/bluestore: a few cleanups (`pr#11192 `_, xie xingguo) * bluestore: os/bluestore: a few fixes about the global csum setting (`pr#11195 `_, xie xingguo) * bluestore: os/bluestore: add assert to compress_extent_map (`pr#11240 `_, Sage Weil) * bluestore: os/bluestore: add cache-related stats (`pr#10961 `_, xie xingguo) * bluestore: os/bluestore: add checks and kill unreachable code (`pr#11077 `_, xie xingguo) * bluestore: os/bluestore: add error injection (`pr#11151 `_, Sage Weil) * bluestore: os/bluestore: add max blob size; fix compressed min blob size logic (`pr#11239 `_, Sage Weil) * bluestore: os/bluestore: add multiple finishers to bluestore (`pr#10780 `_, Ilsoo Byun) * bluestore: os/bluestore: add perf counters for compression effectiveness and space utilization measurements (`pr#10449 `_, Igor Fedotov) * bluestore: os/bluestore: apply "small encoding" for onode_t::extents map (`pr#10018 `_, Igor Fedotov) * bluestore: os/bluestore: avoid blob_t reencode when unchanged (`pr#10768 `_, Sage Weil) * bluestore: os/bluestore: binary search specified shard (`pr#11245 `_, xie xingguo) * bluestore: os/bluestore: change algorithm of compression header from string to int (`pr#10137 `_, xie xingguo) * bluestore: os/bluestore: compaction fixes (`pr#11279 `_, Sage Weil) * bluestore: os/bluestore: drop redundant call of get_blob (`pr#11275 `_, xie xingguo) * bluestore: os/bluestore: drop unreferenced spanning blobs (`pr#11212 `_, Sage Weil) * bluestore: os/bluestore: fix a few leaks (`pr#11068 `_, Sage Weil) * bluestore: os/bluestore: fix a few memory utilization leaks and wasters (`pr#11011 `_, Sage Weil) * bluestore: os/bluestore: fix crash in decode_some() (`pr#11312 `_, Sage Weil) * bluestore: os/bluestore: fix decoding hash of bnode (`pr#10773 `_, xie xingguo) * bluestore: os/bluestore: fix fsck() won't catch stray shard sometimes (`pr#11219 `_, xie xingguo) * bluestore: os/bluestore: fix gc when blob extends past eof (`pr#11282 `_, Sage Weil) * bluestore: os/bluestore: fix improper local var variable in collection_list meth… (`pr#10680 `_, Igor Fedotov) * bluestore: os/bluestore: fix incorrect pool decoding of bnode (`pr#10117 `_, xie xingguo) * bluestore: os/bluestore: fix leak of result-checking of _fsck_check_extents (`pr#11040 `_, xie xingguo) * bluestore: os/bluestore: fix leaks in our use of rocksdb (`pr#11250 `_, Sage Weil) * bluestore: os/bluestore: fix memory leak during bit_alloc testing (`pr#9935 `_, xie xingguo) * bluestore: os/bluestore: fix offset bug in _do_write_small. (`pr#11030 `_, amoxic) * bluestore: os/bluestore: fix onode cache addition race (`pr#11300 `_, Sage Weil) * bluestore: os/bluestore: fix potential access violation (`pr#10362 `_, xie xingguo) * bluestore: os/bluestore: fix potential access violation during rename (`pr#11033 `_, xie xingguo) * bluestore: os/bluestore: fix shard_info::dump() (`pr#11061 `_, xie xingguo) * bluestore: os/bluestore: fix spanning blob leak from ~ExtentMap (`pr#11223 `_, Somnath Roy) * bluestore: os/bluestore: fix statfs tests (`pr#10910 `_, Sage Weil) * bluestore: os/bluestore: fix when block device is not a multiple of the block size (`pr#10844 `_, Sage Weil) * bluestore: os/bluestore: fix write_big counter and some more cleanups (`pr#11344 `_, xie xingguo) * bluestore: os/bluestore: fix/improve csum error message (`pr#10938 `_, Sage Weil) * bluestore: os/bluestore: garbage collect partially overlapped blobs (`pr#11232 `_, Roushan Ali) * bluestore: os/bluestore: get rid off "isa-l" type in ZLibCompressor ctor (`pr#10931 `_, xie xingguo) * bluestore: os/bluestore: gifting bluefs more carefully (`pr#10950 `_, xie xingguo) * bluestore: os/bluestore: honour allow-eio flag; use global compressor if possible (`pr#10970 `_, xie xingguo) * bluestore: os/bluestore: improve required compression threshold (`pr#10080 `_, xie xingguo) * bluestore: os/bluestore: include bluefs space in statfs result (`pr#10795 `_, Sage Weil) * bluestore: os/bluestore: introduce power 2 macros for block alignment and rounding (`pr#10128 `_, xie xingguo) * bluestore: os/bluestore: make assert conditional with macro for allocator (`pr#11014 `_, Ramesh Chander) * bluestore: os/bluestore: make cache settings process-wide (`pr#11295 `_, Sage Weil) * bluestore: os/bluestore: make clone_range copy-on-write (`pr#11106 `_, Sage Weil) * bluestore: os/bluestore: make onode keys more efficient (and sort correctly) (`pr#11009 `_, xie xingguo, Sage Weil) * bluestore: os/bluestore: make trim() of 2Q cache more fine-grained (`pr#9946 `_, xie xingguo) * bluestore: os/bluestore: make zone/span size of bitmap-allocator configurable (`pr#10040 `_, xie xingguo) * bluestore: os/bluestore: misc cleanup and test fixes (`pr#11346 `_, Igor Fedotov) * bluestore: os/bluestore: misc cleanups (`pr#10201 `_, xie xingguo) * bluestore: os/bluestore: misc cleanups (`pr#11197 `_, Haomai Wang) * bluestore: os/bluestore: misc fixes (`pr#9999 `_, xie xingguo) * bluestore: os/bluestore: misc fixes (`pr#10771 `_, xie xingguo) * bluestore: os/bluestore: misc. fixes (`pr#11129 `_, xie xingguo) * bluestore: os/bluestore: more cleanups (`pr#11235 `_, xie xingguo) * bluestore: os/bluestore: more cleanups and fixes (`pr#11210 `_, xie xingguo) * bluestore: os/bluestore: narrow condition of sanity check when get_object_key() (`pr#11149 `_, xie xingguo) * bluestore: os/bluestore: narrow lock scope for cache trim() (`pr#10410 `_, xie xingguo) * bluestore: os/bluestore: optimize intrusive sets for size. (`pr#11319 `_, Mark Nelson) * bluestore: os/bluestore: pack a few more in-memory types (`pr#11328 `_, Sage Weil) * bluestore: os/bluestore: precondition rocksdb/bluefs during mkfs (`pr#10814 `_, Sage Weil) * bluestore: os/bluestore: prevent extent merging across shard boundaries (`pr#11216 `_, Sage Weil) * bluestore: os/bluestore: print bluefs_extents in hex (`pr#10689 `_, Sage Weil) * bluestore: os/bluestore: proper handling for csum enable/disable settings (`pr#10431 `_, Igor Fedotov) * bluestore: os/bluestore: refactor dirty blob tracking along with some related fixes (`pr#10215 `_, Igor Fedotov) * bluestore: os/bluestore: remove cmake warning from extent alloc functions (`issue#16766 `_, `pr#10492 `_, Ramesh Chander) * bluestore: os/bluestore: remove deferred_csum machinery (`pr#11243 `_, Sage Weil) * bluestore: os/bluestore: remove some copy-pastes (`pr#11017 `_, Igor Fedotov) * bluestore: os/bluestore: replace store with logger in Cache (`pr#10969 `_, xie xingguo) * bluestore: os/bluestore: shard extent map (`pr#10963 `_, Sage Weil) * bluestore: os/bluestore: simplify LRUCache::trim() (`pr#10109 `_, xie xingguo) * bluestore: os/bluestore: simplify calculation of collection key range (`pr#11166 `_, xie xingguo) * bluestore: os/bluestore: sloppy reshard boundaries to avoid spanning blobs (`pr#11263 `_, Sage Weil) * bluestore: os/bluestore: still more cleanups (`pr#11274 `_, xie xingguo) * bluestore: os/bluestore: switch spanning_blob_map to std::map (`pr#11336 `_, Sage Weil) * bluestore: os/bluestore: trim cache on reads (`pr#10095 `_, Sage Weil) * bluestore: os/bluestore: try to split blobs instead of spanning them (`pr#11264 `_, Sage Weil) * bluestore: os/bluestore: upgrade compression settings to atomics (`pr#11244 `_, xie xingguo) * bluestore: os/bluestore: use small encoding for bluefs extent and fnode (`pr#10375 `_, xie xingguo) * bluestore: os/bluestore: yet another statfs test fix (`pr#10926 `_, Igor Fedotov) * bluestore: os/bluestore:Fix size calculation in bitallocator (`pr#10377 `_, Ramesh Chander) * bluestore: os/bluestore: fix error handling of posix_fallocate() (`pr#10277 `_, xie xingguo) * bluestore: os/bluestore: use BE for gifting and reclaiming from bluefs (`pr#10294 `_, xie xingguo) * bluestore: os/bluestore: get rid off blob's ref_map for non-shared objects (`pr#9988 `_, Igor Fedotov) * bluestore: kv/MemDB: fix wrong output target and add sanity checks (`pr#10358 `_, xie xingguo) * bluestore: os/bluestore: add a boundary check of cache read (`pr#10349 `_, xie xingguo) * bluestore: os/bluestore: fix bitmap allocating failure if max_alloc_size is 0 (`pr#10379 `_, xie xingguo) * bluestore: os/bluestore: misc fixes (`pr#10327 `_, xie xingguo) * bluestore: kv/MemDB: misc fixes and cleanups (`pr#10295 `_, xie xingguo) * bluestore: rocksdb: pull up to master (4.12 + a few patches) (`pr#11069 `_, Sage Weil) * bluestore: test/store_test: extend Bluestore compression test to verify compress… (`pr#11080 `_, Igor Fedotov) * bluestore: test/store_test: fix statfs results check to consider SSD min_alloc_size (`pr#11096 `_, Igor Fedotov) * bluestore: unittest_bluestore_types: a few more types for sizeof (`pr#11323 `_, Sage Weil) * bluestore: ceph_test_objectstore: test clone_range and fix a few bugs (`pr#11103 `_, Sage Weil) * bluestore: kv: fix some bugs in memdb (`pr#10550 `_, Haodong Tang) * bluestore: os/bluestore/BlueFS: disable buffered io (`pr#10766 `_, Sage Weil) * build/ops,bluestore: test/objectstore/CMakeLists.txt: fix libaio conditional (`pr#11008 `_, Sage Weil) * build/ops,cephfs: client: added def for ACCESSPERMS when undefined (`pr#9835 `_, John Coyle) * build/ops,cephfs: deb: merge ceph-fs-common into ceph-common (`issue#16808 `_, `pr#10433 `_, Nathan Cutler) * build/ops,cephfs: man/Makefile-client.am: drop legacy cephfs tool (`pr#10444 `_, Nathan Cutler) * build/ops,cephfs: test: break out librados-using cephfs test (`issue#16556 `_, `pr#10452 `_, John Spray) * build/ops,common: common/dns_resolve: use ns_name_uncompress instead of ns_name_ntop (`pr#9755 `_, John Coyle) * build/ops,common: msg/async/net_handler.cc: make it more compatible with BSDs (`pr#10029 `_, Willem Jan Withagen) * build/ops,pybind: Include Python 3 bindings into the cmake build and make packages for them (`pr#10208 `_, Oleh Prypin) * build/ops,rbd: systemd: add install section to rbdmap.service file (`pr#10942 `_, Jelle vd Kooij) * build/ops,rbd: test: fix rbd-mirror workunit test cases for cmake (`pr#10076 `_, Jason Dillaman) * build/ops,rgw: rgw-ldap: add ldap lib to rgw lib deps based on build config (`pr#9852 `_, John Coyle) * build/ops: .gitignore: Add .pyc files globally (`pr#11076 `_, Brad Hubbard) * build/ops: Allow compressor build without YASM (`pr#10937 `_, Daniel Gryniewicz) * build/ops: CMake - stop pip checking for updates (`pr#10161 `_, Daniel Gryniewicz) * build/ops: CMakeList.txt: link ceph_objectstore_tool against fuse only if WITH_FUSE (`pr#10149 `_, Willem Jan Withagen) * build/ops: Cmake: fix using CMAKE_DL_LIBS instead of dl (`pr#10317 `_, Willem Jan Withagen) * build/ops: CmakeLists.txt: use LIB_RESOLV instead of resolv. (`pr#10972 `_, Willem Jan Withagen) * build/ops: Enable builds without ceph-test subpackage (`issue#16776 `_, `pr#10872 `_, Ricardo Dias) * build/ops: Fix libatomic_ops-devel in SUSE and specfile cleanup (`issue#16645 `_, `pr#10363 `_, Nathan Cutler) * build/ops: FreeBSD: Define CLOCK_REALTIME_COARSE in compat.h (`pr#10506 `_, Willem Jan Withagen) * build/ops: Gentoo support for ceph-disk / ceph-detect-init; pip speedup (`pr#8317 `_, Robin H. Johnson) * build/ops: LTTng-UST disabled for openSUSE (`issue#16937 `_, `pr#10592 `_, Michel Normand) * build/ops: Port ceph-brag to Python 3 (+ small fixes) (`pr#10064 `_, Oleh Prypin) * build/ops: Removes remaining reference to WITH_MDS (`pr#10286 `_, J. Eric Ivancich) * build/ops: Stop hiding errors from run-tox.sh (`issue#17267 `_, `pr#11071 `_, Dan Mick) * build/ops: Wip kill warnings (`pr#10881 `_, Kefu Chai) * build/ops: autogen: Fix rocksdb error when make dist (`pr#10988 `_, tianqing) * build/ops: autotools: remove a few other remaining traces (`pr#11019 `_, Sage Weil) * build/ops: build scripts: Enable dnf for Fedora >= 22 (`pr#11105 `_, Brad Hubbard) * build/ops: build: drop dryrun of autogen.sh from run-cmake-check.sh script (`pr#11013 `_, xie xingguo) * build/ops: ceph-disk tests: Let missing python interpreters be non-fatal (`pr#11072 `_, Dan Mick) * build/ops: ceph-disk: Compatibility fixes for Python 3 (`pr#9936 `_, Anirudha Bose) * build/ops: ceph-disk: do not activate device that is not ready (`issue#15990 `_, `pr#9943 `_, Boris Ranto) * build/ops: ceph-osd-prestart.sh: check existence of OSD data directory (`issue#17091 `_, `pr#10809 `_, Nathan Cutler) * build/ops: ceph-osd-prestart.sh: drop Upstart-specific code (`issue#15984 `_, `pr#9667 `_, Nathan Cutler) * build/ops: ceph-post-file replace DSA with RSA ssh key (`issue#14267 `_, `pr#10800 `_, David Galloway) * build/ops: ceph.spec.in: don't try to package __pycache__ for SUSE (`issue#17106 `_, `pr#10805 `_, Tim Serong) * build/ops: ceph.spec.in: fix rpm package building error (`pr#10115 `_, runsisi) * build/ops: changes for Clang and yasm (`pr#10417 `_, Willem Jan Withagen) * build/ops: cmake changes (`pr#10351 `_, Kefu Chai) * build/ops: cmake changes (`pr#10059 `_, Kefu Chai) * build/ops: cmake changes (`pr#10279 `_, Kefu Chai) * build/ops: cmake changes (`issue#16804 `_, `pr#10391 `_, Kefu Chai) * build/ops: cmake changes (`pr#10361 `_, Kefu Chai) * build/ops: cmake changes (`pr#10112 `_, Kefu Chai) * build/ops: cmake changes (`pr#10489 `_, Kefu Chai) * build/ops: cmake changes (`pr#10283 `_, Kefu Chai) * build/ops: cmake changes (`issue#16504 `_, `pr#9995 `_, Kefu Chai, Sage Weil, Dan Mick) * build/ops: cmake changes (`pr#9975 `_, Kefu Chai) * build/ops: cmake changes related to LTTng-UST (`pr#10917 `_, Kefu Chai) * build/ops: common/compressor: add libcommon as a dependency for zlib and snappy p… (`pr#11083 `_, Igor Fedotov) * build/ops: compat: add abstractions for non portable pthread name funcs (`pr#9763 `_, John Coyle) * build/ops: configure.ac: Use uname instead of arch. (`pr#9766 `_, John Coyle) * build/ops: configure.ac: add _LIBS variables for boost_system and boost_iostreams (`pr#9848 `_, John Coyle) * build/ops: configure.ac: fix res_query detection (`pr#9820 `_, John Coyle) * build/ops: debian and cmake cleanups (`pr#10788 `_, Kefu Chai) * build/ops: debian: bump compat to 9 (`issue#16744 `_, `pr#10366 `_, Kefu Chai) * build/ops: debian: python related changes (`pr#10322 `_, Kefu Chai) * build/ops: debian: replace SysV rbdmap with systemd service (`pr#10435 `_, Ken Dreyer) * build/ops: debian: set libexec dir to correct value as autotools did (`pr#10096 `_, Daniel Gryniewicz) * build/ops: do_cmake.sh: set up initial plugin dir (`pr#10067 `_, Sage Weil) * build/ops: fix /etc/os-release parsing in install-deps.sh (`pr#10981 `_, Nathan Cutler) * build/ops: fix the rpm build for centos (`pr#10289 `_, Oleh Prypin, Josh Durgin) * build/ops: force Python 3 packages to build in SUSE (`issue#17106 `_, `pr#10894 `_, Dominique Leuenberger, Nathan Cutler) * build/ops: install-deps.sh based on /etc/os-release (`issue#16522 `_, `pr#10017 `_, Jan Fajerski) * build/ops: install-deps: exit non-zero when we cannot match distro (`pr#10941 `_, Gregory Meno) * build/ops: isa-l: add isa-l library as a submodule (`pr#10066 `_, Alyona Kiseleva) * build/ops: jerasure: include generic objects in neon jerasure lib (like sse3/4) (`pr#10879 `_, Dan Mick) * build/ops: logrotate: Run as root/ceph (`pr#10587 `_, Boris Ranto) * build/ops: lttng: build the tracepoint provider lib from .c files in repo (`pr#11196 `_, Kefu Chai) * build/ops: make-dist: generate ceph.spec (`issue#16501 `_, `pr#9986 `_, Sage Weil) * build/ops: make-dist: set rpm_release correctly for release builds (`pr#11334 `_, Dan Mick) * build/ops: make-srpm.sh: A simple script to make the srpm for ceph. (`pr#11064 `_, Ira Cooper) * build/ops: makefile: change librgw_file_* as check_PROGRAMS (`issue#16646 `_, `pr#10229 `_, Brad Hubbard) * build/ops: remove autotools (`pr#11007 `_, Sage Weil) * build/ops: rpm: Do not start targets on update (`pr#9968 `_, Nathan Cutler, Boris Ranto) * build/ops: rpm: ExclusiveArch for suse_version (`issue#16936 `_, `pr#10594 `_, Michel Normand) * build/ops: rpm: Fix creation of mount.ceph symbolic link for SUSE distros (`pr#10353 `_, Ricardo Dias) * build/ops: rpm: add udev BuildRequires to provide /usr/lib/udev directory (`issue#16949 `_, `pr#10608 `_, Nathan Cutler) * build/ops: rpm: build rpm with cmake (`pr#10016 `_, Kefu Chai) * build/ops: rpm: drop obsolete libs-compat and python-ceph-compat metapackages (`issue#16353 `_, `pr#9757 `_, Nathan Cutler) * build/ops: rpm: fix permissions for /etc/ceph/rbdmap (`issue#17395 `_, `pr#11217 `_, Ken Dreyer) * build/ops: rpm: fix shared library devel package names and dependencies (`issue#16345 `_, `issue#16346 `_, `pr#9744 `_, Nathan Cutler, Ken Dreyer) * build/ops: rpm: move mount.ceph from ceph-base to ceph-common and add symlink in /sbin for SUSE (`issue#16598 `_, `pr#10147 `_, Nathan Cutler) * build/ops: run-cmake-check.sh: Remove redundant calls (`pr#11116 `_, Brad Hubbard) * build/ops: script: improve ceph-release-notes regex (`pr#10729 `_, Nathan Cutler) * build/ops: src/CMakeLists.txt: remove double flag -Wno-invalid-offsetof (`pr#10443 `_, Willem Jan Withagen) * build/ops: src/CMakeLists.txt: remove unneeded libraries from ceph-dencoder target (`pr#10478 `_, Willem Jan Withagen) * build/ops: src/global/pidfile.cc: Assign elements in structures individually (`pr#10516 `_, Willem Jan Withagen) * build/ops: src/kv/CMakeLists.txt: force rocksdb/include to first include directory (`pr#11194 `_, Willem Jan Withagen) * build/ops: test/common/test_util.cc: FreeBSD does not have distro information (`pr#10547 `_, Willem Jan Withagen) * build/ops: test: make check using cmake (`pr#10116 `_, Kefu Chai, Sage Weil) * build/ops: verfied f23 (`pr#10222 `_, Kefu Chai) * build/ops: yasm-wrapper: dont echo the yasm command line (`pr#10819 `_, Casey Bodley) * build/ops: .gitignore: exclude coredumps, logfiles and temporary testresults (`pr#8150 `_, Willem Jan Withagen) * build/ops: this fixes the broken build (`pr#9992 `_, Haomai Wang) * build/ops: mrgw: search for cmake build dir. (`pr#10180 `_, Abhishek Lekshmanan) * build/ops: mrun, mstart.sh, mstop.sh: search for cmake build directory (`pr#10097 `_, Yehuda Sadeh) * build/ops: arm64 fixes(`pr#10438 `_, Dan Mick) * build/ops: Wip kill warnings (`pr#10934 `_, Kefu Chai) * build/ops: systemd: add osd id to service description (`pr#10091 `_, Ruben Kerkhof) * build/ops: fix wrong indent caused compile warning (`pr#10014 `_, Wanlong Gao) * build/ops: ceph-detect-init: fix the py3 test (`pr#10266 `_, Kefu Chai) * build/ops: ceph.spec: fix ceph-mgr version requirement (`pr#11285 `_, Sage Weil) * build/ops: make-dist/ceph.spec.in: Fix srpm build breakage. (`pr#10404 `_, Ira Cooper) * build/ops: master: remove SYSTEMD_RUN from initscript (`issue#16440 `_, `issue#7627 `_, `pr#9871 `_, Vladislav Odintsov) * build/ops: rocksdb: revert the change introduced by dc41731 (`pr#10595 `_, Kefu Chai) * build/ops: do_freebsd*.sh: rename do_freebsd-cmake.sh to do_freebsd.sh (`pr#11088 `_, Kefu Chai) * build/ops: gcc 6.1.1 complains about missing include: . 4.8.3 does not c… (`pr#10747 `_, Daniel Oliveira) * build/ops: selinux: Allow ceph to manage tmp files (`issue#17436 `_, `pr#11259 `_, Boris Ranto) * build/ops: selinux: allow read /proc//cmdline (`issue#16675 `_, `pr#10339 `_, Kefu Chai) * cephfs,common: osdc/Journaler: move C_DelayFlush class to .cc (`pr#10744 `_, Michal Jarzabek) * cephfs,core,rbd: ObjectCacher: fix bh_read_finish offset logic (`issue#16002 `_, `pr#9606 `_, Greg Farnum) * cephfs,core,rbd: osdc/ObjectCacher: move C_ReadFinish, C_RetryRead (`pr#10781 `_, Michal Jarzabek) * cephfs: Add ceph_ll_setlk and ceph_ll_getlk (`pr#9566 `_, Frank S. Filz) * cephfs: CephFS: misc. cleanups and remove legacy cephfs tool (`issue#16195 `_, `issue#16035 `_, `issue#15923 `_, `pr#10243 `_, John Spray) * cephfs: Clean up handling of "/.." in ceph client (`pr#10691 `_, Jeff Layton) * cephfs: Client: fixup param type and return value (`pr#10463 `_, gongchuang) * cephfs: Client: pass "UserPerm" struct everywhere for security checks (`issue#16367 `_, `issue#17368 `_, `pr#11218 `_, Greg Farnum) * cephfs: First pile of statx patches (`pr#10922 `_, Sage Weil, Jeff Layton) * cephfs: Fix attribute handling at lookup time (`issue#16668 `_, `pr#10386 `_, Jeff Layton) * cephfs: Inotable repair during forward scrub (`pr#10281 `_, Vishal Kanaujia) * cephfs: Server: drop locks and auth pins if wait for pending truncate (`pr#9716 `_, xie xingguo) * cephfs: Small interface cleanups for struct ceph_statx (`pr#11093 `_, Jeff Layton) * cephfs: build ceph-fuse on OSX (`pr#9371 `_, Yan, Zheng) * cephfs: ceph-fuse: link to libtcmalloc or jemalloc (`issue#16655 `_, `pr#10258 `_, Yan, Zheng) * cephfs: ceph_volume_client: store authentication metadata (`issue#15406 `_, `issue#15615 `_, `pr#9864 `_, John Spray, Ramana Raja) * cephfs: client/barrier: move C_Block_Sync class to .cc (`pr#11001 `_, Michal Jarzabek) * cephfs: client/filer: cleanup the redundant judgments of _write&&_fallocate (`pr#10062 `_, huanwen ren) * cephfs: client: add missing client_lock for get_root (`pr#10027 `_, Patrick Donnelly) * cephfs: client: discard mds map if it is identical to ours (`pr#9774 `_, xie xingguo) * cephfs: client: fast abort if underlying statsf() call failed; end scope of std::hex properly (`pr#9803 `_, xie xingguo) * cephfs: client: fix access violation (`pr#9793 `_, xie xingguo) * cephfs: client: fix readdir vs fragmentation race (`issue#17286 `_, `pr#11147 `_, Yan, Zheng) * cephfs: client: fix segment fault in Client::_invalidate_kernel_dcache(). (`issue#17253 `_, `pr#11170 `_, Yan, Zheng) * cephfs: client: fix shutdown with open inodes (`issue#16764 `_, `pr#10419 `_, John Spray) * cephfs: client: include COMPLETE and ORDERED states in cache dump (`pr#10485 `_, Greg Farnum) * cephfs: client: kill compiling warning (`pr#9994 `_, xie xingguo) * cephfs: client: misc fixes (`pr#9838 `_, xie xingguo) * cephfs: client: move Inode specific cleanup to destructor (`pr#10168 `_, Patrick Donnelly) * cephfs: client: note order of member init in cons (`pr#10169 `_, Patrick Donnelly) * cephfs: client: properly set inode number of created inode in replay request (`issue#17172 `_, `pr#10957 `_, Yan, Zheng) * cephfs: client: protect InodeRef with client_lock (`issue#17392 `_, `pr#11225 `_, Yan, Zheng) * cephfs: doc/mds: fixup mds doc (`pr#10573 `_, huanwen ren) * cephfs: fuse_ll: fix incorrect error settings of fuse_ll_mkdir() (`pr#9809 `_, xie xingguo) * cephfs: include/ceph_fs.h: guard `#define CEPH_SETATTR_*` with #ifndef (`pr#10265 `_, Kefu Chai) * cephfs: libcephfs: Fix the incorrect integer conversion in libcephfs_jni.cc (`pr#10640 `_, wenjunhuang) * cephfs: libcephfs: add unmount function in cephfs.pyx (`pr#10774 `_, huanwen ren) * cephfs: libcephfs: fix portability-related error settings (`pr#9794 `_, xie xingguo) * cephfs: libcephfs: kill compiling warning (`pr#10622 `_, xie xingguo) * cephfs: mds/CDir: remove the part of judgment for _next_dentry_on_set (`pr#10476 `_, zhang.zezhu) * cephfs: mds/CInode: fix potential fin hanging (`pr#9773 `_, xie xingguo) * cephfs: mds/MDBalancer: cleanup (`pr#10512 `_, huanwen ren) * cephfs: mds/MDCache: kill a comipler warning (`pr#11254 `_, xie xingguo) * cephfs: mds/MDSMap default metadata pool to -1 (was: output None instead of 0 when no fs present.) (`issue#16588 `_, `pr#10202 `_, Xiaoxi Chen) * cephfs: mds/MDSTable: add const to member functions (`pr#10846 `_, Michal Jarzabek) * cephfs: mds/SessionMap.h: change statement to assertion (`pr#11289 `_, Michal Jarzabek) * cephfs: mds/SnapRealm.h: add const to member functions (`pr#10878 `_, Michal Jarzabek) * cephfs: mds/server: clean up handle_client_open() (`pr#11120 `_, huanwen ren) * cephfs: mon/MDSMonitor: move C_Updated class to .cc file (`pr#10668 `_, Michal Jarzabek) * cephfs: osdc/mds: fixup pos parameter in the journaler (`pr#10200 `_, huanwen ren) * cephfs: reduce unnecessary mds log flush (`pr#10393 `_, Yan, Zheng) * cephfs: tools/cephfs: Remove cephfs-data-scan tmap_upgrade (`issue#16144 `_, `pr#10100 `_, Douglas Fuller) * cephfs: ceph_fuse: use sizeof get the buf length (`pr#11176 `_, LeoZhang) * cli: retry when the mon is not configured (`issue#16477 `_, `pr#11089 `_, Loic Dachary) * cmake: Add -pie to CMAKE_EXE_LINKER_FLAGS (`pr#10755 `_, Tim Serong) * cmake: Fix FCGI include directory (`pr#9983 `_, Tim Serong) * cmake: Fix mismatched librgw VERSION / SOVERSION (`pr#10754 `_, Tim Serong) * cmake: FreeBSD specific excludes in CMakeLists.txt (`pr#10973 `_, Willem Jan Withagen) * cmake: FreeBSD specific excludes in CMakeLists.txt files (`pr#10517 `_, Willem Jan Withagen) * cmake: Really add FCGI_INCLUDE_DIR to include_directories for rgw (`pr#10139 `_, Tim Serong) * cmake: Removed README.cmake.md, edited README.md (`pr#10028 `_, Ali Maredia) * cmake: Support tcmalloc_minimal allocator (`pr#11111 `_, Bassam Tabbara) * cmake: add dependency from ceph_smalliobenchrbd to cls libraries (`pr#10870 `_, J. Eric Ivancich) * cmake: add_subdirectory(include) (`pr#10360 `_, Kefu Chai) * cmake: ceph_test_rbd_mirror does not require librados_test_stub (`pr#10164 `_, Jason Dillaman) * cmake: cleanup Findgperftools.cmake (`pr#10670 `_, Kefu Chai) * cmake: correct ceph_test_librbd/ceph_test_rbd_mirror linkage (`issue#16882 `_, `pr#10598 `_, Jason Dillaman) * cmake: disable -fvar-tracking-assignments for ceph_dencoder.cc (`pr#10275 `_, Kefu Chai) * cmake: disable unittest_async_compressor (`pr#10394 `_, Kefu Chai) * cmake: do not link against unused objects or libraries (`pr#10837 `_, Kefu Chai) * cmake: enable ccache for rocksdb too (`pr#11100 `_, Bassam Tabbara) * cmake: exclude non-public symbols in shared libraries (`issue#16556 `_, `pr#10472 `_, Kefu Chai) * cmake: fix incorrect dependencies to librados (`pr#10145 `_, Jason Dillaman) * cmake: fix the FTBFS introduced by dc8b3ba (`pr#10282 `_, Kefu Chai) * cmake: fix the build of unittest_async_compressor (`pr#10400 `_, Kefu Chai) * cmake: fix the tracing header dependencies (`pr#10906 `_, Kefu Chai) * cmake: fix unittest_rbd_mirror failures under non-optimized builds (`pr#9990 `_, Jason Dillaman) * cmake: fix wrong path introduced by bb163e9 (`pr#10643 `_, Kefu Chai) * cmake: fixes (`pr#10092 `_, Daniel Gryniewicz) * cmake: fixes for pypi changes (`pr#10204 `_, Kefu Chai) * cmake: include(SIMDExt) in src/CMakeLists.txt (`pr#11003 `_, Kefu Chai) * cmake: install ceph_test_cls_rgw (`pr#10025 `_, Kefu Chai) * cmake: install ceph_test_rados_striper_api_* (`pr#10541 `_, Kefu Chai) * cmake: install platlib into a subdir of build-base dir (`pr#10666 `_, Kefu Chai) * cmake: make py3 a nice-to-have (`issue#17103 `_, `pr#11015 `_, Kefu Chai) * cmake: pass -DINTEL* to gf-complete cflags (`pr#10956 `_, tone.zhang, Kefu Chai) * cmake: pass cmake's compiler and flags to compile RocksDB into build (`pr#10418 `_, Willem Jan Withagen) * cmake: recompile erasure src for different variants (`pr#10772 `_, Kefu Chai) * cmake: remove WITH_MDS option (`pr#10186 `_, Ali Maredia) * cmake: remove more autotools hacks (`pr#11229 `_, Sage Weil) * cmake: remove unnecessary linked libs from libcephfs (`issue#16556 `_, `pr#10081 `_, Kefu Chai) * cmake: rework NSS and SSL (`pr#9831 `_, Matt Benjamin) * cmake: set ARM_CRC_FLAGS from the CRC test rather than ARM_NEON_FLAGS (`issue#17250 `_, `pr#11028 `_, Dan Mick) * cmake: specify distutils build path explicitly (`pr#10568 `_, Kefu Chai) * cmake: supress more warnings (`pr#10469 `_, Willem Jan Withagen) * cmake: use PERF_LOCAL_FLAGS only if defined (`issue#17104 `_, `pr#10828 `_, Michel Normand) * cmake: use stock Find* modules. (`pr#10178 `_, Kefu Chai) * cmake: work to get inital FreeBSD stuff (`pr#10352 `_, Willem Jan Withagen) * cmake: find GIT_VER variables if there is no .git dir (`pr#11499 `_, Ali Maredia) * common,bluestore: Isa-l extention for zlib compression plugin (`pr#10158 `_, Alyona Kiseleva, Dan Mick) * common,bluestore: compressor/zlib: zlib wrapper fix (`pr#11079 `_, Igor Fedotov) * common: auth/cephx: misc fixes (`pr#9679 `_, xie xingguo) * common: common/PluginRegistry: improve error output for shared library load fa… (`pr#11081 `_, Igor Fedotov) * common: common/Throttle.h: remove unneeded class (`pr#10902 `_, Michal Jarzabek) * common: common/Timer.h: delete copy constr and assign op (`pr#11046 `_, Michal Jarzabek) * common: common/WorkQueue: add std move (`pr#9729 `_, Michal Jarzabek) * common: compressor: zlib compressor plugin cleanup (`pr#9782 `_, Alyona Kiseleva) * common: erasure-code: Runtime detection of SIMD for jerasure and shec (`pr#11086 `_, Bassam Tabbara) * common: global: log which process/command sent a signal (`pr#8964 `_, song baisen) * common: include/assert: clean up ceph assertion macros (`pr#9969 `_, Sage Weil) * common: instantiate strict_si_cast not strict_si_cast (`issue#16398 `_, `pr#9934 `_, Kefu Chai) * common: lockdep: verbose even if no logging is set (`pr#10576 `_, Willem Jan Withagen) * common: messages/MOSDMap: mark as enlighten OSDMap encoder (`pr#10843 `_, Sage Weil) * common: mon/Monitor.cc:replce lock/unlock with Mutex:Lockr (`pr#9792 `_, Michal Jarzabek) * common: msg/AsyncMessenger.cc: remove code duplication (`pr#10030 `_, Michal Jarzabek) * common: msg/async: less verbose debug messages at debug_ms=1 (`pr#11205 `_, Sage Weil) * common: msg/async: remove static member variable (`issue#16686 `_, `pr#10440 `_, Kefu Chai) * common: only call crypto::init once per CephContext (`issue#17205 `_, `pr#10965 `_, Casey Bodley) * common: osdc/ObjectCacher: change iterator to const_iterator and add const to member functions (`pr#9644 `_, Michal Jarzabek) * common: preforker: prevent call to 'write' on an fd that was already closed (`pr#10949 `_, Avner BenHanoch) * common: remove basename() dependency (`pr#9845 `_, John Coyle) * common: src/common/buffer.cc fix judgment for lseek (`pr#10130 `_, zhang.zezhu) * common: unknown hash type of judgment modification (`pr#9510 `_, huanwen ren) * common: Timer.cc: replace long types with auto (`pr#11067 `_, Michal Jarzabek) * common: TrackedOp: move ShardedTrackingData to .cc (`pr#10639 `_, Michal Jarzabek) * common: config_opts: fix comment(radio -> ratio) (`pr#10783 `_, xie xingguo) * common: src/common/dns_resolve.cc: reorder the includes (`pr#10505 `_, Willem Jan Withagen) * common: global/signal_handler: use sig_str instead of sys_siglist (`pr#10633 `_, John Coyle) * core,cephfs: Revert "osd/ReplicatedPG: for sync-read it don't cacl l_osd_op_r_prep… (`issue#16908 `_, `pr#10875 `_, Samuel Just) * core,cephfs: mon/mds: add err info when load_metadata is abnormal (`pr#10176 `_, huanwen ren) * core,common: osd/OSD.cc: remove unneeded returns (`pr#11043 `_, Michal Jarzabek) * core,pybind: python-rados: extends ReadOp/WriteOp API (`pr#9944 `_, Mehdi Abaakouk) * core,pybind: python-rados: implement new aio_stat. (`pr#11006 `_, Iain Buclaw) * core,pybind: qa/workunits/rados/test_python.sh: Allow specifying Python executable (`pr#10782 `_, Oleh Prypin) * core: os/filestore/LFNIndex: remove unused variable 'subdir_path' (`pr#8959 `_, huangjun) * core: Create ceph-mgr (`pr#10328 `_, John Spray, Tim Serong) * core: FileJournal: Remove obsolete _check_disk_write_cache function (`pr#11073 `_, Brad Hubbard) * core: Lua object class support (`pr#7338 `_, Noah Watkins) * core: OSD crash with Hammer to Jewel Upgrade: void FileStore::init_temp_collections() (`issue#16672 `_, `pr#10565 `_, David Zafman) * core: OSD.cc: remove unneeded return (`pr#9701 `_, Michal Jarzabek) * core: OSD: avoid FileStore finisher deadlock in osd_lock when shutdown OSD (`pr#11052 `_, Haomai Wang) * core: ObjectCacher: fix last_write check in bh_write_adjacencies() (`issue#16610 `_, `pr#10304 `_, Yan, Zheng) * core: ReplicatedPG: call op_applied for submit_log_entries based repops (`pr#9489 `_, Samuel Just) * core: Wip 16998 (`issue#16998 `_, `pr#10688 `_, Samuel Just) * core: ceph-create-keys: add missing argument comma (`pr#11123 `_, Patrick Donnelly) * core: ceph-create-keys: fix existing-but-different case (`issue#16255 `_, `pr#10415 `_, John Spray) * core: ceph-disk: partprobe should block udev induced BLKRRPART (`issue#15176 `_, `pr#9330 `_, Marius Vollmer, Loic Dachary) * core: ceph-disk: timeout ceph-disk to avoid blocking forever (`issue#16580 `_, `pr#10262 `_, Loic Dachary) * core: ceph-objectstore-tool: add a way to split filestore directories offline (`issue#17220 `_, `pr#10776 `_, Josh Durgin) * core: ceph.in: python 3 compatibility of the ceph CLI (`pr#9702 `_, Oleh Prypin) * core: ceph_mon: use readdir() as readdir_r() is deprecated (`pr#11047 `_, Kefu Chai) * core: cephx: Fix multiple segfaults due to attempts to encrypt or decrypt (`issue#16266 `_, `pr#9703 `_, Brad Hubbard) * core: https://github.com/ceph/ceph/pull/11052 (`pr#10371 `_, Yan Jun) * core: include write error codes in the pg log (`issue#14468 `_, `pr#10170 `_, Josh Durgin) * core: kv/MemDB: fix assert triggerred by m_total_bytes underflow (`pr#10471 `_, xie xingguo) * core: kv/RocksDB: add perfcounter for submit_transaction_sync operation (`pr#9770 `_, Haodong Tang) * core: logmon: check is_leader() before doing any work on get_trim_to() (`pr#10342 `_, song baisen) * core: memstore: clone zero-fills holes from source range (`pr#11157 `_, Casey Bodley) * core: message: optimization for message priority strategy (`pr#8687 `_, yaoning) * core: messages/MForward: fix encoding features (`issue#17365 `_, `pr#11180 `_, Sage Weil) * core: mgr/MgrClient: fix ms_handle_reset (`pr#11298 `_, Sage Weil) * core: mgr/MgrMap: initialize all fields (`issue#17492 `_, `pr#11308 `_, Sage Weil) * core: mon/ConfigKeyService: pass strings by const ref (`pr#10618 `_, Michal Jarzabek) * core: mon/LogMonitor: move C_Log struct to cc file (`pr#10721 `_, Michal Jarzabek) * core: mon/MonClient.h: pass strings by const reference (`pr#10605 `_, Michal Jarzabek) * core: mon/MonDBStore: fix assert which never fires (`pr#10706 `_, xie xingguo) * core: mon/MonitorDBStore: do not use snapshot iterator; close on close (`pr#10102 `_, Sage Weil) * core: mon/OSDMonitor.cc: remove use of boost assign (`pr#11060 `_, Michal Jarzabek) * core: mon/PGMonitor: batch filter pg states; add sanity check (`pr#9394 `_, xie xingguo) * core: mon/PGMonitor: calc the %USED of pool using used/(used+avail) (`issue#16933 `_, `pr#10584 `_, Kefu Chai) * core: mon/PGMonitor: move C_Stats struct to cc file (`pr#10719 `_, Michal Jarzabek) * core: mon/PaxosService: make the return value type inconsistent (`pr#10231 `_, zhang.zezhu) * core: mon/osdmonitor: fix incorrect output of "osd df" due to osd out (`issue#16706 `_, `pr#10308 `_, xie xingguo) * core: msg/AsyncMessenger: change return type to void (`pr#10230 `_, Michal Jarzabek) * core: msg/Messenger: add const and override to function (`pr#10183 `_, Michal Jarzabek) * core: msg/async/AsyncConnection: replace Mutex with std::mutex for peformance (`issue#16714 `_, `issue#16715 `_, `pr#10340 `_, Haomai Wang) * core: msg/async/Event: ensure not refer to member variable which may destroyed (`issue#16714 `_, `pr#10369 `_, Haomai Wang) * core: msg/async/kqueue: avoid remove nonexist kqueue event (`pr#9869 `_, Haomai Wang) * core: msg/async: Support close idle connection feature (`issue#16366 `_, `pr#9783 `_, Haomai Wang) * core: msg/async: allow other async backend implementations (`pr#10264 `_, Haomai Wang) * core: msg/async: avoid set out of range ms_async_op_threads option (`pr#11200 `_, Haomai Wang) * core: msg/async: connect authorizer fix + recv_buf size (`pr#9784 `_, Ilya Dryomov) * core: msg/async: harden error logic handle (`pr#9781 `_, Haomai Wang) * core: msg/async: remove fd output in log prefix (`pr#11199 `_, Haomai Wang) * core: msg/async: remove file event lock (`issue#16554 `_, `issue#16552 `_, `pr#10090 `_, Haomai Wang) * core: msg/simple/Pipe: eliminating casts for the comparing of len and recv_max_prefetch (`pr#10273 `_, zhang.zezhu) * core: msg/simple: fix wrong condition checking of writing TAG_CLOSE on closing (`pr#10343 `_, xie xingguo) * core: msg/simple: wait dispatch_queue until all pipes closed (`issue#16472 `_, `pr#9930 `_, Haomai Wang) * core: msg: make async backend default (`pr#10746 `_, Haomai Wang) * core: msg: mark daemons down on RST + ECONNREFUSED (`pr#8558 `_, Piotr Dałek) * core: os/FuseStore: fix several FuseStore issues (`pr#10723 `_, Sage Weil) * core: os/MemStore: move BufferlistObject to .cc file (`pr#10833 `_, Michal Jarzabek) * core: os/ObjectStore: fix return code of collection_empty() method (`pr#11050 `_, xie xingguo) * core: os/RocksDBStore: use effective Get API instead of iterator api (`pr#9411 `_, Jianjian Huo, Haomai Wang, Mark Nelson) * core: os/filestore/FDCache: fix bug when filestore_fd_cache_shards = 0 (`pr#11048 `_, jimifm) * core: os/filestore/FileJournal: error out if FileJournal is not a file (`issue#17307 `_, `pr#11146 `_, Kefu Chai) * core: os/filestore: add sanity checks and cleanups for mount() process (`pr#9734 `_, xie xingguo) * core: os/filestore: disable use of splice by default (`pr#11113 `_, Haomai Wang) * core: osd/OSD.cc: remove repeated searching of map (`pr#10986 `_, Michal Jarzabek) * core: osd/OSD.cc: remove unneeded searching of maps (`pr#11039 `_, Michal Jarzabek) * core: osd/OSD.h: add const to member functions (`pr#11114 `_, Michal Jarzabek) * core: osd/OSD.h: move some members under private (`pr#11121 `_, Michal Jarzabek) * core: osd/OSD.h: remove unneeded line (`pr#8980 `_, Michal Jarzabek) * core: osd/OSDMonitor: misc. cleanups (`pr#10739 `_, xie xingguo) * core: osd/OSDMonitor: misc. fixes (`pr#10491 `_, xie xingguo) * core: osd/ReplicatedBackend: add sanity check during build_push_op() (`pr#9491 `_, Yan Jun) * core: osd/ReplicatedPG: for sync-read it don't cacl l_osd_op_r_prepare_lat. (`pr#10365 `_, Jianpeng Ma) * core: osd/ReplicatedPG: remove class redeclaration (`pr#11041 `_, Michal Jarzabek) * core: osd/ReplicatedPG: remove unused param "op" from generate_subop() (`pr#10811 `_, jimifm) * core: osd/Watch: add consts to member functions (`pr#10251 `_, Michal Jarzabek) * core: osd/osd_type: check if pool is gone during check_new_interval() (`pr#10859 `_, xie xingguo) * core: osd/osdmonitor: pool of objects and bytes beyond quota should all be warn (`pr#9085 `_, huanwen ren) * core: osdc/objecter: misc fixes (`pr#10826 `_, xie xingguo) * core: pass string by const ref and add override to virtual function (`pr#9082 `_, Michal Jarzabek) * core: qa/workunits/objectstore/test_fuse.sh: make test_fuse.sh work with filestore (`pr#11057 `_, Sage Weil) * core: rados: add option to include clones when doing flush or evict (`pr#9698 `_, Mingxin Liu) * core: subman: use replace instead of format (`issue#16961 `_, `pr#10620 `_, Loic Dachary) * core: test/common/Throttle.cc: fix race in shutdown (`pr#10094 `_, Samuel Just) * core: test: add the necessary judgment (`pr#9694 `_, huanwen ren) * core: tox.ini: remove extraneous coverage --omit option (`pr#10943 `_, Josh Durgin) * core: udev: always populate /dev/disk/by-parttypeuuid (`issue#16351 `_, `pr#9885 `_, Loic Dachary) * core: os/FuseStore: remove unneeded header file (`pr#10799 `_, Michal Jarzabek) * core: os/MemStore: move OmapIteratorImpl to cc file (`pr#10803 `_, Michal Jarzabek) * core: os/Memstore.h: add override to virtual functions (`pr#10801 `_, Michal Jarzabek) * core: os/Memstore: move PageSetObject class to .cc file (`pr#10817 `_, Michal Jarzabek) * core: os/bluestore: remove unused head file. (`pr#11186 `_, Jianpeng Ma) * core: safe_io: Improve portability by replacing loff_t type usage with off_t. (`pr#9767 `_, John Coyle) * core: src/kv/MemDB.cc: the type of the parameter of push_back() does not match the ops's value_type (`pr#10455 `_, Willem Jan Withagen) * core: msg/simple: apply prefetch policy more precisely (`pr#10344 `_, xie xingguo) * core: CompatSet.h: remove unneeded inline (`pr#10071 `_, Michal Jarzabek) * core: Objclass perm feedback (`pr#10313 `_, Noah Watkins) * core: arch/arm.c: remove unnecessary variable read for simplicity (`pr#10821 `_, Weibing Zhang) * crush: don't normalize input of crush_ln iteratively (`pr#10935 `_, Piotr Dałek) * crush: reset bucket->h.items[i] when removing tree item (`issue#16525 `_, `pr#10093 `_, Kefu Chai) * crush: CrushCompiler.cc:884 (`pr#10952 `_, xu biao) * crush: CrushCompiler: error out as long as parse fails (`issue#17306 `_, `pr#11144 `_, Kefu Chai) * doc: Add documentation about snapshots (`pr#10436 `_, Greg Farnum) * doc: Add two options to radosgw-admin.rst manpage (`issue#17281 `_, `pr#11134 `_, Thomas Serlin) * doc: Changed config parameter "rgw keystone make new tenants" in radosgw multitenancy (`issue#17293 `_, `pr#11127 `_, SirishaGuduru) * doc: Modification for "TEST S3 ACCESS" section in "INSTALL CEPH OBJECT GATEWAY" page (`pr#9089 `_, la-sguduru) * doc: Update developer docs for cmake paths (`pr#11163 `_, John Spray) * doc: add "--orphan-stale-secs" to radosgw-admin(8) (`issue#17280 `_, `pr#11097 `_, Ken Dreyer) * doc: add $pid metavar conf doc (`pr#11172 `_, Patrick Donnelly) * doc: add Backporting section to Essentials chapter (`issue#15497 `_, `pr#10457 `_, Nathan Cutler) * doc: add Prepare tenant section to Testing in the cloud chapter (`pr#10413 `_, Nathan Cutler) * doc: add Upload logs to archive server section... (`pr#10414 `_, Nathan Cutler) * doc: add client config ref (`issue#16743 `_, `pr#10434 `_, Patrick Donnelly) * doc: add graphic for cap bit field (`pr#10897 `_, Patrick Donnelly) * doc: add missing PR to hammer 0.94.8 release notes (`pr#10900 `_, Nathan Cutler) * doc: add openSUSE instructions to quick-start-preflight (`pr#10454 `_, Nathan Cutler) * doc: add rgw_enable_usage_log option in Rados Gateway admin guide (`issue#16604 `_, `pr#10159 `_, Mike Hackett) * doc: add troubleshooting steps for ceph-fuse (`pr#10374 `_, Ken Dreyer) * doc: admin/build-doc: bypass sanity check if building doc (`issue#16940 `_, `pr#10623 `_, Kefu Chai) * doc: ceph-authtool man page option is --print-key not --print (`pr#9731 `_, Brad Hubbard) * doc: ceph-deploy mon add doesn't take multiple nodes (`pr#10085 `_, Chengwei Yang) * doc: clarify rbd size units (`pr#11303 `_, Ilya Dryomov) * doc: cleanup outdated radosgw description (`pr#11248 `_, Jiaying Ren) * doc: describe libvirt client logging (`pr#10542 `_, Ken Dreyer) * doc: do not list all major versions in get-packages.rst (`pr#10899 `_, Nathan Cutler) * doc: doc/cephfs: explain the various health messages (`pr#10244 `_, John Spray) * doc: doc/dev: Fix missing code section due to no lexer for "none" (`pr#9083 `_, Brad Hubbard) * doc: doc/radosgw: fix description of response elements 'Part' (`pr#10641 `_, weiqiaomiao) * doc: doc/radosgw: rename config.rst to config-fcgi.rst (`pr#10381 `_, Nathan Cutler) * doc: extend the CephFS troubleshooting guide (`pr#10458 `_, Greg Farnum) * doc: fix broken link in SHEC erasure code plugin (`issue#16996 `_, `pr#10675 `_, Albert Tu) * doc: fix description for rsize and rasize (`pr#11101 `_, Andreas Gerstmayr) * doc: fix rados/configuration/osd-config-ref.rst (`pr#10619 `_, Chengwei Yang) * doc: fix singleton example in Developer Guide (`pr#10830 `_, Nathan Cutler) * doc: fix some nits in release notes and releases table (`pr#10903 `_, Nathan Cutler) * doc: fix standby replay config (`issue#16664 `_, `pr#10268 `_, Patrick Donnelly) * doc: fix wrong osdkeepalive name in mount.ceph manpage (`pr#10840 `_, Zhi Zhang) * doc: fix/add changelog for 10.2.2, 0.94.7, 0.94.8 (`pr#10895 `_, Sage Weil) * doc: format 2 now is the default image format (`pr#10705 `_, Chengwei Yang) * doc: lgtm (build verified f23) (`pr#9745 `_, weiqiaomiao) * doc: mailmap updates for upcoming 11.0.0 (`pr#9301 `_, Yann Dupont) * doc: manual instructions to set up mds daemon (`pr#11115 `_, Peter Maloney) * doc: missing "make vstart" in quick_guide.rst (`pr#11226 `_, Leo Zhang) * doc: more details for pool deletion (`pr#10190 `_, Ken Dreyer) * doc: peering.rst, fix typo (`pr#10131 `_, Brad Hubbard) * doc: perf_counters.rst fix trivial typo (`pr#10292 `_, Brad Hubbard) * doc: rbdmap: specify bash shell interpreter (`issue#16608 `_, `pr#10733 `_, Jason Dillaman) * doc: release-notes.rst: draft 0.94.8 release notes (`pr#10730 `_, Nathan Cutler) * doc: remove btrfs contradiction (`pr#9758 `_, Nathan Cutler) * doc: remove i386 from minimal hardware recommendations (`pr#10276 `_, Kefu Chai) * doc: remove old references to inktank premium support (`pr#11182 `_, Alfredo Deza) * doc: remove the description of deleted options (`issue#17041 `_, `pr#10741 `_, MinSheng Lin) * doc: rgw, doc: fix formatting around Keystone-related options. (`pr#10331 `_, Radoslaw Zarzynski) * doc: rgw/doc: fix indent (`pr#10676 `_, Yan Jun) * doc: rm SysV instructions, add systemd (`pr#10184 `_, Ken Dreyer) * doc: silence sphinx warnings (`pr#10621 `_, Kefu Chai) * doc: small standby doc edits (`pr#10479 `_, Patrick Donnelly) * doc: update CephFS "early adopters" info (`pr#10068 `_, John Spray) * doc: update canonical tarballs URL (`pr#9695 `_, Ken Dreyer) * doc: update rbd glance configuration notes (`pr#10629 `_, Jason Dillaman) * doc: update s3 static webiste feature support status (`pr#10223 `_, Jiaying Ren) * doc: changelog: add v10.2.3 (`pr#11238 `_, Abhishek Lekshmanan) * doc: install: Use https:// for download.ceph.com (`pr#10709 `_, Colin Walters) * doc: release-notes: v0.94.9 (`pr#10927 `_, Sage Weil) * doc: release-notes: v10.2.3 jewel (`pr#11234 `_, Abhishek Lekshmanan) * doc: Add UK mirror and update copyright (`pr#10531 `_, Patrick McGarry) * doc: README.md: replace package build instructions with tarball instructions (`pr#10829 `_, Sage Weil) * doc: Removed reference about pool ownership based on BZ#1368528 (`pr#11063 `_, Bara Ancincova) * librados: use bufferlist instead of buffer::list in public header (`pr#10632 `_, Ryne Li) * librados: Rados-stripper: Flexible string matching for not found attributes (`pr#10577 `_, Willem Jan Withagen) * librados: librados examples: link and include from current source tree by default. (`issue#15100 `_, `pr#8189 `_, Jesse Williamson) * librbd: API methods to directly acquire and release the exclusive lock (`issue#15632 `_, `pr#9592 `_, Mykola Golub) * librbd: add consistency groups operations with images (`pr#10034 `_, Victor Denisov) * librbd: add explicit shrink check while resizing images (`pr#9878 `_, Vaibhav Bhembre) * librbd: asynchronous v2 image creation (`issue#15321 `_, `pr#9585 `_, Venky Shankar) * librbd: backward/forward compatibility for update_features (`issue#17330 `_, `pr#11155 `_, Jason Dillaman) * librbd: block name prefix might overflow fixed size C-string (`issue#17310 `_, `pr#11148 `_, Jason Dillaman) * librbd: cache was not switching to writeback after first flush (`issue#16654 `_, `pr#10762 `_, Jason Dillaman) * librbd: corrected use-after-free in ImageWatcher (`issue#17289 `_, `pr#11112 `_, Jason Dillaman) * librbd: deadlock when replaying journal during image open (`issue#17188 `_, `pr#10945 `_, Jason Dillaman) * librbd: delay acquiring lock if image watch has failed (`issue#16923 `_, `pr#10574 `_, Jason Dillaman) * librbd: discard hangs when 'rbd_skip_partial_discard' is enabled (`issue#16386 `_, `pr#10060 `_, Mykola Golub) * librbd: extract group module from librbd/internal (`pr#11070 `_, Victor Denisov) * librbd: failed assertion after shrinking a clone image twice (`issue#16561 `_, `pr#10072 `_, Jason Dillaman) * librbd: fix missing return statement if failed to get mirror image state (`pr#10136 `_, runsisi) * librbd: fix possible inconsistent state when disabling mirroring fails (`issue#16984 `_, `pr#10711 `_, Jason Dillaman) * librbd: ignore partial refresh error when acquiring exclusive lock (`issue#17227 `_, `pr#11044 `_, Jason Dillaman) * librbd: initial hooks for client-side, image-extent cache in IO path (`pr#9121 `_, Jason Dillaman) * librbd: interlock image refresh and exclusive lock operations (`issue#16773 `_, `issue#17015 `_, `pr#10770 `_, Jason Dillaman) * librbd: memory leak in MirroringWatcher::notify_image_updated (`pr#11306 `_, Mykola Golub) * librbd: optimize away unnecessary object map updates (`issue#16707 `_, `issue#16689 `_, `pr#10332 `_, Jason Dillaman) * librbd: optionally unregister "laggy" journal clients (`issue#14738 `_, `pr#10378 `_, Mykola Golub) * librbd: permit disabling journaling if in corrupt state (`issue#16740 `_, `pr#10712 `_, Jason Dillaman) * librbd: possible deadlock if cluster connection closed after image (`issue#17254 `_, `pr#11037 `_, Jason Dillaman) * librbd: potential deadlock closing image with in-flight readahead (`issue#17198 `_, `pr#11152 `_, Jason Dillaman) * librbd: potential double-unwatch of watch handle upon error (`issue#17210 `_, `pr#10974 `_, Jason Dillaman) * librbd: potential seg fault when blacklisting an image client (`issue#17251 `_, `pr#11034 `_, Jason Dillaman) * librbd: prevent creation of clone from non-primary mirrored image (`issue#16449 `_, `pr#10123 `_, Mykola Golub) * librbd: prevent creation of v2 image ids that are too large (`issue#16887 `_, `pr#10581 `_, Jason Dillaman) * mds: Add path filtering for dump cache (`issue#11171 `_, `pr#9925 `_, Douglas Fuller) * mds: Kill C_SaferCond in evict_sessions() (`issue#16288 `_, `pr#9971 `_, Douglas Fuller) * mds: Return "committing" rather than "committed" member in get_committing (`pr#10250 `_, Greg Farnum) * mds: Set mds_snap_max_uid to 4294967294 (`pr#11016 `_, Wido den Hollander) * mds: add assertion in handle_slave_rename_prep (`issue#16807 `_, `pr#10429 `_, John Spray) * mds: add assertions for standby_daemons invariant (`issue#16592 `_, `pr#10316 `_, Patrick Donnelly) * mds: add health warning for oversized cache (`issue#16570 `_, `pr#10245 `_, John Spray) * mds: add maximum fragment size constraint (`issue#16164 `_, `pr#9789 `_, Patrick Donnelly) * mds: add perf counters for MDLog replay and SessionMap (`pr#10539 `_, John Spray) * mds: catch duplicates in DamageTable (`issue#17173 `_, `pr#11137 `_, John Spray) * mds: fix Session::check_access() (`issue#16358 `_, `pr#9769 `_, Yan, Zheng) * mds: fix daemon selection when starting ranks (`pr#10540 `_, John Spray) * mds: fix shutting down mds timed-out due to deadlock (`issue#16396 `_, `pr#9884 `_, Zhi Zhang) * mds: fix up _dispatch ref-counting semantics (`pr#10533 `_, Greg Farnum) * mds: fixup dump Formatter' type error; add path_ino and is_primary in the CDentry::dump() (`pr#10119 `_, huanwen ren) * mds: handle blacklisting during journal recovery (`issue#17236 `_, `pr#11138 `_, John Spray) * mds: log path with CDir damage messages (`issue#16973 `_, `pr#10996 `_, John Spray) * mds: move Finisher to unlocked shutdown (`issue#16042 `_, `pr#10142 `_, Patrick Donnelly) * mds: populate DamageTable from scrub and log more quietly (`issue#16016 `_, `pr#11136 `_, John Spray) * mds: remove fail-safe queueing replay request (`issue#17271 `_, `pr#11078 `_, Yan, Zheng) * mds: remove max_mds config option (`issue#17105 `_, `pr#10914 `_, Patrick Donnelly) * mds: remove unused MDSDaemon::objecter (`pr#10566 `_, Patrick Donnelly) * mds: snap failover fixes (`pr#9955 `_, Yan, Zheng) * mds: trim null dentries proactively (`issue#16919 `_, `pr#10606 `_, John Spray) * mds: unuse Class and cleanup (`pr#10399 `_, huanwen ren) * mds: use reference to avoid copy (`pr#10191 `_, Patrick Donnelly) * mds: MDCache.h: remove unneeded access specifier (`pr#10901 `_, Michal Jarzabek) * mds: MDSDaemon: move C_MDS_Tick class to .cc file (`pr#11220 `_, Michal Jarzabek) * mgr: implement con reset handling (`pr#11299 `_, Sage Weil) * mgr: squash compiler warnings (`pr#11307 `_, John Spray) * mon: MonClient may hang on pinging an unresponsive monitor (`pr#9259 `_, xie xingguo) * mon: Monitor: validate prefix on handle_command() (`issue#16297 `_, `pr#9700 `_, You Ji) * mon: OSDMonitor: Missing nearfull flag set (`pr#11082 `_, Igor Podoski) * mon: change osdmap flags set and unset messages (`issue#15983 `_, `pr#9252 `_, Vikhyat Umrao) * mon: clear list in better way (`pr#9718 `_, song baisen) * mon: do not recalculate 'to_remove' when it's known (`pr#9717 `_, song baisen) * mon: misc cleanups (`pr#10591 `_, xie xingguo) * mon: remove the redundant cancel_probe_timeout function (`pr#10261 `_, song baisen) * mon: remove the redundant is_active judge in PaxosService (`pr#9749 `_, song baisen) * mon: tear down standby replays on MDS rank stop (`issue#16909 `_, `pr#10628 `_, John Spray) * mon: use clearer code structure (`pr#10192 `_, Patrick Donnelly) * mon: validate states transmitted in beacons (`issue#16592 `_, `pr#10428 `_, John Spray) * mon: wait 10m (not 5m) before marking down OSDs out (`pr#11184 `_, Sage Weil) * mon: write fsid use the right return value (`pr#10197 `_, song baisen) * mon: Elector:move C_ElectionExpire class to cc file (`pr#10416 `_, Michal Jarzabek) * mon: HealthMonitor: add override to virtual functs (`pr#10549 `_, Michal Jarzabek) * mon: HealthMonitor: remove unneeded include (`pr#10563 `_, Michal Jarzabek) * mon: MonClient.h: delete copy constr and assing op (`pr#10599 `_, Michal Jarzabek) * mon: MonClient: move C_CancelMonCommand to cc file (`pr#10392 `_, Michal Jarzabek) * mon: MonClient: move C_Tick struct to cc file (`pr#10383 `_, Michal Jarzabek) * mon: Monitor.h: add override to virtual functions (`pr#10515 `_, Michal Jarzabek) * mon: Monitor: move C_Scrub, C_ScrubTimeout to .cc (`pr#10513 `_, Michal Jarzabek) * mon: OSDMonitor.cc: remove unneeded casts (`pr#10575 `_, Michal Jarzabek) * mon: Paxos: move classes to .cc file (`pr#11215 `_, Michal Jarzabek) * mon: PaxosService: move classes to cc file (`pr#10529 `_, Michal Jarzabek) * mon: remove the redundant list swap in paxos commit_proposal (`pr#10011 `_, song baisen) * msgr: set close on exec flag (`issue#16390 `_, `pr#9772 `_, Kefu Chai) * msgr: Accepter.h: add override to virtual function (`pr#10422 `_, Michal Jarzabek) * msgr: Accepter: move include to cc file (`pr#10441 `_, Michal Jarzabek) * msgr: AsyncConnection: add const to mem functions (`pr#10302 `_, Michal Jarzabek) * msgr: AsyncMessenger.cc: remove unneeded cast (`pr#10141 `_, Michal Jarzabek) * msgr: AsyncMessenger: add const to function (`pr#10114 `_, Michal Jarzabek) * msgr: AsyncMessenger: move C_handle_reap class to cc (`pr#10113 `_, Michal Jarzabek) * msgr: AsyncMessenger: move C_processor_accept class (`pr#9991 `_, Michal Jarzabek) * msgr: AsyncMessenger: remove unneeded include file (`pr#10195 `_, Michal Jarzabek) * msgr: AsyncMessenger: remove unused function (`pr#10163 `_, Michal Jarzabek) * msgr: EventKqueue.h: add override to virtual func (`pr#10318 `_, Michal Jarzabek) * msgr: EventPoll.h: add override to virtual functions (`pr#10314 `_, Michal Jarzabek) * msgr: EventSelect.h: add override to virtual funct (`pr#10321 `_, Michal Jarzabek) * msgr: EventSelect: move includes to cc file (`pr#10333 `_, Michal Jarzabek) * msgr: FastStrategy.h: add override to virtual funct (`pr#10482 `_, Michal Jarzabek) * msgr: Message.h: add const to member function (`pr#10354 `_, Michal Jarzabek) * msgr: Message.h: remove code duplication (`pr#10356 `_, Michal Jarzabek) * msgr: QueueStrategy: add override to virtual functs (`pr#10503 `_, Michal Jarzabek) * msgr: Stack.h: delete copy constr and assign op (`pr#11107 `_, Michal Jarzabek) * msgr: async/Event.h: add const to member function (`pr#10224 `_, Michal Jarzabek) * msgr: async: remove unused code. (`pr#11247 `_, Jianpeng Ma) * osd: bail out if transaction size overflows (`issue#16982 `_, `pr#10753 `_, Kefu Chai) * osd: cleanup options and other redundancies (`pr#10450 `_, xie xingguo) * osd: drop unused variables/methods (`pr#10559 `_, xie xingguo) * osd: fix the mem leak of RepGather (`issue#16801 `_, `pr#10423 `_, Kefu Chai) * osd: fixups to explicitly persistenting missing sets (`pr#10405 `_, Samuel Just) * osd: increment stats on recovery pull also (`issue#16277 `_, `pr#10152 `_, Kefu Chai) * osd: limit omap data in push op (`issue#16128 `_, `pr#9894 `_, Wanlong Gao) * osd: minor performance improvements (`pr#10470 `_, xie xingguo) * osd: minor performance improvements and fixes (`pr#10526 `_, xie xingguo) * osd: misc fixes and cleanups (`pr#10610 `_, xie xingguo) * osd: miscellaneous fixes (`pr#10572 `_, xie xingguo) * osd: more cleanups (`pr#10548 `_, xie xingguo) * osd: object class loading and execution permissions (`pr#9972 `_, Noah Watkins) * osd: pass shared_ptr by const reference (`pr#11266 `_, Michal Jarzabek) * osd: persist the missing set explicitly (`pr#10334 `_, Samuel Just) * osd: remove dispatch queue check since we don't queue hb message to this (`pr#9947 `_, Haomai Wang) * osd: remove duplicated function (`pr#9117 `_, Wei Jin) * osd: replace ceph:atomic_t with std::atomic in osd module. (`pr#9138 `_, Xiaowei Chen) * osd: should not look up an empty pg (`issue#17380 `_, `pr#11208 `_, Kefu Chai, Loic Dachary) * osd: small cleanups (`pr#9980 `_, Wanlong Gao) * osd: subscribe for old osdmaps when pause flag is set (`issue#17023 `_, `pr#10725 `_, Kefu Chai) * osd:preserve allocation hint attribute during recovery (`pr#9452 `_, yaoning) * osd: osd-fast-mark-down.sh: fix typo in variable assignments (`pr#11224 `_, Willem Jan Withagen) * osd: OSD.cc: initialise variable at definition (`pr#11099 `_, Michal Jarzabek) * osd: OSD.cc: remove unneeded searching of map (`pr#11000 `_, Michal Jarzabek) * osd: OSD.h: make some members private (`pr#11085 `_, Michal Jarzabek) * osd: PG.cc: remove unneeded use of count (`pr#11228 `_, Michal Jarzabek) * osd: PGBackend.h: move structs to .cc file (`pr#10975 `_, Michal Jarzabek) * osd: ReplicatedBackend: move classes to cc file (`pr#10967 `_, Michal Jarzabek) * osd: ReplicatedPG.h: add override to virtual funct (`pr#11271 `_, Michal Jarzabek) * osd: ReplicatedPG: move classes to .cc file (`pr#10971 `_, Michal Jarzabek) * osd: ReplicatedPG:move C_OSD_OnApplied class to cc (`pr#11288 `_, Michal Jarzabek) * osd: Watch.h: remove unneeded forward declaration (`pr#10269 `_, Michal Jarzabek) * osd: osdc/ObjectCacher.h: add const to member functions (`pr#9569 `_, Michal Jarzabek) * osd: osdc/ObjectCacher.h: add const to member functions (`pr#9652 `_, Michal Jarzabek) * osd: osdc/Objecter: move RequestStateHook class to .cc (`pr#10734 `_, Michal Jarzabek) * pybind: Port Python-based tests and remaining Python bindings to Python 3 (`pr#10177 `_, Oleh Prypin) * pybind: Rework cephfs/setup.py for PyPI (`pr#10315 `_, Anirudha Bose) * pybind: Rework rbd/setup.py for PyPI (`issue#16940 `_, `pr#10376 `_, Anirudha Bose) * pybind: global/signal_handler: dump cmdline instead of arg[0] (`pr#10345 `_, Kefu Chai) * pybind: make rados ready for PyPI (`pr#9833 `_, Anirudha Bose) * pybind: pybind/ceph_argparse: handle non ascii unicode args (`issue#12287 `_, `pr#8943 `_, Kefu Chai) * pybind: Python 3 compatibility for workunits (`pr#10815 `_, Anirudha Bose) * rbd: Allow user to remove snapshot with --force to auto flatten children (`pr#10087 `_, Dongsheng Yang) * rbd: Reviewed-off-by: Ilya Dryomov (`issue#16171 `_, `pr#10481 `_, Jason Dillaman) * rbd: Reviewed-off-by: Ilya Dryomov (`issue#17030 `_, `pr#10735 `_, Jason Dillaman) * rbd: bench io-size should not be larger than image size (`issue#16967 `_, `pr#10708 `_, Jason Dillaman) * rbd: cleanup - Proxied operations shouldn't result in error messages if replayed (`issue#16130 `_, `pr#9724 `_, Vikhyat Umrao) * rbd: cls_rbd: clean up status from rbd-mirror if image removed (`pr#11142 `_, Huan Zhang) * rbd: cls_rbd: set omap values in batch during image creation (`pr#9981 `_, Dongsheng Yang) * rbd: inherit the parent image features when cloning an image (`issue#15388 `_, `pr#9334 `_, Dongsheng Yang) * rbd: journal: ensure in-flight ops are complete destroying journaler (`issue#17446 `_, `pr#11257 `_, Mykola Golub, Jason Dillaman) * rbd: journal: increase concurrency/parallelism of journal recorder (`issue#15259 `_, `pr#10445 `_, Ricardo Dias) * rbd: journal: move JournalTrimmer::C_RemoveSet struct (`pr#10912 `_, Michal Jarzabek) * rbd: qa/workunits/rbd: before removing image make sure it is not bootstrapped (`issue#16555 `_, `pr#10155 `_, Mykola Golub) * rbd: qa/workunits/rbd: check status also in pool dir after asok commands (`pr#11291 `_, Mykola Golub) * rbd: qa/workunits/rbd: set image-meta on primary image and wait it is replicated (`pr#11294 `_, Mykola Golub) * rbd: qa/workunits/rbd: small fixup and improvements for rbd-mirror tests (`pr#10483 `_, Mykola Golub) * rbd: qa/workunits/rbd: wait for image deleted before checking health (`pr#10545 `_, Mykola Golub) * rbd: qa/workunits: support filtering cls_rbd unit test cases (`issue#16529 `_, `pr#10714 `_, Jason Dillaman) * rbd: rbd-mirror: 'wait_for_scheduled_deletion' callback might deadlock (`issue#16491 `_, `pr#9964 `_, Jason Dillaman) * rbd: rbd-mirror: Add sparse read for sync image (`issue#16780 `_, `pr#11005 `_, tianqing) * rbd: rbd-mirror: add additional test scenarios (`pr#10488 `_, lande1234) * rbd: rbd-mirror: concurrent access of event might result in heap corruption (`issue#17283 `_, `pr#11104 `_, Jason Dillaman) * rbd: rbd-mirror: force-promoted image will remain R/O until rbd-mirror daemon restarted (`issue#16974 `_, `pr#11090 `_, Jason Dillaman) * rbd: rbd-mirror: gracefully fail if object map is unavailable (`issue#16558 `_, `pr#10065 `_, Jason Dillaman) * rbd: rbd-mirror: gracefully handle being blacklisted (`issue#16349 `_, `pr#9970 `_, Jason Dillaman) * rbd: rbd-mirror: image deleter should use pool id + global image uuid for key (`issue#16538 `_, `issue#16227 `_, `pr#10484 `_, Jason Dillaman) * rbd: rbd-mirror: improve split-brain detection logic (`issue#16855 `_, `pr#10703 `_, Jason Dillaman) * rbd: rbd-mirror: include local pool id in resync throttle unique key (`issue#16536 `_, `pr#10254 `_, Jason Dillaman) * rbd: rbd-mirror: non-primary image is recording journal events during image sync (`pr#10462 `_, Jason Dillaman) * rbd: rbd-mirror: potential IO stall when using asok flush request (`issue#16708 `_, `pr#10432 `_, Jason Dillaman) * rbd: rbd-mirror: potential assertion failure during error-induced shutdown (`issue#16956 `_, `pr#10613 `_, Jason Dillaman) * rbd: rbd-mirror: potential race condition during failure shutdown (`issue#16980 `_, `pr#10667 `_, Jason Dillaman) * rbd: rbd-mirror: quiesce in-flight event commits before shut down (`issue#17355 `_, `pr#11185 `_, Jason Dillaman) * rbd: rbd-mirror: reduce memory footprint during journal replay (`issue#16223 `_, `pr#10341 `_, Jason Dillaman) * rbd: rbd-mirror: remove ceph_test_rbd_mirror_image_replay test case (`issue#16539 `_, `pr#10083 `_, Mykola Golub) * rbd: rbd-mirror: replaying state should include flush action (`issue#16970 `_, `pr#10627 `_, Jason Dillaman) * rbd: rbd-mirror: replicate dynamic feature updates (`issue#16213 `_, `pr#10980 `_, Mykola Golub) * rbd: rbd-mirror: replicate image metadata settings (`issue#16212 `_, `pr#11168 `_, Mykola Golub) * rbd: rbd-mirror: snap rename does not properly replicate to peers (`issue#16622 `_, `pr#10249 `_, Jason Dillaman) * rbd: rbd-nbd does not properly handle resize notifications (`issue#15715 `_, `pr#9291 `_, Mykola Golub) * rbd: rbd-nbd: fix kernel deadlock during teuthology testing (`issue#16921 `_, `pr#10985 `_, Jason Dillaman) * rbd: recognize lock_on_read option (`pr#11313 `_, Ilya Dryomov) * rbd: return error if we specified a wrong image name for rbd du (`issue#16987 `_, `pr#11031 `_, Dongsheng Yang) * rbd: test/librbd/fsx: enable exclusive-lock feature in krbd mode (`pr#10984 `_, Ilya Dryomov) * rbd: test/rbd: fix possible mock journal race conditions (`issue#17317 `_, `pr#11153 `_, Jason Dillaman) * rbd: test: cmake related fixups for rbd tests (`pr#10124 `_, Mykola Golub) * rbd: test: run-rbd-tests test cmake fixup (`pr#10134 `_, Mykola Golub) * rbd: test: use wrapper that respects RBD_FEATURES when creating rbd image (`issue#16717 `_, `pr#10348 `_, Mykola Golub) * rbd: udev: add krbd readahead placeholder (`pr#10841 `_, Nick Fisk) * rbd: rbd_mirror/ImageSynceThrottler: move struct to .cc (`pr#10928 `_, Michal Jarzabek) * rgw: (build verified, provably unused/not aliased) (`pr#9993 `_, weiqiaomiao) * rgw: Add documentation for the Multi-tenancy feature (`pr#9570 `_, Pete Zaitcev) * rgw: Clean up lifecycle thread (`pr#10480 `_, Daniel Gryniewicz) * rgw: Do not archive metadata by default (`issue#17256 `_, `pr#11051 `_, Pavan Rallabhandi) * rgw: Fix Host->bucket fallback logic inversion (`issue#15975 `_, `issue#17136 `_, `pr#10873 `_, Robin H. Johnson) * rgw: Fix for using port 443 with pre-signed urls. (`issue#16548 `_, `pr#10088 `_, Pritha Srivastava) * rgw: Fix incorrect content length and range for zero sized objects during range requests (`issue#16388 `_, `pr#10207 `_, Pavan Rallabhandi) * rgw: Got rid of recursive mutex. (`pr#10562 `_, Adam Kupczyk) * rgw: RGW : setting socket backlog for via ceph.conf (`issue#16406 `_, `pr#9891 `_, Feng Guo) * rgw: RGWMetaSyncCR holds refs to stacks instead of crs (`issue#16666 `_, `pr#10301 `_, Casey Bodley) * rgw: Reviewed by: Pritha Srivastava (`issue#16188 `_, `pr#9584 `_, Albert Tu) * rgw: Rgw lifecycle testing (`pr#11131 `_, Daniel Gryniewicz) * rgw: Rgw nfs 28 (`pr#10611 `_, Matt Benjamin) * rgw: add configurables for {data,meta} sync error injection (`pr#10388 `_, Yehuda Sadeh) * rgw: add deadlock detection to RGWCoroutinesManager::run() (`pr#10032 `_, Casey Bodley) * rgw: add lc_pool when decode or encode struct RGWZoneParams (`pr#10439 `_, weiqiaomiao) * rgw: add missing master_zone when running with old default region config (`issue#16627 `_, `pr#10205 `_, Orit Wasserman) * rgw: add pg_ver to tombstone_cache (`pr#9851 `_, Casey Bodley) * rgw: add reinit/rebind logic (ldap) (`pr#10532 `_, Matt Benjamin) * rgw: add return value checking to avoid possible subsequent `parse` exception (`pr#10241 `_, Yan Jun) * rgw: add suport for Swift-at-root dependent features of Swift API (`issue#16673 `_, `pr#10280 `_, Pritha Srivastava, Radoslaw Zarzynski) * rgw: add support for Static Website of Swift API (`pr#9844 `_, Radoslaw Zarzynski) * rgw: add tenant support to multisite sync (`issue#16469 `_, `pr#10075 `_, Casey Bodley) * rgw: back off bucket sync on failures, don't store marker (`issue#16742 `_, `pr#10355 `_, Yehuda Sadeh) * rgw: better error message when user has no bucket created yet (`issue#16444 `_, `pr#10162 `_, Gaurav Kumar Garg) * rgw: clean-up in the authentication infrastructure (`pr#10212 `_, Radoslaw Zarzynski) * rgw: clear realm watch on failed watch_restart (`issue#16817 `_, `pr#10446 `_, Casey Bodley) * rgw: collect skips a specific coroutine stack (`issue#16665 `_, `pr#10274 `_, Yehuda Sadeh) * rgw: cosmetic changes only--build verified, f23 (`pr#9931 `_, Yan Jun) * rgw: delete region map after upgrade to zonegroup map (`issue#17051 `_, `pr#10831 `_, Casey Bodley) * rgw: do not try to encode or decode time_t and fix compiling warnings (`pr#10751 `_, Kefu Chai) * rgw: don't fail if lost race when setting acls (`issue#16930 `_, `pr#11286 `_, Yehuda Sadeh) * rgw: drop create_bucket in fwd_request log message (`pr#10214 `_, Abhishek Lekshmanan) * rgw: eradicate dynamic memory allocation in RGWPostObj. (`pr#11054 `_, Radoslaw Zarzynski) * rgw: file setattr (`pr#8618 `_, Matt Benjamin) * rgw: finish error_repo cr in stop_spawned_services() (`issue#16530 `_, `pr#10031 `_, Yehuda Sadeh) * rgw: fix RGWAccessControlPolicy_SWIFT::create return value check error (`issue#17090 `_, `pr#10727 `_, weiqiaomiao) * rgw: fix compilation (`pr#10252 `_, Josh Durgin) * rgw: fix decoding of creation_time and last_update. (`issue#17167 `_, `pr#11132 `_, Orit Wasserman) * rgw: fix error_repo segfault in data sync (`issue#16603 `_, `pr#10157 `_, Casey Bodley) * rgw: fix failed to create bucket if a non-master zonegroup has a single zone (`pr#10991 `_, weiqiaomiao) * rgw: fix flush_read_list() error msg (`pr#10749 `_, Jiaying Ren) * rgw: fix for issue 16494 (`issue#16494 `_, `pr#10077 `_, Yehuda Sadeh) * rgw: fix for s3tests failure when ldap auth is not applied (`pr#10669 `_, Casey Bodley) * rgw: fix get object instance returned NoSuchKey error (`issue#17111 `_, `pr#10820 `_, Yang Honggang) * rgw: fix is_admin handling in RGWLDAPAuthEngine and introduce acct_privilege_t (`pr#10687 `_, Radoslaw Zarzynski) * rgw: fix issue 16435 (`issue#16435 `_, `pr#10193 `_, Yehuda Sadeh) * rgw: fix multi-delete query param parsing. (`issue#16618 `_, `pr#10187 `_, Robin H. Johnson) * rgw: fix period update --commit return error (`issue#17110 `_, `pr#10836 `_, weiqiaomiao) * rgw: fix radosgw daemon core when reopen logs (`issue#17036 `_, `pr#10737 `_, weiqiaomiao) * rgw: fix regression with handling double underscore (`issue#16856 `_, `pr#10939 `_, Orit Wasserman) * rgw: fix rgw_bucket_dir_entry decode v (`pr#10918 `_, Tianshan Qu) * rgw: fix the error return variable in log message and cleanups (`pr#10138 `_, Yan Jun) * rgw: fix the missing return value (`pr#10122 `_, Yan Jun) * rgw: fix upgrade from old multisite to new multisite configuration (`issue#16751 `_, `pr#10368 `_, Orit Wasserman) * rgw: fix wrong variable definition in cls_version_check func (`pr#10233 `_, weiqiaomiao) * rgw: fix wrong variable definition in rgw_cls_lc_set_entry function (`pr#10408 `_, weiqiaomiao) * rgw: for the create_bucket api, if the input creation_time is zero, we should set it to 'now" (`issue#16597 `_, `pr#10118 `_, weiqiaomiao) * rgw: kill a compile warning for rgw_sync (`pr#10425 `_, Casey Bodley, Abhishek Lekshmanan) * rgw: lgtm (`pr#9941 `_, weiqiaomiao) * rgw: lgtm (build verified, f23) (`pr#9754 `_, John Coyle) * rgw: lgtm, build verified f23 (`pr#10035 `_, Yan Jun) * rgw: lgtm--build verified, f23 (`pr#10002 `_, Yan Jun) * rgw: lgtm--build verified, f23 (`pr#9985 `_, Yan Jun) * rgw: lgtm--should backport (`pr#9979 `_, Yan Jun) * rgw: log mp upload failures due to parts mismatch (`pr#10424 `_, Abhishek Lekshmanan) * rgw: merge setting flags operation together and cleanups (`pr#10203 `_, Yan Jun) * rgw: miscellaneous cleanups (`pr#10299 `_, Yan Jun) * rgw: multiple fixes for Swift's object expiration (`issue#16705 `_, `issue#16684 `_, `pr#10330 `_, Radoslaw Zarzynski) * rgw: need to 'open_object_section' before dump stats in 'RGWGetUsage_… (`issue#17499 `_, `pr#11325 `_, weiqiaomiao) * rgw: obsolete 'radosgw-admin period prepare' command (`issue#17387 `_, `pr#11278 `_, Gaurav Kumar Garg) * rgw: radosgw-admin: add "--orphan-stale-secs" to --help (`issue#17280 `_, `pr#11098 `_, Ken Dreyer) * rgw: radosgw-admin: zone[group] modify can change realm id (`issue#16839 `_, `pr#10477 `_, Casey Bodley) * rgw: raise log levels for common radosgw-admin errors (`issue#16935 `_, `pr#10602 `_, Shilpa Jagannath) * rgw: register the correct handler for cls_user_complete_stats (`issue#16624 `_, `pr#10151 `_, Orit Wasserman) * rgw: remove bucket index objects when deleting the bucket (`issue#16412 `_, `pr#10120 `_, Orit Wasserman) * rgw: remove possible duplicate setting (`pr#10110 `_, Yan Jun) * rgw: remove the field ret from class RGWPutLC (`pr#10726 `_, weiqiaomiao) * rgw: remove unused bufferlist variable (`pr#10194 `_, Yan Jun) * rgw: remove unused realm from radosgw-admin zone modify (`issue#16632 `_, `pr#10211 `_, Orit Wasserman) * rgw: remove unused variables (`pr#10589 `_, Yan Jun) * rgw: return "NoSuchLifecycleConfiguration" if lifecycle config does not exist (`pr#10442 `_, weiqiaomiao) * rgw: revert a commit that broke s3 signature validation (`issue#17279 `_, `pr#11102 `_, Casey Bodley) * rgw: rgw file: remove busy-wait in RGWLibFS::gc() (`pr#10638 `_, Matt Benjamin) * rgw: rgw ldap: protect rgw::from_base64 from non-base64 input (`pr#10777 `_, Matt Benjamin) * rgw: rgw ldap: enforce simple_bind w/LDAPv3 (`pr#10593 `_, Matt Benjamin) * rgw: rgw multisite: RGWCoroutinesManager::run returns status of last cr (`issue#17047 `_, `pr#10778 `_, Casey Bodley) * rgw: rgw multisite: RGWDataSyncCR fails on errors from RGWListBucketIndexesCR (`issue#17073 `_, `pr#10779 `_, Casey Bodley) * rgw: rgw multisite: fix for assertion in RGWMetaSyncCR (`issue#17044 `_, `pr#10743 `_, Casey Bodley) * rgw: rgw multisite: fixes for period puller (`issue#16939 `_, `pr#10596 `_, Casey Bodley) * rgw: rgw multisite: trim data logs as peer zones catch up (`pr#10372 `_, Casey Bodley) * rgw: rgw nfs v3 completions (`pr#10745 `_, Matt Benjamin) * rgw: rgw-admin: allow unsetting user's email (`issue#13286 `_, `pr#11340 `_, Yehuda Sadeh, Weijun Duan) * rgw: rgw/admin: fix some return values and indents (`pr#9170 `_, Yan Jun) * rgw: rgw/rados: remove confused error printout (`pr#9351 `_, Yan Jun) * rgw: rgw/rgw_common.cc: modify the end check in RGWHTTPArgs::sys_get (`pr#9136 `_, zhao kun) * rgw: rgw/rgw_lc.cc: fix sleep time according to the error message (`pr#10930 `_, Weibing Zhang) * rgw: rgw/rgw_main: fix unnecessary variables defined (`pr#10475 `_, zhang.zezhu) * rgw: rgw/swift: remove redundant assignment operation (`pr#11292 `_, Yan Jun) * rgw: rgw_file: pre-assign times (`issue#17367 `_, `pr#11181 `_, Matt Benjamin) * rgw: rgw_file: fix rename cases and unify unlink (`pr#10271 `_, Matt Benjamin) * rgw: rgw_file: fix set_attrs operation (`pr#11159 `_, Matt Benjamin) * rgw: rgw_file: refuse partial, out-of-order writes (`pr#10284 `_, Matt Benjamin) * rgw: rgw_file: restore local definition of RGWLibFS gc interval (`pr#10756 `_, Matt Benjamin) * rgw: rgw_file: unlock() must precede out label (`pr#10635 `_, Matt Benjamin) * rgw: right parenthesis is missing in radosgw-admin help message on caps (`pr#10947 `_, Weibing Zhang) * rgw: set correct instance on the object (`issue#17443 `_, `pr#11270 `_, Yehuda Sadeh) * rgw: store oldest mdlog period in rados (`issue#16894 `_, `pr#10558 `_, Casey Bodley) * rgw: test/multi.py add a destructive attr to tests (`pr#10401 `_, Abhishek Lekshmanan) * rgw: test/rgw: add --gateways-per-zone to test_multi.py (`pr#10742 `_, Casey Bodley) * rgw: test_multi.py avoid creating mds (`pr#10174 `_, Abhishek Lekshmanan) * rgw: test_rgw_bencode: null terminate strings before checking (`issue#16861 `_, `pr#10510 `_, Yehuda Sadeh) * rgw: use endpoints from master zone instead of zonegroup (`issue#16834 `_, `pr#10456 `_, Casey Bodley) * rgw: use the standard usage of string.find (`pr#10226 `_, Yan Jun) * rgw: verfied: f23, subset of s3tests (`pr#10448 `_, Pritha Srivastava) * rgw: verified (`pr#10000 `_, weiqiaomiao) * rgw: verified non-regression (MS AD) (`pr#10597 `_, Pritha Srivastava) * rgw: verified: autobuild (`issue#16928 `_, `pr#10579 `_, Robin H. Johnson) * rgw: verified: MS AD (`pr#10307 `_, Pritha Srivastava) * rgw: verified: f23 (`pr#10882 `_, Michal Jarzabek) * rgw: verified: f23 (`pr#10858 `_, Weibing Zhang) * rgw: verified: f23 (`pr#10822 `_, Yan Jun) * rgw: verified: f23 (`pr#10929 `_, Weibing Zhang) * rgw: wip: rgw multisite: preserve zone's extra pool (`issue#16712 `_, `pr#10397 `_, Abhishek Lekshmanan) * rgw: work around curl_multi_wait bug with non-blocking reads (`issue#15915 `_, `issue#16695 `_, `pr#10998 `_, Casey Bodley) * rgw:add a s3 API of make torrent for a object (`pr#10396 `_, zhouruisong) * rgw:add a s3 API of make torrent for a object (`pr#9589 `_, zhouruisong) * rgw:bucket check remove _multipart_ prefix (`pr#6501 `_, Weijun Duan) * rgw:clean unuse bufferlist (`pr#10232 `_, weiqiaomiao) * rgw:fix rgw boot failed after upgrade to master latest version (`pr#10409 `_, weiqiaomiao) * rgw:lifecycle feature [rebased] (`pr#9737 `_, Ji Chen, Daniel Gryniewicz) * rgw: rgw/rgw_rados.h: remove unneeded class C_Tick (`pr#10954 `_, Michal Jarzabek) * rgw: ext_mime_map_init add string describing for error number (`pr#9807 `_, Yan Jun) * tests: Add test for global static non-POD segfault (`pr#10486 `_, Brad Hubbard) * tests: populate /dev/disk/by-partuuid for scsi_debug (`issue#17100 `_, `pr#10824 `_, Loic Dachary) * tests: use a fixture for memstore clone testing (`pr#11190 `_, Kefu Chai) * tests: run-\*make-check.sh: Make DRY_RUN actually mean a dry run (`pr#11074 `_, Brad Hubbard) * tests: run-cmake-check.sh: Actually run the tests (`pr#11075 `_, Brad Hubbard) * tests: run-cmake-check.sh: Init submodules (`pr#11091 `_, Brad Hubbard) * tests: run-make-check.sh: Make DRY_RUN actually do a dry run (`pr#11092 `_, Brad Hubbard) * tests: run-make-check.sh: pass args to do_cmake.sh (`pr#10701 `_, John Coyle) * tests: unittest_chain_xattr: account for existing xattrs (`issue#16025 `_, `pr#11109 `_, Dan Mick) * tests: src/test/cli/* tests: POSIX Convert grep -P to grep -E (`pr#10319 `_, Willem Jan Withagen) * test: ceph_test_msgr: fix circular locking dependency (`issue#16955 `_, `pr#10612 `_, Kefu Chai) * test: cli/crushtool: fix the test of compile-decompile-recompile.t (`issue#17306 `_, `pr#11173 `_, Kefu Chai) * test: libcephfs: fix gcc sys/fcntl.h warnings (`pr#10126 `_, John Coyle) * test: librados: rados_connect() should succeed (`issue#17087 `_, `pr#10806 `_, Kefu Chai) * test: mds: add fs dump in test_ceph_argparse.py (`pr#10347 `_, huanwen ren) * test: simple_dispatcher.cc: remove unused variable (`pr#9932 `_, Michal Jarzabek) * test: store_test: tidy-up SyntheticWorkloadState class (`pr#10775 `_, xie xingguo) * test: More portable use of mmap(MAP_ANON) (`pr#10557 `_, Willem Jan Withagen) * test: Removeall merged after print_function commit needs a fix (`pr#10535 `_, David Zafman) * test: ceph-disk.sh do not kill all daemons (`issue#16729 `_, `pr#10346 `_, Kefu Chai) * test: cephtool/test.sh: fix expect_false() calls (`pr#10133 `_, Kefu Chai) * test: fix usage info of omapbench (`pr#10089 `_, Wanlong Gao) * test: remove ceph_test_rados_api_tmap_migrate (`issue#16144 `_, `pr#10256 `_, Kefu Chai) * test: test_{compression_plugin,async_compressor}: do not copy plugins (`pr#10153 `_, Kefu Chai) * test: test_rados_tool.sh: Make script work under ctest (`pr#10166 `_, Willem Jan Withagen) * test: qa/workunits/cephtool/test.sh: fix omission of ceph-command (`pr#10979 `_, Willem Jan Withagen) * test: qa/workunits/cephtool/test.sh: s/TMPDIR/TEMP_DIR/ (`pr#10306 `_, Kefu Chai) * test: qa/workunits/cephtool/test.sh: use absolute path for TEMP_DIR (`pr#10430 `_, Kefu Chai) * tools: New "removeall" used to remove head with snapshots (`pr#10098 `_, David Zafman) * tools: do not closed stdout ; fix overload of "<" operator (`pr#9290 `_, xie xingguo) * tools: fix the core dump when get the crushmap do not exist (`pr#10451 `_, song baisen) * tools: rebuild monstore (`issue#17179 `_, `pr#10933 `_, Kefu Chai) * tools: use TextTable for "rados df" plain output (`pr#9362 `_, xie xingguo) * tools: fio engine for objectstore (`pr#10267 `_, Casey Bodley, Igor Fedotov, Daniel Gollub) * tools: rados/client: fix typo (`pr#10493 `_, Yan Jun) * tools: rados/client: fix waiting on the condition variable more efficient. (`pr#9939 `_, Yan Jun) * tools: tools/rebuild_mondb: kill comipling warning and other fixes (`pr#11117 `_, xie xingguo) * tools: authtool: Enhance argument combinations validation (`issue#2904 `_, `pr#9704 `_, Brad Hubbard) * tools: ceph-disk: change ownership of initfile to ceph:ceph (`issue#16280 `_, `pr#9688 `_, Shylesh Kumar) * test: ceph_test_rados_api_tmap_migrate: remove test for tmap_upgrade (`pr#10234 `_, Kefu Chai) v10.2.8 Jewel ============= This point release brought a number of important bugfixes in all major components of Ceph. However, it also introduced a regression that could cause MDS damage, and a new release, v10.2.9, was published to address this. Therefore, Jewel users should *not* upgrade to this version - instead, we recommend upgrading directly to v10.2.9. For more detailed information, see :download:`the complete changelog `. OSD Removal Caveat ------------------ There was a bug introduced in Jewel (#19119) that broke the mapping behavior when an "out" OSD that still existed in the CRUSH map was removed with 'osd rm'. This could result in 'misdirected op' and other errors. The bug is now fixed, but the fix itself introduces the same risk because the behavior may vary between clients and OSDs. To avoid problems, please ensure that all OSDs are removed from the CRUSH map before deleting them. That is, be sure to do:: ceph osd crush rm osd.123 before:: ceph osd rm osd.123 Snap Trimmer Improvements ------------------------- This release greatly improves control and throttling of the snap trimmer. It introduces the "osd max trimming pgs" option (defaulting to 2), which limits how many PGs on an OSD can be trimming snapshots at a time. And it restores the safe use of the "osd snap trim sleep" option, wihch defaults to 0 but otherwise adds the given number of seconds in delay between every dispatch of trim operations to the underlying system. Other Notable Changes --------------------- * build/ops: "osd marked itself down" will not recognised if host runs mon + osd on shutdown/reboot (`issue#18516 `_, `pr#13492 `_, Boris Ranto) * build/ops: ceph-base package missing dependency for psmisc (`issue#19129 `_, `pr#13786 `_, Nathan Cutler) * build/ops: enable build of ceph-resource-agents package on rpm-based os (`issue#17613 `_, `issue#19546 `_, `pr#13606 `_, Nathan Cutler) * build/ops: rbdmap.service not included in debian packaging (jewel-only) (`issue#19547 `_, `pr#14383 `_, Ken Dreyer) * cephfs: Journaler may execute on_safe contexts prematurely (`issue#20055 `_, `pr#15468 `_, "Yan, Zheng") * cephfs: MDS assert failed when shutting down (`issue#19204 `_, `pr#14683 `_, John Spray) * cephfs: MDS goes readonly writing backtrace for a file whose data pool has been removed (`issue#19401 `_, `pr#14682 `_, John Spray) * cephfs: MDS server crashes due to inconsistent metadata (`issue#19406 `_, `pr#14676 `_, John Spray) * cephfs: No output for ceph mds rmfailed 0 --yes-i-really-mean-it command (`issue#16709 `_, `pr#14674 `_, John Spray) * cephfs: Test failure: test_data_isolated (tasks.cephfs.test_volume_client.TestVolumeClient) (`issue#18914 `_, `pr#14685 `_, "Yan, Zheng") * cephfs: Test failure: test_open_inode (`issue#18661 `_, `pr#14669 `_, John Spray) * cephfs: The mount point break off when mds switch hanppened (`issue#19437 `_, `pr#14679 `_, Guan yunfei) * cephfs: ceph-fuse does not recover after lost connection to MDS (`issue#16743 `_, `issue#18757 `_, `pr#14698 `_, Kefu Chai, Henrik Korkuc, Patrick Donnelly) * cephfs: client: fix the cross-quota rename boundary check conditions (`issue#18699 `_, `pr#14667 `_, Greg Farnum) * cephfs: mds is crushed, after I set about 400 64KB xattr kv pairs to a file (`issue#19033 `_, `pr#14684 `_, Yang Honggang) * cephfs: non-local quota changes not visible until some IO is done (`issue#17939 `_, `pr#15466 `_, John Spray, Nathan Cutler) * cephfs: normalize file open flags internally used by cephfs (`issue#18872 `_, `issue#19890 `_, `pr#15000 `_, Jan Fajerski, "Yan, Zheng") * common: monitor creation with IPv6 public network segfaults (`issue#19371 `_, `pr#14324 `_, Fabian Grünbichler) * common: radosstriper: protect aio_write API from calls with 0 bytes (`issue#14609 `_, `pr#13254 `_, Sebastien Ponce) * core: Objecter::epoch_barrier isn't respected in _op_submit() (`issue#19396 `_, `pr#14332 `_, Ilya Dryomov) * core: clear divergent_priors set off disk (`issue#17916 `_, `pr#14596 `_, Greg Farnum) * core: improve snap trimming, enable restriction of parallelism (`issue#19241 `_, `pr#14492 `_, Samuel Just, Greg Farnum) * core: os/filestore/HashIndex: be loud about splits (`issue#18235 `_, `pr#13788 `_, Dan van der Ster) * core: os/filestore: fix clang static check warn use-after-free (`issue#19311 `_, `pr#14044 `_, liuchang0812, yaoning) * core: transient jerasure unit test failures (`issue#18070 `_, `issue#17762 `_, `issue#18128 `_, `issue#17951 `_, `pr#14701 `_, Kefu Chai, Pan Liu, Loic Dachary, Jason Dillaman) * core: two instances of omap_digest mismatch (`issue#18533 `_, `pr#14204 `_, Samuel Just, David Zafman) * doc: Improvements to crushtool manpage (`issue#19649 `_, `pr#14635 `_, Loic Dachary, Nathan Cutler) * doc: PendingReleaseNotes: note about 19119 (`issue#19119 `_, `pr#13732 `_, Sage Weil) * doc: admin ops: fix the quota section (`issue#19397 `_, `pr#14654 `_, Chu, Hua-Rong) * doc: radosgw-admin: add the 'object stat' command to usage (`issue#19013 `_, `pr#13872 `_, Pavan Rallabhandi) * doc: rgw S3 create bucket should not do response in json (`issue#18889 `_, `pr#13874 `_, Abhishek Lekshmanan) * fs: Invalid error code returned by MDS is causing a kernel client WARNING (`issue#19205 `_, `pr#13831 `_, Jan Fajerski, xie xingguo) * librbd: Incomplete declaration for ContextWQ in librbd/Journal.h (`issue#18862 `_, `pr#14152 `_, Boris Ranto) * librbd: Issues with C API image metadata retrieval functions (`issue#19588 `_, `pr#14666 `_, Mykola Golub) * librbd: Possible deadlock performing a synchronous API action while refresh in-progress (`issue#18419 `_, `pr#13154 `_, Jason Dillaman) * librbd: is_exclusive_lock_owner API should ping OSD (`issue#19287 `_, `pr#14481 `_, Jason Dillaman) * librbd: remove image header lock assertions (`issue#18244 `_, `pr#13809 `_, Jason Dillaman) * mds: C_MDSInternalNoop::complete doesn't free itself (`issue#19501 `_, `pr#14677 `_, "Yan, Zheng") * mds: Too many stat ops when trying to probe a large file (`issue#19955 `_, `pr#15472 `_, "Yan, Zheng") * mds: avoid reusing deleted inode in StrayManager::_purge_stray_logged (`issue#18877 `_, `pr#14670 `_, Zhi Zhang) * mds: enable start when session ino info is corrupt (`issue#19708 `_, `issue#16842 `_, `pr#14700 `_, John Spray) * mds: fragment space check can cause replayed request fail (`issue#18660 `_, `pr#14668 `_, "Yan, Zheng") * mds: heartbeat timeout during rejoin, when working with large amount of caps/inodes (`issue#19118 `_, `pr#14672 `_, John Spray) * mds: issue new caps when sending reply to client (`issue#19635 `_, `pr#15438 `_, "Yan, Zheng") * mon: OSDMonitor: make 'osd crush move ...' work on osds (`issue#18587 `_, `pr#13261 `_, Sage Weil) * mon: fix 'sortbitwise' warning on jewel (`issue#20578 `_, `pr#15208 `_, huanwen ren, Sage Weil) * mon: make get_mon_log_message() atomic (`issue#19427 `_, `pr#14587 `_, Kefu Chai) * mon: remove bad rocksdb option (`issue#19392 `_, `pr#14236 `_, Sage Weil) * msg: IPv6 Heartbeat packets are not marked with DSCP QoS - simple messenger (`issue#18887 `_, `pr#13450 `_, Yan Jun, Robin H. Johnson) * msg: set close on exec flag (`issue#16390 `_, `pr#13585 `_, Kefu Chai) * osd: --flush-journal: sporadic segfaults on exit (`issue#18820 `_, `pr#13477 `_, Alexey Sheplyakov) * osd: Give requested scrubs a higher priority (`issue#15789 `_, `pr#14686 `_, David Zafman) * osd: Implement asynchronous scrub sleep (`issue#19986 `_, `issue#19497 `_, `pr#15529 `_, Brad Hubbard) * osd: Object level shard errors are tracked and used if no auth available (`issue#20089 `_, `pr#15416 `_, David Zafman) * osd: ReplicatedPG: try with pool's use-gmt setting if hitset archive not found (`issue#19185 `_, `pr#13827 `_, Kefu Chai) * osd: allow client throttler to be adjusted on-fly, without restart (`issue#18791 `_, `pr#13214 `_, Piotr Dałek) * osd: bypass readonly ops when osd full (`issue#19394 `_, `pr#14181 `_, Jianpeng Ma, yaoning) * osd: degraded and misplaced status output inaccurate (`issue#18619 `_, `pr#14325 `_, David Zafman) * osd: new added OSD always down when full flag is set (`issue#15025 `_, `pr#14326 `_, Mingxin Liu) * osd: pg_pool_t::encode(): be compatible with Hammer <= 0.94.6 (`issue#19508 `_, `pr#14392 `_, Alexey Sheplyakov) * osd: pre-jewel "osd rm" incrementals are misinterpreted (`issue#19119 `_, `pr#13884 `_, Ilya Dryomov) * osd: preserve allocation hint attribute during recovery (`issue#19083 `_, `pr#13647 `_, yaoning) * osd: promote throttle parameters are reversed (`issue#19773 `_, `pr#14791 `_, Mark Nelson) * osd: reindex properly on pg log split (`issue#18975 `_, `pr#14047 `_, Alexey Sheplyakov) * osd: restrict want_acting to up+acting on recovery completion (`issue#18929 `_, `pr#13541 `_, Sage Weil) * rbd-nbd: check /sys/block/nbdX/size to ensure kernel mapped correctly (`issue#18335 `_, `pr#13932 `_, Mykola Golub, Alexey Sheplyakov) * rbd: [api] temporarily restrict (rbd_)mirror_peer_add from adding multiple peers (`issue#19256 `_, `pr#14664 `_, Jason Dillaman) * rbd: qemu crash triggered by network issues (`issue#18436 `_, `pr#13244 `_, Jason Dillaman) * rbd: rbd --pool=x rename y z does not work (`issue#18326 `_, `pr#14148 `_, Gaurav Kumar Garg) * rbd: systemctl stop rbdmap unmaps all rbds and not just the ones in /etc/ceph/rbdmap (`issue#18884 `_, `issue#18262 `_, `pr#14083 `_, David Disseldorp, Nathan Cutler) * rgw: "cluster [WRN] bad locator @X on object @X...." in cluster log (`issue#18980 `_, `pr#14064 `_, Casey Bodley) * rgw: 'radosgw-admin sync status' on master zone of non-master zonegroup (`issue#18091 `_, `pr#13779 `_, Jing Wenjun) * rgw: Change loglevel to 20 for 'System already converted' message (`issue#18919 `_, `pr#13834 `_, Vikhyat Umrao) * rgw: Use decoded URI when verifying TempURL (`issue#18590 `_, `pr#13724 `_, Alexey Sheplyakov) * rgw: a few cases where rgw_obj is incorrectly initialized (`issue#19096 `_, `pr#13842 `_, Yehuda Sadeh) * rgw: add apis to support ragweed suite (`issue#19804 `_, `pr#14851 `_, Yehuda Sadeh) * rgw: add bucket size limit check to radosgw-admin (`issue#17925 `_, `pr#14787 `_, Matt Benjamin) * rgw: allow system users to read SLO parts (`issue#19027 `_, `pr#14752 `_, Casey Bodley) * rgw: don't return skew time in pre-signed url (`issue#18828 `_, `issue#18829 `_, `pr#14605 `_, liuchang0812) * rgw: failure to create s3 type subuser from admin rest api (`issue#16682 `_, `pr#14815 `_, snakeAngel2015) * rgw: fix break inside of yield in RGWFetchAllMetaCR (`issue#17655 `_, `pr#14066 `_, Casey Bodley) * rgw: fix failed to create bucket if a non-master zonegroup has a single zone (`issue#19756 `_, `pr#14766 `_, weiqiaomiao) * rgw: health check errors out incorrectly (`issue#19025 `_, `pr#13865 `_, Pavan Rallabhandi) * rgw: list_plain_entries() stops before bi_log entries (`issue#19876 `_, `pr#15383 `_, Casey Bodley) * rgw: multisite: fetch_remote_obj() gets wrong version when copying from remote (`issue#19599 `_, `pr#14607 `_, Zhang Shaowen, Casey Bodley) * rgw: multisite: some yields in RGWMetaSyncShardCR::full_sync() resume in incremental_sync() (`issue#18076 `_, `pr#13837 `_, Casey Bodley, Abhishek Lekshmanan) * rgw: only append zonegroups to rest params if not empty (`issue#20078 `_, `pr#15312 `_, Yehuda Sadeh, Karol Mroz) * rgw: pullup civet chunked (`issue#19736 `_, `pr#14776 `_, Matt Benjamin) * rgw: rgw_file: fix event expire check, don't expire directories being read (`issue#19623 `_, `issue#19270 `_, `issue#19625 `_, `issue#19624 `_, `issue#19634 `_, `issue#19435 `_, `pr#14653 `_, Gui Hecheng, Matt Benjamin) * rgw: swift: disable revocation thread under certain circumstances (`issue#19499 `_, `issue#9493 `_, `pr#14789 `_, Marcus Watts) * rgw: the swift container acl does not support field .ref (`issue#18484 `_, `pr#13833 `_, Jing Wenjun) * rgw: typo in rgw_admin.cc (`issue#19026 `_, `pr#13863 `_, Ronak Jain) * rgw: unsafe access in RGWListBucket_ObjStore_SWIFT::send_response() (`issue#19249 `_, `pr#14661 `_, Yehuda Sadeh) * rgw: upgrade to multisite v2 fails if there is a zone without zone info (`issue#19231 `_, `pr#14136 `_, Danny Al-Gaaf, Orit Wasserman) * rgw: use separate http_manager for read_sync_status (`issue#19236 `_, `pr#14195 `_, Casey Bodley, Shasha Lu) * rgw: when converting region_map we need to use rgw_zone_root_pool (`issue#19195 `_, `pr#14143 `_, Orit Wasserman) * rgw: zonegroupmap set does not work (`issue#19498 `_, `issue#18725 `_, `pr#14660 `_, Orit Wasserman, Casey Bodley) * rgw:fix memory leaks in data/md sync (`issue#20088 `_, `pr#15382 `_, weiqiaomiao) * tests: 'ceph auth import -i' overwrites caps, should alert user before overwrite (`issue#18932 `_, `pr#13544 `_, Vikhyat Umrao) * tests: New upgrade test for #19508 (`issue#19829 `_, `issue#19508 `_, `pr#14930 `_, Nathan Cutler) * tests: [ FAILED ] TestLibRBD.ImagePollIO in upgrade:client-upgrade-kraken-distro-basic-smithi (`issue#18617 `_, `pr#13107 `_, Jason Dillaman) * tests: [librados_test_stub] cls_cxx_map_get_XYZ methods don't return correct value (`issue#19597 `_, `pr#14665 `_, Jason Dillaman) * tests: additional rbd-mirror test stability improvements (`issue#18935 `_, `pr#14154 `_, Jason Dillaman) * tests: api_misc: [ FAILED ] LibRadosMiscConnectFailure.ConnectFailure (`issue#15368 `_, `pr#14763 `_, Sage Weil) * tests: buffer overflow in test LibCephFS.DirLs (`issue#18941 `_, `pr#14671 `_, "Yan, Zheng") * tests: clone workunit using the branch specified by task (`issue#19429 `_, `pr#14371 `_, Kefu Chai, Dan Mick) * tests: drop upgrade/hammer-jewel-x (`issue#20574 `_, `pr#15933 `_, Nathan Cutler) * tests: dummy suite fails in OpenStack (`issue#18259 `_, `pr#14070 `_, Nathan Cutler) * tests: eliminate race condition in Thrasher constructor (`issue#18799 `_, `pr#13608 `_, Nathan Cutler) * tests: enable quotas for pre-luminous quota tests (`issue#20412 `_, `pr#15936 `_, Patrick Donnelly) * tests: fix oversight in yaml comment (`issue#20581 `_, `pr#14449 `_, Nathan Cutler) * tests: move swift.py task from teuthology to ceph, phase one (jewel) (`issue#20392 `_, `pr#15870 `_, Nathan Cutler, Sage Weil, Warren Usui, Greg Farnum, Ali Maredia, Tommi Virtanen, Zack Cerza, Sam Lang, Yehuda Sadeh, Joe Buck, Josh Durgin) * tests: qa/Fixed upgrade sequence to 10.2.0 -> 10.2.7 -> latest -x (10.2.8) (`issue#20572 `_, `pr#16089 `_, Yuri Weinstein) * tests: qa/suites/upgrade/hammer-x: set "sortbitwise" for jewel clusters (`issue#20342 `_, `pr#15842 `_, Nathan Cutler) * tests: qa/workunits/rados/test-upgrade-\*: whitelist tests for master (part 1) (`issue#20577 `_, `pr#15360 `_, Sage Weil) * tests: qa/workunits/rados/test-upgrade-\*: whitelist tests for master (part 2) (`issue#20576 `_, `pr#15778 `_, Kefu Chai) * tests: qa/workunits/rados/test-upgrade-\*: whitelist tests the right way (`issue#20575 `_, `pr#15824 `_, Kefu Chai) * tests: rados: sleep before ceph tell osd.0 flush_pg_stats after restart (`issue#16239 `_, `issue#20489 `_, `pr#14710 `_, Kefu Chai, Nathan Cutler) * tests: run upgrade/client-upgrade on latest CentOS 7.3 (`issue#20573 `_, `pr#16088 `_, Nathan Cutler) * tests: run-rbd-unit-tests.sh assert in lockdep_will_lock, TestLibRBD.ObjectMapConsistentSnap (`issue#17447 `_, `pr#14150 `_, Jason Dillaman) * tests: systemd test backport to jewel (`issue#19717 `_, `pr#14694 `_, Vasu Kulkarni) * tests: test/librados/tmap_migrate: g_ceph_context->put() upon return (`issue#20579 `_, `pr#14809 `_, Kefu Chai) * tests: test_notify.py: rbd.InvalidArgument: error updating features for image test_notify_clone2 (`issue#19692 `_, `pr#14680 `_, Jason Dillaman) * tests: upgrade/hammer-x failing with OSD has the store locked when Thrasher runs ceph-objectstore-tool on down PG (`issue#19556 `_, `pr#14416 `_, Nathan Cutler) * tests: upgrade:hammer-x/stress-split-erasure-code-x86_64 fails in 10.2.8 integration testing (`issue#20413 `_, `pr#15904 `_, Nathan Cutler) * tools: brag fails to count "in" mds (`issue#19192 `_, `pr#14112 `_, Oleh Prypin, Peng Zhang) * tools: ceph-disk does not support cluster names different than 'ceph' (`issue#17821 `_, `pr#14765 `_, Loic Dachary) * tools: ceph-disk: Racing between partition creation and device node creation (`issue#19428 `_, `pr#14329 `_, Erwan Velu) * tools: ceph-disk: bluestore --setgroup incorrectly set with user (`issue#18955 `_, `pr#13489 `_, craigchi) * tools: ceph-disk: ceph-disk list reports mount error for OSD having mount options with SELinux context (`issue#17331 `_, `pr#14402 `_, Brad Hubbard) * tools: ceph-disk: do not setup_statedir on trigger (`issue#19941 `_, `pr#15504 `_, Loic Dachary) * tools: ceph-disk: enable directory backed OSD at boot time (`issue#19628 `_, `pr#14602 `_, Loic Dachary) * tools: rados: RadosImport::import should return an error if Rados::connect fails (`issue#19319 `_, `pr#14113 `_, Brad Hubbard) v10.2.7 Jewel ============= This point release fixes several important bugs in RBD mirroring, librbd & RGW. We recommend that all v10.2.x users upgrade. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * librbd: possible race in ExclusiveLock handle_peer_notification (`issue#19368 `_, `pr#14233 `_, Mykola Golub) * osd: Increase priority for inactive PGs backfill (`issue#18350 `_, `pr#13232 `_, Bartłomiej Święcki) * osd: Scrub improvements and other fixes (`issue#17857 `_, `issue#18114 `_, `issue#13937 `_, `issue#18113 `_, `pr#13146 `_, Kefu Chai, David Zafman) * osd: fix OSD network address in OSD heartbeat_check log message (`issue#18657 `_, `pr#13108 `_, Vikhyat Umrao) * rbd-mirror: deleting a snapshot during sync can result in read errors (`issue#18990 `_, `pr#13596 `_, Jason Dillaman) * rgw: 'period update' does not remove short_zone_ids of deleted zones (`issue#15618 `_, `pr#14140 `_, Casey Bodley) * rgw: DUMPABLE flag is cleared by setuid preventing coredumps (`issue#19089 `_, `pr#13844 `_, Brad Hubbard) * rgw: clear data_sync_cr if RGWDataSyncControlCR fails (`issue#17569 `_, `pr#13886 `_, Casey Bodley) * rgw: fix openssl (`issue#11239 `_, `issue#19098 `_, `issue#16535 `_, `pr#14215 `_, Marcus Watts) * rgw: fix swift cannot disable object versioning with empty X-Versions-Location (`issue#18852 `_, `pr#13823 `_, Jing Wenjun) * rgw: librgw: RGWLibFS::setattr fails on directories (`issue#18808 `_, `pr#13778 `_, Matt Benjamin) * rgw: make sending Content-Length in 204 and 304 controllable (`issue#16602 `_, `pr#13503 `_, Radoslaw Zarzynski, Matt Benjamin) * rgw: multipart uploads copy part support (`issue#12790 `_, `pr#13219 `_, Yehuda Sadeh, Javier M. Mellid, Matt Benjamin) * rgw: multisite: RGWMetaSyncShardControlCR gives up on EIO (`issue#19019 `_, `pr#13867 `_, Casey Bodley) * rgw: radosgw/swift: clean up flush / newline behavior (`issue#18473 `_, `pr#14100 `_, Nathan Cutler, Marcus Watts, Matt Benjamin) * rgw: radosgw/swift: clean up flush / newline behavior. (`issue#18473 `_, `pr#13143 `_, Marcus Watts, Matt Benjamin) * rgw: rgw_fh: RGWFileHandle dtor must also cond-unlink from FHCache (`issue#19112 `_, `pr#14231 `_, Matt Benjamin) * rgw: rgw_file: avoid interning .. in FHCache table and don't ref for them (`issue#19036 `_, `pr#13848 `_, Matt Benjamin) * rgw: rgw_file: interned RGWFileHandle objects need parent refs (`issue#18650 `_, `pr#13583 `_, Matt Benjamin) * rgw: rgw_file: restore (corrected) fix for dir partial match (return of FLAG_EXACT_MATCH) (`issue#19060 `_, `issue#18992 `_, `issue#19059 `_, `pr#13858 `_, Matt Benjamin) * rgw: rgw_file: FHCache residence check should be exhaustive (`issue#19111 `_, `pr#14169 `_, Matt Benjamin) * rgw: rgw_file: ensure valid_s3_object_name for directories, too (`issue#19066 `_, `pr#13717 `_, Matt Benjamin) * rgw: rgw_file: fix marker computation (`issue#19018 `_, `issue#18989 `_, `issue#18992 `_, `issue#18991 `_, `pr#13869 `_, Matt Benjamin) * rgw: rgw_file: wip dir orphan (`issue#18992 `_, `issue#18989 `_, `issue#19018 `_, `issue#18991 `_, `pr#14205 `_, Gui Hecheng, Matt Benjamin) * rgw: rgw_file: various fixes (`pr#14206 `_, Matt Benjamin) * rgw: rgw_file: expand argv (`pr#14230 `_, Matt Benjamin) v10.2.6 Jewel ============= This point release fixes several important bugs in RBD mirroring, RGW multi-site, CephFS, and RADOS. We recommend that all v10.2.x users upgrade. For more detailed information, see :download:`the complete changelog `. OSDs No Longer Send ENXIO by Default ------------------------------------ In previous versions, if a client sent an op to the wrong OSD, the OSD would reply with ENXIO. The rationale here is that the client or OSD is clearly buggy and we want to surface the error as clearly as possible. We now only send the ENXIO reply if the osd_enxio_on_misdirected_op option is enabled (it's off by default). This means that a VM using librbd that previously would have gotten an EIO and gone read-only will now see a blocked/hung IO instead. Other Notable Changes --------------------- * build/ops: add hostname sanity check to run-{c}make-check.sh (`issue#18134 `_, `pr#12302 `_, Nathan Cutler) * build/ops: add ldap lib to rgw lib deps based on build config (`issue#17313 `_, `pr#13183 `_, Nathan Cutler) * build/ops: ceph-create-keys loops forever (`issue#17753 `_, `pr#11884 `_, Alfredo Deza) * build/ops: ceph daemons DUMPABLE flag is cleared by setuid preventing coredumps (`issue#17650 `_, `pr#11736 `_, Patrick Donnelly) * build/ops: fixed compilation error when --with-radowsgw=no (`issue#18512 `_, `pr#12729 `_, Pan Liu) * build/ops: fixed the issue when --disable-server, compilation fails. (`issue#18120 `_, `pr#12239 `_, Pan Liu) * build/ops: fix undefined crypto references with --with-xio (`issue#18133 `_, `pr#12296 `_, Nathan Cutler) * build/ops: install-deps.sh based on /etc/os-release (`issue#18466 `_, `issue#18198 `_, `pr#12405 `_, Jan Fajerski, Nitin A Kamble, Nathan Cutler) * build/ops: Remove the runtime dependency on lsb_release (`issue#17425 `_, `pr#11875 `_, John Coyle, Brad Hubbard) * build/ops: rpm: /etc/ceph/rbdmap is packaged with executable access rights (`issue#17395 `_, `pr#11855 `_, Ken Dreyer) * build/ops: selinux: Allow ceph to manage tmp files (`issue#17436 `_, `pr#13048 `_, Boris Ranto) * build/ops: systemd: Restart Mon after 10s in case of failure (`issue#18635 `_, `pr#13058 `_, Wido den Hollander) * build/ops: systemd restarts Ceph Mon to quickly after failing to start (`issue#18635 `_, `pr#13184 `_, Wido den Hollander) * ceph-disk: fix flake8 errors (`issue#17898 `_, `pr#11976 `_, Ken Dreyer) * cephfs: fuse client crash when adding a new osd (`issue#17270 `_, `pr#11860 `_, John Spray) * cli: ceph-disk: convert none str to str before printing it (`issue#18371 `_, `pr#13187 `_, Kefu Chai) * client: Fix lookup of "/.." in jewel (`issue#18408 `_, `pr#12766 `_, Jeff Layton) * client: fix stale entries in command table (`issue#17974 `_, `pr#12137 `_, John Spray) * client: populate metadata during mount (`issue#18361 `_, `pr#13085 `_, John Spray) * cli: implement functionality for adding, editing and removing omap values with binary keys (`issue#18123 `_, `pr#12755 `_, Jason Dillaman) * common: Improve linux dcache hash algorithm (`issue#17599 `_, `pr#11529 `_, Yibo Cai) * common: utime.h: fix timezone issue in round_to_* funcs. (`issue#14862 `_, `pr#11508 `_, Zhao Chao) * doc: Python Swift client commands in Quick Developer Guide don't match configuration in vstart.sh (`issue#17746 `_, `pr#13043 `_, Ronak Jain) * librbd: allow to open an image without opening parent image (`issue#18325 `_, `pr#13130 `_, Ricardo Dias) * librbd: metadata_set API operation should not change global config setting (`issue#18465 `_, `pr#13168 `_, Mykola Golub) * librbd: new API method to force break a peer's exclusive lock (`issue#15632 `_, `issue#16773 `_, `issue#17188 `_, `issue#16988 `_, `issue#17210 `_, `issue#17251 `_, `issue#18429 `_, `issue#17227 `_, `issue#18327 `_, `issue#17015 `_, `pr#12890 `_, Danny Al-Gaaf, Mykola Golub, Jason Dillaman) * librbd: properly order concurrent updates to the object map (`issue#16176 `_, `pr#12909 `_, Jason Dillaman) * librbd: restore journal access when force disabling mirroring (`issue#17588 `_, `pr#11916 `_, Mykola Golub) * mds: Cannot create deep directories when caps contain path=/somepath (`issue#17858 `_, `pr#12154 `_, Patrick Donnelly) * mds: cephfs metadata pool: deep-scrub error omap_digest != best guess omap_digest (`issue#17177 `_, `pr#12380 `_, Yan, Zheng) * mds: cephfs test failures (ceph.com/qa is broken, should be download.ceph.com/qa) (`issue#18574 `_, `pr#13023 `_, John Spray) * mds: ceph-fuse crash during snapshot tests (`issue#18460 `_, `pr#13120 `_, Yan, Zheng) * mds: ceph_volume_client: fix recovery from partial auth update (`issue#17216 `_, `pr#11656 `_, Ramana Raja) * mds: ceph_volume_client.py : Error: Can't handle arrays of non-strings (`issue#17800 `_, `pr#12325 `_, Ramana Raja) * mds: Cleanly reject session evict command when in replay (`issue#17801 `_, `pr#12153 `_, Yan, Zheng) * mds: client segfault on ceph_rmdir path / (`issue#9935 `_, `pr#13029 `_, Michal Jarzabek) * mds: Clients without pool-changing caps shouldn't be allowed to change pool_namespace (`issue#17798 `_, `pr#12155 `_, John Spray) * mds: Decode errors on backtrace will crash MDS (`issue#18311 `_, `pr#12836 `_, Nathan Cutler, John Spray) * mds: false failing to respond to cache pressure warning (`issue#17611 `_, `pr#11861 `_, Yan, Zheng) * mds: finish clientreplay requests before requesting active state (`issue#18461 `_, `pr#13113 `_, Yan, Zheng) * mds: fix incorrect assertion in Server::_dir_is_nonempty() (`issue#18578 `_, `pr#13459 `_, Yan, Zheng) * mds: fix MDSMap upgrade decoding (`issue#17837 `_, `pr#13139 `_, John Spray, Patrick Donnelly) * mds: fix missing ll_get for ll_walk (`issue#18086 `_, `pr#13125 `_, Gui Hecheng) * mds: Fix mount root for ceph_mount users and change tarball format (`issue#18312 `_, `issue#18254 `_, `pr#12592 `_, Jeff Layton) * mds: fix null pointer dereference in Locker::handle_client_caps (`issue#18306 `_, `pr#13060 `_, Yan, Zheng) * mds: lookup of /.. in returns -ENOENT (`issue#18408 `_, `pr#12783 `_, Jeff Layton) * mds: MDS crashes on missing metadata object (`issue#18179 `_, `pr#13119 `_, Yan, Zheng) * mds: mds fails to respawn if executable has changed (`issue#17531 `_, `pr#11873 `_, Patrick Donnelly) * mds: MDS: false failing to respond to cache pressure warning (`issue#17716 `_, `pr#11856 `_, Yan, Zheng) * mds: MDS goes damaged on blacklist (failed to read JournalPointer: -108 ((108) Cannot send after transport endpoint shutdown) (`issue#17236 `_, `pr#11413 `_, John Spray) * mds: MDS long-time blocked ops. ceph-fuse locks up with getattr of file (`issue#17275 `_, `pr#11858 `_, Yan, Zheng) * mds: speed up readdir by skipping unwanted dn (`issue#18519 `_, `pr#12921 `_, Xiaoxi Chen) * mds: standby-replay daemons can sometimes miss events (`issue#17954 `_, `pr#13126 `_, John Spray) * mon: cache tiering: base pool last_force_resend not respected (racing read got wrong version) (`issue#18366 `_, `pr#13115 `_, Sage Weil) * mon: ceph osd down detection behaviour (`issue#18104 `_, `pr#12677 `_, xie xingguo) * mon: Error EINVAL: removing mon.a at 172.21.15.16:6789/0, there will be 1 monitors (`issue#17725 `_, `pr#11999 `_, Joao Eduardo Luis) * mon: health does not report pgs stuck in more than one state (`issue#17515 `_, `pr#11660 `_, Sage Weil) * mon: monitor assertion failure when deactivating mds in (invalid) fscid 0 (`issue#17518 `_, `pr#11862 `_, Patrick Donnelly) * mon: monitor cannot start because of FAILED assert(info.state == MDSMap::STATE_STANDBY) (`issue#18166 `_, `pr#13123 `_, John Spray, Patrick Donnelly) * mon: osd flag health message is misleading (`issue#18175 `_, `pr#13117 `_, Sage Weil) * mon: OSDMonitor: clear jewel+ feature bits when talking to Hammer OSD (`issue#18582 `_, `pr#13131 `_, Piotr Dałek) * mon: OSDs marked OUT wrongly after monitor failover (`issue#17719 `_, `pr#11947 `_, Dong Wu) * mon: peon wrongly delete routed pg stats op before receive pg stats ack (`issue#18458 `_, `pr#13045 `_, Mingxin Liu) * mon: send updated monmap to its subscribers (`issue#17558 `_, `pr#11743 `_, Kefu Chai) * msgr: don't truncate message sequence to 32-bits (`issue#16122 `_, `pr#12416 `_, Yan, Zheng) * msgr: msg/simple: clear_pipe when wait() is mopping up pipes (`issue#15784 `_, `pr#13062 `_, Sage Weil) * msgr: msg/simple/Pipe: error decoding addr (`issue#18072 `_, `pr#12291 `_, Sage Weil) * osd: Add config option to disable new scrubs during recovery (`issue#17866 `_, `pr#11944 `_, Wido den Hollander) * osd: collection_list shadow return value # (`issue#17713 `_, `pr#11737 `_, Haomai Wang) * osd: do not send ENXIO on misdirected op by default (`issue#18751 `_, `pr#13255 `_, Sage Weil) * osd: FileStore: fiemap cannot be totally retrieved in xfs when the number of extents > 1364 (`issue#17610 `_, `pr#11998 `_, Kefu Chai, Ning Yao) * osd: leveldb corruption leads to Operation not permitted not handled and assert (`issue#18037 `_, `pr#12789 `_, Nathan Cutler) * osd: limit omap data in push op (`issue#16128 `_, `pr#11991 `_, Wanlong Gao) * osd: osd crashes when radosgw-admin bi list --max-entries=1 command runing (`issue#17745 `_, `pr#11758 `_, weiqiaomiao) * osd: osd_max_backfills default has changed, documentation should reflect that. (`issue#17701 `_, `pr#11735 `_, huangjun) * osd: OSDMonitor: only reject MOSDBoot based on up_from if inst matches (`issue#17899 `_, `pr#12868 `_, Samuel Just) * osd: osd/PG: publish PG stats when backfill-related states change (`issue#18369 `_, `pr#12875 `_, Alexey Sheplyakov, Sage Weil) * osd: Remove extra call to reg_next_scrub() during splits (`issue#16474 `_, `pr#11606 `_, David Zafman) * osd: Revert "Merge pull request #12978 from asheplyakov/jewel-18581" (`issue#18809 `_, `pr#13280 `_, Samuel Just) * osd: update_log_missing does not order correctly with osd_ops (`issue#17789 `_, `pr#11997 `_, Samuel Just) * qa/tasks: backport rbd_fio fixes to jewel (`issue#13512 `_, `pr#13104 `_, Ilya Dryomov) * qa/tasks/workunits: backport misc fixes to jewel (`issue#18336 `_, `pr#12912 `_, Sage Weil) * rados: crash adding snap to purged_snaps in ReplicatedPG::WaitingOnReplicas (part 2) (`issue#15943 `_, `issue#18504 `_, `pr#12791 `_, Samuel Just) * rados: Memory leaks in object_list_begin and object_list_end (`issue#18252 `_, `pr#13118 `_, Brad Hubbard) * rados: The request lock RPC message might be incorrectly ignored (`issue#17030 `_, `pr#10865 `_, Jason Dillaman) * rbd: add image id block name prefix APIs (`issue#18270 `_, `pr#12529 `_, Jason Dillaman) * rbd: add max_part and nbds_max options in rbd nbd map, in order to keep consistent with (`issue#18186 `_, `pr#12426 `_, Pan Liu) * rbd: Attempting to remove an image w/ incompatible features results in partial removal (`issue#18315 `_, `pr#13156 `_, Dongsheng Yang) * rbd: bench-write will crash if --io-size is 4G (`issue#18422 `_, `pr#13129 `_, Gaurav Kumar Garg) * rbd: diff calculate can hide parent extents when examining first snapshot in clone (`issue#18068 `_, `pr#12322 `_, Jason Dillaman) * rbd: Exclusive lock improperly initialized on read-only image when using snap_set API (`issue#17618 `_, `pr#11852 `_, Jason Dillaman) * rbd: FAILED assert(m_processing == 0) while running test_lock_fence.sh (`issue#17973 `_, `pr#12323 `_, Venky Shankar) * rbd: Improve error reporting from rbd feature enable/disable (`issue#16985 `_, `pr#13157 `_, Gaurav Kumar Garg) * rbd: JournalMetadata flooding with errors when being blacklisted (`issue#18243 `_, `pr#12739 `_, Jason Dillaman) * rbd: librbd: use proper snapshot when computing diff parent overlap (`issue#18200 `_, `pr#12649 `_, Xiaoxi Chen) * rbd: partition func should be enabled When load nbd.ko for rbd-nbd (`issue#18115 `_, `pr#12754 `_, Pan Liu) * rbd: Potential race when removing two-way mirroring image (`issue#18447 `_, `pr#13233 `_, Mykola Golub) * rbd: [qa] crash in journal-enabled fsx run (`issue#18618 `_, `pr#13128 `_, Jason Dillaman) * rbd: 'rbd du' of missing image does not return error (`issue#16987 `_, `pr#11854 `_, Dongsheng Yang) * rbd: rbd-mirror: gmock warnings in bootstrap request unit tests (`issue#18048 `_, `issue#18012 `_, `issue#18156 `_, `issue#16991 `_, `issue#18051 `_, `pr#12425 `_, Mykola Golub) * rbd: rbd-mirror: image sync object map reload logs message (`issue#16179 `_, `pr#12753 `_, runsisi) * rbd: rbd-mirror: snap protect of non-layered image results in split-brain (`issue#16962 `_, `pr#11869 `_, Mykola Golub) * rbd: [rbd-mirror] sporadic image replayer shut down failure (`issue#18441 `_, `pr#13155 `_, Jason Dillaman) * rbd: rbd-nbd: disallow mapping images >2TB in size (`issue#17219 `_, `pr#11870 `_, Mykola Golub) * rbd: rbd-nbd: invalid error code for "failed to read nbd request" messages (`issue#18242 `_, `pr#12756 `_, Mykola Golub) * rbd: status json format has duplicated/overwritten key (`issue#18261 `_, `pr#12741 `_, Mykola Golub) * rbd: TestLibRBD.DiscardAfterWrite doesn't handle rbd_skip_partial_discard = true (`issue#17750 `_, `pr#11853 `_, Jason Dillaman) * rbd: truncate can cause unflushed snapshot data lose (`issue#17193 `_, `pr#12324 `_, Yan, Zheng) * : ReplicatedBackend: take read locks for clone sources during recovery (`issue#17831 `_, `issue#18583 `_, `pr#12978 `_, Samuel Just) * rgw: add option to log custom HTTP headers (rgw_log_http_headers) (`issue#18891 `_, `pr#12490 `_, Matt Benjamin) * rgw: add suport for Swift-at-root dependent features of Swift API (`issue#18526 `_, `issue#16673 `_, `pr#11497 `_, Pritha Srivastava, Radoslaw Zarzynski, Pete Zaitcev, Abhishek Lekshmanan) * rgw: add support for the prefix parameter in account listing of Swift API (`issue#17931 `_, `pr#12258 `_, Radoslaw Zarzynski) * rgw: Add workaround for upgrade issues for older jewel versions (`issue#17820 `_, `pr#12316 `_, Orit Wasserman) * rgw: be aware abount tenants on cls_user_bucket -> rgw_bucket conversion (`issue#18364 `_, `issue#16355 `_, `pr#13276 `_, Radoslaw Zarzynski) * rgw: bucket check remove _multipart_ prefix (`issue#13724 `_, `pr#11470 `_, Weijun Duan) * rgw: bucket resharding (`issue#17549 `_, `issue#17550 `_, `pr#13341 `_, Yehuda Sadeh, Robin H. Johnson) * rgw: disable virtual hosting of buckets when no hostnames are configured (`issue#17440 `_, `issue#15975 `_, `issue#17136 `_, `pr#11760 `_, Casey Bodley, Robin H. Johnson) * rgw: do not abort when accept a CORS request with short origin (`issue#18187 `_, `pr#12397 `_, LiuYang) * rgw: don't store empty chains in gc (`issue#17897 `_, `pr#12174 `_, Yehuda Sadeh) * rgw:fix for deleting objects name beginning and ending with underscores of one bucket using POST method of js sdk. (`issue#17888 `_, `pr#12320 `_, Casey Bodley) * rgw: fix period update crash (`issue#18631 `_, `pr#13273 `_, Orit Wasserman) * rgw: fix put_acls for objects starting and ending with underscore (`issue#17625 `_, `pr#11675 `_, Orit Wasserman) * rgw: fix use of marker in List::list_objects() (`issue#18331 `_, `pr#13358 `_, Yehuda Sadeh) * rgw: for the create_bucket api, if the input creation_time is zero, we … (`issue#16597 `_, `pr#11990 `_, weiqiaomiao) * rgw: Have a flavor of bucket deletion in radosgw-admin to bypass garbage collection (`issue#15557 `_, `pr#10661 `_, Pavan Rallabhandi) * rgw: json encode/decode of RGWBucketInfo missing index_type field (`issue#17755 `_, `pr#11759 `_, Yehuda Sadeh) * rgw: ldap: enforce simple_bind w/LDAPv3 redux (`issue#18339 `_, `pr#12678 `_, Weibing Zhang) * rgw: leak from RGWMetaSyncShardCR::incremental_sync (`issue#18412 `_, `issue#18300 `_, `pr#13004 `_, Casey Bodley, Sage Weil) * rgw: leak in RGWFetchAllMetaCR (`issue#17812 `_, `pr#11872 `_, Casey Bodley) * rgw: librgw: objects created from s3 apis are not visible from nfs mount point (`issue#18651 `_, `pr#13177 `_, Matt Benjamin) * rgw: log name instead of id for SystemMetaObj on failure (`issue#15776 `_, `pr#12622 `_, Wido den Hollander, Abhishek Lekshmanan) * rgw: multimds: mds entering up:replay and processing down mds aborts (`issue#17670 `_, `pr#11857 `_, Patrick Donnelly) * rgw: multipart upload copy (`issue#12790 `_, `pr#13068 `_, Yehuda Sadeh, Javier M. Mellid, Matt Benjamin) * rgw: multisite: after finishing full sync on a bucket, incremental sync starts over from the beginning (`issue#17661 `_, `issue#17624 `_, `pr#11864 `_, Zengran Zhang, Casey Bodley) * rgw: multisite: assert(next) failed in RGWMetaSyncCR (`issue#17044 `_, `pr#11477 `_, Casey Bodley) * rgw: multisite: coroutine deadlock assertion on error in FetchAllMetaCR (`issue#17571 `_, `pr#11866 `_, Casey Bodley) * rgw: multisite: coroutine deadlock in RGWMetaSyncCR after ECANCELED errors (`issue#17465 `_, `pr#12738 `_, Casey Bodley) * rgw: multisite doesn't retry RGWFetchAllMetaCR on failed lease (`issue#17047 `_, `pr#11476 `_, Casey Bodley) * rgw: multisite: ECANCELED & 500 error on bucket delete (`issue#17698 `_, `pr#12044 `_, Casey Bodley) * rgw: multisite: failed assertion in 'radosgw-admin bucket sync status' (`issue#18083 `_, `pr#12314 `_, Casey Bodley) * rgw: multisite: fix ref counting of completions (`issue#17792 `_, `issue#18414 `_, `issue#17793 `_, `issue#18407 `_, `pr#13001 `_, Casey Bodley) * rgw: multisite: metadata master can get the wrong value for 'oldest_log_period' (`issue#16894 `_, `pr#11868 `_, Casey Bodley) * rgw: multisite: obsolete 'radosgw-admin period prepare' command (`issue#17387 `_, `pr#11574 `_, Gaurav Kumar Garg) * rgw: multisite: race between ReadSyncStatus and InitSyncStatus leads to EIO errors (`issue#17568 `_, `pr#11865 `_, Casey Bodley) * rgw: multisite requests failing with '400 Bad Request' with civetweb 1.8 (`issue#17822 `_, `pr#12313 `_, Casey Bodley) * rgw: multisite: segfault after changing value of rgw_data_log_num_shards (`issue#18488 `_, `pr#13180 `_, Casey Bodley) * rgw: multisite: sync status reports master is on a different period (`issue#18064 `_, `pr#13175 `_, Abhishek Lekshmanan) * rgw: multisite upgrade from hammer -> jewel ignores rgw_region_root_pool (`issue#17963 `_, `pr#12156 `_, Casey Bodley) * rgw: radosgw-admin period update reverts deleted zonegroup (`issue#17239 `_, `pr#13171 `_, Orit Wasserman) * rgw: Realm set does not create a new period (`issue#18333 `_, `pr#13182 `_, Orit Wasserman) * rgw: remove spurious mount entries for RGW buckets (`issue#17850 `_, `pr#12045 `_, Matt Benjamin) * rgw: Replacing '+' with "%20" in canonical uri for s3 v4 auth. (`issue#17076 `_, `pr#12542 `_, Pritha Srivastava) * rgw: rgw-admin: missing command to modify placement targets (`issue#18078 `_, `pr#12428 `_, Yehuda Sadeh, Casey Bodley) * rgw: RGWRados::get_system_obj() sends unnecessary stat request before read (`issue#17580 `_, `pr#11867 `_, Casey Bodley) * rgw: rgw_rest_s3: apply missed base64 try-catch (`issue#17663 `_, `pr#11672 `_, Matt Benjamin) * rgw: RGW will not list Argonaut-era bucket via HTTP (but radosgw-admin works) (`issue#17372 `_, `pr#11863 `_, Yehuda Sadeh) * rgw: sends omap_getvals with (u64)-1 limit (`issue#17985 `_, `pr#12419 `_, Yehuda Sadeh, Sage Weil) * rgw: slave zonegroup cannot enable the bucket versioning (`issue#18003 `_, `pr#13173 `_, Orit Wasserman) * rgw: TempURL properly handles accounts created with the implicit tenant (`issue#17961 `_, `pr#12079 `_, Radoslaw Zarzynski) * rgw: the value of total_time is wrong in the result of 'radosgw-admin log show' opt (`issue#17598 `_, `pr#11876 `_, weiqiaomiao) * rgw: Unable to commit period zonegroup change (`issue#17364 `_, `pr#12315 `_, Orit Wasserman) * rgw: valgrind "invalid read size 4" RGWGetObj (`issue#18071 `_, `pr#12997 `_, Matt Benjamin) * rgw: work around curl_multi_wait bug with non-blocking reads (`issue#15915 `_, `issue#16368 `_, `issue#16695 `_, `pr#11627 `_, John Coyle, Casey Bodley) * tests: add require_jewel_osds before upgrading last hammer node (`issue#18719 `_, `pr#13161 `_, Nathan Cutler) * tests: add require_jewel_osds to upgrade/hammer-x/tiering (`issue#18920 `_, `pr#13404 `_, Nathan Cutler) * tests: assertion failure in a radosgw-admin related task (`issue#17167 `_, `pr#12764 `_, Orit Wasserman) * tests: Cannot reserve CentOS 7.2 smithi machines (`issue#18416 `_, `issue#18401 `_, `pr#13050 `_, Nathan Cutler, Sage Weil, Yuri Weinstein) * tests: ignore bogus ceph-objectstore-tool error in ceph_manager (`issue#16263 `_, `pr#13240 `_, Nathan Cutler, Kefu Chai) * tests: objecter_requests workunit fails on wip branches (`issue#18393 `_, `pr#12761 `_, Sage Weil) * tests: qa/suites/upgrade/hammer-x: break stress split ec symlinks (`issue#19006 `_, `pr#13533 `_, Nathan Cutler) * tests: qa/suites/upgrade/hammer-x/stress-split: finish thrashing before final upgrade (`issue#19004 `_, `pr#13222 `_, Sage Weil) * tests: qa/tasks/ceph_deploy.py: use dev option (`issue#18736 `_, `pr#13106 `_, Vasu Kulkarni) * tests: qa/workunits/rbd: use more recent qemu-iotests that support Xenial (`issue#18149 `_, `issue#10773 `_, `pr#13103 `_, Jason Dillaman) * tests: remove qa/suites/buildpackages (`issue#18846 `_, `pr#13299 `_, Loic Dachary) * tests: SUSE yaml facets in qa/distros/all are out of date (`issue#18856 `_, `issue#18846 `_, `pr#13331 `_, Nathan Cutler) * tests: update rbd/singleton/all/formatted-output.yaml to support ceph-ci (`issue#18440 `_, `pr#12822 `_, Nathan Cutler, Venky Shankar) * tests: update Ubuntu image url after ceph.com refactor (`issue#18542 `_, `pr#12959 `_, Jason Dillaman) * tests: upgrade:hammer-x: install firefly only on Ubuntu 14.04 (`issue#18089 `_, `pr#13153 `_, Nathan Cutler) * tests: use ceph-jewel branch for s3tests (`issue#18384 `_, `pr#12745 `_, Nathan Cutler) * tests: Workunits needlessly wget from git.ceph.com (`issue#18336 `_, `issue#18271 `_, `issue#18388 `_, `pr#12686 `_, Nathan Cutler, Sage Weil) * test: temporarily disable fork()'ing tests (`issue#16556 `_, `issue#17832 `_, `pr#11953 `_, John Spray) * test: test fails due to The UNIX domain socket path (`issue#16014 `_, `pr#12151 `_, Loic Dachary) * tools: ceph-disk: ceph-disk@.service races with ceph-osd@.service (`issue#17889 `_, `issue#17813 `_, `pr#12147 `_, Loic Dachary) * tools: ceph-disk --dmcrypt create must not require admin key (`issue#17849 `_, `pr#12033 `_, Loic Dachary) * tools: ceph-disk prepare writes osd log 0 with root owner (`issue#18538 `_, `pr#13025 `_, Samuel Matzek) * tools: crushtool --compile is create output despite of missing item (`issue#17306 `_, `pr#11410 `_, Kefu Chai) * tools: rados bench seq must verify the hostname (`issue#17526 `_, `pr#13049 `_, Loic Dachary) * tools: snapshotted RBD extent objects can't be manually evicted from a cache tier (`issue#17896 `_, `pr#11968 `_, Mingxin Liu) * tools: systemd/ceph-disk: reduce ceph-disk flock contention (`issue#18049 `_, `issue#13160 `_, `pr#12210 `_, David Disseldorp) v10.2.5 Jewel ============= This point release fixes an important `regression introduced in v10.2.4 `_. We recommend that all v10.2.x users upgrade. Notable Changes --------------- For more detailed information, see :download:`the complete changelog `. * msg/simple/Pipe: avoid returning 0 on poll timeout (`issue#18185 `_, `pr#12376 `_, Sage Weil) v10.2.4 Jewel ============= This point release fixes several important bugs in RBD mirroring, RGW multi-site, CephFS, and RADOS. We recommend that all v10.2.x users upgrade. Also note the following when upgrading from hammer Upgrading from hammer --------------------- When the last hammer OSD in a cluster containing jewel MONs is upgraded to jewel, as of 10.2.4 the jewel MONs will issue this warning: "all OSDs are running jewel or later but the 'require_jewel_osds' osdmap flag is not set" and change the cluster health status to HEALTH_WARN. This is a signal for the admin to do "ceph osd set require_jewel_osds" - by doing this, the upgrade path is complete and no more pre-Jewel OSDs may be added to the cluster. Notable Changes --------------- For more detailed information, see :download:`the complete changelog `. * build/ops: aarch64: Compiler-based detection of crc32 extended CPU type is broken (`issue#17516 `_, `pr#11492 `_, Alexander Graf) * build/ops: allow building RGW with LDAP disabled (`issue#17312 `_, `pr#11478 `_, Daniel Gryniewicz) * build/ops: backport 'logrotate: Run as root/ceph' (`issue#17381 `_, `pr#11201 `_, Boris Ranto) * build/ops: ceph installs stuff in %_udevrulesdir but does not own that directory (`issue#16949 `_, `pr#10862 `_, Nathan Cutler) * build/ops: ceph-osd-prestart.sh fails confusingly when data directory does not exist (`issue#17091 `_, `pr#10812 `_, Nathan Cutler) * build/ops: disable LTTng-UST in openSUSE builds (`issue#16937 `_, `pr#10794 `_, Michel Normand) * build/ops: i386 tarball gitbuilder failure on master (`issue#16398 `_, `pr#10855 `_, Vikhyat Umrao, Kefu Chai) * build/ops: include more files in "make dist" tarball (`issue#17560 `_, `pr#11431 `_, Ken Dreyer) * build/ops: incorrect value of CINIT_FLAG_DEFER_DROP_PRIVILEGES (`issue#16663 `_, `pr#10278 `_, Casey Bodley) * build/ops: remove SYSTEMD_RUN from initscript (`issue#7627 `_, `issue#16441 `_, `issue#16440 `_, `pr#9872 `_, Vladislav Odintsov) * build/ops: systemd: add install section to rbdmap.service file (`issue#17541 `_, `pr#11158 `_, Jelle vd Kooij) * common: Enable/Disable of features is allowed even the features are already enabled/disabled (`issue#16079 `_, `pr#11460 `_, Lu Shi) * common: Log.cc: Assign LOG_INFO priority to syslog calls (`issue#15808 `_, `pr#11231 `_, Brad Hubbard) * common: Proxied operations shouldn't result in error messages if replayed (`issue#16130 `_, `pr#11461 `_, Vikhyat Umrao) * common: Request exclusive lock if owner sends -ENOTSUPP for proxied maintenance op (`issue#16171 `_, `pr#10784 `_, Jason Dillaman) * common: msgr/async: Messenger thread long time lock hold risk (`issue#15758 `_, `pr#10761 `_, Wei Jin) * doc: fix description for rsize and rasize (`issue#17357 `_, `pr#11171 `_, Andreas Gerstmayr) * filestore: can get stuck in an unbounded loop during scrub (`issue#17859 `_, `pr#12001 `_, Sage Weil) * fs: Failure in snaptest-git-ceph.sh (`issue#17172 `_, `pr#11419 `_, Yan, Zheng) * fs: Log path as well as ino when detecting metadata damage (`issue#16973 `_, `pr#11418 `_, John Spray) * fs: client: FAILED assert(root_ancestor->qtree == __null) (`issue#16066 `_, `issue#16067 `_, `pr#10107 `_, Yan, Zheng) * fs: client: add missing client_lock for get_root (`issue#17197 `_, `pr#10921 `_, Patrick Donnelly) * fs: client: fix shutdown with open inodes (`issue#16764 `_, `pr#10958 `_, John Spray) * fs: client: nlink count is not maintained correctly (`issue#16668 `_, `pr#10877 `_, Jeff Layton) * fs: multimds: allow_multimds not required when max_mds is set in ceph.conf at startup (`issue#17105 `_, `pr#10997 `_, Patrick Donnelly) * librados: memory leaks from ceph::crypto (WITH_NSS) (`issue#17205 `_, `pr#11409 `_, Casey Bodley) * librados: modify Pipe::connect() to return the error code (`issue#15308 `_, `pr#11193 `_, Vikhyat Umrao) * librados: remove new setxattr overload to avoid breaking the C++ ABI (`issue#18058 `_, `pr#12207 `_, Josh Durgin) * librbd: cannot disable journaling or remove non-mirrored, non-primary image (`issue#16740 `_, `pr#11337 `_, Jason Dillaman) * librbd: discard after write can result in assertion failure (`issue#17695 `_, `pr#11644 `_, Jason Dillaman) * librbd::Operations: update notification failed: (2) No such file or directory (`issue#17549 `_, `pr#11420 `_, Jason Dillaman) * mds: Crash in Client::_invalidate_kernel_dcache when reconnecting during unmount (`issue#17253 `_, `pr#11414 `_, Yan, Zheng) * mds: Duplicate damage table entries (`issue#17173 `_, `pr#11412 `_, John Spray) * mds: Failure in dirfrag.sh (`issue#17286 `_, `pr#11416 `_, Yan, Zheng) * mds: Failure in snaptest-git-ceph.sh (`issue#17271 `_, `pr#11415 `_, Yan, Zheng) * mon: Ceph Status - Segmentation Fault (`issue#16266 `_, `pr#11408 `_, Brad Hubbard) * mon: Display full flag in ceph status if full flag is set (`issue#15809 `_, `pr#9388 `_, Vikhyat Umrao) * mon: Error EINVAL: removing mon.a at 172.21.15.16:6789/0, there will be 1 monitors (`issue#17725 `_, `pr#12267 `_, Joao Eduardo Luis) * mon: OSDMonitor: only reject MOSDBoot based on up_from if inst matches (`issue#17899 `_, `pr#12067 `_, Samuel Just) * mon: OSDMonitor: Missing nearfull flag set (`issue#17390 `_, `pr#11272 `_, Igor Podoski) * mon: Upgrading 0.94.6 -> 0.94.9 saturating mon node networking (`issue#17365 `_, `issue#17386 `_, `pr#11679 `_, Sage Weil, xie xingguo) * mon: ceph mon Segmentation fault after set crush_ruleset ceph 10.2.2 (`issue#16653 `_, `pr#10861 `_, song baisen) * mon: crash: crush/CrushWrapper.h: 940: FAILED assert(successful_detach) (`issue#16525 `_, `pr#10496 `_, Kefu Chai) * mon: don't crash on invalid standby_for_fscid (`issue#17466 `_, `pr#11389 `_, John Spray) * mon: fix missing osd metadata (again) (`issue#17685 `_, `pr#11642 `_, John Spray) * mon: osdmonitor: decouple adjust_heartbeat_grace and min_down_reporters (`issue#17055 `_, `pr#10757 `_, Zengran Zhang) * mon: the %USED of ceph df is wrong (`issue#16933 `_, `pr#10860 `_, Kefu Chai) * osd: condition OSDMap encoding on features (`issue#18015 `_, `pr#12167 `_, Sage Weil) * osd: PG::_update_calc_stats wrong for CRUSH_ITEM_NONE up set items (`issue#16998 `_, `pr#10883 `_, Samuel Just) * osd: PG::choose_acting valgrind error or ./common/hobject.h: 182: FAILED assert(!max || (\*this == hobject_t(hobject_t::get_max()))) (`issue#13967 `_, `pr#10885 `_, Tao Chang) * osd: Potential crash during journal::Replay shut down (`issue#16433 `_, `pr#10645 `_, Jason Dillaman) * osd: add peer_addr in heartbeat_check log message (`issue#15762 `_, `pr#9739 `_, Vikhyat Umrao, Sage Weil) * osd: adjust scrub boundary to object without SnapSet (`issue#17470 `_, `pr#11311 `_, Samuel Just) * osd: ceph osd df does not show summarized info correctly if one or more OSDs are out (`issue#16706 `_, `pr#10759 `_, xie xingguo) * osd: journal: do not prematurely flag object recorder as closed (`issue#17590 `_, `pr#11634 `_, Jason Dillaman) * osd: mark_all_unfound_lost() leaves unapplied changes (`issue#16156 `_, `pr#10886 `_, Samuel Just) * osd: segfault in ObjectCacher::FlusherThread (`issue#16610 `_, `pr#10864 `_, Yan, Zheng) * qa: remove EnumerateObjects from librados upgrade tests (`pr#11728 `_, Josh Durgin) * rbd: Disabling pool mirror mode with registered peers results orphaned mirrored images (`issue#16984 `_, `pr#10857 `_, Jason Dillaman) * rbd: ImageWatcher: use after free within C_UnwatchAndFlush (`issue#17289 `_, `issue#17254 `_, `pr#11466 `_, Jason Dillaman) * rbd: Prevent the creation of a clone from a non-primary mirrored image (`issue#16449 `_, `pr#10650 `_, Mykola Golub) * rbd: RBD should restrict mirror enable/disable actions on parents/clones (`issue#16056 `_, `pr#11459 `_, zhuangzeqiang) * rbd: TestJournalReplay: sporadic assert(m_state == STATE_READY || m_state == STATE_STOPPING) failure (`issue#17566 `_, `pr#11590 `_, Jason Dillaman) * rbd: bench io-size should not be larger than image size (`issue#16967 `_, `pr#10796 `_, Jason Dillaman) * rbd: ceph 10.2.2 rbd status on image format 2 returns (2) No such file or directory (`issue#16887 `_, `pr#10652 `_, Jason Dillaman) * rbd: helgrind: TestLibRBD.TestIOPP potential deadlock closing an image with read-ahead enabled (`issue#17198 `_, `pr#11463 `_, Jason Dillaman) * rbd: image.stat() call in librbdpy fails sometimes (`issue#17310 `_, `pr#11464 `_, Jason Dillaman) * rbd: krbd qa scripts and concurrent.sh test fix (`issue#17223 `_, `pr#11018 `_, Ilya Dryomov) * rbd: krbd-related CLI patches (`issue#17554 `_, `pr#11400 `_, Ilya Dryomov) * rbd: mirror: improve resiliency of stress test case (`issue#16855 `_, `issue#16555 `_, `issue#14738 `_, `issue#15259 `_, `issue#17446 `_, `issue#17355 `_, `issue#16538 `_, `issue#16974 `_, `issue#17283 `_, `issue#17317 `_, `issue#17416 `_, `issue#16227 `_, `pr#11433 `_, Mykola Golub, Ricardo Dias, Jason Dillaman) * rbd: rbd-nbd IO hang (`issue#16921 `_, `pr#11467 `_, Jason Dillaman) * rbd: update_features API needs to support backwards/forward compatibility (`issue#17330 `_, `pr#11462 `_, Jason Dillaman) * rgw: COPY broke multipart files uploaded under dumpling (`issue#16435 `_, `pr#10866 `_, Yehuda Sadeh) * rgw: Config parameter rgw keystone make new tenants in radosgw multitenancy does not work (`issue#17293 `_, `pr#11473 `_, SirishaGuduru) * rgw: Do not archive metadata by default (`issue#17256 `_, `pr#11321 `_, Pavan Rallabhandi, Matt Benjamin) * rgw: ERROR: got unexpected error when trying to read object: -2 (`issue#17111 `_, `pr#11472 `_, Yang Honggang) * rgw: Modification for TEST S3 ACCESS section in INSTALL CEPH OBJECT GATEWAY page (`issue#15603 `_, `pr#11475 `_, la-sguduru) * rgw: RGW loses realm/period/zonegroup/zone data: period overwritten if somewhere in the cluster is still running Hammer (`issue#17371 `_, `pr#11519 `_, Orit Wasserman) * rgw: RGWDataSyncCR fails on errors from RGWListBucketIndexesCR (`issue#17073 `_, `pr#11330 `_, Casey Bodley) * rgw: S3 object versioning fails when applied on a non-master zone (`issue#16494 `_, `pr#11367 `_, Yehuda Sadeh) * rgw: add orphan options to radosgw-admin --help and man page (`issue#17281 `_, `issue#17280 `_, `pr#11139 `_, Ken Dreyer, Thomas Serlin) * rgw: back off bucket sync on failures, don't store marker (`issue#16742 `_, `pr#11021 `_, Yehuda Sadeh) * rgw: combined LDAP backports (`issue#17544 `_, `issue#17185 `_, `pr#11332 `_, Harald Klein, Matt Benjamin) * rgw: cors auto memleak (`issue#16564 `_, `pr#10656 `_, Yan Jun) * rgw: default quota fixes (`issue#16410 `_, `pr#10832 `_, Pavan Rallabhandi, Daniel Gryniewicz) * rgw: doc: description of multipart part entity is wrong (`issue#17504 `_, `pr#11342 `_, weiqiaomiao) * rgw: don't loop forever when reading data from 0 sized segment. (`issue#17692 `_, `pr#11626 `_, Marcus Watts) * rgw: fix put_acls for objects starting and ending with underscore (`issue#17625 `_, `pr#11669 `_, Orit Wasserman) * rgw: fix regression with handling double underscore (`issue#17443 `_, `issue#16856 `_, `pr#11563 `_, Yehuda Sadeh, Orit Wasserman) * rgw: handle empty POST condition (`issue#17635 `_, `pr#11662 `_, Yehuda Sadeh) * rgw: metadata sync can skip markers for failed/incomplete entries (`issue#16759 `_, `pr#10657 `_, Yehuda Sadeh) * rgw: nfs backports (`issue#17393 `_, `issue#17311 `_, `issue#17367 `_, `issue#17319 `_, `issue#17321 `_, `issue#17322 `_, `issue#17323 `_, `issue#17325 `_, `issue#17326 `_, `issue#17327 `_, `pr#11335 `_, Min Chen, Yan Jun, Weibing Zhang, Matt Benjamin) * rgw: period commit loses zonegroup changes: region_map converted repeatedly (`issue#17051 `_, `pr#10890 `_, Casey Bodley) * rgw: period commit return error when the current period has a zonegroup which doesn't have a master zone (`issue#17110 `_, `pr#10867 `_, weiqiaomiao) * rgw: radosgw daemon core when reopen logs (`issue#17036 `_, `pr#10868 `_, weiqiaomiao) * rgw: rgw file uses too much CPU in gc/idle thread (`issue#16976 `_, `pr#10889 `_, Matt Benjamin) * rgw: s3tests-test-readwrite failing with 500 (`issue#16930 `_, `pr#11471 `_, Yehuda Sadeh) * rgw: upgrade from old multisite to new multisite fails (`issue#16751 `_, `pr#10891 `_, Orit Wasserman) * rgw:response information is error when geting token of swift account (`issue#15195 `_, `pr#11474 `_, Qiankun Zheng) * rgw:user email can modify to empty when it has values (`issue#13286 `_, `pr#11469 `_, Yehuda Sadeh, Weijun Duan) * tests: ceph-disk must ignore debug monc (`issue#17607 `_, `pr#11548 `_, Loic Dachary) * tests: fix TestClsRbd.mirror_image failure in upgrade:jewel-x-master-distro-basic-vps (`issue#16529 `_, `pr#10888 `_, Jason Dillaman) * tests: scsi_debug fails /dev/disk/by-partuuid (`issue#17100 `_, `pr#11411 `_, Loic Dachary) * tests: test/ceph_test_msgr: do not use Message::middle for holding transient… (`issue#17365 `_, `issue#17728 `_, `issue#16955 `_, `pr#11742 `_, Haomai Wang, Kefu Chai, Michal Jarzabek, Sage Weil) * tools: Missing comma in ceph-create-keys causes concatenation of arguments (`issue#17815 `_, `pr#11822 `_, Patrick Donnelly) * tools: add a tool to rebuild mon store from OSD (`issue#17179 `_, `issue#17400 `_, `pr#11126 `_, Kefu Chai, xie xingguo) * tools: ceph-create-keys: sometimes blocks forever if mds allow is set (`issue#16255 `_, `pr#11417 `_, John Spray) * tools: ceph-disk should timeout when a lock cannot be acquired (`issue#16580 `_, `pr#10758 `_, Loic Dachary) * tools: ceph-disk: expected systemd unit failures are confusing (`issue#15990 `_, `pr#10884 `_, Boris Ranto) * tools: ceph-disk: using a regular file as a journal fails (`issue#16280 `_, `issue#17662 `_, `pr#11657 `_, Jayashree Candadai, Anirudha Bose, Loic Dachary, Shylesh Kumar) * tools: ceph-objectstore-tool crashes if --journal-path (`issue#17307 `_, `pr#11407 `_, Kefu Chai) * tools: ceph-objectstore-tool: add a way to split filestore directories offline (`issue#17220 `_, `pr#11252 `_, Josh Durgin) * tools: ceph-post-file: use new ssh key (`issue#14267 `_, `pr#11746 `_, David Galloway) v10.2.3 Jewel ============= This point release fixes several important bugs in RBD mirroring, RGW multi-site, CephFS, and RADOS. We recommend that all v10.2.x users upgrade. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * build/ops: 60-ceph-partuuid-workaround-rules still needed by debian jessie (udev 215-17) (`issue#16351 `_, `pr#10653 `_, runsisi, Loic Dachary) * build/ops: ceph Resource Agent does not work with systemd (`issue#14828 `_, `pr#9917 `_, Nathan Cutler) * build/ops: ceph-base requires parted (`issue#16095 `_, `pr#10008 `_, Ken Dreyer) * build/ops: ceph-osd-prestart.sh contains Upstart-specific code (`issue#15984 `_, `pr#10364 `_, Nathan Cutler) * build/ops: mount.ceph: move from ceph-base to ceph-common and add symlink in /sbin for SUSE (`issue#16598 `_, `issue#16645 `_, `pr#10357 `_, Nathan Cutler, Dan Horák, Ricardo Dias, Kefu Chai) * build/ops: need rocksdb commit 7ca731b12ce for ppc64le build (`issue#17092 `_, `pr#10816 `_, Nathan Cutler) * build/ops: rpm: OBS needs ExclusiveArch (`issue#16936 `_, `pr#10614 `_, Michel Normand) * cli: ceph command line tool chokes on ceph –w (the dash is unicode 'en dash' &ndash, copy-paste to reproduce) (`issue#12287 `_, `pr#10420 `_, Oleh Prypin, Kefu Chai) * common: expose buffer const_iterator symbols (`issue#16899 `_, `pr#10552 `_, Noah Watkins) * common: global-init: fixup chown of the run directory along with log and asok files (`issue#15607 `_, `pr#8754 `_, Karol Mroz) * fs: ceph-fuse: link to libtcmalloc or jemalloc (`issue#16655 `_, `pr#10303 `_, Yan, Zheng) * fs: client: crash in unmount when fuse_use_invalidate_cb is enabled (`issue#16137 `_, `pr#10106 `_, Yan, Zheng) * fs: client: fstat cap release (`issue#15723 `_, `pr#9562 `_, Yan, Zheng, Noah Watkins) * fs: essential backports for OpenStack Manila (`issue#15406 `_, `issue#15614 `_, `issue#15615 `_, `pr#10453 `_, John Spray, Ramana Raja, Xiaoxi Chen) * fs: fix double-unlock on shutdown (`issue#17126 `_, `pr#10847 `_, Greg Farnum) * fs: fix mdsmap print_summary with standby replays (`issue#15705 `_, `pr#9547 `_, John Spray) * fs: fuse mounted file systems fails SAMBA CTDB ping_pong rw test with v9.0.2 (`issue#12653 `_, `issue#15634 `_, `pr#10108 `_, Yan, Zheng) * librados: Missing export for rados_aio_get_version in src/include/rados/librados.h (`issue#15535 `_, `pr#9574 `_, Jim Wright) * librados: osd: bad flags can crash the osd (`issue#16012 `_, `pr#9997 `_, Sage Weil) * librbd: Close journal and object map before flagging exclusive lock as released (`issue#16450 `_, `pr#10053 `_, Jason Dillaman) * librbd: Crash when utilizing advisory locking API functions (`issue#16364 `_, `pr#10051 `_, Jason Dillaman) * librbd: ExclusiveLock object leaked when switching to snapshot (`issue#16446 `_, `pr#10054 `_, Jason Dillaman) * librbd: FAILED assert(object_no < m_object_map.size()) (`issue#16561 `_, `pr#10647 `_, Jason Dillaman) * librbd: Image removal doesn't necessarily clean up all rbd_mirroring entries (`issue#16471 `_, `pr#10009 `_, Jason Dillaman) * librbd: Object map/fast-diff invalidated if journal replays the same snap remove event (`issue#16350 `_, `pr#10010 `_, Jason Dillaman) * librbd: Timeout sending mirroring notification shouldn't result in failure (`issue#16470 `_, `pr#10052 `_, Jason Dillaman) * librbd: Whitelist EBUSY error from snap unprotect for journal replay (`issue#16445 `_, `pr#10055 `_, Jason Dillaman) * librbd: cancel all tasks should wait until finisher is done (`issue#16517 `_, `pr#9752 `_, Haomai Wang) * librbd: delay acquiring lock if image watch has failed (`issue#16923 `_, `pr#10827 `_, Jason Dillaman) * librbd: fix missing return statement if failed to get mirror image state (`issue#16600 `_, `pr#10144 `_, runsisi) * librbd: flag image as updated after proxying maintenance op (`issue#16404 `_, `pr#9883 `_, Jason Dillaman) * librbd: mkfs.xfs slow performance with discards and object map (`issue#16707 `_, `issue#16689 `_, `pr#10649 `_, Jason Dillaman) * librbd: potential use after free on refresh error (`issue#16519 `_, `pr#9952 `_, Mykola Golub) * librbd: rbd-nbd does not properly handle resize notifications (`issue#15715 `_, `pr#10679 `_, Mykola Golub) * librbd: the option 'rbd_cache_writethrough_until_flush=true' dosn't work (`issue#16740 `_, `issue#16386 `_, `issue#16708 `_, `issue#16654 `_, `issue#16478 `_, `pr#10797 `_, Mykola Golub, xinxin shu, Xiaowei Chen, Jason Dillaman) * mds: tell command blocks forever with async messenger (TestVolumeClient.test_evict_client failure) (`issue#16288 `_, `pr#10501 `_, Douglas Fuller) * mds: Confusing MDS log message when shut down with stalled journaler reads (`issue#15689 `_, `pr#9557 `_, John Spray) * mds: Deadlock on shutdown active rank while busy with metadata IO (`issue#16042 `_, `pr#10502 `_, Patrick Donnelly) * mds: Failing file operations on kernel based cephfs mount point leaves unaccessible file behind on hammer 0.94.7 (`issue#16013 `_, `pr#10199 `_, Yan, Zheng) * mds: Fix shutting down mds timed-out due to deadlock (`issue#16396 `_, `pr#10500 `_, Zhi Zhang) * mds: MDSMonitor fixes (`issue#16136 `_, `pr#9561 `_, xie xingguo) * mds: MDSMonitor::check_subs() is very buggy (`issue#16022 `_, `pr#10103 `_, Yan, Zheng) * mds: Session::check_access() is buggy (`issue#16358 `_, `pr#10105 `_, Yan, Zheng) * mds: StrayManager.cc: 520: FAILED assert(dnl->is_primary()) (`issue#15920 `_, `pr#9559 `_, Yan, Zheng) * mds: enforce a dirfrag limit on entries (`issue#16164 `_, `pr#10104 `_, Patrick Donnelly) * mds: fix SnapRealm::have_past_parents_open() (`issue#16299 `_, `pr#10499 `_, Yan, Zheng) * mds: fix getattr starve setattr (`issue#16154 `_, `pr#9560 `_, Yan, Zheng) * mds: wrongly treat symlink inode as normal file/dir when symlink inode is stale on kcephfs (`issue#15702 `_, `pr#9405 `_, Zhi Zhang) * mon: "mon metadata" fails when only one monitor exists (`issue#15866 `_, `pr#10654 `_, John Spray, Kefu Chai) * mon: Monitor: validate prefix on handle_command() (`issue#16297 `_, `pr#10036 `_, You Ji) * mon: OSDMonitor: drop pg temps from not the current primary (`issue#16127 `_, `pr#9998 `_, Samuel Just) * mon: prepare_pgtemp needs to only update up_thru if newer than the existing one (`issue#16185 `_, `pr#10001 `_, Samuel Just) * msgr: AsyncConnection::lockmsg/async lockdep cycle: AsyncMessenger::lock, MDSDaemon::mds_lock, AsyncConnection::lock (`issue#16237 `_, `pr#10004 `_, Haomai Wang) * msgr: async messenger mon crash (`issue#16378 `_, `issue#16418 `_, `pr#9996 `_, Haomai Wang) * msgr: backports of all asyncmsgr fixes to jewel (`issue#15503 `_, `issue#15372 `_, `pr#9633 `_, Yan Jun, Haomai Wang, Piotr Dałek) * msgr: msg/async: connection race hang (`issue#15849 `_, `pr#10003 `_, Haomai Wang) * osd: FileStore: umount hang because sync thread doesn't exit (`issue#15695 `_, `pr#9105 `_, Kefu Chai) * osd: Fixes for list-inconsistent-* (`issue#15766 `_, `issue#16192 `_, `issue#15719 `_, `pr#9565 `_, David Zafman) * osd: New pools have bogus stuck inactive/unclean HEALTH_ERR messages until they are first active and clean (`issue#14952 `_, `pr#10007 `_, Sage Weil) * osd: OSD crash with Hammer to Jewel Upgrade: void FileStore::init_temp_collections() (`issue#16672 `_, `pr#10561 `_, David Zafman) * osd: OSD failed to subscribe skipped osdmaps after ceph osd pause (`issue#17023 `_, `pr#10804 `_, Kefu Chai) * osd: ObjectCacher split BufferHead read fix (`issue#16002 `_, `pr#10074 `_, Greg Farnum) * osd: ReplicatedBackend doesn't increment stats on pull, only push (`issue#16277 `_, `pr#10421 `_, Kefu Chai) * osd: Scrub error: 0/1 pinned (`issue#15952 `_, `pr#9576 `_, Samuel Just) * osd: crash adding snap to purged_snaps in ReplicatedPG::WaitingOnReplicas (`issue#15943 `_, `pr#9575 `_, Samuel Just) * osd: partprobe intermittent issues during ceph-disk prepare (`issue#15176 `_, `pr#10497 `_, Marius Vollmer, Loic Dachary) * osd: saw valgrind issues in ReplicatedPG::new_repop (`issue#16801 `_, `pr#10760 `_, Kefu Chai) * osd: sparse_read on ec pool should return extends with correct offset (`issue#16138 `_, `pr#10006 `_, kofiliu) * osd:sched_time not actually randomized (`issue#15890 `_, `pr#9578 `_, xie xingguo) * rbd: ImageReplayer::is_replaying does not include flush state (`issue#16970 `_, `pr#10790 `_, Jason Dillaman) * rbd: Journal duplicate op detection can cause lockdep error (`issue#16363 `_, `pr#10044 `_, Jason Dillaman) * rbd: Journal needs to handle duplicate maintenance op tids (`issue#16362 `_, `pr#10045 `_, Jason Dillaman) * rbd: Unable to disable journaling feature if in unexpected mirror state (`issue#16348 `_, `pr#10042 `_, Jason Dillaman) * rbd: bashism in src/rbdmap (`issue#16608 `_, `pr#10786 `_, Jason Dillaman) * rbd: doc: format 2 now is the default image format (`issue#17026 `_, `pr#10732 `_, Chengwei Yang) * rbd: hen journaling is enabled, a flush request shouldn't flush the cache (`issue#15761 `_, `pr#10041 `_, Yuan Zhou) * rbd: possible race condition during journal transition from replay to ready (`issue#16198 `_, `pr#10047 `_, Jason Dillaman) * rbd: qa/workunits/rbd: respect RBD_CREATE_ARGS environment variable (`issue#16289 `_, `pr#9721 `_, Mykola Golub) * rbd: rbd-mirror should disable proxied maintenance ops for non-primary image (`issue#16411 `_, `pr#10050 `_, Jason Dillaman) * rbd: rbd-mirror: FAILED assert(m_local_image_ctx->object_map != nullptr) (`issue#16558 `_, `pr#10646 `_, Jason Dillaman) * rbd: rbd-mirror: FAILED assert(m_on_update_status_finish == nullptr) (`issue#16956 `_, `pr#10792 `_, Jason Dillaman) * rbd: rbd-mirror: FAILED assert(m_state == STATE_STOPPING) (`issue#16980 `_, `pr#10791 `_, Jason Dillaman) * rbd: rbd-mirror: ensure replay status formatter has completed before stopping replay (`issue#16352 `_, `pr#10043 `_, Jason Dillaman) * rbd: rbd-mirror: include local pool id in resync throttle unique key (`issue#16536 `_, `issue#15239 `_, `issue#16488 `_, `issue#16491 `_, `issue#16329 `_, `issue#15108 `_, `issue#15670 `_, `pr#10678 `_, Ricardo Dias, Jason Dillaman) * rbd: rbd-mirror: potential race condition accessing local image journal (`issue#16230 `_, `pr#10046 `_, Jason Dillaman) * rbd: rbd-mirror: reduce memory footprint during journal replay (`issue#16321 `_, `issue#16489 `_, `issue#16622 `_, `issue#16539 `_, `issue#16223 `_, `issue#16349 `_, `pr#10684 `_, Mykola Golub, Jason Dillaman) * rgw: A query on a static large object fails with 404 error (`issue#16015 `_, `pr#9544 `_, Radoslaw Zarzynski) * rgw: Add zone rename to radosgw_admin (`issue#16934 `_, `pr#10663 `_, Shilpa Jagannath) * rgw: Bucket index shards orphaned after bucket delete (`issue#16412 `_, `pr#10525 `_, Orit Wasserman) * rgw: Bug when using port 443s in rgw. (`issue#16548 `_, `pr#10664 `_, Pritha Srivastava) * rgw: Fallback to Host header for bucket name. (`issue#15975 `_, `pr#10693 `_, Robin H. Johnson) * rgw: Fix civetweb IPv6 (`issue#16928 `_, `pr#10580 `_, Robin H. Johnson) * rgw: Increase log level for messages occuring while running rgw admin command (`issue#16935 `_, `pr#10765 `_, Shilpa Jagannath) * rgw: No Last-Modified, Content-Size and X-Object-Manifest headers if no segments in DLO manifest (`issue#15812 `_, `pr#9265 `_, Radoslaw Zarzynski) * rgw: RGWPeriodPuller tries to pull from itself (`issue#16939 `_, `pr#10764 `_, Casey Bodley) * rgw: Set Access-Control-Allow-Origin to a Asterisk if allowed in a rule (`issue#15348 `_, `pr#9453 `_, Wido den Hollander) * rgw: Swift API returns double space usage and objects of account metadata (`issue#16188 `_, `pr#10148 `_, Albert Tu) * rgw: account/container metadata not actually present in a request are deleted during POST through Swift API (`issue#15977 `_, `issue#15779 `_, `pr#9542 `_, Radoslaw Zarzynski) * rgw: add socket backlog setting for via ceph.conf (`issue#16406 `_, `pr#10216 `_, Feng Guo) * rgw: add tenant support to multisite sync (`issue#16469 `_, `issue#16121 `_, `issue#16665 `_, `pr#10845 `_, Yehuda Sadeh, Josh Durgin, Casey Bodley, Pritha Srivastava) * rgw: add_zone only clears master_zone if --master=false (`issue#15901 `_, `pr#9327 `_, Casey Bodley) * rgw: aws4 parsing issue (`issue#15940 `_, `issue#15939 `_, `pr#9545 `_, Yehuda Sadeh) * rgw: aws4: add STREAMING-AWS4-HMAC-SHA256-PAYLOAD support (`issue#16146 `_, `pr#10167 `_, Radoslaw Zarzynski, Javier M. Mellid) * rgw: backport merge of static sites fixes (`issue#15555 `_, `issue#15532 `_, `issue#15531 `_, `pr#9568 `_, Robin H. Johnson) * rgw: can set negative max_buckets on RGWUserInfo (`issue#14534 `_, `pr#10655 `_, Yehuda Sadeh) * rgw: cleanup radosgw-admin temp command as it was deprecated (`issue#16023 `_, `pr#9390 `_, Vikhyat Umrao) * rgw: comparing return code to ERR_NOT_MODIFIED in rgw_rest_s3.cc (needs minus sign) (`issue#16327 `_, `pr#9790 `_, Nathan Cutler) * rgw: custom metadata aren't camelcased in Swift's responses (`issue#15902 `_, `pr#9267 `_, Radoslaw Zarzynski) * rgw: data sync stops after getting error in all data log sync shards (`issue#16530 `_, `pr#10073 `_, Yehuda Sadeh) * rgw: default zone and zonegroup cannot be added to a realm (`issue#16839 `_, `pr#10658 `_, Casey Bodley) * rgw: document multi tenancy (`issue#16635 `_, `pr#10217 `_, Pete Zaitcev) * rgw: don't unregister request if request is not connected to manager (`issue#15911 `_, `pr#9242 `_, Yehuda Sadeh) * rgw: failed to create bucket after upgrade from hammer to jewel (`issue#16627 `_, `pr#10524 `_, Orit Wasserman) * rgw: fix ldap bindpw parsing (`issue#16286 `_, `pr#10518 `_, Matt Benjamin) * rgw: fix multi-delete query param parsing. (`issue#16618 `_, `pr#10188 `_, Robin H. Johnson) * rgw: improve support for Swift's object versioning. (`issue#15925 `_, `pr#10710 `_, Radoslaw Zarzynski) * rgw: initial slashes are not properly handled in Swift's BulkDelete (`issue#15948 `_, `pr#9316 `_, Radoslaw Zarzynski) * rgw: master: build failures with boost > 1.58 (`issue#16392 `_, `issue#16391 `_, `pr#10026 `_, Abhishek Lekshmanan) * rgw: multisite segfault on ~RGWRealmWatcher if realm was deleted (`issue#16817 `_, `pr#10660 `_, Casey Bodley) * rgw: multisite sync races with deletes (`issue#16222 `_, `issue#16464 `_, `issue#16220 `_, `issue#16143 `_, `pr#10293 `_, Yehuda Sadeh, Casey Bodley) * rgw: multisite: preserve zone's extra pool (`issue#16712 `_, `pr#10537 `_, Abhishek Lekshmanan) * rgw: object expirer's hints might be trimmed without processing in some circumstances (`issue#16705 `_, `issue#16684 `_, `pr#10763 `_, Radoslaw Zarzynski) * rgw: radosgw-admin failure for user create after upgrade from hammer to jewel (`issue#15937 `_, `pr#9294 `_, Orit Wasserman, Abhishek Lekshmanan) * rgw: radosgw-admin: EEXIST messages for create operations (`issue#15720 `_, `pr#9268 `_, Abhishek Lekshmanan) * rgw: radosgw-admin: inconsistency in uid/email handling (`issue#13598 `_, `pr#10520 `_, Matt Benjamin) * rgw: realm pull fails when using apache frontend (`issue#15846 `_, `pr#9266 `_, Orit Wasserman) * rgw: retry on bucket sync errors (`issue#16108 `_, `pr#9425 `_, Yehuda Sadeh) * rgw: s3website: x-amz-website-redirect-location header returns malformed HTTP response (`issue#15531 `_, `pr#9099 `_, Robin H. Johnson) * rgw: segfault in RGWOp_MDLog_Notify (`issue#16666 `_, `pr#10662 `_, Casey Bodley) * rgw: segmentation fault on error_repo in data sync (`issue#16603 `_, `pr#10523 `_, Casey Bodley) * rgw: selinux denials in RGW (`issue#16126 `_, `pr#10519 `_, Boris Ranto) * rgw: support size suffixes for --max-size in radosgw-admin command (`issue#16004 `_, `pr#9743 `_, Vikhyat Umrao) * rgw: updating CORS/ACLs might not work in some circumstances (`issue#15976 `_, `pr#9543 `_, Radoslaw Zarzynski) * rgw: use zone endpoints instead of zonegroup endpoints (`issue#16834 `_, `pr#10659 `_, Casey Bodley) * tests: improve rbd-mirror test case coverage (`issue#16197 `_, `pr#9631 `_, Mykola Golub, Jason Dillaman) * tests: rados/test.sh workunit timesout on OpenStack (`issue#15403 `_, `pr#8904 `_, Loic Dachary) * tools: ceph-disk: Accept bcache devices as data disks (`issue#13278 `_, `pr#8497 `_, Peter Sabaini) * tools: rados: Add cleanup message with time to rados bench output (`issue#15704 `_, `pr#9740 `_, Vikhyat Umrao) * tools: src/script/subman fails with KeyError: 'nband' (`issue#16961 `_, `pr#10625 `_, Loic Dachary, Ali Maredia) v10.2.2 Jewel ============= This point release fixes several important bugs in RBD mirroring, RGW multi-site, CephFS, and RADOS. We recommend that all v10.2.x users upgrade. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * ceph: cli: exception when pool name has non-ascii characters (`issue#15913 `_, `pr#9320 `_, Ricardo Dias) * ceph-disk: workaround gperftool hang (`issue#13522 `_, `issue#16103 `_, `pr#9427 `_, Loic Dachary) * cephfs: backports needed for Manila (`issue#15599 `_, `issue#15417 `_, `issue#15045 `_, `pr#9430 `_, John Spray, Ramana Raja, Xiaoxi Chen) * ceph.spec.in: drop support for RHEL<7 and SUSE<1210 in jewel and above (`issue#15725 `_, `issue#15627 `_, `issue#13445 `_, `issue#15822 `_, `issue#15472 `_, `issue#15987 `_, `issue#15516 `_, `issue#15549 `_, `pr#8938 `_, Boris Ranto, Sage Weil, Nathan Cutler, Lars Marowsky-Bree) * ceph_test_librbd_fsx crashes during journal replay shut down (`issue#16123 `_, `pr#9556 `_, Jason Dillaman) * client: fix bugs accidentally disabling readahead (`issue#16024 `_, `pr#9656 `_, Patrick Donnelly, Greg Farnum) * cls_journal: initialize empty commit position upon client register (`issue#15757 `_, `pr#9376 `_, runsisi, Venky Shankar) * cls::rbd: mirror_image_status_list returned max 64 items (`pr#9069 `_, Mykola Golub) * cls_rbd: mirror image status summary should read full directory (`issue#16178 `_, `pr#9608 `_, Jason Dillaman) * common: BackoffThrottle spins unnecessarily with very small backoff while the throttle is full (`issue#15953 `_, `pr#9579 `_, Samuel Just) * common: Do not link lttng into libglobal (`pr#9194 `_, Karol Mroz) * debian: install systemd target files (`issue#15573 `_, `pr#8815 `_, Kefu Chai, Sage Weil) * doc: update mirroring guide to include pool/image status commands (`issue#15746 `_, `pr#9180 `_, Mykola Golub) * librbd: Disabling journaling feature results in "Transport endpoint is not connected" error (`issue#15863 `_, `pr#9548 `_, Yuan Zhou) * librbd: do not shut down exclusive lock while acquiring' (`issue#16291 `_, `issue#16260 `_, `pr#9691 `_, Jason Dillaman) * librbd: Initial python APIs to support mirroring (`issue#15656 `_, `pr#9550 `_, Mykola Golub) * librbd: journal IO error results in failed assertion in AioCompletion (`issue#16077 `_, `issue#15034 `_, `issue#15791 `_, `pr#9611 `_, Hector Martin, Jason Dillaman) * librbd: journal: live replay might skip entries from previous object set (`issue#15864 `_, `issue#15665 `_, `pr#9217 `_, Jason Dillaman) * librbd: journal: support asynchronous shutdown (`issue#15949 `_, `issue#14530 `_, `issue#15993 `_, `pr#9373 `_, Jason Dillaman) * librbd: Metadata config overrides are applied synchronously (`issue#15928 `_, `pr#9318 `_, Jason Dillaman) * librbd: Object Map is showing as invalid, even when Object Map is disabled for that Image. (`issue#16076 `_, `pr#9555 `_, xinxin shu) * librbd: prevent error messages when journal externally disabled (`issue#16114 `_, `pr#9610 `_, Zhiqiang Wang, Jason Dillaman) * librbd: recursive lock possible when disabling journaling (`issue#16235 `_, `pr#9654 `_, Jason Dillaman) * librbd: refresh image if needed in mirror functions (`issue#16096 `_, `pr#9609 `_, Jon Bernard) * librbd: remove should ignore mirror errors from older OSDs (`issue#16268 `_, `pr#9692 `_, Jason Dillaman) * librbd: reuse ImageCtx::finisher and SafeTimer for lots of images case (`issue#13938 `_, `pr#9580 `_, Haomai Wang) * librbd: validate image metadata configuration overrides (`issue#15522 `_, `pr#9554 `_, zhuangzeqiang) * mds: order directories by hash and fix simultaneous readdir races (`issue#15508 `_, `pr#9655 `_, Yan, Zheng, Greg Farnum) * mon: Hammer (0.94.3) OSD does not delete old OSD Maps in a timely fashion (maybe at all?) (`issue#13990 `_, `pr#9100 `_, Kefu Chai) * mon/Monitor: memory leak on Monitor::handle_ping() (`issue#15793 `_, `pr#9270 `_, xie xingguo) * osd: acting_primary not updated on split (`issue#15523 `_, `pr#8968 `_, Sage Weil) * osd: boot race with noup being set (`issue#15678 `_, `pr#9101 `_, Sage Weil) * osd: deadlock in OSD::_committed_osd_maps (`issue#15701 `_, `pr#9103 `_, Xinze Chi) * osd: hobject_t::get_max() vs is_max() discrepancy (`issue#16113 `_, `pr#9614 `_, Samuel Just) * osd: LibRadosWatchNotifyPPTests/LibRadosWatchNotifyPP.WatchNotify2Timeout/1 segv (`issue#15760 `_, `pr#9104 `_, Sage Weil) * osd: remove reliance on FLAG_OMAP for reads (`pr#9638 `_, Samuel Just) * osd valgrind invalid reads/writes (`issue#15870 `_, `pr#9237 `_, Samuel Just) * pybind: rbd API should default features parameter to None (`issue#15982 `_, `pr#9553 `_, Mykola Golub) * qa: dynamic_features.sh races with image deletion (`issue#15500 `_, `pr#9552 `_, Mykola Golub) * qa/workunits: ensure replay has started before checking position (`issue#16248 `_, `pr#9674 `_, Jason Dillaman) * qa/workunits/rbd: fixed rbd_mirror teuthology runtime errors (`pr#9232 `_, Jason Dillaman) * radosgw-admin: fix 'period push' handling of --url (`issue#15926 `_, `pr#9210 `_, Casey Bodley) * rbd-mirror: Delete local image mirror when remote image mirroring is disabled (`issue#15916 `_, `issue#14421 `_, `pr#9372 `_, runsisi, Mykola Golub, Ricardo Dias) * rbd-mirror: do not propagate deletions when pool unavailable (`issue#16229 `_, `pr#9630 `_, Jason Dillaman) * rbd-mirror: do not re-use image id from mirror directory if creating image (`issue#16253 `_, `pr#9673 `_, Jason Dillaman) * rbd-mirror: FAILED assert(!m_status_watcher) (`issue#16245 `_, `issue#16290 `_, `pr#9690 `_, Mykola Golub) * rbd-mirror: fix deletion propagation edge cases (`issue#16226 `_, `pr#9629 `_, Jason Dillaman) * rbd-mirror: fix journal shut down ordering (`issue#16165 `_, `pr#9628 `_, Jason Dillaman) * rbd-mirror: potential crash during image status update (`issue#15909 `_, `pr#9226 `_, Mykola Golub, Jason Dillaman) * rbd-mirror: refresh image after creating sync point (`issue#16196 `_, `pr#9627 `_, Jason Dillaman) * rbd-mirror: replicate cloned images (`issue#14937 `_, `pr#9423 `_, Jason Dillaman) * rbd-mirror should disable the rbd cache for local images (`issue#15930 `_, `pr#9317 `_, Jason Dillaman) * rbd-mirror: support bootstrap canceling (`issue#16201 `_, `pr#9612 `_, Mykola Golub) * rbd-mirror: support multiple replicated pools (`issue#16045 `_, `pr#9409 `_, Jason Dillaman) * rgw: fix manager selection when APIs customized (`issue#15974 `_, `issue#15973 `_, `pr#9245 `_, Robin H. Johnson) * rgw: keep track of written_objs correctly (`issue#15886 `_, `pr#9239 `_, Yehuda Sadeh) * rpm: ceph gid mismatch on upgrade from hammer with pre-existing ceph user (SUSE) (`issue#15869 `_, `pr#9424 `_, Nathan Cutler) * systemd: ceph-{mds,mon,osd,radosgw} systemd unit files need wants=time-sync.target (`issue#15419 `_, `pr#8802 `_, Nathan Cutler) * test: failure in journal.sh workunit test (`issue#16011 `_, `pr#9377 `_, Mykola Golub) * tests: rm -fr /tmp/*virtualenv* (`issue#16087 `_, `pr#9403 `_, Loic Dachary) v10.2.1 Jewel ============= This is the first bugfix release for Jewel. It contains several annoying packaging and init system fixes and a range of important bugfixes across RBD, RGW, and CephFS. We recommend that all v10.2.x users upgrade. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * cephfs: CephFSVolumeClient should isolate volumes by RADOS namespace (`issue#15400 `_, `pr#8787 `_, Xiaoxi Chen) * cephfs: handle standby-replay nodes properly in upgrades (`issue#15591 `_, `pr#8971 `_, John Spray) * ceph-{mds,mon,osd} packages need scriptlets with systemd code (`issue#14941 `_, `pr#8801 `_, Boris Ranto, Nathan Cutler) * ceph_test_keyvaluedb: fix (`issue#15435 `_, `pr#9051 `_, Allen Samuels, Sage Weil) * cmake: add missing source file to rbd_mirror/image_replayer (`pr#9052 `_, Casey Bodley) * cmake: fix rbd compile errors (`pr#9076 `_, runsisi, Jason Dillaman) * journal: incorrectly computed object offset within set (`issue#15765 `_, `pr#9038 `_, Jason Dillaman) * librbd: client-side handling for incompatible object map sizes (`issue#15642 `_, `pr#9039 `_, Jason Dillaman) * librbd: constrain size of AioWriteEvent journal entries (`issue#15750 `_, `pr#9048 `_, Jason Dillaman) * librbd: does not crash if image header is too short (`pr#9044 `_, Kefu Chai) * librbd: Errors encountered disabling object-map while flatten is in-progress (`issue#15572 `_, `pr#8869 `_, Jason Dillaman) * librbd: fix get/list mirror image status API (`issue#15771 `_, `pr#9036 `_, Mykola Golub) * librbd: Parent image is closed twice if error encountered while opening (`issue#15574 `_, `pr#8867 `_, Jason Dillaman) * librbd: possible double-free of object map invalidation request upon error (`issue#15643 `_, `pr#8865 `_, runsisi) * librbd: possible race condition leads to use-after-free (`issue#15690 `_, `pr#9009 `_, Jason Dillaman) * librbd: potential concurrent event processing during journal replay (`issue#15755 `_, `pr#9040 `_, Jason Dillaman) * librbd: Potential double free of SetSnapRequest instance (`issue#15571 `_, `pr#8803 `_, runsisi) * librbd: put the validation of image snap context earlier (`pr#9046 `_, runsisi) * librbd: reduce log level for image format 1 warning (`issue#15577 `_, `pr#9003 `_, Jason Dillaman) * mds/MDSAuthCap parse no longer fails on paths with hyphens (`issue#15465 `_, `pr#8969 `_, John Spray) * mds: MDS incarnation no longer gets lost after remove filesystem (`issue#15399 `_, `pr#8970 `_, John Spray) * mon/OSDMonitor: avoid underflow in reweight-by-utilization if max_change=1 (`issue#15655 `_, `pr#9006 `_, Samuel Just) * python: clone operation will fail if config overridden with "rbd default format = 1" (`issue#15685 `_, `pr#8972 `_, Jason Dillaman) * radosgw-admin: add missing --zonegroup-id to usage (`issue#15650 `_, `pr#9019 `_, Casey Bodley) * radosgw-admin: update usage for zone[group] modify (`issue#15651 `_, `pr#9016 `_, Casey Bodley) * radosgw-admin: zonegroup remove command (`issue#15684 `_, `pr#9015 `_, Casey Bodley) * rbd CLI to retrieve rbd mirror state for a pool / specific image (`issue#15144 `_, `issue#14420 `_, `pr#8868 `_, Mykola Golub) * rbd disk-usage CLI command should support calculating full image usage (`issue#14540 `_, `pr#8870 `_, Jason Dillaman) * rbd: helpful error message on map failure (`issue#15721 `_, `pr#9041 `_, Venky Shankar) * rbd: help message distinction between commands and aliases (`issue#15521 `_, `pr#9004 `_, Yongqiang He) * rbd-mirror: admin socket commands to start/stop/restart mirroring (`issue#15718 `_, `pr#9010 `_, Mykola Golub, Josh Durgin) * rbd-mirror can crash if start up is interrupted (`issue#15630 `_, `pr#8866 `_, Jason Dillaman) * rbd-mirror: image sync needs to handle snapshot size and protection status (`issue#15110 `_, `pr#9050 `_, Jason Dillaman) * rbd-mirror: lockdep error during bootstrap (`issue#15664 `_, `pr#9008 `_, Jason Dillaman) * rbd-nbd: fix rbd-nbd aio callback error handling (`issue#15604 `_, `pr#9005 `_, Chang-Yi Lee) * rgw: add AWS4 completion support for RGW_OP_SET_BUCKET_WEBSITE (`issue#15626 `_, `pr#9018 `_, Javier M. Mellid) * rgw admin output (`issue#15747 `_, `pr#9054 `_, Casey Bodley) * rgw: fix issue #15597 (`issue#15597 `_, `pr#9020 `_, Yehuda Sadeh) * rgw: fix printing wrong X-Storage-Url in Swift's TempAuth. (`issue#15667 `_, `pr#9021 `_, Radoslaw Zarzynski) * rgw: handle stripe transition when flushing final pending_data_bl (`issue#15745 `_, `pr#9053 `_, Yehuda Sadeh) * rgw: leak fixes (`issue#15792 `_, `pr#9022 `_, Yehuda Sadeh) * rgw: multisite: Issues with Deleting Buckets (`issue#15540 `_, `pr#8930 `_, Abhishek Lekshmanan) * rgw: period commit fix (`issue#15828 `_, `pr#9081 `_, Casey Bodley) * rgw: period delete fixes (`issue#15469 `_, `pr#9047 `_, Casey Bodley) * rgw: radosgw-admin zone set cuts pool names short if name starts with a period (`issue#15598 `_, `pr#9029 `_, Yehuda Sadeh) * rgw: segfault at RGWAsyncGetSystemObj (`issue#15565 `_, `issue#15625 `_, `pr#9017 `_, Yehuda Sadeh) * several backports (`issue#15588 `_, `issue#15655 `_, `pr#8853 `_, Alexandre Derumier, xie xingguo, Alfredo Deza) * systemd: fix typo in preset file (`pr#8843 `_, Nathan Cutler) * tests: make check fails on ext4 (`issue#15837 `_, `pr#9063 `_, Loic Dachary, Sage Weil) v10.2.0 Jewel ============= This major release of Ceph is the foundation for the next long-term stable release series. There have been many major changes since the Infernalis (9.2.x) and Hammer (0.94.x) releases, and the upgrade process is non-trivial. Please read these release notes carefully. Major Changes from Infernalis ----------------------------- - *CephFS*: * This is the first release in which CephFS is declared stable! Several features are disabled by default, including snapshots and multiple active MDS servers. * The repair and disaster recovery tools are now feature-complete. * A new cephfs-volume-manager module is included that provides a high-level interface for creating "shares" for OpenStack Manila and similar projects. * There is now experimental support for multiple CephFS file systems within a single cluster. - *RGW*: * The multisite feature has been almost completely rearchitected and rewritten to support any number of clusters/sites, bidirectional fail-over, and active/active configurations. * You can now access radosgw buckets via NFS (experimental). * The AWS4 authentication protocol is now supported. * There is now support for S3 request payer buckets. * The new multitenancy infrastructure improves compatibility with Swift, which provides a separate container namespace for each user/tenant. * The OpenStack Keystone v3 API is now supported. There are a range of other small Swift API features and compatibility improvements as well, including bulk delete and SLO (static large objects). - *RBD*: * There is new support for mirroring (asynchronous replication) of RBD images across clusters. This is implemented as a per-RBD image journal that can be streamed across a WAN to another site, and a new rbd-mirror daemon that performs the cross-cluster replication. * The exclusive-lock, object-map, fast-diff, and journaling features can be enabled or disabled dynamically. The deep-flatten features can be disabled dynamically but not re-enabled. * The RBD CLI has been rewritten to provide command-specific help and full bash completion support. * RBD snapshots can now be renamed. - *RADOS*: * BlueStore, a new OSD backend, is included as an experimental feature. The plan is for it to become the default backend in the K or L release. * The OSD now persists scrub results and provides a librados API to query results in detail. * We have revised our documentation to recommend *against* using ext4 as the underlying filesystem for Ceph OSD daemons due to problems supporting our long object name handling. Major Changes from Hammer ------------------------- - *General*: * Ceph daemons are now managed via systemd (with the exception of Ubuntu Trusty, which still uses upstart). * Ceph daemons run as 'ceph' user instead of 'root'. * On Red Hat distros, there is also an SELinux policy. - *RADOS*: * The RADOS cache tier can now proxy write operations to the base tier, allowing writes to be handled without forcing migration of an object into the cache. * The SHEC erasure coding support is no longer flagged as experimental. SHEC trades some additional storage space for faster repair. * There is now a unified queue (and thus prioritization) of client IO, recovery, scrubbing, and snapshot trimming. * There have been many improvements to low-level repair tooling (ceph-objectstore-tool). * The internal ObjectStore API has been significantly cleaned up in order to faciliate new storage backends like BlueStore. - *RGW*: * The Swift API now supports object expiration. * There are many Swift API compatibility improvements. - *RBD*: * The ``rbd du`` command shows actual usage (quickly, when object-map is enabled). * The object-map feature has seen many stability improvements. * The object-map and exclusive-lock features can be enabled or disabled dynamically. * You can now store user metadata and set persistent librbd options associated with individual images. * The new deep-flatten features allow flattening of a clone and all of its snapshots. (Previously snapshots could not be flattened.) * The export-diff command is now faster (it uses aio). There is also a new fast-diff feature. * The --size argument can be specified with a suffix for units (e.g., ``--size 64G``). * There is a new ``rbd status`` command that, for now, shows who has the image open/mapped. - *CephFS*: * You can now rename snapshots. * There have been ongoing improvements around administration, diagnostics, and the check and repair tools. * The caching and revocation of client cache state due to unused inodes has been dramatically improved. * The ceph-fuse client behaves better on 32-bit hosts. Distro compatibility -------------------- Starting with Infernalis, we have dropped support for many older distributions so that we can move to a newer compiler toolchain (e.g., C++11). Although it is still possible to build Ceph on older distributions by installing backported development tools, we are not building and publishing release packages for ceph.com. We now build packages for the following distributions and architectures: - x86_64: * CentOS 7.x. We have dropped support for CentOS 6 (and other RHEL 6 derivatives, like Scientific Linux 6). * Debian Jessie 8.x. Debian Wheezy 7.x's g++ has incomplete support for C++11 (and no systemd). * Ubuntu Xenial 16.04 and Trusty 14.04. Ubuntu Precise 12.04 is no longer supported. * Fedora 22 or later. - aarch64 / arm64: * Ubuntu Xenial 16.04. Upgrading from Infernalis or Hammer ----------------------------------- * We now recommend against using ``ext4`` as the underlying file system for Ceph OSDs, especially when RGW or other users of long RADOS object names are used. For more information about why, please see `Filesystem Recommendations`_. If you have an existing cluster that uses ext4 for the OSDs but uses only RBD and/or CephFS, then the ext4 limitations will not affect you. Before upgrading, be sure add the following to ``ceph.conf`` to allow the OSDs to start:: osd max object name len = 256 osd max object namespace len = 64 Keep in mind that if you set these lower object name limits and later decide to use RGW on this cluster, it will have problems storing S3/Swift objects with long names. This startup check can also be disabled via the below option, although this is not recommended:: osd check max object name len on startup = false .. _Filesystem Recommendations: ../configuration/filesystem-recommendations * There are no major compatibility changes since Infernalis. Simply upgrading the daemons on each host and restarting all daemons is sufficient. * The rbd CLI no longer accepts the deprecated '--image-features' option during create, import, and clone operations. The '--image-feature' option should be used instead. * The rbd legacy image format (version 1) is deprecated with the Jewel release. Attempting to create a new version 1 RBD image will result in a warning. Future releases of Ceph will remove support for version 1 RBD images. * The 'send_pg_creates' and 'map_pg_creates' mon CLI commands are obsolete and no longer supported. * A new configure option 'mon_election_timeout' is added to specifically limit max waiting time of monitor election process, which was previously restricted by 'mon_lease'. * CephFS filesystems created using versions older than Firefly (0.80) must use the new 'cephfs-data-scan tmap_upgrade' command after upgrading to Jewel. See 'Upgrading' in the CephFS documentation for more information. * The 'ceph mds setmap' command has been removed. * The default RBD image features for new images have been updated to enable the following: exclusive lock, object map, fast-diff, and deep-flatten. These features are not currently supported by the RBD kernel driver nor older RBD clients. They can be disabled on a per-image basis via the RBD CLI, or the default features can be updated to the pre-Jewel setting by adding the following to the client section of the Ceph configuration file:: rbd default features = 1 * The rbd legacy image format (version 1) is deprecated with the Jewel release. * After upgrading, users should set the 'sortbitwise' flag to enable the new internal object sort order:: ceph osd set sortbitwise This flag is important for the new object enumeration API and for new backends like BlueStore. * The rbd CLI no longer permits creating images and snapshots with potentially ambiguous names (e.g. the '/' and '@' characters are disallowed). The validation can be temporarily disabled by adding "--rbd-validate-names=false" to the rbd CLI when creating an image or snapshot. It can also be disabled by adding the following to the client section of the Ceph configuration file:: rbd validate names = false Upgrading from Hammer --------------------- * All cluster nodes must first upgrade to Hammer v0.94.4 or a later v0.94.z release; only then is it possible to upgrade to Jewel 10.2.z. * For all distributions that support systemd (CentOS 7, Fedora, Debian Jessie 8.x, OpenSUSE), ceph daemons are now managed using native systemd files instead of the legacy sysvinit scripts. For example,:: systemctl start ceph.target # start all daemons systemctl status ceph-osd@12 # check status of osd.12 The main notable distro that is *not* yet using systemd is Ubuntu trusty 14.04. (The next Ubuntu LTS, 16.04, will use systemd instead of upstart.) * Ceph daemons now run as user and group ``ceph`` by default. The ceph user has a static UID assigned by Fedora and Debian (also used by derivative distributions like RHEL/CentOS and Ubuntu). On SUSE the same UID/GID as in Fedora and Debian will be used, *provided it is not already assigned*. In the unlikely event the preferred UID or GID is assigned to a different user/group, ceph will get a dynamically assigned UID/GID. If your systems already have a ceph user, upgrading the package will cause problems. We suggest you first remove or rename the existing 'ceph' user and 'ceph' group before upgrading. When upgrading, administrators have two options: #. Add the following line to ``ceph.conf`` on all hosts:: setuser match path = /var/lib/ceph/$type/$cluster-$id This will make the Ceph daemons run as root (i.e., not drop privileges and switch to user ceph) if the daemon's data directory is still owned by root. Newly deployed daemons will be created with data owned by user ceph and will run with reduced privileges, but upgraded daemons will continue to run as root. #. Fix the data ownership during the upgrade. This is the preferred option, but it is more work and can be very time consuming. The process for each host is to: #. Upgrade the ceph package. This creates the ceph user and group. For example:: ceph-deploy install --stable jewel HOST #. Stop the daemon(s).:: service ceph stop # fedora, centos, rhel, debian stop ceph-all # ubuntu #. Fix the ownership:: chown -R ceph:ceph /var/lib/ceph chown -R ceph:ceph /var/log/ceph #. Restart the daemon(s).:: start ceph-all # ubuntu systemctl start ceph.target # debian, centos, fedora, rhel Alternatively, the same process can be done with a single daemon type, for example by stopping only monitors and chowning only ``/var/lib/ceph/mon``. * The on-disk format for the experimental KeyValueStore OSD backend has changed. You will need to remove any OSDs using that backend before you upgrade any test clusters that use it. * When a pool quota is reached, librados operations now block indefinitely, the same way they do when the cluster fills up. (Previously they would return -ENOSPC.) By default, a full cluster or pool will now block. If your librados application can handle ENOSPC or EDQUOT errors gracefully, you can get error returns instead by using the new librados OPERATION_FULL_TRY flag. * The return code for librbd's rbd_aio_read and Image::aio_read API methods no longer returns the number of bytes read upon success. Instead, it returns 0 upon success and a negative value upon failure. * 'ceph scrub', 'ceph compact' and 'ceph sync force' are now DEPRECATED. Users should instead use 'ceph mon scrub', 'ceph mon compact' and 'ceph mon sync force'. * 'ceph mon_metadata' should now be used as 'ceph mon metadata'. There is no need to deprecate this command (same major release since it was first introduced). * The `--dump-json` option of "osdmaptool" is replaced by `--dump json`. * The commands of "pg ls-by-{pool,primary,osd}" and "pg ls" now take "recovering" instead of "recovery", to include the recovering pgs in the listed pgs. Upgrading from Firefly ---------------------- Upgrading directly from Firefly v0.80.z is not recommended. It is possible to do a direct upgrade, but not without downtime, as all OSDs must be stopped, upgraded, and then restarted. We recommend that clusters be first upgraded to Hammer v0.94.6 or a later v0.94.z release; only then is it possible to upgrade to Jewel 10.2.z for an online upgrade (see below). To do an offline upgrade directly from Firefly, all Firefly OSDs must be stopped and marked down before any Jewel OSDs will be allowed to start up. This fencing is enforced by the Jewel monitor, so you should use an upgrade procedure like: #. Upgrade Ceph on monitor hosts #. Restart all ceph-mon daemons #. Set noout:: ceph osd set noout #. Upgrade Ceph on all OSD hosts #. Stop all ceph-osd daemons #. Mark all OSDs down with something like:: ceph osd down `seq 0 1000` #. Start all ceph-osd daemons #. Let the cluster settle and then unset noout:: ceph osd unset noout #. Upgrade and restart any remaining daemons (ceph-mds, radosgw) Notable Changes since Infernalis -------------------------------- * admin/build-doc: depend on zlib1g-dev and graphviz (`pr#7522 `_, Ken Dreyer) * auth: fail if rotating key is missing (do not spam log) (`pr#6473 `_, Qiankun Zheng) * auth: fix crash when bad keyring is passed (`pr#6698 `_, Dunrong Huang) * auth: make keyring without mon entity type return -EACCES (`pr#5734 `_, Xiaowei Chen) * AUTHORS: update email (`pr#7854 `_, Yehuda Sadeh) * authtool: update --help and manpage to match code. (`pr#8456 `_, Robin H. Johnson) * bluestore: latest and greatest (`issue#14210 `_, `issue#13801 `_, `pr#6896 `_, xie.xingguo, Jianpeng Ma, YiQiang Chen, Sage Weil, Ning Yao) * buffer: add symmetry operator==() and operator!=() (`pr#7974 `_, Kefu Chai) * buffer: fix internal iterator invalidation on rebuild, get_contiguous (`pr#6962 `_, Sage Weil) * buffer: hide iterator_impl symbols (`issue#14788 `_, `pr#7688 `_, Kefu Chai) * buffer: increment history alloc as well in raw_combined (`issue#14955 `_, `pr#7910 `_, Samuel Just) * buffer: make usable outside of ceph source again (`pr#6863 `_, Josh Durgin) * buffer: raw_combined allocations buffer and ref count together (`pr#7612 `_, Sage Weil) * buffer: use move construct to append/push_back/push_front (`pr#7455 `_, Haomai Wang) * build: Adding build requires (`pr#7742 `_, Erwan Velu) * build: a few armhf (32-bit build) fixes (`pr#7999 `_, Eric Lee, Sage Weil) * build: allow jemalloc with rocksdb-static (`pr#7368 `_, Somnath Roy) * build: build internal plugins and classes as modules (`pr#6462 `_, James Page) * build: cmake check fixes (`pr#6787 `_, Orit Wasserman) * build: cmake tweaks (`pr#6254 `_, John Spray) * build: fix a few warnings (`pr#6847 `_, Orit Wasserman) * build: fix bz2-dev dependency (`pr#6948 `_, Samuel Just) * build: fix compiling warnings (`pr#8366 `_, Dongsheng Yang) * build: Fixing BTRFS issue at 'make check' (`pr#7805 `_, Erwan Velu) * build: fix Jenkins make check errors due to deep-scrub randomization (`pr#6671 `_, David Zafman) * build: fix the autotools and cmake build (the new fusestore needs libfuse) (`pr#7393 `_, Kefu Chai) * build: fix warnings (`pr#7197 `_, Kefu Chai, xie xingguo) * build: fix warnings (`pr#7315 `_, Kefu Chai) * build: FreeBSD related fixes (`pr#7170 `_, Mykola Golub) * build: Gentoo: _FORTIFY_SOURCE fix. (`issue#13920 `_, `pr#6739 `_, Robin H. Johnson) * build: kill warnings (`pr#7397 `_, Kefu Chai) * build: misc make check fixes (`pr#7153 `_, Sage Weil) * build: more CMake package check fixes (`pr#6108 `_, Daniel Gryniewicz) * build: move libexec scripts to standardize across distros (`issue#14687 `_, `issue#14705 `_, `issue#14723 `_, `pr#7636 `_, Nathan Cutler, Kefu Chai) * build/ops: enable CR in CentOS 7 (`issue#13997 `_, `pr#6844 `_, Loic Dachary) * build/ops: rbd-replay moved from ceph-test-dbg to ceph-common-dbg (`issue#13785 `_, `pr#6578 `_, Loic Dachary) * build/ops: systemd ceph-disk unit must not assume /bin/flock (`issue#13975 `_, `pr#6803 `_, Loic Dachary) * build: Refrain from versioning and packaging EC testing plugins (`issue#14756 `_, `issue#14723 `_, `pr#7637 `_, Nathan Cutler, Kefu Chai) * build: Respect TMPDIR for virtualenv. (`pr#8457 `_, Robin H. Johnson) * build: spdk submodule; cmake (`pr#7503 `_, Kefu Chai) * build: workaround an automake bug for "make check" (`issue#14723 `_, `pr#7626 `_, Kefu Chai) * ceph: bash auto complete for CLI based on mon command descriptions (`pr#7693 `_, Adam Kupczyk) * ceph_daemon.py: Resolved ImportError to work with python3 (`pr#7937 `_, Sarthak Munshi) * ceph-detect-init: add debian/jessie test (`pr#8074 `_, Kefu Chai) * ceph-detect-init: add missing test case (`pr#8105 `_, Nathan Cutler) * ceph-detect-init: fix py3 test (`pr#7025 `_, Kefu Chai) * ceph-detect-init: fix py3 test (`pr#7243 `_, Kefu Chai) * ceph_detect_init/__init__.py: remove shebang (`pr#7731 `_, Nathan Cutler) * ceph-detect-init: return correct value on recent SUSE distros (`issue#14770 `_, `pr#7909 `_, Nathan Cutler) * ceph-detect-init/run-tox.sh: FreeBSD: No init detect (`pr#8373 `_, Willem Jan Withagen) * ceph-detect-init: Ubuntu >= 15.04 uses systemd (`pr#6873 `_, James Page) * ceph-disk: Add destroy and deactivate option (`issue#7454 `_, `pr#5867 `_, Vicente Cheng) * ceph-disk: add -f flag for btrfs mkfs (`pr#7222 `_, Darrell Enns) * ceph-disk: Add --setuser and --setgroup options for ceph-disk (`pr#7351 `_, Mike Shuey) * ceph-disk: ceph-disk list fails on /dev/cciss!c0d0 (`issue#13970 `_, `issue#14233 `_, `issue#14230 `_, `pr#6879 `_, Loic Dachary) * ceph-disk: compare parted output with the dereferenced path (`issue#13438 `_, `pr#6219 `_, Joe Julian) * ceph-disk: deactivate / destroy PATH arg are optional (`pr#7756 `_, Loic Dachary) * ceph-disk: do not always fail when re-using a partition (`pr#8508 `_, You Ji) * ceph-disk: fix failures when preparing disks with udev > 214 (`issue#14080 `_, `issue#14094 `_, `pr#6926 `_, Loic Dachary, Ilya Dryomov) * ceph-disk: fix prepare --help (`pr#7758 `_, Loic Dachary) * ceph-disk: Fix trivial typo (`pr#7472 `_, Brad Hubbard) * ceph-disk: flake8 fixes (`pr#7646 `_, Loic Dachary) * ceph-disk: get Nonetype when ceph-disk list with --format plain on single device. (`pr#6410 `_, Vicente Cheng) * ceph-disk: Improving 'make check' for ceph-disk (`pr#7762 `_, Erwan Velu) * ceph-disk: key management support (`issue#14669 `_, `pr#7552 `_, Loic Dachary) * ceph-disk: make some arguments as required if necessary (`pr#7687 `_, Dongsheng Yang) * ceph-disk: s/dmcrpyt/dmcrypt/ (`issue#14838 `_, `pr#7744 `_, Loic Dachary, Frode Sandholtbraaten) * ceph-disk: support bluestore (`issue#13422 `_, `pr#7218 `_, Loic Dachary, Sage Weil) * ceph-disk/test: fix test_prepare.py::TestPrepare tests (`pr#7549 `_, Kefu Chai) * ceph-disk: warn for prepare partitions with bad GUIDs (`issue#13943 `_, `pr#6760 `_, David Disseldorp) * ceph: fix tell behavior (`pr#6329 `_, David Zafman) * cephfs-data-scan: scan_frags (`pr#5941 `_, John Spray) * cephfs-data-scan: scrub tag filtering (#12133 and #12145) (`issue#12133 `_, `issue#12145 `_, `pr#5685 `_, John Spray) * ceph-fuse: add process to ceph-fuse --help (`pr#6821 `_, Wei Feng) * ceph-fuse: fix double decreasing the count to trim caps (`issue#14319 `_, `pr#7229 `_, Zhi Zhang) * ceph-fuse: fix double free of args (`pr#7015 `_, Ilya Shipitsin) * ceph-fuse: fix fsync() (`pr#6388 `_, Yan, Zheng) * ceph-fuse: Fix potential filehandle ref leak at umount (`issue#14800 `_, `pr#7686 `_, Zhi Zhang) * ceph-fuse,libcephfs: Fix client handling of "lost" open directories on shutdown (`issue#14996 `_, `pr#7994 `_, Yan, Zheng) * ceph-fuse,libcephfs: fix free fds being exhausted eventually because freed fds are never put back (`issue#14798 `_, `pr#7685 `_, Zhi Zhang) * ceph-fuse:print usage information when no parameter specified (`pr#6868 `_, Bo Cai) * ceph-fuse: rotate log file (`pr#8485 `_, Sage Weil) * ceph-fuse: While starting ceph-fuse, start the log thread first (`issue#13443 `_, `pr#6224 `_, Wenjun Huang) * ceph: improve the error message (`issue#11101 `_, `pr#7106 `_, Kefu Chai) * ceph.in: avoid a broken pipe error when use ceph command (`issue#14354 `_, `pr#7212 `_, Bo Cai) * ceph.in: correct dev python path for automake builds (`pr#8360 `_, Josh Durgin) * ceph.in: fix python libpath for automake as well (`pr#8362 `_, Josh Durgin) * ceph.in: Minor python3 specific changes (`pr#7947 `_, Sarthak Munshi) * ceph-kvstore-tool: handle bad out file on command line (`pr#6093 `_, Kefu Chai) * ceph-mds:add --help/-h (`pr#6850 `_, Cilang Zhao) * ceph_objectstore_bench: fix race condition, bugs (`issue#13516 `_, `pr#6681 `_, Igor Fedotov) * ceph-objectstore-tool: fix --dry-run for many ceph-objectstore-tool operations (`pr#6545 `_, David Zafman) * ceph-rest-api: fix fs/flag/set (`pr#8428 `_, Sage Weil) * ceph.spec.in: add BuildRequires: systemd (`issue#13860 `_, `pr#6692 `_, Nathan Cutler) * ceph.spec.in: add copyright notice (`issue#14694 `_, `pr#7569 `_, Nathan Cutler) * ceph.spec.in: add license declaration (`pr#7574 `_, Nathan Cutler) * ceph.spec.in: disable lttng and babeltrace explicitly (`issue#14844 `_, `pr#7857 `_, Kefu Chai) * ceph.spec.in: do not install Ceph RA on systemd platforms (`issue#14828 `_, `pr#7894 `_, Nathan Cutler) * ceph.spec.in: fix openldap and openssl build dependencies for SUSE (`issue#15138 `_, `pr#8120 `_, Nathan Cutler) * ceph.spec.in: limit _smp_mflags when lowmem_builder is set in SUSE's OBS (`issue#13858 `_, `pr#6691 `_, Nathan Cutler) * ceph.spec.in: terminate if statement in %pre scriptlet (`pr#8628 `_, Nathan Cutler) * ceph_test_libcephfs: tolerate duplicated entries in readdir (`issue#14377 `_, `pr#7246 `_, Yan, Zheng) * ceph_test_msgr: reduce test size to fix memory size (`pr#8127 `_, Haomai Wang) * ceph_test_msgr: Use send_message instead of keepalive to wakeup connection (`pr#6605 `_, Haomai Wang) * ceph_test_rados: make long name ~300 chars, (not ~800) (`pr#8600 `_, Sage Weil) * ceph_test_rados_misc: shorten mount timeout (`pr#8209 `_, Sage Weil) * check-generated.sh: can't source bash from sh (`pr#8521 `_, Michal Jarzabek) * cleanup (`pr#8058 `_, Yehuda Sadeh, Orit Wasserman) * cleanup: remove misc dead code (`pr#7201 `_, Erwan Velu) * client: a better check for MDS availability (`pr#6253 `_, John Spray) * client: add option to control how directory size is calculated (`pr#7323 `_, Yan, Zheng) * client: avoid creating orphan object in Client::check_pool_perm() (`issue#13782 `_, `pr#6603 `_, Yan, Zheng) * client: check if Fh is readable when processing a read (`issue#11517 `_, `pr#7209 `_, Yan, Zheng) * client: close mds sessions in shutdown() (`pr#6269 `_, John Spray) * client: don't invalidate page cache when inode is no longer used (`pr#6380 `_, Yan, Zheng) * client: don't mark_down on command reply (`pr#6204 `_, John Spray) * client: drop prefix from ints (`pr#6275 `_, John Coyle) * client: flush kernel pagecache before creating snapshot (`issue#10436 `_, `pr#7495 `_, Yan, Zheng) * client: modify a word in log (`pr#6906 `_, YongQiang He) * client: properly trim unlinked inode (`issue#13903 `_, `pr#7297 `_, Yan, Zheng) * client: removed unused Mutex from MetaRequest (`pr#7655 `_, Greg Farnum) * client: sys/file.h includes for flock operations (`pr#6282 `_, John Coyle) * client: use null snapc to check pool permission (`issue#13714 `_, `pr#6497 `_, Yan, Zheng) * cls/cls_rbd.cc: fix misused metadata_name_from_key (`issue#13922 `_, `pr#6661 `_, Xiaoxi Chen) * cls/cls_rbd: pass string by reference (`pr#7232 `_, Jeffrey Lu) * cls_hello: Fix grammatical error in description comment (`pr#7951 `_, Brad Hubbard) * cls_journal: fix -EEXIST checking (`pr#8413 `_, runsisi) * cls_rbd: add guards for error cases (`issue#14316 `_, `issue#14317 `_, `pr#7165 `_, xie xingguo) * cls_rbd: change object_map_update to return 0 on success, add logging (`pr#6467 `_, Douglas Fuller) * cls_rbd: enable object map checksums for object_map_save (`issue#14280 `_, `pr#7149 `_, Douglas Fuller) * cls_rbd: fix -EEXIST checking in cls::rbd::image_set (`pr#8371 `_, runsisi) * cls_rbd: fix the test for ceph-dencoder (`pr#7793 `_, Kefu Chai) * cls_rbd: mirror_image_list should return global image id (`pr#8297 `_, Jason Dillaman) * cls_rbd: mirroring directory (`issue#14419 `_, `pr#7620 `_, Josh Durgin) * cls_rbd: pass WILLNEED fadvise flags during object map update (`issue#15332 `_, `pr#8380 `_, Jason Dillaman) * cls_rbd: protect against excessively large object maps (`issue#15121 `_, `pr#8099 `_, Jason Dillaman) * cls_rbd: read_peers: update last_read on next cls_cxx_map_get_vals (`pr#8374 `_, Mykola Golub) * cls/rgw: fix FTBFS (`pr#8142 `_, Kefu Chai) * cls/rgw: fix use of timespan (`issue#15181 `_, `pr#8212 `_, Yehuda Sadeh) * cmake: add common/fs_types.cc to libcommon (`pr#7898 `_, Orit Wasserman) * cmake: Add common/PluginRegistry.cc to CMakeLists.txt (`pr#6805 `_, Pete Zaitcev) * cmake: Added new unittests to make check (`pr#7572 `_, Ali Maredia) * cmake: Add ENABLE_GIT_VERSION to avoid rebuilding (`pr#7171 `_, Kefu Chai) * cmake: add ErasureCode.cc to jerasure plugins (`pr#7808 `_, Casey Bodley) * cmake: add FindOpenSSL.cmake (`pr#8106 `_, Marcus Watts, Matt Benjamin) * cmake: add KernelDevice.cc to libos_srcs (`pr#7507 `_, Kefu Chai) * cmake: add missing check for HAVE_EXECINFO_H (`pr#7270 `_, Casey Bodley) * cmake: add missing librbd image_watcher sources (`issue#14823 `_, `pr#7717 `_, Casey Bodley) * cmake: add missing librbd/MirrorWatcher.cc and librd/ObjectWatcher.cc (`pr#8399 `_, Orit Wasserman) * cmake: add nss as a suffix for pk11pub.h (`pr#6556 `_, Samuel Just) * cmake: add rgw_basic_types.cc to librgw.a (`pr#6786 `_, Orit Wasserman) * cmake: add StandardPolicy.cc to librbd (`pr#8368 `_, Kefu Chai) * cmake: add TracepointProvider.cc to libcommon (`pr#6823 `_, Orit Wasserman) * cmake: avoid false-positive LDAP header detect (`pr#8100 `_, Matt Benjamin) * cmake: Build cython modules and change paths to bin/, lib/ (`pr#8351 `_, John Spray, Ali Maredia) * cmake: check for libsnappy in default path also (`pr#7366 `_, Kefu Chai) * cmake: cleanups and more features from automake (`pr#7103 `_, Casey Bodley, Ali Maredia) * cmake: define STRERROR_R_CHAR_P for GNU-specific strerror_r (`pr#6751 `_, Ilya Dryomov) * cmake: detect bzip2 and lz4 (`pr#7126 `_, Kefu Chai) * cmake: feb5 (`pr#7541 `_, Matt Benjamin) * cmake: fix build with bluestore (`pr#7099 `_, John Spray) * cmake: fix files list (`pr#6539 `_, Yehuda Sadeh) * cmake: fix mrun to handle cmake build structure (`pr#8237 `_, Orit Wasserman) * cmake: fix paths to various EC source files (`pr#7748 `_, Ali Maredia, Matt Benjamin) * cmake: fix the build of test_rados_api_list (`pr#8438 `_, Kefu Chai) * cmake: fix the build of tests (`pr#7523 `_, Kefu Chai) * cmake: fix the build on trusty (`pr#7249 `_, Kefu Chai) * cmake: For CMake version <= 2.8.11, use LINK_PRIVATE and LINK_PUBLIC (`pr#7474 `_, Tao Chang) * cmake: For CMake version <= 2.8.11, use LINK_PRIVATE (`pr#8422 `_, Haomai Wang) * cmake: let ceph-client-debug link with tcmalloc (`pr#7314 `_, Kefu Chai) * cmake: librbd and libjournal build fixes (`pr#6557 `_, Ilya Dryomov) * cmake: made rocksdb an imported library (`pr#7131 `_, Ali Maredia) * cmake: make check (`pr#7912 `_, Kefu Chai, Ali Maredia) * cmake: no need to run configure from run-cmake-check.sh (`pr#6959 `_, Orit Wasserman) * cmake (`pr#7849 `_, Ali Maredia) * cmake/pybind: fix include paths for cephfs (`pr#8381 `_, Josh Durgin) * cmake: Remove duplicate find_package libcurl line. (`pr#7972 `_, Brad Hubbard) * cmake: support ccache via a WITH_CCACHE build option (`pr#6875 `_, John Coyle) * cmake: test_build_libcephfs needs ${ALLOC_LIBS} (`pr#7300 `_, Ali Maredia) * cmake: update for recent librbd changes (`pr#6715 `_, John Spray) * cmake: update for recent rbd changes (`pr#6818 `_, Mykola Golub) * cmake: Use uname instead of arch. (`pr#6358 `_, John Coyle) * coc: fix typo in the apt-get command (`pr#6659 `_, Chris Holcombe) * common: add generic plugin infrastructure (`pr#6696 `_, Sage Weil) * common: add latency perf counter for finisher (`pr#6175 `_, Xinze Chi) * common/address_help.cc: fix the leak in entity_addr_from_url() (`issue#14132 `_, `pr#6987 `_, Qiankun Zheng) * common: add thread names (`pr#5882 `_, Igor Podoski) * common: add zlib compression plugin (`pr#7437 `_, Alyona Kiseleva, Kiseleva Alyona) * common: admin socket commands for tcmalloc heap get/set operations (`pr#7512 `_, Samuel Just) * common: ake ceph_time clocks work under BSD (`pr#7340 `_, Adam C. Emerson) * common: allow enable/disable of optracker at runtime (`pr#5168 `_, Jianpeng Ma) * common: Allow OPT_INT settings with negative values (`issue#13829 `_, `pr#7390 `_, Brad Hubbard, Kefu Chai) * common: assert: abort() rather than throw (`pr#6804 `_, Adam C. Emerson) * common: assert: __STRING macro is not defined by musl libc. (`pr#6210 `_, John Coyle) * common/bit_vector: use hard-coded value for block size (`issue#14747 `_, `pr#7610 `_, Jason Dillaman) * common: buffer: add cached_crc and cached_crc_adjust counts to perf dump (`pr#6535 `_, Ning Yao) * common: buffer/assert minor fixes (`pr#6990 `_, Matt Benjamin) * common: buffer: put a guard for stat() syscall during read_file (`pr#7956 `_, xie xingguo) * common: buffer: remove unneeded list destructor (`pr#6456 `_, Michal Jarzabek) * common/buffer: replace RWLock with spinlocks (`pr#7294 `_, Piotr Dałek) * common/ceph_context.cc:fix order of initialisers (`pr#6838 `_, Michal Jarzabek) * common: change the type of counter total/unhealthy_workers (`pr#7254 `_, Guang Yang) * common: default cluster name to config file prefix (`pr#7364 `_, Javen Wu) * common: Deprecate or free up a bunch of feature bits (`pr#8214 `_, Samuel Just) * common: Do not use non-portable constants in mutex_debug (`pr#7766 `_, Adam C. Emerson) * common: don't reverse hobject_t hash bits when zero (`pr#6653 `_, Piotr Dałek) * common: fix OpTracker age histogram calculation (`pr#5065 `_, Zhiqiang Wang) * common: fix race during optracker switches between enabled/disabled mode (`pr#8330 `_, xie xingguo) * common: fix reset max in Throttle using perf reset command (`issue#13517 `_, `pr#6300 `_, Xinze Chi) * common: fix time_t cast in decode (`issue#15330 `_, `pr#8419 `_, Adam C. Emerson) * common/Formatter: avoid newline if there is no output (`pr#5351 `_, Aran85) * common: improve shared_cache and simple_cache efficiency with hash table (`pr#6909 `_, Ning Yao) * common/lockdep: increase max lock names (`pr#6961 `_, Sage Weil) * common: log: Assign LOG_DEBUG priority to syslog calls (`issue#13993 `_, `pr#6815 `_, Brad Hubbard) * common: log: predict log message buffer allocation size (`pr#6641 `_, Adam Kupczyk) * common/MemoryModel: Added explicit feature check for mallinfo(). (`pr#6252 `_, John Coyle) * common: new timekeeping common code, and Objecter conversion (`pr#5782 `_, Adam C. Emerson) * common/obj_bencher.cc: bump the precision of bandwidth field (`pr#8021 `_, Piotr Dałek) * common/obj_bencher.cc: faster object name generation (`pr#7863 `_, Piotr Dałek) * common/obj_bencher.cc: fix verification crashing when there's no objects (`pr#5853 `_, Piotr Dałek) * common/obj_bencher.cc: make verify error fatal (`issue#14971 `_, `pr#7897 `_, Piotr Dałek) * common: optimize debug logging code (`pr#6441 `_, Adam Kupczyk) * common: optimize debug logging (`pr#6307 `_, Adam Kupczyk) * common/page.cc: _page_mask has too many bits (`pr#7588 `_, Dan Mick) * common: perf counter for bufferlist history total alloc (`pr#6198 `_, Xinze Chi) * common: reduce CPU usage by making stringstream in stringify function thread local (`pr#6543 `_, Evgeniy Firsov) * common: re-enable backtrace support (`pr#6771 `_, Jason Dillaman) * common: set thread name from correct thread (`pr#7845 `_, Igor Podoski) * common: signal_handler: added support for using reentrant strsignal() implementations vs. sys_siglist[] (`pr#6796 `_, John Coyle) * common: snappy decompressor may assert when handling segmented input bufferlist (`issue#14400 `_, `pr#7268 `_, Igor Fedotov) * common: string.h: return type from str_len(...) need not be const (`pr#7679 `_, Matt Benjamin) * common/str_map: cleanup: replaced get_str_map() function overloading by using default parameters for delimiters (`pr#7266 `_, Sahithi R V) * common/strtol.cc: fix the coverity warnings (`pr#7967 `_, Kefu Chai) * common: SubProcess: Avoid buffer corruption when calling err() (`issue#15011 `_, `pr#8054 `_, Erwan Velu) * common: SubProcess: fix multiple definition bug (`pr#6790 `_, Yunchuan Wen) * common: Thread: move copy constructor and assignment op (`pr#5133 `_, Michal Jarzabek) * common: time: have skewing-now call non-skewing now (`pr#7466 `_, Adam C. Emerson) * common/TrackedOp: fix inaccurate counting for slow requests (`issue#14804 `_, `pr#7690 `_, xie xingguo) * common: unit test for interval_set implementations (`pr#6 `_, Igor Fedotov) * common: use namespace instead of subclasses for buffer (`pr#6686 `_, Michal Jarzabek) * common: various fixes from SCA runs (`pr#7680 `_, Danny Al-Gaaf) * common: WeightedPriorityQueue Re-Add Round Robin for Classes (`pr#7984 `_, Robert LeBlanc) * common: WorkQueue: new PointerWQ base class for ContextWQ (`issue#13636 `_, `pr#6525 `_, Jason Dillaman) * compat: use prefixed typeof extension (`pr#6216 `_, John Coyle) * config: add $data_dir/config to config search path (`pr#7377 `_, Sage Weil) * config: complains when a setting is not tracked (`issue#11692 `_, `pr#7085 `_, Kefu Chai) * config: fix osd_crush_initial_weight (`pr#7975 `_, You Ji) * config: increase default async op threads (`pr#7802 `_, Piotr Dałek) * config_opts: disable filestore throttle soft backoff by default (`pr#8265 `_, Samuel Just) * configure.ac: boost_iostreams is required, not optional (`pr#7816 `_, Hector Martin) * configure.ac: macro fix (`pr#6769 `_, Igor Podoski) * configure.ac: make "--with-librocksdb-static" default to 'check' (`issue#14463 `_, `pr#7317 `_, Dan Mick) * configure.ac: update help strings for cython (`pr#7856 `_, Josh Durgin) * configure: Add -D_LARGEFILE64_SOURCE to Linux build. (`pr#8402 `_, Ira Cooper) * configure: detect bz2 and lz4 (`issue#13850 `_, `issue#13981 `_, `pr#7030 `_, Kefu Chai) * correct radosgw-admin command (`pr#7006 `_, YankunLi) * crush: add chooseleaf_stable tunable (`pr#6572 `_, Sangdi Xu, Sage Weil) * crush: add safety assert (`issue#14496 `_, `pr#7344 `_, songbaisen) * crush: clean up whitespace removal (`issue#14302 `_, `pr#7157 `_, songbaisen) * crush/CrushTester: check for overlapped rules (`pr#7139 `_, Kefu Chai) * crush/CrushTester: workaround a bug in boost::icl (`pr#7560 `_, Kefu Chai) * crush: fix cli tests for new crush tunables (`pr#8107 `_, Sage Weil) * crush: fix error log (`pr#8430 `_, Wei Jin) * crush: fix typo (`pr#8518 `_, Wei Jin) * crush: reply quickly from get_immediate_parent (`issue#14334 `_, `pr#7181 `_, song baisen) * crushtool: Don't crash when called on a file that isn't a crushmap (`issue#8286 `_, `pr#8038 `_, Brad Hubbard) * crushtool: improve usage/tip messages (`pr#7142 `_, xie xingguo) * crushtool: set type 0 name "device" for --build option (`pr#6824 `_, Sangdi Xu) * crush: update tunable docs. change default profile to jewel (`pr#7964 `_, Sage Weil) * crush: validate bucket id before indexing buckets array (`issue#13477 `_, `pr#6246 `_, Sage Weil) * debian/changelog: Remove stray 'v' in version (`pr#7936 `_, Dan Mick) * debian/changelog: Remove stray 'v' in version (`pr#7938 `_, Dan Mick) * debian: include cpio in build-requiers (`pr#7533 `_, Rémi BUISSON) * debian: package librgw_file* tests (`pr#7930 `_, Ken Dreyer) * debian: packaging fixes for jewel (`pr#7807 `_, Ken Dreyer, Ali Maredia) * debian/rpm split servers (`issue#10587 `_, `pr#7746 `_, Ken Dreyer) * debian/rules: put init-ceph in /etc/init.d/ceph, not ceph-base (`issue#15329 `_, `pr#8406 `_, Dan Mick) * debian: start ceph-mon-all and ceph-osd-all on package install (`issue#15098 `_, `pr#8617 `_, Sage Weil) * doc: add ceph-detect-init(8) source to dist tarball (`pr#7933 `_, Ken Dreyer) * doc: add cinder backend section to rbd-openstack.rst (`pr#7923 `_, RustShen) * doc: adding "--allow-shrink" in decreasing the size of the rbd block to distinguish from the increasing option (`pr#7020 `_, Yehua) * doc: Adding documentation on how to use new dynamic throttle scheme (`pr#8069 `_, Somnath Roy) * doc: add orphans commands to radosgw-admin(8) (`issue#14637 `_, `pr#7518 `_, Ken Dreyer) * doc: add v0.80.11 to the release timeline (`pr#6658 `_, Loic Dachary) * doc: admin/build-doc: add lxml dependencies on debian (`pr#6610 `_, Ken Dreyer) * doc: admin/build-doc: make paths absolute (`pr#7119 `_, Dan Mick) * doc: amend Fixes instructions in SubmittingPatches (`pr#8312 `_, Nathan Cutler) * doc: amend the rados.8 (`pr#7251 `_, Kefu Chai) * doc/architecture.rst: remove redundant word "across" (`pr#8179 `_, Zhao Junwang) * doc/cephfs/posix: update (`pr#6922 `_, Sage Weil) * doc: Clarify usage on starting single osd/mds/mon. (`pr#7641 `_, Patrick Donnelly) * doc: CodingStyle: fix broken URLs (`pr#6733 `_, Kefu Chai) * doc: correct typo 'restared' to 'restarted' (`pr#6734 `_, Yilong Zhao) * doc: detailed description of bugfixing workflow (`pr#7941 `_, Nathan Cutler) * doc/dev: add "Deploy a cluster for manual testing" section (`issue#15218 `_, `pr#8228 `_, Nathan Cutler) * doc/dev: add section on interrupting a running suite (`pr#8116 `_, Nathan Cutler) * doc/dev: continue writing Testing in the cloud chapter (`pr#7960 `_, Nathan Cutler) * doc: dev: document ceph-qa-suite (`pr#6955 `_, Loic Dachary) * doc/dev/index: refactor/reorg (`pr#6792 `_, Nathan Cutler) * doc/dev/index.rst: begin writing Contributing to Ceph (`pr#6727 `_, Nathan Cutler) * doc/dev/index.rst: fix headings (`pr#6780 `_, Nathan Cutler) * doc/dev: integrate testing into the narrative (`pr#7946 `_, Nathan Cutler) * doc: dev: introduction to tests (`pr#6910 `_, Loic Dachary) * doc/dev: various refinements (`pr#7954 `_, Nathan Cutler) * doc: document "readforward" and "readproxy" cache mode (`pr#7023 `_, Kefu Chai) * doc: download GPG key from download.ceph.com (`issue#13603 `_, `pr#6384 `_, Ken Dreyer) * doc: draft notes for jewel (`pr#8211 `_, Loic Dachary, Sage Weil) * doc: file must be empty when writing layout fields of file use "setfattr" (`pr#6848 `_, Cilang Zhao) * doc: fix 0.94.4 and 0.94.5 ordering (`pr#7763 `_, Loic Dachary) * doc: fix dependencies (`pr#8587 `_, Etienne Menguy) * doc: Fixed incorrect name of a "List Multipart Upload Parts" Response Entity (`issue#14003 `_, `pr#6829 `_, Lenz Grimmer) * doc: Fixes a CRUSH map step take argument (`pr#7327 `_, Ivan Grcic) * doc: Fixes a spelling error (`pr#6705 `_, Jeremy Qian) * doc: Fixes headline different font size and type (`pr#8328 `_, scienceluo) * doc: fixing image in section ERASURE CODING (`pr#7298 `_, Rachana Patel) * doc: fix misleading configuration guide on cache tiering (`pr#7000 `_, Yuan Zhou) * doc: fix "mon osd down out subtree limit" option name (`pr#7164 `_, François Lafont) * doc: fix outdated content in cache tier (`pr#6272 `_, Yuan Zhou) * doc: fix S3 C# example (`pr#7027 `_, Dunrong Huang) * doc: fix typo, duplicated content etc. for Jewel release notes (`pr#8342 `_, xie xingguo) * doc: fix typo in cephfs/quota (`pr#6745 `_, Drunkard Zhang) * doc: fix typo, indention etc. (`pr#7829 `_, xie xingguo) * doc: fix typo in developer guide (`pr#6943 `_, Nathan Cutler) * doc: fix typo (`pr#7004 `_, tianqing) * doc: fix wrong type of hyphen (`pr#8252 `_, xie xingguo) * doc: initial draft of RBD mirroring admin documentation (`issue#15041 `_, `pr#8169 `_, Jason Dillaman) * doc: INSTALL redirect to online documentation (`pr#6749 `_, Loic Dachary) * doc: little improvements for troubleshooting scrub issues (`pr#6827 `_, Mykola Golub) * doc: Modified a note section in rbd-snapshot doc. (`pr#6908 `_, Nilamdyuti Goswami) * doc: note that cephfs auth stuff is new in jewel (`pr#6858 `_, John Spray) * doc: osd-config Add Configuration Options for op queue. (`pr#7837 `_, Robert LeBlanc) * doc: osd: s/schedued/scheduled/ (`pr#6872 `_, Loic Dachary) * doc/rados/api/librados-intro.rst: fix typo (`pr#7879 `_, xie xingguo) * doc/rados/operations/crush: fix the formatting (`pr#8306 `_, Kefu Chai) * doc: recommend against ext4 (`pr#8556 `_, Sage Weil) * doc: reinstate accidentally removed section header (`pr#8569 `_, Josh Durgin) * doc: release-notes: draft v0.80.11 release notes (`pr#6374 `_, Loic Dachary) * doc: release-notes: draft v10.0.0 release notes (`pr#6666 `_, Loic Dachary) * doc/release-notes: fix indents (`pr#8345 `_, Kefu Chai) * doc/release-notes: jewel updates (`pr#8590 `_, Nathan Cutler, Sage Weil) * doc/release-notes: v9.1.0 (`pr#6281 `_, Loic Dachary) * doc/releases-notes: fix build error (`pr#6483 `_, Kefu Chai) * doc: Remove Ceph Monitors do lots of fsync() (`issue#15288 `_, `pr#8327 `_, Vikhyat Umrao) * doc: remove redundant space in ceph-authtool/monmaptool doc (`pr#7244 `_, Jiaying Ren) * doc: remove toctree items under Create CephFS (`pr#6241 `_, Jevon Qiao) * doc: remove unnecessary period in headline (`pr#6775 `_, Marc Koderer) * doc: rename the "Create a Ceph User" section and add verbage about… (`issue#13502 `_, `pr#6297 `_, ritz303) * doc: revise SubmittingPatches (`pr#7292 `_, Kefu Chai) * doc: rgw adding a very basic multisite doc (`issue#15247 `_, `pr#8281 `_, Abhishek Lekshmanan) * doc: rgw admin uses "region list" not "regions list" (`pr#8517 `_, Kris Jurka) * doc: rgw explain keystone's verify ssl switch (`pr#7862 `_, Abhishek Lekshmanan) * doc: rgw multisite, add pools section & minor cosmetic improvements (`pr#8653 `_, Abhishek Lekshmanan) * doc: rgw: port changes from downstream to upstream (`pr#7264 `_, Bara Ancincova) * doc: rgw_region_root_pool option should be in [global] (`issue#15244 `_, `pr#8271 `_, Vikhyat Umrao) * doc: rst style fix for pools document (`pr#6816 `_, Drunkard Zhang) * doc: script and guidelines for mirroring Ceph (`pr#7384 `_, Wido den Hollander) * docs: Fix styling of newly added mirror docs (`pr#6127 `_, Wido den Hollander) * doc: small fixes (`pr#7813 `_, xiexingguo) * doc: standardize @param (not @parma, @parmam, @params) (`pr#7714 `_, Nathan Cutler) * doc: SubmittingPatches: there is no next; only jewel (`pr#6811 `_, Nathan Cutler) * doc, tests: update all http://ceph.com/ to download.ceph.com (`pr#6435 `_, Alfredo Deza) * doc: Update ceph-disk manual page with new feature deactivate/destroy. (`pr#6637 `_, Vicente Cheng) * doc: Updated CloudStack RBD documentation (`pr#8308 `_, Wido den Hollander) * doc: update doc for with new pool settings (`pr#5951 `_, Guang Yang) * doc: Updated the rados command man page to include the --run-name opt… (`issue#12899 `_, `pr#5900 `_, ritz303) * doc: update infernalis release notes (`pr#6575 `_, vasukulkarni) * doc: Update list of admin/build-doc dependencies (`issue#14070 `_, `pr#6934 `_, Nathan Cutler) * doc: update radosgw-admin example (`pr#6256 `_, YankunLi) * doc: update the OS recommendations for newer Ceph releases (`pr#6355 `_, ritz303) * doc: use 'ceph auth get-or-create' for creating RGW keyring (`pr#6930 `_, Wido den Hollander) * doc: very basic doc on mstart (`pr#8207 `_, Abhishek Lekshmanan) * drop envz.h includes (`pr#6285 `_, John Coyle) * fix FTBFS introduced by d0af316 (`pr#7792 `_, Kefu Chai) * fix: use right init_flags to finish CephContext (`pr#6549 `_, Yunchuan Wen) * fs: be more careful about the "mds setmap" command to prevent breakage (`issue#14380 `_, `pr#7262 `_, Yan, Zheng) * ghobject_t: use # instead of ! as a separator (`pr#8055 `_, Sage Weil) * global: do not start two daemons with a single pid-file (`issue#13422 `_, `pr#7075 `_, shun song) * global: do not start two daemons with a single pid-file (part 2) (`issue#13422 `_, `pr#7463 `_, Loic Dachary) * global/global_init: expand metavariables in setuser_match_path (`issue#15365 `_, `pr#8433 `_, Sage Weil) * global/signal_handler: print thread name in signal handler (`pr#8177 `_, Jianpeng Ma) * helgrind: additional race conditionslibrbd: journal replay should honor inter-event dependencies (`pr#7274 `_, Jason Dillaman) * helgrind: fix real (and imaginary) race conditions (`issue#14163 `_, `pr#7208 `_, Jason Dillaman) * include/encoding: do not try to be clever with list encoding (`pr#7913 `_, Sage Weil) * init-ceph: do umount when the path exists. (`pr#6866 `_, Xiaoxi Chen) * init-ceph.in: allow case-insensitive true in `osd crush update on start' (`pr#7943 `_, Eric Cook) * init-ceph.in: skip ceph-disk if it is not present (`issue#10587 `_, `pr#7286 `_, Ken Dreyer) * init-ceph: use getopt to make option processing more flexible (`issue#3015 `_, `pr#6089 `_, Nathan Cutler) * journal: async methods to (un)register and update client (`pr#7832 `_, Mykola Golub) * journal: disconnect watch after watch error (`issue#14168 `_, `pr#7113 `_, Jason Dillaman) * journal: fire replay complete event after reading last object (`issue#13924 `_, `pr#6762 `_, Jason Dillaman) * journal: fix final result for JournalTrimmer::C_RemoveSet (`pr#8516 `_, runsisi) * journal: fix race condition between Future and journal shutdown (`issue#15364 `_, `pr#8477 `_, Jason Dillaman) * journal: flush commit position on metadata shutdown (`pr#7385 `_, Mykola Golub) * journal: improve commit position tracking (`pr#7776 `_, Jason Dillaman) * journal: incremental improvements and fixes (`pr#6552 `_, Mykola Golub) * journal: possible race condition during live replay (`issue#15352 `_, `pr#8494 `_, Jason Dillaman) * journal: prevent race injecting new records into overflowed object (`issue#15202 `_, `pr#8220 `_, Jason Dillaman) * journal: reset commit_position_task_ctx pointer after task complete (`pr#7480 `_, Mykola Golub) * journal: re-use common threads between journalers (`pr#7906 `_, Jason Dillaman) * journal: support replaying beyond skipped splay objects (`pr#6687 `_, Jason Dillaman) * krbd: remove deprecated --quiet param from udevadm (`issue#13560 `_, `pr#6394 `_, Jason Dillaman) * kv: fix bug in kv key optimization (`pr#6511 `_, Sage Weil) * kv: implement value_as_ptr() and use it in .get() (`pr#7052 `_, Piotr Dałek) * kv/KineticStore: fix broken split_key (`pr#6574 `_, Haomai Wang) * kv: optimize and clean up internal key/value interface (`pr#6312 `_, Piotr Dałek, Sage Weil) * libcephfs: fix python tests and fix getcwd on missing dir (`pr#7901 `_, John Spray) * libcephfs: Improve portability by replacing loff_t type usage with off_t (`pr#6301 `_, John Coyle) * libcephfs: only check file offset on glibc platforms (`pr#6288 `_, John Coyle) * libcephfs: update LIBCEPHFS_VERSION to indicate the interface was changed (`pr#7551 `_, Jevon Qiao) * librados: add c++ style osd/pg command interface (`pr#6893 `_, Yunchuan Wen) * librados: aix gcc librados port (`pr#6675 `_, Rohan Mars) * librados: avoid malloc(0) (which can return NULL on some platforms) (`issue#13944 `_, `pr#6779 `_, Dan Mick) * librados: cancel aio notification linger op upon completion (`pr#8102 `_, Jason Dillaman) * librados: check connection state in rados_monitor_log (`issue#14499 `_, `pr#7350 `_, David Disseldorp) * librados: clean up Objecter.h (`pr#6731 `_, Jie Wang) * librados: detect laggy ops with objecter_timeout, not osd_timeout (`pr#7629 `_, Greg Farnum) * librados: do cleanup (`pr#6488 `_, xie xingguo) * librados: do not clear handle for aio_watch() (`pr#7771 `_, xie xingguo) * librados: fix examples/librados/Makefile error. (`pr#6320 `_, You Ji) * librados: fix pool alignment API overflow issue (`issue#13715 `_, `pr#6489 `_, xie xingguo) * librados: fix potential null pointer access when do pool_snap_list (`issue#13639 `_, `pr#6422 `_, xie xingguo) * librados: fix PromoteOn2ndRead test for EC (`pr#6373 `_, Sage Weil) * librados: fix rare race where pool op callback may hang forever (`issue#13642 `_, `pr#6426 `_, xie xingguo) * librados: fix several flaws introduced by the enumeration_objects API (`issue#14299 `_, `issue#14301 `_, `issue#14300 `_, `pr#7156 `_, xie xingguo) * librados: fix test failure with new aio watch/unwatch API (`pr#7824 `_, Jason Dillaman) * librados: implement async watch/unwatch (`pr#7649 `_, Haomai Wang) * librados: include/rados/librados.h: fix typo (`pr#6741 `_, Nathan Cutler) * librados: init crush_location from config file. (`issue#13473 `_, `pr#6243 `_, Wei Luo) * librados: mix lock cycle (un)registering asok commands (`pr#7581 `_, John Spray) * librados: move to c++11 concurrency types (`pr#5931 `_, Adam C. Emerson) * librados: new style (sharded) object listing (`pr#6405 `_, John Spray, Sage Weil) * librados: potential null pointer access in list_(n)objects (`issue#13822 `_, `pr#6639 `_, xie xingguo) * librados: race condition on aio_notify completion handling (`pr#7864 `_, Jason Dillaman) * librados: remove duplicate definitions for rados pool_stat_t and cluster_stat_t (`pr#7330 `_, Igor Fedotov) * librados: Revert "rados: Add new field flags for ceph_osd_op.copy_get." (`pr#8486 `_, Sage Weil) * librados: shutdown finisher in a more graceful way (`pr#7519 `_, xie xingguo) * librados: Solaris port (`pr#6416 `_, Rohan Mars) * librados: stat2 with higher time precision (`pr#7915 `_, Yehuda Sadeh, Matt Benjamin) * librados: Striper: Fix incorrect push_front -> append_zero change (`pr#7578 `_, Haomai Wang) * librados_test_stub: protect against notify/unwatch race (`pr#7540 `_, Jason Dillaman) * librados: wrongly passed in argument for stat command (`issue#13703 `_, `pr#6476 `_, xie xingguo) * librbd: allocate new journal tag after acquiring exclusive lock (`pr#7884 `_, Jason Dillaman) * librbd: API: async open and close (`issue#14264 `_, `pr#7259 `_, Mykola Golub) * librbd: automatically flush IO after blocking write operations (`issue#13913 `_, `pr#6742 `_, Jason Dillaman) * librbd: Avoid create two threads per image (`pr#7400 `_, Haomai Wang) * librbd: avoid throwing error if mirroring is unsupported (`pr#8417 `_, Jason Dillaman) * librbd: better handling of exclusive lock transition period (`pr#7204 `_, Jason Dillaman) * librbd: block maintenance ops until after journal is ready (`issue#14510 `_, `pr#7382 `_, Jason Dillaman) * librbd: block read requests until journal replayed (`pr#7627 `_, Jason Dillaman) * librbd: check for presence of journal before attempting to remove (`issue#13912 `_, `pr#6737 `_, Jason Dillaman) * librbd: clear error when older OSD doesn't support image flags (`issue#14122 `_, `pr#7035 `_, Jason Dillaman) * librbd: correct include guard in RenameRequest.h (`pr#7143 `_, Jason Dillaman) * librbd: correct issues discovered during teuthology testing (`issue#14108 `_, `issue#14107 `_, `pr#6974 `_, Jason Dillaman) * librbd: correct issues discovered via valgrind memcheck (`pr#8132 `_, Jason Dillaman) * librbd: correct issues discovered when cache is disabled (`issue#14123 `_, `pr#6979 `_, Jason Dillaman) * librbd: correct race conditions discovered during unit testing (`issue#14060 `_, `pr#6923 `_, Jason Dillaman) * librbd: deadlock while attempting to flush AIO requests (`issue#13726 `_, `pr#6508 `_, Jason Dillaman) * librbd: differentiate journal replay flush vs shut down (`pr#7698 `_, Jason Dillaman) * librbd: disable copy-on-read when not exclusive lock owner (`issue#14167 `_, `pr#7129 `_, Jason Dillaman) * librbd: disable image mirroring when image is removed (`issue#15265 `_, `pr#8375 `_, Ricardo Dias) * librbd: disallow unsafe rbd_op_threads values (`issue#15034 `_, `pr#8459 `_, Josh Durgin) * librbd: do not ignore self-managed snapshot release result (`issue#14170 `_, `pr#7043 `_, Jason Dillaman) * librbd: enable/disable image mirroring automatically for pool mode (`issue#15143 `_, `pr#8204 `_, Ricardo Dias) * librbd: ensure copy-on-read requests are complete prior to closing parent image (`pr#6740 `_, Jason Dillaman) * librbd: ensure librados callbacks are flushed prior to destroying (`issue#14092 `_, `pr#7040 `_, Jason Dillaman) * librbd: exit if parent's snap is gone during clone (`issue#14118 `_, `pr#6968 `_, xie xingguo) * librbd: fix enable objectmap feature issue (`issue#13558 `_, `pr#6339 `_, xinxin shu) * librbd: fix handling of proxied maintenance operations during shut down (`issue#15471 `_, `issue#15456 `_, `pr#8565 `_, Jason Dillaman) * librbd: fix internal handling of dynamic feature updates (`pr#7299 `_, Jason Dillaman) * librbd: fix journal iohint (`pr#6917 `_, Jianpeng Ma) * librbd: fix known test case race condition failures (`issue#13969 `_, `pr#6800 `_, Jason Dillaman) * librbd: fix merge-diff for >2GB diff-files (`issue#14030 `_, `pr#6889 `_, Yunchuan Wen) * librbd: fix potential memory leak (`issue#14332 `_, `issue#14333 `_, `pr#7174 `_, xie xingguo) * librbd: fix snap_exists API return code overflow (`issue#14129 `_, `pr#6986 `_, xie xingguo) * librbd: fix state machine race conditions during shut down (`pr#7761 `_, Jason Dillaman) * librbd: fix test case race condition for journaling ops (`pr#6877 `_, Jason Dillaman) * librbd: fix tracepoint parameter in diff_iterate (`pr#6892 `_, Yunchuan Wen) * librbd: flush and invalidate cache via admin socket (`issue#2468 `_, `pr#6453 `_, Mykola Golub) * librbd: handle unregistering the image watcher when disconnected (`pr#8094 `_, Jason Dillaman) * librbd: image refresh code paths converted to async state machines (`pr#6859 `_, Jason Dillaman) * librbd: include missing header for bool type (`pr#6798 `_, Mykola Golub) * librbd: initial collection of state machine unit tests (`pr#6703 `_, Jason Dillaman) * librbd: integrate journaling for maintenance operations (`pr#6625 `_, Jason Dillaman) * librbd: integrate journaling support for IO operations (`pr#6541 `_, Jason Dillaman) * librbd: integrate journal replay with fsx testing (`pr#7583 `_, Jason Dillaman) * librbd: IO deadlock when dynamically enabling/disabling features (`issue#15102 `_, `pr#8511 `_, Jason Dillaman) * librbd: journal framework for tracking exclusive lock transitions (`issue#13298 `_, `pr#7529 `_, Jason Dillaman) * librbd: journaling-related lock dependency cleanup (`pr#6777 `_, Jason Dillaman) * librbd: journal replay needs to support re-executing maintenance ops (`issue#14822 `_, `pr#7785 `_, Jason Dillaman) * librbd: journal replay should honor inter-event dependencies (`pr#7019 `_, Jason Dillaman) * librbd: journal shut down flush race condition (`issue#14434 `_, `pr#7302 `_, Jason Dillaman) * librbd: not necessary to hold owner_lock while releasing snap id (`issue#13914 `_, `pr#6736 `_, Jason Dillaman) * librbd: only send signal when AIO completions queue empty (`pr#6729 `_, Jianpeng Ma) * librbd: optionally validate new RBD pools for snapshot support (`issue#13633 `_, `pr#6925 `_, Jason Dillaman) * librbd: partial revert of commit 9b0e359 (`issue#13969 `_, `pr#6789 `_, Jason Dillaman) * librbd: perf counters might not be initialized on error (`issue#13740 `_, `pr#6523 `_, Jason Dillaman) * librbd: perf section name: use hyphen to separate components (`issue#13719 `_, `pr#6516 `_, Mykola Golub) * librbd: properly handle replay of snap remove RPC message (`issue#14164 `_, `pr#7042 `_, Jason Dillaman) * librbd: reduce mem copies to user-buffer during read (`pr#7548 `_, Jianpeng Ma) * librbd: reduce verbosity of common error condition logging (`issue#14234 `_, `pr#7114 `_, Jason Dillaman) * librbd: refresh image if required before replaying journal ops (`issue#14908 `_, `pr#7978 `_, Jason Dillaman) * librbd: remove canceled tasks from timer thread (`issue#14476 `_, `pr#7329 `_, Douglas Fuller) * librbd: remove duplicate read_only test in librbd::async_flatten (`pr#5856 `_, runsisi) * librbd: remove last synchronous librados calls from open/close state machine (`pr#7839 `_, Jason Dillaman) * librbd: replaying a journal op post-refresh requires locking (`pr#8028 `_, Jason Dillaman) * librbd: resize should only update image size within header (`issue#13674 `_, `pr#6447 `_, Jason Dillaman) * librbd: retrieve image name when opening by id (`pr#7736 `_, Mykola Golub) * librbd: return error if we fail to delete object_map head object (`issue#14098 `_, `pr#6958 `_, xie xingguo) * librbd: Revert "librbd: use task finisher per CephContext" (`issue#14780 `_, `pr#7667 `_, Josh Durgin) * librbd: send notifications for mirroring status updates (`pr#8355 `_, Jason Dillaman) * librbd: several race conditions discovered under single CPU environment (`pr#7653 `_, Jason Dillaman) * librbd: simplify IO method signatures for 32bit environments (`pr#6700 `_, Jason Dillaman) * librbd: small fixes for error messages and readahead counter (`issue#14127 `_, `pr#6983 `_, xie xingguo) * librbd: start perf counters after id is initialized (`issue#13720 `_, `pr#6494 `_, Mykola Golub) * librbd: support eventfd for AIO completion notifications (`pr#5465 `_, Haomai Wang) * librbd: truncate does not need to mark the object as existing in the object map (`issue#14789 `_, `pr#7772 `_, xinxin shu) * librbd: uninitialized state in snap remove state machine (`pr#6982 `_, Jason Dillaman) * librbd: update of mirror pool mode and mirror peer handling (`pr#7718 `_, Jason Dillaman) * librbd: use async librados notifications (`pr#7668 `_, Jason Dillaman) * log: do not repeat errors to stderr (`issue#14616 `_, `pr#7983 `_, Sage Weil) * log: fix stack overflow when flushing large log lines (`issue#14707 `_, `pr#7599 `_, Igor Fedotov) * log: segv in a portable way (`issue#14856 `_, `pr#7790 `_, Kefu Chai) * log: use delete[] (`pr#7904 `_, Sage Weil) * mailmap: Abhishek Lekshmanan affiliation (`pr#8615 `_, Abhishek Lekshmanan) * mailmap: add UMCloud affiliation (`pr#6820 `_, Jiaying Ren) * mailmap for 10.0.4 (`pr#7932 `_, Abhishek Lekshmanan) * mailmap: hange organization for Dongmao Zhang (`pr#7173 `_, Dongmao Zhang) * mailmap: Igor Podoski affiliation (`pr#7219 `_, Igor Podoski) * mailmap: Jewel updates (`pr#6750 `_, Abhishek Lekshmanan) * mailmap: modify member info (`pr#6468 `_, Xiaowei Chen) * mailmap: revise organization (`pr#6519 `_, Li Wang) * mailmap: Ubuntu Kylin name changed to Kylin Cloud (`pr#6532 `_, Loic Dachary) * mailmap: update .organizationmap (`pr#6565 `_, chenji-kael) * mailmap update (`pr#7210 `_, M Ranga Swami Reddy) * mailmap update (`pr#8522 `_, M Ranga Swami Reddy) * mailmap update (`pr#8608 `_, M Ranga Swami Reddy) * mailmap: updates for infernalis. (`pr#6495 `_, Yann Dupont) * mailmap: updates (`pr#6258 `_, M Ranga Swami Reddy) * mailmap: updates (`pr#6594 `_, chenji-kael) * mailmap updates (`pr#6992 `_, Loic Dachary) * mailmap updates (`pr#7189 `_, Loic Dachary) * mailmap updates (`pr#7528 `_, Yann Dupont) * mailmap updates (`pr#8256 `_, Loic Dachary) * mailmap: Xie Xingguo affiliation (`pr#6409 `_, Loic Dachary) * Makefile-env.am: set a default for CEPH_BUILD_VIRTUALENV (part 2) (`pr#8320 `_, Loic Dachary) * makefile: fix rbdmap manpage (`pr#8310 `_, Kefu Chai) * makefile: remove libedit from libclient.la (`pr#7284 `_, Kefu Chai) * makefiles: remove bz2-dev from dependencies (`issue#13981 `_, `pr#6939 `_, Piotr Dałek) * man/8/ceph-disk: fix formatting issue (`pr#8003 `_, Sage Weil) * man/8/ceph-disk: fix formatting issue (`pr#8012 `_, Sage Weil) * man: document listwatchers cmd in "rados" manpage (`pr#7021 `_, Kefu Chai) * mdsa: A few more snapshot fixes, mostly around snapshotted inode/dentry tracking (`pr#7798 `_, Yan, Zheng) * mds: Add cmapv to ESessions default constructor initializer list (`pr#8403 `_, John Coyle) * mds: add 'p' flag in auth caps to control setting pool in layout (`pr#6567 `_, John Spray) * mds: advance clientreplay when replying (`issue#14357 `_, `pr#7216 `_, John Spray) * mds: allow client to request caps when opening file (`issue#14360 `_, `pr#7952 `_, Yan, Zheng) * mds: Be more careful about directory fragmentation and scrubbing (`issue#15167 `_, `pr#8180 `_, Yan, Zheng) * mds, client: add namespace to file_layout_t (previously ceph_file_layout) (`pr#7098 `_, Yan, Zheng, Sage Weil) * mds, client: fix locking around handle_conf_change (`issue#14365 `_, `issue#14374 `_, `pr#7312 `_, John Spray) * mds: don't double-shutdown the timer when suiciding (`issue#14697 `_, `pr#7616 `_, Greg Farnum) * mds: expose state of recovery to status ASOK command (`issue#14146 `_, `pr#7068 `_, Yan, Zheng) * mds: filelock deadlock (`pr#7713 `_, Yan, Zheng) * mds: fix client capabilities during reconnect (client.XXXX isn't responding to mclientcaps(revoke)) (`issue#11482 `_, `pr#6432 `_, Yan, Zheng) * mds: fix client cap/message replay order on restart (`issue#14254 `_, `issue#13546 `_, `pr#7199 `_, Yan, Zheng) * mds: fix file_layout_t legacy encoding snafu (`pr#8455 `_, Sage Weil) * mds: fix fsmap decode (`pr#8063 `_, Greg Farnum) * mds: fix FSMap upgrade with daemons in the map (`pr#8073 `_, John Spray, Greg Farnum) * mds: fix inode_t::compare() (`issue#15038 `_, `pr#8014 `_, Yan, Zheng) * mds: fix scrub_path (`pr#6684 `_, John Spray) * mds: fix setvxattr (broken in a536d114) (`issue#14029 `_, `pr#6941 `_, John Spray) * mds: fix standby replay thread creation (`issue#14144 `_, `pr#7132 `_, John Spray) * mds: fix stray purging in 'stripe_count > 1' case (`issue#15050 `_, `pr#8040 `_, Yan, Zheng) * mds: FSmap decode fix (`pr#8604 `_, Greg Farnum) * mds/FSMap: use _ in key name (`pr#8614 `_, Sage Weil) * mds: function parameter 'df' should be passed by reference (`pr#7490 `_, Na Xie) * mds: implement snapshot rename (`pr#5645 `_, xinxin shu) * mds: judgment added to avoid the risk of visiting the NULL pointer (`pr#7358 `_, Kongming Wu) * mds: messages/MOSDOp: cast in assert to eliminate warnings (`issue#13625 `_, `pr#6414 `_, David Zafman) * mds: Minor fixes around data scan in some scenarios (`pr#8115 `_, Yan, Zheng) * mds: Multi-filesystem support (`issue#14952 `_, `pr#6953 `_, John Spray, Sage Weil) * mds: new filtered MDS tell commands for sessions (`pr#6180 `_, John Spray) * mds: properly set STATE_STRAY/STATE_ORPHAN for stray dentry/inode (`issue#13777 `_, `pr#6553 `_, Yan, Zheng) * mds: repair the command option "--hot-standby" (`pr#6454 `_, Wei Feng) * mds: ScrubStack and "tag path" command (`pr#5662 `_, Yan, Zheng, John Spray, Greg Farnum) * mds/Session: use projected parent for auth path check (`issue#13364 `_, `pr#6200 `_, Sage Weil) * mds: tear down connections from `tell` commands (`issue#14048 `_, `pr#6933 `_, John Spray) * mds: we should wait messenger when MDSDaemon suicide (`pr#6996 `_, Wei Feng) * messages/MOSDOp: clear reqid inc for v6 encoding (`issue#15230 `_, `pr#8299 `_, Sage Weil) * mirrors: Change contact e-mail address for se.ceph.com (`pr#8007 `_, Wido den Hollander) * mirrors: Updated scripts and documentation for mirrors (`pr#7847 `_, Wido den Hollander) * misc: use make_shared while creating shared_ptr (`pr#7769 `_, Somnath Roy) * mon: add an independent option for max election time (`pr#7245 `_, Sangdi Xu) * mon: add `osd blacklist clear` (`pr#6945 `_, John Spray) * mon: add RAW USED column to ceph df detail (`pr#7087 `_, Ruifeng Yang) * mon: block 'ceph osd pg-temp ...' if pg_temp update is already pending (`pr#6704 `_, Sage Weil) * mon: cleanup set-quota error msg (`pr#7371 `_, Abhishek Lekshmanan) * monclient: avoid key renew storm on clock skew (`issue#12065 `_, `pr#8258 `_, Alexey Sheplyakov) * mon: compact full epochs also (`issue#14537 `_, `pr#7396 `_, Kefu Chai) * mon: consider pool size when creating pool (`issue#14509 `_, `pr#7359 `_, songbaisen) * mon: consider the pool size when setting pool crush rule (`issue#14495 `_, `pr#7341 `_, song baisen) * mon: degrade a log message to level 2 (`pr#6929 `_, Kongming Wu) * mon: do not send useless pg_create messages for split pgs (`pr#8247 `_, Sage Weil) * mon: don't require OSD W for MRemoveSnaps (`issue#13777 `_, `pr#6601 `_, John Spray) * mon: drop useless rank init assignment (`issue#14508 `_, `pr#7321 `_, huanwen ren) * mon: enable 'mon osd prime pg temp' by default (`pr#7838 `_, Robert LeBlanc) * mon: fix calculation of %USED (`pr#7881 `_, Adam Kupczyk) * mon: fix ceph df pool available calculation for 0-weighted OSDs (`pr#6660 `_, Chengyuan Li) * mon: fix coding-style on PG related Monitor files (`pr#6881 `_, Wido den Hollander) * mon: fixes related to mondbstore->get() changes (`pr#6564 `_, Piotr Dałek) * mon: fix keyring permissions (`issue#14950 `_, `pr#7880 `_, Owen Synge) * mon: fix locking in preinit error paths (`issue#14473 `_, `pr#7353 `_, huanwen ren) * mon: fix monmap creation stamp (`pr#7459 `_, duanweijun) * mon: fix reuse of osd ids (clear osd info on osd deletion) (`issue#13988 `_, `pr#6900 `_, Loic Dachary, Sage Weil) * mon: fix routed_request_tids leak (`pr#6102 `_, Ning Yao) * mon: fix sync of config-key data (`pr#7363 `_, Xiaowei Chen) * mon: fix the can't change subscribe level bug in monitoring log (`pr#7031 `_, Zhiqiang Wang) * mon: go into ERR state if multiple PGs are stuck inactive (`issue#13923 `_, `pr#7253 `_, Wido den Hollander) * mon: initialize last_* timestamps on new pgs to creation time (`issue#14952 `_, `pr#7980 `_, Sage Weil) * mon: initialize recorded election epoch properly even when standalone (`issue#13627 `_, `pr#6407 `_, huanwen ren) * mon: make clock skew checks sane (`issue#14175 `_, `pr#7141 `_, Joao Eduardo Luis) * mon: mark_down_pgs in lockstep with pg_map's osdmap epoch (`pr#8208 `_, Sage Weil) * mon/MDSMonitor: add confirmation to "ceph mds rmfailed" (`issue#14379 `_, `pr#7248 `_, Yan, Zheng) * mon/MDSMonitor.cc: properly note beacon when health metrics changes (`issue#14684 `_, `pr#7757 `_, Yan, Zheng) * mon: modify a dout level in OSDMonitor.cc (`pr#6928 `_, Yongqiang He) * mon/MonClient: avoid null pointer error when configured incorrectly (`issue#14405 `_, `pr#7276 `_, Bo Cai) * mon/MonClient: fix shutdown race (`issue#13992 `_, `pr#8335 `_, Sage Weil) * mon/monitor: some clean up (`pr#7520 `_, huanwen ren) * mon: MonmapMonitor: don't expose uncommitted state to client (`pr#6854 `_, Joao Eduardo Luis) * mon/OSDMonitor: osdmap laggy set a maximum limit for interval (`pr#7109 `_, Zengran Zhang) * mon: osd [test-]reweight-by-{pg,utilization} command updates (`pr#7890 `_, Dan van der Ster, Sage Weil) * mon: paxos is_recovering calc error (`pr#7227 `_, Weijun Duan) * mon/PGMap: show rd/wr iops separately in status reports (`pr#7072 `_, Cilang Zhao) * mon: PGMonitor: acting primary diff with cur_stat, should not set pg to stale (`pr#7083 `_, Xiaowei Chen) * mon/PGMonitor: reliably mark PGs state (`pr#8089 `_, Sage Weil) * mon: PG Monitor should report waiting for backfill (`issue#12744 `_, `pr#7398 `_, Abhishek Lekshmanan) * mon/pgmonitor: use appropriate forced conversions in get_rule_avail (`pr#7705 `_, huanwen ren) * mon: reduce CPU and memory manager pressure of pg health check (`pr#7482 `_, Piotr Dałek) * mon: remove 'mds setmap' (`issue#15136 `_, `pr#8121 `_, Sage Weil) * mon: remove remove_legacy_versions() (`pr#8324 `_, Kefu Chai) * mon: remove unnecessary comment for update_from_paxos (`pr#8400 `_, Qinghua Jin) * mon: remove unused variable (`issue#15292 `_, `pr#8337 `_, Javier M. Mellid) * mon: revert MonitorDBStore's WholeStoreIteratorImpl::get (`issue#13742 `_, `pr#6522 `_, Piotr Dałek) * mon: should not set isvalid = true when cephx_verify_authorizer return false (`issue#13525 `_, `pr#6306 `_, Ruifeng Yang) * mon: show the pool quota info on ceph df detail command (`issue#14216 `_, `pr#7094 `_, song baisen) * mon: some cleanup in MonmapMonitor.cc (`pr#7418 `_, huanwen ren) * mon: standardize Ceph removal commands (`pr#7939 `_, Dongsheng Yang) * mon: support min_down_reporter by subtree level (default by host) (`pr#6709 `_, Xiaoxi Chen) * mon: unconfuse object count skew message (`pr#7882 `_, Piotr Dałek) * mon: unregister command on shutdown (`pr#7504 `_, huanwen ren) * mon: warn if pg(s) not scrubbed (`issue#13142 `_, `pr#6440 `_, Michal Jarzabek) * mount.ceph: memory leaks (`pr#6905 `_, Qiankun Zheng) * mount.fuse.ceph: better parsing of arguments passed to mount.fuse.ceph by mount command (`issue#14735 `_, `pr#7607 `_, Florent Bautista) * mrun: update path to cmake binaries (`pr#8447 `_, Casey Bodley) * msg: add override to virutal methods (`pr#6977 `_, Michal Jarzabek) * msg: add thread safety for "random" Messenger + fix wrong usage of random functions (`pr#7650 `_, Avner BenHanoch) * msg/async: AsyncConnection: avoid debug log in cleanup_handler (`pr#7547 `_, Haomai Wang) * msg/async: AsyncConnection: avoid is_connected require connection's lock (`issue#15440 `_, `pr#8520 `_, Haomai Wang) * msg/async: AsyncMessenger: fix several bugs (`pr#7831 `_, Haomai Wang) * msg/async: AsyncMessenger: fix valgrind leak (`pr#7725 `_, Haomai Wang) * msg/async: avoid log spam on throttle (`issue#15031 `_, `pr#8263 `_, Kefu Chai) * msg/async: bunch of fixes (`pr#7379 `_, Piotr Dałek) * msg/async: cleanup dead connection and misc things (`pr#7158 `_, Haomai Wang) * msg/async: don't calculate msg header crc when not needed (`pr#7815 `_, Piotr Dałek) * msg/async: don't use shared_ptr to manage EventCallback (`pr#7028 `_, Haomai Wang) * msg/async: Event: fix clock skew problem (`pr#7949 `_, Wei Jin) * msg/async: fix array boundary (`pr#7451 `_, Wei Jin) * msg: async: fix perf counter description and simplify _send_keepalive_or_ack (`pr#8046 `_, xie xingguo) * msg/async: fix potential race condition (`pr#7453 `_, Haomai Wang) * msg/async: fix send closed local_connection message problem (`pr#7255 `_, Haomai Wang) * msg/async: let receiver ack message ASAP (`pr#6478 `_, Haomai Wang) * msg/async: reduce extra tcp packet for message ack (`pr#7380 `_, Haomai Wang) * msg/async: remove experiment feature (`pr#7820 `_, Haomai Wang) * msg: async: small cleanups (`pr#7871 `_, xie xingguo) * msg/async: smarter MSG_MORE (`pr#7625 `_, Piotr Dałek) * msg: async: start over after failing to bind a port in specified range (`issue#14928 `_, `issue#13002 `_, `pr#7852 `_, xie xingguo) * msg/async: support of non-block connect in async messenger (`issue#12802 `_, `pr#5848 `_, Jianhui Yuan) * msg/async: _try_send trim already sent for outcoming_bl more efficient (`pr#7970 `_, Yan Jun) * msg/async: will crash if enabling async msg because of an assertion (`pr#6640 `_, Zhi Zhang) * msg: filter out lo addr when bind osd addr (`pr#7012 `_, Ji Chen) * msg: removed unneeded includes from Dispatcher (`pr#6814 `_, Michal Jarzabek) * msg: remove duplicated code - local_delivery will now call 'enqueue' (`pr#7948 `_, Avner BenHanoch) * msg: remove unneeded inline (`pr#6989 `_, Michal Jarzabek) * msgr: fix large message data content length causing overflow (`pr#6809 `_, Jun Huang, Haomai Wang) * msg: significantly reduce minimal memory usage of connections (`pr#7567 `_, Piotr Dałek) * msg/simple: pipe: memory leak when signature check failed (`pr#7096 `_, Ruifeng Yang) * msg/simple: remove unneeded friend declarations (`pr#6924 `_, Michal Jarzabek) * msg/xio: fix compilation (`pr#7479 `_, Roi Dayan) * msg/xio: fixes (`pr#7603 `_, Roi Dayan) * msg/xio: simple fixes (`pr#8555 `_, Avner BenHanoch) * mstart: start rgw on different ports as well (`pr#8167 `_, Abhishek Lekshmanan) * nfs for rgw (Matt Benjamin, Orit Wasserman) (`pr#7634 `_, Yehuda Sadeh, Matt Benjamin) * objecter: avoid recursive lock of Objecter::rwlock (`pr#7343 `_, Yan, Zheng) * organizationmap: modify org mail info. (`pr#7240 `_, Xiaowei Chen) * os/bluestore: a few fixes (`pr#8193 `_, Sage Weil) * os/bluestore/BlueFS: Before reap ioct, it should wait io complete (`pr#8178 `_, Jianpeng Ma) * os/bluestore/BlueStore: Don't leak trim overlay data before write. (`pr#7895 `_, Jianpeng Ma) * os/bluestore: ceph-bluefs-tool fixes (`issue#15261 `_, `pr#8292 `_, Venky Shankar) * os/bluestore: clone overlay data (`pr#7860 `_, Jianpeng Ma) * os/bluestore: fix assert (`issue#14436 `_, `pr#7293 `_, xie xingguo) * os/bluestore: fix a typo in SPDK path parsing (`pr#7601 `_, Jianjian Huo) * os/bluestore: fix bluestore_wal_transaction_t encoding test (`pr#7342 `_, Kefu Chai) * os/bluestore: fix bluestore_wal_transaction_t encoding test (`pr#7419 `_, Kefu Chai, Brad Hubbard) * os/bluestore: fix shortened bluefs paths in env mirror case (`pr#8498 `_, Sage Weil) * os/bluestore: insert new onode to the front position of onode LRU (`pr#7492 `_, Jianjian Huo) * os/bluestore/KernelDevice: force block size (`pr#8006 `_, Sage Weil) * os/bluestore: make bluestore_sync_transaction = true can work. (`pr#7674 `_, Jianpeng Ma) * os/bluestore/NVMEDevice: make IO thread using dpdk launch (`pr#8160 `_, Haomai Wang) * os/bluestore/NVMEDevice: refactor probe/attach codes and support zero command (`pr#7647 `_, Haomai Wang) * os/bluestore: revamp BlueFS bdev management and add perfcounters (`issue#15376 `_, `pr#8431 `_, Sage Weil) * os/bluestore: small fixes in bluestore StupidAllocator (`pr#8101 `_, Jianjian Huo) * os/bluestore: use intrusive_ptr for Dir (`pr#7247 `_, Igor Fedotov) * osd: add cache hint when pushing raw clone during recovery (`pr#7069 `_, Zhiqiang Wang) * osd: Add config option osd_read_ec_check_for_errors for testing (`pr#5865 `_, David Zafman) * osd: add missing newline to usage message (`pr#7613 `_, Willem Jan Withagen) * osd: add osd op queue latency perfcounter (`pr#5793 `_, Haomai Wang) * osd: add pin/unpin support to cache tier (11066) (`pr#6326 `_, Zhiqiang Wang) * osd: add 'proxy' cache mode (`issue#12814 `_, `pr#8210 `_, Sage Weil) * osd: add scrub persist/query API (`issue#13505 `_, `pr#6898 `_, Kefu Chai, Samuel Just) * osd: add the support of per pool scrub priority (`pr#7062 `_, Zhiqiang Wang) * osd: a fix for HeartbeatDispatcher and cleanups (`pr#7550 `_, Kefu Chai) * osd: Allow repair of history.last_epoch_started using config (`pr#6793 `_, David Zafman) * osd: auto repair EC pool (`issue#12754 `_, `pr#6196 `_, Guang Yang) * osd: avoid calculating crush mapping for most ops (`pr#6371 `_, Sage Weil) * osd: avoid debug std::string initialization in PG::get/put (`pr#7117 `_, Evgeniy Firsov) * osd: avoid double-check for replaying and can_checkpoint() in FileStore::_check_replay_guard (`pr#6471 `_, Ning Yao) * osd: avoid duplicate op->mark_started in ReplicatedBackend (`pr#6689 `_, Jacek J. Łakis) * osd: avoid FORCE updating digest been overwritten by MAYBE when comparing scrub map (`pr#7051 `_, Zhiqiang Wang) * osd: avoid osd_op_thread suicide because osd_scrub_sleep (`pr#7009 `_, Jianpeng Ma) * osd: bail out of _committed_osd_maps if we are shutting down (`pr#8267 `_, Samuel Just) * osd: blockdevice: avoid implicit cast and add guard (`pr#7460 `_, xie xingguo) * osd: bluefs: fix alignment for odd page sizes (`pr#7900 `_, Dan Mick) * osd: bluestore: add 'override' to virtual functions (`pr#7886 `_, Michal Jarzabek) * osd: bluestore: allow _dump_onode dynamic accept log level (`pr#7995 `_, Jianpeng Ma) * osd: bluestore/blockdevice: use std::mutex et al (`pr#7568 `_, Sage Weil) * osd: bluestore: bluefs: fix several small bugs (`issue#14344 `_, `issue#14343 `_, `pr#7200 `_, xie xingguo) * osd: bluestore/BlueFS: initialize super block_size earlier in mkfs (`pr#7535 `_, Sage Weil) * osd: bluestore: don't include when building without libaio (`issue#14207 `_, `pr#7169 `_, Mykola Golub) * osd: bluestore: fix bluestore onode_t attr leak (`pr#7125 `_, Ning Yao) * osd: bluestore: fix bluestore_wal_transaction_t encoding test (`pr#7168 `_, Kefu Chai) * osd: bluestore: fix check for write falling within the same extent (`issue#14954 `_, `pr#7892 `_, Jianpeng Ma) * osd: BlueStore: fix fsck and blockdevice read-relevant issue (`pr#7362 `_, xie xingguo) * osd: BlueStore: fix null pointer access (`issue#14561 `_, `pr#7435 `_, xie xingguo) * osd: bluestore: fix several bugs (`issue#14259 `_, `issue#14353 `_, `issue#14260 `_, `issue#14261 `_, `pr#7122 `_, xie xingguo) * osd: bluestore: fix space rebalancing, collection split, buffered reads (`pr#7196 `_, Sage Weil) * osd: bluestore: for overwrite a extent, allocate new extent on min_alloc_size write (`pr#7996 `_, Jianpeng Ma) * osd: bluestore: improve fs-type verification and tidy up (`pr#7651 `_, xie xingguo) * osd: bluestore, kstore: fix nid overwritten logic (`issue#14407 `_, `issue#14433 `_, `pr#7283 `_, xie xingguo) * osd: bluestore: misc fixes (`pr#7658 `_, Jianpeng Ma) * osd: bluestore: more fixes (`pr#7130 `_, Sage Weil) * osd: BlueStore/NVMEDevice: fix compiling and fd leak (`pr#7496 `_, xie xingguo) * osd: bluestore: NVMEDevice: fix error handling (`pr#7799 `_, xie xingguo) * osd: bluestore: remove unneeded includes (`pr#7870 `_, Michal Jarzabek) * osd: bluestore: Revert NVMEDevice task cstor and refresh interface changes (`pr#7729 `_, Haomai Wang) * osd: bluestore updates, scrub fixes (`pr#8035 `_, Sage Weil) * osd: bluestore: use btree_map for allocator (`pr#7269 `_, Igor Fedotov, Sage Weil) * osd: cache tier: add config option for eviction check list size (`pr#6997 `_, Yuan Zhou) * osd: call on_new_interval on newly split child PG (`issue#13962 `_, `pr#6778 `_, Sage Weil) * osd: cancel failure reports if we fail to rebind network (`pr#6278 `_, Xinze Chi) * osdc: Fix race condition with tick_event and shutdown (`issue#14256 `_, `pr#7151 `_, Adam C. Emerson) * osd: change mutex to spinlock to optimize thread context switch. (`pr#6492 `_, Xiaowei Chen) * osd: check do_shutdown before do_restart (`pr#6547 `_, Xiaoxi Chen) * osd: check health state before pre_booting (`issue#14181 `_, `pr#7053 `_, Xiaoxi Chen) * osd: clarify the scrub result report (`pr#6534 `_, Li Wang) * osd/ClassHandler: only dlclose() the classes not missing (`pr#8354 `_, Kefu Chai) * osd: clean up CMPXATTR checks (`pr#5961 `_, Jianpeng Ma) * osd: clean up temp object if copy-from fails (`pr#8487 `_, Sage Weil) * osd: clear pg_stat_queue after stopping pgs (`issue#14212 `_, `pr#7091 `_, Sage Weil) * osd: clear requeue_scrub_on_unblock flag after requeue scrub (`pr#8570 `_, xie xingguo) * osdc/Objecter: dout log after assign tid (`pr#8202 `_, Xinze Chi) * osdc/Objecter: fix narrow race with tid assignment (`issue#14364 `_, `pr#7981 `_, Sage Weil) * osdc/Objecter: use full pgid hash in PGNLS ops (`pr#8378 `_, Sage Weil) * osd: consider high/low mode when putting agent to sleep (`issue#14752 `_, `pr#7631 `_, Sage Weil) * osd: correctly handle small osd_scrub_interval_randomize_ratio (`pr#7147 `_, Samuel Just) * osdc/Striper: reduce assemble_result log level (`pr#8426 `_, Jason Dillaman) * osd: defer decoding of MOSDRepOp/MOSDRepOpReply (`pr#6503 `_, Xinze Chi) * osd: delay populating in-memory PG log hashmaps (`pr#6425 `_, Piotr Dałek) * osd: disable filestore_xfs_extsize by default (`issue#14397 `_, `pr#7265 `_, Ken Dreyer) * osd: do not keep ref of old osdmap in pg (`issue#13990 `_, `pr#7007 `_, Kefu Chai) * osd: don't do random deep scrubs for user initiated scrubs (`pr#6673 `_, David Zafman) * osd: don't update epoch and rollback_info objects attrs if there is no need (`pr#6555 `_, Ning Yao) * osd: drop deprecated removal pg type (`pr#6970 `_, Igor Podoski) * osd: drop fiemap len=0 logic (`pr#7267 `_, Sage Weil) * osd: drop the interim set from load_pgs() (`pr#6277 `_, Piotr Dałek) * osd: dump number of missing objects for each peer with pg query (`pr#6058 `_, Guang Yang) * osd: duplicated clear for peer_missing (`pr#8315 `_, Ning Yao) * osd: enable perfcounters on sharded work queue mutexes (`pr#6455 `_, Jacek J. Łakis) * osd: ensure new osdmaps commit before publishing them to pgs (`issue#15073 `_, `pr#8096 `_, Sage Weil) * osd: FileJournal: _fdump wrongly returns if journal is currently unreadable. (`issue#13626 `_, `pr#6406 `_, xie xingguo) * osd: FileJournal: fix return code of create method (`issue#14134 `_, `pr#6988 `_, xie xingguo) * osd: FileJournal: reduce locking scope in write_aio_bl (`issue#12789 `_, `pr#5670 `_, Zhi Zhang) * osd: filejournal: report journal entry count (`pr#7643 `_, tianqing) * osd: FileJournal: support batch peak and pop from writeq (`pr#6701 `_, Xinze Chi) * osd: FileStore: add a field indicate xattr only one chunk for set xattr. (`pr#6244 `_, Jianpeng Ma) * osd: FileStore: Added O_DSYNC write scheme (`pr#7752 `_, Somnath Roy) * osd: FileStore: add error check for object_map->sync() (`pr#7281 `_, Chendi Xue) * osd: FileStore: cleanup: remove obsolete option "filestore_xattr_use_omap" (`issue#14356 `_, `pr#7217 `_, Vikhyat Umrao) * osd: FileStore: conditional collection of drive metadata (`pr#6956 `_, Somnath Roy) * osd: filestore: FALLOC_FL_PUNCH_HOLE must be used with FALLOC_FL_KEEP_SIZE (`pr#7768 `_, xinxin shu) * osd: filestore: fast abort if statfs encounters ENOENT (`pr#7703 `_, xie xingguo) * osd: FileStore: fix initialization order for m_disable_wbthrottle (`pr#8067 `_, Samuel Just) * osd: filestore: fix race condition with split vs collection_move_rename and long object names (`issue#14766 `_, `pr#8136 `_, Samuel Just) * osd: filestore: fix result code overwritten for clone (`issue#14817 `_, `issue#14827 `_, `pr#7711 `_, xie xingguo) * osd: filestore: fix wrong scope of result code for error cases during mkfs (`issue#14814 `_, `pr#7704 `_, xie xingguo) * osd: filestore: fix wrong scope of result code for error cases during mount (`issue#14815 `_, `pr#7707 `_, xie xingguo) * osd: FileStore: LFNIndex: remove redundant local variable 'obj'. (`issue#13552 `_, `pr#6333 `_, xiexingguo) * osd: FileStore: modify the format of colon (`pr#7333 `_, Donghai Xu) * osd: FileStore:: optimize lfn_unlink (`pr#6649 `_, Jianpeng Ma) * osd: FileStore: potential memory leak if _fgetattrs fails (`issue#13597 `_, `pr#6377 `_, xie xingguo) * osd: FileStore: print file name before osd assert if read file failed (`pr#7111 `_, Ji Chen) * osd: FileStore: remove __SWORD_TYPE dependency (`pr#6263 `_, John Coyle) * osd: FileStore: remove unused local variable 'handle' (`pr#6381 `_, xie xingguo) * osd: filestore: restructure journal and op queue throttling (`pr#7767 `_, Samuel Just) * osd: FileStore: support multiple ondisk finish and apply finishers (`pr#6486 `_, Xinze Chi, Haomai Wang) * osd: FileStore: use pwritev instead of lseek+writev (`pr#7349 `_, Haomai Wang, Tao Chang) * osd: fix bogus scrub results when missing a clone (`issue#12738 `_, `issue#12740 `_, `pr#5783 `_, David Zafman) * osd: fix broken balance / localized read handling (`issue#13491 `_, `pr#6364 `_, Jason Dillaman) * osd: fix bug in last_* PG state timestamps (`pr#6517 `_, Li Wang) * osd: fix bugs for omap ops (`pr#8230 `_, Jianpeng Ma) * osd: fix ClassHandler::ClassData::get_filter() (`pr#6747 `_, Yan, Zheng) * osd: fix/clean up full map request handling (`pr#8446 `_, Sage Weil) * osd: fix debug message in OSD::is_healthy (`pr#6226 `_, Xiaoxi Chen) * osd: fix dirtying info without correctly setting drity_info field (`pr#8275 `_, xie xingguo) * osd: fix dump_ops_in_flight races (`issue#8885 `_, `pr#8044 `_, David Zafman) * osd: fix epoch check in handle_pg_create (`pr#8382 `_, Samuel Just) * osd: fixes for several cases where op result code was not checked or set (`issue#13566 `_, `pr#6347 `_, xie xingguo) * osd: fix failure report handling during ms_handle_connect() (`pr#8348 `_, xie xingguo) * osd: fix FileStore::_destroy_collection error return code (`pr#6612 `_, Ruifeng Yang) * osd: fix forced prmootion for CALL ops (`issue#14745 `_, `pr#7617 `_, Sage Weil) * osd: fix fusestore hanging during stop/quit (`issue#14786 `_, `pr#7677 `_, xie xingguo) * osd: fix inaccurate counter and skip over queueing an empty transaction (`pr#7754 `_, xie xingguo) * osd: fix incorrect throttle in WBThrottle (`pr#6713 `_, Zhang Huan) * osd: fix invalid list traversal in process_copy_chunk (`pr#7511 `_, Samuel Just) * osd: fix lack of object unblock when flush fails (`issue#14511 `_, `pr#7584 `_, Igor Fedotov) * osd: fix log info (`pr#8273 `_, Wei Jin) * osd: fix MOSDOp encoding (`pr#6174 `_, Sage Weil) * osd: fix MOSDRepScrub reference counter in replica_scrub (`pr#6730 `_, Jie Wang) * osd: fix null pointer access and race condition (`issue#14072 `_, `pr#6916 `_, xie xingguo) * osd: fix overload of '==' operator for pg_stat_t (`issue#14921 `_, `pr#7842 `_, xie xingguo) * osd: fix race condition for heartbeat_need_update (`issue#14387 `_, `pr#7739 `_, xie xingguo) * osd: fix reactivate (check OSDSuperblock in mkfs() when we already have the superblock) (`issue#13586 `_, `pr#6385 `_, Vicente Cheng) * osd: fix reference count, rare race condition etc. (`pr#8254 `_, xie xingguo) * osd: fix return value from maybe_handle_cache_detail() (`pr#7593 `_, Igor Fedotov) * osd: fix rollback_info_trimmed_to before index() (`issue#13965 `_, `pr#6801 `_, Samuel Just) * osd: fix scrub start hobject (`pr#7467 `_, Sage Weil) * osd: fix sparse-read result code checking logic (`issue#14151 `_, `pr#7016 `_, xie xingguo) * osd: fix temp object removal after upgrade (`issue#13862 `_, `pr#6976 `_, David Zafman) * osd: fix tick relevant issues (`pr#8369 `_, xie xingguo) * osd: fix trivial scrub bug (`pr#6533 `_, Li Wang) * osd: fix two scrub relevant issues (`pr#8462 `_, xie xingguo) * osd: fix unnecessary object promotion when deleting from cache pool (`issue#13894 `_, `pr#7537 `_, Igor Fedotov) * osd: fix watch reconnect race (`issue#15441 `_, `pr#8586 `_, Sage Weil) * osd: fix wip (l_osd_op_wip) perf counter and remove repop_map (`pr#7077 `_, Xinze Chi) * osd: fix wrongly placed assert and some cleanups (`pr#6766 `_, xiexingguo, xie xingguo) * osd: fix wrong return type of find_osd_on_ip() (`issue#14872 `_, `pr#7812 `_, xie xingguo) * osd: fix wrong use of right parenthesis in localized read logic (`pr#6566 `_, Jie Wang) * osd: ghobject_t: use ! instead of @ as a separator (`pr#7595 `_, Sage Weil) * osd: handle dup pg_create that races with pg deletion (`pr#8033 `_, Sage Weil) * osd: improve temperature calculation for cache tier agent (`pr#4737 `_, MingXin Liu) * osd: initialize last_recalibrate field at construction (`pr#8071 `_, xie xingguo) * osd: init started to 0 (`issue#13206 `_, `pr#6107 `_, Sage Weil) * osd: KeyValueStore: don't queue NULL context (`pr#6783 `_, Haomai Wang) * osd: KeyValueStore: fix return code of mkfs (`pr#7036 `_, xie xingguo) * osd: KeyValueStore: fix the name's typo of keyvaluestore_default_strip_size (`pr#6375 `_, Zhi Zhang) * osd: KeyValueStore: fix wrongly placed assert (`issue#14176 `_, `issue#14178 `_, `pr#7047 `_, xie xingguo) * osd: kstore: fix a race condition in _txc_finish() (`pr#7804 `_, Jianjian Huo) * osd: kstore: latency breakdown (`pr#7850 `_, James Liu) * osd: kstore: several small fixes (`issue#14351 `_, `issue#14352 `_, `pr#7213 `_, xie xingguo) * osd: kstore: small fixes to kstore (`issue#14204 `_, `pr#7095 `_, xie xingguo) * osd: kstore: sync up kstore with recent bluestore updates (`pr#7681 `_, Jianjian Huo) * osd: make backend and block device code a bit more generic (`pr#6759 `_, Sage Weil) * osd: make list_missing query missing_loc.needs_recovery_map (`pr#6298 `_, Guang Yang) * osdmap: remove unused local variables (`pr#6864 `_, luo kexue) * osdmap: rm nonused variable (`pr#8423 `_, Wei Jin) * osd: memstore: fix alignment of Page for test_pageset (`pr#7587 `_, Casey Bodley) * osd: memstore: fix two bugs (`pr#6963 `_, Casey Bodley, Sage Weil) * osd: merge local_t and op_t txn to single one (`pr#6439 `_, Xinze Chi) * osd: min_write_recency_for_promote & min_read_recency_for_promote are tiering only (`pr#8081 `_, huanwen ren) * osd: misc FileStore fixes (`issue#14192 `_, `issue#14188 `_, `issue#14194 `_, `issue#14187 `_, `issue#14186 `_, `pr#7059 `_, xie xingguo) * osd: misc optimization for map utilization (`pr#6950 `_, Ning Yao) * osd, mon: fix exit issue (`pr#7420 `_, Jiaying Ren) * osd,mon: log leveldb and rocksdb to ceph log (`pr#6921 `_, Sage Weil) * osd: more fixes for incorrectly dirtying info; resend reply for duplicated scrub-reserve req (`pr#8291 `_, xie xingguo) * osd: move newest decode version of MOSDOp and MOSDOpReply to the front (`pr#6642 `_, Jacek J. Łakis) * osd: new and delete ObjectStore::Transaction in a function is not necessary (`pr#6299 `_, Ruifeng Yang) * osd: newstore: misc updates (including kv and os/fs stuff) (`pr#6609 `_, Sage Weil) * osd: note down the number of missing clones (`pr#6654 `_, Kefu Chai) * osd: Omap small bugs adapted (`pr#6669 `_, Jianpeng Ma, David Zafman) * osd: optimize clone write path if object-map is enabled (`pr#6403 `_, xinxin shu) * osd: optimize get_object_context (`pr#6305 `_, Jianpeng Ma) * osd: optimize MOSDOp/do_op/handle_op (`pr#5211 `_, Jacek J. Lakis) * osd: optimize scrub subset_last_update calculation (`pr#6518 `_, Li Wang) * osd: optimize the session_handle_reset function (`issue#14182 `_, `pr#7054 `_, songbaisen) * osd: os/chain_xattr: On linux use linux/limits.h for XATTR_NAME_MAX. (`pr#6343 `_, John Coyle) * osd/OSD.cc: finish full_map_request every MOSDMap message. (`issue#15130 `_, `pr#8147 `_, Xiaoxi Chen) * osd/OSD: fix build_past_intervals_parallel (`pr#8215 `_, David Zafman) * osd/OSDMap: fix typo in summarize_mapping_stats (`pr#8088 `_, Sage Weil) * osd: OSDMap: reset osd_primary_affinity shared_ptr when deepish_copy_from (`issue#14686 `_, `pr#7553 `_, Xinze Chi) * osd: OSDService: Fix typo in osdmap comment (`pr#7275 `_, Brad Hubbard) * osd: os: skip checking pg_meta object existance in FileStore (`pr#6870 `_, Ning Yao) * osd: partial revert of "ReplicatedPG: result code not correctly set in some cases." (`issue#13796 `_, `pr#6622 `_, Sage Weil) * osd: PG::activate(): handle unexpected cached_removed_snaps more gracefully (`issue#14428 `_, `pr#7309 `_, Alexey Sheplyakov) * osd/PG: indicate in pg query output whether ignore_history_les would help (`pr#8156 `_, Sage Weil) * osd: PGLog: clean up read_log (`pr#7092 `_, Jie Wang) * osd/PGLog: fix warning (`pr#8057 `_, Sage Weil) * osd/PG: pg down state blocked by osd.x, lost osd.x cannot solve peering stuck (`issue#13531 `_, `pr#6317 `_, Xiaowei Chen) * osd: pg_pool_t: add dictionary for pool options (`issue#13077 `_, `pr#6081 `_, Mykola Golub) * osd: PG: set epoch_created and parent_split_bits for child pg (`issue#15426 `_, `pr#8552 `_, Kefu Chai) * osd: prevent osd_recovery_sleep from causing recovery-thread suicide (`pr#7065 `_, Jianpeng Ma) * osd: probabilistic cache tier promotion throttling (`pr#7465 `_, Sage Weil) * osd: randomize deep scrubbing (`pr#6550 `_, Dan van der Ster, Herve Rousseau) * osd: reduce memory consumption of some structs (`pr#6475 `_, Piotr Dałek) * osd: reduce string use in coll_t::calc_str() (`pr#6505 `_, Igor Podoski) * osd: refuse to start if object name limits are not possible (`pr#8560 `_, Sage Weil) * osd: release related sources when scrub is interrupted (`pr#6744 `_, Jianpeng Ma) * osd: release the message throttle when OpRequest unregistered (`issue#14248 `_, `pr#7148 `_, Samuel Just) * osd: remove __SWORD_TYPE dependency (`pr#6262 `_, John Coyle) * osd: remove unused OSDMap::set_weightf() (`issue#14369 `_, `pr#7231 `_, huanwen ren) * osd: remove up_thru_pending field, which is never used (`pr#7991 `_, xie xingguo) * osd: reorder bool fields in PGLog struct (`pr#6279 `_, Piotr Dałek) * osd: Replace snprintf with faster implementation in eversion_t::get_key_name (`pr#7121 `_, Evgeniy Firsov) * osd/ReplicatedPG: be more careful about calling publish_stats_to_osd() (`issue#14962 `_, `pr#8039 `_, Greg Farnum) * osd: replicatedpg: break out loop if we encounter fatal error during do_pg_op() (`issue#14922 `_, `pr#7844 `_, xie xingguo) * osd: ReplicatedPG: clean up unused function (`pr#7211 `_, Xiaowei Chen) * osd/ReplicatedPG: clear watches on change after applying repops (`issue#15151 `_, `pr#8163 `_, Sage Weil) * osd/ReplicatedPG: fix implementation of register_on_success() (`pr#8612 `_, xie xingguo) * osd/ReplicatedPG: fix promotion recency logic (`issue#14320 `_, `pr#6702 `_, Sage Weil) * osd/ReplicatedPG: make handle_watch_timeout no-op if !active (`issue#15391 `_, `pr#8505 `_, Sage Weil) * osd: ReplicatedPG: remove unused local variables (`issue#13575 `_, `pr#6360 `_, xiexingguo) * osd/ReplicatedPG::_rollback_to: update the OMAP flag (`issue#14777 `_, `pr#8495 `_, Samuel Just) * osd: repop and lost-unfound overhaul (`pr#7765 `_, Samuel Just) * osd: reset primary and up_primary when building a new past_interval. (`issue#13471 `_, `pr#6240 `_, xiexingguo) * osd: resolve boot vs NOUP set + clear race (`pr#7483 `_, Sage Weil) * osd: scrub: do not assign value if read error (`pr#6568 `_, Li Wang) * osd/ScrubStore: remove unused function (`pr#8045 `_, Kefu Chai) * osd: several small cleanups (`pr#7055 `_, xie xingguo) * osd: shut down if we flap too many times in a short period (`pr#6708 `_, Xiaoxi Chen) * osd: skip promote for writefull w/ FADVISE_DONTNEED/NOCACHE (`pr#7010 `_, Jianpeng Ma) * osd: slightly reduce actual size of pg_log_entry_t (`pr#6690 `_, Piotr Dałek) * osd: small fixes to memstore (`issue#14228 `_, `issue#14229 `_, `issue#14227 `_, `pr#7107 `_, xie xingguo) * osd: some debug output for old osdmap trimming (`pr#8613 `_, Sage Weil) * osd: support pool level recovery_priority and recovery_op_priority (`pr#5953 `_, Guang Yang) * osd: try evicting after flushing is done (`pr#5630 `_, Zhiqiang Wang) * osd: use atomic to generate ceph_tid (`pr#7017 `_, Evgeniy Firsov) * osd: use optimized is_zero in object_stat_sum_t.is_zero() (`pr#7203 `_, Piotr Dałek) * osd: use pg id (without shard) when referring the PG (`pr#6236 `_, Guang Yang) * osd: utime_t, eversion_t, osd_stat_sum_t encoding optimization (`pr#6902 `_, Xinze Chi) * osd: WeightedPriorityQueue: move to intrusive containers (`pr#7654 `_, Robert LeBlanc) * osd: write file journal optimization (`pr#6484 `_, Xinze Chi) * os/filestore/FileJournal: set block size via config option (`pr#7628 `_, Sage Weil) * os/filestore: fix punch hole usage in _zero (`pr#8050 `_, Sage Weil) * os/filestore: fix result handling logic of destroy_collection (`pr#7721 `_, xie xingguo) * os/filestore: fix return type mismatch for lfn parse (`pr#8545 `_, xie xingguo) * os/filestore: force lfn attrs to be written atomically, restructure name length limits (`pr#8496 `_, Samuel Just) * os/filestore: require offset == length == 0 for full object read; add test (`pr#7957 `_, Jianpeng Ma) * os/fs: fix io_getevents argument (`pr#7355 `_, Jingkai Yuan) * os/fusestore: add error handling (`pr#7395 `_, xie xingguo) * os/keyvaluestore: kill KeyValueStore (`pr#7320 `_, Haomai Wang) * os/kstore: insert new onode to the front position of onode LRU (`pr#7505 `_, xie xingguo) * os/ObjectStore: add custom move operations for ObjectStore::Transaction (`pr#7303 `_, Casey Bodley) * os/ObjectStore: add noexcept to ensure move ctor is used (`pr#8421 `_, Kefu Chai) * os/ObjectStore: fix _update_op for split dest_cid (`pr#8364 `_, Sage Weil) * os/ObjectStore: implement more efficient get_encoded_bytes() (`pr#7775 `_, Piotr Dałek) * os/ObjectStore: make device uuid probe output something friendly (`pr#8418 `_, Sage Weil) * os/ObjectStore: try_move_rename in transaction append and add coverage to store_test (`issue#15205 `_, `pr#8359 `_, Samuel Just) * packaging: add build dependency on python devel package (`pr#7205 `_, Josh Durgin) * packaging: make infernalis -> jewel upgrade work (`issue#15047 `_, `pr#8034 `_, Nathan Cutler) * packaging: move cephfs repair tools to ceph-common (`issue#15145 `_, `pr#8133 `_, Boris Ranto, Ken Dreyer) * pybind: add ceph_volume_client interface for Manila and similar frameworks (`pr#6205 `_, John Spray) * pybind: add flock to libcephfs python bindings (`pr#7902 `_, John Spray) * pybind/cephfs: add symlink and its unit test (`pr#6323 `_, Shang Ding) * pybind: decode empty string in conf_parse_argv() correctly (`pr#6711 `_, Josh Durgin) * pybind: Ensure correct python flags are passed (`pr#7663 `_, James Page) * pybind: fix build failure, remove extraneous semicolon in method (`issue#14371 `_, `pr#7235 `_, Abhishek Lekshmanan) * pybind: flag an RBD image as closed regardless of result code (`pr#8005 `_, Jason Dillaman) * pybind: Implementation of rados_ioctx_snapshot_rollback (`pr#6878 `_, Florent Manens) * pybind/Makefile.am: Prevent race creating CYTHON_BUILD_DIR (`issue#15276 `_, `pr#8356 `_, Dan Mick) * pybind: move cephfs to Cython (`pr#7745 `_, John Spray, Mehdi Abaakouk) * pybind: port the rbd bindings to Cython (`issue#13115 `_, `pr#6768 `_, Hector Martin) * pybind/rados: fix object lifetime issues and other bugs in aio (`pr#7778 `_, Hector Martin) * pybind/rados: fix omap method return values (`pr#8603 `_, Roland Mechler) * pybind/rados: python3 fix (`pr#8331 `_, Mehdi Abaakouk) * pybind/rados: use __dealloc__ since __del__ is ignored by cython (`pr#7692 `_, Mehdi Abaakouk) * pybind: remove next() on iterators (`pr#7706 `_, Mehdi Abaakouk) * pybind: replace __del__ with __dealloc__ for rbd (`pr#7708 `_, Josh Durgin) * pybind: support ioctx:exec (`pr#6795 `_, Noah Watkins) * pybind/test_rbd: fix test_create_defaults (`issue#14279 `_, `pr#7155 `_, Josh Durgin) * pybind: use correct subdir for rados install-exec rule (`pr#7684 `_, Josh Durgin) * python binding of librados with cython (`pr#7621 `_, Mehdi Abaakouk) * python: use pip instead of python setup.py (`pr#7605 `_, Loic Dachary) * qa: add workunit to run ceph_test_rbd_mirror (`pr#8221 `_, Josh Durgin) * qa: disable rbd/qemu-iotests test case 055 on RHEL/CentOSlibrbd: journal replay should honor inter-event dependencies (`issue#14385 `_, `pr#7272 `_, Jason Dillaman) * qa: erasure-code benchmark plugin selection (`pr#6685 `_, Loic Dachary) * qa/krbd: Expunge generic/247 (`pr#6831 `_, Douglas Fuller) * qa: update rest test cephfs calls (`issue#15309 `_, `pr#8372 `_, John Spray) * qa: update rest test cephfs calls (part 2) (`issue#15309 `_, `pr#8393 `_, John Spray) * qa/workunits/cephtool/test.sh: false positive fail on /tmp/obj1. (`pr#6837 `_, Robin H. Johnson) * qa/workunits/cephtool/test.sh: no ./ (`pr#6748 `_, Sage Weil) * qa/workunits/cephtool/test.sh: wait longer in ceph_watch_start() (`issue#14910 `_, `pr#7861 `_, Kefu Chai) * qa/workunits: merge_diff shouldn't attempt to use striping (`issue#14165 `_, `pr#7041 `_, Jason Dillaman) * qa/workunits/rados: Test exit values on test.sh, fix tier.cc (`issue#15165 `_, `pr#8266 `_, Samuel Just) * qa/workunits/rados/test.sh: capture stderr too (`pr#8004 `_, Sage Weil) * qa/workunits/rados/test.sh: test tmap_migrate (`pr#8114 `_, Sage Weil) * qa/workunits/rbd: do not use object map during read flag testing (`pr#8104 `_, Jason Dillaman) * qa/workunits/rbd: new online maintenance op tests (`pr#8216 `_, Jason Dillaman) * qa/workunits/rbd: rbd-nbd test should use sudo for map/unmap ops (`issue#14221 `_, `pr#7101 `_, Jason Dillaman) * qa/workunits/rbd: use POSIX function definition (`issue#15104 `_, `pr#8068 `_, Nathan Cutler) * qa/workunits/rest/test.py: add confirmation to 'mds setmap' (`issue#14606 `_, `pr#7982 `_, Sage Weil) * qa/workunits/rest/test.py: don't use newfs (`pr#8191 `_, Sage Weil) * qa/workunits/snaps: move snap tests into fs sub-directory (`pr#6496 `_, Yan, Zheng) * rados: add ceph:: namespace to bufferlist type (`pr#8059 `_, Noah Watkins) * rados: bench: fix off-by-one to avoid writing past object_size (`pr#6677 `_, Tao Chang) * rados: fix bug for write bench (`pr#7851 `_, James Liu) * radosgw-admin: allow 'period pull --url' without full multisite config (`pr#8567 `_, Casey Bodley) * radosgw-admin: allow (`pr#8529 `_, Orit Wasserman) * radosgw-admin: Checking the legality of the parameters (`issue#13018 `_, `pr#5879 `_, Qiankun Zheng) * radosgw-admin: Create --secret-key alias for --secret (`issue#5821 `_, `pr#5335 `_, Yuan Zhou) * radosgw-admin: fix for 'realm pull' (`pr#8404 `_, Casey Bodley) * radosgw-admin: metadata list user should return an empty list when user pool is empty (`issue#13596 `_, `pr#6465 `_, Orit Wasserman) * radosgw-admin: 'period commit' supplies user-readable error messages (`pr#8264 `_, Casey Bodley) * rados: implement rm --force option to force remove when full (`pr#6202 `_, Xiaowei Chen) * rados: new options for write benchmark (`pr#6340 `_, Joaquim Rocha) * rbd: accept --user, refuse -i command-line optionals (`pr#6590 `_, Ilya Dryomov) * rbd: additional validation for striping parameters (`pr#6914 `_, Na Xie) * rbd: add missing command aliases to refactored CLI (`issue#13806 `_, `pr#6606 `_, Jason Dillaman) * rbd: add --object-size option, deprecate --order (`issue#12112 `_, `pr#6830 `_, Vikhyat Umrao) * rbd: add pool name to disambiguate rbd admin socket commands (`pr#6904 `_, wuxiangwei) * rbd: add RBD pool mirroring configuration API + CLI (`pr#6129 `_, Jason Dillaman) * rbd: add support for mirror image promotion/demotion/resync (`pr#8138 `_, Jason Dillaman) * rbd: allow librados to prune the command-line for config overrides (`issue#15250 `_, `pr#8282 `_, Jason Dillaman) * rbd: clone operation should default to image format 2 (`pr#8119 `_, Jason Dillaman) * rbd: correct an output string for merge-diff (`pr#7046 `_, Kongming Wu) * rbd: deprecate image format 1 (`pr#7841 `_, Jason Dillaman) * rbd: dynamically generated bash completion (`issue#13494 `_, `pr#6316 `_, Jason Dillaman) * rbd: fix build with "--without-rbd" (`issue#14058 `_, `pr#6899 `_, Piotr Dałek) * rbd: fix clone isssue (`issue#13553 `_, `pr#6334 `_, xinxin shu) * rbd: fixes for refactored CLI and related tests (`pr#6738 `_, Ilya Dryomov) * rbd: fix init-rbdmap CMDPARAMS (`issue#13214 `_, `pr#6109 `_, Sage Weil) * rbd: fix static initialization ordering issues (`pr#6978 `_, Mykola Golub) * rbd-fuse: image name can not include snap name (`pr#7044 `_, Yongqiang He) * rbd-fuse: implement mv operation (`pr#6938 `_, wuxiangwei) * rbd: journal: configuration via conf, cli, api and some fixes (`pr#6665 `_, Mykola Golub) * rbd: journal reset should disable/re-enable journaling feature (`issue#15097 `_, `pr#8490 `_, Jason Dillaman) * rbd: loosen image spec validation requirements (`pr#8606 `_, Jason Dillaman) * rbd: make config changes actually apply (`pr#6520 `_, Mykola Golub) * rbdmap: add manpage (`issue#15212 `_, `pr#8224 `_, Nathan Cutler) * rbdmap: systemd support (`issue#13374 `_, `pr#6479 `_, Boris Ranto) * rbd: merge_diff test should use new --object-size parameter instead of --order (`issue#14106 `_, `pr#6972 `_, Na Xie, Jason Dillaman) * rbd-mirror: asok commands to get status and flush on Mirror and Replayer level (`pr#8235 `_, Mykola Golub) * rbd-mirror: enabling/disabling pool mirroring should update the mirroring directory (`issue#15217 `_, `pr#8261 `_, Ricardo Dias) * rbd-mirror: fix image replay test failures (`pr#8158 `_, Jason Dillaman) * rbd-mirror: fix long termination due to 30sec wait in main loop (`pr#8185 `_, Mykola Golub) * rbd-mirror: fix missing increment for iterators (`pr#8352 `_, runsisi) * rbd-mirror: ImageReplayer async start/stop (`pr#7944 `_, Mykola Golub) * rbd-mirror: ImageReplayer improvements (`pr#7759 `_, Mykola Golub) * rbd-mirror: implement ImageReplayer (`pr#7614 `_, Mykola Golub) * rbd-mirror: initial failover / failback support (`pr#8287 `_, Jason Dillaman) * rbd-mirror: integrate with image sync state machine (`pr#8079 `_, Jason Dillaman) * rbd-mirror: make remote context respect env and argv config params (`pr#8182 `_, Mykola Golub) * rbd-mirror: minor fix-ups for initial skeleton implementation (`pr#7958 `_, Mykola Golub) * rbd-mirror: prevent enabling/disabling an image's mirroring when not in image mode (`issue#15267 `_, `pr#8332 `_, Ricardo Dias) * rbd-mirror: remote to local cluster image sync (`pr#7979 `_, Jason Dillaman) * rbd-mirror: switch fsid over to mirror uuid (`issue#15238 `_, `pr#8280 `_, Ricardo Dias) * rbd-mirror: use pool/image names in asok commands (`pr#8159 `_, Mykola Golub) * rbd-mirror: use the mirroring directory to detect candidate images (`issue#15142 `_, `pr#8162 `_, Ricardo Dias) * rbd-mirror: workaround for intermingled lockdep singletons (`pr#8476 `_, Jason Dillaman) * rbd: must specify both of stripe-unit and stripe-count when specifying stripingv2 feature (`pr#7026 `_, Donghai Xu) * rbd-nbd: add copyright (`pr#7166 `_, Li Wang) * rbd-nbd: fix up return code handling (`pr#7215 `_, Mykola Golub) * rbd-nbd: network block device (NBD) support for RBD (`pr#6657 `_, Yunchuan Wen, Li Wang) * rbd-nbd: small improvements in logging and forking (`pr#7127 `_, Mykola Golub) * rbd: output formatter may not be closed upon error (`issue#13711 `_, `pr#6706 `_, xie xingguo) * rbd: rbdmap improvements (`pr#6445 `_, Boris Ranto) * rbd: rbd order will be place in 22, when set to 0 in the config_opt (`issue#14139 `_, `issue#14047 `_, `pr#6886 `_, huanwen ren) * rbd: refactor cli command handling (`pr#5987 `_, Jason Dillaman) * rbd/run_cli_tests.sh: Reflect test failures (`issue#14825 `_, `pr#7781 `_, Zack Cerza) * rbd: stripe unit/count set incorrectly from config (`pr#6593 `_, Mykola Golub) * rbd: striping parameters should support 64bit integers (`pr#6942 `_, Na Xie) * rbd: support for enabling/disabling mirroring on specific images (`issue#13296 `_, `pr#8056 `_, Ricardo Dias) * rbd: support negative boolean command-line optionals (`issue#13784 `_, `pr#6607 `_, Jason Dillaman) * rbd: unbreak rbd map + cephx_sign_messages option (`pr#6583 `_, Ilya Dryomov) * rbd: update default image features (`pr#7846 `_, Jason Dillaman) * rbd: use default order from configuration when not specified (`pr#6965 `_, Yunchuan Wen) * release-notes: draft v0.94.4 release notes (`pr#5907 `_, Loic Dachary) * release-notes: draft v0.94.4 release notes (`pr#6195 `_, Loic Dachary) * release-notes: draft v0.94.4 release notes (`pr#6238 `_, Loic Dachary) * release-notes: draft v0.94.6 release notes (`issue#13356 `_, `pr#7689 `_, Abhishek Varshney, Loic Dachary) * release-notes: draft v10.0.3 release notes (`pr#7592 `_, Loic Dachary) * release-notes: draft v10.0.4 release notes (`pr#7966 `_, Loic Dachary) * release-notes: draft v9.2.1 release notes (`issue#13750 `_, `pr#7694 `_, Abhishek Varshney) * release-notes: v10.1.2 release notes (`pr#8594 `_, Loic Dachary) * releases: what is merged where and when ? (`pr#8358 `_, Loic Dachary) * rgw: accept data only at the first time in response to a request (`pr#8084 `_, sunspot) * rgw: add a few more help options in admin interface (`pr#8410 `_, Abhishek Lekshmanan) * rgw: add a method to purge all associate keys when removing a subuser (`issue#12890 `_, `pr#6002 `_, Sangdi Xu) * rgw: add a missing cap type (`pr#6774 `_, Yehuda Sadeh) * rgw: add an inspection to the field of type when assigning user caps (`pr#6051 `_, Kongming Wu) * rgw: Add a test for multi-tenancy (`pr#8592 `_, Pete Zaitcev) * rgw: add bucket request payment feature usage statistics integration (`issue#13834 `_, `pr#6656 `_, Javier M. Mellid) * rgw: add compat header for TEMP_FAILURE_RETRY (`pr#6294 `_, John Coyle) * rgw: add default quota config (`pr#6400 `_, Daniel Gryniewicz) * rgw: add LifeCycle feature (`pr#6331 `_, Ji Chen) * rgw: add missing error code for admin op API (`pr#7037 `_, Dunrong Huang) * rgw: adds the radosgw-admin sync status command that gives a human readable status of the sync process at a specific zone (`pr#8030 `_, Yehuda Sadeh) * rgw: add support for caching of Keystone admin token. (`pr#7630 `_, Radoslaw Zarzynski) * rgw: add support for "end_marker" parameter for GET on Swift account. (`issue#10682 `_, `pr#4216 `_, Radoslaw Zarzynski) * rgw: add support for getting Swift's DLO without manifest handling (`pr#6206 `_, Radoslaw Zarzynski) * rgw: add support for metadata upload during PUT on Swift container. (`pr#8002 `_, Radoslaw Zarzynski) * rgw: add support for Static Large Objects of Swift API (`issue#12886 `_, `issue#13452 `_, `pr#6643 `_, Yehuda Sadeh, Radoslaw Zarzynski) * rgw: add support for system requests over Swift API (`pr#7666 `_, Radoslaw Zarzynski) * rgw: add zone delete to rgw-admin help (`pr#8184 `_, Abhishek Lekshmanan) * rgw: adjust error code when bucket does not exist in copy operation (`issue#14975 `_, `pr#7916 `_, Yehuda Sadeh) * rgw: adjust the request_uri to support absoluteURI of http request (`issue#12917 `_, `pr#7675 `_, Wenjun Huang) * rgw: admin api for retrieving usage info (Ji Chen) (`pr#8031 `_, Yehuda Sadeh, Ji Chen) * rgw_admin: improve period update errors (`issue#15251 `_, `pr#8564 `_, Abhishek Lekshmanan) * rgw_admin: orphans finish segfaults (`pr#6652 `_, Igor Fedotov) * rgw-admin: remove unused iterator and fix error message (`pr#8507 `_, Karol Mroz) * rgw_admin: remove unused parent_period arg (`pr#8411 `_, Abhishek Lekshmanan) * rgw: Allow an implicit tenant in case of Keystone (`pr#8139 `_, Pete Zaitcev) * rgw: allow authentication keystone with self signed certs (`issue#14853 `_, `issue#13422 `_, `pr#7777 `_, Abhishek Lekshmanan) * rgw: approximate AmazonS3 HostId error field. (`pr#7444 `_, Robin H. Johnson) * rgw: aws4: handle UNSIGNED-PAYLOAD under header auth (`issue#15499 `_, `pr#8601 `_, Javier M. Mellid) * rgw: aws4 subdomain calling bugfix (`issue#15369 `_, `pr#8472 `_, Javier M. Mellid) * rgw: bucket link now set the bucket.instance acl (bug fix) (`issue#11076 `_, `pr#8037 `_, Zengran Zhang) * rgw: bucket request payment support (`issue#13427 `_, `pr#6214 `_, Javier M. Mellid) * rgw: Bug fix for mtime anomalies in RadosGW and other places (`pr#7328 `_, Adam C. Emerson, Casey Bodley) * rgw: build-related fixes (`pr#8076 `_, Yehuda Sadeh, Matt Benjamin) * rgw: calculate payload hash in RGWPutObj_ObjStore only when necessary. (`pr#7869 `_, Radoslaw Zarzynski) * rgw: call rgw_log_usage_finalize() on reconfiguration (`pr#8585 `_, Casey Bodley) * rgw: Check return code in RGWFileHandle::write (`pr#7875 `_, Brad Hubbard) * rgw: check the return value when call fe->run() (`issue#14585 `_, `pr#7457 `_, wei qiaomiao) * rgw: clarify the error message when trying to create an existed user (`pr#5938 `_, Zeqiang Zhuang) * rgw: cleanups to comments and messages (`pr#7633 `_, Pete Zaitcev) * rgw: content length (`issue#13582 `_, `pr#6975 `_, Yehuda Sadeh) * rgw: convert plain object to versioned (with null version) when removing (`issue#15243 `_, `pr#8268 `_, Yehuda Sadeh) * rgw: delete default zone (`pr#7005 `_, YankunLi) * rgw: do not abort radowgw server when using admin op API with bad parameters (`issue#14190 `_, `issue#14191 `_, `pr#7063 `_, Dunrong Huang) * rgw: Do not send a Content-Type on a '304 Not Modified' response (`issue#15119 `_, `pr#8253 `_, Wido den Hollander) * rgw: don't use s->bucket for metadata api path entry (`issue#14549 `_, `pr#7408 `_, Yehuda Sadeh) * rgw: Drop a debugging message (`pr#7280 `_, Pete Zaitcev) * rgw: drop permissions of rgw/civetweb after startup (`issue#13600 `_, `pr#8019 `_, Karol Mroz) * rgw: Drop unused usage_exit from rgw_admin.cc (`pr#7632 `_, Pete Zaitcev) * rgw: Ensure xmlns is consistent on S3 responses. (`pr#8526 `_, Robin H. Johnson) * rgw: extend rgw_extended_http_attrs to affect Swift accounts and containers as well (`pr#5969 `_, Radoslaw Zarzynski) * rgw: fcgi should include acconfig (`pr#7760 `_, Abhishek Lekshmanan) * rgw_file: set owner uid, gid, and Unix mode on new objects (`pr#8321 `_, Matt Benjamin) * rgw: file unix (`pr#8563 `_, Matt Benjamin) * rgw: fix a glaring syntax error (`pr#6888 `_, Pavan Rallabhandi) * rgw: fix a typo in error message (`pr#8434 `_, Abhishek Lekshmanan) * rgw: fix a typo in init-radosgw (`pr#6817 `_, Zhi Zhang) * rgw: fix binfo_cache, RGWShardCollectCR (`pr#8619 `_, Yehuda Sadeh) * rgw: fix compilation warning (`pr#7160 `_, Yehuda Sadeh) * rgw: fix compiling error (`pr#8394 `_, xie xingguo) * rgw: fixes for per-period metadata logs (`pr#7827 `_, Casey Bodley) * rgw: fix lockdep false positive (`pr#8284 `_, Yehuda Sadeh) * rgw: fix mdlog (`pr#8183 `_, Orit Wasserman) * rgw: fix objects can not be displayed which object name does not cont… (`issue#12963 `_, `pr#5738 `_, Weijun Duan) * rgw: fix openssl linkage (`pr#6513 `_, Yehuda Sadeh) * rgw: fix partial read issue in rgw_admin and rgw_tools (`pr#6761 `_, Jiaying Ren) * rgw: fix problem deleting objects begining with double underscores (`issue#15318 `_, `pr#8488 `_, Orit Wasserman) * rgw: fix reload on non Debian systems. (`pr#6482 `_, Hervé Rousseau) * rgw: fix response of delete expired objects (`issue#13469 `_, `pr#6228 `_, Yuan Zhou) * rgw: Fix subuser harder with tenants (`pr#7618 `_, Pete Zaitcev) * rgw: fix swift API returning incorrect account metadata (`issue#13140 `_, `pr#6047 `_, Sangdi Xu) * rgw: fix the build failure (`pr#6927 `_, Kefu Chai) * rgw: fix typo in RGWHTTPClient::process error message (`pr#6424 `_, Brad Hubbard) * rgw: fix unsafe c_str() usage (`issue#15463 `_, `pr#8559 `_, Sage Weil) * rgw: fix wrong check for parse() return (`pr#6797 `_, Dunrong Huang) * rgw: fix wrong etag calculation during POST on S3 bucket. (`issue#11241 `_, `pr#6030 `_, Radoslaw Zarzynski) * rgw: fix wrong handling of limit=0 during listing of Swift account. (`issue#14903 `_, `pr#7821 `_, Radoslaw Zarzynski) * rgw: handle no current_period in is_syncing_bucket_meta (`pr#8597 `_, Orit Wasserman) * rgw: highres time stamps (`pr#8108 `_, Yehuda Sadeh, Adam C. Emerson, Matt Benjamin) * rgw: improve error handling in S3/Keystone integration (`pr#7597 `_, Radoslaw Zarzynski) * rgw: increase verbosity level on RGWObjManifest line (`pr#7285 `_, magicrobotmonkey) * rgw: indexless (`pr#7786 `_, Yehuda Sadeh) * rgw: jewel nfs fixes (`pr#8460 `_, Matt Benjamin) * rgw: keystone v3 (`pr#7719 `_, Mark Barnes, Radoslaw Zarzynski) * rgw: ldap fixes (`pr#8168 `_, Matt Benjamin) * rgw_ldap: make ldap.h inclusion conditional (`pr#8500 `_, Matt Benjamin) * rgw: ldap (Matt Benjamin) (`pr#7985 `_, Matt Benjamin) * rgw leaks fixes (`pr#8636 `_, Yehuda Sadeh) * rgw: let radosgw-admin bucket stats return a standard josn (`pr#7029 `_, Ruifeng Yang) * rgw: link against system openssl (instead of dlopen at runtime) (`pr#6419 `_, Sage Weil) * rgw: link civetweb with openssl (Sage, Marcus Watts) (`pr#7825 `_, Marcus Watts, Sage Weil) * rgw: link payer info to usage logging (`pr#7918 `_, Yehuda Sadeh, Javier M. Mellid) * rgw: mdlog trim add usage prompt (`pr#6059 `_, Weijun Duan) * rgw: modify command stucking when operating radosgw-admin metadata list user (`pr#7032 `_, Peiyang Liu) * rgw: modify documents and help infos' descriptions to the usage of option date when executing command "log show" (`pr#6080 `_, Kongming Wu) * rgw: modify the conditional statement in parse_metadata_key method. (`pr#5875 `_, Zengran Zhang) * rgw: move signal.h dependency from rgw_front.h (`pr#7678 `_, Matt Benjamin) * rgw: Multipart ListPartsResult ETag quotes (`issue#15334 `_, `pr#8387 `_, Robin H. Johnson) * rgw: multiple improvements regarding etag calculation for SLO/DLO of Swift API. (`pr#7764 `_, Radoslaw Zarzynski) * rgw: multiple Swift API compliance improvements for TempURL (Radoslaw Zarzynsk) (`issue#14806 `_, `issue#11163 `_, `pr#7891 `_, Radoslaw Zarzynski) * rgw: multisite fixes (`pr#8013 `_, Yehuda Sadeh) * rgw: multitenancy support (`pr#6784 `_, Yehuda Sadeh, Pete Zaitcev) * rgw: new multisite merge (`issue#14549 `_, `pr#7709 `_, Yehuda Sadeh, Orit Wasserman, Casey Bodley, Daniel Gryniewicz) * rgw: Parse --subuser better (`pr#7279 `_, Pete Zaitcev) * rgw: prevent anonymous user from reading bucket with authenticated read ACL (`issue#13207 `_, `pr#6057 `_, root) * rgw: radosgw-admin bucket check --fix not work (`pr#7093 `_, Weijun Duan) * rgw-rados: return RGWSystemMetaObj init directly (`pr#8534 `_, Karol Mroz) * rgw: refuse to calculate digest when the s3 secret key is empty (`issue#13133 `_, `pr#6045 `_, Sangdi Xu) * rgw : region to zonegroup related fixes (`pr#8620 `_, Yehuda Sadeh) * rgw: remove duplicated code in RGWRados::get_bucket_info() (`pr#7413 `_, liyankun) * rgw: remove extra check in RGWGetObj::execute (`issue#12352 `_, `pr#5262 `_, Javier M. Mellid) * rgw: Remove unused code in PutMetadataAccount:execute (`pr#6668 `_, Pete Zaitcev) * rgw: remove unused variable in RGWPutMetadataBucket::execute (`pr#6735 `_, Radoslaw Zarzynski) * rgw: remove unused vector (`pr#7990 `_, Na Xie) * rgw: reset return code in when iterating over the bucket the objects (`issue#14826 `_, `pr#7803 `_, Orit Wasserman) * rgw: retry RGWRemoteMetaLog::read_log_info() while master is down (`pr#8453 `_, Casey Bodley) * rgw: Revert "rgw ldap" (`pr#8075 `_, Yehuda Sadeh) * rgw/rgw_admin:fix bug about list and stats command (`pr#8200 `_, Qiankun Zheng) * rgw/rgw_common.h: fix the RGWBucketInfo decoding (`pr#8154 `_, Kefu Chai) * rgw/rgw_common.h: fix the RGWBucketInfo decoding (`pr#8165 `_, Kefu Chai) * rgw: RGWLib::env is not used so remove it (`pr#7874 `_, Brad Hubbard) * rgw/rgw_orphan: check the return value of save_state (`pr#7544 `_, Boris Ranto) * rgw: RGWPeriod::reflect() sets master zonegroup as default (`pr#8566 `_, Casey Bodley) * rgw/rgw_resolve: fallback to res_query when res_nquery not implemented (`pr#6292 `_, John Coyle) * rgw: RGWZoneParams::create should not handle -EEXIST error (`pr#7927 `_, Orit Wasserman) * rgw: S3: set EncodingType in ListBucketResult (`pr#7712 `_, Victor Makarov) * rgw: signature mismatch with escaped characters in url query portion (`issue#15358 `_, `pr#8445 `_, Javier M. Mellid) * rgw: static large objects (Radoslaw Zarzynski, Yehuda Sadeh) * rgw: store system object meta in cache when creating it (`issue#14678 `_, `pr#7615 `_, Yehuda Sadeh) * rgw: support core file limit for radosgw daemon (`pr#6346 `_, Guang Yang) * rgw: support for aws authentication v4 (Javier M. Mellid) (`issue#10333 `_, `pr#7720 `_, Yehuda Sadeh, Javier M. Mellid) * rgw: support json format for admin policy API (Dunrong Huang) (`issue#14090 `_, `pr#8036 `_, Yehuda Sadeh, Dunrong Huang) * rgw: swift bulk delete (Radoslaw Zarzynski) * rgw: swift use Civetweb ssl can not get right url (`issue#13628 `_, `pr#6408 `_, Weijun Duan) * rgw: swift versioning disabled (`pr#8066 `_, Yehuda Sadeh, Radoslaw Zarzynski) * rgw: sync fixes 3 (`pr#8170 `_, Yehuda Sadeh) * rgw: sync fixes 4 (`pr#8190 `_, Yehuda Sadeh) * rgw sync fixes (`pr#8095 `_, Yehuda Sadeh) * rgw: the map 'headers' is assigned a wrong value (`pr#8481 `_, weiqiaomiao) * rgw: try to parse Keystone token in order appropriate to configuration. (`pr#7822 `_, Radoslaw Zarzynski) * rgw: try to use current period id in a few more cases (`pr#8588 `_, Yehuda Sadeh) * rgw:Use count fn in RGWUserBuckets for quota check (`pr#8294 `_, Abhishek Lekshmanan) * rgw: use pimpl pattern for RGWPeriodHistory (`pr#7809 `_, Casey Bodley) * rgw: user quota may not adjust on bucket removal (`issue#14507 `_, `pr#7586 `_, root) * rgw: use smart pointer for C_Reinitwatch (`pr#6767 `_, Orit Wasserman) * rgw: warn on suspicious civetweb frontend parameters (`pr#6944 `_, Matt Benjamin) * rocksdb: build with PORTABLE=1 (`pr#6311 `_, Sage Weil) * rocksdb: remove rdb source files from dist tarball (`issue#13554 `_, `pr#6379 `_, Kefu Chai) * rocksdb: remove rdb sources from dist tarball (`issue#13554 `_, `pr#7105 `_, Venky Shankar) * rocksdb: use native rocksdb makefile (and our autotools) (`pr#6290 `_, Sage Weil) * rpm: ceph.spec.in: correctly declare systemd dependency for SLE/openSUSE (`pr#6114 `_, Nathan Cutler) * rpm: ceph.spec.in: fix libs-compat / devel-compat conditional (`issue#12315 `_, `pr#5219 `_, Ken Dreyer) * rpm,deb: remove conditional BuildRequires for btrfs-progs (`issue#15042 `_, `pr#8016 `_, Erwan Velu) * rpm: move %post(un) ldconfig calls to ceph-base (`issue#14940 `_, `pr#7867 `_, Nathan Cutler) * rpm: move runtime dependencies to ceph-base and fix other packaging issues (`issue#14864 `_, `pr#7826 `_, Nathan Cutler) * rpm: prefer UID/GID 167 when creating ceph user/group (`issue#15246 `_, `pr#8277 `_, Nathan Cutler) * rpm: remove sub-package dependencies on "ceph" (`issue#15146 `_, `pr#8137 `_, Ken Dreyer) * rpm: rhel 5.9 librados compile fix, moved blkid to RBD check/compilation (`issue#13177 `_, `pr#5954 `_, Rohan Mars) * script: add missing stop_rgw variable to stop.sh script (`pr#7959 `_, Karol Mroz) * scripts: adjust mstart and mstop script to run with cmake build (`pr#6920 `_, Orit Wasserman) * scripts: release_notes can track original issue (`pr#6009 `_, Abhishek Lekshmanan) * script: subscription-manager support (`issue#14972 `_, `pr#7907 `_, Loic Dachary) * selinux: allow log files to be located in /var/log/radosgw (`pr#7604 `_, Boris Ranto) * selinux: Update policy to grant additional access (`issue#14870 `_, `pr#7971 `_, Boris Ranto) * set 128MB tcmalloc cache size by bytes (`pr#8427 `_, Star Guo) * stringify outputted error code and fix unmatched parentheses. (`pr#6998 `_, xie.xingguo, xie xingguo) * submodules: revert an accidental change (`pr#7929 `_, Yehuda Sadeh) * systemd: correctly escape block device paths (`issue#14706 `_, `pr#7579 `_, James Page) * systemd: drop any systemd imposed process/thread limits (`pr#8450 `_, James Page) * systemd: fix typos (`pr#6679 `_, Tobias Suckow) * systemd: set up environment in rbdmap unit file (`issue#14984 `_, `pr#8222 `_, Nathan Cutler) * systemd: start/stop/restart ceph services by daemon type (`issue#13497 `_, `pr#6276 `_, Zhi Zhang) * systemd: Use the same restart limits as upstart (`pr#8188 `_, Boris Ranto) * sysvinit: allow custom cluster names (`pr#6732 `_, Richard Chan) * test: add missing shut_down mock method (`pr#8125 `_, Jason Dillaman) * test/bufferlist: Avoid false-positive tests (`pr#7955 `_, Erwan Velu) * test: ceph_test_rados: use less CPU (`pr#7513 `_, Samuel Just) * test/cli-integration/rbd: disable progress output (`issue#14931 `_, `pr#7858 `_, Josh Durgin) * test: correct librbd errors discovered with unoptimized cmake build (`pr#7914 `_, Jason Dillaman) * test: create pools for rbd tests with different prefix (`pr#7738 `_, Mykola Golub) * test: enable test for bug #2339 which has been resolved. (`pr#7743 `_, You Ji) * test/encoding/readable.sh fix (`pr#6714 `_, Igor Podoski) * test/encoding/readable: use [ for "test" not (( (`pr#8574 `_, Kefu Chai) * test: fix issues discovered via the rbd permissions test case (`pr#8129 `_, Jason Dillaman) * test: fix osd-scrub-snaps.sh (`pr#6697 `_, Xinze Chi) * test: Fix test to run with btrfs which has snap_### dirs (`issue#15347 `_, `pr#8420 `_, David Zafman) * test: fixup and improvements for rbd-mirror test (`pr#8090 `_, Mykola Golub) * test: fix ut test failure caused by lfn change (`issue#15464 `_, `pr#8544 `_, xie xingguo) * test: fix valgrind memcheck issues for rbd-mirror test cases (`issue#15354 `_, `pr#8493 `_, Jason Dillaman) * test: handle exception thrown from close during rbd lock test (`pr#8124 `_, Jason Dillaman) * test: image replayer needs dummy remote mirror peer uuid (`pr#8584 `_, Jason Dillaman) * test/libcephfs/flock: add sys/file.h include for flock operations (`pr#6310 `_, John Coyle) * test/librados/test.cc: clean up EC pools' crush rules too (`issue#13878 `_, `pr#6788 `_, Loic Dachary, Dan Mick) * test/librbd/fsx: Use c++11 std::mt19937 generator instead of random_r() (`pr#6332 `_, John Coyle) * test/mon/osd-erasure-code-profile: pick new mon port (`pr#7161 `_, Sage Weil) * test: more debug logging for TestWatchNotify (`pr#7737 `_, Mykola Golub) * test: new librbd flatten test case (`pr#7609 `_, Jason Dillaman) * test/osd: Relax the timing intervals in osd-markdown.sh (`pr#7899 `_, Dan Mick) * test_pool_create.sh: put test files in the test dir so they are cleaned up (`pr#8219 `_, Josh Durgin) * test/pybind/test_ceph_argparse: fix reweight-by-utilization tests (`pr#8027 `_, Kefu Chai, Sage Weil) * test/radosgw-admin: update the expected usage outputs (`pr#7723 `_, Kefu Chai) * test: rbd-mirror: add "switch to the next tag" test (`pr#8149 `_, Mykola Golub) * test: rbd-mirror: compare positions using all fields (`pr#8172 `_, Mykola Golub) * test: rbd-mirror: script improvements for manual testing (`pr#8325 `_, Mykola Golub) * test: reproducer for writeback CoW deadlock (`pr#8009 `_, Jason Dillaman) * test/rgw: add multisite test for meta sync across periods (`pr#7887 `_, Casey Bodley) * test_rgw_admin: use freopen for output redirection. (`pr#6303 `_, John Coyle) * tests: add const for ec test (`pr#6911 `_, Michal Jarzabek) * tests: add Ubuntu 16.04 xenial dockerfile (`pr#8519 `_, Loic Dachary) * tests: allow docker-test.sh to run under root (`issue#13355 `_, `pr#6173 `_, Loic Dachary) * tests: allow object corpus readable test to skip specific incompat instances (`pr#6932 `_, Igor Podoski) * tests: centos7 needs the Continuous Release (CR) Repository enabled for (`issue#13997 `_, `pr#6842 `_, Brad Hubbard) * tests: ceph-disk.sh: should use "readlink -f" instead (`pr#7594 `_, Kefu Chai) * tests: ceph-disk.sh: use "readlink -f" instead for fullpath (`pr#7606 `_, Kefu Chai) * tests: ceph-disk workunit uses configobj (`pr#6342 `_, Loic Dachary) * tests: ceph-helpers assert success getting backfills (`pr#6699 `_, Loic Dachary) * tests: ceph_test_keyvaluedb_iterators: fix broken test (`pr#6597 `_, Haomai Wang) * tests: concatenate test_rados_test_tool from src and qa (`issue#13691 `_, `pr#6464 `_, Loic Dachary) * tests: configure with rocksdb by default (`issue#14220 `_, `pr#7100 `_, Loic Dachary) * tests: destroy testprofile before creating one (`issue#13664 `_, `pr#6446 `_, Loic Dachary) * test: set a default $CEPH_ROOT env variable (`pr#8645 `_, Kefu Chai) * tests: fix a few build warnings (`pr#7608 `_, Sage Weil) * tests: fix failure for osd-scrub-snap.sh (`issue#13986 `_, `pr#6890 `_, Loic Dachary, Ning Yao) * tests: Fix for make check. (`pr#7102 `_, David Zafman) * tests: Fixing broken test/cephtool-test-mon.sh test (`pr#8429 `_, Erwan Velu) * tests: fix race condition testing auto scrub (`issue#13592 `_, `pr#6724 `_, Xinze Chi, Loic Dachary) * tests: fix test_rados_tools.sh rados lookup (`issue#13691 `_, `pr#6502 `_, Loic Dachary) * tests: fix typo in TestClsRbd.snapshots test case (`issue#13727 `_, `pr#6504 `_, Jason Dillaman) * tests: flush op work queue prior to destroying MockImageCtx (`issue#14092 `_, `pr#7002 `_, Jason Dillaman) * tests: ignore test-suite.log (`pr#6584 `_, Loic Dachary) * tests: Improving 'make check' execution time (`pr#8131 `_, Erwan Velu) * tests: notification slave needs to wait for master (`issue#13810 `_, `pr#7220 `_, Jason Dillaman) * tests: --osd-scrub-load-threshold=2000 for more consistency (`issue#14027 `_, `pr#6871 `_, Loic Dachary) * tests: osd-scrub-snaps.sh to display full osd logs on error (`issue#13986 `_, `pr#6857 `_, Loic Dachary) * tests: port uniqueness reminder (`pr#6387 `_, Loic Dachary) * tests: restore run-cli-tests (`pr#6571 `_, Loic Dachary, Sage Weil, Jason Dillaman) * tests: snap rename and rebuild object map in client update test (`pr#7224 `_, Jason Dillaman) * tests: sync ceph-erasure-code-corpus for mktemp -d (`pr#7596 `_, Loic Dachary) * tests: test/librados/test.cc must create profile (`issue#13664 `_, `pr#6452 `_, Loic Dachary) * tests: test_pidfile.sh lingering processes (`issue#14834 `_, `pr#7734 `_, Loic Dachary) * tests: unittest_bufferlist: fix hexdump test (`pr#7152 `_, Sage Weil) * tests: unittest_ipaddr: fix segv (`pr#7154 `_, Sage Weil) * test/system/rados_list_parallel: print oid if rados_write fails (`issue#15240 `_, `pr#8309 `_, Kefu Chai) * test/system/\*: use dynamically generated pool name (`issue#15240 `_, `pr#8318 `_, Kefu Chai) * test/test-erasure-code.sh: disable pg_temp priming (`issue#15211 `_, `pr#8260 `_, Sage Weil) * test: TestMirroringWatcher test cases were not closing images (`pr#8435 `_, Jason Dillaman) * test/TestPGLog: fix the FTBFS (`issue#14930 `_, `pr#7855 `_, Kefu Chai) * test/test_pool_create.sh: fix port (`pr#8361 `_, Sage Weil) * test/time: no need to abs(uint64_t) for comparing (`pr#7726 `_, Kefu Chai) * test: update rbd integration cram tests for new default features (`pr#8001 `_, Jason Dillaman) * test: use sequential journal_tid for object cacher test (`issue#13877 `_, `pr#6710 `_, Josh Durgin) * tools: add cephfs-table-tool 'take_inos' (`pr#6655 `_, John Spray) * tools/cephfs: add tmap_upgrade (`pr#7003 `_, John Spray) * tools/cephfs: fix overflow writing header to fixed size buffer (#13816) (`pr#6617 `_, John Spray) * tools/cephfs: fix overflow writing header to fixed size buffer (#13816) (`pr#6617 `_, John Spray) * tools/cephfs: fix tmap_upgrade (`issue#15135 `_, `pr#8128 `_, John Spray) * tools: ceph_monstore_tool: add inflate-pgmap command (`issue#14217 `_, `pr#7097 `_, Kefu Chai) * tools: ceph-monstore-update-crush: add "--test" when testing crushmap (`pr#6418 `_, Kefu Chai) * tools: Fix layout handing in cephfs-data-scan (#13898) (`pr#6719 `_, John Spray) * tools: monstore: add 'show-versions' command. (`pr#7073 `_, Cilang Zhao) * tools/rados: reduce "rados put" memory usage by op_size (`pr#7928 `_, Piotr Dałek) * tools:remove duplicate references (`pr#5917 `_, Bo Cai) * tools: support printing part cluster map in readable fashion (`issue#13079 `_, `pr#5921 `_, Bo Cai) * unittest_compression_zlib: do not assume buffer will be null terminated (`pr#8064 `_, Sage Weil) * unittest_erasure_code_plugin: fix deadlock (Alpine) (`pr#8314 `_, John Coyle) * unittest_osdmap: default crush tunables now firefly (`pr#8098 `_, Sage Weil) * vstart: fix up cmake paths when VSTART_DEST is given (`pr#8363 `_, Casey Bodley) * vstart: grant full access to Swift testing account (`pr#6239 `_, Yuan Zhou) * vstart: make -k with optional mon_num. (`pr#8251 `_, Jianpeng Ma) * vstart: set cephfs root uid/gid to caller (`pr#6255 `_, John Spray) * vstart.sh: add mstart, mstop, mrun wrappers for running multiple vstart-style test clusters out of src tree (`pr#6901 `_, Yehuda Sadeh) * vstart.sh: avoid race condition starting rgw via vstart.sh (`issue#14829 `_, `pr#7727 `_, Javier M. Mellid) * vstart.sh: silence a harmless msg where btrfs is not found (`pr#7640 `_, Patrick Donnelly) * xio: add prefix to xio msgr logs (`pr#8148 `_, Roi Dayan) * xio: fix compilation against latest accelio (`pr#8022 `_, Roi Dayan) * xio: fix incorrect ip being assigned in case of multiple RDMA ports (`pr#7747 `_, Subramanyam Varanasi) * xio: remove duplicate assignment of peer addr (`pr#8025 `_, Roi Dayan) * xio: remove redundant magic methods (`pr#7773 `_, Roi Dayan) * xio: remove unused variable (`pr#8023 `_, Roi Dayan) * xio: xio_init needs to be called before any other xio function (`pr#8227 `_, Roi Dayan) * xxhash: use clone of xxhash.git; add .gitignore (`pr#7986 `_, Sage Weil) Notable Changes since Hammer ---------------------------- * aarch64: add optimized version of crc32c (Yazen Ghannam, Steve Capper) * auth: cache/reuse crypto lib key objects, optimize msg signature check (Sage Weil) * auth: reinit NSS after fork() (#11128 Yan, Zheng) * autotools: fix out of tree build (Krxysztof Kosinski) * autotools: improve make check output (Loic Dachary) * buffer: add invalidate_crc() (Piotr Dalek) * buffer: fix zero bug (#12252 Haomai Wang) * buffer: some cleanup (Michal Jarzabek) * build: allow tcmalloc-minimal (Thorsten Behrens) * build: C++11 now supported * build: cmake: fix nss linking (Danny Al-Gaaf) * build: cmake: misc fixes (Orit Wasserman, Casey Bodley) * build: disable LTTNG by default (#11333 Josh Durgin) * build: do not build ceph-dencoder with tcmalloc (#10691 Boris Ranto) * build: fix junit detection on Fedora 22 (Ira Cooper) * build: fix pg ref disabling (William A. Kennington III) * build: fix ppc build (James Page) * build: install-deps: misc fixes (Loic Dachary) * build: install-deps.sh improvements (Loic Dachary) * build: install-deps: support OpenSUSE (Loic Dachary) * build: make_dist_tarball.sh (Sage Weil) * build: many cmake improvements * build: misc cmake fixes (Matt Benjamin) * build: misc fixes (Boris Ranto, Ken Dreyer, Owen Synge) * build: OSX build fixes (Yan, Zheng) * build: remove rest-bench * ceph-authtool: fix return code on error (Gerhard Muntingh) * ceph-detect-init: added Linux Mint (Michal Jarzabek) * ceph-detect-init: robust init system detection (Owen Synge) * ceph-disk: ensure 'zap' only operates on a full disk (#11272 Loic Dachary) * ceph-disk: fix zap sgdisk invocation (Owen Synge, Thorsten Behrens) * ceph-disk: follow ceph-osd hints when creating journal (#9580 Sage Weil) * ceph-disk: handle re-using existing partition (#10987 Loic Dachary) * ceph-disk: improve parted output parsing (#10983 Loic Dachary) * ceph-disk: install pip > 6.1 (#11952 Loic Dachary) * ceph-disk: make suppression work for activate-all and activate-journal (Dan van der Ster) * ceph-disk: many fixes (Loic Dachary, Alfredo Deza) * ceph-disk: fixes to respect init system (Loic Dachary, Owen Synge) * ceph-disk: pass --cluster arg on prepare subcommand (Kefu Chai) * ceph-disk: support for multipath devices (Loic Dachary) * ceph-disk: support NVMe device partitions (#11612 Ilja Slepnev) * ceph: fix 'df' units (Zhe Zhang) * ceph: fix parsing in interactive cli mode (#11279 Kefu Chai) * cephfs-data-scan: many additions, improvements (John Spray) * ceph-fuse: do not require successful remount when unmounting (#10982 Greg Farnum) * ceph-fuse, libcephfs: don't clear COMPLETE when trimming null (Yan, Zheng) * ceph-fuse, libcephfs: drop inode when rmdir finishes (#11339 Yan, Zheng) * ceph-fuse,libcephfs: fix uninline (#11356 Yan, Zheng) * ceph-fuse, libcephfs: hold exclusive caps on dirs we "own" (#11226 Greg Farnum) * ceph-fuse: mostly behave on 32-bit hosts (Yan, Zheng) * ceph: improve error output for 'tell' (#11101 Kefu Chai) * ceph-monstore-tool: fix store-copy (Huangjun) * ceph: new 'ceph daemonperf' command (John Spray, Mykola Golub) * ceph-objectstore-tool: many many improvements (David Zafman) * ceph-objectstore-tool: refactoring and cleanup (John Spray) * ceph-post-file: misc fixes (Joey McDonald, Sage Weil) * ceph_test_rados: test pipelined reads (Zhiqiang Wang) * client: avoid sending unnecessary FLUSHSNAP messages (Yan, Zheng) * client: exclude setfilelock when calculating oldest tid (Yan, Zheng) * client: fix error handling in check_pool_perm (John Spray) * client: fsync waits only for inode's caps to flush (Yan, Zheng) * client: invalidate kernel dcache when cache size exceeds limits (Yan, Zheng) * client: make fsync wait for unsafe dir operations (Yan, Zheng) * client: pin lookup dentry to avoid inode being freed (Yan, Zheng) * common: add descriptions to perfcounters (Kiseleva Alyona) * common: add perf counter descriptions (Alyona Kiseleva) * common: bufferlist performance tuning (Piotr Dalek, Sage Weil) * common: detect overflow of int config values (#11484 Kefu Chai) * common: fix bit_vector extent calc (#12611 Jason Dillaman) * common: fix json parsing of utf8 (#7387 Tim Serong) * common: fix leak of pthread_mutexattr (#11762 Ketor Meng) * common: fix LTTNG vs fork issue (Josh Durgin) * common: fix throttle max change (Henry Chang) * common: make mutex more efficient * common: make work queue addition/removal thread safe (#12662 Jason Dillaman) * common: optracker improvements (Zhiqiang Wang, Jianpeng Ma) * common: PriorityQueue tests (Kefu Chai) * common: some async compression infrastructure (Haomai Wang) * crush: add --check to validate dangling names, max osd id (Kefu Chai) * crush: cleanup, sync with kernel (Ilya Dryomov) * crush: fix crash from invalid 'take' argument (#11602 Shiva Rkreddy, Sage Weil) * crush: fix divide-by-2 in straw2 (#11357 Yann Dupont, Sage Weil) * crush: fix has_v4_buckets (#11364 Sage Weil) * crush: fix subtree base weight on adjust_subtree_weight (#11855 Sage Weil) * crush: respect default replicated ruleset config on map creation (Ilya Dryomov) * crushtool: fix order of operations, usage (Sage Weil) * crypto: fix NSS leak (Jason Dillaman) * crypto: fix unbalanced init/shutdown (#12598 Zheng Yan) * deb: fix rest-bench-dbg and ceph-test-dbg dependendies (Ken Dreyer) * debian: minor package reorg (Ken Dreyer) * deb, rpm: move ceph-objectstore-tool to ceph (Ken Dreyer) * doc: docuemnt object corpus generation (#11099 Alexis Normand) * doc: document region hostnames (Robin H. Johnson) * doc: fix gender neutrality (Alexandre Maragone) * doc: fix install doc (#10957 Kefu Chai) * doc: fix sphinx issues (Kefu Chai) * doc: man page updates (Kefu Chai) * doc: mds data structure docs (Yan, Zheng) * doc: misc updates (Fracois Lafont, Ken Dreyer, Kefu Chai, Owen Synge, Gael Fenet-Garde, Loic Dachary, Yannick Atchy-Dalama, Jiaying Ren, Kevin Caradant, Robert Maxime, Nicolas Yong, Germain Chipaux, Arthur Gorjux, Gabriel Sentucq, Clement Lebrun, Jean-Remi Deveaux, Clair Massot, Robin Tang, Thomas Laumondais, Jordan Dorne, Yuan Zhou, Valentin Thomas, Pierre Chaumont, Benjamin Troquereau, Benjamin Sesia, Vikhyat Umrao, Nilamdyuti Goswami, Vartika Rai, Florian Haas, Loic Dachary, Simon Guinot, Andy Allan, Alistair Israel, Ken Dreyer, Robin Rehu, Lee Revell, Florian Marsylle, Thomas Johnson, Bosse Klykken, Travis Rhoden, Ian Kelling) * doc: swift tempurls (#10184 Abhishek Lekshmanan) * doc: switch doxygen integration back to breathe (#6115 Kefu Chai) * doc: update release schedule docs (Loic Dachary) * erasure-code: cleanup (Kefu Chai) * erasure-code: improve tests (Loic Dachary) * erasure-code: shec: fix recovery bugs (Takanori Nakao, Shotaro Kawaguchi) * erasure-code: update ISA-L to 2.13 (Yuan Zhou) * gmock: switch to submodule (Danny Al-Gaaf, Loic Dachary) * hadoop: add terasort test (Noah Watkins) * init-radosgw: merge with sysv version; fix enumeration (Sage Weil) * java: fix libcephfs bindings (Noah Watkins) * libcephfs: add pread, pwrite (Jevon Qiao) * libcephfs,ceph-fuse: cache cleanup (Zheng Yan) * libcephfs,ceph-fuse: fix request resend on cap reconnect (#10912 Yan, Zheng) * librados: add config observer (Alistair Strachan) * librados: add FULL_TRY and FULL_FORCE flags for dealing with full clusters or pools (Sage Weil) * librados: add src_fadvise_flags for copy-from (Jianpeng Ma) * librados: define C++ flags from C constants (Josh Durgin) * librados: fadvise flags per op (Jianpeng Ma) * librados: fix last_force_resent handling (#11026 Jianpeng Ma) * librados: fix memory leak from C_TwoContexts (Xiong Yiliang) * librados: fix notify completion race (#13114 Sage Weil) * librados: fix striper when stripe_count = 1 and stripe_unit != object_size (#11120 Yan, Zheng) * librados, libcephfs: randomize client nonces (Josh Durgin) * librados: op perf counters (John Spray) * librados: pybind: fix binary omap values (Robin H. Johnson) * librados: pybind: fix write() method return code (Javier Guerra) * librados: respect default_crush_ruleset on pool_create (#11640 Yuan Zhou) * libradosstriper: fix leak (Danny Al-Gaaf) * librbd: add const for single-client-only features (Josh Durgin) * librbd: add deep-flatten operation (Jason Dillaman) * librbd: add purge_on_error cache behavior (Jianpeng Ma) * librbd: allow additional metadata to be stored with the image (Haomai Wang) * librbd: avoid blocking aio API methods (#11056 Jason Dillaman) * librbd: better handling for dup flatten requests (#11370 Jason Dillaman) * librbd: cancel in-flight ops on watch error (#11363 Jason Dillaman) * librbd: default new images to format 2 (#11348 Jason Dillaman) * librbd: fadvise for copy, export, import (Jianpeng Ma) * librbd: fast diff implementation that leverages object map (Jason Dillaman) * librbd: fix fast diff bugs (#11553 Jason Dillaman) * librbd: fix image format detection (Zhiqiang Wang) * librbd: fix lock ordering issue (#11577 Jason Dillaman) * librbd: fix reads larger than the cache size (Lu Shi) * librbd: fix snapshot creation when other snap is active (#11475 Jason Dillaman) * librbd: flatten/copyup fixes (Jason Dillaman) * librbd: handle NOCACHE fadvise flag (Jinapeng Ma) * librbd: lockdep, helgrind validation (Jason Dillaman, Josh Durgin) * librbd: metadata filter fixes (Haomai Wang) * librbd: misc aio fixes (#5488 Jason Dillaman) * librbd: misc rbd fixes (#11478 #11113 #11342 #11380 Jason Dillaman, Zhiqiang Wang) * librbd: new diff_iterate2 API (Jason Dillaman) * librbd: object map rebuild support (Jason Dillaman) * librbd: only update image flags while hold exclusive lock (#11791 Jason Dillaman) * librbd: optionally disable allocation hint (Haomai Wang) * librbd: prevent race between resize requests (#12664 Jason Dillaman) * librbd: readahead fixes (Zhiqiang Wang) * librbd: return result code from close (#12069 Jason Dillaman) * librbd: store metadata, including config options, in image (Haomai Wang) * librbd: tolerate old osds when getting image metadata (#11549 Jason Dillaman) * librbd: use write_full when possible (Zhiqiang Wang) * log: fix data corruption race resulting from log rotation (#12465 Samuel Just) * logrotate.d: prefer service over invoke-rc.d (#11330 Win Hierman, Sage Weil) * mds: add 'damaged' state to MDSMap (John Spray) * mds: add nicknames for perfcounters (John Spray) * mds: avoid emitting cap warnigns before evicting session (John Spray) * mds: avoid getting stuck in XLOCKDONE (#11254 Yan, Zheng) * mds: disable problematic rstat propagation into snap parents (Yan, Zheng) * mds: do not add snapped items to bloom filter (Yan, Zheng) * mds: expose frags via asok (John Spray) * mds: fix expected holes in journal objects (#13167 Yan, Zheng) * mds: fix handling for missing mydir dirfrag (#11641 John Spray) * mds: fix integer truncateion on large client ids (Henry Chang) * mds: fix mydir replica issue with shutdown (#10743 John Spray) * mds: fix out-of-order messages (#11258 Yan, Zheng) * mds: fix rejoin (Yan, Zheng) * mds: fix setting entire file layout in one setxattr (John Spray) * mds: fix shutdown (John Spray) * mds: fix shutdown with strays (#10744 John Spray) * mds: fix SnapServer crash on deleted pool (John Spray) * mds: fix snapshot bugs (Yan, Zheng) * mds: fix stray reintegration (Yan, Zheng) * mds: fix stray handling (John Spray) * mds: fix suicide beacon (John Spray) * mds: flush immediately in do_open_truncate (#11011 John Spray) * mds: handle misc corruption issues (John Spray) * mds: improve dump methods (John Spray) * mds: many fixes (Yan, Zheng, John Spray, Greg Farnum) * mds: many snapshot and stray fixes (Yan, Zheng) * mds: misc fixes (Jianpeng Ma, Dan van der Ster, Zhang Zhi) * mds: misc journal cleanups and fixes (#10368 John Spray) * mds: misc repair improvements (John Spray) * mds: misc snap fixes (Zheng Yan) * mds: misc snapshot fixes (Yan, Zheng) * mds: new SessionMap storage using omap (#10649 John Spray) * mds: persist completed_requests reliably (#11048 John Spray) * mds: reduce memory consumption (Yan, Zheng) * mds: respawn instead of suicide on blacklist (John Spray) * mds: separate safe_pos in Journaler (#10368 John Spray) * mds: snapshot rename support (#3645 Yan, Zheng) * mds: store layout on header object (#4161 John Spray) * mds: throttle purge stray operations (#10390 John Spray) * mds: tolerate clock jumping backwards (#11053 Yan, Zheng) * mds: warn when clients fail to advance oldest_client_tid (#10657 Yan, Zheng) * misc cleanups and fixes (Danny Al-Gaaf) * misc coverity fixes (Danny Al-Gaaf) * misc performance and cleanup (Nathan Cutler, Xinxin Shu) * mon: add cache over MonitorDBStore (Kefu Chai) * mon: add 'mon_metadata ' command (Kefu Chai) * mon: add 'node ls ...' command (Kefu Chai) * mon: add NOFORWARD, OBSOLETE, DEPRECATE flags for mon commands (Joao Eduardo Luis) * mon: add PG count to 'ceph osd df' output (Michal Jarzabek) * mon: 'ceph osd metadata' can dump all osds (Haomai Wang) * mon: clean up, reorg some mon commands (Joao Eduardo Luis) * monclient: flush_log (John Spray) * mon: detect kv backend failures (Sage Weil) * mon: disallow >2 tiers (#11840 Kefu Chai) * mon: disallow ec pools as tiers (#11650 Samuel Just) * mon: do not deactivate last mds (#10862 John Spray) * mon: fix average utilization calc for 'osd df' (Mykola Golub) * mon: fix CRUSH map test for new pools (Sage Weil) * mon: fix log dump crash when debugging (Mykola Golub) * mon: fix mds beacon replies (#11590 Kefu Chai) * mon: fix metadata update race (Mykola Golub) * mon: fix min_last_epoch_clean tracking (Kefu Chai) * mon: fix 'pg ls' sort order, state names (#11569 Kefu Chai) * mon: fix refresh (#11470 Joao Eduardo Luis) * mon: fix variance calc in 'osd df' (Sage Weil) * mon: improve callout to crushtool (Mykola Golub) * mon: make blocked op messages more readable (Jianpeng Ma) * mon: make osd get pool 'all' only return applicable fields (#10891 Michal Jarzabek) * mon: misc scaling fixes (Sage Weil) * mon: normalize erasure-code profile for storage and comparison (Loic Dachary) * mon: only send mon metadata to supporting peers (Sage Weil) * mon: optionally specify osd id on 'osd create' (Mykola Golub) * mon: 'osd tree' fixes (Kefu Chai) * mon: periodic background scrub (Joao Eduardo Luis) * mon: prevent bucket deletion when referenced by a crush rule (#11602 Sage Weil) * mon: prevent pgp_num > pg_num (#12025 Xinxin Shu) * mon: prevent pool with snapshot state from being used as a tier (#11493 Sage Weil) * mon: prime pg_temp when CRUSH map changes (Sage Weil) * mon: refine check_remove_tier checks (#11504 John Spray) * mon: reject large max_mds values (#12222 John Spray) * mon: remove spurious who arg from 'mds rm ...' (John Spray) * mon: streamline session handling, fix memory leaks (Sage Weil) * mon: upgrades must pass through hammer (Sage Weil) * mon: warn on bogus cache tier config (Jianpeng Ma) * msgr: add ceph_perf_msgr tool (Hoamai Wang) * msgr: async: fix seq handling (Haomai Wang) * msgr: async: many many fixes (Haomai Wang) * msgr: simple: fix clear_pipe (#11381 Haomai Wang) * msgr: simple: fix connect_seq assert (Haomai Wang) * msgr: xio: fastpath improvements (Raju Kurunkad) * msgr: xio: fix ip and nonce (Raju Kurunkad) * msgr: xio: improve lane assignment (Vu Pham) * msgr: xio: sync with accellio v1.4 (Vu Pham) * msgr: xio: misc fixes (#10735 Matt Benjamin, Kefu Chai, Danny Al-Gaaf, Raju Kurunkad, Vu Pham, Casey Bodley) * msg: unit tests (Haomai Wang) * objectcacher: misc bug fixes (Jianpeng Ma) * osd: add latency perf counters for tier operations (Xinze Chi) * osd: add misc perfcounters (Xinze Chi) * osd: add simple sleep injection in recovery (Sage Weil) * osd: allow SEEK_HOLE/SEEK_DATA for sparse read (Zhiqiang Wang) * osd: avoid dup omap sets for in pg metadata (Sage Weil) * osd: avoid multiple hit set insertions (Zhiqiang Wang) * osd: avoid transaction append in some cases (Sage Weil) * osd: break PG removal into multiple iterations (#10198 Guang Yang) * osd: cache proxy-write support (Zhiqiang Wang, Samuel Just) * osd: check scrub state when handling map (Jianpeng Ma) * osd: clean up some constness, privateness (Kefu Chai) * osd: clean up temp object if promotion fails (Jianpeng Ma) * osd: configure promotion based on write recency (Zhiqiang Wang) * osd: constrain collections to meta and PGs (normal and temp) (Sage Weil) * osd: don't send dup MMonGetOSDMap requests (Sage Weil, Kefu Chai) * osd: EIO injection (David Zhang) * osd: elminiate txn apend, ECSubWrite copy (Samuel Just) * osd: erasure-code: drop entries according to LRU (Andreas-Joachim Peters) * osd: erasure-code: fix SHEC floating point bug (#12936 Loic Dachary) * osd: erasure-code: update to ISA-L 2.14 (Yuan Zhou) * osd: filejournal: cleanup (David Zafman) * osd: filestore: clone using splice (Jianpeng Ma) * osd: filestore: fix recursive lock (Xinxin Shu) * osd: fix check_for_full (Henry Chang) * osd: fix dirty accounting in make_writeable (Zhiqiang Wang) * osd: fix dup promotion lost op bug (Zhiqiang Wang) * osd: fix endless repair when object is unrecoverable (Jianpeng Ma, Kefu Chai) * osd: fix hitset object naming to use GMT (Kefu Chai) * osd: fix misc memory leaks (Sage Weil) * osd: fix negative degraded stats during backfill (Guang Yang) * osd: fix osdmap dump of blacklist items (John Spray) * osd: fix peek_queue locking in FileStore (Xinze Chi) * osd: fix pg resurrection (#11429 Samuel Just) * osd: fix promotion vs full cache tier (Samuel Just) * osd: fix replay requeue when pg is still activating (#13116 Samuel Just) * osd: fix scrub stat bugs (Sage Weil, Samuel Just) * osd: fix snap flushing from cache tier (again) (#11787 Samuel Just) * osd: fix snap handling on promotion (#11296 Sam Just) * osd: fix temp-clearing (David Zafman) * osd: force promotion for ops EC can't handle (Zhiqiang Wang) * osd: handle log split with overlapping entries (#11358 Samuel Just) * osd: ignore non-existent osds in unfound calc (#10976 Mykola Golub) * osd: improve behavior on machines with large memory pages (Steve Capper) * osd: include a temp namespace within each collection/pgid (Sage Weil) * osd: increase default max open files (Owen Synge) * osd: keyvaluestore: misc fixes (Varada Kari) * osd: low and high speed flush modes (Mingxin Liu) * osd: make suicide timeouts individually configurable (Samuel Just) * osd: merge multiple setattr calls into a setattrs call (Xinxin Shu) * osd: misc fixes (Ning Yao, Kefu Chai, Xinze Chi, Zhiqiang Wang, Jianpeng Ma) * osd: move scrub in OpWQ (Samuel Just) * osd: newstore prototype (Sage Weil) * osd: ObjectStore internal API refactor (Sage Weil) * osd: peer_features includes self (David Zafman) * osd: pool size change triggers new interval (#11771 Samuel Just) * osd: prepopulate needs_recovery_map when only one peer has missing (#9558 Guang Yang) * osd: randomize scrub times (#10973 Kefu Chai) * osd: recovery, peering fixes (#11687 Samuel Just) * osd: refactor scrub and digest recording (Sage Weil) * osd: refuse first write to EC object at non-zero offset (Jianpeng Ma) * osd: relax reply order on proxy read (#11211 Zhiqiang Wang) * osd: require firefly features (David Zafman) * osd: set initial crush weight with more precision (Sage Weil) * osd: SHEC no longer experimental * osd: skip promotion for flush/evict op (Zhiqiang Wang) * osd: stripe over small xattrs to fit in XFS's 255 byte inline limit (Sage Weil, Ning Yao) * osd: sync object_map on syncfs (Samuel Just) * osd: take excl lock of op is rw (Samuel Just) * osd: throttle evict ops (Yunchuan Wen) * osd: upgrades must pass through hammer (Sage Weil) * osd: use a temp object for recovery (Sage Weil) * osd: use blkid to collection partition information (Joseph Handzik) * osd: use SEEK_HOLE / SEEK_DATA for sparse copy (Xinxin Shu) * osd: WBThrottle cleanups (Jianpeng Ma) * osd: write journal header on clean shutdown (Xinze Chi) * osdc/Objecter: allow per-pool calls to op_cancel_writes (John Spray) * os/filestore: enlarge getxattr buffer size (Jianpeng Ma) * pybind: pep8 cleanups (Danny Al-Gaaf) * pycephfs: many fixes for bindings (Haomai Wang) * qa: fix filelock_interrupt.py test (Yan, Zheng) * qa: improve ceph-disk tests (Loic Dachary) * qa: improve docker build layers (Loic Dachary) * qa: run-make-check.sh script (Loic Dachary) * rados: add --striper option to use libradosstriper (#10759 Sebastien Ponce) * rados: bench: add --no-verify option to improve performance (Piotr Dalek) * rados bench: misc fixes (Dmitry Yatsushkevich) * rados: fix error message on failed pool removal (Wido den Hollander) * radosgw-admin: add 'bucket check' function to repair bucket index (Yehuda Sadeh) * radosgw-admin: fix subuser modify output (#12286 Guce) * rados: handle --snapid arg properly (Abhishek Lekshmanan) * rados: improve bench buffer handling, performance (Piotr Dalek) * rados: misc bench fixes (Dmitry Yatsushkevich) * rados: new pool import implementation (John Spray) * rados: translate errno to string in CLI (#10877 Kefu Chai) * rbd: accept map options config option (Ilya Dryomov) * rbd: add disk usage tool (#7746 Jason Dillaman) * rbd: allow unmapping by spec (Ilya Dryomov) * rbd: cli: fix arg parsing with --io-pattern (Dmitry Yatsushkevich) * rbd: deprecate --new-format option (Jason Dillman) * rbd: fix error messages (#2862 Rajesh Nambiar) * rbd: fix link issues (Jason Dillaman) * rbd: improve CLI arg parsing, usage (Ilya Dryomov) * rbd: rbd-replay-prep and rbd-replay improvements (Jason Dillaman) * rbd: recognize queue_depth kernel option (Ilya Dryomov) * rbd: support G and T units for CLI (Abhishek Lekshmanan) * rbd: update rbd man page (Ilya Dryomov) * rbd: update xfstests tests (Douglas Fuller) * rbd: use image-spec and snap-spec in help (Vikhyat Umrao, Ilya Dryomov) * rest-bench: misc fixes (Shawn Chen) * rest-bench: support https (#3968 Yuan Zhou) * rgw: add max multipart upload parts (#12146 Abshishek Dixit) * rgw: add missing headers to Swift container details (#10666 Ahmad Faheem, Dmytro Iurchenko) * rgw: add stats to headers for account GET (#10684 Yuan Zhou) * rgw: add Trasnaction-Id to response (Abhishek Dixit) * rgw: add X-Timestamp for Swift containers (#10938 Radoslaw Zarzynski) * rgw: always check if token is expired (#11367 Anton Aksola, Riku Lehto) * rgw: conversion tool to repair broken multipart objects (#12079 Yehuda Sadeh) * rgw: document layout of pools and objects (Pete Zaitcev) * rgw: do not enclose bucket header in quotes (#11860 Wido den Hollander) * rgw: do not prefetch data for HEAD requests (Guang Yang) * rgw: do not preserve ACLs when copying object (#12370 Yehuda Sadeh) * rgw: do not set content-type if length is 0 (#11091 Orit Wasserman) * rgw: don't clobber bucket/object owner when setting ACLs (#10978 Yehuda Sadeh) * rgw: don't use end_marker for namespaced object listing (#11437 Yehuda Sadeh) * rgw: don't use rgw_socket_path if frontend is configured (#11160 Yehuda Sadeh) * rgw: enforce Content-Length for POST on Swift cont/obj (#10661 Radoslaw Zarzynski) * rgw: error out if frontend did not send all data (#11851 Yehuda Sadeh) * rgw: expose the number of unhealthy workers through admin socket (Guang Yang) * rgw: fail if parts not specified on multipart upload (#11435 Yehuda Sadeh) * rgw: fix assignment of copy obj attributes (#11563 Yehuda Sadeh) * rgw: fix broken stats in container listing (#11285 Radoslaw Zarzynski) * rgw: fix bug in domain/subdomain splitting (Robin H. Johnson) * rgw: fix casing of Content-Type header (Robin H. Johnson) * rgw: fix civetweb max threads (#10243 Yehuda Sadeh) * rgw: fix Connection: header handling (#12298 Wido den Hollander) * rgw: fix copy metadata, support X-Copied-From for swift (#10663 Radoslaw Zarzynski) * rgw: fix data corruptions race condition (#11749 Wuxingyi) * rgw: fix decoding of X-Object-Manifest from GET on Swift DLO (Radslow Rzarzynski) * rgw: fix GET on swift account when limit == 0 (#10683 Radoslaw Zarzynski) * rgw: fix handling empty metadata items on Swift container (#11088 Radoslaw Zarzynski) * rgw: fix JSON response when getting user quota (#12117 Wuxingyi) * rgw: fix locator for objects starting with _ (#11442 Yehuda Sadeh) * rgw: fix log rotation (Wuxingyi) * rgw: fix mulitipart upload in retry path (#11604 Yehuda Sadeh) * rgw: fix quota enforcement on POST (#11323 Sergey Arkhipov) * rgw: fix reset_loc (#11974 Yehuda Sadeh) * rgw: fix return code on missing upload (#11436 Yehuda Sadeh) * rgw: fix sysvinit script * rgw: fix sysvinit script w/ multiple instances (Sage Weil, Pavan Rallabhandi) * rgw: force content_type for swift bucket stats requests (#12095 Orit Wasserman) * rgw: force content type header on responses with no body (#11438 Orit Wasserman) * rgw: generate Date header for civetweb (#10873 Radoslaw Zarzynski) * rgw: generate new object tag when setting attrs (#11256 Yehuda Sadeh) * rgw: improve content-length env var handling (#11419 Robin H. Johnson) * rgw: improved support for swift account metadata (Radoslaw Zarzynski) * rgw: improve handling of already removed buckets in expirer (Radoslaw Rzarzynski) * rgw: issue aio for first chunk before flush cached data (#11322 Guang Yang) * rgw: log to /var/log/ceph instead of /var/log/radosgw * rgw: make init script wait for radosgw to stop (#11140 Dmitry Yatsushkevich) * rgw: make max put size configurable (#6999 Yuan Zhou) * rgw: make quota/gc threads configurable (#11047 Guang Yang) * rgw: make read user buckets backward compat (#10683 Radoslaw Zarzynski) * rgw: merge manifests properly with prefix override (#11622 Yehuda Sadeh) * rgw: only scan for objects not in a namespace (#11984 Yehuda Sadeh) * rgw: orphan detection tool (Yehuda Sadeh) * rgw: pass in civetweb configurables (#10907 Yehuda Sadeh) * rgw: rectify 202 Accepted in PUT response (#11148 Radoslaw Zarzynski) * rgw: remove meta file after deleting bucket (#11149 Orit Wasserman) * rgw: remove trailing :port from HTTP_HOST header (Sage Weil) * rgw: return 412 on bad limit when listing buckets (#11613 Yehuda Sadeh) * rgw: rework X-Trans-Id header to conform with Swift API (Radoslaw Rzarzynski) * rgw: s3 encoding-type for get bucket (Jeff Weber) * rgw: send ETag, Last-Modified for swift (#11087 Radoslaw Zarzynski) * rgw: set content length on container GET, PUT, DELETE, HEAD (#10971, #11036 Radoslaw Zarzynski) * rgw: set max buckets per user in ceph.conf (Vikhyat Umrao) * rgw: shard work over multiple librados instances (Pavan Rallabhandi) * rgw: support end marker on swift container GET (#10682 Radoslaw Zarzynski) * rgw: support for Swift expiration API (Radoslaw Rzarzynski, Yehuda Sadeh) * rgw: swift: allow setting attributes with COPY (#10662 Ahmad Faheem, Dmytro Iurchenko) * rgw: swift: do not override sent content type (#12363 Orit Wasserman) * rgw: swift: enforce Content-Type in response (#12157 Radoslaw Zarzynski) * rgw: swift: fix account listing (#11501 Radoslaw Zarzynski) * rgw: swift: fix metadata handling on copy (#10645 Radoslaw Zarzynski) * rgw: swift: send Last-Modified header (#10650 Radoslaw Zarzynski) * rgw: swift: set Content-Length for account GET (#12158 Radoslav Zarzynski) * rgw: swift: set content-length on keystone tokens (#11473 Herv Rousseau) * rgw: update keystone cache with token info (#11125 Yehuda Sadeh) * rgw: update to latest civetweb, enable config for IPv6 (#10965 Yehuda Sadeh) * rgw: use attrs from source bucket on copy (#11639 Javier M. Mellid) * rgw: use correct oid for gc chains (#11447 Yehuda Sadeh) * rgw: user rm is idempotent (Orit Wasserman) * rgw: use unique request id for civetweb (#10295 Orit Wasserman) * rocksdb: add perf counters for get/put latency (Xinxin Shu) * rocksdb, leveldb: fix compact_on_mount (Xiaoxi Chen) * rocksdb: pass options as single string (Xiaoxi Chen) * rocksdb: update to latest (Xiaoxi Chen) * rpm: add suse firewall files (Tim Serong) * rpm: always rebuild and install man pages for rpm (Owen Synge) * rpm: loosen ceph-test dependencies (Ken Dreyer) * rpm: many spec file fixes (Owen Synge, Ken Dreyer) * rpm: misc fixes (Boris Ranto, Owen Synge, Ken Dreyer, Ira Cooper) * rpm: misc systemd and SUSE fixes (Owen Synge, Nathan Cutler) * selinux policy (Boris Ranto, Milan Broz) * systemd: logrotate fixes (Tim Serong, Lars Marowsky-Bree, Nathan Cutler) * systemd: many fixes (Sage Weil, Owen Synge, Boris Ranto, Dan van der Ster) * systemd: run daemons as user ceph * sysvinit compat: misc fixes (Owen Synge) * test: misc fs test improvements (John Spray, Loic Dachary) * test: python tests, linter cleanup (Alfredo Deza) * tests: fixes for rbd xstests (Douglas Fuller) * tests: fix tiering health checks (Loic Dachary) * tests for low-level performance (Haomai Wang) * tests: many ec non-regression improvements (Loic Dachary) * tests: many many ec test improvements (Loic Dachary) * upstart: throttle restarts (#11798 Sage Weil, Greg Farnum) v10.1.2 Jewel (release candidate) ================================= This is the third (and likely final) release candidate for Jewel. Notable Changes since v10.1.1 ----------------------------- * rgw: Jewel nfs fixes 3 (`pr#8460 `_, Matt Benjamin) * osd/PG: set epoch_created and parent_split_bits for child pg (`issue#15426 `_, `pr#8552 `_, Kefu Chai) * rgw: signature mismatch with escaped characters in url query portion (`issue#15358 `_, `pr#8445 `_, Javier M. Mellid) * authtool: update --help and manpage to match code. (`pr#8456 `_, Robin H. Johnson) * build: Respect TMPDIR for virtualenv. (`pr#8457 `_, Robin H. Johnson) * ceph-disk: do not always fail when re-using a partition (`pr#8508 `_, You Ji) * ceph-fuse: rotate log file (`pr#8485 `_, Sage Weil) * ceph-rest-api: fix fs/flag/set (`pr#8428 `_, Sage Weil) * check-generated.sh: can't source bash from sh (`pr#8521 `_, Michal Jarzabek) * common: buffer: put a guard for stat() syscall during read_file (`pr#7956 `_, xie xingguo) * common: fix time_t cast in decode (`issue#15330 `_, `pr#8419 `_, Adam C. Emerson) * crush: fix typo (`pr#8518 `_, Wei Jin) * doc: rgw admin uses "region list" not "regions list" (`pr#8517 `_, Kris Jurka) * journal: fix final result for JournalTrimmer::C_RemoveSet (`pr#8516 `_, runsisi) * journal: fix race condition between Future and journal shutdown (`issue#15364 `_, `pr#8477 `_, Jason Dillaman) * librados: Revert "rados: Add new field flags for ceph_osd_op.copy_get." (`pr#8486 `_, Sage Weil) * librbd: disallow unsafe rbd_op_threads values (`issue#15034 `_, `pr#8459 `_, Josh Durgin) * mailmap update (`pr#8522 `_, M Ranga Swami Reddy) * mds: Add cmapv to ESessions default constructor initializer list (`pr#8403 `_, John Coyle) * mds: fix file_layout_t legacy encoding snafu (`pr#8455 `_, Sage Weil) * mds: Protect a number of unstable/experimental features behind durable flags (`pr#8383 `_, Greg Farnum) * mds: Extend the existing pool access checking to include specific RADOS namespacse. (`pr#8444 `_, Yan, Zheng) * os/ObjectStore: make device uuid probe output something friendly (`pr#8418 `_, Sage Weil) * os/bluestore: revamp BlueFS bdev management and add perfcounters (`issue#15376 `_, `pr#8431 `_, Sage Weil) * os/filestore: force lfn attrs to be written atomically, restructure name length limits (`pr#8496 `_, Samuel Just) * osd/ReplicatedPG::_rollback_to: update the OMAP flag (`issue#14777 `_, `pr#8495 `_, Samuel Just) * osd: clean up temp object if copy-from fails (`pr#8487 `_, Sage Weil) * osd: fix two scrub relevant issues (`pr#8462 `_, xie xingguo) * osd: fix/clean up full map request handling (`pr#8446 `_, Sage Weil) * osdc/Objecter: fix narrow race with tid assignment (`issue#14364 `_, `pr#7981 `_, Sage Weil) * radosgw-admin: allow (`pr#8529 `_, Orit Wasserman) * rbd-mirror: workaround for intermingled lockdep singletons (`pr#8476 `_, Jason Dillaman) * rbd: journal reset should disable/re-enable journaling feature (`issue#15097 `_, `pr#8490 `_, Jason Dillaman) * rgw-admin: remove unused iterator and fix error message (`pr#8507 `_, Karol Mroz) * rgw: aws4 subdomain calling bugfix (`issue#15369 `_, `pr#8472 `_, Javier M. Mellid) * rgw: fix a typo in error message (`pr#8434 `_, Abhishek Lekshmanan) * rgw: fix problem deleting objects begining with double underscores (`issue#15318 `_, `pr#8488 `_, Orit Wasserman) * rgw: retry RGWRemoteMetaLog::read_log_info() while master is down (`pr#8453 `_, Casey Bodley) * rgw: the map 'headers' is assigned a wrong value (`pr#8481 `_, weiqiaomiao) * rgw_ldap: make ldap.h inclusion conditional (`pr#8500 `_, Matt Benjamin) * systemd: drop any systemd imposed process/thread limits (`pr#8450 `_, James Page) * test: fix ut test failure caused by lfn change (`issue#15464 `_, `pr#8544 `_, xie xingguo) * test: fix valgrind memcheck issues for rbd-mirror test cases (`issue#15354 `_, `pr#8493 `_, Jason Dillaman) * tests: add Ubuntu 16.04 xenial dockerfile (`pr#8519 `_, Loic Dachary) v10.1.1 Jewel (release candidate) ================================= This is the first release build that includes arm64/aarch64 packages for Ubuntu Xenial 16.04. Know issues with this release candidate --------------------------------------- There are a few known issues to watch out for: * Old CephFS clusters will mangle the layouts with this release; the fix was committed just after it was cut. Wait for the next RC or the release if you're upgrading a cluster that has a CephFS data pool as pool 0. * The upstart ceph-mds-all.conf file is missing. Notable Changes since v10.1.0 ----------------------------- * Adding documentation on how to use new dynamic throttle scheme (`pr#8069 `_, Somnath Roy) * Be more careful about directory fragmentation and scrubbing (`issue#15167 `_, `pr#8180 `_, Yan, Zheng) * CMake: For CMake version <= 2.8.11, use LINK_PRIVATE (`pr#8422 `_, Haomai Wang) * Makefile-env.am: set a default for CEPH_BUILD_VIRTUALENV (part 2) (`pr#8320 `_, Loic Dachary) * Minor fixes around data scan in some scenarios (`pr#8115 `_, Yan, Zheng) * PG: pg down state blocked by osd.x, lost osd.x cannot solve peering stuck (`issue#13531 `_, `pr#6317 `_, Xiaowei Chen) * osd/PG: indicate in pg query output whether ignore_history_les would help (`pr#8156 `_, Sage Weil) * Striper: reduce assemble_result log level (`pr#8426 `_, Jason Dillaman) * Test exit values on test.sh, fix tier.cc (`issue#15165 `_, `pr#8266 `_, Samuel Just) * ceph.in: correct dev python path for automake builds (`pr#8360 `_, Josh Durgin) * osd/OSD: fix build_past_intervals_parallel (`pr#8215 `_, David Zafman) * rgw: sync fixes 4 (`pr#8190 `_, Yehuda Sadeh) * [rgw] Check return code in RGWFileHandle::write (`pr#7875 `_, Brad Hubbard) * build: fix compiling warnings (`pr#8366 `_, Dongsheng Yang) * ceph-detect-init/run-tox.sh: FreeBSD: No init detect (`pr#8373 `_, Willem Jan Withagen) * ceph.in: fix python libpath for automake as well (`pr#8362 `_, Josh Durgin) * ceph: bash auto complete for CLI based on mon command descriptions (`pr#7693 `_, Adam Kupczyk) * cls_journal: fix -EEXIST checking (`pr#8413 `_, runsisi) * cls_rbd: fix -EEXIST checking in cls::rbd::image_set (`pr#8371 `_, runsisi) * cls_rbd: mirror_image_list should return global image id (`pr#8297 `_, Jason Dillaman) * cls_rbd: pass WILLNEED fadvise flags during object map update (`issue#15332 `_, `pr#8380 `_, Jason Dillaman) * cls_rbd: read_peers: update last_read on next cls_cxx_map_get_vals (`pr#8374 `_, Mykola Golub) * cmake: Build cython modules and change paths to bin/, lib/ (`pr#8351 `_, John Spray, Ali Maredia) * cmake: add FindOpenSSL.cmake (`pr#8106 `_, Marcus Watts, Matt Benjamin) * cmake: add StandardPolicy.cc to librbd (`pr#8368 `_, Kefu Chai) * cmake: add missing librbd/MirrorWatcher.cc and librd/ObjectWatcher.cc (`pr#8399 `_, Orit Wasserman) * cmake: fix mrun to handle cmake build structure (`pr#8237 `_, Orit Wasserman) * cmake: fix the build of test_rados_api_list (`pr#8438 `_, Kefu Chai) * common: fix race during optracker switches between enabled/disabled mode (`pr#8330 `_, xie xingguo) * config_opts: disable filestore throttle soft backoff by default (`pr#8265 `_, Samuel Just) * configure: Add -D_LARGEFILE64_SOURCE to Linux build. (`pr#8402 `_, Ira Cooper) * crush: fix error log (`pr#8430 `_, Wei Jin) * crushtool: Don't crash when called on a file that isn't a crushmap (`issue#8286 `_, `pr#8038 `_, Brad Hubbard) * debian/rules: put init-ceph in /etc/init.d/ceph, not ceph-base (`issue#15329 `_, `pr#8406 `_, Dan Mick) * doc/dev: add "Deploy a cluster for manual testing" section (`issue#15218 `_, `pr#8228 `_, Nathan Cutler) * doc/rados/operations/crush: fix the formatting (`pr#8306 `_, Kefu Chai) * doc/release-notes: fix indents (`pr#8345 `_, Kefu Chai) * doc: Fixes headline different font size and type (`pr#8328 `_, scienceluo) * doc: Remove Ceph Monitors do lots of fsync() (`issue#15288 `_, `pr#8327 `_, Vikhyat Umrao) * doc: Updated CloudStack RBD documentation (`pr#8308 `_, Wido den Hollander) * doc: amend Fixes instructions in SubmittingPatches (`pr#8312 `_, Nathan Cutler) * doc: draft notes for jewel (`pr#8211 `_, Loic Dachary, Sage Weil) * doc: fix typo, duplicated content etc. for Jewel release notes (`pr#8342 `_, xie xingguo) * doc: fix wrong type of hyphen (`pr#8252 `_, xie xingguo) * doc: rgw_region_root_pool option should be in [global] (`issue#15244 `_, `pr#8271 `_, Vikhyat Umrao) * doc: very basic doc on mstart (`pr#8207 `_, Abhishek Lekshmanan) * global/global_init: expand metavariables in setuser_match_path (`issue#15365 `_, `pr#8433 `_, Sage Weil) * global/signal_handler: print thread name in signal handler (`pr#8177 `_, Jianpeng Ma) * libcephfs: fix python tests and fix getcwd on missing dir (`pr#7901 `_, John Spray) * librbd: avoid throwing error if mirroring is unsupported (`pr#8417 `_, Jason Dillaman) * librbd: disable image mirroring when image is removed (`issue#15265 `_, `pr#8375 `_, Ricardo Dias) * librbd: send notifications for mirroring status updates (`pr#8355 `_, Jason Dillaman) * mailmap updates (`pr#8256 `_, Loic Dachary) * makefile: fix rbdmap manpage (`pr#8310 `_, Kefu Chai) * mds: allow client to request caps when opening file (`issue#14360 `_, `pr#7952 `_, Yan, Zheng) * messages/MOSDOp: clear reqid inc for v6 encoding (`issue#15230 `_, `pr#8299 `_, Sage Weil) * mon/MonClient: fix shutdown race (`issue#13992 `_, `pr#8335 `_, Sage Weil) * mon: do not send useless pg_create messages for split pgs (`pr#8247 `_, Sage Weil) * mon: mark_down_pgs in lockstep with pg_map's osdmap epoch (`pr#8208 `_, Sage Weil) * mon: remove remove_legacy_versions() (`pr#8324 `_, Kefu Chai) * mon: remove unnecessary comment for update_from_paxos (`pr#8400 `_, Qinghua Jin) * mon: remove unused variable (`issue#15292 `_, `pr#8337 `_, Javier M. Mellid) * mon: show the pool quota info on ceph df detail command (`issue#14216 `_, `pr#7094 `_, song baisen) * monclient: avoid key renew storm on clock skew (`issue#12065 `_, `pr#8258 `_, Alexey Sheplyakov) * mrun: update path to cmake binaries (`pr#8447 `_, Casey Bodley) * msg/async: avoid log spam on throttle (`issue#15031 `_, `pr#8263 `_, Kefu Chai) * msg/async: remove experiment feature (`pr#7820 `_, Haomai Wang) * os/ObjectStore: add noexcept to ensure move ctor is used (`pr#8421 `_, Kefu Chai) * os/ObjectStore: fix _update_op for split dest_cid (`pr#8364 `_, Sage Weil) * os/ObjectStore: try_move_rename in transaction append and add coverage to store_test (`issue#15205 `_, `pr#8359 `_, Samuel Just) * os/bluestore: a few fixes (`pr#8193 `_, Sage Weil) * os/bluestore: ceph-bluefs-tool fixes (`issue#15261 `_, `pr#8292 `_, Venky Shankar) * osd/ClassHandler: only dlclose() the classes not missing (`pr#8354 `_, Kefu Chai) * osd/OSD.cc: finish full_map_request every MOSDMap message. (`issue#15130 `_, `pr#8147 `_, Xiaoxi Chen) * osd: add 'proxy' cache mode (`issue#12814 `_, `pr#8210 `_, Sage Weil) * osd: add the support of per pool scrub priority (`pr#7062 `_, Zhiqiang Wang) * osd: bail out of _committed_osd_maps if we are shutting down (`pr#8267 `_, Samuel Just) * osd: duplicated clear for peer_missing (`pr#8315 `_, Ning Yao) * osd: fix bugs for omap ops (`pr#8230 `_, Jianpeng Ma) * osd: fix dirtying info without correctly setting drity_info field (`pr#8275 `_, xie xingguo) * osd: fix dump_ops_in_flight races (`issue#8885 `_, `pr#8044 `_, David Zafman) * osd: fix epoch check in handle_pg_create (`pr#8382 `_, Samuel Just) * osd: fix failure report handling during ms_handle_connect() (`pr#8348 `_, xie xingguo) * osd: fix log info (`pr#8273 `_, Wei Jin) * osd: fix reference count, rare race condition etc. (`pr#8254 `_, xie xingguo) * osd: fix tick relevant issues (`pr#8369 `_, xie xingguo) * osd: more fixes for incorrectly dirtying info; resend reply for duplicated scrub-reserve req (`pr#8291 `_, xie xingguo) * osdc/Objecter: dout log after assign tid (`pr#8202 `_, Xinze Chi) * osdc/Objecter: use full pgid hash in PGNLS ops (`pr#8378 `_, Sage Weil) * osdmap: rm nonused variable (`pr#8423 `_, Wei Jin) * pybind/Makefile.am: Prevent race creating CYTHON_BUILD_DIR (`issue#15276 `_, `pr#8356 `_, Dan Mick) * pybind/rados: python3 fix (`pr#8331 `_, Mehdi Abaakouk) * pybind: add flock to libcephfs python bindings (`pr#7902 `_, John Spray) * qa: update rest test cephfs calls (`issue#15309 `_, `pr#8372 `_, John Spray) * qa: update rest test cephfs calls (part 2) (`issue#15309 `_, `pr#8393 `_, John Spray) * radosgw-admin: 'period commit' supplies user-readable error messages (`pr#8264 `_, Casey Bodley) * radosgw-admin: fix for 'realm pull' (`pr#8404 `_, Casey Bodley) * rbd-mirror: asok commands to get status and flush on Mirror and Replayer level (`pr#8235 `_, Mykola Golub) * rbd-mirror: enabling/disabling pool mirroring should update the mirroring directory (`issue#15217 `_, `pr#8261 `_, Ricardo Dias) * rbd-mirror: fix missing increment for iterators (`pr#8352 `_, runsisi) * rbd-mirror: initial failover / failback support (`pr#8287 `_, Jason Dillaman) * rbd-mirror: prevent enabling/disabling an image's mirroring when not in image mode (`issue#15267 `_, `pr#8332 `_, Ricardo Dias) * rbd-mirror: switch fsid over to mirror uuid (`issue#15238 `_, `pr#8280 `_, Ricardo Dias) * rbd: allow librados to prune the command-line for config overrides (`issue#15250 `_, `pr#8282 `_, Jason Dillaman) * rbdmap: add manpage (`issue#15212 `_, `pr#8224 `_, Nathan Cutler) * releases: what is merged where and when ? (`pr#8358 `_, Loic Dachary) * rgw/rgw_admin:fix bug about list and stats command (`pr#8200 `_, Qiankun Zheng) * rgw: Do not send a Content-Type on a '304 Not Modified' response (`issue#15119 `_, `pr#8253 `_, Wido den Hollander) * rgw: Multipart ListPartsResult ETag quotes (`issue#15334 `_, `pr#8387 `_, Robin H. Johnson) * rgw: S3: set EncodingType in ListBucketResult (`pr#7712 `_, Victor Makarov) * rgw: accept data only at the first time in response to a request (`pr#8084 `_, sunspot) * rgw: add a few more help options in admin interface (`pr#8410 `_, Abhishek Lekshmanan) * rgw: add zone delete to rgw-admin help (`pr#8184 `_, Abhishek Lekshmanan) * rgw: convert plain object to versioned (with null version) when removing (`issue#15243 `_, `pr#8268 `_, Yehuda Sadeh) * rgw: fix compiling error (`pr#8394 `_, xie xingguo) * rgw: fix lockdep false positive (`pr#8284 `_, Yehuda Sadeh) * rgw:Use count fn in RGWUserBuckets for quota check (`pr#8294 `_, Abhishek Lekshmanan) * rgw_admin: remove unused parent_period arg (`pr#8411 `_, Abhishek Lekshmanan) * rgw_file: set owner uid, gid, and Unix mode on new objects (`pr#8321 `_, Matt Benjamin) * rpm: prefer UID/GID 167 when creating ceph user/group (`issue#15246 `_, `pr#8277 `_, Nathan Cutler) * script: subscription-manager support (`issue#14972 `_, `pr#7907 `_, Loic Dachary) * set 128MB tcmalloc cache size by bytes (`pr#8427 `_, Star Guo) * systemd: set up environment in rbdmap unit file (`issue#14984 `_, `pr#8222 `_, Nathan Cutler) * test/system/\*: use dynamically generated pool name (`issue#15240 `_, `pr#8318 `_, Kefu Chai) * test/system/rados_list_parallel: print oid if rados_write fails (`issue#15240 `_, `pr#8309 `_, Kefu Chai) * test/test-erasure-code.sh: disable pg_temp priming (`issue#15211 `_, `pr#8260 `_, Sage Weil) * test/test_pool_create.sh: fix port (`pr#8361 `_, Sage Weil) * test: Fix test to run with btrfs which has snap_### dirs (`issue#15347 `_, `pr#8420 `_, David Zafman) * test: TestMirroringWatcher test cases were not closing images (`pr#8435 `_, Jason Dillaman) * test: rbd-mirror: script improvements for manual testing (`pr#8325 `_, Mykola Golub) * tests: Fixing broken test/cephtool-test-mon.sh test (`pr#8429 `_, Erwan Velu) * tests: Improving 'make check' execution time (`pr#8131 `_, Erwan Velu) * unittest_erasure_code_plugin: fix deadlock (Alpine) (`pr#8314 `_, John Coyle) * vstart: fix up cmake paths when VSTART_DEST is given (`pr#8363 `_, Casey Bodley) * vstart: make -k with optional mon_num. (`pr#8251 `_, Jianpeng Ma) * xio: add prefix to xio msgr logs (`pr#8148 `_, Roi Dayan) * xio: fix compilation against latest accelio (`pr#8022 `_, Roi Dayan) * xio: xio_init needs to be called before any other xio function (`pr#8227 `_, Roi Dayan) * ceph.spec.in: disable lttng and babeltrace explicitly (`issue#14844 `_, `pr#7857 `_, Kefu Chai) v10.1.0 Jewel (release candidate) ================================= There are a few known issues with this release candidate; see below. Known Issues with v10.1.0 ------------------------- * While running a mixed version cluster of jewel and infernalis or hammer monitors, any MDSMap updates will cause the pre-jewel monitors to crash. Workaround is to simply upgrde all monitors. There is a fix but it is still being tested. * Some of the rbd-mirror functionality for switching between active and replica images is not yet merged. Notable Changes since v10.0.4 ----------------------------- * ceph.spec.in: do not install Ceph RA on systemd platforms (`issue#14828 `_, `pr#7894 `_, Nathan Cutler) * mdsa: A few more snapshot fixes, mostly around snapshotted inode/dentry tracking (`pr#7798 `_, Yan, Zheng) * AUTHORS: update email (`pr#7854 `_, Yehuda Sadeh) * ceph-disk: Add --setuser and --setgroup options for ceph-disk (`pr#7351 `_, Mike Shuey) * build: Adding build requires (`pr#7742 `_, Erwan Velu) * msg/async: AsyncMessenger: fix several bugs (`pr#7831 `_, Haomai Wang) * msg/async: AsyncMessenger: fix valgrind leak (`pr#7725 `_, Haomai Wang) * doc: Clarify usage on starting single osd/mds/mon. (`pr#7641 `_, Patrick Donnelly) * common: Deprecate or free up a bunch of feature bits (`pr#8214 `_, Samuel Just) * msg/async: Event: fix clock skew problem (`pr#7949 `_, Wei Jin) * osd: FileStore: Added O_DSYNC write scheme (`pr#7752 `_, Somnath Roy) * osd: FileStore: fix initialization order for m_disable_wbthrottle (`pr#8067 `_, Samuel Just) * build: Fixing BTRFS issue at 'make check' (`pr#7805 `_, Erwan Velu) * build: FreeBSD related fixes (`pr#7170 `_, Mykola Golub) * ceph-disk: Improving 'make check' for ceph-disk (`pr#7762 `_, Erwan Velu) * rgw: increase verbosity level on RGWObjManifest line (`pr#7285 `_, magicrobotmonkey) * build: workaround an automake bug for "make check" (`issue#14723 `_, `pr#7626 `_, Kefu Chai) * ceph-fuse,libcephfs: Fix client handling of "lost" open directories on shutdown (`issue#14996 `_, `pr#7994 `_, Yan, Zheng) * mds: Multi-filesystem support (`issue#14952 `_, `pr#6953 `_, John Spray, Sage Weil) * os/bluestore/NVMEDevice: refactor probe/attach codes and support zero command (`pr#7647 `_, Haomai Wang) * librados: detect laggy ops with objecter_timeout, not osd_timeout (`pr#7629 `_, Greg Farnum) * ceph.spec.in: fix openldap and openssl build dependencies for SUSE (`issue#15138 `_, `pr#8120 `_, Nathan Cutler) * osd: repop and lost-unfound overhaul (`pr#7765 `_, Samuel Just) * librbd: Revert "librbd: use task finisher per CephContext" (`issue#14780 `_, `pr#7667 `_, Josh Durgin) * rgw: Fix subuser harder with tenants (`pr#7618 `_, Pete Zaitcev) * rgw: ldap fixes (`pr#8168 `_, Matt Benjamin) * rgw: check the return value when call fe->run() (`issue#14585 `_, `pr#7457 `_, wei qiaomiao) * rgw: Revert "rgw ldap" (`pr#8075 `_, Yehuda Sadeh) * librados: Striper: Fix incorrect push_front -> append_zero change (`pr#7578 `_, Haomai Wang) * rgw: build-related fixes (`pr#8076 `_, Yehuda Sadeh, Matt Benjamin) * mirrors: Updated scripts and documentation for mirrors (`pr#7847 `_, Wido den Hollander) * misc: use make_shared while creating shared_ptr (`pr#7769 `_, Somnath Roy) * cmake (`pr#7849 `_, Ali Maredia) * mds: filelock deadlock (`pr#7713 `_, Yan, Zheng) * mds: fix fsmap decode (`pr#8063 `_, Greg Farnum) * rgw: fix mdlog (`pr#8183 `_, Orit Wasserman) * rgw: highres time stamps (`pr#8108 `_, Yehuda Sadeh, Adam C. Emerson, Matt Benjamin) * rgw: swift versioning disabled (`pr#8066 `_, Yehuda Sadeh, Radoslaw Zarzynski) * rgw: sync fixes 3 (`pr#8170 `_, Yehuda Sadeh) * msg/xio: fixes (`pr#7603 `_, Roi Dayan) * ceph-fuse,libcephfs: fix free fds being exhausted eventually because freed fds are never put back (`issue#14798 `_, `pr#7685 `_, Zhi Zhang) * rgw: RGWLib::env is not used so remove it (`pr#7874 `_, Brad Hubbard) * build: a few armhf (32-bit build) fixes (`pr#7999 `_, Eric Lee, Sage Weil) * osd: add scrub persist/query API (`issue#13505 `_, `pr#6898 `_, Kefu Chai, Samuel Just) * rgw: adds the radosgw-admin sync status command that gives a human readable status of the sync process at a specific zone (`pr#8030 `_, Yehuda Sadeh) * scripts: adjust mstart and mstop script to run with cmake build (`pr#6920 `_, Orit Wasserman) * buffer: add symmetry operator==() and operator!=() (`pr#7974 `_, Kefu Chai) * buffer: hide iterator_impl symbols (`issue#14788 `_, `pr#7688 `_, Kefu Chai) * buffer: increment history alloc as well in raw_combined (`issue#14955 `_, `pr#7910 `_, Samuel Just) * buffer: raw_combined allocations buffer and ref count together (`pr#7612 `_, Sage Weil) * ceph-detect-init: add debian/jessie test (`pr#8074 `_, Kefu Chai) * ceph-detect-init: add missing test case (`pr#8105 `_, Nathan Cutler) * ceph-detect-init: fix py3 test (`pr#7243 `_, Kefu Chai) * ceph-detect-init: return correct value on recent SUSE distros (`issue#14770 `_, `pr#7909 `_, Nathan Cutler) * ceph-disk: deactivate / destroy PATH arg are optional (`pr#7756 `_, Loic Dachary) * ceph-disk: fix prepare --help (`pr#7758 `_, Loic Dachary) * ceph-disk: flake8 fixes (`pr#7646 `_, Loic Dachary) * ceph-disk: key management support (`issue#14669 `_, `pr#7552 `_, Loic Dachary) * ceph-disk: make some arguments as required if necessary (`pr#7687 `_, Dongsheng Yang) * ceph-disk: s/dmcrpyt/dmcrypt/ (`issue#14838 `_, `pr#7744 `_, Loic Dachary, Frode Sandholtbraaten) * ceph-fuse: Fix potential filehandle ref leak at umount (`issue#14800 `_, `pr#7686 `_, Zhi Zhang) * ceph.in: Minor python3 specific changes (`pr#7947 `_, Sarthak Munshi) * ceph_daemon.py: Resolved ImportError to work with python3 (`pr#7937 `_, Sarthak Munshi) * ceph_detect_init/__init__.py: remove shebang (`pr#7731 `_, Nathan Cutler) * ceph_test_msgr: reduce test size to fix memory size (`pr#8127 `_, Haomai Wang) * ceph_test_rados_misc: shorten mount timeout (`pr#8209 `_, Sage Weil) * cleanup (`pr#8058 `_, Yehuda Sadeh, Orit Wasserman) * client: flush kernel pagecache before creating snapshot (`issue#10436 `_, `pr#7495 `_, Yan, Zheng) * client: removed unused Mutex from MetaRequest (`pr#7655 `_, Greg Farnum) * cls/rgw: fix FTBFS (`pr#8142 `_, Kefu Chai) * cls/rgw: fix use of timespan (`issue#15181 `_, `pr#8212 `_, Yehuda Sadeh) * cls_hello: Fix grammatical error in description comment (`pr#7951 `_, Brad Hubbard) * cls_rbd: fix the test for ceph-dencoder (`pr#7793 `_, Kefu Chai) * cls_rbd: mirroring directory (`issue#14419 `_, `pr#7620 `_, Josh Durgin) * cls_rbd: protect against excessively large object maps (`issue#15121 `_, `pr#8099 `_, Jason Dillaman) * cmake: Remove duplicate find_package libcurl line. (`pr#7972 `_, Brad Hubbard) * cmake: add ErasureCode.cc to jerasure plugins (`pr#7808 `_, Casey Bodley) * cmake: add common/fs_types.cc to libcommon (`pr#7898 `_, Orit Wasserman) * cmake: add missing librbd image_watcher sources (`issue#14823 `_, `pr#7717 `_, Casey Bodley) * cmake: avoid false-positive LDAP header detect (`pr#8100 `_, Matt Benjamin) * cmake: fix paths to various EC source files (`pr#7748 `_, Ali Maredia, Matt Benjamin) * cmake: fix the build of tests (`pr#7523 `_, Kefu Chai) * common/TrackedOp: fix inaccurate counting for slow requests (`issue#14804 `_, `pr#7690 `_, xie xingguo) * common/bit_vector: use hard-coded value for block size (`issue#14747 `_, `pr#7610 `_, Jason Dillaman) * common/obj_bencher.cc: bump the precision of bandwidth field (`pr#8021 `_, Piotr Dałek) * common/obj_bencher.cc: faster object name generation (`pr#7863 `_, Piotr Dałek) * common/obj_bencher.cc: make verify error fatal (`issue#14971 `_, `pr#7897 `_, Piotr Dałek) * common/page.cc: _page_mask has too many bits (`pr#7588 `_, Dan Mick) * common/strtol.cc: fix the coverity warnings (`pr#7967 `_, Kefu Chai) * common: Do not use non-portable constants in mutex_debug (`pr#7766 `_, Adam C. Emerson) * common: SubProcess: Avoid buffer corruption when calling err() (`issue#15011 `_, `pr#8054 `_, Erwan Velu) * common: default cluster name to config file prefix (`pr#7364 `_, Javen Wu) * common: set thread name from correct thread (`pr#7845 `_, Igor Podoski) * common: various fixes from SCA runs (`pr#7680 `_, Danny Al-Gaaf) * config: fix osd_crush_initial_weight (`pr#7975 `_, You Ji) * config: increase default async op threads (`pr#7802 `_, Piotr Dałek) * configure.ac: boost_iostreams is required, not optional (`pr#7816 `_, Hector Martin) * configure.ac: update help strings for cython (`pr#7856 `_, Josh Durgin) * crush/CrushTester: workaround a bug in boost::icl (`pr#7560 `_, Kefu Chai) * crush: fix cli tests for new crush tunables (`pr#8107 `_, Sage Weil) * crush: update tunable docs. change default profile to jewel (`pr#7964 `_, Sage Weil) * debian/changelog: Remove stray 'v' in version (`pr#7936 `_, Dan Mick) * debian/changelog: Remove stray 'v' in version (`pr#7938 `_, Dan Mick) * debian: include cpio in build-requiers (`pr#7533 `_, Rémi BUISSON) * debian: package librgw_file* tests (`pr#7930 `_, Ken Dreyer) * doc/architecture.rst: remove redundant word "across" (`pr#8179 `_, Zhao Junwang) * doc/dev: add section on interrupting a running suite (`pr#8116 `_, Nathan Cutler) * doc/dev: continue writing Testing in the cloud chapter (`pr#7960 `_, Nathan Cutler) * doc/dev: integrate testing into the narrative (`pr#7946 `_, Nathan Cutler) * doc/dev: various refinements (`pr#7954 `_, Nathan Cutler) * doc/rados/api/librados-intro.rst: fix typo (`pr#7879 `_, xie xingguo) * doc: add ceph-detect-init(8) source to dist tarball (`pr#7933 `_, Ken Dreyer) * doc: add cinder backend section to rbd-openstack.rst (`pr#7923 `_, RustShen) * doc: detailed description of bugfixing workflow (`pr#7941 `_, Nathan Cutler) * doc: fix 0.94.4 and 0.94.5 ordering (`pr#7763 `_, Loic Dachary) * doc: fix typo, indention etc. (`pr#7829 `_, xie xingguo) * doc: initial draft of RBD mirroring admin documentation (`issue#15041 `_, `pr#8169 `_, Jason Dillaman) * doc: osd-config Add Configuration Options for op queue. (`pr#7837 `_, Robert LeBlanc) * doc: rgw explain keystone's verify ssl switch (`pr#7862 `_, Abhishek Lekshmanan) * doc: small fixes (`pr#7813 `_, xiexingguo) * doc: standardize @param (not @parma, @parmam, @params) (`pr#7714 `_, Nathan Cutler) * fix FTBFS introduced by d0af316 (`pr#7792 `_, Kefu Chai) * ghobject_t: use # instead of ! as a separator (`pr#8055 `_, Sage Weil) * include/encoding: do not try to be clever with list encoding (`pr#7913 `_, Sage Weil) * init-ceph.in: allow case-insensitive true in `osd crush update on start' (`pr#7943 `_, Eric Cook) * init-ceph.in: skip ceph-disk if it is not present (`issue#10587 `_, `pr#7286 `_, Ken Dreyer) * journal: async methods to (un)register and update client (`pr#7832 `_, Mykola Golub) * journal: improve commit position tracking (`pr#7776 `_, Jason Dillaman) * journal: prevent race injecting new records into overflowed object (`issue#15202 `_, `pr#8220 `_, Jason Dillaman) * librados: cancel aio notification linger op upon completion (`pr#8102 `_, Jason Dillaman) * librados: check connection state in rados_monitor_log (`issue#14499 `_, `pr#7350 `_, David Disseldorp) * librados: do not clear handle for aio_watch() (`pr#7771 `_, xie xingguo) * librados: fix test failure with new aio watch/unwatch API (`pr#7824 `_, Jason Dillaman) * librados: implement async watch/unwatch (`pr#7649 `_, Haomai Wang) * librados: mix lock cycle (un)registering asok commands (`pr#7581 `_, John Spray) * librados: race condition on aio_notify completion handling (`pr#7864 `_, Jason Dillaman) * librados: stat2 with higher time precision (`pr#7915 `_, Yehuda Sadeh, Matt Benjamin) * librbd: allocate new journal tag after acquiring exclusive lock (`pr#7884 `_, Jason Dillaman) * librbd: block read requests until journal replayed (`pr#7627 `_, Jason Dillaman) * librbd: correct issues discovered via valgrind memcheck (`pr#8132 `_, Jason Dillaman) * librbd: differentiate journal replay flush vs shut down (`pr#7698 `_, Jason Dillaman) * librbd: enable/disable image mirroring automatically for pool mode (`issue#15143 `_, `pr#8204 `_, Ricardo Dias) * librbd: fix state machine race conditions during shut down (`pr#7761 `_, Jason Dillaman) * librbd: handle unregistering the image watcher when disconnected (`pr#8094 `_, Jason Dillaman) * librbd: integrate journal replay with fsx testing (`pr#7583 `_, Jason Dillaman) * librbd: journal replay needs to support re-executing maintenance ops (`issue#14822 `_, `pr#7785 `_, Jason Dillaman) * librbd: reduce mem copies to user-buffer during read (`pr#7548 `_, Jianpeng Ma) * librbd: refresh image if required before replaying journal ops (`issue#14908 `_, `pr#7978 `_, Jason Dillaman) * librbd: remove last synchronous librados calls from open/close state machine (`pr#7839 `_, Jason Dillaman) * librbd: replaying a journal op post-refresh requires locking (`pr#8028 `_, Jason Dillaman) * librbd: retrieve image name when opening by id (`pr#7736 `_, Mykola Golub) * librbd: several race conditions discovered under single CPU environment (`pr#7653 `_, Jason Dillaman) * librbd: truncate does not need to mark the object as existing in the object map (`issue#14789 `_, `pr#7772 `_, xinxin shu) * librbd: update of mirror pool mode and mirror peer handling (`pr#7718 `_, Jason Dillaman) * librbd: use async librados notifications (`pr#7668 `_, Jason Dillaman) * log: do not repeat errors to stderr (`issue#14616 `_, `pr#7983 `_, Sage Weil) * log: fix stack overflow when flushing large log lines (`issue#14707 `_, `pr#7599 `_, Igor Fedotov) * log: segv in a portable way (`issue#14856 `_, `pr#7790 `_, Kefu Chai) * log: use delete[] (`pr#7904 `_, Sage Weil) * mailmap for 10.0.4 (`pr#7932 `_, Abhishek Lekshmanan) * mailmap updates (`pr#7528 `_, Yann Dupont) * man/8/ceph-disk: fix formatting issue (`pr#8012 `_, Sage Weil) * man/8/ceph-disk: fix formatting issue (`pr#8003 `_, Sage Weil) * mds, client: add namespace to file_layout_t (previously ceph_file_layout) (`pr#7098 `_, Yan, Zheng, Sage Weil) * mds: don't double-shutdown the timer when suiciding (`issue#14697 `_, `pr#7616 `_, Greg Farnum) * mds: fix FSMap upgrade with daemons in the map (`pr#8073 `_, John Spray, Greg Farnum) * mds: fix inode_t::compare() (`issue#15038 `_, `pr#8014 `_, Yan, Zheng) * mds: fix stray purging in 'stripe_count > 1' case (`issue#15050 `_, `pr#8040 `_, Yan, Zheng) * mds: function parameter 'df' should be passed by reference (`pr#7490 `_, Na Xie) * mirrors: Change contact e-mail address for se.ceph.com (`pr#8007 `_, Wido den Hollander) * mon/PGMonitor: reliably mark PGs state (`pr#8089 `_, Sage Weil) * mon/monitor: some clean up (`pr#7520 `_, huanwen ren) * mon/pgmonitor: use appropriate forced conversions in get_rule_avail (`pr#7705 `_, huanwen ren) * mon: cleanup set-quota error msg (`pr#7371 `_, Abhishek Lekshmanan) * mon: consider pool size when creating pool (`issue#14509 `_, `pr#7359 `_, songbaisen) * mon: enable 'mon osd prime pg temp' by default (`pr#7838 `_, Robert LeBlanc) * mon: fix calculation of %USED (`pr#7881 `_, Adam Kupczyk) * mon: fix keyring permissions (`issue#14950 `_, `pr#7880 `_, Owen Synge) * mon: initialize last_* timestamps on new pgs to creation time (`issue#14952 `_, `pr#7980 `_, Sage Weil) * mon: make clock skew checks sane (`issue#14175 `_, `pr#7141 `_, Joao Eduardo Luis) * mon: osd [test-]reweight-by-{pg,utilization} command updates (`pr#7890 `_, Dan van der Ster, Sage Weil) * mon: remove 'mds setmap' (`issue#15136 `_, `pr#8121 `_, Sage Weil) * mon: standardize Ceph removal commands (`pr#7939 `_, Dongsheng Yang) * mon: unconfuse object count skew message (`pr#7882 `_, Piotr Dałek) * mon: unregister command on shutdown (`pr#7504 `_, huanwen ren) * mount.fuse.ceph: better parsing of arguments passed to mount.fuse.ceph by mount command (`issue#14735 `_, `pr#7607 `_, Florent Bautista) * msg/async: _try_send trim already sent for outcoming_bl more efficient (`pr#7970 `_, Yan Jun) * msg/async: don't calculate msg header crc when not needed (`pr#7815 `_, Piotr Dałek) * msg/async: smarter MSG_MORE (`pr#7625 `_, Piotr Dałek) * msg: add thread safety for "random" Messenger + fix wrong usage of random functions (`pr#7650 `_, Avner BenHanoch) * msg: async: fix perf counter description and simplify _send_keepalive_or_ack (`pr#8046 `_, xie xingguo) * msg: async: small cleanups (`pr#7871 `_, xie xingguo) * msg: async: start over after failing to bind a port in specified range (`issue#14928 `_, `issue#13002 `_, `pr#7852 `_, xie xingguo) * msg: remove duplicated code - local_delivery will now call 'enqueue' (`pr#7948 `_, Avner BenHanoch) * msg: significantly reduce minimal memory usage of connections (`pr#7567 `_, Piotr Dałek) * mstart: start rgw on different ports as well (`pr#8167 `_, Abhishek Lekshmanan) * nfs for rgw (Matt Benjamin, Orit Wasserman) (`pr#7634 `_, Yehuda Sadeh, Matt Benjamin) * os/ObjectStore: implement more efficient get_encoded_bytes() (`pr#7775 `_, Piotr Dałek) * os/bluestore/BlueFS: Before reap ioct, it should wait io complete (`pr#8178 `_, Jianpeng Ma) * os/bluestore/BlueStore: Don't leak trim overlay data before write. (`pr#7895 `_, Jianpeng Ma) * os/bluestore/KernelDevice: force block size (`pr#8006 `_, Sage Weil) * os/bluestore/NVMEDevice: make IO thread using dpdk launch (`pr#8160 `_, Haomai Wang) * os/bluestore: clone overlay data (`pr#7860 `_, Jianpeng Ma) * os/bluestore: fix a typo in SPDK path parsing (`pr#7601 `_, Jianjian Huo) * os/bluestore: make bluestore_sync_transaction = true can work. (`pr#7674 `_, Jianpeng Ma) * os/bluestore: small fixes in bluestore StupidAllocator (`pr#8101 `_, Jianjian Huo) * os/filestore/FileJournal: set block size via config option (`pr#7628 `_, Sage Weil) * os/filestore: fix punch hole usage in _zero (`pr#8050 `_, Sage Weil) * os/filestore: fix result handling logic of destroy_collection (`pr#7721 `_, xie xingguo) * os/filestore: require offset == length == 0 for full object read; add test (`pr#7957 `_, Jianpeng Ma) * osd/OSDMap: fix typo in summarize_mapping_stats (`pr#8088 `_, Sage Weil) * osd/PGLog: fix warning (`pr#8057 `_, Sage Weil) * osd/ReplicatedPG: be more careful about calling publish_stats_to_osd() (`issue#14962 `_, `pr#8039 `_, Greg Farnum) * osd/ReplicatedPG: clear watches on change after applying repops (`issue#15151 `_, `pr#8163 `_, Sage Weil) * osd/ScrubStore: remove unused function (`pr#8045 `_, Kefu Chai) * osd: BlueStore/NVMEDevice: fix compiling and fd leak (`pr#7496 `_, xie xingguo) * osd: FileStore: use pwritev instead of lseek+writev (`pr#7349 `_, Haomai Wang, Tao Chang) * osd: OSDMap: reset osd_primary_affinity shared_ptr when deepish_copy_from (`issue#14686 `_, `pr#7553 `_, Xinze Chi) * osd: Replace snprintf with faster implementation in eversion_t::get_key_name (`pr#7121 `_, Evgeniy Firsov) * osd: WeightedPriorityQueue: move to intrusive containers (`pr#7654 `_, Robert LeBlanc) * osd: a fix for HeartbeatDispatcher and cleanups (`pr#7550 `_, Kefu Chai) * osd: add missing newline to usage message (`pr#7613 `_, Willem Jan Withagen) * osd: avoid FORCE updating digest been overwritten by MAYBE when comparing scrub map (`pr#7051 `_, Zhiqiang Wang) * osd: bluefs: fix alignment for odd page sizes (`pr#7900 `_, Dan Mick) * osd: bluestore updates, scrub fixes (`pr#8035 `_, Sage Weil) * osd: bluestore/blockdevice: use std::mutex et al (`pr#7568 `_, Sage Weil) * osd: bluestore: NVMEDevice: fix error handling (`pr#7799 `_, xie xingguo) * osd: bluestore: Revert NVMEDevice task cstor and refresh interface changes (`pr#7729 `_, Haomai Wang) * osd: bluestore: add 'override' to virtual functions (`pr#7886 `_, Michal Jarzabek) * osd: bluestore: allow _dump_onode dynamic accept log level (`pr#7995 `_, Jianpeng Ma) * osd: bluestore: fix check for write falling within the same extent (`issue#14954 `_, `pr#7892 `_, Jianpeng Ma) * osd: bluestore: for overwrite a extent, allocate new extent on min_alloc_size write (`pr#7996 `_, Jianpeng Ma) * osd: bluestore: improve fs-type verification and tidy up (`pr#7651 `_, xie xingguo) * osd: bluestore: misc fixes (`pr#7658 `_, Jianpeng Ma) * osd: bluestore: remove unneeded includes (`pr#7870 `_, Michal Jarzabek) * osd: clean up CMPXATTR checks (`pr#5961 `_, Jianpeng Ma) * osd: consider high/low mode when putting agent to sleep (`issue#14752 `_, `pr#7631 `_, Sage Weil) * osd: ensure new osdmaps commit before publishing them to pgs (`issue#15073 `_, `pr#8096 `_, Sage Weil) * osd: filejournal: report journal entry count (`pr#7643 `_, tianqing) * osd: filestore: FALLOC_FL_PUNCH_HOLE must be used with FALLOC_FL_KEEP_SIZE (`pr#7768 `_, xinxin shu) * osd: filestore: fast abort if statfs encounters ENOENT (`pr#7703 `_, xie xingguo) * osd: filestore: fix race condition with split vs collection_move_rename and long object names (`issue#14766 `_, `pr#8136 `_, Samuel Just) * osd: filestore: fix result code overwritten for clone (`issue#14817 `_, `issue#14827 `_, `pr#7711 `_, xie xingguo) * osd: filestore: fix wrong scope of result code for error cases during mkfs (`issue#14814 `_, `pr#7704 `_, xie xingguo) * osd: filestore: fix wrong scope of result code for error cases during mount (`issue#14815 `_, `pr#7707 `_, xie xingguo) * osd: filestore: restructure journal and op queue throttling (`pr#7767 `_, Samuel Just) * osd: fix forced prmootion for CALL ops (`issue#14745 `_, `pr#7617 `_, Sage Weil) * osd: fix fusestore hanging during stop/quit (`issue#14786 `_, `pr#7677 `_, xie xingguo) * osd: fix inaccurate counter and skip over queueing an empty transaction (`pr#7754 `_, xie xingguo) * osd: fix lack of object unblock when flush fails (`issue#14511 `_, `pr#7584 `_, Igor Fedotov) * osd: fix overload of '==' operator for pg_stat_t (`issue#14921 `_, `pr#7842 `_, xie xingguo) * osd: fix race condition for heartbeat_need_update (`issue#14387 `_, `pr#7739 `_, xie xingguo) * osd: fix return value from maybe_handle_cache_detail() (`pr#7593 `_, Igor Fedotov) * osd: fix unnecessary object promotion when deleting from cache pool (`issue#13894 `_, `pr#7537 `_, Igor Fedotov) * osd: fix wrong return type of find_osd_on_ip() (`issue#14872 `_, `pr#7812 `_, xie xingguo) * osd: ghobject_t: use ! instead of @ as a separator (`pr#7595 `_, Sage Weil) * osd: handle dup pg_create that races with pg deletion (`pr#8033 `_, Sage Weil) * osd: initialize last_recalibrate field at construction (`pr#8071 `_, xie xingguo) * osd: kstore: fix a race condition in _txc_finish() (`pr#7804 `_, Jianjian Huo) * osd: kstore: latency breakdown (`pr#7850 `_, James Liu) * osd: kstore: sync up kstore with recent bluestore updates (`pr#7681 `_, Jianjian Huo) * osd: memstore: fix alignment of Page for test_pageset (`pr#7587 `_, Casey Bodley) * osd: min_write_recency_for_promote & min_read_recency_for_promote are tiering only (`pr#8081 `_, huanwen ren) * osd: probabilistic cache tier promotion throttling (`pr#7465 `_, Sage Weil) * osd: remove up_thru_pending field, which is never used (`pr#7991 `_, xie xingguo) * osd: replicatedpg: break out loop if we encounter fatal error during do_pg_op() (`issue#14922 `_, `pr#7844 `_, xie xingguo) * osd: resolve boot vs NOUP set + clear race (`pr#7483 `_, Sage Weil) * packaging: make infernalis -> jewel upgrade work (`issue#15047 `_, `pr#8034 `_, Nathan Cutler) * packaging: move cephfs repair tools to ceph-common (`issue#15145 `_, `pr#8133 `_, Boris Ranto, Ken Dreyer) * pybind/rados: fix object lifetime issues and other bugs in aio (`pr#7778 `_, Hector Martin) * pybind/rados: use __dealloc__ since __del__ is ignored by cython (`pr#7692 `_, Mehdi Abaakouk) * pybind: Ensure correct python flags are passed (`pr#7663 `_, James Page) * pybind: flag an RBD image as closed regardless of result code (`pr#8005 `_, Jason Dillaman) * pybind: move cephfs to Cython (`pr#7745 `_, John Spray, Mehdi Abaakouk) * pybind: remove next() on iterators (`pr#7706 `_, Mehdi Abaakouk) * pybind: replace __del__ with __dealloc__ for rbd (`pr#7708 `_, Josh Durgin) * pybind: use correct subdir for rados install-exec rule (`pr#7684 `_, Josh Durgin) * python binding of librados with cython (`pr#7621 `_, Mehdi Abaakouk) * python: use pip instead of python setup.py (`pr#7605 `_, Loic Dachary) * qa/workunits/cephtool/test.sh: wait longer in ceph_watch_start() (`issue#14910 `_, `pr#7861 `_, Kefu Chai) * qa/workunits/rados/test.sh: capture stderr too (`pr#8004 `_, Sage Weil) * qa/workunits/rados/test.sh: test tmap_migrate (`pr#8114 `_, Sage Weil) * qa/workunits/rbd: do not use object map during read flag testing (`pr#8104 `_, Jason Dillaman) * qa/workunits/rbd: new online maintenance op tests (`pr#8216 `_, Jason Dillaman) * qa/workunits/rbd: use POSIX function definition (`issue#15104 `_, `pr#8068 `_, Nathan Cutler) * qa/workunits/rest/test.py: add confirmation to 'mds setmap' (`issue#14606 `_, `pr#7982 `_, Sage Weil) * qa/workunits/rest/test.py: don't use newfs (`pr#8191 `_, Sage Weil) * qa: add workunit to run ceph_test_rbd_mirror (`pr#8221 `_, Josh Durgin) * rados: add ceph:: namespace to bufferlist type (`pr#8059 `_, Noah Watkins) * rados: fix bug for write bench (`pr#7851 `_, James Liu) * rbd-mirror: ImageReplayer async start/stop (`pr#7944 `_, Mykola Golub) * rbd-mirror: ImageReplayer improvements (`pr#7759 `_, Mykola Golub) * rbd-mirror: fix image replay test failures (`pr#8158 `_, Jason Dillaman) * rbd-mirror: fix long termination due to 30sec wait in main loop (`pr#8185 `_, Mykola Golub) * rbd-mirror: implement ImageReplayer (`pr#7614 `_, Mykola Golub) * rbd-mirror: integrate with image sync state machine (`pr#8079 `_, Jason Dillaman) * rbd-mirror: minor fix-ups for initial skeleton implementation (`pr#7958 `_, Mykola Golub) * rbd-mirror: remote to local cluster image sync (`pr#7979 `_, Jason Dillaman) * rbd-mirror: use pool/image names in asok commands (`pr#8159 `_, Mykola Golub) * rbd-mirror: use the mirroring directory to detect candidate images (`issue#15142 `_, `pr#8162 `_, Ricardo Dias) * rbd/run_cli_tests.sh: Reflect test failures (`issue#14825 `_, `pr#7781 `_, Zack Cerza) * rbd: add support for mirror image promotion/demotion/resync (`pr#8138 `_, Jason Dillaman) * rbd: clone operation should default to image format 2 (`pr#8119 `_, Jason Dillaman) * rbd: deprecate image format 1 (`pr#7841 `_, Jason Dillaman) * rbd: support for enabling/disabling mirroring on specific images (`issue#13296 `_, `pr#8056 `_, Ricardo Dias) * release-notes: draft v0.94.6 release notes (`issue#13356 `_, `pr#7689 `_, Abhishek Varshney, Loic Dachary) * release-notes: draft v10.0.3 release notes (`pr#7592 `_, Loic Dachary) * release-notes: draft v10.0.4 release notes (`pr#7966 `_, Loic Dachary) * release-notes: draft v9.2.1 release notes (`issue#13750 `_, `pr#7694 `_, Abhishek Varshney) * rgw: ldap (Matt Benjamin) (`pr#7985 `_, Matt Benjamin) * rgw: multisite fixes (`pr#8013 `_, Yehuda Sadeh) * rgw: support for aws authentication v4 (Javier M. Mellid) (`issue#10333 `_, `pr#7720 `_, Yehuda Sadeh, Javier M. Mellid) * rgw sync fixes (`pr#8095 `_, Yehuda Sadeh) * rgw/rgw_common.h: fix the RGWBucketInfo decoding (`pr#8165 `_, Kefu Chai) * rgw/rgw_common.h: fix the RGWBucketInfo decoding (`pr#8154 `_, Kefu Chai) * rgw/rgw_orphan: check the return value of save_state (`pr#7544 `_, Boris Ranto) * rgw: Allow an implicit tenant in case of Keystone (`pr#8139 `_, Pete Zaitcev) * rgw: Drop unused usage_exit from rgw_admin.cc (`pr#7632 `_, Pete Zaitcev) * rgw: RGWZoneParams::create should not handle -EEXIST error (`pr#7927 `_, Orit Wasserman) * rgw: add bucket request payment feature usage statistics integration (`issue#13834 `_, `pr#6656 `_, Javier M. Mellid) * rgw: add support for caching of Keystone admin token. (`pr#7630 `_, Radoslaw Zarzynski) * rgw: add support for metadata upload during PUT on Swift container. (`pr#8002 `_, Radoslaw Zarzynski) * rgw: add support for system requests over Swift API (`pr#7666 `_, Radoslaw Zarzynski) * rgw: adjust the request_uri to support absoluteURI of http request (`issue#12917 `_, `pr#7675 `_, Wenjun Huang) * rgw: admin api for retrieving usage info (Ji Chen) (`pr#8031 `_, Yehuda Sadeh, Ji Chen) * rgw: allow authentication keystone with self signed certs (`issue#14853 `_, `issue#13422 `_, `pr#7777 `_, Abhishek Lekshmanan) * rgw: approximate AmazonS3 HostId error field. (`pr#7444 `_, Robin H. Johnson) * rgw: calculate payload hash in RGWPutObj_ObjStore only when necessary. (`pr#7869 `_, Radoslaw Zarzynski) * rgw: cleanups to comments and messages (`pr#7633 `_, Pete Zaitcev) * rgw: don't use s->bucket for metadata api path entry (`issue#14549 `_, `pr#7408 `_, Yehuda Sadeh) * rgw: drop permissions of rgw/civetweb after startup (`issue#13600 `_, `pr#8019 `_, Karol Mroz) * rgw: fcgi should include acconfig (`pr#7760 `_, Abhishek Lekshmanan) * rgw: fix wrong handling of limit=0 during listing of Swift account. (`issue#14903 `_, `pr#7821 `_, Radoslaw Zarzynski) * rgw: fixes for per-period metadata logs (`pr#7827 `_, Casey Bodley) * rgw: improve error handling in S3/Keystone integration (`pr#7597 `_, Radoslaw Zarzynski) * rgw: link civetweb with openssl (Sage, Marcus Watts) (`pr#7825 `_, Marcus Watts, Sage Weil) * rgw: link payer info to usage logging (`pr#7918 `_, Yehuda Sadeh, Javier M. Mellid) * rgw: move signal.h dependency from rgw_front.h (`pr#7678 `_, Matt Benjamin) * rgw: multiple Swift API compliance improvements for TempURL (Radoslaw Zarzynsk) (`issue#14806 `_, `issue#11163 `_, `pr#7891 `_, Radoslaw Zarzynski) * rgw: multiple improvements regarding etag calculation for SLO/DLO of Swift API. (`pr#7764 `_, Radoslaw Zarzynski) * rgw: remove duplicated code in RGWRados::get_bucket_info() (`pr#7413 `_, liyankun) * rgw: remove unused vector (`pr#7990 `_, Na Xie) * rgw: reset return code in when iterating over the bucket the objects (`issue#14826 `_, `pr#7803 `_, Orit Wasserman) * rgw: store system object meta in cache when creating it (`issue#14678 `_, `pr#7615 `_, Yehuda Sadeh) * rgw: support json format for admin policy API (Dunrong Huang) (`issue#14090 `_, `pr#8036 `_, Yehuda Sadeh, Dunrong Huang) * rgw: try to parse Keystone token in order appropriate to configuration. (`pr#7822 `_, Radoslaw Zarzynski) * rgw: use pimpl pattern for RGWPeriodHistory (`pr#7809 `_, Casey Bodley) * rgw: user quota may not adjust on bucket removal (`issue#14507 `_, `pr#7586 `_, root) * rgw:bucket link now set the bucket.instance acl (bug fix) (`issue#11076 `_, `pr#8037 `_, Zengran Zhang) * rpm,deb: remove conditional BuildRequires for btrfs-progs (`issue#15042 `_, `pr#8016 `_, Erwan Velu) * rpm: remove sub-package dependencies on "ceph" (`issue#15146 `_, `pr#8137 `_, Ken Dreyer) * script: add missing stop_rgw variable to stop.sh script (`pr#7959 `_, Karol Mroz) * selinux: Update policy to grant additional access (`issue#14870 `_, `pr#7971 `_, Boris Ranto) * selinux: allow log files to be located in /var/log/radosgw (`pr#7604 `_, Boris Ranto) * sstring.hh: return type from str_len(...) need not be const (`pr#7679 `_, Matt Benjamin) * submodules: revert an accidental change (`pr#7929 `_, Yehuda Sadeh) * systemd: correctly escape block device paths (`issue#14706 `_, `pr#7579 `_, James Page) * test/TestPGLog: fix the FTBFS (`issue#14930 `_, `pr#7855 `_, Kefu Chai) * test/bufferlist: Avoid false-positive tests (`pr#7955 `_, Erwan Velu) * test/cli-integration/rbd: disable progress output (`issue#14931 `_, `pr#7858 `_, Josh Durgin) * test/osd: Relax the timing intervals in osd-markdown.sh (`pr#7899 `_, Dan Mick) * test/pybind/test_ceph_argparse: fix reweight-by-utilization tests (`pr#8027 `_, Kefu Chai, Sage Weil) * test/radosgw-admin: update the expected usage outputs (`pr#7723 `_, Kefu Chai) * test/rgw: add multisite test for meta sync across periods (`pr#7887 `_, Casey Bodley) * test/time: no need to abs(uint64_t) for comparing (`pr#7726 `_, Kefu Chai) * test: add missing shut_down mock method (`pr#8125 `_, Jason Dillaman) * test: correct librbd errors discovered with unoptimized cmake build (`pr#7914 `_, Jason Dillaman) * test: create pools for rbd tests with different prefix (`pr#7738 `_, Mykola Golub) * test: enable test for bug #2339 which has been resolved. (`pr#7743 `_, You Ji) * test: fix issues discovered via the rbd permissions test case (`pr#8129 `_, Jason Dillaman) * test: fixup and improvements for rbd-mirror test (`pr#8090 `_, Mykola Golub) * test: handle exception thrown from close during rbd lock test (`pr#8124 `_, Jason Dillaman) * test: more debug logging for TestWatchNotify (`pr#7737 `_, Mykola Golub) * test: new librbd flatten test case (`pr#7609 `_, Jason Dillaman) * test: rbd-mirror: add "switch to the next tag" test (`pr#8149 `_, Mykola Golub) * test: rbd-mirror: compare positions using all fields (`pr#8172 `_, Mykola Golub) * test: reproducer for writeback CoW deadlock (`pr#8009 `_, Jason Dillaman) * test: update rbd integration cram tests for new default features (`pr#8001 `_, Jason Dillaman) * test_pool_create.sh: put test files in the test dir so they are cleaned up (`pr#8219 `_, Josh Durgin) * tests: ceph-disk.sh: should use "readlink -f" instead (`pr#7594 `_, Kefu Chai) * tests: ceph-disk.sh: use "readlink -f" instead for fullpath (`pr#7606 `_, Kefu Chai) * tests: fix a few build warnings (`pr#7608 `_, Sage Weil) * tests: sync ceph-erasure-code-corpus for mktemp -d (`pr#7596 `_, Loic Dachary) * tests: test_pidfile.sh lingering processes (`issue#14834 `_, `pr#7734 `_, Loic Dachary) * tools/cephfs: add tmap_upgrade (`pr#7003 `_, John Spray) * tools/cephfs: fix tmap_upgrade (`issue#15135 `_, `pr#8128 `_, John Spray) * tools/rados: reduce "rados put" memory usage by op_size (`pr#7928 `_, Piotr Dałek) * unittest_compression_zlib: do not assume buffer will be null terminated (`pr#8064 `_, Sage Weil) * unittest_osdmap: default crush tunables now firefly (`pr#8098 `_, Sage Weil) * vstart.sh: avoid race condition starting rgw via vstart.sh (`issue#14829 `_, `pr#7727 `_, Javier M. Mellid) * vstart.sh: silence a harmless msg where btrfs is not found (`pr#7640 `_, Patrick Donnelly) * xio: fix incorrect ip being assigned in case of multiple RDMA ports (`pr#7747 `_, Subramanyam Varanasi) * xio: remove duplicate assignment of peer addr (`pr#8025 `_, Roi Dayan) * xio: remove redundant magic methods (`pr#7773 `_, Roi Dayan) * xio: remove unused variable (`pr#8023 `_, Roi Dayan) * xxhash: use clone of xxhash.git; add .gitignore (`pr#7986 `_, Sage Weil) * rbd: update default image features (`pr#7846 `_, Jason Dillaman) * rbd-mirror: make remote context respect env and argv config params (`pr#8182 `_, Mykola Golub) * journal: re-use common threads between journalers (`pr#7906 `_, Jason Dillaman) * client: add option to control how directory size is calculated (`pr#7323 `_, Yan, Zheng) * rgw: keystone v3 (`pr#7719 `_, Mark Barnes, Radoslaw Zarzynski) * rgw: new multisite merge (`issue#14549 `_, `pr#7709 `_, Yehuda Sadeh, Orit Wasserman, Casey Bodley, Daniel Gryniewicz) * rgw: adjust error code when bucket does not exist in copy operation (`issue#14975 `_, `pr#7916 `_, Yehuda Sadeh) * rgw: indexless (`pr#7786 `_, Yehuda Sadeh) v10.0.5 ======= This is identical to v10.0.4 and was only created because of a git tagging mistake. v10.0.4 ======= This is the fifth and last development release before Jewel. The next release will be a release candidate with the final set of features. Big items include RGW static website support, librbd journal framework, fixed mon sync of config-key data, C++11 updates, and bluestore/kstore. Notable Changes --------------- * admin/build-doc: depend on zlib1g-dev and graphviz (`pr#7522 `_, Ken Dreyer) * buffer: use move construct to append/push_back/push_front (`pr#7455 `_, Haomai Wang) * build: allow jemalloc with rocksdb-static (`pr#7368 `_, Somnath Roy) * build: fix the autotools and cmake build (the new fusestore needs libfuse) (`pr#7393 `_, Kefu Chai) * build: fix warnings (`pr#7197 `_, Kefu Chai, xie xingguo) * build: fix warnings (`pr#7315 `_, Kefu Chai) * build: kill warnings (`pr#7397 `_, Kefu Chai) * build: move libexec scripts to standardize across distros (`issue#14687 `_, `issue#14705 `_, `issue#14723 `_, `pr#7636 `_, Nathan Cutler, Kefu Chai) * build: Refrain from versioning and packaging EC testing plugins (`issue#14756 `_, `issue#14723 `_, `pr#7637 `_, Nathan Cutler, Kefu Chai) * build: spdk submodule; cmake (`pr#7503 `_, Kefu Chai) * ceph-disk: support bluestore (`issue#13422 `_, `pr#7218 `_, Loic Dachary, Sage Weil) * ceph-disk/test: fix test_prepare.py::TestPrepare tests (`pr#7549 `_, Kefu Chai) * cleanup: remove misc dead code (`pr#7201 `_, Erwan Velu) * cls/cls_rbd: pass string by reference (`pr#7232 `_, Jeffrey Lu) * cmake: Added new unittests to make check (`pr#7572 `_, Ali Maredia) * cmake: add KernelDevice.cc to libos_srcs (`pr#7507 `_, Kefu Chai) * cmake: check for libsnappy in default path also (`pr#7366 `_, Kefu Chai) * cmake: feb5 (`pr#7541 `_, Matt Benjamin) * cmake: For CMake version <= 2.8.11, use LINK_PRIVATE and LINK_PUBLIC (`pr#7474 `_, Tao Chang) * cmake: let ceph-client-debug link with tcmalloc (`pr#7314 `_, Kefu Chai) * cmake: support ccache via a WITH_CCACHE build option (`pr#6875 `_, John Coyle) * common: add zlib compression plugin (`pr#7437 `_, Alyona Kiseleva, Kiseleva Alyona) * common: admin socket commands for tcmalloc heap get/set operations (`pr#7512 `_, Samuel Just) * common: ake ceph_time clocks work under BSD (`pr#7340 `_, Adam C. Emerson) * common: Allow OPT_INT settings with negative values (`issue#13829 `_, `pr#7390 `_, Brad Hubbard, Kefu Chai) * common/buffer: replace RWLock with spinlocks (`pr#7294 `_, Piotr Dałek) * common: change the type of counter total/unhealthy_workers (`pr#7254 `_, Guang Yang) * common: snappy decompressor may assert when handling segmented input bufferlist (`issue#14400 `_, `pr#7268 `_, Igor Fedotov) * common/str_map: cleanup: replaced get_str_map() function overloading by using default parameters for delimiters (`pr#7266 `_, Sahithi R V) * common: time: have skewing-now call non-skewing now (`pr#7466 `_, Adam C. Emerson) * common: unit test for interval_set implementations (`pr#6 `_, Igor Fedotov) * config: add $data_dir/config to config search path (`pr#7377 `_, Sage Weil) * configure.ac: make "--with-librocksdb-static" default to 'check' (`issue#14463 `_, `pr#7317 `_, Dan Mick) * crush: add safety assert (`issue#14496 `_, `pr#7344 `_, songbaisen) * crush: reply quickly from get_immediate_parent (`issue#14334 `_, `pr#7181 `_, song baisen) * debian: packaging fixes for jewel (`pr#7807 `_, Ken Dreyer, Ali Maredia) * debian/rpm split servers (`issue#10587 `_, `pr#7746 `_, Ken Dreyer) * doc: add orphans commands to radosgw-admin(8) (`issue#14637 `_, `pr#7518 `_, Ken Dreyer) * doc: amend the rados.8 (`pr#7251 `_, Kefu Chai) * doc: Fixes a CRUSH map step take argument (`pr#7327 `_, Ivan Grcic) * doc: fixing image in section ERASURE CODING (`pr#7298 `_, Rachana Patel) * doc: fix misleading configuration guide on cache tiering (`pr#7000 `_, Yuan Zhou) * doc: fix S3 C# example (`pr#7027 `_, Dunrong Huang) * doc: remove redundant space in ceph-authtool/monmaptool doc (`pr#7244 `_, Jiaying Ren) * doc: revise SubmittingPatches (`pr#7292 `_, Kefu Chai) * doc: rgw: port changes from downstream to upstream (`pr#7264 `_, Bara Ancincova) * doc: script and guidelines for mirroring Ceph (`pr#7384 `_, Wido den Hollander) * doc: use 'ceph auth get-or-create' for creating RGW keyring (`pr#6930 `_, Wido den Hollander) * global: do not start two daemons with a single pid-file (`issue#13422 `_, `pr#7075 `_, shun song) * global: do not start two daemons with a single pid-file (part 2) (`issue#13422 `_, `pr#7463 `_, Loic Dachary) * journal: flush commit position on metadata shutdown (`pr#7385 `_, Mykola Golub) * journal: reset commit_position_task_ctx pointer after task complete (`pr#7480 `_, Mykola Golub) * libcephfs: update LIBCEPHFS_VERSION to indicate the interface was changed (`pr#7551 `_, Jevon Qiao) * librados: move to c++11 concurrency types (`pr#5931 `_, Adam C. Emerson) * librados: remove duplicate definitions for rados pool_stat_t and cluster_stat_t (`pr#7330 `_, Igor Fedotov) * librados: shutdown finisher in a more graceful way (`pr#7519 `_, xie xingguo) * librados_test_stub: protect against notify/unwatch race (`pr#7540 `_, Jason Dillaman) * librbd: API: async open and close (`issue#14264 `_, `pr#7259 `_, Mykola Golub) * librbd: Avoid create two threads per image (`pr#7400 `_, Haomai Wang) * librbd: block maintenance ops until after journal is ready (`issue#14510 `_, `pr#7382 `_, Jason Dillaman) * librbd: fix internal handling of dynamic feature updates (`pr#7299 `_, Jason Dillaman) * librbd: journal framework for tracking exclusive lock transitions (`issue#13298 `_, `pr#7529 `_, Jason Dillaman) * librbd: journal shut down flush race condition (`issue#14434 `_, `pr#7302 `_, Jason Dillaman) * librbd: remove canceled tasks from timer thread (`issue#14476 `_, `pr#7329 `_, Douglas Fuller) * makefile: remove libedit from libclient.la (`pr#7284 `_, Kefu Chai) * mds, client: fix locking around handle_conf_change (`issue#14365 `_, `issue#14374 `_, `pr#7312 `_, John Spray) * mds: judgment added to avoid the risk of visiting the NULL pointer (`pr#7358 `_, Kongming Wu) * mon: add an independent option for max election time (`pr#7245 `_, Sangdi Xu) * mon: compact full epochs also (`issue#14537 `_, `pr#7396 `_, Kefu Chai) * mon: consider the pool size when setting pool crush rule (`issue#14495 `_, `pr#7341 `_, song baisen) * mon: drop useless rank init assignment (`issue#14508 `_, `pr#7321 `_, huanwen ren) * mon: fix locking in preinit error paths (`issue#14473 `_, `pr#7353 `_, huanwen ren) * mon: fix monmap creation stamp (`pr#7459 `_, duanweijun) * mon: fix sync of config-key data (`pr#7363 `_, Xiaowei Chen) * mon: go into ERR state if multiple PGs are stuck inactive (`issue#13923 `_, `pr#7253 `_, Wido den Hollander) * mon/MDSMonitor.cc: properly note beacon when health metrics changes (`issue#14684 `_, `pr#7757 `_, Yan, Zheng) * mon/MonClient: avoid null pointer error when configured incorrectly (`issue#14405 `_, `pr#7276 `_, Bo Cai) * mon: PG Monitor should report waiting for backfill (`issue#12744 `_, `pr#7398 `_, Abhishek Lekshmanan) * mon: reduce CPU and memory manager pressure of pg health check (`pr#7482 `_, Piotr Dałek) * mon: some cleanup in MonmapMonitor.cc (`pr#7418 `_, huanwen ren) * mon: warn if pg(s) not scrubbed (`issue#13142 `_, `pr#6440 `_, Michal Jarzabek) * msg/async: AsyncConnection: avoid debug log in cleanup_handler (`pr#7547 `_, Haomai Wang) * msg/async: bunch of fixes (`pr#7379 `_, Piotr Dałek) * msg/async: fix array boundary (`pr#7451 `_, Wei Jin) * msg/async: fix potential race condition (`pr#7453 `_, Haomai Wang) * msg/async: fix send closed local_connection message problem (`pr#7255 `_, Haomai Wang) * msg/async: reduce extra tcp packet for message ack (`pr#7380 `_, Haomai Wang) * msg/xio: fix compilation (`pr#7479 `_, Roi Dayan) * organizationmap: modify org mail info. (`pr#7240 `_, Xiaowei Chen) * os/bluestore: fix assert (`issue#14436 `_, `pr#7293 `_, xie xingguo) * os/bluestore: fix bluestore_wal_transaction_t encoding test (`pr#7342 `_, Kefu Chai) * os/bluestore: insert new onode to the front position of onode LRU (`pr#7492 `_, Jianjian Huo) * os/bluestore: use intrusive_ptr for Dir (`pr#7247 `_, Igor Fedotov) * osd: blockdevice: avoid implicit cast and add guard (`pr#7460 `_, xie xingguo) * osd: bluestore/BlueFS: initialize super block_size earlier in mkfs (`pr#7535 `_, Sage Weil) * osd: BlueStore: fix fsck and blockdevice read-relevant issue (`pr#7362 `_, xie xingguo) * osd: BlueStore: fix null pointer access (`issue#14561 `_, `pr#7435 `_, xie xingguo) * osd: bluestore, kstore: fix nid overwritten logic (`issue#14407 `_, `issue#14433 `_, `pr#7283 `_, xie xingguo) * osd: bluestore: use btree_map for allocator (`pr#7269 `_, Igor Fedotov, Sage Weil) * osd: drop fiemap len=0 logic (`pr#7267 `_, Sage Weil) * osd: FileStore: add error check for object_map->sync() (`pr#7281 `_, Chendi Xue) * osd: FileStore: cleanup: remove obsolete option "filestore_xattr_use_omap" (`issue#14356 `_, `pr#7217 `_, Vikhyat Umrao) * osd: FileStore: modify the format of colon (`pr#7333 `_, Donghai Xu) * osd: FileStore: print file name before osd assert if read file failed (`pr#7111 `_, Ji Chen) * osd: fix invalid list traversal in process_copy_chunk (`pr#7511 `_, Samuel Just) * osd, mon: fix exit issue (`pr#7420 `_, Jiaying Ren) * osd: PG::activate(): handle unexpected cached_removed_snaps more gracefully (`issue#14428 `_, `pr#7309 `_, Alexey Sheplyakov) * os/fs: fix io_getevents argument (`pr#7355 `_, Jingkai Yuan) * os/fusestore: add error handling (`pr#7395 `_, xie xingguo) * os/keyvaluestore: kill KeyValueStore (`pr#7320 `_, Haomai Wang) * os/kstore: insert new onode to the front position of onode LRU (`pr#7505 `_, xie xingguo) * os/ObjectStore: add custom move operations for ObjectStore::Transaction (`pr#7303 `_, Casey Bodley) * rgw: Bug fix for mtime anomalies in RadosGW and other places (`pr#7328 `_, Adam C. Emerson, Casey Bodley) * rpm: move %post(un) ldconfig calls to ceph-base (`issue#14940 `_, `pr#7867 `_, Nathan Cutler) * rpm: move runtime dependencies to ceph-base and fix other packaging issues (`issue#14864 `_, `pr#7826 `_, Nathan Cutler) * test: ceph_test_rados: use less CPU (`pr#7513 `_, Samuel Just) v10.0.3 ======= This is the fourth development release for Jewel. Several big pieces have been added this release, including BlueStore (a new backend for OSD to replace FileStore), many ceph-disk fixes, a new CRUSH tunable that improves mapping stability, a new librados object enumeration API, and a whole slew of OSD and RADOS optimizations. Note that, due to general developer busyness, we aren't building official release packages for this dev release. You can fetch autobuilt gitbuilder packages from the usual location (gitbuilder.ceph.com). Notable Changes --------------- * bluestore: latest and greatest (`issue#14210 `_, `issue#13801 `_, `pr#6896 `_, xie.xingguo, Jianpeng Ma, YiQiang Chen, Sage Weil, Ning Yao) * buffer: fix internal iterator invalidation on rebuild, get_contiguous (`pr#6962 `_, Sage Weil) * build: fix a few warnings (`pr#6847 `_, Orit Wasserman) * build: misc make check fixes (`pr#7153 `_, Sage Weil) * ceph-detect-init: fix py3 test (`pr#7025 `_, Kefu Chai) * ceph-disk: add -f flag for btrfs mkfs (`pr#7222 `_, Darrell Enns) * ceph-disk: ceph-disk list fails on /dev/cciss!c0d0 (`issue#13970 `_, `issue#14233 `_, `issue#14230 `_, `pr#6879 `_, Loic Dachary) * ceph-disk: fix failures when preparing disks with udev > 214 (`issue#14080 `_, `issue#14094 `_, `pr#6926 `_, Loic Dachary, Ilya Dryomov) * ceph-disk: Fix trivial typo (`pr#7472 `_, Brad Hubbard) * ceph-disk: warn for prepare partitions with bad GUIDs (`issue#13943 `_, `pr#6760 `_, David Disseldorp) * ceph-fuse: fix double decreasing the count to trim caps (`issue#14319 `_, `pr#7229 `_, Zhi Zhang) * ceph-fuse: fix double free of args (`pr#7015 `_, Ilya Shipitsin) * ceph-fuse: fix fsync() (`pr#6388 `_, Yan, Zheng) * ceph-fuse:print usage information when no parameter specified (`pr#6868 `_, Bo Cai) * ceph: improve the error message (`issue#11101 `_, `pr#7106 `_, Kefu Chai) * ceph.in: avoid a broken pipe error when use ceph command (`issue#14354 `_, `pr#7212 `_, Bo Cai) * ceph.spec.in: add copyright notice (`issue#14694 `_, `pr#7569 `_, Nathan Cutler) * ceph.spec.in: add license declaration (`pr#7574 `_, Nathan Cutler) * ceph_test_libcephfs: tolerate duplicated entries in readdir (`issue#14377 `_, `pr#7246 `_, Yan, Zheng) * client: check if Fh is readable when processing a read (`issue#11517 `_, `pr#7209 `_, Yan, Zheng) * client: properly trim unlinked inode (`issue#13903 `_, `pr#7297 `_, Yan, Zheng) * cls_rbd: add guards for error cases (`issue#14316 `_, `issue#14317 `_, `pr#7165 `_, xie xingguo) * cls_rbd: enable object map checksums for object_map_save (`issue#14280 `_, `pr#7149 `_, Douglas Fuller) * cmake: Add ENABLE_GIT_VERSION to avoid rebuilding (`pr#7171 `_, Kefu Chai) * cmake: add missing check for HAVE_EXECINFO_H (`pr#7270 `_, Casey Bodley) * cmake: cleanups and more features from automake (`pr#7103 `_, Casey Bodley, Ali Maredia) * cmake: detect bzip2 and lz4 (`pr#7126 `_, Kefu Chai) * cmake: fix build with bluestore (`pr#7099 `_, John Spray) * cmake: fix the build on trusty (`pr#7249 `_, Kefu Chai) * cmake: made rocksdb an imported library (`pr#7131 `_, Ali Maredia) * cmake: no need to run configure from run-cmake-check.sh (`pr#6959 `_, Orit Wasserman) * cmake: test_build_libcephfs needs ${ALLOC_LIBS} (`pr#7300 `_, Ali Maredia) * common/address_help.cc: fix the leak in entity_addr_from_url() (`issue#14132 `_, `pr#6987 `_, Qiankun Zheng) * common: add thread names (`pr#5882 `_, Igor Podoski) * common: assert: abort() rather than throw (`pr#6804 `_, Adam C. Emerson) * common: buffer/assert minor fixes (`pr#6990 `_, Matt Benjamin) * common/Formatter: avoid newline if there is no output (`pr#5351 `_, Aran85) * common: improve shared_cache and simple_cache efficiency with hash table (`pr#6909 `_, Ning Yao) * common/lockdep: increase max lock names (`pr#6961 `_, Sage Weil) * common: new timekeeping common code, and Objecter conversion (`pr#5782 `_, Adam C. Emerson) * common: signal_handler: added support for using reentrant strsignal() implementations vs. sys_siglist[] (`pr#6796 `_, John Coyle) * config: complains when a setting is not tracked (`issue#11692 `_, `pr#7085 `_, Kefu Chai) * configure: detect bz2 and lz4 (`issue#13850 `_, `issue#13981 `_, `pr#7030 `_, Kefu Chai) * correct radosgw-admin command (`pr#7006 `_, YankunLi) * crush: add chooseleaf_stable tunable (`pr#6572 `_, Sangdi Xu, Sage Weil) * crush: clean up whitespace removal (`issue#14302 `_, `pr#7157 `_, songbaisen) * crush/CrushTester: check for overlapped rules (`pr#7139 `_, Kefu Chai) * crushtool: improve usage/tip messages (`pr#7142 `_, xie xingguo) * crushtool: set type 0 name "device" for --build option (`pr#6824 `_, Sangdi Xu) * doc: adding "--allow-shrink" in decreasing the size of the rbd block to distinguish from the increasing option (`pr#7020 `_, Yehua) * doc: admin/build-doc: make paths absolute (`pr#7119 `_, Dan Mick) * doc: dev: document ceph-qa-suite (`pr#6955 `_, Loic Dachary) * doc: document "readforward" and "readproxy" cache mode (`pr#7023 `_, Kefu Chai) * doc: fix "mon osd down out subtree limit" option name (`pr#7164 `_, François Lafont) * doc: fix typo (`pr#7004 `_, tianqing) * doc: Updated the rados command man page to include the --run-name opt… (`issue#12899 `_, `pr#5900 `_, ritz303) * fs: be more careful about the "mds setmap" command to prevent breakage (`issue#14380 `_, `pr#7262 `_, Yan, Zheng) * helgrind: additional race conditionslibrbd: journal replay should honor inter-event dependencies (`pr#7274 `_, Jason Dillaman) * helgrind: fix real (and imaginary) race conditions (`issue#14163 `_, `pr#7208 `_, Jason Dillaman) * kv: implement value_as_ptr() and use it in .get() (`pr#7052 `_, Piotr Dałek) * librados: add c++ style osd/pg command interface (`pr#6893 `_, Yunchuan Wen) * librados: fix several flaws introduced by the enumeration_objects API (`issue#14299 `_, `issue#14301 `_, `issue#14300 `_, `pr#7156 `_, xie xingguo) * librados: new style (sharded) object listing (`pr#6405 `_, John Spray, Sage Weil) * librados: potential null pointer access in list_(n)objects (`issue#13822 `_, `pr#6639 `_, xie xingguo) * librbd: exit if parent's snap is gone during clone (`issue#14118 `_, `pr#6968 `_, xie xingguo) * librbd: fix potential memory leak (`issue#14332 `_, `issue#14333 `_, `pr#7174 `_, xie xingguo) * librbd: fix snap_exists API return code overflow (`issue#14129 `_, `pr#6986 `_, xie xingguo) * librbd: journal replay should honor inter-event dependencies (`pr#7019 `_, Jason Dillaman) * librbd: return error if we fail to delete object_map head object (`issue#14098 `_, `pr#6958 `_, xie xingguo) * librbd: small fixes for error messages and readahead counter (`issue#14127 `_, `pr#6983 `_, xie xingguo) * librbd: uninitialized state in snap remove state machine (`pr#6982 `_, Jason Dillaman) * mailmap: hange organization for Dongmao Zhang (`pr#7173 `_, Dongmao Zhang) * mailmap: Igor Podoski affiliation (`pr#7219 `_, Igor Podoski) * mailmap update (`pr#7210 `_, M Ranga Swami Reddy) * mailmap updates (`pr#6992 `_, Loic Dachary) * mailmap updates (`pr#7189 `_, Loic Dachary) * man: document listwatchers cmd in "rados" manpage (`pr#7021 `_, Kefu Chai) * mds: advance clientreplay when replying (`issue#14357 `_, `pr#7216 `_, John Spray) * mds: expose state of recovery to status ASOK command (`issue#14146 `_, `pr#7068 `_, Yan, Zheng) * mds: fix client cap/message replay order on restart (`issue#14254 `_, `issue#13546 `_, `pr#7199 `_, Yan, Zheng) * mds: fix standby replay thread creation (`issue#14144 `_, `pr#7132 `_, John Spray) * mds: we should wait messenger when MDSDaemon suicide (`pr#6996 `_, Wei Feng) * mon: add `osd blacklist clear` (`pr#6945 `_, John Spray) * mon: add RAW USED column to ceph df detail (`pr#7087 `_, Ruifeng Yang) * mon: degrade a log message to level 2 (`pr#6929 `_, Kongming Wu) * mon: fix coding-style on PG related Monitor files (`pr#6881 `_, Wido den Hollander) * mon: fixes related to mondbstore->get() changes (`pr#6564 `_, Piotr Dałek) * mon: fix reuse of osd ids (clear osd info on osd deletion) (`issue#13988 `_, `pr#6900 `_, Loic Dachary, Sage Weil) * mon: fix the can't change subscribe level bug in monitoring log (`pr#7031 `_, Zhiqiang Wang) * mon/MDSMonitor: add confirmation to "ceph mds rmfailed" (`issue#14379 `_, `pr#7248 `_, Yan, Zheng) * mon: modify a dout level in OSDMonitor.cc (`pr#6928 `_, Yongqiang He) * mon: MonmapMonitor: don't expose uncommitted state to client (`pr#6854 `_, Joao Eduardo Luis) * mon/OSDMonitor: osdmap laggy set a maximum limit for interval (`pr#7109 `_, Zengran Zhang) * mon: paxos is_recovering calc error (`pr#7227 `_, Weijun Duan) * mon/PGMap: show rd/wr iops separately in status reports (`pr#7072 `_, Cilang Zhao) * mon: PGMonitor: acting primary diff with cur_stat, should not set pg to stale (`pr#7083 `_, Xiaowei Chen) * msg: add override to virutal methods (`pr#6977 `_, Michal Jarzabek) * msg/async: cleanup dead connection and misc things (`pr#7158 `_, Haomai Wang) * msg/async: don't use shared_ptr to manage EventCallback (`pr#7028 `_, Haomai Wang) * msg: filter out lo addr when bind osd addr (`pr#7012 `_, Ji Chen) * msg: removed unneeded includes from Dispatcher (`pr#6814 `_, Michal Jarzabek) * msg: remove unneeded inline (`pr#6989 `_, Michal Jarzabek) * msgr: fix large message data content length causing overflow (`pr#6809 `_, Jun Huang, Haomai Wang) * msg/simple: pipe: memory leak when signature check failed (`pr#7096 `_, Ruifeng Yang) * msg/simple: remove unneeded friend declarations (`pr#6924 `_, Michal Jarzabek) * objecter: avoid recursive lock of Objecter::rwlock (`pr#7343 `_, Yan, Zheng) * os/bluestore: fix bluestore_wal_transaction_t encoding test (`pr#7419 `_, Kefu Chai, Brad Hubbard) * osd: add cache hint when pushing raw clone during recovery (`pr#7069 `_, Zhiqiang Wang) * osd: avoid debug std::string initialization in PG::get/put (`pr#7117 `_, Evgeniy Firsov) * osd: avoid osd_op_thread suicide because osd_scrub_sleep (`pr#7009 `_, Jianpeng Ma) * osd: bluestore: bluefs: fix several small bugs (`issue#14344 `_, `issue#14343 `_, `pr#7200 `_, xie xingguo) * osd: bluestore: don't include when building without libaio (`issue#14207 `_, `pr#7169 `_, Mykola Golub) * osd: bluestore: fix bluestore onode_t attr leak (`pr#7125 `_, Ning Yao) * osd: bluestore: fix bluestore_wal_transaction_t encoding test (`pr#7168 `_, Kefu Chai) * osd: bluestore: fix several bugs (`issue#14259 `_, `issue#14353 `_, `issue#14260 `_, `issue#14261 `_, `pr#7122 `_, xie xingguo) * osd: bluestore: fix space rebalancing, collection split, buffered reads (`pr#7196 `_, Sage Weil) * osd: bluestore: more fixes (`pr#7130 `_, Sage Weil) * osd: cache tier: add config option for eviction check list size (`pr#6997 `_, Yuan Zhou) * osdc: Fix race condition with tick_event and shutdown (`issue#14256 `_, `pr#7151 `_, Adam C. Emerson) * osd: check health state before pre_booting (`issue#14181 `_, `pr#7053 `_, Xiaoxi Chen) * osd: clear pg_stat_queue after stopping pgs (`issue#14212 `_, `pr#7091 `_, Sage Weil) * osd: delay populating in-memory PG log hashmaps (`pr#6425 `_, Piotr Dałek) * osd: disable filestore_xfs_extsize by default (`issue#14397 `_, `pr#7265 `_, Ken Dreyer) * osd: do not keep ref of old osdmap in pg (`issue#13990 `_, `pr#7007 `_, Kefu Chai) * osd: drop deprecated removal pg type (`pr#6970 `_, Igor Podoski) * osd: FileJournal: fix return code of create method (`issue#14134 `_, `pr#6988 `_, xie xingguo) * osd: FileJournal: support batch peak and pop from writeq (`pr#6701 `_, Xinze Chi) * osd: FileStore: conditional collection of drive metadata (`pr#6956 `_, Somnath Roy) * osd: FileStore:: optimize lfn_unlink (`pr#6649 `_, Jianpeng Ma) * osd: fix null pointer access and race condition (`issue#14072 `_, `pr#6916 `_, xie xingguo) * osd: fix scrub start hobject (`pr#7467 `_, Sage Weil) * osd: fix sparse-read result code checking logic (`issue#14151 `_, `pr#7016 `_, xie xingguo) * osd: fix temp object removal after upgrade (`issue#13862 `_, `pr#6976 `_, David Zafman) * osd: fix wip (l_osd_op_wip) perf counter and remove repop_map (`pr#7077 `_, Xinze Chi) * osd: fix wrongly placed assert and some cleanups (`pr#6766 `_, xiexingguo, xie xingguo) * osd: KeyValueStore: fix return code of mkfs (`pr#7036 `_, xie xingguo) * osd: KeyValueStore: fix wrongly placed assert (`issue#14176 `_, `issue#14178 `_, `pr#7047 `_, xie xingguo) * osd: kstore: several small fixes (`issue#14351 `_, `issue#14352 `_, `pr#7213 `_, xie xingguo) * osd: kstore: small fixes to kstore (`issue#14204 `_, `pr#7095 `_, xie xingguo) * osd: make list_missing query missing_loc.needs_recovery_map (`pr#6298 `_, Guang Yang) * osdmap: remove unused local variables (`pr#6864 `_, luo kexue) * osd: memstore: fix two bugs (`pr#6963 `_, Casey Bodley, Sage Weil) * osd: misc FileStore fixes (`issue#14192 `_, `issue#14188 `_, `issue#14194 `_, `issue#14187 `_, `issue#14186 `_, `pr#7059 `_, xie xingguo) * osd: misc optimization for map utilization (`pr#6950 `_, Ning Yao) * osd,mon: log leveldb and rocksdb to ceph log (`pr#6921 `_, Sage Weil) * osd: Omap small bugs adapted (`pr#6669 `_, Jianpeng Ma, David Zafman) * osd: optimize the session_handle_reset function (`issue#14182 `_, `pr#7054 `_, songbaisen) * osd: OSDService: Fix typo in osdmap comment (`pr#7275 `_, Brad Hubbard) * osd: os: skip checking pg_meta object existance in FileStore (`pr#6870 `_, Ning Yao) * osd: PGLog: clean up read_log (`pr#7092 `_, Jie Wang) * osd: prevent osd_recovery_sleep from causing recovery-thread suicide (`pr#7065 `_, Jianpeng Ma) * osd: reduce string use in coll_t::calc_str() (`pr#6505 `_, Igor Podoski) * osd: release related sources when scrub is interrupted (`pr#6744 `_, Jianpeng Ma) * osd: remove unused OSDMap::set_weightf() (`issue#14369 `_, `pr#7231 `_, huanwen ren) * osd: ReplicatedPG: clean up unused function (`pr#7211 `_, Xiaowei Chen) * osd/ReplicatedPG: fix promotion recency logic (`issue#14320 `_, `pr#6702 `_, Sage Weil) * osd: several small cleanups (`pr#7055 `_, xie xingguo) * osd: shut down if we flap too many times in a short period (`pr#6708 `_, Xiaoxi Chen) * osd: skip promote for writefull w/ FADVISE_DONTNEED/NOCACHE (`pr#7010 `_, Jianpeng Ma) * osd: small fixes to memstore (`issue#14228 `_, `issue#14229 `_, `issue#14227 `_, `pr#7107 `_, xie xingguo) * osd: try evicting after flushing is done (`pr#5630 `_, Zhiqiang Wang) * osd: use atomic to generate ceph_tid (`pr#7017 `_, Evgeniy Firsov) * osd: use optimized is_zero in object_stat_sum_t.is_zero() (`pr#7203 `_, Piotr Dałek) * osd: utime_t, eversion_t, osd_stat_sum_t encoding optimization (`pr#6902 `_, Xinze Chi) * pybind: add ceph_volume_client interface for Manila and similar frameworks (`pr#6205 `_, John Spray) * pybind: fix build failure, remove extraneous semicolon in method (`issue#14371 `_, `pr#7235 `_, Abhishek Lekshmanan) * pybind/test_rbd: fix test_create_defaults (`issue#14279 `_, `pr#7155 `_, Josh Durgin) * qa: disable rbd/qemu-iotests test case 055 on RHEL/CentOSlibrbd: journal replay should honor inter-event dependencies (`issue#14385 `_, `pr#7272 `_, Jason Dillaman) * qa/workunits: merge_diff shouldn't attempt to use striping (`issue#14165 `_, `pr#7041 `_, Jason Dillaman) * qa/workunits/snaps: move snap tests into fs sub-directory (`pr#6496 `_, Yan, Zheng) * rados: implement rm --force option to force remove when full (`pr#6202 `_, Xiaowei Chen) * rbd: additional validation for striping parameters (`pr#6914 `_, Na Xie) * rbd: add pool name to disambiguate rbd admin socket commands (`pr#6904 `_, wuxiangwei) * rbd: correct an output string for merge-diff (`pr#7046 `_, Kongming Wu) * rbd: fix static initialization ordering issues (`pr#6978 `_, Mykola Golub) * rbd-fuse: image name can not include snap name (`pr#7044 `_, Yongqiang He) * rbd-fuse: implement mv operation (`pr#6938 `_, wuxiangwei) * rbd: must specify both of stripe-unit and stripe-count when specifying stripingv2 feature (`pr#7026 `_, Donghai Xu) * rbd-nbd: add copyright (`pr#7166 `_, Li Wang) * rbd-nbd: fix up return code handling (`pr#7215 `_, Mykola Golub) * rbd-nbd: small improvements in logging and forking (`pr#7127 `_, Mykola Golub) * rbd: rbd order will be place in 22, when set to 0 in the config_opt (`issue#14139 `_, `issue#14047 `_, `pr#6886 `_, huanwen ren) * rbd: striping parameters should support 64bit integers (`pr#6942 `_, Na Xie) * rbd: use default order from configuration when not specified (`pr#6965 `_, Yunchuan Wen) * rgw: add a method to purge all associate keys when removing a subuser (`issue#12890 `_, `pr#6002 `_, Sangdi Xu) * rgw: add missing error code for admin op API (`pr#7037 `_, Dunrong Huang) * rgw: add support for "end_marker" parameter for GET on Swift account. (`issue#10682 `_, `pr#4216 `_, Radoslaw Zarzynski) * rgw_admin: orphans finish segfaults (`pr#6652 `_, Igor Fedotov) * rgw: content length (`issue#13582 `_, `pr#6975 `_, Yehuda Sadeh) * rgw: delete default zone (`pr#7005 `_, YankunLi) * rgw: do not abort radowgw server when using admin op API with bad parameters (`issue#14190 `_, `issue#14191 `_, `pr#7063 `_, Dunrong Huang) * rgw: Drop a debugging message (`pr#7280 `_, Pete Zaitcev) * rgw: fix a typo in init-radosgw (`pr#6817 `_, Zhi Zhang) * rgw: fix compilation warning (`pr#7160 `_, Yehuda Sadeh) * rgw: fix wrong check for parse() return (`pr#6797 `_, Dunrong Huang) * rgw: let radosgw-admin bucket stats return a standard josn (`pr#7029 `_, Ruifeng Yang) * rgw: modify command stucking when operating radosgw-admin metadata list user (`pr#7032 `_, Peiyang Liu) * rgw: modify documents and help infos' descriptions to the usage of option date when executing command "log show" (`pr#6080 `_, Kongming Wu) * rgw: Parse --subuser better (`pr#7279 `_, Pete Zaitcev) * rgw: radosgw-admin bucket check --fix not work (`pr#7093 `_, Weijun Duan) * rgw: warn on suspicious civetweb frontend parameters (`pr#6944 `_, Matt Benjamin) * rocksdb: remove rdb sources from dist tarball (`issue#13554 `_, `pr#7105 `_, Venky Shankar) * stringify outputted error code and fix unmatched parentheses. (`pr#6998 `_, xie.xingguo, xie xingguo) * test/librbd/fsx: Use c++11 std::mt19937 generator instead of random_r() (`pr#6332 `_, John Coyle) * test/mon/osd-erasure-code-profile: pick new mon port (`pr#7161 `_, Sage Weil) * tests: add const for ec test (`pr#6911 `_, Michal Jarzabek) * tests: configure with rocksdb by default (`issue#14220 `_, `pr#7100 `_, Loic Dachary) * tests: Fix for make check. (`pr#7102 `_, David Zafman) * tests: notification slave needs to wait for master (`issue#13810 `_, `pr#7220 `_, Jason Dillaman) * tests: snap rename and rebuild object map in client update test (`pr#7224 `_, Jason Dillaman) * tests: unittest_bufferlist: fix hexdump test (`pr#7152 `_, Sage Weil) * tests: unittest_ipaddr: fix segv (`pr#7154 `_, Sage Weil) * tools: ceph_monstore_tool: add inflate-pgmap command (`issue#14217 `_, `pr#7097 `_, Kefu Chai) * tools: monstore: add 'show-versions' command. (`pr#7073 `_, Cilang Zhao) v10.0.2 ======= This development release includes a raft of changes and improvements for Jewel. Key additions include CephFS scrub/repair improvements, an AIX and Solaris port of librados, many librbd journaling additions and fixes, extended per-pool options, and NBD driver for RBD (rbd-nbd) that allows librbd to present a kernel-level block device on Linux, multitenancy support for RGW, RGW bucket lifecycle support, RGW support for Swift static large objects (SLO), and RGW support for Swift bulk delete. There are also lots of smaller optimizations and performance fixes going in all over the tree, particular in the OSD and common code. Notable Changes --------------- * auth: fail if rotating key is missing (do not spam log) (`pr#6473 `_, Qiankun Zheng) * auth: fix crash when bad keyring is passed (`pr#6698 `_, Dunrong Huang) * auth: make keyring without mon entity type return -EACCES (`pr#5734 `_, Xiaowei Chen) * buffer: make usable outside of ceph source again (`pr#6863 `_, Josh Durgin) * build: cmake check fixes (`pr#6787 `_, Orit Wasserman) * build: fix bz2-dev dependency (`pr#6948 `_, Samuel Just) * build: Gentoo: _FORTIFY_SOURCE fix. (`issue#13920 `_, `pr#6739 `_, Robin H. Johnson) * build/ops: systemd ceph-disk unit must not assume /bin/flock (`issue#13975 `_, `pr#6803 `_, Loic Dachary) * ceph-detect-init: Ubuntu >= 15.04 uses systemd (`pr#6873 `_, James Page) * cephfs-data-scan: scan_frags (`pr#5941 `_, John Spray) * cephfs-data-scan: scrub tag filtering (#12133 and #12145) (`issue#12133 `_, `issue#12145 `_, `pr#5685 `_, John Spray) * ceph-fuse: add process to ceph-fuse --help (`pr#6821 `_, Wei Feng) * ceph-kvstore-tool: handle bad out file on command line (`pr#6093 `_, Kefu Chai) * ceph-mds:add --help/-h (`pr#6850 `_, Cilang Zhao) * ceph_objectstore_bench: fix race condition, bugs (`issue#13516 `_, `pr#6681 `_, Igor Fedotov) * ceph.spec.in: add BuildRequires: systemd (`issue#13860 `_, `pr#6692 `_, Nathan Cutler) * client: a better check for MDS availability (`pr#6253 `_, John Spray) * client: close mds sessions in shutdown() (`pr#6269 `_, John Spray) * client: don't invalidate page cache when inode is no longer used (`pr#6380 `_, Yan, Zheng) * client: modify a word in log (`pr#6906 `_, YongQiang He) * cls/cls_rbd.cc: fix misused metadata_name_from_key (`issue#13922 `_, `pr#6661 `_, Xiaoxi Chen) * cmake: Add common/PluginRegistry.cc to CMakeLists.txt (`pr#6805 `_, Pete Zaitcev) * cmake: add rgw_basic_types.cc to librgw.a (`pr#6786 `_, Orit Wasserman) * cmake: add TracepointProvider.cc to libcommon (`pr#6823 `_, Orit Wasserman) * cmake: define STRERROR_R_CHAR_P for GNU-specific strerror_r (`pr#6751 `_, Ilya Dryomov) * cmake: update for recent librbd changes (`pr#6715 `_, John Spray) * cmake: update for recent rbd changes (`pr#6818 `_, Mykola Golub) * common: add generic plugin infrastructure (`pr#6696 `_, Sage Weil) * common: add latency perf counter for finisher (`pr#6175 `_, Xinze Chi) * common: buffer: add cached_crc and cached_crc_adjust counts to perf dump (`pr#6535 `_, Ning Yao) * common: buffer: remove unneeded list destructor (`pr#6456 `_, Michal Jarzabek) * common/ceph_context.cc:fix order of initialisers (`pr#6838 `_, Michal Jarzabek) * common: don't reverse hobject_t hash bits when zero (`pr#6653 `_, Piotr Dałek) * common: log: Assign LOG_DEBUG priority to syslog calls (`issue#13993 `_, `pr#6815 `_, Brad Hubbard) * common: log: predict log message buffer allocation size (`pr#6641 `_, Adam Kupczyk) * common: optimize debug logging code (`pr#6441 `_, Adam Kupczyk) * common: perf counter for bufferlist history total alloc (`pr#6198 `_, Xinze Chi) * common: reduce CPU usage by making stringstream in stringify function thread local (`pr#6543 `_, Evgeniy Firsov) * common: re-enable backtrace support (`pr#6771 `_, Jason Dillaman) * common: SubProcess: fix multiple definition bug (`pr#6790 `_, Yunchuan Wen) * common: use namespace instead of subclasses for buffer (`pr#6686 `_, Michal Jarzabek) * configure.ac: macro fix (`pr#6769 `_, Igor Podoski) * doc: admin/build-doc: add lxml dependencies on debian (`pr#6610 `_, Ken Dreyer) * doc/cephfs/posix: update (`pr#6922 `_, Sage Weil) * doc: CodingStyle: fix broken URLs (`pr#6733 `_, Kefu Chai) * doc: correct typo 'restared' to 'restarted' (`pr#6734 `_, Yilong Zhao) * doc/dev/index: refactor/reorg (`pr#6792 `_, Nathan Cutler) * doc/dev/index.rst: begin writing Contributing to Ceph (`pr#6727 `_, Nathan Cutler) * doc/dev/index.rst: fix headings (`pr#6780 `_, Nathan Cutler) * doc: dev: introduction to tests (`pr#6910 `_, Loic Dachary) * doc: file must be empty when writing layout fields of file use "setfattr" (`pr#6848 `_, Cilang Zhao) * doc: Fixed incorrect name of a "List Multipart Upload Parts" Response Entity (`issue#14003 `_, `pr#6829 `_, Lenz Grimmer) * doc: Fixes a spelling error (`pr#6705 `_, Jeremy Qian) * doc: fix typo in cephfs/quota (`pr#6745 `_, Drunkard Zhang) * doc: fix typo in developer guide (`pr#6943 `_, Nathan Cutler) * doc: INSTALL redirect to online documentation (`pr#6749 `_, Loic Dachary) * doc: little improvements for troubleshooting scrub issues (`pr#6827 `_, Mykola Golub) * doc: Modified a note section in rbd-snapshot doc. (`pr#6908 `_, Nilamdyuti Goswami) * doc: note that cephfs auth stuff is new in jewel (`pr#6858 `_, John Spray) * doc: osd: s/schedued/scheduled/ (`pr#6872 `_, Loic Dachary) * doc: remove unnecessary period in headline (`pr#6775 `_, Marc Koderer) * doc: rst style fix for pools document (`pr#6816 `_, Drunkard Zhang) * doc: Update list of admin/build-doc dependencies (`issue#14070 `_, `pr#6934 `_, Nathan Cutler) * init-ceph: do umount when the path exists. (`pr#6866 `_, Xiaoxi Chen) * journal: disconnect watch after watch error (`issue#14168 `_, `pr#7113 `_, Jason Dillaman) * journal: fire replay complete event after reading last object (`issue#13924 `_, `pr#6762 `_, Jason Dillaman) * journal: support replaying beyond skipped splay objects (`pr#6687 `_, Jason Dillaman) * librados: aix gcc librados port (`pr#6675 `_, Rohan Mars) * librados: avoid malloc(0) (which can return NULL on some platforms) (`issue#13944 `_, `pr#6779 `_, Dan Mick) * librados: clean up Objecter.h (`pr#6731 `_, Jie Wang) * librados: include/rados/librados.h: fix typo (`pr#6741 `_, Nathan Cutler) * librbd: automatically flush IO after blocking write operations (`issue#13913 `_, `pr#6742 `_, Jason Dillaman) * librbd: better handling of exclusive lock transition period (`pr#7204 `_, Jason Dillaman) * librbd: check for presence of journal before attempting to remove (`issue#13912 `_, `pr#6737 `_, Jason Dillaman) * librbd: clear error when older OSD doesn't support image flags (`issue#14122 `_, `pr#7035 `_, Jason Dillaman) * librbd: correct include guard in RenameRequest.h (`pr#7143 `_, Jason Dillaman) * librbd: correct issues discovered during teuthology testing (`issue#14108 `_, `issue#14107 `_, `pr#6974 `_, Jason Dillaman) * librbd: correct issues discovered when cache is disabled (`issue#14123 `_, `pr#6979 `_, Jason Dillaman) * librbd: correct race conditions discovered during unit testing (`issue#14060 `_, `pr#6923 `_, Jason Dillaman) * librbd: disable copy-on-read when not exclusive lock owner (`issue#14167 `_, `pr#7129 `_, Jason Dillaman) * librbd: do not ignore self-managed snapshot release result (`issue#14170 `_, `pr#7043 `_, Jason Dillaman) * librbd: ensure copy-on-read requests are complete prior to closing parent image (`pr#6740 `_, Jason Dillaman) * librbd: ensure librados callbacks are flushed prior to destroying (`issue#14092 `_, `pr#7040 `_, Jason Dillaman) * librbd: fix journal iohint (`pr#6917 `_, Jianpeng Ma) * librbd: fix known test case race condition failures (`issue#13969 `_, `pr#6800 `_, Jason Dillaman) * librbd: fix merge-diff for >2GB diff-files (`issue#14030 `_, `pr#6889 `_, Yunchuan Wen) * librbd: fix test case race condition for journaling ops (`pr#6877 `_, Jason Dillaman) * librbd: fix tracepoint parameter in diff_iterate (`pr#6892 `_, Yunchuan Wen) * librbd: image refresh code paths converted to async state machines (`pr#6859 `_, Jason Dillaman) * librbd: include missing header for bool type (`pr#6798 `_, Mykola Golub) * librbd: initial collection of state machine unit tests (`pr#6703 `_, Jason Dillaman) * librbd: integrate journaling for maintenance operations (`pr#6625 `_, Jason Dillaman) * librbd: journaling-related lock dependency cleanup (`pr#6777 `_, Jason Dillaman) * librbd: not necessary to hold owner_lock while releasing snap id (`issue#13914 `_, `pr#6736 `_, Jason Dillaman) * librbd: only send signal when AIO completions queue empty (`pr#6729 `_, Jianpeng Ma) * librbd: optionally validate new RBD pools for snapshot support (`issue#13633 `_, `pr#6925 `_, Jason Dillaman) * librbd: partial revert of commit 9b0e359 (`issue#13969 `_, `pr#6789 `_, Jason Dillaman) * librbd: properly handle replay of snap remove RPC message (`issue#14164 `_, `pr#7042 `_, Jason Dillaman) * librbd: reduce verbosity of common error condition logging (`issue#14234 `_, `pr#7114 `_, Jason Dillaman) * librbd: simplify IO method signatures for 32bit environments (`pr#6700 `_, Jason Dillaman) * librbd: support eventfd for AIO completion notifications (`pr#5465 `_, Haomai Wang) * mailmap: add UMCloud affiliation (`pr#6820 `_, Jiaying Ren) * mailmap: Jewel updates (`pr#6750 `_, Abhishek Lekshmanan) * makefiles: remove bz2-dev from dependencies (`issue#13981 `_, `pr#6939 `_, Piotr Dałek) * mds: add 'p' flag in auth caps to control setting pool in layout (`pr#6567 `_, John Spray) * mds: fix client capabilities during reconnect (client.XXXX isn't responding to mclientcaps(revoke)) (`issue#11482 `_, `pr#6432 `_, Yan, Zheng) * mds: fix setvxattr (broken in a536d114) (`issue#14029 `_, `pr#6941 `_, John Spray) * mds: repair the command option "--hot-standby" (`pr#6454 `_, Wei Feng) * mds: tear down connections from `tell` commands (`issue#14048 `_, `pr#6933 `_, John Spray) * mon: fix ceph df pool available calculation for 0-weighted OSDs (`pr#6660 `_, Chengyuan Li) * mon: fix routed_request_tids leak (`pr#6102 `_, Ning Yao) * mon: support min_down_reporter by subtree level (default by host) (`pr#6709 `_, Xiaoxi Chen) * mount.ceph: memory leaks (`pr#6905 `_, Qiankun Zheng) * osd: add osd op queue latency perfcounter (`pr#5793 `_, Haomai Wang) * osd: Allow repair of history.last_epoch_started using config (`pr#6793 `_, David Zafman) * osd: avoid duplicate op->mark_started in ReplicatedBackend (`pr#6689 `_, Jacek J. Łakis) * osd: cancel failure reports if we fail to rebind network (`pr#6278 `_, Xinze Chi) * osd: correctly handle small osd_scrub_interval_randomize_ratio (`pr#7147 `_, Samuel Just) * osd: defer decoding of MOSDRepOp/MOSDRepOpReply (`pr#6503 `_, Xinze Chi) * osd: don't update epoch and rollback_info objects attrs if there is no need (`pr#6555 `_, Ning Yao) * osd: dump number of missing objects for each peer with pg query (`pr#6058 `_, Guang Yang) * osd: enable perfcounters on sharded work queue mutexes (`pr#6455 `_, Jacek J. Łakis) * osd: FileJournal: reduce locking scope in write_aio_bl (`issue#12789 `_, `pr#5670 `_, Zhi Zhang) * osd: FileStore: remove __SWORD_TYPE dependency (`pr#6263 `_, John Coyle) * osd: fix FileStore::_destroy_collection error return code (`pr#6612 `_, Ruifeng Yang) * osd: fix incorrect throttle in WBThrottle (`pr#6713 `_, Zhang Huan) * osd: fix MOSDRepScrub reference counter in replica_scrub (`pr#6730 `_, Jie Wang) * osd: fix rollback_info_trimmed_to before index() (`issue#13965 `_, `pr#6801 `_, Samuel Just) * osd: fix trivial scrub bug (`pr#6533 `_, Li Wang) * osd: KeyValueStore: don't queue NULL context (`pr#6783 `_, Haomai Wang) * osd: make backend and block device code a bit more generic (`pr#6759 `_, Sage Weil) * osd: move newest decode version of MOSDOp and MOSDOpReply to the front (`pr#6642 `_, Jacek J. Łakis) * osd: pg_pool_t: add dictionary for pool options (`issue#13077 `_, `pr#6081 `_, Mykola Golub) * osd: reduce memory consumption of some structs (`pr#6475 `_, Piotr Dałek) * osd: release the message throttle when OpRequest unregistered (`issue#14248 `_, `pr#7148 `_, Samuel Just) * osd: remove __SWORD_TYPE dependency (`pr#6262 `_, John Coyle) * osd: slightly reduce actual size of pg_log_entry_t (`pr#6690 `_, Piotr Dałek) * osd: support pool level recovery_priority and recovery_op_priority (`pr#5953 `_, Guang Yang) * osd: use pg id (without shard) when referring the PG (`pr#6236 `_, Guang Yang) * packaging: add build dependency on python devel package (`pr#7205 `_, Josh Durgin) * pybind/cephfs: add symlink and its unit test (`pr#6323 `_, Shang Ding) * pybind: decode empty string in conf_parse_argv() correctly (`pr#6711 `_, Josh Durgin) * pybind: Implementation of rados_ioctx_snapshot_rollback (`pr#6878 `_, Florent Manens) * pybind: port the rbd bindings to Cython (`issue#13115 `_, `pr#6768 `_, Hector Martin) * pybind: support ioctx:exec (`pr#6795 `_, Noah Watkins) * qa: erasure-code benchmark plugin selection (`pr#6685 `_, Loic Dachary) * qa/krbd: Expunge generic/247 (`pr#6831 `_, Douglas Fuller) * qa/workunits/cephtool/test.sh: false positive fail on /tmp/obj1. (`pr#6837 `_, Robin H. Johnson) * qa/workunits/cephtool/test.sh: no ./ (`pr#6748 `_, Sage Weil) * qa/workunits/rbd: rbd-nbd test should use sudo for map/unmap ops (`issue#14221 `_, `pr#7101 `_, Jason Dillaman) * rados: bench: fix off-by-one to avoid writing past object_size (`pr#6677 `_, Tao Chang) * rbd: add --object-size option, deprecate --order (`issue#12112 `_, `pr#6830 `_, Vikhyat Umrao) * rbd: add RBD pool mirroring configuration API + CLI (`pr#6129 `_, Jason Dillaman) * rbd: fix build with "--without-rbd" (`issue#14058 `_, `pr#6899 `_, Piotr Dałek) * rbd: journal: configuration via conf, cli, api and some fixes (`pr#6665 `_, Mykola Golub) * rbd: merge_diff test should use new --object-size parameter instead of --order (`issue#14106 `_, `pr#6972 `_, Na Xie, Jason Dillaman) * rbd-nbd: network block device (NBD) support for RBD (`pr#6657 `_, Yunchuan Wen, Li Wang) * rbd: output formatter may not be closed upon error (`issue#13711 `_, `pr#6706 `_, xie xingguo) * rgw: add a missing cap type (`pr#6774 `_, Yehuda Sadeh) * rgw: add an inspection to the field of type when assigning user caps (`pr#6051 `_, Kongming Wu) * rgw: add LifeCycle feature (`pr#6331 `_, Ji Chen) * rgw: add support for Static Large Objects of Swift API (`issue#12886 `_, `issue#13452 `_, `pr#6643 `_, Yehuda Sadeh, Radoslaw Zarzynski) * rgw: fix a glaring syntax error (`pr#6888 `_, Pavan Rallabhandi) * rgw: fix the build failure (`pr#6927 `_, Kefu Chai) * rgw: multitenancy support (`pr#6784 `_, Yehuda Sadeh, Pete Zaitcev) * rgw: Remove unused code in PutMetadataAccount:execute (`pr#6668 `_, Pete Zaitcev) * rgw: remove unused variable in RGWPutMetadataBucket::execute (`pr#6735 `_, Radoslaw Zarzynski) * rgw/rgw_resolve: fallback to res_query when res_nquery not implemented (`pr#6292 `_, John Coyle) * rgw: static large objects (Radoslaw Zarzynski, Yehuda Sadeh) * rgw: swift bulk delete (Radoslaw Zarzynski) * systemd: start/stop/restart ceph services by daemon type (`issue#13497 `_, `pr#6276 `_, Zhi Zhang) * sysvinit: allow custom cluster names (`pr#6732 `_, Richard Chan) * test/encoding/readable.sh fix (`pr#6714 `_, Igor Podoski) * test: fix osd-scrub-snaps.sh (`pr#6697 `_, Xinze Chi) * test/librados/test.cc: clean up EC pools' crush rules too (`issue#13878 `_, `pr#6788 `_, Loic Dachary, Dan Mick) * tests: allow object corpus readable test to skip specific incompat instances (`pr#6932 `_, Igor Podoski) * tests: ceph-helpers assert success getting backfills (`pr#6699 `_, Loic Dachary) * tests: ceph_test_keyvaluedb_iterators: fix broken test (`pr#6597 `_, Haomai Wang) * tests: fix failure for osd-scrub-snap.sh (`issue#13986 `_, `pr#6890 `_, Loic Dachary, Ning Yao) * tests: fix race condition testing auto scrub (`issue#13592 `_, `pr#6724 `_, Xinze Chi, Loic Dachary) * tests: flush op work queue prior to destroying MockImageCtx (`issue#14092 `_, `pr#7002 `_, Jason Dillaman) * tests: --osd-scrub-load-threshold=2000 for more consistency (`issue#14027 `_, `pr#6871 `_, Loic Dachary) * tests: osd-scrub-snaps.sh to display full osd logs on error (`issue#13986 `_, `pr#6857 `_, Loic Dachary) * test: use sequential journal_tid for object cacher test (`issue#13877 `_, `pr#6710 `_, Josh Durgin) * tools: add cephfs-table-tool 'take_inos' (`pr#6655 `_, John Spray) * tools: Fix layout handing in cephfs-data-scan (#13898) (`pr#6719 `_, John Spray) * tools: support printing part cluster map in readable fashion (`issue#13079 `_, `pr#5921 `_, Bo Cai) * vstart.sh: add mstart, mstop, mrun wrappers for running multiple vstart-style test clusters out of src tree (`pr#6901 `_, Yehuda Sadeh) v10.0.1 ======= This is the second development release for the Jewel cycle. Highlights include some KeyValueDB interface optimizations, initial journaling support in librbd, MDS scrubbing progress, and a bunch of OSD optimizations and improvements (removal of an unnecessary CRUSH calculation in the IO path, sharding for FileStore completions, improved temperature calculation for cache tiering, fixed randomization of scrub times, and various optimizations). Notable Changes --------------- * build: build internal plugins and classes as modules (`pr#6462 `_, James Page) * build: fix Jenkins make check errors due to deep-scrub randomization (`pr#6671 `_, David Zafman) * build/ops: enable CR in CentOS 7 (`issue#13997 `_, `pr#6844 `_, Loic Dachary) * build/ops: rbd-replay moved from ceph-test-dbg to ceph-common-dbg (`issue#13785 `_, `pr#6578 `_, Loic Dachary) * ceph-disk: Add destroy and deactivate option (`issue#7454 `_, `pr#5867 `_, Vicente Cheng) * ceph-disk: compare parted output with the dereferenced path (`issue#13438 `_, `pr#6219 `_, Joe Julian) * ceph-objectstore-tool: fix --dry-run for many ceph-objectstore-tool operations (`pr#6545 `_, David Zafman) * ceph.spec.in: limit _smp_mflags when lowmem_builder is set in SUSE's OBS (`issue#13858 `_, `pr#6691 `_, Nathan Cutler) * ceph_test_msgr: Use send_message instead of keepalive to wakeup connection (`pr#6605 `_, Haomai Wang) * client: avoid creating orphan object in Client::check_pool_perm() (`issue#13782 `_, `pr#6603 `_, Yan, Zheng) * client: use null snapc to check pool permission (`issue#13714 `_, `pr#6497 `_, Yan, Zheng) * cmake: add nss as a suffix for pk11pub.h (`pr#6556 `_, Samuel Just) * cmake: fix files list (`pr#6539 `_, Yehuda Sadeh) * cmake: librbd and libjournal build fixes (`pr#6557 `_, Ilya Dryomov) * coc: fix typo in the apt-get command (`pr#6659 `_, Chris Holcombe) * common: allow enable/disable of optracker at runtime (`pr#5168 `_, Jianpeng Ma) * common: fix reset max in Throttle using perf reset command (`issue#13517 `_, `pr#6300 `_, Xinze Chi) * doc: add v0.80.11 to the release timeline (`pr#6658 `_, Loic Dachary) * doc: release-notes: draft v0.80.11 release notes (`pr#6374 `_, Loic Dachary) * doc: release-notes: draft v10.0.0 release notes (`pr#6666 `_, Loic Dachary) * doc: SubmittingPatches: there is no next; only jewel (`pr#6811 `_, Nathan Cutler) * doc: Update ceph-disk manual page with new feature deactivate/destroy. (`pr#6637 `_, Vicente Cheng) * doc: update infernalis release notes (`pr#6575 `_, vasukulkarni) * fix: use right init_flags to finish CephContext (`pr#6549 `_, Yunchuan Wen) * init-ceph: use getopt to make option processing more flexible (`issue#3015 `_, `pr#6089 `_, Nathan Cutler) * journal: incremental improvements and fixes (`pr#6552 `_, Mykola Golub) * krbd: remove deprecated --quiet param from udevadm (`issue#13560 `_, `pr#6394 `_, Jason Dillaman) * kv: fix bug in kv key optimization (`pr#6511 `_, Sage Weil) * kv/KineticStore: fix broken split_key (`pr#6574 `_, Haomai Wang) * kv: optimize and clean up internal key/value interface (`pr#6312 `_, Piotr Dałek, Sage Weil) * librados: do cleanup (`pr#6488 `_, xie xingguo) * librados: fix pool alignment API overflow issue (`issue#13715 `_, `pr#6489 `_, xie xingguo) * librados: fix potential null pointer access when do pool_snap_list (`issue#13639 `_, `pr#6422 `_, xie xingguo) * librados: fix PromoteOn2ndRead test for EC (`pr#6373 `_, Sage Weil) * librados: fix rare race where pool op callback may hang forever (`issue#13642 `_, `pr#6426 `_, xie xingguo) * librados: Solaris port (`pr#6416 `_, Rohan Mars) * librbd: flush and invalidate cache via admin socket (`issue#2468 `_, `pr#6453 `_, Mykola Golub) * librbd: integrate journaling support for IO operations (`pr#6541 `_, Jason Dillaman) * librbd: perf counters might not be initialized on error (`issue#13740 `_, `pr#6523 `_, Jason Dillaman) * librbd: perf section name: use hyphen to separate components (`issue#13719 `_, `pr#6516 `_, Mykola Golub) * librbd: resize should only update image size within header (`issue#13674 `_, `pr#6447 `_, Jason Dillaman) * librbd: start perf counters after id is initialized (`issue#13720 `_, `pr#6494 `_, Mykola Golub) * mailmap: revise organization (`pr#6519 `_, Li Wang) * mailmap: Ubuntu Kylin name changed to Kylin Cloud (`pr#6532 `_, Loic Dachary) * mailmap: update .organizationmap (`pr#6565 `_, chenji-kael) * mailmap: updates for infernalis. (`pr#6495 `_, Yann Dupont) * mailmap: updates (`pr#6594 `_, chenji-kael) * mds: fix scrub_path (`pr#6684 `_, John Spray) * mds: properly set STATE_STRAY/STATE_ORPHAN for stray dentry/inode (`issue#13777 `_, `pr#6553 `_, Yan, Zheng) * mds: ScrubStack and "tag path" command (`pr#5662 `_, Yan, Zheng, John Spray, Greg Farnum) * mon: block 'ceph osd pg-temp ...' if pg_temp update is already pending (`pr#6704 `_, Sage Weil) * mon: don't require OSD W for MRemoveSnaps (`issue#13777 `_, `pr#6601 `_, John Spray) * mon: initialize recorded election epoch properly even when standalone (`issue#13627 `_, `pr#6407 `_, huanwen ren) * mon: revert MonitorDBStore's WholeStoreIteratorImpl::get (`issue#13742 `_, `pr#6522 `_, Piotr Dałek) * msg/async: let receiver ack message ASAP (`pr#6478 `_, Haomai Wang) * msg/async: support of non-block connect in async messenger (`issue#12802 `_, `pr#5848 `_, Jianhui Yuan) * msg/async: will crash if enabling async msg because of an assertion (`pr#6640 `_, Zhi Zhang) * osd: avoid calculating crush mapping for most ops (`pr#6371 `_, Sage Weil) * osd: avoid double-check for replaying and can_checkpoint() in FileStore::_check_replay_guard (`pr#6471 `_, Ning Yao) * osd: call on_new_interval on newly split child PG (`issue#13962 `_, `pr#6778 `_, Sage Weil) * osd: change mutex to spinlock to optimize thread context switch. (`pr#6492 `_, Xiaowei Chen) * osd: check do_shutdown before do_restart (`pr#6547 `_, Xiaoxi Chen) * osd: clarify the scrub result report (`pr#6534 `_, Li Wang) * osd: don't do random deep scrubs for user initiated scrubs (`pr#6673 `_, David Zafman) * osd: FileStore: support multiple ondisk finish and apply finishers (`pr#6486 `_, Xinze Chi, Haomai Wang) * osd: fix broken balance / localized read handling (`issue#13491 `_, `pr#6364 `_, Jason Dillaman) * osd: fix bug in last_* PG state timestamps (`pr#6517 `_, Li Wang) * osd: fix ClassHandler::ClassData::get_filter() (`pr#6747 `_, Yan, Zheng) * osd: fixes for several cases where op result code was not checked or set (`issue#13566 `_, `pr#6347 `_, xie xingguo) * osd: fix reactivate (check OSDSuperblock in mkfs() when we already have the superblock) (`issue#13586 `_, `pr#6385 `_, Vicente Cheng) * osd: fix wrong use of right parenthesis in localized read logic (`pr#6566 `_, Jie Wang) * osd: improve temperature calculation for cache tier agent (`pr#4737 `_, MingXin Liu) * osd: merge local_t and op_t txn to single one (`pr#6439 `_, Xinze Chi) * osd: newstore: misc updates (including kv and os/fs stuff) (`pr#6609 `_, Sage Weil) * osd: note down the number of missing clones (`pr#6654 `_, Kefu Chai) * osd: optimize clone write path if object-map is enabled (`pr#6403 `_, xinxin shu) * osd: optimize scrub subset_last_update calculation (`pr#6518 `_, Li Wang) * osd: partial revert of "ReplicatedPG: result code not correctly set in some cases." (`issue#13796 `_, `pr#6622 `_, Sage Weil) * osd: randomize deep scrubbing (`pr#6550 `_, Dan van der Ster, Herve Rousseau) * osd: scrub: do not assign value if read error (`pr#6568 `_, Li Wang) * osd: write file journal optimization (`pr#6484 `_, Xinze Chi) * rbd: accept --user, refuse -i command-line optionals (`pr#6590 `_, Ilya Dryomov) * rbd: add missing command aliases to refactored CLI (`issue#13806 `_, `pr#6606 `_, Jason Dillaman) * rbd: dynamically generated bash completion (`issue#13494 `_, `pr#6316 `_, Jason Dillaman) * rbd: fixes for refactored CLI and related tests (`pr#6738 `_, Ilya Dryomov) * rbd: make config changes actually apply (`pr#6520 `_, Mykola Golub) * rbd: refactor cli command handling (`pr#5987 `_, Jason Dillaman) * rbd: stripe unit/count set incorrectly from config (`pr#6593 `_, Mykola Golub) * rbd: support negative boolean command-line optionals (`issue#13784 `_, `pr#6607 `_, Jason Dillaman) * rbd: unbreak rbd map + cephx_sign_messages option (`pr#6583 `_, Ilya Dryomov) * rgw: bucket request payment support (`issue#13427 `_, `pr#6214 `_, Javier M. Mellid) * rgw: extend rgw_extended_http_attrs to affect Swift accounts and containers as well (`pr#5969 `_, Radoslaw Zarzynski) * rgw: fix openssl linkage (`pr#6513 `_, Yehuda Sadeh) * rgw: fix partial read issue in rgw_admin and rgw_tools (`pr#6761 `_, Jiaying Ren) * rgw: fix reload on non Debian systems. (`pr#6482 `_, Hervé Rousseau) * rgw: fix response of delete expired objects (`issue#13469 `_, `pr#6228 `_, Yuan Zhou) * rgw: fix swift API returning incorrect account metadata (`issue#13140 `_, `pr#6047 `_, Sangdi Xu) * rgw: link against system openssl (instead of dlopen at runtime) (`pr#6419 `_, Sage Weil) * rgw: prevent anonymous user from reading bucket with authenticated read ACL (`issue#13207 `_, `pr#6057 `_, root) * rgw: use smart pointer for C_Reinitwatch (`pr#6767 `_, Orit Wasserman) * systemd: fix typos (`pr#6679 `_, Tobias Suckow) * tests: centos7 needs the Continuous Release (CR) Repository enabled for (`issue#13997 `_, `pr#6842 `_, Brad Hubbard) * tests: concatenate test_rados_test_tool from src and qa (`issue#13691 `_, `pr#6464 `_, Loic Dachary) * tests: fix test_rados_tools.sh rados lookup (`issue#13691 `_, `pr#6502 `_, Loic Dachary) * tests: fix typo in TestClsRbd.snapshots test case (`issue#13727 `_, `pr#6504 `_, Jason Dillaman) * tests: ignore test-suite.log (`pr#6584 `_, Loic Dachary) * tests: restore run-cli-tests (`pr#6571 `_, Loic Dachary, Sage Weil, Jason Dillaman) * tools/cephfs: fix overflow writing header to fixed size buffer (#13816) (`pr#6617 `_, John Spray) v10.0.0 ======= This is the first development release for the Jewel cycle. Notable Changes --------------- * build: cmake tweaks (`pr#6254 `_, John Spray) * build: more CMake package check fixes (`pr#6108 `_, Daniel Gryniewicz) * ceph-disk: get Nonetype when ceph-disk list with --format plain on single device. (`pr#6410 `_, Vicente Cheng) * ceph: fix tell behavior (`pr#6329 `_, David Zafman) * ceph-fuse: While starting ceph-fuse, start the log thread first (`issue#13443 `_, `pr#6224 `_, Wenjun Huang) * client: don't mark_down on command reply (`pr#6204 `_, John Spray) * client: drop prefix from ints (`pr#6275 `_, John Coyle) * client: sys/file.h includes for flock operations (`pr#6282 `_, John Coyle) * cls_rbd: change object_map_update to return 0 on success, add logging (`pr#6467 `_, Douglas Fuller) * cmake: Use uname instead of arch. (`pr#6358 `_, John Coyle) * common: assert: __STRING macro is not defined by musl libc. (`pr#6210 `_, John Coyle) * common: fix OpTracker age histogram calculation (`pr#5065 `_, Zhiqiang Wang) * common/MemoryModel: Added explicit feature check for mallinfo(). (`pr#6252 `_, John Coyle) * common/obj_bencher.cc: fix verification crashing when there's no objects (`pr#5853 `_, Piotr Dałek) * common: optimize debug logging (`pr#6307 `_, Adam Kupczyk) * common: Thread: move copy constructor and assignment op (`pr#5133 `_, Michal Jarzabek) * common: WorkQueue: new PointerWQ base class for ContextWQ (`issue#13636 `_, `pr#6525 `_, Jason Dillaman) * compat: use prefixed typeof extension (`pr#6216 `_, John Coyle) * crush: validate bucket id before indexing buckets array (`issue#13477 `_, `pr#6246 `_, Sage Weil) * doc: download GPG key from download.ceph.com (`issue#13603 `_, `pr#6384 `_, Ken Dreyer) * doc: fix outdated content in cache tier (`pr#6272 `_, Yuan Zhou) * doc/release-notes: v9.1.0 (`pr#6281 `_, Loic Dachary) * doc/releases-notes: fix build error (`pr#6483 `_, Kefu Chai) * doc: remove toctree items under Create CephFS (`pr#6241 `_, Jevon Qiao) * doc: rename the "Create a Ceph User" section and add verbage about… (`issue#13502 `_, `pr#6297 `_, ritz303) * docs: Fix styling of newly added mirror docs (`pr#6127 `_, Wido den Hollander) * doc, tests: update all http://ceph.com/ to download.ceph.com (`pr#6435 `_, Alfredo Deza) * doc: update doc for with new pool settings (`pr#5951 `_, Guang Yang) * doc: update radosgw-admin example (`pr#6256 `_, YankunLi) * doc: update the OS recommendations for newer Ceph releases (`pr#6355 `_, ritz303) * drop envz.h includes (`pr#6285 `_, John Coyle) * libcephfs: Improve portability by replacing loff_t type usage with off_t (`pr#6301 `_, John Coyle) * libcephfs: only check file offset on glibc platforms (`pr#6288 `_, John Coyle) * librados: fix examples/librados/Makefile error. (`pr#6320 `_, You Ji) * librados: init crush_location from config file. (`issue#13473 `_, `pr#6243 `_, Wei Luo) * librados: wrongly passed in argument for stat command (`issue#13703 `_, `pr#6476 `_, xie xingguo) * librbd: deadlock while attempting to flush AIO requests (`issue#13726 `_, `pr#6508 `_, Jason Dillaman) * librbd: fix enable objectmap feature issue (`issue#13558 `_, `pr#6339 `_, xinxin shu) * librbd: remove duplicate read_only test in librbd::async_flatten (`pr#5856 `_, runsisi) * mailmap: modify member info (`pr#6468 `_, Xiaowei Chen) * mailmap: updates (`pr#6258 `_, M Ranga Swami Reddy) * mailmap: Xie Xingguo affiliation (`pr#6409 `_, Loic Dachary) * mds: implement snapshot rename (`pr#5645 `_, xinxin shu) * mds: messages/MOSDOp: cast in assert to eliminate warnings (`issue#13625 `_, `pr#6414 `_, David Zafman) * mds: new filtered MDS tell commands for sessions (`pr#6180 `_, John Spray) * mds/Session: use projected parent for auth path check (`issue#13364 `_, `pr#6200 `_, Sage Weil) * mon: should not set isvalid = true when cephx_verify_authorizer return false (`issue#13525 `_, `pr#6306 `_, Ruifeng Yang) * osd: Add config option osd_read_ec_check_for_errors for testing (`pr#5865 `_, David Zafman) * osd: add pin/unpin support to cache tier (11066) (`pr#6326 `_, Zhiqiang Wang) * osd: auto repair EC pool (`issue#12754 `_, `pr#6196 `_, Guang Yang) * osd: drop the interim set from load_pgs() (`pr#6277 `_, Piotr Dałek) * osd: FileJournal: _fdump wrongly returns if journal is currently unreadable. (`issue#13626 `_, `pr#6406 `_, xie xingguo) * osd: FileStore: add a field indicate xattr only one chunk for set xattr. (`pr#6244 `_, Jianpeng Ma) * osd: FileStore: LFNIndex: remove redundant local variable 'obj'. (`issue#13552 `_, `pr#6333 `_, xiexingguo) * osd: FileStore: potential memory leak if _fgetattrs fails (`issue#13597 `_, `pr#6377 `_, xie xingguo) * osd: FileStore: remove unused local variable 'handle' (`pr#6381 `_, xie xingguo) * osd: fix bogus scrub results when missing a clone (`issue#12738 `_, `issue#12740 `_, `pr#5783 `_, David Zafman) * osd: fix debug message in OSD::is_healthy (`pr#6226 `_, Xiaoxi Chen) * osd: fix MOSDOp encoding (`pr#6174 `_, Sage Weil) * osd: init started to 0 (`issue#13206 `_, `pr#6107 `_, Sage Weil) * osd: KeyValueStore: fix the name's typo of keyvaluestore_default_strip_size (`pr#6375 `_, Zhi Zhang) * osd: new and delete ObjectStore::Transaction in a function is not necessary (`pr#6299 `_, Ruifeng Yang) * osd: optimize get_object_context (`pr#6305 `_, Jianpeng Ma) * osd: optimize MOSDOp/do_op/handle_op (`pr#5211 `_, Jacek J. Lakis) * osd: os/chain_xattr: On linux use linux/limits.h for XATTR_NAME_MAX. (`pr#6343 `_, John Coyle) * osd: reorder bool fields in PGLog struct (`pr#6279 `_, Piotr Dałek) * osd: ReplicatedPG: remove unused local variables (`issue#13575 `_, `pr#6360 `_, xiexingguo) * osd: reset primary and up_primary when building a new past_interval. (`issue#13471 `_, `pr#6240 `_, xiexingguo) * radosgw-admin: Checking the legality of the parameters (`issue#13018 `_, `pr#5879 `_, Qiankun Zheng) * radosgw-admin: Create --secret-key alias for --secret (`issue#5821 `_, `pr#5335 `_, Yuan Zhou) * radosgw-admin: metadata list user should return an empty list when user pool is empty (`issue#13596 `_, `pr#6465 `_, Orit Wasserman) * rados: new options for write benchmark (`pr#6340 `_, Joaquim Rocha) * rbd: fix clone isssue (`issue#13553 `_, `pr#6334 `_, xinxin shu) * rbd: fix init-rbdmap CMDPARAMS (`issue#13214 `_, `pr#6109 `_, Sage Weil) * rbdmap: systemd support (`issue#13374 `_, `pr#6479 `_, Boris Ranto) * rbd: rbdmap improvements (`pr#6445 `_, Boris Ranto) * release-notes: draft v0.94.4 release notes (`pr#5907 `_, Loic Dachary) * release-notes: draft v0.94.4 release notes (`pr#6195 `_, Loic Dachary) * release-notes: draft v0.94.4 release notes (`pr#6238 `_, Loic Dachary) * rgw: add compat header for TEMP_FAILURE_RETRY (`pr#6294 `_, John Coyle) * rgw: add default quota config (`pr#6400 `_, Daniel Gryniewicz) * rgw: add support for getting Swift's DLO without manifest handling (`pr#6206 `_, Radoslaw Zarzynski) * rgw: clarify the error message when trying to create an existed user (`pr#5938 `_, Zeqiang Zhuang) * rgw: fix objects can not be displayed which object name does not cont… (`issue#12963 `_, `pr#5738 `_, Weijun Duan) * rgw: fix typo in RGWHTTPClient::process error message (`pr#6424 `_, Brad Hubbard) * rgw: fix wrong etag calculation during POST on S3 bucket. (`issue#11241 `_, `pr#6030 `_, Radoslaw Zarzynski) * rgw: mdlog trim add usage prompt (`pr#6059 `_, Weijun Duan) * rgw: modify the conditional statement in parse_metadata_key method. (`pr#5875 `_, Zengran Zhang) * rgw: refuse to calculate digest when the s3 secret key is empty (`issue#13133 `_, `pr#6045 `_, Sangdi Xu) * rgw: remove extra check in RGWGetObj::execute (`issue#12352 `_, `pr#5262 `_, Javier M. Mellid) * rgw: support core file limit for radosgw daemon (`pr#6346 `_, Guang Yang) * rgw: swift use Civetweb ssl can not get right url (`issue#13628 `_, `pr#6408 `_, Weijun Duan) * rocksdb: build with PORTABLE=1 (`pr#6311 `_, Sage Weil) * rocksdb: remove rdb source files from dist tarball (`issue#13554 `_, `pr#6379 `_, Kefu Chai) * rocksdb: use native rocksdb makefile (and our autotools) (`pr#6290 `_, Sage Weil) * rpm: ceph.spec.in: correctly declare systemd dependency for SLE/openSUSE (`pr#6114 `_, Nathan Cutler) * rpm: ceph.spec.in: fix libs-compat / devel-compat conditional (`issue#12315 `_, `pr#5219 `_, Ken Dreyer) * rpm: rhel 5.9 librados compile fix, moved blkid to RBD check/compilation (`issue#13177 `_, `pr#5954 `_, Rohan Mars) * scripts: release_notes can track original issue (`pr#6009 `_, Abhishek Lekshmanan) * test/libcephfs/flock: add sys/file.h include for flock operations (`pr#6310 `_, John Coyle) * test_rgw_admin: use freopen for output redirection. (`pr#6303 `_, John Coyle) * tests: allow docker-test.sh to run under root (`issue#13355 `_, `pr#6173 `_, Loic Dachary) * tests: ceph-disk workunit uses configobj (`pr#6342 `_, Loic Dachary) * tests: destroy testprofile before creating one (`issue#13664 `_, `pr#6446 `_, Loic Dachary) * tests: port uniqueness reminder (`pr#6387 `_, Loic Dachary) * tests: test/librados/test.cc must create profile (`issue#13664 `_, `pr#6452 `_, Loic Dachary) * tools/cephfs: fix overflow writing header to fixed size buffer (#13816) (`pr#6617 `_, John Spray) * tools: ceph-monstore-update-crush: add "--test" when testing crushmap (`pr#6418 `_, Kefu Chai) * tools:remove duplicate references (`pr#5917 `_, Bo Cai) * vstart: grant full access to Swift testing account (`pr#6239 `_, Yuan Zhou) * vstart: set cephfs root uid/gid to caller (`pr#6255 `_, John Spray) v9.2.1 Infernalis ================= This Infernalis point release fixes several packagins and init script issues, enables the librbd objectmap feature by default, a few librbd bugs, and a range of miscellaneous bug fixes across the system. We recommend that all infernalis v9.2.0 users upgrade. For more detailed information, see :download:`the complete changelog `. Upgrading --------- * Some symbols wrongly exposed by the C++ interface for librados in v9.1.0 and v9.2.0 were removed. If you compiled your own application against librados shipped with these releases, it is very likely referencing these removed symbols. So you will need to recompile it. Notable Changes --------------- * build/ops: Ceph daemon failed to start, because the service name was already used. (`issue#13474 `_, `pr#6833 `_, Chuanhong Wang) * build/ops: ceph upstart script rbdmap.conf incorrectly processes parameters (`issue#13214 `_, `pr#6396 `_, Sage Weil) * build/ops: libunwind package missing on CentOS 7 (`issue#13997 `_, `pr#6845 `_, Loic Dachary) * build/ops: rbd-replay-* moved from ceph-test-dbg to ceph-common-dbg as well (`issue#13785 `_, `pr#6628 `_, Loic Dachary) * build/ops: systemd/ceph-disk@.service assumes /bin/flock (`issue#13975 `_, `pr#6852 `_, Loic Dachary) * build/ops: systemd: no rbdmap systemd unit file (`issue#13374 `_, `pr#6500 `_, Boris Ranto) * common: auth/cephx: large amounts of log are produced by osd (`issue#13610 `_, `pr#6836 `_, Qiankun Zheng) * common: log: Log.cc: Assign LOG_DEBUG priority to syslog calls (`issue#13993 `_, `pr#6993 `_, Brad Hubbard) * crush: crash if we see CRUSH_ITEM_NONE in early rule step (`issue#13477 `_, `pr#6626 `_, Sage Weil) * fs: Ceph file system is not freeing space (`issue#13777 `_, `pr#7431 `_, Yan, Zheng, John Spray) * fs: Ceph-fuse won't start correctly when the option log_max_new in ceph.conf set to zero (`issue#13443 `_, `pr#6395 `_, Wenjun Huang) * fs: Segmentation fault accessing file using fuse mount (`issue#13714 `_, `pr#6853 `_, Yan, Zheng) * librbd: Avoid re-writing old-format image header on resize (`issue#13674 `_, `pr#6630 `_, Jason Dillaman) * librbd: ImageWatcher shouldn't block the notification thread (`issue#14373 `_, `pr#7406 `_, Jason Dillaman) * librbd: QEMU hangs after creating snapshot and stopping VM (`issue#13726 `_, `pr#6632 `_, Jason Dillaman) * librbd: Verify self-managed snapshot functionality on image create (`issue#13633 `_, `pr#7080 `_, Jason Dillaman) * librbd: [ FAILED ] TestLibRBD.SnapRemoveViaLockOwner (`issue#14164 `_, `pr#7079 `_, Jason Dillaman) * librbd: enable feature objectmap (`issue#13558 `_, `pr#6477 `_, xinxin shu) * librbd: fix merge-diff for >2GB diff-files (`issue#14030 `_, `pr#6981 `_, Jason Dillaman) * librbd: flattening an rbd image with active IO can lead to hang (`issue#14092 `_, `issue#14483 `_, `pr#7484 `_, Jason Dillaman) * mds: fix client capabilities during reconnect (client.XXXX isn't responding to mclientcaps warning) (`issue#11482 `_, `pr#6752 `_, Yan, Zheng) * mon: Ceph Pools' MAX AVAIL is 0 if some OSDs' weight is 0 (`issue#13840 `_, `pr#6907 `_, Chengyuan Li) * mon: should not set isvalid = true when cephx_verify_authorizer retur... (`issue#13525 `_, `pr#6392 `_, Ruifeng Yang) * objecter: pool op callback may hang forever. (`issue#13642 `_, `pr#6627 `_, xie xingguo) * objecter: potential null pointer access when do pool_snap_list. (`issue#13639 `_, `pr#6840 `_, xie xingguo) * osd: FileStore: potential memory leak if getattrs fails. (`issue#13597 `_, `pr#6846 `_, xie xingguo) * osd: OSD::build_past_intervals_parallel() shall reset primary and up_primary when begin a new past_interval. (`issue#13471 `_, `pr#6397 `_, xiexingguo) * osd: call on_new_interval on newly split child PG (`issue#13962 `_, `pr#6849 `_, Sage Weil) * osd: ceph-disk list fails on /dev/cciss!c0d0 (`issue#13970 `_, `issue#14230 `_, `pr#6880 `_, Loic Dachary) * osd: ceph-disk: use blkid instead of sgdisk -i (`issue#14080 `_, `pr#7001 `_, Loic Dachary, Ilya Dryomov) * osd: fix race condition during send_failures (`issue#13821 `_, `pr#6694 `_, Sage Weil) * osd: osd/PG.cc: 288: FAILED assert(info.last_epoch_started >= info.history.last_epoch_started) (`issue#14015 `_, `pr#6851 `_, David Zafman) * osd: pgs stuck inconsistent after infernalis upgrade (`issue#13862 `_, `pr#7421 `_, David Zafman) * rbd: TaskFinisher::cancel should remove event from SafeTimer (`issue#14476 `_, `pr#7426 `_, Douglas Fuller) * rbd: cls_rbd: object_map_save should enable checksums (`issue#14280 `_, `pr#7428 `_, Douglas Fuller) * rbd: misdirected op in rbd balance-reads test (`issue#13491 `_, `pr#6629 `_, Jason Dillaman) * rbd: pure virtual method called (`issue#13636 `_, `pr#6633 `_, Jason Dillaman) * rbd: rbd clone issue (`issue#13553 `_, `pr#6474 `_, xinxin shu) * rbd: rbd-replay does not check for EOF and goes to endless loop (`issue#14452 `_, `pr#7427 `_, Mykola Golub) * rbd: unknown argument --quiet in udevadm settle (`issue#13560 `_, `pr#6634 `_, Jason Dillaman) * rgw: init script reload doesn't work on EL7 (`issue#13709 `_, `pr#6650 `_, Hervé Rousseau) * rgw: radosgw-admin --help doesn't show the orphans find command (`issue#14516 `_, `pr#7543 `_, Yehuda Sadeh) * tests: ceph-disk workunit uses configobj (`issue#14004 `_, `pr#6828 `_, Loic Dachary) * tests: fsx failed to compile (`issue#14384 `_, `pr#7429 `_, Greg Farnum) * tests: notification slave needs to wait for master (`issue#13810 `_, `pr#7225 `_, Jason Dillaman) * tests: rebuild exclusive lock test should acquire exclusive lock (`issue#14121 `_, `pr#7038 `_, Jason Dillaman) * tests: testprofile must be removed before it is re-created (`issue#13664 `_, `pr#6449 `_, Loic Dachary) * tests: verify it is possible to reuse an OSD id (`issue#13988 `_, `pr#6882 `_, Loic Dachary) v9.2.0 Infernalis ================= This major release will be the foundation for the next stable series. There have been some major changes since v0.94.x Hammer, and the upgrade process is non-trivial. Please read these release notes carefully. Major Changes from Hammer ------------------------- - *General*: * Ceph daemons are now managed via systemd (with the exception of Ubuntu Trusty, which still uses upstart). * Ceph daemons run as 'ceph' user instead root. * On Red Hat distros, there is also an SELinux policy. - *RADOS*: * The RADOS cache tier can now proxy write operations to the base tier, allowing writes to be handled without forcing migration of an object into the cache. * The SHEC erasure coding support is no longer flagged as experimental. SHEC trades some additional storage space for faster repair. * There is now a unified queue (and thus prioritization) of client IO, recovery, scrubbing, and snapshot trimming. * There have been many improvements to low-level repair tooling (ceph-objectstore-tool). * The internal ObjectStore API has been significantly cleaned up in order to faciliate new storage backends like NewStore. - *RGW*: * The Swift API now supports object expiration. * There are many Swift API compatibility improvements. - *RBD*: * The ``rbd du`` command shows actual usage (quickly, when object-map is enabled). * The object-map feature has seen many stability improvements. * Object-map and exclusive-lock features can be enabled or disabled dynamically. * You can now store user metadata and set persistent librbd options associated with individual images. * The new deep-flatten features allows flattening of a clone and all of its snapshots. (Previously snapshots could not be flattened.) * The export-diff command command is now faster (it uses aio). There is also a new fast-diff feature. * The --size argument can be specified with a suffix for units (e.g., ``--size 64G``). * There is a new ``rbd status`` command that, for now, shows who has the image open/mapped. - *CephFS*: * You can now rename snapshots. * There have been ongoing improvements around administration, diagnostics, and the check and repair tools. * The caching and revocation of client cache state due to unused inodes has been dramatically improved. * The ceph-fuse client behaves better on 32-bit hosts. Distro compatibility -------------------- We have decided to drop support for many older distributions so that we can move to a newer compiler toolchain (e.g., C++11). Although it is still possible to build Ceph on older distributions by installing backported development tools, we are not building and publishing release packages for ceph.com. We now build packages for: * CentOS 7 or later. We have dropped support for CentOS 6 (and other RHEL 6 derivatives, like Scientific Linux 6). * Debian Jessie 8.x or later. Debian Wheezy 7.x's g++ has incomplete support for C++11 (and no systemd). * Ubuntu Trusty 14.04 or later. Ubuntu Precise 12.04 is no longer supported. * Fedora 22 or later. Upgrading from Firefly ---------------------- Upgrading directly from Firefly v0.80.z is not recommended. It is possible to do a direct upgrade, but not without downtime. We recommend that clusters are first upgraded to Hammer v0.94.4 or a later v0.94.z release; only then is it possible to upgrade to Infernalis 9.2.z for an online upgrade (see below). To do an offline upgrade directly from Firefly, all Firefly OSDs must be stopped and marked down before any Infernalis OSDs will be allowed to start up. This fencing is enforced by the Infernalis monitor, so use an upgrade procedure like: #. Upgrade Ceph on monitor hosts #. Restart all ceph-mon daemons #. Upgrade Ceph on all OSD hosts #. Stop all ceph-osd daemons #. Mark all OSDs down with something like:: ceph osd down `seq 0 1000` #. Start all ceph-osd daemons #. Upgrade and restart remaining daemons (ceph-mds, radosgw) Upgrading from Hammer --------------------- * All cluster nodes must first upgrade to Hammer v0.94.4 or a later v0.94.z release; only then is it possible to upgrade to Infernalis 9.2.z. * For all distributions that support systemd (CentOS 7, Fedora, Debian Jessie 8.x, OpenSUSE), ceph daemons are now managed using native systemd files instead of the legacy sysvinit scripts. For example,:: systemctl start ceph.target # start all daemons systemctl status ceph-osd@12 # check status of osd.12 The main notable distro that is *not* yet using systemd is Ubuntu trusty 14.04. (The next Ubuntu LTS, 16.04, will use systemd instead of upstart.) * Ceph daemons now run as user and group ``ceph`` by default. The ceph user has a static UID assigned by Fedora and Debian (also used by derivative distributions like RHEL/CentOS and Ubuntu). On SUSE the ceph user will currently get a dynamically assigned UID when the user is created. If your systems already have a ceph user, upgrading the package will cause problems. We suggest you first remove or rename the existing 'ceph' user and 'ceph' group before upgrading. When upgrading, administrators have two options: #. Add the following line to ``ceph.conf`` on all hosts:: setuser match path = /var/lib/ceph/$type/$cluster-$id This will make the Ceph daemons run as root (i.e., not drop privileges and switch to user ceph) if the daemon's data directory is still owned by root. Newly deployed daemons will be created with data owned by user ceph and will run with reduced privileges, but upgraded daemons will continue to run as root. #. Fix the data ownership during the upgrade. This is the preferred option, but it is more work and can be very time consuming. The process for each host is to: #. Upgrade the ceph package. This creates the ceph user and group. For example:: ceph-deploy install --stable infernalis HOST #. Stop the daemon(s).:: service ceph stop # fedora, centos, rhel, debian stop ceph-all # ubuntu #. Fix the ownership:: chown -R ceph:ceph /var/lib/ceph chown -R ceph:ceph /var/log/ceph #. Restart the daemon(s).:: start ceph-all # ubuntu systemctl start ceph.target # debian, centos, fedora, rhel Alternatively, the same process can be done with a single daemon type, for example by stopping only monitors and chowning only ``/var/lib/ceph/mon``. * The on-disk format for the experimental KeyValueStore OSD backend has changed. You will need to remove any OSDs using that backend before you upgrade any test clusters that use it. * When a pool quota is reached, librados operations now block indefinitely, the same way they do when the cluster fills up. (Previously they would return -ENOSPC). By default, a full cluster or pool will now block. If your librados application can handle ENOSPC or EDQUOT errors gracefully, you can get error returns instead by using the new librados OPERATION_FULL_TRY flag. * The return code for librbd's rbd_aio_read and Image::aio_read API methods no longer returns the number of bytes read upon success. Instead, it returns 0 upon success and a negative value upon failure. * 'ceph scrub', 'ceph compact' and 'ceph sync force are now DEPRECATED. Users should instead use 'ceph mon scrub', 'ceph mon compact' and 'ceph mon sync force'. * 'ceph mon_metadata' should now be used as 'ceph mon metadata'. There is no need to deprecate this command (same major release since it was first introduced). * The `--dump-json` option of "osdmaptool" is replaced by `--dump json`. * The commands of "pg ls-by-{pool,primary,osd}" and "pg ls" now take "recovering" instead of "recovery", to include the recovering pgs in the listed pgs. Notable Changes since Hammer ---------------------------- * aarch64: add optimized version of crc32c (Yazen Ghannam, Steve Capper) * auth: cache/reuse crypto lib key objects, optimize msg signature check (Sage Weil) * auth: reinit NSS after fork() (#11128 Yan, Zheng) * autotools: fix out of tree build (Krxysztof Kosinski) * autotools: improve make check output (Loic Dachary) * buffer: add invalidate_crc() (Piotr Dalek) * buffer: fix zero bug (#12252 Haomai Wang) * buffer: some cleanup (Michal Jarzabek) * build: allow tcmalloc-minimal (Thorsten Behrens) * build: C++11 now supported * build: cmake: fix nss linking (Danny Al-Gaaf) * build: cmake: misc fixes (Orit Wasserman, Casey Bodley) * build: disable LTTNG by default (#11333 Josh Durgin) * build: do not build ceph-dencoder with tcmalloc (#10691 Boris Ranto) * build: fix junit detection on Fedora 22 (Ira Cooper) * build: fix pg ref disabling (William A. Kennington III) * build: fix ppc build (James Page) * build: install-deps: misc fixes (Loic Dachary) * build: install-deps.sh improvements (Loic Dachary) * build: install-deps: support OpenSUSE (Loic Dachary) * build: make_dist_tarball.sh (Sage Weil) * build: many cmake improvements * build: misc cmake fixes (Matt Benjamin) * build: misc fixes (Boris Ranto, Ken Dreyer, Owen Synge) * build: OSX build fixes (Yan, Zheng) * build: remove rest-bench * ceph-authtool: fix return code on error (Gerhard Muntingh) * ceph-detect-init: added Linux Mint (Michal Jarzabek) * ceph-detect-init: robust init system detection (Owen Synge) * ceph-disk: ensure 'zap' only operates on a full disk (#11272 Loic Dachary) * ceph-disk: fix zap sgdisk invocation (Owen Synge, Thorsten Behrens) * ceph-disk: follow ceph-osd hints when creating journal (#9580 Sage Weil) * ceph-disk: handle re-using existing partition (#10987 Loic Dachary) * ceph-disk: improve parted output parsing (#10983 Loic Dachary) * ceph-disk: install pip > 6.1 (#11952 Loic Dachary) * ceph-disk: make suppression work for activate-all and activate-journal (Dan van der Ster) * ceph-disk: many fixes (Loic Dachary, Alfredo Deza) * ceph-disk: fixes to respect init system (Loic Dachary, Owen Synge) * ceph-disk: pass --cluster arg on prepare subcommand (Kefu Chai) * ceph-disk: support for multipath devices (Loic Dachary) * ceph-disk: support NVMe device partitions (#11612 Ilja Slepnev) * ceph: fix 'df' units (Zhe Zhang) * ceph: fix parsing in interactive cli mode (#11279 Kefu Chai) * cephfs-data-scan: many additions, improvements (John Spray) * ceph-fuse: do not require successful remount when unmounting (#10982 Greg Farnum) * ceph-fuse, libcephfs: don't clear COMPLETE when trimming null (Yan, Zheng) * ceph-fuse, libcephfs: drop inode when rmdir finishes (#11339 Yan, Zheng) * ceph-fuse,libcephfs: fix uninline (#11356 Yan, Zheng) * ceph-fuse, libcephfs: hold exclusive caps on dirs we "own" (#11226 Greg Farnum) * ceph-fuse: mostly behave on 32-bit hosts (Yan, Zheng) * ceph: improve error output for 'tell' (#11101 Kefu Chai) * ceph-monstore-tool: fix store-copy (Huangjun) * ceph: new 'ceph daemonperf' command (John Spray, Mykola Golub) * ceph-objectstore-tool: many many improvements (David Zafman) * ceph-objectstore-tool: refactoring and cleanup (John Spray) * ceph-post-file: misc fixes (Joey McDonald, Sage Weil) * ceph_test_rados: test pipelined reads (Zhiqiang Wang) * client: avoid sending unnecessary FLUSHSNAP messages (Yan, Zheng) * client: exclude setfilelock when calculating oldest tid (Yan, Zheng) * client: fix error handling in check_pool_perm (John Spray) * client: fsync waits only for inode's caps to flush (Yan, Zheng) * client: invalidate kernel dcache when cache size exceeds limits (Yan, Zheng) * client: make fsync wait for unsafe dir operations (Yan, Zheng) * client: pin lookup dentry to avoid inode being freed (Yan, Zheng) * common: add descriptions to perfcounters (Kiseleva Alyona) * common: add perf counter descriptions (Alyona Kiseleva) * common: bufferlist performance tuning (Piotr Dalek, Sage Weil) * common: detect overflow of int config values (#11484 Kefu Chai) * common: fix bit_vector extent calc (#12611 Jason Dillaman) * common: fix json parsing of utf8 (#7387 Tim Serong) * common: fix leak of pthread_mutexattr (#11762 Ketor Meng) * common: fix LTTNG vs fork issue (Josh Durgin) * common: fix throttle max change (Henry Chang) * common: make mutex more efficient * common: make work queue addition/removal thread safe (#12662 Jason Dillaman) * common: optracker improvements (Zhiqiang Wang, Jianpeng Ma) * common: PriorityQueue tests (Kefu Chai) * common: some async compression infrastructure (Haomai Wang) * crush: add --check to validate dangling names, max osd id (Kefu Chai) * crush: cleanup, sync with kernel (Ilya Dryomov) * crush: fix crash from invalid 'take' argument (#11602 Shiva Rkreddy, Sage Weil) * crush: fix divide-by-2 in straw2 (#11357 Yann Dupont, Sage Weil) * crush: fix has_v4_buckets (#11364 Sage Weil) * crush: fix subtree base weight on adjust_subtree_weight (#11855 Sage Weil) * crush: respect default replicated ruleset config on map creation (Ilya Dryomov) * crushtool: fix order of operations, usage (Sage Weil) * crypto: fix NSS leak (Jason Dillaman) * crypto: fix unbalanced init/shutdown (#12598 Zheng Yan) * deb: fix rest-bench-dbg and ceph-test-dbg dependendies (Ken Dreyer) * debian: minor package reorg (Ken Dreyer) * deb, rpm: move ceph-objectstore-tool to ceph (Ken Dreyer) * doc: docuemnt object corpus generation (#11099 Alexis Normand) * doc: document region hostnames (Robin H. Johnson) * doc: fix gender neutrality (Alexandre Maragone) * doc: fix install doc (#10957 Kefu Chai) * doc: fix sphinx issues (Kefu Chai) * doc: man page updates (Kefu Chai) * doc: mds data structure docs (Yan, Zheng) * doc: misc updates (Fracois Lafont, Ken Dreyer, Kefu Chai, Owen Synge, Gael Fenet-Garde, Loic Dachary, Yannick Atchy-Dalama, Jiaying Ren, Kevin Caradant, Robert Maxime, Nicolas Yong, Germain Chipaux, Arthur Gorjux, Gabriel Sentucq, Clement Lebrun, Jean-Remi Deveaux, Clair Massot, Robin Tang, Thomas Laumondais, Jordan Dorne, Yuan Zhou, Valentin Thomas, Pierre Chaumont, Benjamin Troquereau, Benjamin Sesia, Vikhyat Umrao, Nilamdyuti Goswami, Vartika Rai, Florian Haas, Loic Dachary, Simon Guinot, Andy Allan, Alistair Israel, Ken Dreyer, Robin Rehu, Lee Revell, Florian Marsylle, Thomas Johnson, Bosse Klykken, Travis Rhoden, Ian Kelling) * doc: swift tempurls (#10184 Abhishek Lekshmanan) * doc: switch doxygen integration back to breathe (#6115 Kefu Chai) * doc: update release schedule docs (Loic Dachary) * erasure-code: cleanup (Kefu Chai) * erasure-code: improve tests (Loic Dachary) * erasure-code: shec: fix recovery bugs (Takanori Nakao, Shotaro Kawaguchi) * erasure-code: update ISA-L to 2.13 (Yuan Zhou) * gmock: switch to submodule (Danny Al-Gaaf, Loic Dachary) * hadoop: add terasort test (Noah Watkins) * init-radosgw: merge with sysv version; fix enumeration (Sage Weil) * java: fix libcephfs bindings (Noah Watkins) * libcephfs: add pread, pwrite (Jevon Qiao) * libcephfs,ceph-fuse: cache cleanup (Zheng Yan) * libcephfs,ceph-fuse: fix request resend on cap reconnect (#10912 Yan, Zheng) * librados: add config observer (Alistair Strachan) * librados: add FULL_TRY and FULL_FORCE flags for dealing with full clusters or pools (Sage Weil) * librados: add src_fadvise_flags for copy-from (Jianpeng Ma) * librados: define C++ flags from C constants (Josh Durgin) * librados: fadvise flags per op (Jianpeng Ma) * librados: fix last_force_resent handling (#11026 Jianpeng Ma) * librados: fix memory leak from C_TwoContexts (Xiong Yiliang) * librados: fix notify completion race (#13114 Sage Weil) * librados: fix striper when stripe_count = 1 and stripe_unit != object_size (#11120 Yan, Zheng) * librados, libcephfs: randomize client nonces (Josh Durgin) * librados: op perf counters (John Spray) * librados: pybind: fix binary omap values (Robin H. Johnson) * librados: pybind: fix write() method return code (Javier Guerra) * librados: respect default_crush_ruleset on pool_create (#11640 Yuan Zhou) * libradosstriper: fix leak (Danny Al-Gaaf) * librbd: add const for single-client-only features (Josh Durgin) * librbd: add deep-flatten operation (Jason Dillaman) * librbd: add purge_on_error cache behavior (Jianpeng Ma) * librbd: allow additional metadata to be stored with the image (Haomai Wang) * librbd: avoid blocking aio API methods (#11056 Jason Dillaman) * librbd: better handling for dup flatten requests (#11370 Jason Dillaman) * librbd: cancel in-flight ops on watch error (#11363 Jason Dillaman) * librbd: default new images to format 2 (#11348 Jason Dillaman) * librbd: fadvise for copy, export, import (Jianpeng Ma) * librbd: fast diff implementation that leverages object map (Jason Dillaman) * librbd: fix fast diff bugs (#11553 Jason Dillaman) * librbd: fix image format detection (Zhiqiang Wang) * librbd: fix lock ordering issue (#11577 Jason Dillaman) * librbd: fix reads larger than the cache size (Lu Shi) * librbd: fix snapshot creation when other snap is active (#11475 Jason Dillaman) * librbd: flatten/copyup fixes (Jason Dillaman) * librbd: handle NOCACHE fadvise flag (Jinapeng Ma) * librbd: lockdep, helgrind validation (Jason Dillaman, Josh Durgin) * librbd: metadata filter fixes (Haomai Wang) * librbd: misc aio fixes (#5488 Jason Dillaman) * librbd: misc rbd fixes (#11478 #11113 #11342 #11380 Jason Dillaman, Zhiqiang Wang) * librbd: new diff_iterate2 API (Jason Dillaman) * librbd: object map rebuild support (Jason Dillaman) * librbd: only update image flags while hold exclusive lock (#11791 Jason Dillaman) * librbd: optionally disable allocation hint (Haomai Wang) * librbd: prevent race between resize requests (#12664 Jason Dillaman) * librbd: readahead fixes (Zhiqiang Wang) * librbd: return result code from close (#12069 Jason Dillaman) * librbd: store metadata, including config options, in image (Haomai Wang) * librbd: tolerate old osds when getting image metadata (#11549 Jason Dillaman) * librbd: use write_full when possible (Zhiqiang Wang) * log: fix data corruption race resulting from log rotation (#12465 Samuel Just) * logrotate.d: prefer service over invoke-rc.d (#11330 Win Hierman, Sage Weil) * mds: add 'damaged' state to MDSMap (John Spray) * mds: add nicknames for perfcounters (John Spray) * mds: avoid emitting cap warnigns before evicting session (John Spray) * mds: avoid getting stuck in XLOCKDONE (#11254 Yan, Zheng) * mds: disable problematic rstat propagation into snap parents (Yan, Zheng) * mds: do not add snapped items to bloom filter (Yan, Zheng) * mds: expose frags via asok (John Spray) * mds: fix expected holes in journal objects (#13167 Yan, Zheng) * mds: fix handling for missing mydir dirfrag (#11641 John Spray) * mds: fix integer truncateion on large client ids (Henry Chang) * mds: fix mydir replica issue with shutdown (#10743 John Spray) * mds: fix out-of-order messages (#11258 Yan, Zheng) * mds: fix rejoin (Yan, Zheng) * mds: fix setting entire file layout in one setxattr (John Spray) * mds: fix shutdown (John Spray) * mds: fix shutdown with strays (#10744 John Spray) * mds: fix SnapServer crash on deleted pool (John Spray) * mds: fix snapshot bugs (Yan, Zheng) * mds: fix stray reintegration (Yan, Zheng) * mds: fix stray handling (John Spray) * mds: fix suicide beacon (John Spray) * mds: flush immediately in do_open_truncate (#11011 John Spray) * mds: handle misc corruption issues (John Spray) * mds: improve dump methods (John Spray) * mds: many fixes (Yan, Zheng, John Spray, Greg Farnum) * mds: many snapshot and stray fixes (Yan, Zheng) * mds: misc fixes (Jianpeng Ma, Dan van der Ster, Zhang Zhi) * mds: misc journal cleanups and fixes (#10368 John Spray) * mds: misc repair improvements (John Spray) * mds: misc snap fixes (Zheng Yan) * mds: misc snapshot fixes (Yan, Zheng) * mds: new SessionMap storage using omap (#10649 John Spray) * mds: persist completed_requests reliably (#11048 John Spray) * mds: reduce memory consumption (Yan, Zheng) * mds: respawn instead of suicide on blacklist (John Spray) * mds: separate safe_pos in Journaler (#10368 John Spray) * mds: snapshot rename support (#3645 Yan, Zheng) * mds: store layout on header object (#4161 John Spray) * mds: throttle purge stray operations (#10390 John Spray) * mds: tolerate clock jumping backwards (#11053 Yan, Zheng) * mds: warn when clients fail to advance oldest_client_tid (#10657 Yan, Zheng) * misc cleanups and fixes (Danny Al-Gaaf) * misc coverity fixes (Danny Al-Gaaf) * misc performance and cleanup (Nathan Cutler, Xinxin Shu) * mon: add cache over MonitorDBStore (Kefu Chai) * mon: add 'mon_metadata ' command (Kefu Chai) * mon: add 'node ls ...' command (Kefu Chai) * mon: add NOFORWARD, OBSOLETE, DEPRECATE flags for mon commands (Joao Eduardo Luis) * mon: add PG count to 'ceph osd df' output (Michal Jarzabek) * mon: 'ceph osd metadata' can dump all osds (Haomai Wang) * mon: clean up, reorg some mon commands (Joao Eduardo Luis) * monclient: flush_log (John Spray) * mon: detect kv backend failures (Sage Weil) * mon: disallow >2 tiers (#11840 Kefu Chai) * mon: disallow ec pools as tiers (#11650 Samuel Just) * mon: do not deactivate last mds (#10862 John Spray) * mon: fix average utilization calc for 'osd df' (Mykola Golub) * mon: fix CRUSH map test for new pools (Sage Weil) * mon: fix log dump crash when debugging (Mykola Golub) * mon: fix mds beacon replies (#11590 Kefu Chai) * mon: fix metadata update race (Mykola Golub) * mon: fix min_last_epoch_clean tracking (Kefu Chai) * mon: fix 'pg ls' sort order, state names (#11569 Kefu Chai) * mon: fix refresh (#11470 Joao Eduardo Luis) * mon: fix variance calc in 'osd df' (Sage Weil) * mon: improve callout to crushtool (Mykola Golub) * mon: make blocked op messages more readable (Jianpeng Ma) * mon: make osd get pool 'all' only return applicable fields (#10891 Michal Jarzabek) * mon: misc scaling fixes (Sage Weil) * mon: normalize erasure-code profile for storage and comparison (Loic Dachary) * mon: only send mon metadata to supporting peers (Sage Weil) * mon: optionally specify osd id on 'osd create' (Mykola Golub) * mon: 'osd tree' fixes (Kefu Chai) * mon: periodic background scrub (Joao Eduardo Luis) * mon: prevent bucket deletion when referenced by a crush rule (#11602 Sage Weil) * mon: prevent pgp_num > pg_num (#12025 Xinxin Shu) * mon: prevent pool with snapshot state from being used as a tier (#11493 Sage Weil) * mon: prime pg_temp when CRUSH map changes (Sage Weil) * mon: refine check_remove_tier checks (#11504 John Spray) * mon: reject large max_mds values (#12222 John Spray) * mon: remove spurious who arg from 'mds rm ...' (John Spray) * mon: streamline session handling, fix memory leaks (Sage Weil) * mon: upgrades must pass through hammer (Sage Weil) * mon: warn on bogus cache tier config (Jianpeng Ma) * msgr: add ceph_perf_msgr tool (Hoamai Wang) * msgr: async: fix seq handling (Haomai Wang) * msgr: async: many many fixes (Haomai Wang) * msgr: simple: fix clear_pipe (#11381 Haomai Wang) * msgr: simple: fix connect_seq assert (Haomai Wang) * msgr: xio: fastpath improvements (Raju Kurunkad) * msgr: xio: fix ip and nonce (Raju Kurunkad) * msgr: xio: improve lane assignment (Vu Pham) * msgr: xio: sync with accellio v1.4 (Vu Pham) * msgr: xio: misc fixes (#10735 Matt Benjamin, Kefu Chai, Danny Al-Gaaf, Raju Kurunkad, Vu Pham, Casey Bodley) * msg: unit tests (Haomai Wang) * objectcacher: misc bug fixes (Jianpeng Ma) * osd: add latency perf counters for tier operations (Xinze Chi) * osd: add misc perfcounters (Xinze Chi) * osd: add simple sleep injection in recovery (Sage Weil) * osd: allow SEEK_HOLE/SEEK_DATA for sparse read (Zhiqiang Wang) * osd: avoid dup omap sets for in pg metadata (Sage Weil) * osd: avoid multiple hit set insertions (Zhiqiang Wang) * osd: avoid transaction append in some cases (Sage Weil) * osd: break PG removal into multiple iterations (#10198 Guang Yang) * osd: cache proxy-write support (Zhiqiang Wang, Samuel Just) * osd: check scrub state when handling map (Jianpeng Ma) * osd: clean up some constness, privateness (Kefu Chai) * osd: clean up temp object if promotion fails (Jianpeng Ma) * osd: configure promotion based on write recency (Zhiqiang Wang) * osd: constrain collections to meta and PGs (normal and temp) (Sage Weil) * osd: don't send dup MMonGetOSDMap requests (Sage Weil, Kefu Chai) * osd: EIO injection (David Zhang) * osd: elminiate txn apend, ECSubWrite copy (Samuel Just) * osd: erasure-code: drop entries according to LRU (Andreas-Joachim Peters) * osd: erasure-code: fix SHEC floating point bug (#12936 Loic Dachary) * osd: erasure-code: update to ISA-L 2.14 (Yuan Zhou) * osd: filejournal: cleanup (David Zafman) * osd: filestore: clone using splice (Jianpeng Ma) * osd: filestore: fix recursive lock (Xinxin Shu) * osd: fix check_for_full (Henry Chang) * osd: fix dirty accounting in make_writeable (Zhiqiang Wang) * osd: fix dup promotion lost op bug (Zhiqiang Wang) * osd: fix endless repair when object is unrecoverable (Jianpeng Ma, Kefu Chai) * osd: fix hitset object naming to use GMT (Kefu Chai) * osd: fix misc memory leaks (Sage Weil) * osd: fix negative degraded stats during backfill (Guang Yang) * osd: fix osdmap dump of blacklist items (John Spray) * osd: fix peek_queue locking in FileStore (Xinze Chi) * osd: fix pg resurrection (#11429 Samuel Just) * osd: fix promotion vs full cache tier (Samuel Just) * osd: fix replay requeue when pg is still activating (#13116 Samuel Just) * osd: fix scrub stat bugs (Sage Weil, Samuel Just) * osd: fix snap flushing from cache tier (again) (#11787 Samuel Just) * osd: fix snap handling on promotion (#11296 Sam Just) * osd: fix temp-clearing (David Zafman) * osd: force promotion for ops EC can't handle (Zhiqiang Wang) * osd: handle log split with overlapping entries (#11358 Samuel Just) * osd: ignore non-existent osds in unfound calc (#10976 Mykola Golub) * osd: improve behavior on machines with large memory pages (Steve Capper) * osd: include a temp namespace within each collection/pgid (Sage Weil) * osd: increase default max open files (Owen Synge) * osd: keyvaluestore: misc fixes (Varada Kari) * osd: low and high speed flush modes (Mingxin Liu) * osd: make suicide timeouts individually configurable (Samuel Just) * osd: merge multiple setattr calls into a setattrs call (Xinxin Shu) * osd: misc fixes (Ning Yao, Kefu Chai, Xinze Chi, Zhiqiang Wang, Jianpeng Ma) * osd: move scrub in OpWQ (Samuel Just) * osd: newstore prototype (Sage Weil) * osd: ObjectStore internal API refactor (Sage Weil) * osd: peer_features includes self (David Zafman) * osd: pool size change triggers new interval (#11771 Samuel Just) * osd: prepopulate needs_recovery_map when only one peer has missing (#9558 Guang Yang) * osd: randomize scrub times (#10973 Kefu Chai) * osd: recovery, peering fixes (#11687 Samuel Just) * osd: refactor scrub and digest recording (Sage Weil) * osd: refuse first write to EC object at non-zero offset (Jianpeng Ma) * osd: relax reply order on proxy read (#11211 Zhiqiang Wang) * osd: require firefly features (David Zafman) * osd: set initial crush weight with more precision (Sage Weil) * osd: SHEC no longer experimental * osd: skip promotion for flush/evict op (Zhiqiang Wang) * osd: stripe over small xattrs to fit in XFS's 255 byte inline limit (Sage Weil, Ning Yao) * osd: sync object_map on syncfs (Samuel Just) * osd: take excl lock of op is rw (Samuel Just) * osd: throttle evict ops (Yunchuan Wen) * osd: upgrades must pass through hammer (Sage Weil) * osd: use a temp object for recovery (Sage Weil) * osd: use blkid to collection partition information (Joseph Handzik) * osd: use SEEK_HOLE / SEEK_DATA for sparse copy (Xinxin Shu) * osd: WBThrottle cleanups (Jianpeng Ma) * osd: write journal header on clean shutdown (Xinze Chi) * osdc/Objecter: allow per-pool calls to op_cancel_writes (John Spray) * os/filestore: enlarge getxattr buffer size (Jianpeng Ma) * pybind: pep8 cleanups (Danny Al-Gaaf) * pycephfs: many fixes for bindings (Haomai Wang) * qa: fix filelock_interrupt.py test (Yan, Zheng) * qa: improve ceph-disk tests (Loic Dachary) * qa: improve docker build layers (Loic Dachary) * qa: run-make-check.sh script (Loic Dachary) * rados: add --striper option to use libradosstriper (#10759 Sebastien Ponce) * rados: bench: add --no-verify option to improve performance (Piotr Dalek) * rados bench: misc fixes (Dmitry Yatsushkevich) * rados: fix error message on failed pool removal (Wido den Hollander) * radosgw-admin: add 'bucket check' function to repair bucket index (Yehuda Sadeh) * radosgw-admin: fix subuser modify output (#12286 Guce) * rados: handle --snapid arg properly (Abhishek Lekshmanan) * rados: improve bench buffer handling, performance (Piotr Dalek) * rados: misc bench fixes (Dmitry Yatsushkevich) * rados: new pool import implementation (John Spray) * rados: translate errno to string in CLI (#10877 Kefu Chai) * rbd: accept map options config option (Ilya Dryomov) * rbd: add disk usage tool (#7746 Jason Dillaman) * rbd: allow unmapping by spec (Ilya Dryomov) * rbd: cli: fix arg parsing with --io-pattern (Dmitry Yatsushkevich) * rbd: deprecate --new-format option (Jason Dillman) * rbd: fix error messages (#2862 Rajesh Nambiar) * rbd: fix link issues (Jason Dillaman) * rbd: improve CLI arg parsing, usage (Ilya Dryomov) * rbd: rbd-replay-prep and rbd-replay improvements (Jason Dillaman) * rbd: recognize queue_depth kernel option (Ilya Dryomov) * rbd: support G and T units for CLI (Abhishek Lekshmanan) * rbd: update rbd man page (Ilya Dryomov) * rbd: update xfstests tests (Douglas Fuller) * rbd: use image-spec and snap-spec in help (Vikhyat Umrao, Ilya Dryomov) * rest-bench: misc fixes (Shawn Chen) * rest-bench: support https (#3968 Yuan Zhou) * rgw: add max multipart upload parts (#12146 Abshishek Dixit) * rgw: add missing headers to Swift container details (#10666 Ahmad Faheem, Dmytro Iurchenko) * rgw: add stats to headers for account GET (#10684 Yuan Zhou) * rgw: add Trasnaction-Id to response (Abhishek Dixit) * rgw: add X-Timestamp for Swift containers (#10938 Radoslaw Zarzynski) * rgw: always check if token is expired (#11367 Anton Aksola, Riku Lehto) * rgw: conversion tool to repair broken multipart objects (#12079 Yehuda Sadeh) * rgw: document layout of pools and objects (Pete Zaitcev) * rgw: do not enclose bucket header in quotes (#11860 Wido den Hollander) * rgw: do not prefetch data for HEAD requests (Guang Yang) * rgw: do not preserve ACLs when copying object (#12370 Yehuda Sadeh) * rgw: do not set content-type if length is 0 (#11091 Orit Wasserman) * rgw: don't clobber bucket/object owner when setting ACLs (#10978 Yehuda Sadeh) * rgw: don't use end_marker for namespaced object listing (#11437 Yehuda Sadeh) * rgw: don't use rgw_socket_path if frontend is configured (#11160 Yehuda Sadeh) * rgw: enforce Content-Length for POST on Swift cont/obj (#10661 Radoslaw Zarzynski) * rgw: error out if frontend did not send all data (#11851 Yehuda Sadeh) * rgw: expose the number of unhealthy workers through admin socket (Guang Yang) * rgw: fail if parts not specified on multipart upload (#11435 Yehuda Sadeh) * rgw: fix assignment of copy obj attributes (#11563 Yehuda Sadeh) * rgw: fix broken stats in container listing (#11285 Radoslaw Zarzynski) * rgw: fix bug in domain/subdomain splitting (Robin H. Johnson) * rgw: fix casing of Content-Type header (Robin H. Johnson) * rgw: fix civetweb max threads (#10243 Yehuda Sadeh) * rgw: fix Connection: header handling (#12298 Wido den Hollander) * rgw: fix copy metadata, support X-Copied-From for swift (#10663 Radoslaw Zarzynski) * rgw: fix data corruptions race condition (#11749 Wuxingyi) * rgw: fix decoding of X-Object-Manifest from GET on Swift DLO (Radslow Rzarzynski) * rgw: fix GET on swift account when limit == 0 (#10683 Radoslaw Zarzynski) * rgw: fix handling empty metadata items on Swift container (#11088 Radoslaw Zarzynski) * rgw: fix JSON response when getting user quota (#12117 Wuxingyi) * rgw: fix locator for objects starting with _ (#11442 Yehuda Sadeh) * rgw: fix log rotation (Wuxingyi) * rgw: fix mulitipart upload in retry path (#11604 Yehuda Sadeh) * rgw: fix quota enforcement on POST (#11323 Sergey Arkhipov) * rgw: fix reset_loc (#11974 Yehuda Sadeh) * rgw: fix return code on missing upload (#11436 Yehuda Sadeh) * rgw: fix sysvinit script * rgw: fix sysvinit script w/ multiple instances (Sage Weil, Pavan Rallabhandi) * rgw: force content_type for swift bucket stats requests (#12095 Orit Wasserman) * rgw: force content type header on responses with no body (#11438 Orit Wasserman) * rgw: generate Date header for civetweb (#10873 Radoslaw Zarzynski) * rgw: generate new object tag when setting attrs (#11256 Yehuda Sadeh) * rgw: improve content-length env var handling (#11419 Robin H. Johnson) * rgw: improved support for swift account metadata (Radoslaw Zarzynski) * rgw: improve handling of already removed buckets in expirer (Radoslaw Rzarzynski) * rgw: issue aio for first chunk before flush cached data (#11322 Guang Yang) * rgw: log to /var/log/ceph instead of /var/log/radosgw * rgw: make init script wait for radosgw to stop (#11140 Dmitry Yatsushkevich) * rgw: make max put size configurable (#6999 Yuan Zhou) * rgw: make quota/gc threads configurable (#11047 Guang Yang) * rgw: make read user buckets backward compat (#10683 Radoslaw Zarzynski) * rgw: merge manifests properly with prefix override (#11622 Yehuda Sadeh) * rgw: only scan for objects not in a namespace (#11984 Yehuda Sadeh) * rgw: orphan detection tool (Yehuda Sadeh) * rgw: pass in civetweb configurables (#10907 Yehuda Sadeh) * rgw: rectify 202 Accepted in PUT response (#11148 Radoslaw Zarzynski) * rgw: remove meta file after deleting bucket (#11149 Orit Wasserman) * rgw: remove trailing :port from HTTP_HOST header (Sage Weil) * rgw: return 412 on bad limit when listing buckets (#11613 Yehuda Sadeh) * rgw: rework X-Trans-Id header to conform with Swift API (Radoslaw Rzarzynski) * rgw: s3 encoding-type for get bucket (Jeff Weber) * rgw: send ETag, Last-Modified for swift (#11087 Radoslaw Zarzynski) * rgw: set content length on container GET, PUT, DELETE, HEAD (#10971, #11036 Radoslaw Zarzynski) * rgw: set max buckets per user in ceph.conf (Vikhyat Umrao) * rgw: shard work over multiple librados instances (Pavan Rallabhandi) * rgw: support end marker on swift container GET (#10682 Radoslaw Zarzynski) * rgw: support for Swift expiration API (Radoslaw Rzarzynski, Yehuda Sadeh) * rgw: swift: allow setting attributes with COPY (#10662 Ahmad Faheem, Dmytro Iurchenko) * rgw: swift: do not override sent content type (#12363 Orit Wasserman) * rgw: swift: enforce Content-Type in response (#12157 Radoslaw Zarzynski) * rgw: swift: fix account listing (#11501 Radoslaw Zarzynski) * rgw: swift: fix metadata handling on copy (#10645 Radoslaw Zarzynski) * rgw: swift: send Last-Modified header (#10650 Radoslaw Zarzynski) * rgw: swift: set Content-Length for account GET (#12158 Radoslav Zarzynski) * rgw: swift: set content-length on keystone tokens (#11473 Herv Rousseau) * rgw: update keystone cache with token info (#11125 Yehuda Sadeh) * rgw: update to latest civetweb, enable config for IPv6 (#10965 Yehuda Sadeh) * rgw: use attrs from source bucket on copy (#11639 Javier M. Mellid) * rgw: use correct oid for gc chains (#11447 Yehuda Sadeh) * rgw: user rm is idempotent (Orit Wasserman) * rgw: use unique request id for civetweb (#10295 Orit Wasserman) * rocksdb: add perf counters for get/put latency (Xinxin Shu) * rocksdb, leveldb: fix compact_on_mount (Xiaoxi Chen) * rocksdb: pass options as single string (Xiaoxi Chen) * rocksdb: update to latest (Xiaoxi Chen) * rpm: add suse firewall files (Tim Serong) * rpm: always rebuild and install man pages for rpm (Owen Synge) * rpm: loosen ceph-test dependencies (Ken Dreyer) * rpm: many spec file fixes (Owen Synge, Ken Dreyer) * rpm: misc fixes (Boris Ranto, Owen Synge, Ken Dreyer, Ira Cooper) * rpm: misc systemd and SUSE fixes (Owen Synge, Nathan Cutler) * selinux policy (Boris Ranto, Milan Broz) * systemd: logrotate fixes (Tim Serong, Lars Marowsky-Bree, Nathan Cutler) * systemd: many fixes (Sage Weil, Owen Synge, Boris Ranto, Dan van der Ster) * systemd: run daemons as user ceph * sysvinit compat: misc fixes (Owen Synge) * test: misc fs test improvements (John Spray, Loic Dachary) * test: python tests, linter cleanup (Alfredo Deza) * tests: fixes for rbd xstests (Douglas Fuller) * tests: fix tiering health checks (Loic Dachary) * tests for low-level performance (Haomai Wang) * tests: many ec non-regression improvements (Loic Dachary) * tests: many many ec test improvements (Loic Dachary) * upstart: throttle restarts (#11798 Sage Weil, Greg Farnum) v9.1.0 Infernalis release candidate =================================== This is the first Infernalis release candidate. There have been some major changes since Hammer, and the upgrade process is non-trivial. Please read carefully. Getting the release candidate ----------------------------- The v9.1.0 packages are pushed to the development release repositories:: http://download.ceph.com/rpm-testing http://download.ceph.com/debian-testing For for info, see:: http://docs.ceph.com/docs/master/install/get-packages/ Or install with ceph-deploy via:: ceph-deploy install --testing HOST Known issues ------------ * librbd and librados ABI compatibility is broken. Be careful installing this RC on client machines (e.g., those running qemu). It will be fixed in the final v9.2.0 release. Major Changes from Hammer ------------------------- - *General*: * Ceph daemons are now managed via systemd (with the exception of Ubuntu Trusty, which still uses upstart). * Ceph daemons run as 'ceph' user instead of root. * On Red Hat distros, there is also an SELinux policy. - *RADOS*: * The RADOS cache tier can now proxy write operations to the base tier, allowing writes to be handled without forcing migration of an object into the cache. * The SHEC erasure coding support is no longer flagged as experimental. SHEC trades some additional storage space for faster repair. * There is now a unified queue (and thus prioritization) of client IO, scrubbing, and snapshot trimming. * There have been many improvements to low-level repair tooling (ceph-objectstore-tool). * The internal ObjectStore API has been significantly cleaned up in order to faciliate new storage backends like NewStore. - *RGW*: * The Swift API now supports object expiration. * There are many Swift API compatibility improvements. - *RBD*: * The ``rbd du`` command shows actual usage (quickly, when object-map is enabled). * The object-map feature has seen many stability improvements. * Object-map and exclusive-lock features can be enabled or disabled dynamically. * You can now store user metadata and set persistent librbd options associated with individual images. * The new deep-flatten features allows flattening of a clone and all of its snapshots. (Previously snapshots could not be flattened.) * The export-diff command command is now faster (it uses aio). There is also a new fast-diff feature. * The --size argument can be specified with a suffix for units (e.g., ``--size 64G``). * There is a new ``rbd status`` command that, for now, shows who has the image open/mapped. - *CephFS*: * You can now rename snapshots. * There have been ongoing improvements around administration, diagnostics, and the check and repair tools. * The caching and revocation of client cache state due to unused inodes has been dramatically improved. * The ceph-fuse client behaves better on 32-bit hosts. Distro compatibility -------------------- We have decided to drop support for many older distributions so that we can move to a newer compiler toolchain (e.g., C++11). Although it is still possible to build Ceph on older distributions by installing backported development tools, we are not building and publishing release packages for them on ceph.com. In particular, * CentOS 7 or later; we have dropped support for CentOS 6 (and other RHEL 6 derivatives, like Scientific Linux 6). * Debian Jessie 8.x or later; Debian Wheezy 7.x's g++ has incomplete support for C++11 (and no systemd). * Ubuntu Trusty 14.04 or later; Ubuntu Precise 12.04 is no longer supported. * Fedora 22 or later. Upgrading from Firefly ---------------------- Upgrading directly from Firefly v0.80.z is not possible. All clusters must first upgrade to Hammer v0.94.4 or a later v0.94.z release; only then is it possible to do online upgrade to Infernalis 9.2.z. User can upgrade to latest hammer v0.94.z from gitbuilder with(also refer the hammer release notes for more details):: ceph-deploy install --release hammer HOST Upgrading from Hammer --------------------- * All cluster nodes must first upgrade to Hammer v0.94.4 or a later v0.94.z release; only then is it possible to do online upgrade to Infernalis 9.2.z. * For all distributions that support systemd (CentOS 7, Fedora, Debian Jessie 8.x, OpenSUSE), ceph daemons are now managed using native systemd files instead of the legacy sysvinit scripts. For example,:: systemctl start ceph.target # start all daemons systemctl status ceph-osd@12 # check status of osd.12 The main notable distro that is *not* yet using systemd is Ubuntu trusty 14.04. (The next Ubuntu LTS, 16.04, will use systemd instead of upstart.) * Ceph daemons now run as user and group ``ceph`` by default. The ceph user has a static UID assigned by Fedora and Debian (also used by derivative distributions like RHEL/CentOS and Ubuntu). On SUSE the ceph user will currently get a dynamically assigned UID when the user is created. If your systems already have a ceph user, the package upgrade process will usually fail with an error. We suggest you first remove or rename the existing 'ceph' user and then upgrade. When upgrading, administrators have two options: #. Add the following line to ``ceph.conf`` on all hosts:: setuser match path = /var/lib/ceph/$type/$cluster-$id This will make the Ceph daemons run as root (i.e., not drop privileges and switch to user ceph) if the daemon's data directory is still owned by root. Newly deployed daemons will be created with data owned by user ceph and will run with reduced privileges, but upgraded daemons will continue to run as root. #. Fix the data ownership during the upgrade. This is the preferred option, but is more work. The process for each host would be to: #. Upgrade the ceph package. This creates the ceph user and group. For example:: ceph-deploy install --stable infernalis HOST #. Stop the daemon(s).:: service ceph stop # fedora, centos, rhel, debian stop ceph-all # ubuntu #. Fix the ownership:: chown -R ceph:ceph /var/lib/ceph chown -R ceph:ceph /var/log/ceph #. Restart the daemon(s).:: start ceph-all # ubuntu systemctl start ceph.target # debian, centos, fedora, rhel * The on-disk format for the experimental KeyValueStore OSD backend has changed. You will need to remove any OSDs using that backend before you upgrade any test clusters that use it. Upgrade notes ------------- * When a pool quota is reached, librados operations now block indefinitely, the same way they do when the cluster fills up. (Previously they would return -ENOSPC). By default, a full cluster or pool will now block. If your librados application can handle ENOSPC or EDQUOT errors gracefully, you can get error returns instead by using the new librados OPERATION_FULL_TRY flag. Notable changes --------------- NOTE: These notes are somewhat abbreviated while we find a less time-consuming process for generating them. * build: C++11 now supported * build: many cmake improvements * build: OSX build fixes (Yan, Zheng) * build: remove rest-bench * ceph-disk: many fixes (Loic Dachary) * ceph-disk: support for multipath devices (Loic Dachary) * ceph-fuse: mostly behave on 32-bit hosts (Yan, Zheng) * ceph-objectstore-tool: many improvements (David Zafman) * common: bufferlist performance tuning (Piotr Dalek, Sage Weil) * common: make mutex more efficient * common: some async compression infrastructure (Haomai Wang) * librados: add FULL_TRY and FULL_FORCE flags for dealing with full clusters or pools (Sage Weil) * librados: fix notify completion race (#13114 Sage Weil) * librados, libcephfs: randomize client nonces (Josh Durgin) * librados: pybind: fix binary omap values (Robin H. Johnson) * librbd: fix reads larger than the cache size (Lu Shi) * librbd: metadata filter fixes (Haomai Wang) * librbd: use write_full when possible (Zhiqiang Wang) * mds: avoid emitting cap warnigns before evicting session (John Spray) * mds: fix expected holes in journal objects (#13167 Yan, Zheng) * mds: fix SnapServer crash on deleted pool (John Spray) * mds: many fixes (Yan, Zheng, John Spray, Greg Farnum) * mon: add cache over MonitorDBStore (Kefu Chai) * mon: 'ceph osd metadata' can dump all osds (Haomai Wang) * mon: detect kv backend failures (Sage Weil) * mon: fix CRUSH map test for new pools (Sage Weil) * mon: fix min_last_epoch_clean tracking (Kefu Chai) * mon: misc scaling fixes (Sage Weil) * mon: streamline session handling, fix memory leaks (Sage Weil) * mon: upgrades must pass through hammer (Sage Weil) * msg/async: many fixes (Haomai Wang) * osd: cache proxy-write support (Zhiqiang Wang, Samuel Just) * osd: configure promotion based on write recency (Zhiqiang Wang) * osd: don't send dup MMonGetOSDMap requests (Sage Weil, Kefu Chai) * osd: erasure-code: fix SHEC floating point bug (#12936 Loic Dachary) * osd: erasure-code: update to ISA-L 2.14 (Yuan Zhou) * osd: fix hitset object naming to use GMT (Kefu Chai) * osd: fix misc memory leaks (Sage Weil) * osd: fix peek_queue locking in FileStore (Xinze Chi) * osd: fix promotion vs full cache tier (Samuel Just) * osd: fix replay requeue when pg is still activating (#13116 Samuel Just) * osd: fix scrub stat bugs (Sage Weil, Samuel Just) * osd: force promotion for ops EC can't handle (Zhiqiang Wang) * osd: improve behavior on machines with large memory pages (Steve Capper) * osd: merge multiple setattr calls into a setattrs call (Xinxin Shu) * osd: newstore prototype (Sage Weil) * osd: ObjectStore internal API refactor (Sage Weil) * osd: SHEC no longer experimental * osd: throttle evict ops (Yunchuan Wen) * osd: upgrades must pass through hammer (Sage Weil) * osd: use SEEK_HOLE / SEEK_DATA for sparse copy (Xinxin Shu) * rbd: rbd-replay-prep and rbd-replay improvements (Jason Dillaman) * rgw: expose the number of unhealthy workers through admin socket (Guang Yang) * rgw: fix casing of Content-Type header (Robin H. Johnson) * rgw: fix decoding of X-Object-Manifest from GET on Swift DLO (Radslow Rzarzynski) * rgw: fix sysvinit script * rgw: fix sysvinit script w/ multiple instances (Sage Weil, Pavan Rallabhandi) * rgw: improve handling of already removed buckets in expirer (Radoslaw Rzarzynski) * rgw: log to /var/log/ceph instead of /var/log/radosgw * rgw: rework X-Trans-Id header to be conform with Swift API (Radoslaw Rzarzynski) * rgw: s3 encoding-type for get bucket (Jeff Weber) * rgw: set max buckets per user in ceph.conf (Vikhyat Umrao) * rgw: support for Swift expiration API (Radoslaw Rzarzynski, Yehuda Sadeh) * rgw: user rm is idempotent (Orit Wasserman) * selinux policy (Boris Ranto, Milan Broz) * systemd: many fixes (Sage Weil, Owen Synge, Boris Ranto, Dan van der Ster) * systemd: run daemons as user ceph v9.0.3 ====== This is the second to last batch of development work for the Infernalis cycle. The most intrusive change is an internal (non user-visible) change to the OSD's ObjectStore interface. Many fixes and improvements elsewhere across RGW, RBD, and another big pile of CephFS scrub/repair improvements. Upgrading --------- * The return code for librbd's rbd_aio_read and Image::aio_read API methods no longer returns the number of bytes read upon success. Instead, it returns 0 upon success and a negative value upon failure. * 'ceph scrub', 'ceph compact' and 'ceph sync force' are now deprecated. Users should instead use 'ceph mon scrub', 'ceph mon compact' and 'ceph mon sync force'. * 'ceph mon_metadata' should now be used as 'ceph mon metadata'. * The `--dump-json` option of "osdmaptool" is replaced by `--dump json`. * The commands of 'pg ls-by-{pool,primary,osd}' and 'pg ls' now take 'recovering' instead of 'recovery' to include the recovering pgs in the listed pgs. Notable Changes --------------- * autotools: fix out of tree build (Krxysztof Kosinski) * autotools: improve make check output (Loic Dachary) * buffer: add invalidate_crc() (Piotr Dalek) * buffer: fix zero bug (#12252 Haomai Wang) * build: fix junit detection on Fedora 22 (Ira Cooper) * ceph-disk: install pip > 6.1 (#11952 Loic Dachary) * cephfs-data-scan: many additions, improvements (John Spray) * ceph: improve error output for 'tell' (#11101 Kefu Chai) * ceph-objectstore-tool: misc improvements (David Zafman) * ceph-objectstore-tool: refactoring and cleanup (John Spray) * ceph_test_rados: test pipelined reads (Zhiqiang Wang) * common: fix bit_vector extent calc (#12611 Jason Dillaman) * common: make work queue addition/removal thread safe (#12662 Jason Dillaman) * common: optracker improvements (Zhiqiang Wang, Jianpeng Ma) * crush: add --check to validate dangling names, max osd id (Kefu Chai) * crush: cleanup, sync with kernel (Ilya Dryomov) * crush: fix subtree base weight on adjust_subtree_weight (#11855 Sage Weil) * crypo: fix NSS leak (Jason Dillaman) * crypto: fix unbalanced init/shutdown (#12598 Zheng Yan) * doc: misc updates (Kefu Chai, Owen Synge, Gael Fenet-Garde, Loic Dachary, Yannick Atchy-Dalama, Jiaying Ren, Kevin Caradant, Robert Maxime, Nicolas Yong, Germain Chipaux, Arthur Gorjux, Gabriel Sentucq, Clement Lebrun, Jean-Remi Deveaux, Clair Massot, Robin Tang, Thomas Laumondais, Jordan Dorne, Yuan Zhou, Valentin Thomas, Pierre Chaumont, Benjamin Troquereau, Benjamin Sesia, Vikhyat Umrao) * erasure-code: cleanup (Kefu Chai) * erasure-code: improve tests (Loic Dachary) * erasure-code: shec: fix recovery bugs (Takanori Nakao, Shotaro Kawaguchi) * libcephfs: add pread, pwrite (Jevon Qiao) * libcephfs,ceph-fuse: cache cleanup (Zheng Yan) * librados: add src_fadvise_flags for copy-from (Jianpeng Ma) * librados: respect default_crush_ruleset on pool_create (#11640 Yuan Zhou) * librbd: fadvise for copy, export, import (Jianpeng Ma) * librbd: handle NOCACHE fadvise flag (Jinapeng Ma) * librbd: optionally disable allocation hint (Haomai Wang) * librbd: prevent race between resize requests (#12664 Jason Dillaman) * log: fix data corruption race resulting from log rotation (#12465 Samuel Just) * mds: expose frags via asok (John Spray) * mds: fix setting entire file layout in one setxattr (John Spray) * mds: fix shutdown (John Spray) * mds: handle misc corruption issues (John Spray) * mds: misc fixes (Jianpeng Ma, Dan van der Ster, Zhang Zhi) * mds: misc snap fixes (Zheng Yan) * mds: store layout on header object (#4161 John Spray) * misc performance and cleanup (Nathan Cutler, Xinxin Shu) * mon: add NOFORWARD, OBSOLETE, DEPRECATE flags for mon commands (Joao Eduardo Luis) * mon: add PG count to 'ceph osd df' output (Michal Jarzabek) * mon: clean up, reorg some mon commands (Joao Eduardo Luis) * mon: disallow >2 tiers (#11840 Kefu Chai) * mon: fix log dump crash when debugging (Mykola Golub) * mon: fix metadata update race (Mykola Golub) * mon: fix refresh (#11470 Joao Eduardo Luis) * mon: make blocked op messages more readable (Jianpeng Ma) * mon: only send mon metadata to supporting peers (Sage Weil) * mon: periodic background scrub (Joao Eduardo Luis) * mon: prevent pgp_num > pg_num (#12025 Xinxin Shu) * mon: reject large max_mds values (#12222 John Spray) * msgr: add ceph_perf_msgr tool (Hoamai Wang) * msgr: async: fix seq handling (Haomai Wang) * msgr: xio: fastpath improvements (Raju Kurunkad) * msgr: xio: sync with accellio v1.4 (Vu Pham) * osd: clean up temp object if promotion fails (Jianpeng Ma) * osd: constrain collections to meta and PGs (normal and temp) (Sage Weil) * osd: filestore: clone using splice (Jianpeng Ma) * osd: filestore: fix recursive lock (Xinxin Shu) * osd: fix dup promotion lost op bug (Zhiqiang Wang) * osd: fix temp-clearing (David Zafman) * osd: include a temp namespace within each collection/pgid (Sage Weil) * osd: low and high speed flush modes (Mingxin Liu) * osd: peer_features includes self (David Zafman) * osd: recovery, peering fixes (#11687 Samuel Just) * osd: require firefly features (David Zafman) * osd: set initial crush weight with more precision (Sage Weil) * osd: use a temp object for recovery (Sage Weil) * osd: use blkid to collection partition information (Joseph Handzik) * rados: add --striper option to use libradosstriper (#10759 Sebastien Ponce) * radosgw-admin: fix subuser modify output (#12286 Guce) * rados: handle --snapid arg properly (Abhishek Lekshmanan) * rados: improve bench buffer handling, performance (Piotr Dalek) * rados: new pool import implementation (John Spray) * rbd: fix link issues (Jason Dillaman) * rbd: improve CLI arg parsing, usage (Ilya Dryomov) * rbd: recognize queue_depth kernel option (Ilya Dryomov) * rbd: support G and T units for CLI (Abhishek Lekshmanan) * rbd: use image-spec and snap-spec in help (Vikhyat Umrao, Ilya Dryomov) * rest-bench: misc fixes (Shawn Chen) * rest-bench: support https (#3968 Yuan Zhou) * rgw: add max multipart upload parts (#12146 Abshishek Dixit) * rgw: add Trasnaction-Id to response (Abhishek Dixit) * rgw: document layout of pools and objects (Pete Zaitcev) * rgw: do not preserve ACLs when copying object (#12370 Yehuda Sadeh) * rgw: fix Connection: header handling (#12298 Wido den Hollander) * rgw: fix data corruptions race condition (#11749 Wuxingyi) * rgw: fix JSON response when getting user quota (#12117 Wuxingyi) * rgw: force content_type for swift bucket stats requests (#12095 Orit Wasserman) * rgw: improved support for swift account metadata (Radoslaw Zarzynski) * rgw: make max put size configurable (#6999 Yuan Zhou) * rgw: orphan detection tool (Yehuda Sadeh) * rgw: swift: do not override sent content type (#12363 Orit Wasserman) * rgw: swift: set Content-Length for account GET (#12158 Radoslav Zarzynski) * rpm: always rebuild and install man pages for rpm (Owen Synge) * rpm: misc fixes (Boris Ranto, Owen Synge, Ken Dreyer, Ira Cooper) * systemd: logrotate fixes (Tim Seron, Lars Marowsky-Bree, Nathan Cutler) * sysvinit compat: misc fixes (Owen Synge) * test: misc fs test improvements (John Spray, Loic Dachary) * test: python tests, linter cleanup (Alfredo Deza) v9.0.2 ====== This development release features more of the OSD work queue unification, randomized osd scrub times, a huge pile of librbd fixes, more MDS repair and snapshot fixes, and a significant amount of work on the tests and build infrastructure. Notable Changes --------------- * buffer: some cleanup (Michal Jarzabek) * build: cmake: fix nss linking (Danny Al-Gaaf) * build: cmake: misc fixes (Orit Wasserman, Casey Bodley) * build: install-deps: misc fixes (Loic Dachary) * build: make_dist_tarball.sh (Sage Weil) * ceph-detect-init: added Linux Mint (Michal Jarzabek) * ceph-detect-init: robust init system detection (Owen Synge, Loic Dachary) * ceph-disk: ensure 'zap' only operates on a full disk (#11272 Loic Dachary) * ceph-disk: misc fixes to respect init system (Loic Dachary, Owen Synge) * ceph-disk: support NVMe device partitions (#11612 Ilja Slepnev) * ceph: fix 'df' units (Zhe Zhang) * ceph: fix parsing in interactive cli mode (#11279 Kefu Chai) * ceph-objectstore-tool: many many changes (David Zafman) * ceph-post-file: misc fixes (Joey McDonald, Sage Weil) * client: avoid sending unnecessary FLUSHSNAP messages (Yan, Zheng) * client: exclude setfilelock when calculating oldest tid (Yan, Zheng) * client: fix error handling in check_pool_perm (John Spray) * client: fsync waits only for inode's caps to flush (Yan, Zheng) * client: invalidate kernel dcache when cache size exceeds limits (Yan, Zheng) * client: make fsync wait for unsafe dir operations (Yan, Zheng) * client: pin lookup dentry to avoid inode being freed (Yan, Zheng) * common: detect overflow of int config values (#11484 Kefu Chai) * common: fix json parsing of utf8 (#7387 Tim Serong) * common: fix leak of pthread_mutexattr (#11762 Ketor Meng) * crush: respect default replicated ruleset config on map creation (Ilya Dryomov) * deb, rpm: move ceph-objectstore-tool to ceph (Ken Dreyer) * doc: man page updates (Kefu Chai) * doc: misc updates (#11396 Nilamdyuti, Fracois Lafont, Ken Dreyer, Kefu Chai) * init-radosgw: merge with sysv version; fix enumeration (Sage Weil) * librados: add config observer (Alistair Strachan) * librbd: add const for single-client-only features (Josh Durgin) * librbd: add deep-flatten operation (Jason Dillaman) * librbd: avoid blocking aio API methods (#11056 Jason Dillaman) * librbd: fix fast diff bugs (#11553 Jason Dillaman) * librbd: fix image format detection (Zhiqiang Wang) * librbd: fix lock ordering issue (#11577 Jason Dillaman) * librbd: flatten/copyup fixes (Jason Dillaman) * librbd: lockdep, helgrind validation (Jason Dillaman, Josh Durgin) * librbd: only update image flags while hold exclusive lock (#11791 Jason Dillaman) * librbd: return result code from close (#12069 Jason Dillaman) * librbd: tolerate old osds when getting image metadata (#11549 Jason Dillaman) * mds: do not add snapped items to bloom filter (Yan, Zheng) * mds: fix handling for missing mydir dirfrag (#11641 John Spray) * mds: fix rejoin (Yan, Zheng) * mds: fix stra reintegration (Yan, Zheng) * mds: fix suicide beason (John Spray) * mds: misc repair improvements (John Spray) * mds: misc snapshot fixes (Yan, Zheng) * mds: respawn instead of suicide on blacklist (John Spray) * misc coverity fixes (Danny Al-Gaaf) * mon: add 'mon_metadata ' command (Kefu Chai) * mon: add 'node ls ...' command (Kefu Chai) * mon: disallow ec pools as tiers (#11650 Samuel Just) * mon: fix mds beacon replies (#11590 Kefu Chai) * mon: fix 'pg ls' sort order, state names (#11569 Kefu Chai) * mon: normalize erasure-code profile for storage and comparison (Loic Dachary) * mon: optionally specify osd id on 'osd create' (Mykola Golub) * mon: 'osd tree' fixes (Kefu Chai) * mon: prevent pool with snapshot state from being used as a tier (#11493 Sage Weil) * mon: refine check_remove_tier checks (#11504 John Spray) * mon: remove spurious who arg from 'mds rm ...' (John Spray) * msgr: async: misc fixes (Haomai Wang) * msgr: xio: fix ip and nonce (Raju Kurunkad) * msgr: xio: improve lane assignment (Vu Pham) * msgr: xio: misc fixes (Vu Pham, Cosey Bodley) * osd: avoid transaction append in some cases (Sage Weil) * osdc/Objecter: allow per-pool calls to op_cancel_writes (John Spray) * osd: elminiate txn apend, ECSubWrite copy (Samuel Just) * osd: filejournal: cleanup (David Zafman) * osd: fix check_for_full (Henry Chang) * osd: fix dirty accounting in make_writeable (Zhiqiang Wang) * osd: fix osdmap dump of blacklist items (John Spray) * osd: fix snap flushing from cache tier (again) (#11787 Samuel Just) * osd: fix snap handling on promotion (#11296 Sam Just) * osd: handle log split with overlapping entries (#11358 Samuel Just) * osd: keyvaluestore: misc fixes (Varada Kari) * osd: make suicide timeouts individually configurable (Samuel Just) * osd: move scrub in OpWQ (Samuel Just) * osd: pool size change triggers new interval (#11771 Samuel Just) * osd: randomize scrub times (#10973 Kefu Chai) * osd: refactor scrub and digest recording (Sage Weil) * osd: refuse first write to EC object at non-zero offset (Jianpeng Ma) * osd: stripe over small xattrs to fit in XFS's 255 byte inline limit (Sage Weil, Ning Yao) * osd: sync object_map on syncfs (Samuel Just) * osd: take excl lock of op is rw (Samuel Just) * osd: WBThrottle cleanups (Jianpeng Ma) * pycephfs: many fixes for bindings (Haomai Wang) * rados: bench: add --no-verify option to improve performance (Piotr Dalek) * rados: misc bench fixes (Dmitry Yatsushkevich) * rbd: add disk usage tool (#7746 Jason Dillaman) * rgw: alwasy check if token is expired (#11367 Anton Aksola, Riku Lehto) * rgw: conversion tool to repair broken multipart objects (#12079 Yehuda Sadeh) * rgw: do not enclose bucket header in quotes (#11860 Wido den Hollander) * rgw: error out if frontend did not send all data (#11851 Yehuda Sadeh) * rgw: fix assignment of copy obj attributes (#11563 Yehuda Sadeh) * rgw: fix reset_loc (#11974 Yehuda Sadeh) * rgw: improve content-length env var handling (#11419 Robin H. Johnson) * rgw: only scan for objects not in a namespace (#11984 Yehuda Sadeh) * rgw: remove trailing :port from HTTP_HOST header (Sage Weil) * rgw: shard work over multiple librados instances (Pavan Rallabhandi) * rgw: swift: enforce Content-Type in response (#12157 Radoslaw Zarzynski) * rgw: use attrs from source bucket on copy (#11639 Javier M. Mellid) * rocksdb: pass options as single string (Xiaoxi Chen) * rpm: many spec file fixes (Owen Synge, Ken Dreyer) * tests: fixes for rbd xstests (Douglas Fuller) * tests: fix tiering health checks (Loic Dachary) * tests for low-level performance (Haomai Wang) * tests: many ec non-regression improvements (Loic Dachary) * tests: many many ec test improvements (Loic Dachary) * upstart: throttle restarts (#11798 Sage Weil, Greg Farnum) v9.0.1 ====== This development release is delayed a bit due to tooling changes in the build environment. As a result the next one (v9.0.2) will have a bit more work than is usual. Highlights here include lots of RGW Swift fixes, RBD feature work surrounding the new object map feature, more CephFS snapshot fixes, and a few important CRUSH fixes. Notable Changes --------------- * auth: cache/reuse crypto lib key objects, optimize msg signature check (Sage Weil) * build: allow tcmalloc-minimal (Thorsten Behrens) * build: do not build ceph-dencoder with tcmalloc (#10691 Boris Ranto) * build: fix pg ref disabling (William A. Kennington III) * build: install-deps.sh improvements (Loic Dachary) * build: misc fixes (Boris Ranto, Ken Dreyer, Owen Synge) * ceph-authtool: fix return code on error (Gerhard Muntingh) * ceph-disk: fix zap sgdisk invocation (Owen Synge, Thorsten Behrens) * ceph-disk: pass --cluster arg on prepare subcommand (Kefu Chai) * ceph-fuse, libcephfs: drop inode when rmdir finishes (#11339 Yan, Zheng) * ceph-fuse,libcephfs: fix uninline (#11356 Yan, Zheng) * ceph-monstore-tool: fix store-copy (Huangjun) * common: add perf counter descriptions (Alyona Kiseleva) * common: fix throttle max change (Henry Chang) * crush: fix crash from invalid 'take' argument (#11602 Shiva Rkreddy, Sage Weil) * crush: fix divide-by-2 in straw2 (#11357 Yann Dupont, Sage Weil) * deb: fix rest-bench-dbg and ceph-test-dbg dependendies (Ken Dreyer) * doc: document region hostnames (Robin H. Johnson) * doc: update release schedule docs (Loic Dachary) * init-radosgw: run radosgw as root (#11453 Ken Dreyer) * librados: fadvise flags per op (Jianpeng Ma) * librbd: allow additional metadata to be stored with the image (Haomai Wang) * librbd: better handling for dup flatten requests (#11370 Jason Dillaman) * librbd: cancel in-flight ops on watch error (#11363 Jason Dillaman) * librbd: default new images to format 2 (#11348 Jason Dillaman) * librbd: fast diff implementation that leverages object map (Jason Dillaman) * librbd: fix snapshot creation when other snap is active (#11475 Jason Dillaman) * librbd: new diff_iterate2 API (Jason Dillaman) * librbd: object map rebuild support (Jason Dillaman) * logrotate.d: prefer service over invoke-rc.d (#11330 Win Hierman, Sage Weil) * mds: avoid getting stuck in XLOCKDONE (#11254 Yan, Zheng) * mds: fix integer truncateion on large client ids (Henry Chang) * mds: many snapshot and stray fixes (Yan, Zheng) * mds: persist completed_requests reliably (#11048 John Spray) * mds: separate safe_pos in Journaler (#10368 John Spray) * mds: snapshot rename support (#3645 Yan, Zheng) * mds: warn when clients fail to advance oldest_client_tid (#10657 Yan, Zheng) * misc cleanups and fixes (Danny Al-Gaaf) * mon: fix average utilization calc for 'osd df' (Mykola Golub) * mon: fix variance calc in 'osd df' (Sage Weil) * mon: improve callout to crushtool (Mykola Golub) * mon: prevent bucket deletion when referenced by a crush rule (#11602 Sage Weil) * mon: prime pg_temp when CRUSH map changes (Sage Weil) * monclient: flush_log (John Spray) * msgr: async: many many fixes (Haomai Wang) * msgr: simple: fix clear_pipe (#11381 Haomai Wang) * osd: add latency perf counters for tier operations (Xinze Chi) * osd: avoid multiple hit set insertions (Zhiqiang Wang) * osd: break PG removal into multiple iterations (#10198 Guang Yang) * osd: check scrub state when handling map (Jianpeng Ma) * osd: fix endless repair when object is unrecoverable (Jianpeng Ma, Kefu Chai) * osd: fix pg resurrection (#11429 Samuel Just) * osd: ignore non-existent osds in unfound calc (#10976 Mykola Golub) * osd: increase default max open files (Owen Synge) * osd: prepopulate needs_recovery_map when only one peer has missing (#9558 Guang Yang) * osd: relax reply order on proxy read (#11211 Zhiqiang Wang) * osd: skip promotion for flush/evict op (Zhiqiang Wang) * osd: write journal header on clean shutdown (Xinze Chi) * qa: run-make-check.sh script (Loic Dachary) * rados bench: misc fixes (Dmitry Yatsushkevich) * rados: fix error message on failed pool removal (Wido den Hollander) * radosgw-admin: add 'bucket check' function to repair bucket index (Yehuda Sadeh) * rbd: allow unmapping by spec (Ilya Dryomov) * rbd: deprecate --new-format option (Jason Dillman) * rgw: do not set content-type if length is 0 (#11091 Orit Wasserman) * rgw: don't use end_marker for namespaced object listing (#11437 Yehuda Sadeh) * rgw: fail if parts not specified on multipart upload (#11435 Yehuda Sadeh) * rgw: fix GET on swift account when limit == 0 (#10683 Radoslaw Zarzynski) * rgw: fix broken stats in container listing (#11285 Radoslaw Zarzynski) * rgw: fix bug in domain/subdomain splitting (Robin H. Johnson) * rgw: fix civetweb max threads (#10243 Yehuda Sadeh) * rgw: fix copy metadata, support X-Copied-From for swift (#10663 Radoslaw Zarzynski) * rgw: fix locator for objects starting with _ (#11442 Yehuda Sadeh) * rgw: fix mulitipart upload in retry path (#11604 Yehuda Sadeh) * rgw: fix quota enforcement on POST (#11323 Sergey Arkhipov) * rgw: fix return code on missing upload (#11436 Yehuda Sadeh) * rgw: force content type header on responses with no body (#11438 Orit Wasserman) * rgw: generate new object tag when setting attrs (#11256 Yehuda Sadeh) * rgw: issue aio for first chunk before flush cached data (#11322 Guang Yang) * rgw: make read user buckets backward compat (#10683 Radoslaw Zarzynski) * rgw: merge manifests properly with prefix override (#11622 Yehuda Sadeh) * rgw: return 412 on bad limit when listing buckets (#11613 Yehuda Sadeh) * rgw: send ETag, Last-Modified for swift (#11087 Radoslaw Zarzynski) * rgw: set content length on container GET, PUT, DELETE, HEAD (#10971, #11036 Radoslaw Zarzynski) * rgw: support end marker on swift container GET (#10682 Radoslaw Zarzynski) * rgw: swift: fix account listing (#11501 Radoslaw Zarzynski) * rgw: swift: set content-length on keystone tokens (#11473 Herv Rousseau) * rgw: use correct oid for gc chains (#11447 Yehuda Sadeh) * rgw: use unique request id for civetweb (#10295 Orit Wasserman) * rocksdb, leveldb: fix compact_on_mount (Xiaoxi Chen) * rocksdb: add perf counters for get/put latency (Xinxin Shu) * rpm: add suse firewall files (Tim Serong) * rpm: misc systemd and suse fixes (Owen Synge, Nathan Cutler) v9.0.0 ====== This is the first development release for the Infernalis cycle, and the first Ceph release to sport a version number from the new numbering scheme. The "9" indicates this is the 9th release cycle--I (for Infernalis) is the 9th letter. The first "0" indicates this is a development release ("1" will mean release candidate and "2" will mean stable release), and the final "0" indicates this is the first such development release. A few highlights include: * a new 'ceph daemonperf' command to watch perfcounter stats in realtime * reduced MDS memory usage * many MDS snapshot fixes * librbd can now store options in the image itself * many fixes for RGW Swift API support * OSD performance improvements * many doc updates and misc bug fixes Notable Changes --------------- * aarch64: add optimized version of crc32c (Yazen Ghannam, Steve Capper) * auth: reinit NSS after fork() (#11128 Yan, Zheng) * build: disable LTTNG by default (#11333 Josh Durgin) * build: fix ppc build (James Page) * build: install-deps: support OpenSUSE (Loic Dachary) * build: misc cmake fixes (Matt Benjamin) * ceph-disk: follow ceph-osd hints when creating journal (#9580 Sage Weil) * ceph-disk: handle re-using existing partition (#10987 Loic Dachary) * ceph-disk: improve parted output parsing (#10983 Loic Dachary) * ceph-disk: make suppression work for activate-all and activate-journal (Dan van der Ster) * ceph-disk: misc fixes (Alfredo Deza) * ceph-fuse, libcephfs: don't clear COMPLETE when trimming null (Yan, Zheng) * ceph-fuse, libcephfs: hold exclusive caps on dirs we "own" (#11226 Greg Farnum) * ceph-fuse: do not require successful remount when unmounting (#10982 Greg Farnum) * ceph: new 'ceph daemonperf' command (John Spray, Mykola Golub) * common: PriorityQueue tests (Kefu Chai) * common: add descriptions to perfcounters (Kiseleva Alyona) * common: fix LTTNG vs fork issue (Josh Durgin) * crush: fix has_v4_buckets (#11364 Sage Weil) * crushtool: fix order of operations, usage (Sage Weil) * debian: minor package reorg (Ken Dreyer) * doc: docuemnt object corpus generation (#11099 Alexis Normand) * doc: fix gender neutrality (Alexandre Maragone) * doc: fix install doc (#10957 Kefu Chai) * doc: fix sphinx issues (Kefu Chai) * doc: mds data structure docs (Yan, Zheng) * doc: misc updates (Nilamdyuti Goswami, Vartika Rai, Florian Haas, Loic Dachary, Simon Guinot, Andy Allan, Alistair Israel, Ken Dreyer, Robin Rehu, Lee Revell, Florian Marsylle, Thomas Johnson, Bosse Klykken, Travis Rhoden, Ian Kelling) * doc: swift tempurls (#10184 Abhishek Lekshmanan) * doc: switch doxygen integration back to breathe (#6115 Kefu Chai) * erasure-code: update ISA-L to 2.13 (Yuan Zhou) * gmock: switch to submodule (Danny Al-Gaaf, Loic Dachary) * hadoop: add terasort test (Noah Watkins) * java: fix libcephfs bindings (Noah Watkins) * libcephfs,ceph-fuse: fix request resend on cap reconnect (#10912 Yan, Zheng) * librados: define C++ flags from C constants (Josh Durgin) * librados: fix last_force_resent handling (#11026 Jianpeng Ma) * librados: fix memory leak from C_TwoContexts (Xiong Yiliang) * librados: fix striper when stripe_count = 1 and stripe_unit != object_size (#11120 Yan, Zheng) * librados: op perf counters (John Spray) * librados: pybind: fix write() method return code (Javier Guerra) * libradosstriper: fix leak (Danny Al-Gaaf) * librbd: add purge_on_error cache behavior (Jianpeng Ma) * librbd: misc aio fixes (#5488 Jason Dillaman) * librbd: misc rbd fixes (#11478 #11113 #11342 #11380 Jason Dillaman, Zhiqiang Wang) * librbd: readahead fixes (Zhiqiang Wang) * librbd: store metadata, including config options, in image (Haomai Wang) * mds: add 'damaged' state to MDSMap (John Spray) * mds: add nicknames for perfcounters (John Spray) * mds: disable problematic rstat propagation into snap parents (Yan, Zheng) * mds: fix mydir replica issue with shutdown (#10743 John Spray) * mds: fix out-of-order messages (#11258 Yan, Zheng) * mds: fix shutdown with strays (#10744 John Spray) * mds: fix snapshot fixes (Yan, Zheng) * mds: fix stray handling (John Spray) * mds: flush immediately in do_open_truncate (#11011 John Spray) * mds: improve dump methods (John Spray) * mds: misc journal cleanups and fixes (#10368 John Spray) * mds: new SessionMap storage using omap (#10649 John Spray) * mds: reduce memory consumption (Yan, Zheng) * mds: throttle purge stray operations (#10390 John Spray) * mds: tolerate clock jumping backwards (#11053 Yan, Zheng) * misc coverity fixes (Danny Al-Gaaf) * mon: do not deactivate last mds (#10862 John Spray) * mon: make osd get pool 'all' only return applicable fields (#10891 Michal Jarzabek) * mon: warn on bogus cache tier config (Jianpeng Ma) * msg/async: misc bug fixes and updates (Haomai Wang) * msg/simple: fix connect_seq assert (Haomai Wang) * msg/xio: misc fixes (#10735 Matt Benjamin, Kefu Chai, Danny Al-Gaaf, Raju Kurunkad, Vu Pham) * msg: unit tests (Haomai Wang) * objectcacher: misc bug fixes (Jianpeng Ma) * os/filestore: enlarge getxattr buffer size (Jianpeng Ma) * osd: EIO injection (David Zhang) * osd: add misc perfcounters (Xinze Chi) * osd: add simple sleep injection in recovery (Sage Weil) * osd: allow SEEK_HOLE/SEEK_DATA for sparse read (Zhiqiang Wang) * osd: avoid dup omap sets for in pg metadata (Sage Weil) * osd: clean up some constness, privateness (Kefu Chai) * osd: erasure-code: drop entries according to LRU (Andreas-Joachim Peters) * osd: fix negative degraded stats during backfill (Guang Yang) * osd: misc fixes (Ning Yao, Kefu Chai, Xinze Chi, Zhiqiang Wang, Jianpeng Ma) * pybind: pep8 cleanups (Danny Al-Gaaf) * qa: fix filelock_interrupt.py test (Yan, Zheng) * qa: improve ceph-disk tests (Loic Dachary) * qa: improve docker build layers (Loic Dachary) * rados: translate erno to string in CLI (#10877 Kefu Chai) * rbd: accept map options config option (Ilya Dryomov) * rbd: cli: fix arg parsing with --io-pattern (Dmitry Yatsushkevich) * rbd: fix error messages (#2862 Rajesh Nambiar) * rbd: update rbd man page (Ilya Dryomov) * rbd: update xfstests tests (Douglas Fuller) * rgw: add X-Timestamp for Swift containers (#10938 Radoslaw Zarzynski) * rgw: add missing headers to Swift container details (#10666 Ahmad Faheem, Dmytro Iurchenko) * rgw: add stats to headers for account GET (#10684 Yuan Zhou) * rgw: do not prefecth data for HEAD requests (Guang Yang) * rgw: don't clobber bucket/object owner when setting ACLs (#10978 Yehuda Sadeh) * rgw: don't use rgw_socket_path if frontend is configured (#11160 Yehuda Sadeh) * rgw: enforce Content-Lenth for POST on Swift cont/obj (#10661 Radoslaw Zarzynski) * rgw: fix handling empty metadata items on Swift container (#11088 Radoslaw Zarzynski) * rgw: fix log rotation (Wuxingyi) * rgw: generate Date header for civetweb (#10873 Radoslaw Zarzynski) * rgw: make init script wait for radosgw to stop (#11140 Dmitry Yatsushkevich) * rgw: make quota/gc threads configurable (#11047 Guang Yang) * rgw: pass in civetweb configurables (#10907 Yehuda Sadeh) * rgw: rectify 202 Accepted in PUT response (#11148 Radoslaw Zarzynski) * rgw: remove meta file after deleting bucket (#11149 Orit Wasserman) * rgw: swift: allow setting attributes with COPY (#10662 Ahmad Faheem, Dmytro Iurchenko) * rgw: swift: fix metadata handling on copy (#10645 Radoslaw Zarzynski) * rgw: swift: send Last-Modified header (#10650 Radoslaw Zarzynski) * rgw: update keystone cache with token info (#11125 Yehuda Sadeh) * rgw: update to latest civetweb, enable config for IPv6 (#10965 Yehuda Sadeh) * rocksdb: update to latest (Xiaoxi Chen) * rpm: loosen ceph-test dependencies (Ken Dreyer) v0.94.10 Hammer =============== This Hammer point release fixes several bugs and adds two new features. We recommend that all hammer v0.94.x users upgrade. For more detailed information, see :download:`the complete changelog `. New Features ------------ ceph-objectstore-tool and ceph-monstore-tool now enable user to rebuild the monitor database from OSDs. (This feature is especially useful when all monitors fail to boot due to leveldb corruption.) In RADOS Gateway, it is now possible to reshard an existing bucket's index using an off-line tool. Usage: $ radosgw-admin bucket reshard --bucket= --num_shards= This will create a new linked bucket instance that points to the newly created index objects. The old bucket instance still exists and currently it's up to the user to manually remove the old bucket index objects. (Note that bucket resharding currently requires that all IO (especially writes) to the specific bucket is quiesced.) Other Notable Changes --------------------- * build/ops: ceph-create-keys loops forever (`issue#17753 `_, `pr#12805 `_, Alfredo Deza) * build/ops: improve ceph.in error message (`issue#11101 `_, `pr#10905 `_, Kefu Chai) * build/ops: make stop.sh more portable (`issue#16918 `_, `pr#10569 `_, Mykola Golub) * build/ops: remove SYSTEMD_RUN from initscript (`issue#16440 `_, `issue#7627 `_, `pr#9873 `_, Vladislav Odintsov) * cephx: Fix multiple segfaults due to attempts to encrypt or decrypt (`issue#16266 `_, `pr#11930 `_, Brad Hubbard) * common: SIGABRT in TrackedOp::dump() via dump_ops_in_flight() (`issue#8885 `_, `pr#12121 `_, Jianpeng Ma, Zhiqiang Wang, David Zafman) * common: os/ObjectStore: fix _update_op for split dest_cid (`issue#15345 `_, `pr#12071 `_, Sage Weil) * crush: reset bucket->h.items[i] when removing tree item (`issue#16525 `_, `pr#10724 `_, Kefu Chai) * doc: add "Upgrading to Hammer" section (`issue#17386 `_, `pr#11372 `_, Kefu Chai) * doc: add orphan options to radosgw-admin --help and man page (`issue#17281 `_, `issue#17280 `_, `pr#11140 `_, Abhishek Lekshmanan, Casey Bodley, Ken Dreyer, Thomas Serlin) * doc: clarify that RGW bucket object versioning is supported (`issue#16574 `_, `pr#10437 `_, Yuan Zhou, shawn chen) * librados: bad flags can crash the osd (`issue#16012 `_, `pr#11936 `_, Jianpeng Ma, Sage Weil) * librbd: ceph 10.2.2 rbd status on image format 2 returns "(2) No such file or directory" (`issue#16887 `_, `pr#10987 `_, Jason Dillaman) * librbd: diffs to clone's first snapshot should include parent diffs (`issue#18068 `_, `pr#12446 `_, Jason Dillaman) * librbd: image.stat() call in librbdpy fails sometimes (`issue#17310 `_, `pr#11949 `_, Jason Dillaman) * librbd: request exclusive lock if current owner cannot execute op (`issue#16171 `_, `pr#12018 `_, Mykola Golub) * mds: fix cephfs-java ftruncate unit test failure (`issue#11258 `_, `pr#11939 `_, Yan, Zheng) * mon: %USED of ceph df is wrong (`issue#16933 `_, `pr#11934 `_, Kefu Chai) * mon: MonmapMonitor should return success when MON will be removed (`issue#17725 `_, `pr#12006 `_, Joao Eduardo Luis) * mon: OSDMonitor: Missing nearfull flag set (`issue#17390 `_, `pr#11273 `_, Igor Podoski) * mon: OSDs marked OUT wrongly after monitor failover (`issue#17719 `_, `pr#11946 `_, Dong Wu) * mon: fix memory leak in prepare_beacon (`issue#17285 `_, `pr#10238 `_, Igor Podoski) * mon: osd flag health message is misleading (`issue#18175 `_, `pr#12687 `_, Sage Weil) * mon: prepare_pgtemp needs to only update up_thru if newer than the existing one (`issue#16185 `_, `pr#11937 `_, Samuel Just) * mon: return size_t from MonitorDBStore::Transaction::size() (`issue#14217 `_, `pr#10904 `_, Kefu Chai) * mon: send updated monmap to its subscribers (`issue#17558 `_, `pr#11457 `_, Kefu Chai) * msgr: OpTracker needs to release the message throttle in _unregistered (`issue#14248 `_, `pr#11938 `_, Samuel Just) * msgr: simple/Pipe: error decoding addr (`issue#18072 `_, `pr#12266 `_, Sage Weil) * osd: PG::_update_calc_stats wrong for CRUSH_ITEM_NONE up set items (`issue#16998 `_, `pr#11933 `_, Samuel Just) * osd: PG::choose_acting valgrind error or ./common/hobject.h: 182: FAILED assert(!max || (\*this == hobject_t(hobject_t::get_max()))) (`issue#13967 `_, `pr#11932 `_, Tao Chang) * osd: ReplicatedBackend::build_push_op: add a second config to limit omap entries/chunk independently of object data (`issue#16128 `_, `pr#12417 `_, Wanlong Gao) * osd: crash on EIO during deep-scrubbing (`issue#16034 `_, `pr#11935 `_, Nathan Cutler) * osd: filestore: FALLOC_FL_PUNCH_HOLE must be used with FALLOC_FL_KEEP_SIZE (`issue#18446 `_, `pr#13041 `_, xinxin shu) * osd: fix cached_removed_snaps bug in PGPool::update after map gap (`issue#18628 `_, `issue#15943 `_, `pr#12906 `_, Samuel Just) * osd: fix collection_list shadow return value (`issue#17713 `_, `pr#11927 `_, Haomai Wang) * osd: fix fiemap issue in xfs when #extents > 1364 (`issue#17610 `_, `pr#11615 `_, Kefu Chai, Ning Yao) * osd: update PGPool to detect map gaps and reset cached_removed_snaps (`issue#15943 `_, `pr#11676 `_, Samuel Just) * rbd: export diff should open image as read-only (`issue#17671 `_, `pr#11948 `_, liyankun) * rbd: fix parameter check (`issue#18237 `_, `pr#12312 `_, Yankun Li) * rbd: fix possible rbd data corruption (`issue#16002 `_, `pr#11618 `_, Yan, Zheng, Greg Farnum) * rgw: Anonymous user is able to read bucket with authenticated read ACL (`issue#13207 `_, `pr#11045 `_, rahul.1aggarwal@gmail.com) * rgw: COPY broke multipart files uploaded under dumpling (`issue#16435 `_, `pr#11950 `_, Yehuda Sadeh) * rgw: TempURL in radosgw behaves now like its Swift's counterpart. (`issue#18316 `_, `pr#12619 `_, Radoslaw Zarzynski) * rgw: default quota fixes (`issue#16410 `_, `pr#10839 `_, Pavan Rallabhandi, Daniel Gryniewicz) * rgw: do not abort when accept a CORS request with short origin (`issue#18187 `_, `pr#12398 `_, LiuYang) * rgw: do not omap_getvals with (u64)-1 max (`issue#17985 `_, `pr#12418 `_, Yehuda Sadeh, Sage Weil) * rgw: fix crash when client posts object with null condition (`issue#17635 `_, `pr#11809 `_, Yehuda Sadeh) * rgw: fix inconsistent uid/email handling in radosgw-admin (`issue#13598 `_, `pr#11952 `_, Matt Benjamin) * rgw: implement offline resharding command (`issue#17745 `_, `pr#12227 `_, Yehuda Sadeh, Orit Wasserman, weiqiaomiao) * rgw: swift: ranged request on a DLO provides wrong values in Content-Range HTTP header (`issue#13452 `_, `pr#11951 `_, Radoslaw Zarzynski) * rgw: the value of total_time is wrong in the result of 'radosgw-admin log show' opt (`issue#17598 `_, `pr#11899 `_, weiqiaomiao) * tests: Cannot clone ceph/s3-tests.git (missing branch) (`issue#18384 `_, `pr#12744 `_, Orit Wasserman) * tests: Cannot reserve CentOS 7.2 smithi machines (`issue#18401 `_, `pr#12762 `_, Nathan Cutler) * tests: OSDs commit suicide in rbd suite when testing on btrfs (`issue#18397 `_, `pr#12758 `_, Nathan Cutler) * tests: Workunits needlessly wget from git.ceph.com (`issue#18336 `_, `issue#18271 `_, `issue#18388 `_, `pr#12685 `_, Sage Weil, Nathan Cutler) * tests: cephfs test failures (ceph.com/qa is broken, should be download.ceph.com/qa) (`issue#18574 `_, `pr#13022 `_, John Spray) * tests: merge ceph-qa-suite (`pr#12455 `_, Sage Weil) * tests: objecter_requests workunit fails on wip branches (`issue#18393 `_, `pr#12759 `_, Sage Weil) * tests: populate mnt_point in qa/tasks/ceph.py (`issue#18383 `_, `pr#12743 `_, Nathan Cutler) * tests: qemu/tests/qemu-iotests/077 fails in dumpling, hammer, and jewel (`issue#10773 `_, `pr#12423 `_, Jason Dillaman) * tests: run fs/thrash on xfs instead of btrfs (`issue#17151 `_, `pr#13039 `_, Nathan Cutler) * tests: update Ubuntu image url after ceph.com refactor (`issue#18542 `_, `pr#12957 `_, Jason Dillaman) * tests: update rbd/singleton/all/formatted-output.yaml to support ceph-ci * (`issue#18440 `_, `pr#12824 * `_, Venky Shankar, Nathan Cutler) * tools: add a tool to rebuild mon store from OSD (`issue#17179 `_, `issue#17400 `_, `pr#11125 `_, Kefu Chai, xie xingguo) * tools: ceph-objectstore-tool crashes if --journal-path (`issue#17307 `_, `pr#11929 `_, Kefu Chai) * tools: ceph-objectstore-tool: add a way to split filestore directories offline (`issue#17220 `_, `pr#11253 `_, Josh Durgin) * tools: crushtool --compile generates output despite missing item (`issue#17306 `_, `pr#11931 `_, Kefu Chai) v0.94.9 Hammer ============== This Hammer point release fixes a build issue present in 0.94.8 that prevented us from generating packages for Ubuntu Precise and CentOS 6.x. We recommend all users of v0.94.7 or older upgrade. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * build/ops: revert: boost uuid makes valgrind complain (`pr#10913 `_, Sage Weil) v0.94.8 Hammer ============== This Hammer point release fixes several bugs. We recommend that all hammer v0.94.x users upgrade. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * build/ops: rocksdb do not link against tcmalloc if it's disabled (`issue#14799 `_, `pr#10750 `_, Sage Weil, Kefu Chai) * build/ops: Add -D_LARGEFILE64_SOURCE to Linux build. (`issue#16611 `_, `pr#10182 `_, Ira Cooper) * build/ops: boost uuid makes valgrind complain (`issue#12736 `_, `pr#9741 `_, Sage Weil, Rohan Mars) * build/ops: ceph-disk s/by-parttype-uuid/by-parttypeuuid/ (`issue#15867 `_, `pr#9107 `_, Nathan Cutler) * common: add units to rados bench output and clean up formatting (`issue#12248 `_, `pr#8960 `_, Dmitry Yatsushkevich, Brad Hubbard, Gu Zhongyan) * common: config set with negative value results in "error setting 'filestore_merge_threshold' to '-40': (22) Invalid argument" (`issue#13829 `_, `pr#10291 `_, Brad Hubbard, Kefu Chai) * common: linking to -lrbd causes process startup times to balloon (`issue#15225 `_, `pr#8538 `_, Richard W.M. Jones) * doc: fix by-parttypeuuid in ceph-disk(8) nroff (`issue#15867 `_, `pr#10699 `_, Ken Dreyer) * fs: double decreased the count to trim caps which will cause failing to respond to cache pressure (`issue#14319 `_, `pr#8804 `_, Zhi Zhang) * log: do not repeat errors to stderr (`issue#14616 `_, `pr#10227 `_, Sage Weil) * mds: failing file operations on kernel based cephfs mount point leaves unaccessible file behind on hammer 0.94.7 (`issue#16013 `_, `pr#10198 `_, Yan, Zheng) * mds: fix stray purging in 'stripe_count > 1' case (`issue#15050 `_, `pr#8042 `_, Yan, Zheng) * mds: wrongly treat symlink inode as normal file/dir when symlink inode is stale on kcephfs (`issue#15702 `_, `pr#9404 `_, Zhi Zhang) * mon: LibRadosMiscConnectFailure.ConnectFailure (not so intermittent) failure in upgrade/hammer-x (`issue#13992 `_, `pr#8806 `_, Sage Weil) * mon: Monitor: validate prefix on handle_command() (`issue#16297 `_, `pr#10038 `_, You Ji) * mon: drop pg temps from not the current primary in OSDMonitor (`issue#16127 `_, `pr#9893 `_, Samuel Just) * mon: fix calculation of %USED (`issue#15641 `_, `pr#9125 `_, Ruifeng Yang, David Zafman) * mon: improve reweight_by_utilization() logic (`issue#15686 `_, `pr#9416 `_, xie xingguo) * mon: pool quota alarm is not in effect (`issue#15478 `_, `pr#8593 `_, Danny Al-Gaaf) * mon: wrong ceph get mdsmap assertion (`issue#14681 `_, `pr#7542 `_, Vicente Cheng) * msgr: ceph-osd valgrind invalid reads/writes (`issue#15870 `_, `pr#9238 `_, Samuel Just) * objecter: LibRadosWatchNotifyPPTests/LibRadosWatchNotifyPP.WatchNotify2Timeout/1 segv (`issue#15760 `_, `pr#9400 `_, Sage Weil) * osd: OSD reporting ENOTEMPTY and crashing (`issue#14766 `_, `pr#9277 `_, Samuel Just) * osd: When generating past intervals due to an import end at pg epoch and fix build_past_intervals_parallel (`issue#12387 `_, `issue#14438 `_, `pr#8464 `_, David Zafman) * osd: acting_primary not updated on split (`issue#15523 `_, `pr#9001 `_, Sage Weil) * osd: assert(!actingbackfill.empty()): old watch timeout tries to queue repop on replica (`issue#15391 `_, `pr#8665 `_, Sage Weil) * osd: assert(rollback_info_trimmed_to == head) in PGLog (`issue#13965 `_, `pr#8849 `_, Samuel Just) * osd: delete one of the repeated op->mark_started in ReplicatedBackend::sub_op_modify_impl (`issue#16572 `_, `pr#9977 `_, shun-s) * osd: fix omap digest compare when scrub (`issue#16000 `_, `pr#9271 `_, Xinze Chi) * osd: is_split crash in handle_pg_create (`issue#15426 `_, `pr#8805 `_, Kefu Chai) * osd: objects unfound after repair (fixed by repeering the pg) (`issue#15006 `_, `pr#7961 `_, Jianpeng Ma, Loic Dachary, Kefu Chai) * osd: rados cppool omap to ec pool crashes osd (`issue#14695 `_, `pr#8845 `_, Jianpeng Ma) * osd: remove all stale osdmaps in handle_osd_map() (`issue#13990 `_, `pr#9090 `_, Kefu Chai) * osd: send write and read sub ops on behalf of client ops at normal priority in ECBackend (`issue#14313 `_, `pr#8573 `_, Samuel Just) * rbd: snap rollback: restore the link to parent (`issue#14512 `_, `pr#8535 `_, Alexey Sheplyakov) * rgw: S3: set EncodingType in ListBucketResult (`issue#15896 `_, `pr#8987 `_, Victor Makarov, Robin H. Johnson) * rgw: backport rgwx-copy-if-newer for radosgw-agent (`issue#16262 `_, `pr#9671 `_, Yehuda Sadeh) * rgw: bucket listing following object delete is partial (`issue#14826 `_, `pr#10555 `_, Orit Wasserman) * rgw: convert plain object to versioned (with null version) when removing (`issue#15243 `_, `pr#8755 `_, Yehuda Sadeh) * rgw: fix multi-delete query param parsing. (`issue#16618 `_, `pr#10189 `_, Robin H. Johnson) * rgw: have a flavor of bucket deletion to bypass GC and to trigger (`issue#15557 `_, `pr#10509 `_, Pavan Rallabhandi) * rgw: keep track of written_objs correctly (`issue#15886 `_, `pr#9240 `_, Yehuda Sadeh) * rgw: multipart ListPartsResult has missing quotes on ETag (`issue#15334 `_, `pr#8475 `_, xie xingguo, Robin H. Johnson) * rgw: no Last-Modified, Content-Size and X-Object-Manifest headers if no segments in DLO manifest (`issue#15812 `_, `pr#9402 `_, Radoslaw Zarzynski) * rgw: radosgw server abort when user passed bad parameters to set quota (`issue#14190 `_, `issue#14191 `_, `pr#8313 `_, Dunrong Huang) * rgw: radosgw-admin region-map set is not reporting the bucket quota correctly (`issue#16815 `_, `pr#10554 `_, Yehuda Sadeh, Orit Wasserman) * rgw: refrain from sending Content-Type/Content-Length for 304 responses (`issue#16327 `_, `issue#13582 `_, `issue#15119 `_, `issue#14005 `_, `pr#8379 `_, Yehuda Sadeh, Nathan Cutler, Wido den Hollander) * rgw: remove bucket index objects when deleting the bucket (`issue#16412 `_, `pr#10530 `_, Orit Wasserman) * rgw: set Access-Control-Allow-Origin to an asterisk if allowed in a rule (`issue#15348 `_, `pr#8528 `_, Wido den Hollander) * rgw: subset of uploaded objects via radosgw are unretrievable when using EC pool (`issue#15745 `_, `pr#9407 `_, Yehuda Sadeh) * rgw: subuser rm fails with status 125 (`issue#14375 `_, `pr#9961 `_, Orit Wasserman) * rgw: the swift key remains after removing a subuser (`issue#12890 `_, `issue#14375 `_, `pr#10718 `_, Orit Wasserman, Sangdi Xu) * rgw: user quota may not adjust on bucket removal (`issue#14507 `_, `pr#8113 `_, Edward Yang) * tests: be more generous with test timeout (`issue#15403 `_, `pr#8470 `_, Loic Dachary) * tests: qa/workunits/rbd: respect RBD_CREATE_ARGS environment variable (`issue#16289 `_, `pr#9722 `_, Mykola Golub) v0.94.7 Hammer ============== This Hammer point release fixes several minor bugs. It also includes a backport of an improved 'ceph osd reweight-by-utilization' command for handling OSDs with higher-than-average utilizations. We recommend that all hammer v0.94.x users upgrade. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * auth: keyring permisions for mon deamon (`issue#14950 `_, `pr#8049 `_, Owen Synge) * auth: PK11_DestroyContext() is called twice if PK11_DigestFinal() fails (`issue#14958 `_, `pr#7922 `_, Brad Hubbard, Dunrong Huang) * auth: use libnss more safely (`issue#14620 `_, `pr#7488 `_, Sage Weil) * ceph-disk: use blkid instead of sgdisk -i (`issue#14080 `_, `issue#14094 `_, `pr#7475 `_, Ilya Dryomov, Loic Dachary) * ceph-fuse: fix ceph-fuse writing to stale log file after log rotation (`issue#12350 `_, `pr#7110 `_, Zhi Zhang) * ceph init script unconditionally sources /lib/lsb/init-functions (`issue#14402 `_, `pr#7797 `_, Yan, Zheng) * ceph.in: Notify user that 'tell' can't be used in interactive mode (`issue#14773 `_, `pr#7656 `_, David Zafman) * ceph-objectstore-tool, osd: Fix import handling (`issue#10794 `_, `issue#13382 `_, `pr#7917 `_, Sage Weil, David Zafman) * client: added permission check based on getgrouplist (`issue#13268 `_, `pr#6604 `_, Yan, Zheng, Danny Al-Gaaf) * client: inoderef (`issue#13729 `_, `pr#6551 `_, Yan, Zheng) * common: clock skew report is incorrect by ceph health detail command (`issue#14175 `_, `pr#8051 `_, Joao Eduardo Luis) * global/pidfile: do not start two daemons with a single pid-file (`issue#13422 `_, `pr#7671 `_, Loic Dachary, shun song) * librados: segfault in Objecter::handle_watch_notify (`issue#13805 `_, `pr#7992 `_, Sage Weil) * librbd: flattening an rbd image with active IO can lead to hang (`issue#14092 `_, `issue#14483 `_, `pr#7485 `_, Jason Dillaman) * librbd: possible QEMU deadlock after creating image snapshots (`issue#14988 `_, `pr#8011 `_, Jason Dillaman) * mon: Bucket owner isn't changed after unlink/link (`issue#11076 `_, `pr#8583 `_, Zengran Zhang) * monclient: avoid key renew storm on clock skew (`issue#12065 `_, `pr#8398 `_, Alexey Sheplyakov) * mon: implement reweight-by-utilization feature (`issue#15054 `_, `pr#8026 `_, Kefu Chai, Dan van der Ster, Sage Weil) * mon/LogMonitor: use the configured facility if log to syslog (`issue#13748 `_, `pr#7648 `_, Kefu Chai) * mon: mon sync does not copy config-key (`issue#14577 `_, `pr#7576 `_, Xiaowei Chen) * mon/OSDMonitor: avoid underflow in reweight-by-utilization if max_change=1 (`issue#15655 `_, `pr#8979 `_, Samuel Just) * osd: consume_maps clearing of waiting_for_pg needs to check the spg_t shard for acting set membership (`issue#14278 `_, `pr#7577 `_, Samuel Just) * osd: log inconsistent shard sizes (`issue#14009 `_, `pr#6946 `_, Loic Dachary) * osd: OSD coredumps with leveldb compact on mount = true (`issue#14748 `_, `pr#7645 `_, Xiaoxi Chen) * osd/OSDMap: reset osd_primary_affinity shared_ptr when deepish_copy_from (`issue#14686 `_, `pr#7590 `_, Xinze Chi) * osd: Protect against excessively large object map sizes (`issue#15121 `_, `pr#8401 `_, Jason Dillaman) * osd/ReplicatedPG: do not proxy read *and* process op locally (`issue#15171 `_, `pr#8187 `_, Sage Weil) * osd: scrub bogus results when missing a clone (`issue#14875 `_, `issue#14874 `_, `issue#14877 `_, `issue#10098 `_, `issue#14878 `_, `issue#14881 `_, `issue#14882 `_, `issue#14883 `_, `issue#14879 `_, `issue#10290 `_, `issue#12740 `_, `issue#12738 `_, `issue#14880 `_, `issue#11135 `_, `issue#14876 `_, `issue#10809 `_, `issue#12193 `_, `issue#11237 `_, `pr#7702 `_, Xinze Chi, Sage Weil, John Spray, Kefu Chai, Mykola Golub, David Zafman) * osd: Unable to bring up OSD's after dealing with FULL cluster (OSD assert with /include/interval_set.h: 386: FAILED assert(_size >= 0)) (`issue#14428 `_, `pr#7415 `_, Alexey Sheplyakov) * osd: use GMT time for the object name of hitsets (`issue#13192 `_, `issue#9732 `_, `issue#12968 `_, `pr#7883 `_, Kefu Chai, David Zafman) * qa/workunits/post-file.sh: sudo (`issue#14586 `_, `pr#7456 `_, Sage Weil) * qa/workunits: remove 'mds setmap' from workunits (`pr#8123 `_, Sage Weil) * rgw: default quota params (`issue#12997 `_, `pr#7188 `_, Daniel Gryniewicz) * rgw: make rgw_fronends more forgiving of whitespace (`issue#12038 `_, `pr#7414 `_, Matt Benjamin) * rgw: radosgw-admin bucket check --fix not work (`issue#14215 `_, `pr#7185 `_, Weijun Duan) * rpm package building fails if the build machine has lttng and babeltrace development packages installed locally (`issue#14844 `_, `pr#8440 `_, Kefu Chai) * rpm: redhat-lsb-core dependency was dropped, but is still needed (`issue#14906 `_, `pr#7876 `_, Nathan Cutler) * test_bit_vector.cc uses magic numbers against #defines that vary (`issue#14747 `_, `pr#7672 `_, Jason Dillaman) * test/librados/tier.cc doesn't completely clean up EC pools (`issue#13878 `_, `pr#8052 `_, Loic Dachary, Dan Mick) * tests: bufferlist: do not expect !is_page_aligned() after unaligned rebuild (`issue#15305 `_, `pr#8272 `_, Kefu Chai) * tools: fix race condition in seq/rand bench (part 1) (`issue#14968 `_, `issue#14873 `_, `pr#7896 `_, Alexey Sheplyakov, Piotr Dałek) * tools: fix race condition in seq/rand bench (part 2) (`issue#14873 `_, `pr#7817 `_, Alexey Sheplyakov) * tools/rados: add bench smoke tests (`issue#14971 `_, `pr#7903 `_, Piotr Dałek) * tools, test: Add ceph-objectstore-tool to operate on the meta collection (`issue#14977 `_, `pr#7911 `_, David Zafman) * unittest_crypto: benchmark 100,000 CryptoKey::encrypt() calls (`issue#14863 `_, `pr#7801 `_, Sage Weil) v0.94.6 Hammer ====================== This Hammer point release fixes a range of bugs, most notably a fix for unbounded growth of the monitor's leveldb store, and a workaround in the OSD to keep most xattrs small enough to be stored inline in XFS inodes. We recommend that all hammer v0.94.x users upgrade. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * build/ops: Ceph daemon failed to start, because the service name was already used. (`issue#13474 `_, `pr#6832 `_, Chuanhong Wang) * build/ops: LTTng-UST tracing should be dynamically enabled (`issue#13274 `_, `pr#6415 `_, Jason Dillaman) * build/ops: ceph upstart script rbdmap.conf incorrectly processes parameters (`issue#13214 `_, `pr#6159 `_, Sage Weil) * build/ops: ceph.spec.in License line does not reflect COPYING (`issue#12935 `_, `pr#6680 `_, Nathan Cutler) * build/ops: ceph.spec.in libcephfs_jni1 has no %post and %postun (`issue#12927 `_, `pr#5789 `_, Owen Synge) * build/ops: configure.ac: no use to add "+" before ac_ext=c (`issue#14330 `_, `pr#6973 `_, Kefu Chai, Robin H. Johnson) * build/ops: deb: strip tracepoint libraries from Wheezy/Precise builds (`issue#14801 `_, `pr#7316 `_, Jason Dillaman) * build/ops: init script reload doesn't work on EL7 (`issue#13709 `_, `pr#7187 `_, Hervé Rousseau) * build/ops: init-rbdmap uses distro-specific functions (`issue#12415 `_, `pr#6528 `_, Boris Ranto) * build/ops: logrotate reload error on Ubuntu 14.04 (`issue#11330 `_, `pr#5787 `_, Sage Weil) * build/ops: miscellaneous spec file fixes (`issue#12931 `_, `issue#12994 `_, `issue#12924 `_, `issue#12360 `_, `pr#5790 `_, Boris Ranto, Nathan Cutler, Owen Synge, Travis Rhoden, Ken Dreyer) * build/ops: pass tcmalloc env through to ceph-os (`issue#14802 `_, `pr#7365 `_, Sage Weil) * build/ops: rbd-replay-* moved from ceph-test-dbg to ceph-common-dbg as well (`issue#13785 `_, `pr#6580 `_, Loic Dachary) * build/ops: unknown argument --quiet in udevadm settle (`issue#13560 `_, `pr#6530 `_, Jason Dillaman) * common: Objecter: pool op callback may hang forever. (`issue#13642 `_, `pr#6588 `_, xie xingguo) * common: Objecter: potential null pointer access when do pool_snap_list. (`issue#13639 `_, `pr#6839 `_, xie xingguo) * common: ThreadPool add/remove work queue methods not thread safe (`issue#12662 `_, `pr#5889 `_, Jason Dillaman) * common: auth/cephx: large amounts of log are produced by osd (`issue#13610 `_, `pr#6835 `_, Qiankun Zheng) * common: client nonce collision due to unshared pid namespaces (`issue#13032 `_, `pr#6151 `_, Josh Durgin) * common: common/Thread:pthread_attr_destroy(thread_attr) when done with it (`issue#12570 `_, `pr#6157 `_, Piotr Dałek) * common: log: Log.cc: Assign LOG_DEBUG priority to syslog calls (`issue#13993 `_, `pr#6994 `_, Brad Hubbard) * common: objecter: cancellation bugs (`issue#13071 `_, `pr#6155 `_, Jianpeng Ma) * common: pure virtual method called (`issue#13636 `_, `pr#6587 `_, Jason Dillaman) * common: small probability sigabrt when setting rados_osd_op_timeout (`issue#13208 `_, `pr#6143 `_, Ruifeng Yang) * common: wrong conditional for boolean function KeyServer::get_auth() (`issue#9756 `_, `issue#13424 `_, `pr#6213 `_, Nathan Cutler) * crush: crash if we see CRUSH_ITEM_NONE in early rule step (`issue#13477 `_, `pr#6430 `_, Sage Weil) * doc: man: document listwatchers cmd in "rados" manpage (`issue#14556 `_, `pr#7434 `_, Kefu Chai) * doc: regenerate man pages, add orphans commands to radosgw-admin(8) (`issue#14637 `_, `pr#7524 `_, Ken Dreyer) * fs: CephFS restriction on removing cache tiers is overly strict (`issue#11504 `_, `pr#6402 `_, John Spray) * fs: fsstress.sh fails (`issue#12710 `_, `pr#7454 `_, Yan, Zheng) * librados: LibRadosWatchNotify.WatchNotify2Timeout (`issue#13114 `_, `pr#6336 `_, Sage Weil) * librbd: ImageWatcher shouldn't block the notification thread (`issue#14373 `_, `pr#7407 `_, Jason Dillaman) * librbd: diff_iterate needs to handle holes in parent images (`issue#12885 `_, `pr#6097 `_, Jason Dillaman) * librbd: fix merge-diff for >2GB diff-files (`issue#14030 `_, `pr#6980 `_, Jason Dillaman) * librbd: invalidate object map on error even w/o holding lock (`issue#13372 `_, `pr#6289 `_, Jason Dillaman) * librbd: reads larger than cache size hang (`issue#13164 `_, `pr#6354 `_, Lu Shi) * mds: ceph mds add_data_pool check for EC pool is wrong (`issue#12426 `_, `pr#5766 `_, John Spray) * mon: MonitorDBStore: get_next_key() only if prefix matches (`issue#11786 `_, `pr#5361 `_, Joao Eduardo Luis) * mon: OSDMonitor: do not assume a session exists in send_incremental() (`issue#14236 `_, `pr#7150 `_, Joao Eduardo Luis) * mon: check for store writeablility before participating in election (`issue#13089 `_, `pr#6144 `_, Sage Weil) * mon: compact full epochs also (`issue#14537 `_, `pr#7446 `_, Kefu Chai) * mon: include min_last_epoch_clean as part of PGMap::print_summary and PGMap::dump (`issue#13198 `_, `pr#6152 `_, Guang Yang) * mon: map_cache can become inaccurate if osd does not receive the osdmaps (`issue#10930 `_, `pr#5773 `_, Kefu Chai) * mon: should not set isvalid = true when cephx_verify_authorizer return false (`issue#13525 `_, `pr#6391 `_, Ruifeng Yang) * osd: Ceph Pools' MAX AVAIL is 0 if some OSDs' weight is 0 (`issue#13840 `_, `pr#6834 `_, Chengyuan Li) * osd: FileStore calls syncfs(2) even it is not supported (`issue#12512 `_, `pr#5530 `_, Kefu Chai) * osd: FileStore: potential memory leak if getattrs fails. (`issue#13597 `_, `pr#6420 `_, xie xingguo) * osd: IO error on kvm/rbd with an erasure coded pool tier (`issue#12012 `_, `pr#5897 `_, Kefu Chai) * osd: OSD::build_past_intervals_parallel() shall reset primary and up_primary when begin a new past_interval. (`issue#13471 `_, `pr#6398 `_, xiexingguo) * osd: ReplicatedBackend: populate recovery_info.size for clone (bug symptom is size mismatch on replicated backend on a clone in scrub) (`issue#12828 `_, `pr#6153 `_, Samuel Just) * osd: ReplicatedPG: wrong result code checking logic during sparse_read (`issue#14151 `_, `pr#7179 `_, xie xingguo) * osd: ReplicatedPG::hit_set_trim osd/ReplicatedPG.cc: 11006: FAILED assert(obc) (`issue#13192 `_, `issue#9732 `_, `issue#12968 `_, `pr#5825 `_, Kefu Chai, Zhiqiang Wang, Samuel Just, David Zafman) * osd: avoid multi set osd_op.outdata in tier pool (`issue#12540 `_, `pr#6060 `_, Xinze Chi) * osd: bug with cache/tiering and snapshot reads (`issue#12748 `_, `pr#6589 `_, Kefu Chai) * osd: ceph osd pool stats broken in hammer (`issue#13843 `_, `pr#7180 `_, BJ Lougee) * osd: ceph-disk prepare fails if device is a symlink (`issue#13438 `_, `pr#7176 `_, Joe Julian) * osd: check for full before changing the cached obc (hammer) (`issue#13098 `_, `pr#6918 `_, Alexey Sheplyakov) * osd: config_opts: increase suicide timeout to 300 to match recovery (`issue#14376 `_, `pr#7236 `_, Samuel Just) * osd: disable filestore_xfs_extsize by default (`issue#14397 `_, `pr#7411 `_, Ken Dreyer) * osd: do not cache unused memory in attrs (`issue#12565 `_, `pr#6499 `_, Xinze Chi, Ning Yao) * osd: dumpling incrementals do not work properly on hammer and newer (`issue#13234 `_, `pr#6132 `_, Samuel Just) * osd: filestore: fix peek_queue for OpSequencer (`issue#13209 `_, `pr#6145 `_, Xinze Chi) * osd: hit set clear repops fired in same epoch as map change -- segfault since they fall into the new interval even though the repops are cleared (`issue#12809 `_, `pr#5890 `_, Samuel Just) * osd: object_info_t::decode() has wrong version (`issue#13462 `_, `pr#6335 `_, David Zafman) * osd: osd/OSD.cc: 2469: FAILED assert(pg_stat_queue.empty()) on shutdown (`issue#14212 `_, `pr#7178 `_, Sage Weil) * osd: osd/PG.cc: 288: FAILED assert(info.last_epoch_started >= info.history.last_epoch_started) (`issue#14015 `_, `pr#7177 `_, David Zafman) * osd: osd/PG.cc: 3837: FAILED assert(0 == "Running incompatible OSD") (`issue#11661 `_, `pr#7206 `_, David Zafman) * osd: osd/ReplicatedPG: Recency fix (`issue#14320 `_, `pr#7207 `_, Sage Weil, Robert LeBlanc) * osd: pg stuck in replay (`issue#13116 `_, `pr#6401 `_, Sage Weil) * osd: race condition detected during send_failures (`issue#13821 `_, `pr#6755 `_, Sage Weil) * osd: randomize scrub times (`issue#10973 `_, `pr#6199 `_, Kefu Chai) * osd: requeue_scrub when kick_object_context_blocked (`issue#12515 `_, `pr#5891 `_, Xinze Chi) * osd: revert: use GMT time for hitsets (`issue#13812 `_, `pr#6644 `_, Loic Dachary) * osd: segfault in agent_work (`issue#13199 `_, `pr#6146 `_, Samuel Just) * osd: should recalc the min_last_epoch_clean when decode PGMap (`issue#13112 `_, `pr#6154 `_, Kefu Chai) * osd: smaller object_info_t xattrs (`issue#14803 `_, `pr#6544 `_, Sage Weil) * osd: we do not ignore notify from down osds (`issue#12990 `_, `pr#6158 `_, Samuel Just) * rbd: QEMU hangs after creating snapshot and stopping VM (`issue#13726 `_, `pr#6586 `_, Jason Dillaman) * rbd: TaskFinisher::cancel should remove event from SafeTimer (`issue#14476 `_, `pr#7417 `_, Douglas Fuller) * rbd: avoid re-writing old-format image header on resize (`issue#13674 `_, `pr#6585 `_, Jason Dillaman) * rbd: fix bench-write (`issue#14225 `_, `pr#7183 `_, Sage Weil) * rbd: rbd-replay does not check for EOF and goes to endless loop (`issue#14452 `_, `pr#7416 `_, Mykola Golub) * rbd: rbd-replay-prep and rbd-replay improvements (`issue#13221 `_, `issue#13220 `_, `issue#13378 `_, `pr#6286 `_, Jason Dillaman) * rbd: verify self-managed snapshot functionality on image create (`issue#13633 `_, `pr#7182 `_, Jason Dillaman) * rgw: Make RGW_MAX_PUT_SIZE configurable (`issue#6999 `_, `pr#7441 `_, Vladislav Odintsov, Yuan Zhou) * rgw: Setting ACL on Object removes ETag (`issue#12955 `_, `pr#6620 `_, Brian Felton) * rgw: backport content-type casing (`issue#12939 `_, `pr#5910 `_, Robin H. Johnson) * rgw: bucket listing hangs on versioned buckets (`issue#12913 `_, `pr#6352 `_, Yehuda Sadeh) * rgw: fix wrong etag calculation during POST on S3 bucket. (`issue#11241 `_, `pr#7442 `_, Vladislav Odintsov, Radoslaw Zarzynski) * rgw: get bucket location returns region name, not region api name (`issue#13458 `_, `pr#6349 `_, Yehuda Sadeh) * rgw: missing handling of encoding-type=url when listing keys in bucket (`issue#12735 `_, `pr#6527 `_, Jeff Weber) * rgw: orphan tool should be careful about removing head objects (`issue#12958 `_, `pr#6351 `_, Yehuda Sadeh) * rgw: orphans finish segfaults (`issue#13824 `_, `pr#7186 `_, Igor Fedotov) * rgw: rgw-admin: document orphans commands in usage (`issue#14516 `_, `pr#7526 `_, Yehuda Sadeh) * rgw: swift API returns more than real object count and bytes used when retrieving account metadata (`issue#13140 `_, `pr#6512 `_, Sangdi Xu) * rgw: swift use Civetweb ssl can not get right url (`issue#13628 `_, `pr#6491 `_, Weijun Duan) * rgw: value of Swift API's X-Object-Manifest header is not url_decoded during segment look up (`issue#12728 `_, `pr#6353 `_, Radoslaw Zarzynski) * tests: fixed broken Makefiles after integration of ttng into rados (`issue#13210 `_, `pr#6322 `_, Sebastien Ponce) * tests: fsx failed to compile (`issue#14384 `_, `pr#7501 `_, Greg Farnum) * tests: notification slave needs to wait for master (`issue#13810 `_, `pr#7226 `_, Jason Dillaman) * tests: qa: remove legacy OS support from rbd/qemu-iotests (`issue#13483 `_, `issue#14385 `_, `pr#7252 `_, Vasu Kulkarni, Jason Dillaman) * tests: testprofile must be removed before it is re-created (`issue#13664 `_, `pr#6450 `_, Loic Dachary) * tools: ceph-monstore-tool must do out_store.close() (`issue#10093 `_, `pr#7347 `_, huangjun) * tools: heavy memory shuffling in rados bench (`issue#12946 `_, `pr#5810 `_, Piotr Dałek) * tools: race condition in rados bench (`issue#12947 `_, `pr#6791 `_, Piotr Dałek) * tools: tool for artificially inflate the leveldb of the mon store for testing purposes (`issue#10093 `_, `issue#11815 `_, `issue#14217 `_, `pr#7412 `_, Cilang Zhao, Bo Cai, Kefu Chai, huangjun, Joao Eduardo Luis) v0.94.5 Hammer ============== This Hammer point release fixes a critical regression in librbd that can cause QEMU/KVM to crash when caching is enabled on images that have been cloned. All v0.94.4 Hammer users are strongly encouraged to upgrade. Notable Changes --------------- * librbd: potential assertion failure during cache read (`issue#13559 `_, `pr#6348 `_, Jason Dillaman) * osd: osd/ReplicatedPG: remove stray debug line (`issue#13455 `_, `pr#6362 `_, Sage Weil) * tests: qemu workunit refers to apt-mirror.front.sepia.ceph.com (`issue#13420 `_, `pr#6330 `_, Yuan Zhou) For more detailed information, see :download:`the complete changelog `. v0.94.4 Hammer ============== This Hammer point release fixes several important bugs in Hammer, as well as fixing interoperability issues that are required before an upgrade to Infernalis. That is, all users of earlier version of Hammer or any version of Firefly will first need to upgrade to hammer v0.94.4 or later before upgrading to Infernalis (or future releases). All v0.94.x Hammer users are strongly encouraged to upgrade. Notable Changes --------------- * build/ops: ceph.spec.in: 50-rbd.rules conditional is wrong (`issue#12166 `_, `pr#5207 `_, Nathan Cutler) * build/ops: ceph.spec.in: ceph-common needs python-argparse on older distros, but doesn't require it (`issue#12034 `_, `pr#5216 `_, Nathan Cutler) * build/ops: ceph.spec.in: radosgw requires apache for SUSE only -- makes no sense (`issue#12358 `_, `pr#5411 `_, Nathan Cutler) * build/ops: ceph.spec.in: rpm: cephfs_java not fully conditionalized (`issue#11991 `_, `pr#5202 `_, Nathan Cutler) * build/ops: ceph.spec.in: rpm: not possible to turn off Java (`issue#11992 `_, `pr#5203 `_, Owen Synge) * build/ops: ceph.spec.in: running fdupes unnecessarily (`issue#12301 `_, `pr#5223 `_, Nathan Cutler) * build/ops: ceph.spec.in: snappy-devel for all supported distros (`issue#12361 `_, `pr#5264 `_, Nathan Cutler) * build/ops: ceph.spec.in: SUSE/openSUSE builds need libbz2-devel (`issue#11629 `_, `pr#5204 `_, Nathan Cutler) * build/ops: ceph.spec.in: useless %py_requires breaks SLE11-SP3 build (`issue#12351 `_, `pr#5412 `_, Nathan Cutler) * build/ops: error in ext_mime_map_init() when /etc/mime.types is missing (`issue#11864 `_, `pr#5385 `_, Ken Dreyer) * build/ops: upstart: limit respawn to 3 in 30 mins (instead of 5 in 30s) (`issue#11798 `_, `pr#5930 `_, Sage Weil) * build/ops: With root as default user, unable to have multiple RGW instances running (`issue#10927 `_, `pr#6161 `_, Sage Weil) * build/ops: With root as default user, unable to have multiple RGW instances running (`issue#11140 `_, `pr#6161 `_, Sage Weil) * build/ops: With root as default user, unable to have multiple RGW instances running (`issue#11686 `_, `pr#6161 `_, Sage Weil) * build/ops: With root as default user, unable to have multiple RGW instances running (`issue#12407 `_, `pr#6161 `_, Sage Weil) * cli: ceph: cli throws exception on unrecognized errno (`issue#11354 `_, `pr#5368 `_, Kefu Chai) * cli: ceph tell: broken error message / misleading hinting (`issue#11101 `_, `pr#5371 `_, Kefu Chai) * common: arm: all programs that link to librados2 hang forever on startup (`issue#12505 `_, `pr#5366 `_, Boris Ranto) * common: buffer: critical bufferlist::zero bug (`issue#12252 `_, `pr#5365 `_, Haomai Wang) * common: ceph-object-corpus: add 0.94.2-207-g88e7ee7 hammer objects (`issue#13070 `_, `pr#5551 `_, Sage Weil) * common: do not insert emtpy ptr when rebuild emtpy bufferlist (`issue#12775 `_, `pr#5764 `_, Xinze Chi) * common: [ FAILED ] TestLibRBD.BlockingAIO (`issue#12479 `_, `pr#5768 `_, Jason Dillaman) * common: LibCephFS.GetPoolId failure (`issue#12598 `_, `pr#5887 `_, Yan, Zheng) * common: Memory leak in Mutex.cc, pthread_mutexattr_init without pthread_mutexattr_destroy (`issue#11762 `_, `pr#5378 `_, Ketor Meng) * common: object_map_update fails with -EINVAL return code (`issue#12611 `_, `pr#5559 `_, Jason Dillaman) * common: Pipe: Drop connect_seq increase line (`issue#13093 `_, `pr#5908 `_, Haomai Wang) * common: recursive lock of md_config_t (0) (`issue#12614 `_, `pr#5759 `_, Josh Durgin) * crush: ceph osd crush reweight-subtree does not reweight parent node (`issue#11855 `_, `pr#5374 `_, Sage Weil) * doc: update docs to point to download.ceph.com (`issue#13162 `_, `pr#6156 `_, Alfredo Deza) * fs: ceph-fuse 0.94.2-1trusty segfaults / aborts (`issue#12297 `_, `pr#5381 `_, Greg Farnum) * fs: segfault launching ceph-fuse with bad --name (`issue#12417 `_, `pr#5382 `_, John Spray) * librados: Change radosgw pools default crush ruleset (`issue#11640 `_, `pr#5754 `_, Yuan Zhou) * librbd: correct issues discovered via lockdep / helgrind (`issue#12345 `_, `pr#5296 `_, Jason Dillaman) * librbd: Crash during TestInternal.MultipleResize (`issue#12664 `_, `pr#5769 `_, Jason Dillaman) * librbd: deadlock during cooperative exclusive lock transition (`issue#11537 `_, `pr#5319 `_, Jason Dillaman) * librbd: Possible crash while concurrently writing and shrinking an image (`issue#11743 `_, `pr#5318 `_, Jason Dillaman) * mon: add a cache layer over MonitorDBStore (`issue#12638 `_, `pr#5697 `_, Kefu Chai) * mon: fix crush testing for new pools (`issue#13400 `_, `pr#6192 `_, Sage Weil) * mon: get pools health'info have error (`issue#12402 `_, `pr#5369 `_, renhwztetecs) * mon: implicit erasure code crush ruleset is not validated (`issue#11814 `_, `pr#5276 `_, Loic Dachary) * mon: PaxosService: call post_refresh() instead of post_paxos_update() (`issue#11470 `_, `pr#5359 `_, Joao Eduardo Luis) * mon: pgmonitor: wrong at/near target max“ reporting (`issue#12401 `_, `pr#5370 `_, huangjun) * mon: register_new_pgs() should check ruleno instead of its index (`issue#12210 `_, `pr#5377 `_, Xinze Chi) * mon: Show osd as NONE in ceph osd map output (`issue#11820 `_, `pr#5376 `_, Shylesh Kumar) * mon: the output is wrong when runing ceph osd reweight (`issue#12251 `_, `pr#5372 `_, Joao Eduardo Luis) * osd: allow peek_map_epoch to return an error (`issue#13060 `_, `pr#5892 `_, Sage Weil) * osd: cache agent is idle although one object is left in the cache (`issue#12673 `_, `pr#5765 `_, Loic Dachary) * osd: copy-from doesn't preserve truncate_{seq,size} (`issue#12551 `_, `pr#5885 `_, Samuel Just) * osd: crash creating/deleting pools (`issue#12429 `_, `pr#5527 `_, John Spray) * osd: fix repair when recorded digest is wrong (`issue#12577 `_, `pr#5468 `_, Sage Weil) * osd: include/ceph_features: define HAMMER_0_94_4 feature (`issue#13026 `_, `pr#5687 `_, Sage Weil) * osd: is_new_interval() fixes (`issue#10399 `_, `pr#5691 `_, Jason Dillaman) * osd: is_new_interval() fixes (`issue#11771 `_, `pr#5691 `_, Jason Dillaman) * osd: long standing slow requests: connection->session->waiting_for_map->connection ref cycle (`issue#12338 `_, `pr#5761 `_, Samuel Just) * osd: Mutex Assert from PipeConnection::try_get_pipe (`issue#12437 `_, `pr#5758 `_, David Zafman) * osd: pg_interval_t::check_new_interval - for ec pool, should not rely on min_size to determine if the PG was active at the interval (`issue#12162 `_, `pr#5373 `_, Guang G Yang) * osd: PGLog.cc: 732: FAILED assert(log.log.size() == log_keys_debug.size()) (`issue#12652 `_, `pr#5763 `_, Sage Weil) * osd: PGLog::proc_replica_log: correctly handle case where entries between olog.head and log.tail were split out (`issue#11358 `_, `pr#5380 `_, Samuel Just) * osd: read on chunk-aligned xattr not handled (`issue#12309 `_, `pr#5367 `_, Sage Weil) * osd: suicide timeout during peering - search for missing objects (`issue#12523 `_, `pr#5762 `_, Guang G Yang) * osd: WBThrottle::clear_object: signal on cond when we reduce throttle values (`issue#12223 `_, `pr#5757 `_, Samuel Just) * rbd: crash during shutdown after writeback blocked by IO errors (`issue#12597 `_, `pr#5767 `_, Jianpeng Ma) * rgw: add delimiter to prefix only when path is specified (`issue#12960 `_, `pr#5860 `_, Sylvain Baubeau) * rgw: create a tool for orphaned objects cleanup (`issue#9604 `_, `pr#5717 `_, Yehuda Sadeh) * rgw: don't preserve acls when copying object (`issue#11563 `_, `pr#6039 `_, Yehuda Sadeh) * rgw: don't preserve acls when copying object (`issue#12370 `_, `pr#6039 `_, Yehuda Sadeh) * rgw: don't preserve acls when copying object (`issue#13015 `_, `pr#6039 `_, Yehuda Sadeh) * rgw: Ensure that swift keys don't include backslashes (`issue#7647 `_, `pr#5716 `_, Yehuda Sadeh) * rgw: GWWatcher::handle_error -> common/Mutex.cc: 95: FAILED assert(r == 0) (`issue#12208 `_, `pr#6164 `_, Yehuda Sadeh) * rgw: HTTP return code is not being logged by CivetWeb (`issue#12432 `_, `pr#5498 `_, Yehuda Sadeh) * rgw: init_rados failed leads to repeated delete (`issue#12978 `_, `pr#6165 `_, Xiaowei Chen) * rgw: init some manifest fields when handling explicit objs (`issue#11455 `_, `pr#5732 `_, Yehuda Sadeh) * rgw: Keystone Fernet tokens break auth (`issue#12761 `_, `pr#6162 `_, Abhishek Lekshmanan) * rgw: region data still exist in region-map after region-map update (`issue#12964 `_, `pr#6163 `_, dwj192) * rgw: remove trailing :port from host for purposes of subdomain matching (`issue#12353 `_, `pr#6042 `_, Yehuda Sadeh) * rgw: rest-bench common/WorkQueue.cc: 54: FAILED assert(_threads.empty()) (`issue#3896 `_, `pr#5383 `_, huangjun) * rgw: returns requested bucket name raw in Bucket response header (`issue#12537 `_, `pr#5715 `_, Yehuda Sadeh) * rgw: segmentation fault when rgw_gc_max_objs > HASH_PRIME (`issue#12630 `_, `pr#5719 `_, Ruifeng Yang) * rgw: segments are read during HEAD on Swift DLO (`issue#12780 `_, `pr#6160 `_, Yehuda Sadeh) * rgw: setting max number of buckets for user via ceph.conf option (`issue#12714 `_, `pr#6166 `_, Vikhyat Umrao) * rgw: Swift API: X-Trans-Id header is wrongly formatted (`issue#12108 `_, `pr#5721 `_, Radoslaw Zarzynski) * rgw: testGetContentType and testHead failed (`issue#11091 `_, `pr#5718 `_, Radoslaw Zarzynski) * rgw: testGetContentType and testHead failed (`issue#11438 `_, `pr#5718 `_, Radoslaw Zarzynski) * rgw: testGetContentType and testHead failed (`issue#12157 `_, `pr#5718 `_, Radoslaw Zarzynski) * rgw: testGetContentType and testHead failed (`issue#12158 `_, `pr#5718 `_, Radoslaw Zarzynski) * rgw: testGetContentType and testHead failed (`issue#12363 `_, `pr#5718 `_, Radoslaw Zarzynski) * rgw: the arguments 'domain' should not be assigned when return false (`issue#12629 `_, `pr#5720 `_, Ruifeng Yang) * tests: qa/workunits/cephtool/test.sh: don't assume crash_replay_interval=45 (`issue#13406 `_, `pr#6172 `_, Sage Weil) * tests: TEST_crush_rule_create_erasure consistently fails on i386 builder (`issue#12419 `_, `pr#6201 `_, Loic Dachary) * tools: ceph-disk zap should ensure block device (`issue#11272 `_, `pr#5755 `_, Loic Dachary) For more detailed information, see :download:`the complete changelog `. v0.94.3 Hammer ============== This Hammer point release fixes a critical (though rare) data corruption bug that could be triggered when logs are rotated via SIGHUP. It also fixes a range of other important bugs in the OSD, monitor, RGW, RGW, and CephFS. All v0.94.x Hammer users are strongly encouraged to upgrade. Upgrading --------- * The ``pg ls-by-{pool,primary,osd}`` commands and ``pg ls`` now take the argument ``recovering`` instead of ``recovery`` in order to include the recovering pgs in the listed pgs. Notable Changes --------------- * librbd: aio calls may block (`issue#11770 `_, `pr#4875 `_, Jason Dillaman) * osd: make the all osd/filestore thread pool suicide timeouts separately configurable (`issue#11701 `_, `pr#5159 `_, Samuel Just) * mon: ceph fails to compile with boost 1.58 (`issue#11982 `_, `pr#5122 `_, Kefu Chai) * tests: TEST_crush_reject_empty must not run a mon (`issue#12285,11975 `_, `pr#5208 `_, Kefu Chai) * osd: FAILED assert(!old_value.deleted()) in upgrade:giant-x-hammer-distro-basic-multi run (`issue#11983 `_, `pr#5121 `_, Samuel Just) * build/ops: linking ceph to tcmalloc causes segfault on SUSE SLE11-SP3 (`issue#12368 `_, `pr#5265 `_, Thorsten Behrens) * common: utf8 and old gcc breakage on RHEL6.5 (`issue#7387 `_, `pr#4687 `_, Kefu Chai) * crush: take crashes due to invalid arg (`issue#11740 `_, `pr#4891 `_, Sage Weil) * rgw: need conversion tool to handle fixes following #11974 (`issue#12502 `_, `pr#5384 `_, Yehuda Sadeh) * rgw: Swift API: support for 202 Accepted response code on container creation (`issue#12299 `_, `pr#5214 `_, Radoslaw Zarzynski) * common: Log::reopen_log_file: take m_flush_mutex (`issue#12520 `_, `pr#5405 `_, Samuel Just) * rgw: Properly respond to the Connection header with Civetweb (`issue#12398 `_, `pr#5284 `_, Wido den Hollander) * rgw: multipart list part response returns incorrect field (`issue#12399 `_, `pr#5285 `_, Henry Chang) * build/ops: ceph.spec.in: 95-ceph-osd.rules, mount.ceph, and mount.fuse.ceph not installed properly on SUSE (`issue#12397 `_, `pr#5283 `_, Nathan Cutler) * rgw: radosgw-admin dumps user info twice (`issue#12400 `_, `pr#5286 `_, guce) * doc: fix doc build (`issue#12180 `_, `pr#5095 `_, Kefu Chai) * tests: backport 11493 fixes, and test, preventing ec cache pools (`issue#12314 `_, `pr#4961 `_, Samuel Just) * rgw: does not send Date HTTP header when civetweb frontend is used (`issue#11872 `_, `pr#5228 `_, Radoslaw Zarzynski) * mon: pg ls is broken (`issue#11910 `_, `pr#5160 `_, Kefu Chai) * librbd: A client opening an image mid-resize can result in the object map being invalidated (`issue#12237 `_, `pr#5279 `_, Jason Dillaman) * doc: missing man pages for ceph-create-keys, ceph-disk-* (`issue#11862 `_, `pr#4846 `_, Nathan Cutler) * tools: ceph-post-file fails on rhel7 (`issue#11876 `_, `pr#5038 `_, Sage Weil) * build/ops: rcceph script is buggy (`issue#12090 `_, `pr#5028 `_, Owen Synge) * rgw: Bucket header is enclosed by quotes (`issue#11874 `_, `pr#4862 `_, Wido den Hollander) * build/ops: packaging: add SuSEfirewall2 service files (`issue#12092 `_, `pr#5030 `_, Tim Serong) * rgw: Keystone PKI token expiration is not enforced (`issue#11722 `_, `pr#4884 `_, Anton Aksola) * build/ops: debian/control: ceph-common (>> 0.94.2) must be >= 0.94.2-2 (`issue#12529,11998 `_, `pr#5417 `_, Loic Dachary) * mon: Clock skew causes missing summary and confuses Calamari (`issue#11879 `_, `pr#4868 `_, Thorsten Behrens) * rgw: rados objects wronly deleted (`issue#12099 `_, `pr#5117 `_, wuxingyi) * tests: kernel_untar_build fails on EL7 (`issue#12098 `_, `pr#5119 `_, Greg Farnum) * fs: Fh ref count will leak if readahead does not need to do read from osd (`issue#12319 `_, `pr#5427 `_, Zhi Zhang) * mon: OSDMonitor: allow addition of cache pool with non-empty snaps with co… (`issue#12595 `_, `pr#5252 `_, Samuel Just) * mon: MDSMonitor: handle MDSBeacon messages properly (`issue#11979 `_, `pr#5123 `_, Kefu Chai) * tools: ceph-disk: get_partition_type fails on /dev/cciss... (`issue#11760 `_, `pr#4892 `_, islepnev) * build/ops: max files open limit for OSD daemon is too low (`issue#12087 `_, `pr#5026 `_, Owen Synge) * mon: add an "osd crush tree" command (`issue#11833 `_, `pr#5248 `_, Kefu Chai) * mon: mon crashes when "ceph osd tree 85 --format json" (`issue#11975 `_, `pr#4936 `_, Kefu Chai) * build/ops: ceph / ceph-dbg steal ceph-objecstore-tool from ceph-test / ceph-test-dbg (`issue#11806 `_, `pr#5069 `_, Loic Dachary) * rgw: DragonDisk fails to create directories via S3: MissingContentLength (`issue#12042 `_, `pr#5118 `_, Yehuda Sadeh) * build/ops: /usr/bin/ceph from ceph-common is broken without installing ceph (`issue#11998 `_, `pr#5206 `_, Ken Dreyer) * build/ops: systemd: Increase max files open limit for OSD daemon (`issue#11964 `_, `pr#5040 `_, Owen Synge) * build/ops: rgw/logrotate.conf calls service with wrong init script name (`issue#12044 `_, `pr#5055 `_, wuxingyi) * common: OPT_INT option interprets 3221225472 as -1073741824, and crashes in Throttle::Throttle() (`issue#11738 `_, `pr#4889 `_, Kefu Chai) * doc: doc/release-notes: v0.94.2 (`issue#11492 `_, `pr#4934 `_, Sage Weil) * common: admin_socket: close socket descriptor in destructor (`issue#11706 `_, `pr#4657 `_, Jon Bernard) * rgw: Object copy bug (`issue#11755 `_, `pr#4885 `_, Javier M. Mellid) * rgw: empty json response when getting user quota (`issue#12245 `_, `pr#5237 `_, wuxingyi) * fs: cephfs Dumper tries to load whole journal into memory at once (`issue#11999 `_, `pr#5120 `_, John Spray) * rgw: Fix tool for #11442 does not correctly fix objects created via multipart uploads (`issue#12242 `_, `pr#5229 `_, Yehuda Sadeh) * rgw: Civetweb RGW appears to report full size of object as downloaded when only partially downloaded (`issue#12243 `_, `pr#5231 `_, Yehuda Sadeh) * osd: stuck incomplete (`issue#12362 `_, `pr#5269 `_, Samuel Just) * osd: start_flush: filter out removed snaps before determining snapc's (`issue#11911 `_, `pr#4899 `_, Samuel Just) * librbd: internal.cc: 1967: FAILED assert(watchers.size() == 1) (`issue#12239 `_, `pr#5243 `_, Jason Dillaman) * librbd: new QA client upgrade tests (`issue#12109 `_, `pr#5046 `_, Jason Dillaman) * librbd: [ FAILED ] TestLibRBD.ExclusiveLockTransition (`issue#12238 `_, `pr#5241 `_, Jason Dillaman) * rgw: Swift API: XML document generated in response for GET on account does not contain account name (`issue#12323 `_, `pr#5227 `_, Radoslaw Zarzynski) * rgw: keystone does not support chunked input (`issue#12322 `_, `pr#5226 `_, Hervé Rousseau) * mds: MDS is crashed (mds/CDir.cc: 1391: FAILED assert(!is_complete())) (`issue#11737 `_, `pr#4886 `_, Yan, Zheng) * cli: ceph: cli interactive mode does not understand quotes (`issue#11736 `_, `pr#4776 `_, Kefu Chai) * librbd: add valgrind memory checks for unit tests (`issue#12384 `_, `pr#5280 `_, Zhiqiang Wang) * build/ops: admin/build-doc: script fails silently under certain circumstances (`issue#11902 `_, `pr#4877 `_, John Spray) * osd: Fixes for rados ops with snaps (`issue#11908 `_, `pr#4902 `_, Samuel Just) * build/ops: ceph.spec.in: ceph-common subpackage def needs tweaking for SUSE/openSUSE (`issue#12308 `_, `pr#4883 `_, Nathan Cutler) * fs: client: reference counting 'struct Fh' (`issue#12088 `_, `pr#5222 `_, Yan, Zheng) * build/ops: ceph.spec: update OpenSUSE BuildRequires (`issue#11611 `_, `pr#4667 `_, Loic Dachary) For more detailed information, see :download:`the complete changelog `. v0.94.2 Hammer ============== This Hammer point release fixes a few critical bugs in RGW that can prevent objects starting with underscore from behaving properly and that prevent garbage collection of deleted objects when using the Civetweb standalone mode. All v0.94.x Hammer users are strongly encouraged to upgrade, and to make note of the repair procedure below if RGW is in use. Upgrading from previous Hammer release -------------------------------------- Bug #11442 introduced a change that made rgw objects that start with underscore incompatible with previous versions. The fix to that bug reverts to the previous behavior. In order to be able to access objects that start with an underscore and were created in prior Hammer releases, following the upgrade it is required to run (for each affected bucket):: $ radosgw-admin bucket check --check-head-obj-locator \ --bucket= [--fix] Notable changes --------------- * build: compilation error: No high-precision counter available (armhf, powerpc..) (#11432, James Page) * ceph-dencoder links to libtcmalloc, and shouldn't (#10691, Boris Ranto) * ceph-disk: disk zap sgdisk invocation (#11143, Owen Synge) * ceph-disk: use a new disk as journal disk,ceph-disk prepare fail (#10983, Loic Dachary) * ceph-objectstore-tool should be in the ceph server package (#11376, Ken Dreyer) * librados: can get stuck in redirect loop if osdmap epoch == last_force_op_resend (#11026, Jianpeng Ma) * librbd: A retransmit of proxied flatten request can result in -EINVAL (Jason Dillaman) * librbd: ImageWatcher should cancel in-flight ops on watch error (#11363, Jason Dillaman) * librbd: Objectcacher setting max object counts too low (#7385, Jason Dillaman) * librbd: Periodic failure of TestLibRBD.DiffIterateStress (#11369, Jason Dillaman) * librbd: Queued AIO reference counters not properly updated (#11478, Jason Dillaman) * librbd: deadlock in image refresh (#5488, Jason Dillaman) * librbd: notification race condition on snap_create (#11342, Jason Dillaman) * mds: Hammer uclient checking (#11510, John Spray) * mds: remove caps from revoking list when caps are voluntarily released (#11482, Yan, Zheng) * messenger: double clear of pipe in reaper (#11381, Haomai Wang) * mon: Total size of OSDs is a maginitude less than it is supposed to be. (#11534, Zhe Zhang) * osd: don't check order in finish_proxy_read (#11211, Zhiqiang Wang) * osd: handle old semi-deleted pgs after upgrade (#11429, Samuel Just) * osd: object creation by write cannot use an offset on an erasure coded pool (#11507, Jianpeng Ma) * rgw: Improve rgw HEAD request by avoiding read the body of the first chunk (#11001, Guang Yang) * rgw: civetweb is hitting a limit (number of threads 1024) (#10243, Yehuda Sadeh) * rgw: civetweb should use unique request id (#10295, Orit Wasserman) * rgw: critical fixes for hammer (#11447, #11442, Yehuda Sadeh) * rgw: fix swift COPY headers (#10662, #10663, #11087, #10645, Radoslaw Zarzynski) * rgw: improve performance for large object (multiple chunks) GET (#11322, Guang Yang) * rgw: init-radosgw: run RGW as root (#11453, Ken Dreyer) * rgw: keystone token cache does not work correctly (#11125, Yehuda Sadeh) * rgw: make quota/gc thread configurable for starting (#11047, Guang Yang) * rgw: make swift responses of RGW return last-modified, content-length, x-trans-id headers.(#10650, Radoslaw Zarzynski) * rgw: merge manifests correctly when there's prefix override (#11622, Yehuda Sadeh) * rgw: quota not respected in POST object (#11323, Sergey Arkhipov) * rgw: restore buffer of multipart upload after EEXIST (#11604, Yehuda Sadeh) * rgw: shouldn't need to disable rgw_socket_path if frontend is configured (#11160, Yehuda Sadeh) * rgw: swift: Response header of GET request for container does not contain X-Container-Object-Count, X-Container-Bytes-Used and x-trans-id headers (#10666, Dmytro Iurchenko) * rgw: swift: Response header of POST request for object does not contain content-length and x-trans-id headers (#10661, Radoslaw Zarzynski) * rgw: swift: response for GET/HEAD on container does not contain the X-Timestamp header (#10938, Radoslaw Zarzynski) * rgw: swift: response for PUT on /container does not contain the mandatory Content-Length header when FCGI is used (#11036, #10971, Radoslaw Zarzynski) * rgw: swift: wrong handling of empty metadata on Swift container (#11088, Radoslaw Zarzynski) * tests: TestFlatIndex.cc races with TestLFNIndex.cc (#11217, Xinze Chi) * tests: ceph-helpers kill_daemons fails when kill fails (#11398, Loic Dachary) For more detailed information, see :download:`the complete changelog `. v0.94.1 Hammer ============== This bug fix release fixes a few critical issues with CRUSH. The most important addresses a bug in feature bit enforcement that may prevent pre-hammer clients from communicating with the cluster during an upgrade. This only manifests in some cases (for example, when the 'rack' type is in use in the CRUSH map, and possibly other cases), but for safety we strongly recommend that all users use 0.94.1 instead of 0.94 when upgrading. There is also a fix in the new straw2 buckets when OSD weights are 0. We recommend that all v0.94 users upgrade. Notable changes --------------- * crush: fix divide-by-0 in straw2 (#11357 Sage Weil) * crush: fix has_v4_buckets (#11364 Sage Weil) * osd: fix negative degraded objects during backfilling (#7737 Guang Yang) For more detailed information, see :download:`the complete changelog `. v0.94 Hammer ============ This major release is expected to form the basis of the next long-term stable series. It is intended to supersede v0.80.x Firefly. Highlights since Giant include: * *RADOS Performance*: a range of improvements have been made in the OSD and client-side librados code that improve the throughput on flash backends and improve parallelism and scaling on fast machines. * *Simplified RGW deployment*: the ceph-deploy tool now has a new 'ceph-deploy rgw create HOST' command that quickly deploys a instance of the S3/Swift gateway using the embedded Civetweb server. This is vastly simpler than the previous Apache-based deployment. There are a few rough edges (e.g., around SSL support) but we encourage users to try `the new method`_. * *RGW object versioning*: RGW now supports the S3 object versioning API, which preserves old version of objects instead of overwriting them. * *RGW bucket sharding*: RGW can now shard the bucket index for large buckets across, improving performance for very large buckets. * *RBD object maps*: RBD now has an object map function that tracks which parts of the image are allocating, improving performance for clones and for commands like export and delete. * *RBD mandatory locking*: RBD has a new mandatory locking framework (still disabled by default) that adds additional safeguards to prevent multiple clients from using the same image at the same time. * *RBD copy-on-read*: RBD now supports copy-on-read for image clones, improving performance for some workloads. * *CephFS snapshot improvements*: Many many bugs have been fixed with CephFS snapshots. Although they are still disabled by default, stability has improved significantly. * *CephFS Recovery tools*: We have built some journal recovery and diagnostic tools. Stability and performance of single-MDS systems is vastly improved in Giant, and more improvements have been made now in Hammer. Although we still recommend caution when storing important data in CephFS, we do encourage testing for non-critical workloads so that we can better guage the feature, usability, performance, and stability gaps. * *CRUSH improvements*: We have added a new straw2 bucket algorithm that reduces the amount of data migration required when changes are made to the cluster. * *Shingled erasure codes (SHEC)*: The OSDs now have experimental support for shingled erasure codes, which allow a small amount of additional storage to be traded for improved recovery performance. * *RADOS cache tiering*: A series of changes have been made in the cache tiering code that improve performance and reduce latency. * *RDMA support*: There is now experimental support the RDMA via the Accelio (libxio) library. * *New administrator commands*: The 'ceph osd df' command shows pertinent details on OSD disk utilizations. The 'ceph pg ls ...' command makes it much simpler to query PG states while diagnosing cluster issues. .. _the new method: ../start/quick-ceph-deploy/#add-an-rgw-instance Other highlights since Firefly include: * *CephFS*: we have fixed a raft of bugs in CephFS and built some basic journal recovery and diagnostic tools. Stability and performance of single-MDS systems is vastly improved in Giant. Although we do not yet recommend CephFS for production deployments, we do encourage testing for non-critical workloads so that we can better guage the feature, usability, performance, and stability gaps. * *Local Recovery Codes*: the OSDs now support an erasure-coding scheme that stores some additional data blocks to reduce the IO required to recover from single OSD failures. * *Degraded vs misplaced*: the Ceph health reports from 'ceph -s' and related commands now make a distinction between data that is degraded (there are fewer than the desired number of copies) and data that is misplaced (stored in the wrong location in the cluster). The distinction is important because the latter does not compromise data safety. * *Tiering improvements*: we have made several improvements to the cache tiering implementation that improve performance. Most notably, objects are not promoted into the cache tier by a single read; they must be found to be sufficiently hot before that happens. * *Monitor performance*: the monitors now perform writes to the local data store asynchronously, improving overall responsiveness. * *Recovery tools*: the ceph-objectstore-tool is greatly expanded to allow manipulation of an individual OSDs data store for debugging and repair purposes. This is most heavily used by our QA infrastructure to exercise recovery code. I would like to take this opportunity to call out the amazing growth in contributors to Ceph beyond the core development team from Inktank. Hammer features major new features and improvements from Intel, Fujitsu, UnitedStack, Yahoo, UbuntuKylin, CohortFS, Mellanox, CERN, Deutsche Telekom, Mirantis, and SanDisk. Dedication ---------- This release is dedicated in memoriam to Sandon Van Ness, aka Houkouonchi, who unexpectedly passed away a few weeks ago. Sandon was responsible for maintaining the large and complex Sepia lab that houses the Ceph project's build and test infrastructure. His efforts have made an important impact on our ability to reliably test Ceph with a relatively small group of people. He was a valued member of the team and we will miss him. H is also for Houkouonchi. Upgrading --------- * If your existing cluster is running a version older than v0.80.x Firefly, please first upgrade to the latest Firefly release before moving on to Giant. We have not tested upgrades directly from Emperor, Dumpling, or older releases. We *have* tested: * Firefly to Hammer * Giant to Hammer * Dumpling to Firefly to Hammer * Please upgrade daemons in the following order: #. Monitors #. OSDs #. MDSs and/or radosgw Note that the relative ordering of OSDs and monitors should not matter, but we primarily tested upgrading monitors first. * The ceph-osd daemons will perform a disk-format upgrade improve the PG metadata layout and to repair a minor bug in the on-disk format. It may take a minute or two for this to complete, depending on how many objects are stored on the node; do not be alarmed if they do not marked "up" by the cluster immediately after starting. * If upgrading from v0.93, set osd enable degraded writes = false on all osds prior to upgrading. The degraded writes feature has been reverted due to 11155. * The LTTNG tracing in librbd and librados is disabled in the release packages until we find a way to avoid violating distro security policies when linking libust. Upgrading from v0.87.x Giant ---------------------------- * librbd and librados include lttng tracepoints on distros with liblttng 2.4 or later (only Ubuntu Trusty for the ceph.com packages). When running a daemon that uses these libraries, i.e. an application that calls fork(2) or clone(2) without exec(3), you must set LD_PRELOAD=liblttng-ust-fork.so.0 to prevent a crash in the lttng atexit handler when the process exits. The only ceph tool that requires this is rbd-fuse. * If rgw_socket_path is defined and rgw_frontends defines a socket_port and socket_host, we now allow the rgw_frontends settings to take precedence. This change should only affect users who have made non-standard changes to their radosgw configuration. * If you are upgrading specifically from v0.92, you must stop all OSD daemons and flush their journals (``ceph-osd -i NNN --flush-journal``) before upgrading. There was a transaction encoding bug in v0.92 that broke compatibility. Upgrading from v0.93, v0.91, or anything earlier is safe. * The experimental 'keyvaluestore-dev' OSD backend has been renamed 'keyvaluestore' (for simplicity) and marked as experimental. To enable this untested feature and acknowledge that you understand that it is untested and may destroy data, you need to add the following to your ceph.conf:: enable experimental unrecoverable data corrupting featuers = keyvaluestore * The following librados C API function calls take a 'flags' argument whose value is now correctly interpreted: rados_write_op_operate() rados_aio_write_op_operate() rados_read_op_operate() rados_aio_read_op_operate() The flags were not correctly being translated from the librados constants to the internal values. Now they are. Any code that is passing flags to these methods should be audited to ensure that they are using the correct LIBRADOS_OP_FLAG_* constants. * The 'rados' CLI 'copy' and 'cppool' commands now use the copy-from operation, which means the latest CLI cannot run these commands against pre-firefly OSDs. * The librados watch/notify API now includes a watch_flush() operation to flush the async queue of notify operations. This should be called by any watch/notify user prior to rados_shutdown(). * The 'category' field for objects has been removed. This was originally added to track PG stat summations over different categories of objects for use by radosgw. It is no longer has any known users and is prone to abuse because it can lead to a pg_stat_t structure that is unbounded. The librados API calls that accept this field now ignore it, and the OSD no longers tracks the per-category summations. * The output for 'rados df' has changed. The 'category' level has been eliminated, so there is now a single stat object per pool. The structure of the JSON output is different, and the plaintext output has one less column. * The 'rados create [category]' optional category argument is no longer supported or recognized. * rados.py's Rados class no longer has a __del__ method; it was causing problems on interpreter shutdown and use of threads. If your code has Rados objects with limited lifetimes and you're concerned about locked resources, call Rados.shutdown() explicitly. * There is a new version of the librados watch/notify API with vastly improved semantics. Any applications using this interface are encouraged to migrate to the new API. The old API calls are marked as deprecated and will eventually be removed. * The librados rados_unwatch() call used to be safe to call on an invalid handle. The new version has undefined behavior when passed a bogus value (for example, when rados_watch() returns an error and handle is not defined). * The structure of the formatted 'pg stat' command is changed for the portion that counts states by name to avoid using the '+' character (which appears in state names) as part of the XML token (it is not legal). * Previously, the formatted output of 'ceph pg stat -f ...' was a full pg dump that included all metadata about all PGs in the system. It is now a concise summary of high-level PG stats, just like the unformatted 'ceph pg stat' command. * All JSON dumps of floating point values were incorrecting surrounding the value with quotes. These quotes have been removed. Any consumer of structured JSON output that was consuming the floating point values was previously having to interpret the quoted string and will most likely need to be fixed to take the unquoted number. * New ability to list all objects from all namespaces that can fail or return incomplete results when not all OSDs have been upgraded. Features rados --all ls, rados cppool, rados export, rados cache-flush-evict-all and rados cache-try-flush-evict-all can also fail or return incomplete results. * Due to a change in the Linux kernel version 3.18 and the limits of the FUSE interface, ceph-fuse needs be mounted as root on at least some systems. See issues #9997, #10277, and #10542 for details. Upgrading from v0.80x Firefly (additional notes) ------------------------------------------------ * The client-side caching for librbd is now enabled by default (rbd cache = true). A safety option (rbd cache writethrough until flush = true) is also enabled so that writeback caching is not used until the library observes a 'flush' command, indicating that the librbd users is passing that operation through from the guest VM. This avoids potential data loss when used with older versions of qemu that do not support flush. leveldb_write_buffer_size = 8*1024*1024 = 33554432 // 8MB leveldb_cache_size = 512*1024*1204 = 536870912 // 512MB leveldb_block_size = 64*1024 = 65536 // 64KB leveldb_compression = false leveldb_log = "" OSDs will still maintain the following osd-specific defaults: leveldb_log = "" * The 'rados getxattr ...' command used to add a gratuitous newline to the attr value; it now does not. * The ``*_kb perf`` counters on the monitor have been removed. These are replaced with a new set of ``*_bytes`` counters (e.g., ``cluster_osd_kb`` is replaced by ``cluster_osd_bytes``). * The ``rd_kb`` and ``wr_kb`` fields in the JSON dumps for pool stats (accessed via the ``ceph df detail -f json-pretty`` and related commands) have been replaced with corresponding ``*_bytes`` fields. Similarly, the ``total_space``, ``total_used``, and ``total_avail`` fields are replaced with ``total_bytes``, ``total_used_bytes``, and ``total_avail_bytes`` fields. * The ``rados df --format=json`` output ``read_bytes`` and ``write_bytes`` fields were incorrectly reporting ops; this is now fixed. * The ``rados df --format=json`` output previously included ``read_kb`` and ``write_kb`` fields; these have been removed. Please use ``read_bytes`` and ``write_bytes`` instead (and divide by 1024 if appropriate). * The experimental keyvaluestore-dev OSD backend had an on-disk format change that prevents existing OSD data from being upgraded. This affects developers and testers only. * mon-specific and osd-specific leveldb options have been removed. From this point onward users should use the `leveldb_*` generic options and add the options in the appropriate sections of their configuration files. Monitors will still maintain the following monitor-specific defaults: leveldb_write_buffer_size = 8*1024*1024 = 33554432 // 8MB leveldb_cache_size = 512*1024*1204 = 536870912 // 512MB leveldb_block_size = 64*1024 = 65536 // 64KB leveldb_compression = false leveldb_log = "" OSDs will still maintain the following osd-specific defaults: leveldb_log = "" * CephFS support for the legacy anchor table has finally been removed. Users with file systems created before firefly should ensure that inodes with multiple hard links are modified *prior* to the upgrade to ensure that the backtraces are written properly. For example:: sudo find /mnt/cephfs -type f -links +1 -exec touch \{\} \; * We disallow nonsensical 'tier cache-mode' transitions. From this point onward, 'writeback' can only transition to 'forward' and 'forward' can transition to 1) 'writeback' if there are dirty objects, or 2) any if there are no dirty objects. Notable changes since v0.93 --------------------------- * build: a few cmake fixes (Matt Benjamin) * build: fix build on RHEL/CentOS 5.9 (Rohan Mars) * build: reorganize Makefile to allow modular builds (Boris Ranto) * ceph-fuse: be more forgiving on remount (#10982 Greg Farnum) * ceph: improve CLI parsing (#11093 David Zafman) * common: fix cluster logging to default channel (#11177 Sage Weil) * crush: fix parsing of straw2 buckets (#11015 Sage Weil) * doc: update man pages (David Zafman) * librados: fix leak in C_TwoContexts (Xiong Yiliang) * librados: fix leak in watch/notify path (Sage Weil) * librbd: fix and improve AIO cache invalidation (#10958 Jason Dillaman) * librbd: fix memory leak (Jason Dillaman) * librbd: fix ordering/queueing of resize operations (Jason Dillaman) * librbd: validate image is r/w on resize/flatten (Jason Dillaman) * librbd: various internal locking fixes (Jason Dillaman) * lttng: tracing is disabled until we streamline dependencies (Josh Durgin) * mon: add bootstrap-rgw profile (Sage Weil) * mon: do not pollute mon dir with CSV files from CRUSH check (Loic Dachary) * mon: fix clock drift time check interval (#10546 Joao Eduardo Luis) * mon: fix units in store stats (Joao Eduardo Luis) * mon: improve error handling on erasure code profile set (#10488, #11144 Loic Dachary) * mon: set {read,write}_tier on 'osd tier add-cache ...' (Jianpeng Ma) * ms: xio: fix misc bugs (Matt Benjamin, Vu Pham) * osd: DBObjectMap: fix locking to prevent rare crash (#9891 Samuel Just) * osd: fix and document last_epoch_started semantics (Samuel Just) * osd: fix divergent entry handling on PG split (Samuel Just) * osd: fix leak on shutdown (Kefu Chai) * osd: fix recording of digest on scrub (Samuel Just) * osd: fix whiteout handling (Sage Weil) * rbd: allow v2 striping parameters for clones and imports (Jason Dillaman) * rbd: fix formatted output of image features (Jason Dillaman) * rbd: updat eman page (Ilya Dryomov) * rgw: don't overwrite bucket/object owner when setting ACLs (#10978 Yehuda Sadeh) * rgw: enable IPv6 for civetweb (#10965 Yehuda Sadeh) * rgw: fix sysvinit script when rgw_socket_path is not defined (#11159 Yehuda Sadeh, Dan Mick) * rgw: pass civetweb configurables through (#10907 Yehuda Sadeh) * rgw: use new watch/notify API (Yehuda Sadeh, Sage Weil) * osd: reverted degraded writes feature due to 11155 Notable changes since v0.87.x Giant ----------------------------------- * add experimental features option (Sage Weil) * arch: fix NEON feaeture detection (#10185 Loic Dachary) * asyncmsgr: misc fixes (Haomai Wang) * buffer: add 'shareable' construct (Matt Benjamin) * buffer: add list::get_contiguous (Sage Weil) * buffer: avoid rebuild if buffer already contiguous (Jianpeng Ma) * build: CMake support (Ali Maredia, Casey Bodley, Adam Emerson, Marcus Watts, Matt Benjamin) * build: a few cmake fixes (Matt Benjamin) * build: aarch64 build fixes (Noah Watkins, Haomai Wang) * build: adjust build deps for yasm, virtualenv (Jianpeng Ma) * build: fix 'make check' races (#10384 Loic Dachary) * build: fix build on RHEL/CentOS 5.9 (Rohan Mars) * build: fix pkg names when libkeyutils is missing (Pankag Garg, Ken Dreyer) * build: improve build dependency tooling (Loic Dachary) * build: reorganize Makefile to allow modular builds (Boris Ranto) * build: support for jemalloc (Shishir Gowda) * ceph-disk: Scientific Linux support (Dan van der Ster) * ceph-disk: allow journal partition re-use (#10146 Loic Dachary, Dav van der Ster) * ceph-disk: call partx/partprobe consistency (#9721 Loic Dachary) * ceph-disk: do not re-use partition if encryption is required (Loic Dachary) * ceph-disk: fix dmcrypt key permissions (Loic Dachary) * ceph-disk: fix umount race condition (#10096 Blaine Gardner) * ceph-disk: improved systemd support (Owen Synge) * ceph-disk: init=none option (Loic Dachary) * ceph-disk: misc fixes (Christos Stavrakakis) * ceph-disk: respect --statedir for keyring (Loic Dachary) * ceph-disk: set guid if reusing journal partition (Dan van der Ster) * ceph-disk: support LUKS for encrypted partitions (Andrew Bartlett, Loic Dachary) * ceph-fuse, libcephfs: POSIX file lock support (Yan, Zheng) * ceph-fuse, libcephfs: allow xattr caps in inject_release_failure (#9800 John Spray) * ceph-fuse, libcephfs: fix I_COMPLETE_ORDERED checks (#9894 Yan, Zheng) * ceph-fuse, libcephfs: fix cap flush overflow (Greg Farnum, Yan, Zheng) * ceph-fuse, libcephfs: fix root inode xattrs (Yan, Zheng) * ceph-fuse, libcephfs: preserve dir ordering (#9178 Yan, Zheng) * ceph-fuse, libcephfs: trim inodes before reconnecting to MDS (Yan, Zheng) * ceph-fuse,libcephfs: add support for O_NOFOLLOW and O_PATH (Greg Farnum) * ceph-fuse,libcephfs: resend requests before completing cap reconnect (#10912 Yan, Zheng) * ceph-fuse: be more forgiving on remount (#10982 Greg Farnum) * ceph-fuse: fix dentry invalidation on 3.18+ kernels (#9997 Yan, Zheng) * ceph-fuse: fix kernel cache trimming (#10277 Yan, Zheng) * ceph-fuse: select kernel cache invalidation mechanism based on kernel version (Greg Farnum) * ceph-monstore-tool: fix shutdown (#10093 Loic Dachary) * ceph-monstore-tool: fix/improve CLI (Joao Eduardo Luis) * ceph-objectstore-tool: fix import (#10090 David Zafman) * ceph-objectstore-tool: improved import (David Zafman) * ceph-objectstore-tool: many improvements and tests (David Zafman) * ceph-objectstore-tool: many many improvements (David Zafman) * ceph-objectstore-tool: misc improvements, fixes (#9870 #9871 David Zafman) * ceph.spec: package rbd-replay-prep (Ken Dreyer) * ceph: add 'ceph osd df [tree]' command (#10452 Mykola Golub) * ceph: do not parse injectargs twice (Loic Dachary) * ceph: fix 'ceph tell ...' command validation (#10439 Joao Eduardo Luis) * ceph: improve 'ceph osd tree' output (Mykola Golub) * ceph: improve CLI parsing (#11093 David Zafman) * ceph: make 'ceph -s' output more readable (Sage Weil) * ceph: make 'ceph -s' show PG state counts in sorted order (Sage Weil) * ceph: make 'ceph tell mon.* version' work (Mykola Golub) * ceph: new 'ceph tell mds.$name_or_rank_or_gid' (John Spray) * ceph: show primary-affinity in 'ceph osd tree' (Mykola Golub) * ceph: test robustness (Joao Eduardo Luis) * ceph_objectstore_tool: behave with sharded flag (#9661 David Zafman) * cephfs-journal-tool: add recover_dentries function (#9883 John Spray) * cephfs-journal-tool: fix journal import (#10025 John Spray) * cephfs-journal-tool: skip up to expire_pos (#9977 John Spray) * cleanup rados.h definitions with macros (Ilya Dryomov) * common: add 'perf reset ...' admin command (Jianpeng Ma) * common: add TableFormatter (Andreas Peters) * common: add newline to flushed json output (Sage Weil) * common: check syncfs() return code (Jianpeng Ma) * common: do not unlock rwlock on destruction (Federico Simoncelli) * common: filtering for 'perf dump' (John Spray) * common: fix Formatter factory breakage (#10547 Loic Dachary) * common: fix block device discard check (#10296 Sage Weil) * common: make json-pretty output prettier (Sage Weil) * common: remove broken CEPH_LOCKDEP optoin (Kefu Chai) * common: shared_cache unit tests (Cheng Cheng) * common: support new gperftools header locations (Key Dreyer) * config: add $cctid meta variable (Adam Crume) * crush: fix buffer overrun for poorly formed rules (#9492 Johnu George) * crush: fix detach_bucket (#10095 Sage Weil) * crush: fix parsing of straw2 buckets (#11015 Sage Weil) * crush: fix several bugs in adjust_item_weight (Rongze Zhu) * crush: fix tree bucket behavior (Rongze Zhu) * crush: improve constness (Loic Dachary) * crush: new and improved straw2 bucket type (Sage Weil, Christina Anderson, Xiaoxi Chen) * crush: straw bucket weight calculation fixes (#9998 Sage Weil) * crush: update tries stats for indep rules (#10349 Loic Dachary) * crush: use larger choose_tries value for erasure code rulesets (#10353 Loic Dachary) * crushtool: add --location command (Sage Weil, Loic Dachary) * debian,rpm: move RBD udev rules to ceph-common (#10864 Ken Dreyer) * debian: split python-ceph into python-{rbd,rados,cephfs} (Boris Ranto) * default to libnss instead of crypto++ (Federico Gimenez) * doc: CephFS disaster recovery guidance (John Spray) * doc: CephFS for early adopters (John Spray) * doc: add build-doc guidlines for Fedora and CentOS/RHEL (Nilamdyuti Goswami) * doc: add dumpling to firefly upgrade section (#7679 John Wilkins) * doc: ceph osd reweight vs crush weight (Laurent Guerby) * doc: do not suggest dangerous XFS nobarrier option (Dan van der Ster) * doc: document erasure coded pool operations (#9970 Loic Dachary) * doc: document the LRC per-layer plugin configuration (Yuan Zhou) * doc: enable rbd cache on openstack deployments (Sebastien Han) * doc: erasure code doc updates (Loic Dachary) * doc: file system osd config settings (Kevin Dalley) * doc: fix OpenStack Glance docs (#10478 Sebastien Han) * doc: improved installation nots on CentOS/RHEL installs (John Wilkins) * doc: key/value store config reference (John Wilkins) * doc: misc cleanups (Adam Spiers, Sebastien Han, Nilamdyuti Goswami, Ken Dreyer, John Wilkins) * doc: misc improvements (Nilamdyuti Goswami, John Wilkins, Chris Holcombe) * doc: misc updates (#9793 #9922 #10204 #10203 Travis Rhoden, Hazem, Ayari, Florian Coste, Andy Allan, Frank Yu, Baptiste Veuillez-Mainard, Yuan Zhou, Armando Segnini, Robert Jansen, Tyler Brekke, Viktor Suprun) * doc: misc updates (Alfredo Deza, VRan Liu) * doc: misc updates (Nilamdyuti Goswami, John Wilkins) * doc: new man pages (Nilamdyuti Goswami) * doc: preflight doc fixes (John Wilkins) * doc: replace cloudfiles with swiftclient Python Swift example (Tim Freund) * doc: update PG count guide (Gerben Meijer, Laurent Guerby, Loic Dachary) * doc: update man pages (David Zafman) * doc: update openstack docs for Juno (Sebastien Han) * doc: update release descriptions (Ken Dreyer) * doc: update sepia hardware inventory (Sandon Van Ness) * erasure-code: add mSHEC erasure code support (Takeshi Miyamae) * erasure-code: improved docs (#10340 Loic Dachary) * erasure-code: set max_size to 20 (#10363 Loic Dachary) * fix cluster logging from non-mon daemons (Sage Weil) * init-ceph: check for systemd-run before using it (Boris Ranto) * install-deps.sh: do not require sudo when root (Loic Dachary) * keyvaluestore: misc fixes (Haomai Wang) * keyvaluestore: performance improvements (Haomai Wang) * libcephfs,ceph-fuse: add 'status' asok (John Spray) * libcephfs,ceph-fuse: fix getting zero-length xattr (#10552 Yan, Zheng) * libcephfs: fix dirfrag trimming (#10387 Yan, Zheng) * libcephfs: fix mount timeout (#10041 Yan, Zheng) * libcephfs: fix test (#10415 Yan, Zheng) * libcephfs: fix use-afer-free on umount (#10412 Yan, Zheng) * libcephfs: include ceph and git version in client metadata (Sage Weil) * librados, osd: new watch/notify implementation (Sage Weil) * librados: add blacklist_add convenience method (Jason Dillaman) * librados: add rados_pool_get_base_tier() call (Adam Crume) * librados: add watch_flush() operation (Sage Weil, Haomai Wang) * librados: avoid memcpy on getxattr, read (Jianpeng Ma) * librados: cap buffer length (Loic Dachary) * librados: create ioctx by pool id (Jason Dillaman) * librados: do notify completion in fast-dispatch (Sage Weil) * librados: drop 'category' feature (Sage Weil) * librados: expose rados_{read|write}_op_assert_version in C API (Kim Vandry) * librados: fix infinite loop with skipped map epochs (#9986 Ding Dinghua) * librados: fix iterator operator= bugs (#10082 David Zafman, Yehuda Sadeh) * librados: fix leak in C_TwoContexts (Xiong Yiliang) * librados: fix leak in watch/notify path (Sage Weil) * librados: fix null deref when pool DNE (#9944 Sage Weil) * librados: fix objecter races (#9617 Josh Durgin) * librados: fix pool deletion handling (#10372 Sage Weil) * librados: fix pool name caching (#10458 Radoslaw Zarzynski) * librados: fix resource leak, misc bugs (#10425 Radoslaw Zarzynski) * librados: fix some watch/notify locking (Jason Dillaman, Josh Durgin) * librados: fix timer race from recent refactor (Sage Weil) * librados: new fadvise API (Ma Jianpeng) * librados: only export public API symbols (Jason Dillaman) * librados: remove shadowed variable (Kefu Chain) * librados: translate op flags from C APIs (Matthew Richards) * libradosstriper: fix remove() (Dongmao Zhang) * libradosstriper: fix shutdown hang (Dongmao Zhang) * libradosstriper: fix stat strtoll (Dongmao Zhang) * libradosstriper: fix trunc method (#10129 Sebastien Ponce) * libradosstriper: fix write_full when ENOENT (#10758 Sebastien Ponce) * libradosstriper: misc fixes (Sebastien Ponce) * librbd: CRC protection for RBD image map (Jason Dillaman) * librbd: add missing python docstrings (Jason Dillaman) * librbd: add per-image object map for improved performance (Jason Dillaman) * librbd: add readahead (Adam Crume) * librbd: add support for an "object map" indicating which objects exist (Jason Dillaman) * librbd: adjust internal locking (Josh Durgin, Jason Dillaman) * librbd: better handling of watch errors (Jason Dillaman) * librbd: complete pending ops before closing image (#10299 Josh Durgin) * librbd: coordinate maint operations through lock owner (Jason Dillaman) * librbd: copy-on-read (Min Chen, Li Wang, Yunchuan Wen, Cheng Cheng, Jason Dillaman) * librbd: differentiate between R/O vs R/W features (Jason Dillaman) * librbd: don't close a closed parent in failure path (#10030 Jason Dillaman) * librbd: enforce write ordering with a snapshot (Jason Dillaman) * librbd: exclusive image locking (Jason Dillaman) * librbd: fadvise API (Ma Jianpeng) * librbd: fadvise-style hints; add misc hints for certain operations (Jianpeng Ma) * librbd: fix and improve AIO cache invalidation (#10958 Jason Dillaman) * librbd: fix cache tiers in list_children and snap_unprotect (Adam Crume) * librbd: fix coverity false-positives (Jason Dillaman) * librbd: fix diff test (#10002 Josh Durgin) * librbd: fix list_children from invalid pool ioctxs (#10123 Jason Dillaman) * librbd: fix locking for readahead (#10045 Jason Dillaman) * librbd: fix memory leak (Jason Dillaman) * librbd: fix ordering/queueing of resize operations (Jason Dillaman) * librbd: fix performance regression in ObjectCacher (#9513 Adam Crume) * librbd: fix snap create races (Jason Dillaman) * librbd: fix write vs import race (#10590 Jason Dillaman) * librbd: flush AIO operations asynchronously (#10714 Jason Dillaman) * librbd: gracefully handle deleted/renamed pools (#10270 Jason Dillaman) * librbd: lttng tracepoints (Adam Crume) * librbd: make async versions of long-running maint operations (Jason Dillaman) * librbd: misc fixes (Xinxin Shu, Jason Dillaman) * librbd: mock tests (Jason Dillaman) * librbd: only export public API symbols (Jason Dillaman) * librbd: optionally blacklist clients before breaking locks (#10761 Jason Dillaman) * librbd: prevent copyup during shrink (Jason Dillaman) * librbd: refactor unit tests to use fixtures (Jason Dillaman) * librbd: validate image is r/w on resize/flatten (Jason Dillaman) * librbd: various internal locking fixes (Jason Dillaman) * many coverity fixes (Danny Al-Gaaf) * many many coverity cleanups (Danny Al-Gaaf) * mds: 'flush journal' admin command (John Spray) * mds: ENOSPC and OSDMap epoch barriers (#7317 John Spray) * mds: a whole bunch of initial scrub infrastructure (Greg Farnum) * mds: add cephfs-table-tool (John Spray) * mds: asok command for fetching subtree map (John Spray) * mds: avoid sending traceless replies in most cases (Yan, Zheng) * mds: constify MDSCacheObjects (John Spray) * mds: dirfrag buf fix (Yan, Zheng) * mds: disallow most commands on inactive MDS's (Greg Farnum) * mds: drop dentries, leases on deleted directories (#10164 Yan, Zheng) * mds: export dir asok command (John Spray) * mds: fix MDLog IO callback deadlock (John Spray) * mds: fix compat_version for MClientSession (#9945 John Spray) * mds: fix deadlock during journal probe vs purge (#10229 Yan, Zheng) * mds: fix race trimming log segments (Yan, Zheng) * mds: fix reply snapbl (Yan, Zheng) * mds: fix sessionmap lifecycle bugs (Yan, Zheng) * mds: fix stray/purge perfcounters (#10388 John Spray) * mds: handle heartbeat_reset during shutdown (#10382 John Spray) * mds: handle zero-size xattr (#10335 Yan, Zheng) * mds: initialize root inode xattr version (Yan, Zheng) * mds: introduce auth caps (John Spray) * mds: many many snapshot-related fixes (Yan, Zheng) * mds: misc bugs (Greg Farnum, John Spray, Yan, Zheng, Henry Change) * mds: refactor, improve Session storage (John Spray) * mds: store backtrace for stray dir (Yan, Zheng) * mds: subtree quota support (Yunchuan Wen) * mds: verify backtrace when fetching dirfrag (#9557 Yan, Zheng) * memstore: free space tracking (John Spray) * misc cleanup (Danny Al-Gaaf, David Anderson) * misc coverity fixes (Danny Al-Gaaf) * misc coverity fixes (Danny Al-Gaaf) * misc: various valgrind fixes and cleanups (Danny Al-Gaaf) * mon: 'osd crush reweight-all' command (Sage Weil) * mon: add 'ceph osd rename-bucket ...' command (Loic Dachary) * mon: add bootstrap-rgw profile (Sage Weil) * mon: add max pgs per osd warning (Sage Weil) * mon: add noforward flag for some mon commands (Mykola Golub) * mon: allow adding tiers to fs pools (#10135 John Spray) * mon: allow full flag to be manually cleared (#9323 Sage Weil) * mon: clean up auth list output (Loic Dachary) * mon: delay failure injection (Joao Eduardo Luis) * mon: disallow empty pool names (#10555 Wido den Hollander) * mon: do not deactivate last mds (#10862 John Spray) * mon: do not pollute mon dir with CSV files from CRUSH check (Loic Dachary) * mon: drop old ceph_mon_store_converter (Sage Weil) * mon: fix 'ceph pg dump_stuck degraded' (Xinxin Shu) * mon: fix 'mds fail' for standby MDSs (John Spray) * mon: fix 'osd crush link' id resolution (John Spray) * mon: fix 'profile osd' use of config-key function on mon (#10844 Joao Eduardo Luis) * mon: fix *_ratio* units and types (Sage Weil) * mon: fix JSON dumps to dump floats as flots and not strings (Sage Weil) * mon: fix MDS health status from peons (#10151 John Spray) * mon: fix caching for min_last_epoch_clean (#9987 Sage Weil) * mon: fix clock drift time check interval (#10546 Joao Eduardo Luis) * mon: fix compatset initalization during mkfs (Joao Eduardo Luis) * mon: fix error output for add_data_pool (#9852 Joao Eduardo Luis) * mon: fix feature tracking during elections (Joao Eduardo Luis) * mon: fix formatter 'pg stat' command output (Sage Weil) * mon: fix mds gid/rank/state parsing (John Spray) * mon: fix misc error paths (Joao Eduardo Luis) * mon: fix paxos off-by-one corner case (#9301 Sage Weil) * mon: fix paxos timeouts (#10220 Joao Eduardo Luis) * mon: fix stashed monmap encoding (#5203 Xie Rui) * mon: fix units in store stats (Joao Eduardo Luis) * mon: get canonical OSDMap from leader (#10422 Sage Weil) * mon: ignore failure reports from before up_from (#10762 Dan van der Ster, Sage Weil) * mon: implement 'fs reset' command (John Spray) * mon: improve error handling on erasure code profile set (#10488, #11144 Loic Dachary) * mon: improved corrupt CRUSH map detection (Joao Eduardo Luis) * mon: include entity name in audit log for forwarded requests (#9913 Joao Eduardo Luis) * mon: include pg_temp count in osdmap summary (Sage Weil) * mon: log health summary to cluster log (#9440 Joao Eduardo Luis) * mon: make 'mds fail' idempotent (John Spray) * mon: make pg dump {sum,pgs,pgs_brief} work for format=plain (#5963 #6759 Mykola Golub) * mon: new 'ceph pool ls [detail]' command (Sage Weil) * mon: new pool safety flags nodelete, nopgchange, nosizechange (#9792 Mykola Golub) * mon: new, friendly 'ceph pg ls ...' command (Xinxin Shu) * mon: paxos: allow reads while proposing (#9321 #9322 Joao Eduardo Luis) * mon: prevent MDS transition from STOPPING (#10791 Greg Farnum) * mon: propose all pending work in one transaction (Sage Weil) * mon: remove pg_temps for nonexistent pools (Joao Eduardo Luis) * mon: require mon_allow_pool_delete option to remove pools (Sage Weil) * mon: respect down flag when promoting standbys (John Spray) * mon: set globalid prealloc to larger value (Sage Weil) * mon: set {read,write}_tier on 'osd tier add-cache ...' (Jianpeng Ma) * mon: skip zeroed osd stats in get_rule_avail (#10257 Joao Eduardo Luis) * mon: validate min_size range (Jianpeng Ma) * mon: wait for writeable before cross-proposing (#9794 Joao Eduardo Luis) * mount.ceph: fix suprious error message (#10351 Yan, Zheng) * ms: xio: fix misc bugs (Matt Benjamin, Vu Pham) * msgr: async: bind threads to CPU cores, improved poll (Haomai Wang) * msgr: async: many fixes, unit tests (Haomai Wang) * msgr: async: several fixes (Haomai Wang) * msgr: asyncmessenger: add kqueue support (#9926 Haomai Wang) * msgr: avoid useless new/delete (Haomai Wang) * msgr: fix RESETSESSION bug (#10080 Greg Farnum) * msgr: fix crc configuration (Mykola Golub) * msgr: fix delay injection bug (#9910 Sage Weil, Greg Farnum) * msgr: misc unit tests (Haomai Wang) * msgr: new AsymcMessenger alternative implementation (Haomai Wang) * msgr: prefetch data when doing recv (Yehuda Sadeh) * msgr: simple: fix rare deadlock (Greg Farnum) * msgr: simple: retry binding to port on failure (#10029 Wido den Hollander) * msgr: xio: XioMessenger RDMA support (Casey Bodley, Vu Pham, Matt Benjamin) * objectstore: deprecate collection attrs (Sage Weil) * osd, librados: fadvise-style librados hints (Jianpeng Ma) * osd, librados: fix xattr_cmp_u64 (Dongmao Zhang) * osd, librados: revamp PG listing API to handle namespaces (#9031 #9262 #9438 David Zafman) * osd, mds: 'ops' as shorthand for 'dump_ops_in_flight' on asok (Sage Weil) * osd, mon: add checksums to all OSDMaps (Sage Weil) * osd, mon: send intiial pg create time from mon to osd (#9887 David Zafman) * osd,mon: add 'norebalance' flag (Kefu Chai) * osd,mon: specify OSD features explicitly in MOSDBoot (#10911 Sage Weil) * osd: DBObjectMap: fix locking to prevent rare crash (#9891 Samuel Just) * osd: EIO on whole-object reads when checksum is wrong (Sage Weil) * osd: add erasure code corpus (Loic Dachary) * osd: add fadvise flags to ObjectStore API (Jianpeng Ma) * osd: add get_latest_osdmap asok command (#9483 #9484 Mykola Golub) * osd: add misc tests (Loic Dachary, Danny Al-Gaaf) * osd: add option to prioritize heartbeat network traffic (Jian Wen) * osd: add support for the SHEC erasure-code algorithm (Takeshi Miyamae, Loic Dachary) * osd: allow deletion of objects with watcher (#2339 Sage Weil) * osd: allow recovery while below min_size (Samuel Just) * osd: allow recovery with fewer than min_size OSDs (Samuel Just) * osd: allow sparse read for Push/Pull (Haomai Wang) * osd: allow whiteout deletion in cache pool (Sage Weil) * osd: allow writes to degraded objects (Samuel Just) * osd: allow writes to degraded objects (Samuel Just) * osd: avoid publishing unchanged PG stats (Sage Weil) * osd: batch pg log trim (Xinze Chi) * osd: cache pool: ignore min flush age when cache is full (Xinze Chi) * osd: cache recent ObjectContexts (Dong Yuan) * osd: cache reverse_nibbles hash value (Dong Yuan) * osd: clean up internal ObjectStore interface (Sage Weil) * osd: cleanup boost optionals (William Kennington) * osd: clear cache on interval change (Samuel Just) * osd: do no proxy reads unless target OSDs are new (#10788 Sage Weil) * osd: do not abort deep scrub on missing hinfo (#10018 Loic Dachary) * osd: do not update digest on inconsistent object (#10524 Samuel Just) * osd: don't record digests for snapdirs (#10536 Samuel Just) * osd: drop upgrade support for pre-dumpling (Sage Weil) * osd: enable and use posix_fadvise (Sage Weil) * osd: erasure coding: allow bench.sh to test ISA backend (Yuan Zhou) * osd: erasure-code: encoding regression tests, corpus (#9420 Loic Dachary) * osd: erasure-code: enforce chunk size alignment (#10211 Loic Dachary) * osd: erasure-code: jerasure support for NEON (Loic Dachary) * osd: erasure-code: relax cauchy w restrictions (#10325 David Zhang, Loic Dachary) * osd: erasure-code: update gf-complete to latest upstream (Loic Dachary) * osd: expose non-journal backends via ceph-osd CLI (Hoamai Wang) * osd: filejournal: don't cache journal when not using direct IO (Jianpeng Ma) * osd: fix JSON output for stray OSDs (Loic Dachary) * osd: fix OSDCap parser on old (el6) boost::spirit (#10757 Kefu Chai) * osd: fix OSDCap parsing on el6 (#10757 Kefu Chai) * osd: fix ObjectStore::Transaction encoding version (#10734 Samuel Just) * osd: fix WBTHrottle perf counters (Haomai Wang) * osd: fix and document last_epoch_started semantics (Samuel Just) * osd: fix auth object selection during repair (#10524 Samuel Just) * osd: fix backfill bug (#10150 Samuel Just) * osd: fix bug in pending digest updates (#10840 Samuel Just) * osd: fix cancel_proxy_read_ops (Sage Weil) * osd: fix cleanup of interrupted pg deletion (#10617 Sage Weil) * osd: fix divergent entry handling on PG split (Samuel Just) * osd: fix ghobject_t formatted output to include shard (#10063 Loic Dachary) * osd: fix ioprio option (Mykola Golub) * osd: fix ioprio options (Loic Dachary) * osd: fix journal shutdown race (Sage Weil) * osd: fix journal wrapping bug (#10883 David Zafman) * osd: fix leak in SnapTrimWQ (#10421 Kefu Chai) * osd: fix leak on shutdown (Kefu Chai) * osd: fix memstore free space calculation (Xiaoxi Chen) * osd: fix mixed-version peering issues (Samuel Just) * osd: fix object age eviction (Zhiqiang Wang) * osd: fix object atime calculation (Xinze Chi) * osd: fix object digest update bug (#10840 Samuel Just) * osd: fix occasional peering stalls (#10431 Sage Weil) * osd: fix ordering issue with new transaction encoding (#10534 Dong Yuan) * osd: fix osd peer check on scrub messages (#9555 Sage Weil) * osd: fix past_interval display bug (#9752 Loic Dachary) * osd: fix past_interval generation (#10427 #10430 David Zafman) * osd: fix pgls filter ops (#9439 David Zafman) * osd: fix recording of digest on scrub (Samuel Just) * osd: fix scrub delay bug (#10693 Samuel Just) * osd: fix scrub vs try-flush bug (#8011 Samuel Just) * osd: fix short read handling on push (#8121 David Zafman) * osd: fix stderr with -f or -d (Dan Mick) * osd: fix transaction accounting (Jianpeng Ma) * osd: fix watch reconnect race (#10441 Sage Weil) * osd: fix watch timeout cache state update (#10784 David Zafman) * osd: fix whiteout handling (Sage Weil) * osd: flush snapshots from cache tier immediately (Sage Weil) * osd: force promotion of watch/notify ops (Zhiqiang Wang) * osd: handle no-op write with snapshot (#10262 Sage Weil) * osd: improve idempotency detection across cache promotion/demotion (#8935 Sage Weil, Samuel Just) * osd: include activating peers in blocked_by (#10477 Sage Weil) * osd: jerasure and gf-complete updates from upstream (#10216 Loic Dachary) * osd: journal: check fsync/fdatasync result (Jianpeng Ma) * osd: journal: fix alignment checks, avoid useless memmove (Jianpeng Ma) * osd: journal: fix hang on shutdown (#10474 David Zafman) * osd: journal: fix header.committed_up_to (Xinze Chi) * osd: journal: fix journal zeroing when direct IO is enabled (Xie Rui) * osd: journal: initialize throttle (Ning Yao) * osd: journal: misc bug fixes (#6003 David Zafman, Samuel Just) * osd: journal: update committed_thru after replay (#6756 Samuel Just) * osd: keyvaluestore: cleanup dead code (Ning Yao) * osd: keyvaluestore: fix getattr semantics (Haomai Wang) * osd: keyvaluestore: fix key ordering (#10119 Haomai Wang) * osd: keyvaluestore_dev: optimization (Chendi Xue) * osd: limit in-flight read requests (Jason Dillaman) * osd: log when scrub or repair starts (Loic Dachary) * osd: make misdirected op checks robust for EC pools (#9835 Sage Weil) * osd: memstore: fix size limit (Xiaoxi Chen) * osd: misc FIEMAP fixes (Ma Jianpeng) * osd: misc cleanup (Xinze Chi, Yongyue Sun) * osd: misc optimizations (Xinxin Shu, Zhiqiang Wang, Xinze Chi) * osd: misc scrub fixes (#10017 Loic Dachary) * osd: new 'activating' state between peering and active (Sage Weil) * osd: new optimized encoding for ObjectStore::Transaction (Dong Yuan) * osd: optimize Finisher (Xinze Chi) * osd: optimize WBThrottle map with unordered_map (Ning Yao) * osd: optimize filter_snapc (Ning Yao) * osd: preserve reqids for idempotency checks for promote/demote (Sage Weil, Zhiqiang Wang, Samuel Just) * osd: proxy read support (Zhiqiang Wang) * osd: proxy reads during cache promote (Zhiqiang Wang) * osd: remove dead locking code (Xinxin Shu) * osd: remove legacy classic scrub code (Sage Weil) * osd: remove unused fields in MOSDSubOp (Xiaoxi Chen) * osd: removed some dead code (Xinze Chi) * osd: replace MOSDSubOp messages with simpler, optimized MOSDRepOp (Xiaoxi Chen) * osd: restrict scrub to certain times of day (Xinze Chi) * osd: rocksdb: fix shutdown (Hoamai Wang) * osd: store PG metadata in per-collection objects for better concurrency (Sage Weil) * osd: store whole-object checksums on scrub, write_full (Sage Weil) * osd: support for discard for journal trim (Jianpeng Ma) * osd: use FIEMAP_FLAGS_SYNC instead of fsync (Jianpeng Ma) * osd: verify kernel is new enough before using XFS extsize ioctl, enable by default (#9956 Sage Weil) * pybind: fix memory leak in librados bindings (Billy Olsen) * pyrados: add object lock support (#6114 Mehdi Abaakouk) * pyrados: fix misnamed wait_* routings (#10104 Dan Mick) * pyrados: misc cleanups (Kefu Chai) * qa: add large auth ticket tests (Ilya Dryomov) * qa: fix mds tests (#10539 John Spray) * qa: fix osd create dup tests (#10083 Loic Dachary) * qa: ignore duplicates in rados ls (Josh Durgin) * qa: improve hadoop tests (Noah Watkins) * qa: many 'make check' improvements (Loic Dachary) * qa: misc tests (Loic Dachary, Yan, Zheng) * qa: parallelize make check (Loic Dachary) * qa: reorg fs quota tests (Greg Farnum) * qa: tolerate nearly-full disk for make check (Loic Dachary) * rados: fix put of /dev/null (Loic Dachary) * rados: fix usage (Jianpeng Ma) * rados: parse command-line arguments more strictly (#8983 Adam Crume) * rados: use copy-from operation for copy, cppool (Sage Weil) * radosgw-admin: add replicalog update command (Yehuda Sadeh) * rbd-fuse: clean up on shutdown (Josh Durgin) * rbd-fuse: fix memory leak (Adam Crume) * rbd-replay-many (Adam Crume) * rbd-replay: --anonymize flag to rbd-replay-prep (Adam Crume) * rbd: add 'merge-diff' function (MingXin Liu, Yunchuan Wen, Li Wang) * rbd: allow v2 striping parameters for clones and imports (Jason Dillaman) * rbd: fix 'rbd diff' for non-existent objects (Adam Crume) * rbd: fix buffer handling on image import (#10590 Jason Dillaman) * rbd: fix error when striping with format 1 (Sebastien Han) * rbd: fix export for image sizes over 2GB (Vicente Cheng) * rbd: fix formatted output of image features (Jason Dillaman) * rbd: leave exclusive lockin goff by default (Jason Dillaman) * rbd: updat eman page (Ilya Dryomov) * rbd: update init-rbdmap to fix dup mount point (Karel Striegel) * rbd: use IO hints for import, export, and bench operations (#10462 Jason Dillaman) * rbd: use rolling average for rbd bench-write throughput (Jason Dillaman) * rbd_recover_tool: RBD image recovery tool (Min Chen) * rgw: S3-style object versioning support (Yehuda Sadeh) * rgw: add location header when object is in another region (VRan Liu) * rgw: change multipart upload id magic (#10271 Yehuda Sadeh) * rgw: check keystone auth for S3 POST requests (#10062 Abhishek Lekshmanan) * rgw: check timestamp on s3 keystone auth (#10062 Abhishek Lekshmanan) * rgw: conditional PUT on ETag (#8562 Ray Lv) * rgw: create subuser if needed when creating user (#10103 Yehuda Sadeh) * rgw: decode http query params correction (#10271 Yehuda Sadeh) * rgw: don't overwrite bucket/object owner when setting ACLs (#10978 Yehuda Sadeh) * rgw: enable IPv6 for civetweb (#10965 Yehuda Sadeh) * rgw: extend replica log API (purge-all) (Yehuda Sadeh) * rgw: fail S3 POST if keystone not configured (#10688 Valery Tschopp, Yehuda Sadeh) * rgw: fix If-Modified-Since (VRan Liu) * rgw: fix XML header on get ACL request (#10106 Yehuda Sadeh) * rgw: fix bucket removal with data purge (Yehuda Sadeh) * rgw: fix content length check (#10701 Axel Dunkel, Yehuda Sadeh) * rgw: fix content-length update (#9576 Yehuda Sadeh) * rgw: fix disabling of max_size quota (#9907 Dong Lei) * rgw: fix error codes (#10334 #10329 Yehuda Sadeh) * rgw: fix incorrect len when len is 0 (#9877 Yehuda Sadeh) * rgw: fix object copy content type (#9478 Yehuda Sadeh) * rgw: fix partial GET in swift (#10553 Yehuda Sadeh) * rgw: fix replica log indexing (#8251 Yehuda Sadeh) * rgw: fix shutdown (#10472 Yehuda Sadeh) * rgw: fix swift metadata header name (Dmytro Iurchenko) * rgw: fix sysvinit script when rgw_socket_path is not defined (#11159 Yehuda Sadeh, Dan Mick) * rgw: fix user stags in get-user-info API (#9359 Ray Lv) * rgw: include XML ns on get ACL request (#10106 Yehuda Sadeh) * rgw: index swift keys appropriately (#10471 Yehuda Sadeh) * rgw: make sysvinit script set ulimit -n properly (Sage Weil) * rgw: misc fixes (#10307 Yehuda Sadeh) * rgw: only track cleanup for objects we write (#10311 Yehuda Sadeh) * rgw: pass civetweb configurables through (#10907 Yehuda Sadeh) * rgw: prevent illegal bucket policy that doesn't match placement rule (Yehuda Sadeh) * rgw: remove multipart entries from bucket index on abort (#10719 Yehuda Sadeh) * rgw: remove swift user manifest (DLO) hash calculation (#9973 Yehuda Sadeh) * rgw: respond with 204 to POST on containers (#10667 Yuan Zhou) * rgw: return timestamp on GET/HEAD (#8911 Yehuda Sadeh) * rgw: reuse fcgx connection struct (#10194 Yehuda Sadeh) * rgw: run radosgw as apache with systemd (#10125 Loic Dachary) * rgw: send explicit HTTP status string (Yehuda Sadeh) * rgw: set ETag on object copy (#9479 Yehuda Sadeh) * rgw: set length for keystone token validation request (#7796 Yehuda Sadeh, Mark Kirkwood) * rgw: support X-Storage-Policy header for Swift storage policy compat (Yehuda Sadeh) * rgw: support multiple host names (#7467 Yehuda Sadeh) * rgw: swift: dump container's custom metadata (#10665 Ahmad Faheem, Dmytro Iurchenko) * rgw: swift: support Accept header for response format (#10746 Dmytro Iurchenko) * rgw: swift: support for X-Remove-Container-Meta-{key} (#10475 Dmytro Iurchenko) * rgw: tweak error codes (#10329 #10334 Yehuda Sadeh) * rgw: update bucket index on attr changes, for multi-site sync (#5595 Yehuda Sadeh) * rgw: use \r\n for http headers (#9254 Yehuda Sadeh) * rgw: use gc for multipart abort (#10445 Aaron Bassett, Yehuda Sadeh) * rgw: use new watch/notify API (Yehuda Sadeh, Sage Weil) * rpm: misc fixes (Key Dreyer) * rpm: move rgw logrotate to radosgw subpackage (Ken Dreyer) * systemd: better systemd unit files (Owen Synge) * sysvinit: fix race in 'stop' (#10389 Loic Dachary) * test: fix bufferlist tests (Jianpeng Ma) * tests: ability to run unit tests under docker (Loic Dachary) * tests: centos-6 dockerfile (#10755 Loic Dachary) * tests: improve docker-based tests (Loic Dachary) * tests: unit tests for shared_cache (Dong Yuan) * udev: fix rules for CentOS7/RHEL7 (Loic Dachary) * use clock_gettime instead of gettimeofday (Jianpeng Ma) * vstart.sh: set up environment for s3-tests (Luis Pabon) * vstart.sh: work with cmake (Yehuda Sadeh) v0.93 ===== This is the first release candidate for Hammer, and includes all of the features that will be present in the final release. We welcome and encourage any and all testing in non-production clusters to identify any problems with functionality, stability, or performance before the final Hammer release. We suggest some caution in one area: librbd. There is a lot of new functionality around object maps and locking that is disabled by default but may still affect stability for existing images. We are continuing to shake out those bugs so that the final Hammer release (probably v0.94) will be rock solid. Major features since Giant include: * cephfs: journal scavenger repair tool (John Spray) * crush: new and improved straw2 bucket type (Sage Weil, Christina Anderson, Xiaoxi Chen) * doc: improved guidance for CephFS early adopters (John Spray) * librbd: add per-image object map for improved performance (Jason Dillaman) * librbd: copy-on-read (Min Chen, Li Wang, Yunchuan Wen, Cheng Cheng) * librados: fadvise-style IO hints (Jianpeng Ma) * mds: many many snapshot-related fixes (Yan, Zheng) * mon: new 'ceph osd df' command (Mykola Golub) * mon: new 'ceph pg ls ...' command (Xinxin Shu) * osd: improved performance for high-performance backends * osd: improved recovery behavior (Samuel Just) * osd: improved cache tier behavior with reads (Zhiqiang Wang) * rgw: S3-compatible bucket versioning support (Yehuda Sadeh) * rgw: large bucket index sharding (Guang Yang, Yehuda Sadeh) * RDMA "xio" messenger support (Matt Benjamin, Vu Pham) Upgrading --------- * If you are upgrading from v0.92, you must stop all OSD daemons and flush their journals (``ceph-osd -i NNN --flush-journal``) before upgrading. There was a transaction encoding bug in v0.92 that broke compatibility. Upgrading from v0.91 or anything earlier is safe. * No special restrictions when upgrading from firefly or giant. Notable Changes --------------- * build: CMake support (Ali Maredia, Casey Bodley, Adam Emerson, Marcus Watts, Matt Benjamin) * ceph-disk: do not re-use partition if encryption is required (Loic Dachary) * ceph-disk: support LUKS for encrypted partitions (Andrew Bartlett, Loic Dachary) * ceph-fuse,libcephfs: add support for O_NOFOLLOW and O_PATH (Greg Farnum) * ceph-fuse,libcephfs: resend requests before completing cap reconnect (#10912 Yan, Zheng) * ceph-fuse: select kernel cache invalidation mechanism based on kernel version (Greg Farnum) * ceph-objectstore-tool: improved import (David Zafman) * ceph-objectstore-tool: misc improvements, fixes (#9870 #9871 David Zafman) * ceph: add 'ceph osd df [tree]' command (#10452 Mykola Golub) * ceph: fix 'ceph tell ...' command validation (#10439 Joao Eduardo Luis) * ceph: improve 'ceph osd tree' output (Mykola Golub) * cephfs-journal-tool: add recover_dentries function (#9883 John Spray) * common: add newline to flushed json output (Sage Weil) * common: filtering for 'perf dump' (John Spray) * common: fix Formatter factory breakage (#10547 Loic Dachary) * common: make json-pretty output prettier (Sage Weil) * crush: new and improved straw2 bucket type (Sage Weil, Christina Anderson, Xiaoxi Chen) * crush: update tries stats for indep rules (#10349 Loic Dachary) * crush: use larger choose_tries value for erasure code rulesets (#10353 Loic Dachary) * debian,rpm: move RBD udev rules to ceph-common (#10864 Ken Dreyer) * debian: split python-ceph into python-{rbd,rados,cephfs} (Boris Ranto) * doc: CephFS disaster recovery guidance (John Spray) * doc: CephFS for early adopters (John Spray) * doc: fix OpenStack Glance docs (#10478 Sebastien Han) * doc: misc updates (#9793 #9922 #10204 #10203 Travis Rhoden, Hazem, Ayari, Florian Coste, Andy Allan, Frank Yu, Baptiste Veuillez-Mainard, Yuan Zhou, Armando Segnini, Robert Jansen, Tyler Brekke, Viktor Suprun) * doc: replace cloudfiles with swiftclient Python Swift example (Tim Freund) * erasure-code: add mSHEC erasure code support (Takeshi Miyamae) * erasure-code: improved docs (#10340 Loic Dachary) * erasure-code: set max_size to 20 (#10363 Loic Dachary) * libcephfs,ceph-fuse: fix getting zero-length xattr (#10552 Yan, Zheng) * librados: add blacklist_add convenience method (Jason Dillaman) * librados: expose rados_{read|write}_op_assert_version in C API (Kim Vandry) * librados: fix pool name caching (#10458 Radoslaw Zarzynski) * librados: fix resource leak, misc bugs (#10425 Radoslaw Zarzynski) * librados: fix some watch/notify locking (Jason Dillaman, Josh Durgin) * libradosstriper: fix write_full when ENOENT (#10758 Sebastien Ponce) * librbd: CRC protection for RBD image map (Jason Dillaman) * librbd: add per-image object map for improved performance (Jason Dillaman) * librbd: add support for an "object map" indicating which objects exist (Jason Dillaman) * librbd: adjust internal locking (Josh Durgin, Jason Dillaman) * librbd: better handling of watch errors (Jason Dillaman) * librbd: coordinate maint operations through lock owner (Jason Dillaman) * librbd: copy-on-read (Min Chen, Li Wang, Yunchuan Wen, Cheng Cheng, Jason Dillaman) * librbd: enforce write ordering with a snapshot (Jason Dillaman) * librbd: fadvise-style hints; add misc hints for certain operations (Jianpeng Ma) * librbd: fix coverity false-positives (Jason Dillaman) * librbd: fix snap create races (Jason Dillaman) * librbd: flush AIO operations asynchronously (#10714 Jason Dillaman) * librbd: make async versions of long-running maint operations (Jason Dillaman) * librbd: mock tests (Jason Dillaman) * librbd: optionally blacklist clients before breaking locks (#10761 Jason Dillaman) * librbd: prevent copyup during shrink (Jason Dillaman) * mds: add cephfs-table-tool (John Spray) * mds: avoid sending traceless replies in most cases (Yan, Zheng) * mds: export dir asok command (John Spray) * mds: fix stray/purge perfcounters (#10388 John Spray) * mds: handle heartbeat_reset during shutdown (#10382 John Spray) * mds: many many snapshot-related fixes (Yan, Zheng) * mds: refactor, improve Session storage (John Spray) * misc coverity fixes (Danny Al-Gaaf) * mon: add noforward flag for some mon commands (Mykola Golub) * mon: disallow empty pool names (#10555 Wido den Hollander) * mon: do not deactivate last mds (#10862 John Spray) * mon: drop old ceph_mon_store_converter (Sage Weil) * mon: fix 'ceph pg dump_stuck degraded' (Xinxin Shu) * mon: fix 'profile osd' use of config-key function on mon (#10844 Joao Eduardo Luis) * mon: fix compatset initalization during mkfs (Joao Eduardo Luis) * mon: fix feature tracking during elections (Joao Eduardo Luis) * mon: fix mds gid/rank/state parsing (John Spray) * mon: ignore failure reports from before up_from (#10762 Dan van der Ster, Sage Weil) * mon: improved corrupt CRUSH map detection (Joao Eduardo Luis) * mon: include pg_temp count in osdmap summary (Sage Weil) * mon: log health summary to cluster log (#9440 Joao Eduardo Luis) * mon: make 'mds fail' idempotent (John Spray) * mon: make pg dump {sum,pgs,pgs_brief} work for format=plain (#5963 #6759 Mykola Golub) * mon: new pool safety flags nodelete, nopgchange, nosizechange (#9792 Mykola Golub) * mon: new, friendly 'ceph pg ls ...' command (Xinxin Shu) * mon: prevent MDS transition from STOPPING (#10791 Greg Farnum) * mon: propose all pending work in one transaction (Sage Weil) * mon: remove pg_temps for nonexistent pools (Joao Eduardo Luis) * mon: require mon_allow_pool_delete option to remove pools (Sage Weil) * mon: set globalid prealloc to larger value (Sage Weil) * mon: skip zeroed osd stats in get_rule_avail (#10257 Joao Eduardo Luis) * mon: validate min_size range (Jianpeng Ma) * msgr: async: bind threads to CPU cores, improved poll (Haomai Wang) * msgr: fix crc configuration (Mykola Golub) * msgr: misc unit tests (Haomai Wang) * msgr: xio: XioMessenger RDMA support (Casey Bodley, Vu Pham, Matt Benjamin) * osd, librados: fadvise-style librados hints (Jianpeng Ma) * osd, librados: fix xattr_cmp_u64 (Dongmao Zhang) * osd,mon: add 'norebalance' flag (Kefu Chai) * osd,mon: specify OSD features explicitly in MOSDBoot (#10911 Sage Weil) * osd: add option to prioritize heartbeat network traffic (Jian Wen) * osd: add support for the SHEC erasure-code algorithm (Takeshi Miyamae, Loic Dachary) * osd: allow recovery while below min_size (Samuel Just) * osd: allow recovery with fewer than min_size OSDs (Samuel Just) * osd: allow writes to degraded objects (Samuel Just) * osd: allow writes to degraded objects (Samuel Just) * osd: avoid publishing unchanged PG stats (Sage Weil) * osd: cache recent ObjectContexts (Dong Yuan) * osd: clear cache on interval change (Samuel Just) * osd: do no proxy reads unless target OSDs are new (#10788 Sage Weil) * osd: do not update digest on inconsistent object (#10524 Samuel Just) * osd: don't record digests for snapdirs (#10536 Samuel Just) * osd: fix OSDCap parser on old (el6) boost::spirit (#10757 Kefu Chai) * osd: fix OSDCap parsing on el6 (#10757 Kefu Chai) * osd: fix ObjectStore::Transaction encoding version (#10734 Samuel Just) * osd: fix auth object selection during repair (#10524 Samuel Just) * osd: fix bug in pending digest updates (#10840 Samuel Just) * osd: fix cancel_proxy_read_ops (Sage Weil) * osd: fix cleanup of interrupted pg deletion (#10617 Sage Weil) * osd: fix journal wrapping bug (#10883 David Zafman) * osd: fix leak in SnapTrimWQ (#10421 Kefu Chai) * osd: fix memstore free space calculation (Xiaoxi Chen) * osd: fix mixed-version peering issues (Samuel Just) * osd: fix object digest update bug (#10840 Samuel Just) * osd: fix ordering issue with new transaction encoding (#10534 Dong Yuan) * osd: fix past_interval generation (#10427 #10430 David Zafman) * osd: fix short read handling on push (#8121 David Zafman) * osd: fix watch timeout cache state update (#10784 David Zafman) * osd: force promotion of watch/notify ops (Zhiqiang Wang) * osd: improve idempotency detection across cache promotion/demotion (#8935 Sage Weil, Samuel Just) * osd: include activating peers in blocked_by (#10477 Sage Weil) * osd: jerasure and gf-complete updates from upstream (#10216 Loic Dachary) * osd: journal: check fsync/fdatasync result (Jianpeng Ma) * osd: journal: fix hang on shutdown (#10474 David Zafman) * osd: journal: fix header.committed_up_to (Xinze Chi) * osd: journal: initialize throttle (Ning Yao) * osd: journal: misc bug fixes (#6003 David Zafman, Samuel Just) * osd: misc cleanup (Xinze Chi, Yongyue Sun) * osd: new 'activating' state between peering and active (Sage Weil) * osd: preserve reqids for idempotency checks for promote/demote (Sage Weil, Zhiqiang Wang, Samuel Just) * osd: remove dead locking code (Xinxin Shu) * osd: restrict scrub to certain times of day (Xinze Chi) * osd: rocksdb: fix shutdown (Hoamai Wang) * pybind: fix memory leak in librados bindings (Billy Olsen) * qa: fix mds tests (#10539 John Spray) * qa: ignore duplicates in rados ls (Josh Durgin) * qa: improve hadoop tests (Noah Watkins) * qa: reorg fs quota tests (Greg Farnum) * rados: fix usage (Jianpeng Ma) * radosgw-admin: add replicalog update command (Yehuda Sadeh) * rbd-fuse: clean up on shutdown (Josh Durgin) * rbd: add 'merge-diff' function (MingXin Liu, Yunchuan Wen, Li Wang) * rbd: fix buffer handling on image import (#10590 Jason Dillaman) * rbd: leave exclusive lockin goff by default (Jason Dillaman) * rbd: update init-rbdmap to fix dup mount point (Karel Striegel) * rbd: use IO hints for import, export, and bench operations (#10462 Jason Dillaman) * rbd_recover_tool: RBD image recovery tool (Min Chen) * rgw: S3-style object versioning support (Yehuda Sadeh) * rgw: check keystone auth for S3 POST requests (#10062 Abhishek Lekshmanan) * rgw: extend replica log API (purge-all) (Yehuda Sadeh) * rgw: fail S3 POST if keystone not configured (#10688 Valery Tschopp, Yehuda Sadeh) * rgw: fix XML header on get ACL request (#10106 Yehuda Sadeh) * rgw: fix bucket removal with data purge (Yehuda Sadeh) * rgw: fix replica log indexing (#8251 Yehuda Sadeh) * rgw: fix swift metadata header name (Dmytro Iurchenko) * rgw: remove multipart entries from bucket index on abort (#10719 Yehuda Sadeh) * rgw: respond with 204 to POST on containers (#10667 Yuan Zhou) * rgw: reuse fcgx connection struct (#10194 Yehuda Sadeh) * rgw: support multiple host names (#7467 Yehuda Sadeh) * rgw: swift: dump container's custom metadata (#10665 Ahmad Faheem, Dmytro Iurchenko) * rgw: swift: support Accept header for response format (#10746 Dmytro Iurchenko) * rgw: swift: support for X-Remove-Container-Meta-{key} (#10475 Dmytro Iurchenko) * rpm: move rgw logrotate to radosgw subpackage (Ken Dreyer) * tests: centos-6 dockerfile (#10755 Loic Dachary) * tests: unit tests for shared_cache (Dong Yuan) * vstart.sh: work with cmake (Yehuda Sadeh) v0.92 ===== This is the second-to-last chunk of new stuff before Hammer. Big items include additional checksums on OSD objects, proxied reads in the cache tier, image locking in RBD, optimized OSD Transaction and replication messages, and a big pile of RGW and MDS bug fixes. Upgrading --------- * The experimental 'keyvaluestore-dev' OSD backend has been renamed 'keyvaluestore' (for simplicity) and marked as experimental. To enable this untested feature and acknowledge that you understand that it is untested and may destroy data, you need to add the following to your ceph.conf:: enable experimental unrecoverable data corrupting featuers = keyvaluestore * The following librados C API function calls take a 'flags' argument whose value is now correctly interpreted: rados_write_op_operate() rados_aio_write_op_operate() rados_read_op_operate() rados_aio_read_op_operate() The flags were not correctly being translated from the librados constants to the internal values. Now they are. Any code that is passing flags to these methods should be audited to ensure that they are using the correct LIBRADOS_OP_FLAG_* constants. * The 'rados' CLI 'copy' and 'cppool' commands now use the copy-from operation, which means the latest CLI cannot run these commands against pre-firefly OSDs. * The librados watch/notify API now includes a watch_flush() operation to flush the async queue of notify operations. This should be called by any watch/notify user prior to rados_shutdown(). Notable Changes --------------- * add experimental features option (Sage Weil) * build: fix 'make check' races (#10384 Loic Dachary) * build: fix pkg names when libkeyutils is missing (Pankag Garg, Ken Dreyer) * ceph: make 'ceph -s' show PG state counts in sorted order (Sage Weil) * ceph: make 'ceph tell mon.* version' work (Mykola Golub) * ceph-monstore-tool: fix/improve CLI (Joao Eduardo Luis) * ceph: show primary-affinity in 'ceph osd tree' (Mykola Golub) * common: add TableFormatter (Andreas Peters) * common: check syncfs() return code (Jianpeng Ma) * doc: do not suggest dangerous XFS nobarrier option (Dan van der Ster) * doc: misc updates (Nilamdyuti Goswami, John Wilkins) * install-deps.sh: do not require sudo when root (Loic Dachary) * libcephfs: fix dirfrag trimming (#10387 Yan, Zheng) * libcephfs: fix mount timeout (#10041 Yan, Zheng) * libcephfs: fix test (#10415 Yan, Zheng) * libcephfs: fix use-afer-free on umount (#10412 Yan, Zheng) * libcephfs: include ceph and git version in client metadata (Sage Weil) * librados: add watch_flush() operation (Sage Weil, Haomai Wang) * librados: avoid memcpy on getxattr, read (Jianpeng Ma) * librados: create ioctx by pool id (Jason Dillaman) * librados: do notify completion in fast-dispatch (Sage Weil) * librados: remove shadowed variable (Kefu Chain) * librados: translate op flags from C APIs (Matthew Richards) * librbd: differentiate between R/O vs R/W features (Jason Dillaman) * librbd: exclusive image locking (Jason Dillaman) * librbd: fix write vs import race (#10590 Jason Dillaman) * librbd: gracefully handle deleted/renamed pools (#10270 Jason Dillaman) * mds: asok command for fetching subtree map (John Spray) * mds: constify MDSCacheObjects (John Spray) * misc: various valgrind fixes and cleanups (Danny Al-Gaaf) * mon: fix 'mds fail' for standby MDSs (John Spray) * mon: fix stashed monmap encoding (#5203 Xie Rui) * mon: implement 'fs reset' command (John Spray) * mon: respect down flag when promoting standbys (John Spray) * mount.ceph: fix suprious error message (#10351 Yan, Zheng) * msgr: async: many fixes, unit tests (Haomai Wang) * msgr: simple: retry binding to port on failure (#10029 Wido den Hollander) * osd: add fadvise flags to ObjectStore API (Jianpeng Ma) * osd: add get_latest_osdmap asok command (#9483 #9484 Mykola Golub) * osd: EIO on whole-object reads when checksum is wrong (Sage Weil) * osd: filejournal: don't cache journal when not using direct IO (Jianpeng Ma) * osd: fix ioprio option (Mykola Golub) * osd: fix scrub delay bug (#10693 Samuel Just) * osd: fix watch reconnect race (#10441 Sage Weil) * osd: handle no-op write with snapshot (#10262 Sage Weil) * osd: journal: fix journal zeroing when direct IO is enabled (Xie Rui) * osd: keyvaluestore: cleanup dead code (Ning Yao) * osd, mds: 'ops' as shorthand for 'dump_ops_in_flight' on asok (Sage Weil) * osd: memstore: fix size limit (Xiaoxi Chen) * osd: misc scrub fixes (#10017 Loic Dachary) * osd: new optimized encoding for ObjectStore::Transaction (Dong Yuan) * osd: optimize filter_snapc (Ning Yao) * osd: optimize WBThrottle map with unordered_map (Ning Yao) * osd: proxy reads during cache promote (Zhiqiang Wang) * osd: proxy read support (Zhiqiang Wang) * osd: remove legacy classic scrub code (Sage Weil) * osd: remove unused fields in MOSDSubOp (Xiaoxi Chen) * osd: replace MOSDSubOp messages with simpler, optimized MOSDRepOp (Xiaoxi Chen) * osd: store whole-object checksums on scrub, write_full (Sage Weil) * osd: verify kernel is new enough before using XFS extsize ioctl, enable by default (#9956 Sage Weil) * rados: use copy-from operation for copy, cppool (Sage Weil) * rgw: change multipart upload id magic (#10271 Yehuda Sadeh) * rgw: decode http query params correction (#10271 Yehuda Sadeh) * rgw: fix content length check (#10701 Axel Dunkel, Yehuda Sadeh) * rgw: fix partial GET in swift (#10553 Yehuda Sadeh) * rgw: fix shutdown (#10472 Yehuda Sadeh) * rgw: include XML ns on get ACL request (#10106 Yehuda Sadeh) * rgw: misc fixes (#10307 Yehuda Sadeh) * rgw: only track cleanup for objects we write (#10311 Yehuda Sadeh) * rgw: tweak error codes (#10329 #10334 Yehuda Sadeh) * rgw: use gc for multipart abort (#10445 Aaron Bassett, Yehuda Sadeh) * sysvinit: fix race in 'stop' (#10389 Loic Dachary) * test: fix bufferlist tests (Jianpeng Ma) * tests: improve docker-based tests (Loic Dachary) v0.91 ===== We are quickly approaching the Hammer feature freeze but have a few more dev releases to go before we get there. The headline items are subtree-based quota support in CephFS (ceph-fuse/libcephfs client support only for now), a rewrite of the watch/notify librados API used by RBD and RGW, OSDMap checksums to ensure that maps are always consistent inside the cluster, new API calls in librados and librbd for IO hinting modeled after posix_fadvise, and improved storage of per-PG state. We expect two more releases before the Hammer feature freeze (v0.93). Upgrading --------- * The 'category' field for objects has been removed. This was originally added to track PG stat summations over different categories of objects for use by radosgw. It is no longer has any known users and is prone to abuse because it can lead to a pg_stat_t structure that is unbounded. The librados API calls that accept this field now ignore it, and the OSD no longers tracks the per-category summations. * The output for 'rados df' has changed. The 'category' level has been eliminated, so there is now a single stat object per pool. The structure of the JSON output is different, and the plaintext output has one less column. * The 'rados create [category]' optional category argument is no longer supported or recognized. * rados.py's Rados class no longer has a __del__ method; it was causing problems on interpreter shutdown and use of threads. If your code has Rados objects with limited lifetimes and you're concerned about locked resources, call Rados.shutdown() explicitly. * There is a new version of the librados watch/notify API with vastly improved semantics. Any applications using this interface are encouraged to migrate to the new API. The old API calls are marked as deprecated and will eventually be removed. * The librados rados_unwatch() call used to be safe to call on an invalid handle. The new version has undefined behavior when passed a bogus value (for example, when rados_watch() returns an error and handle is not defined). * The structure of the formatted 'pg stat' command is changed for the portion that counts states by name to avoid using the '+' character (which appears in state names) as part of the XML token (it is not legal). Notable Changes --------------- * asyncmsgr: misc fixes (Haomai Wang) * buffer: add 'shareable' construct (Matt Benjamin) * build: aarch64 build fixes (Noah Watkins, Haomai Wang) * build: support for jemalloc (Shishir Gowda) * ceph-disk: allow journal partition re-use (#10146 Loic Dachary, Dav van der Ster) * ceph-disk: misc fixes (Christos Stavrakakis) * ceph-fuse: fix kernel cache trimming (#10277 Yan, Zheng) * ceph-objectstore-tool: many many improvements (David Zafman) * common: support new gperftools header locations (Key Dreyer) * crush: straw bucket weight calculation fixes (#9998 Sage Weil) * doc: misc improvements (Nilamdyuti Goswami, John Wilkins, Chris Holcombe) * libcephfs,ceph-fuse: add 'status' asok (John Spray) * librados, osd: new watch/notify implementation (Sage Weil) * librados: drop 'category' feature (Sage Weil) * librados: fix pool deletion handling (#10372 Sage Weil) * librados: new fadvise API (Ma Jianpeng) * libradosstriper: fix remove() (Dongmao Zhang) * librbd: complete pending ops before closing image (#10299 Josh Durgin) * librbd: fadvise API (Ma Jianpeng) * mds: ENOSPC and OSDMap epoch barriers (#7317 John Spray) * mds: dirfrag buf fix (Yan, Zheng) * mds: disallow most commands on inactive MDS's (Greg Farnum) * mds: drop dentries, leases on deleted directories (#10164 Yan, Zheng) * mds: handle zero-size xattr (#10335 Yan, Zheng) * mds: subtree quota support (Yunchuan Wen) * memstore: free space tracking (John Spray) * misc cleanup (Danny Al-Gaaf, David Anderson) * mon: 'osd crush reweight-all' command (Sage Weil) * mon: allow full flag to be manually cleared (#9323 Sage Weil) * mon: delay failure injection (Joao Eduardo Luis) * mon: fix paxos timeouts (#10220 Joao Eduardo Luis) * mon: get canonical OSDMap from leader (#10422 Sage Weil) * msgr: fix RESETSESSION bug (#10080 Greg Farnum) * objectstore: deprecate collection attrs (Sage Weil) * osd, mon: add checksums to all OSDMaps (Sage Weil) * osd: allow deletion of objects with watcher (#2339 Sage Weil) * osd: allow sparse read for Push/Pull (Haomai Wang) * osd: cache reverse_nibbles hash value (Dong Yuan) * osd: drop upgrade support for pre-dumpling (Sage Weil) * osd: enable and use posix_fadvise (Sage Weil) * osd: erasure-code: enforce chunk size alignment (#10211 Loic Dachary) * osd: erasure-code: jerasure support for NEON (Loic Dachary) * osd: erasure-code: relax cauchy w restrictions (#10325 David Zhang, Loic Dachary) * osd: erasure-code: update gf-complete to latest upstream (Loic Dachary) * osd: fix WBTHrottle perf counters (Haomai Wang) * osd: fix backfill bug (#10150 Samuel Just) * osd: fix occasional peering stalls (#10431 Sage Weil) * osd: fix scrub vs try-flush bug (#8011 Samuel Just) * osd: fix stderr with -f or -d (Dan Mick) * osd: misc FIEMAP fixes (Ma Jianpeng) * osd: optimize Finisher (Xinze Chi) * osd: store PG metadata in per-collection objects for better concurrency (Sage Weil) * pyrados: add object lock support (#6114 Mehdi Abaakouk) * pyrados: fix misnamed wait_* routings (#10104 Dan Mick) * pyrados: misc cleanups (Kefu Chai) * qa: add large auth ticket tests (Ilya Dryomov) * qa: many 'make check' improvements (Loic Dachary) * qa: misc tests (Loic Dachary, Yan, Zheng) * rgw: conditional PUT on ETag (#8562 Ray Lv) * rgw: fix error codes (#10334 #10329 Yehuda Sadeh) * rgw: index swift keys appropriately (#10471 Yehuda Sadeh) * rgw: prevent illegal bucket policy that doesn't match placement rule (Yehuda Sadeh) * rgw: run radosgw as apache with systemd (#10125 Loic Dachary) * rgw: support X-Storage-Policy header for Swift storage policy compat (Yehuda Sadeh) * rgw: use \r\n for http headers (#9254 Yehuda Sadeh) * rpm: misc fixes (Key Dreyer) v0.90 ===== This is the last development release before Christmas. There are some API cleanups for librados and librbd, and lots of bug fixes across the board for the OSD, MDS, RGW, and CRUSH. The OSD also gets support for discard (potentially helpful on SSDs, although it is off by default), and there are several improvements to ceph-disk. The next two development releases will be getting a slew of new functionality for hammer. Stay tuned! Upgrading --------- * Previously, the formatted output of 'ceph pg stat -f ...' was a full pg dump that included all metadata about all PGs in the system. It is now a concise summary of high-level PG stats, just like the unformatted 'ceph pg stat' command. * All JSON dumps of floating point values were incorrecting surrounding the value with quotes. These quotes have been removed. Any consumer of structured JSON output that was consuming the floating point values was previously having to interpret the quoted string and will most likely need to be fixed to take the unquoted number. Notable Changes --------------- * arch: fix NEON feaeture detection (#10185 Loic Dachary) * build: adjust build deps for yasm, virtualenv (Jianpeng Ma) * build: improve build dependency tooling (Loic Dachary) * ceph-disk: call partx/partprobe consistency (#9721 Loic Dachary) * ceph-disk: fix dmcrypt key permissions (Loic Dachary) * ceph-disk: fix umount race condition (#10096 Blaine Gardner) * ceph-disk: init=none option (Loic Dachary) * ceph-monstore-tool: fix shutdown (#10093 Loic Dachary) * ceph-objectstore-tool: fix import (#10090 David Zafman) * ceph-objectstore-tool: many improvements and tests (David Zafman) * ceph.spec: package rbd-replay-prep (Ken Dreyer) * common: add 'perf reset ...' admin command (Jianpeng Ma) * common: do not unlock rwlock on destruction (Federico Simoncelli) * common: fix block device discard check (#10296 Sage Weil) * common: remove broken CEPH_LOCKDEP optoin (Kefu Chai) * crush: fix tree bucket behavior (Rongze Zhu) * doc: add build-doc guidlines for Fedora and CentOS/RHEL (Nilamdyuti Goswami) * doc: enable rbd cache on openstack deployments (Sebastien Han) * doc: improved installation nots on CentOS/RHEL installs (John Wilkins) * doc: misc cleanups (Adam Spiers, Sebastien Han, Nilamdyuti Goswami, Ken Dreyer, John Wilkins) * doc: new man pages (Nilamdyuti Goswami) * doc: update release descriptions (Ken Dreyer) * doc: update sepia hardware inventory (Sandon Van Ness) * librados: only export public API symbols (Jason Dillaman) * libradosstriper: fix stat strtoll (Dongmao Zhang) * libradosstriper: fix trunc method (#10129 Sebastien Ponce) * librbd: fix list_children from invalid pool ioctxs (#10123 Jason Dillaman) * librbd: only export public API symbols (Jason Dillaman) * many coverity fixes (Danny Al-Gaaf) * mds: 'flush journal' admin command (John Spray) * mds: fix MDLog IO callback deadlock (John Spray) * mds: fix deadlock during journal probe vs purge (#10229 Yan, Zheng) * mds: fix race trimming log segments (Yan, Zheng) * mds: store backtrace for stray dir (Yan, Zheng) * mds: verify backtrace when fetching dirfrag (#9557 Yan, Zheng) * mon: add max pgs per osd warning (Sage Weil) * mon: fix *_ratio* units and types (Sage Weil) * mon: fix JSON dumps to dump floats as flots and not strings (Sage Weil) * mon: fix formatter 'pg stat' command output (Sage Weil) * msgr: async: several fixes (Haomai Wang) * msgr: simple: fix rare deadlock (Greg Farnum) * osd: batch pg log trim (Xinze Chi) * osd: clean up internal ObjectStore interface (Sage Weil) * osd: do not abort deep scrub on missing hinfo (#10018 Loic Dachary) * osd: fix ghobject_t formatted output to include shard (#10063 Loic Dachary) * osd: fix osd peer check on scrub messages (#9555 Sage Weil) * osd: fix pgls filter ops (#9439 David Zafman) * osd: flush snapshots from cache tier immediately (Sage Weil) * osd: keyvaluestore: fix getattr semantics (Haomai Wang) * osd: keyvaluestore: fix key ordering (#10119 Haomai Wang) * osd: limit in-flight read requests (Jason Dillaman) * osd: log when scrub or repair starts (Loic Dachary) * osd: support for discard for journal trim (Jianpeng Ma) * qa: fix osd create dup tests (#10083 Loic Dachary) * rgw: add location header when object is in another region (VRan Liu) * rgw: check timestamp on s3 keystone auth (#10062 Abhishek Lekshmanan) * rgw: make sysvinit script set ulimit -n properly (Sage Weil) * systemd: better systemd unit files (Owen Synge) * tests: ability to run unit tests under docker (Loic Dachary) v0.89 ===== This is the second development release since Giant. The big items include the first batch of scrub patchs from Greg for CephFS, a rework in the librados object listing API to properly handle namespaces, and a pile of bug fixes for RGW. There are also several smaller issues fixed up in the performance area with buffer alignment and memory copies, osd cache tiering agent, and various CephFS fixes. Upgrading --------- * New ability to list all objects from all namespaces can fail or return incomplete results when not all OSDs have been upgraded. Features rados --all ls, rados cppool, rados export, rados cache-flush-evict-all and rados cache-try-flush-evict-all can also fail or return incomplete results. Notable Changes --------------- * buffer: add list::get_contiguous (Sage Weil) * buffer: avoid rebuild if buffer already contiguous (Jianpeng Ma) * ceph-disk: improved systemd support (Owen Synge) * ceph-disk: set guid if reusing journal partition (Dan van der Ster) * ceph-fuse, libcephfs: allow xattr caps in inject_release_failure (#9800 John Spray) * ceph-fuse, libcephfs: fix I_COMPLETE_ORDERED checks (#9894 Yan, Zheng) * ceph-fuse: fix dentry invalidation on 3.18+ kernels (#9997 Yan, Zheng) * crush: fix detach_bucket (#10095 Sage Weil) * crush: fix several bugs in adjust_item_weight (Rongze Zhu) * doc: add dumpling to firefly upgrade section (#7679 John Wilkins) * doc: document erasure coded pool operations (#9970 Loic Dachary) * doc: file system osd config settings (Kevin Dalley) * doc: key/value store config reference (John Wilkins) * doc: update openstack docs for Juno (Sebastien Han) * fix cluster logging from non-mon daemons (Sage Weil) * init-ceph: check for systemd-run before using it (Boris Ranto) * librados: fix infinite loop with skipped map epochs (#9986 Ding Dinghua) * librados: fix iterator operator= bugs (#10082 David Zafman, Yehuda Sadeh) * librados: fix null deref when pool DNE (#9944 Sage Weil) * librados: fix timer race from recent refactor (Sage Weil) * libradosstriper: fix shutdown hang (Dongmao Zhang) * librbd: don't close a closed parent in failure path (#10030 Jason Dillaman) * librbd: fix diff test (#10002 Josh Durgin) * librbd: fix locking for readahead (#10045 Jason Dillaman) * librbd: refactor unit tests to use fixtures (Jason Dillaman) * many many coverity cleanups (Danny Al-Gaaf) * mds: a whole bunch of initial scrub infrastructure (Greg Farnum) * mds: fix compat_version for MClientSession (#9945 John Spray) * mds: fix reply snapbl (Yan, Zheng) * mon: allow adding tiers to fs pools (#10135 John Spray) * mon: fix MDS health status from peons (#10151 John Spray) * mon: fix caching for min_last_epoch_clean (#9987 Sage Weil) * mon: fix error output for add_data_pool (#9852 Joao Eduardo Luis) * mon: include entity name in audit log for forwarded requests (#9913 Joao Eduardo Luis) * mon: paxos: allow reads while proposing (#9321 #9322 Joao Eduardo Luis) * msgr: asyncmessenger: add kqueue support (#9926 Haomai Wang) * osd, librados: revamp PG listing API to handle namespaces (#9031 #9262 #9438 David Zafman) * osd, mon: send intiial pg create time from mon to osd (#9887 David Zafman) * osd: allow whiteout deletion in cache pool (Sage Weil) * osd: cache pool: ignore min flush age when cache is full (Xinze Chi) * osd: erasure coding: allow bench.sh to test ISA backend (Yuan Zhou) * osd: erasure-code: encoding regression tests, corpus (#9420 Loic Dachary) * osd: fix journal shutdown race (Sage Weil) * osd: fix object age eviction (Zhiqiang Wang) * osd: fix object atime calculation (Xinze Chi) * osd: fix past_interval display bug (#9752 Loic Dachary) * osd: journal: fix alignment checks, avoid useless memmove (Jianpeng Ma) * osd: journal: update committed_thru after replay (#6756 Samuel Just) * osd: keyvaluestore_dev: optimization (Chendi Xue) * osd: make misdirected op checks robust for EC pools (#9835 Sage Weil) * osd: removed some dead code (Xinze Chi) * qa: parallelize make check (Loic Dachary) * qa: tolerate nearly-full disk for make check (Loic Dachary) * rgw: create subuser if needed when creating user (#10103 Yehuda Sadeh) * rgw: fix If-Modified-Since (VRan Liu) * rgw: fix content-length update (#9576 Yehuda Sadeh) * rgw: fix disabling of max_size quota (#9907 Dong Lei) * rgw: fix incorrect len when len is 0 (#9877 Yehuda Sadeh) * rgw: fix object copy content type (#9478 Yehuda Sadeh) * rgw: fix user stags in get-user-info API (#9359 Ray Lv) * rgw: remove swift user manifest (DLO) hash calculation (#9973 Yehuda Sadeh) * rgw: return timestamp on GET/HEAD (#8911 Yehuda Sadeh) * rgw: set ETag on object copy (#9479 Yehuda Sadeh) * rgw: update bucket index on attr changes, for multi-site sync (#5595 Yehuda Sadeh) v0.88 ===== This is the first development release after Giant. The two main features merged this round are the new AsyncMessenger (an alternative implementation of the network layer) from Haomai Wang at UnitedStack, and support for POSIX file locks in ceph-fuse and libcephfs from Yan, Zheng. There is also a big pile of smaller items that re merged while we were stabilizing Giant, including a range of smaller performance and bug fixes and some new tracepoints for LTTNG. Notable Changes --------------- * ceph-disk: Scientific Linux support (Dan van der Ster) * ceph-disk: respect --statedir for keyring (Loic Dachary) * ceph-fuse, libcephfs: POSIX file lock support (Yan, Zheng) * ceph-fuse, libcephfs: fix cap flush overflow (Greg Farnum, Yan, Zheng) * ceph-fuse, libcephfs: fix root inode xattrs (Yan, Zheng) * ceph-fuse, libcephfs: preserve dir ordering (#9178 Yan, Zheng) * ceph-fuse, libcephfs: trim inodes before reconnecting to MDS (Yan, Zheng) * ceph: do not parse injectargs twice (Loic Dachary) * ceph: make 'ceph -s' output more readable (Sage Weil) * ceph: new 'ceph tell mds.$name_or_rank_or_gid' (John Spray) * ceph: test robustness (Joao Eduardo Luis) * ceph_objectstore_tool: behave with sharded flag (#9661 David Zafman) * cephfs-journal-tool: fix journal import (#10025 John Spray) * cephfs-journal-tool: skip up to expire_pos (#9977 John Spray) * cleanup rados.h definitions with macros (Ilya Dryomov) * common: shared_cache unit tests (Cheng Cheng) * config: add $cctid meta variable (Adam Crume) * crush: fix buffer overrun for poorly formed rules (#9492 Johnu George) * crush: improve constness (Loic Dachary) * crushtool: add --location command (Sage Weil, Loic Dachary) * default to libnss instead of crypto++ (Federico Gimenez) * doc: ceph osd reweight vs crush weight (Laurent Guerby) * doc: document the LRC per-layer plugin configuration (Yuan Zhou) * doc: erasure code doc updates (Loic Dachary) * doc: misc updates (Alfredo Deza, VRan Liu) * doc: preflight doc fixes (John Wilkins) * doc: update PG count guide (Gerben Meijer, Laurent Guerby, Loic Dachary) * keyvaluestore: misc fixes (Haomai Wang) * keyvaluestore: performance improvements (Haomai Wang) * librados: add rados_pool_get_base_tier() call (Adam Crume) * librados: cap buffer length (Loic Dachary) * librados: fix objecter races (#9617 Josh Durgin) * libradosstriper: misc fixes (Sebastien Ponce) * librbd: add missing python docstrings (Jason Dillaman) * librbd: add readahead (Adam Crume) * librbd: fix cache tiers in list_children and snap_unprotect (Adam Crume) * librbd: fix performance regression in ObjectCacher (#9513 Adam Crume) * librbd: lttng tracepoints (Adam Crume) * librbd: misc fixes (Xinxin Shu, Jason Dillaman) * mds: fix sessionmap lifecycle bugs (Yan, Zheng) * mds: initialize root inode xattr version (Yan, Zheng) * mds: introduce auth caps (John Spray) * mds: misc bugs (Greg Farnum, John Spray, Yan, Zheng, Henry Change) * misc coverity fixes (Danny Al-Gaaf) * mon: add 'ceph osd rename-bucket ...' command (Loic Dachary) * mon: clean up auth list output (Loic Dachary) * mon: fix 'osd crush link' id resolution (John Spray) * mon: fix misc error paths (Joao Eduardo Luis) * mon: fix paxos off-by-one corner case (#9301 Sage Weil) * mon: new 'ceph pool ls [detail]' command (Sage Weil) * mon: wait for writeable before cross-proposing (#9794 Joao Eduardo Luis) * msgr: avoid useless new/delete (Haomai Wang) * msgr: fix delay injection bug (#9910 Sage Weil, Greg Farnum) * msgr: new AsymcMessenger alternative implementation (Haomai Wang) * msgr: prefetch data when doing recv (Yehuda Sadeh) * osd: add erasure code corpus (Loic Dachary) * osd: add misc tests (Loic Dachary, Danny Al-Gaaf) * osd: cleanup boost optionals (William Kennington) * osd: expose non-journal backends via ceph-osd CLI (Hoamai Wang) * osd: fix JSON output for stray OSDs (Loic Dachary) * osd: fix ioprio options (Loic Dachary) * osd: fix transaction accounting (Jianpeng Ma) * osd: misc optimizations (Xinxin Shu, Zhiqiang Wang, Xinze Chi) * osd: use FIEMAP_FLAGS_SYNC instead of fsync (Jianpeng Ma) * rados: fix put of /dev/null (Loic Dachary) * rados: parse command-line arguments more strictly (#8983 Adam Crume) * rbd-fuse: fix memory leak (Adam Crume) * rbd-replay-many (Adam Crume) * rbd-replay: --anonymize flag to rbd-replay-prep (Adam Crume) * rbd: fix 'rbd diff' for non-existent objects (Adam Crume) * rbd: fix error when striping with format 1 (Sebastien Han) * rbd: fix export for image sizes over 2GB (Vicente Cheng) * rbd: use rolling average for rbd bench-write throughput (Jason Dillaman) * rgw: send explicit HTTP status string (Yehuda Sadeh) * rgw: set length for keystone token validation request (#7796 Yehuda Sadeh, Mark Kirkwood) * udev: fix rules for CentOS7/RHEL7 (Loic Dachary) * use clock_gettime instead of gettimeofday (Jianpeng Ma) * vstart.sh: set up environment for s3-tests (Luis Pabon) v0.87.2 Giant ============= This is the second (and possibly final) point release for Giant. We recommend all v0.87.x Giant users upgrade to this release. Notable Changes --------------- * ceph-objectstore-tool: only output unsupported features when incompatible (#11176 David Zafman) * common: do not implicitly unlock rwlock on destruction (Federico Simoncelli) * common: make wait timeout on empty queue configurable (#10818 Samuel Just) * crush: pick ruleset id that matches and rule id (Xiaoxi Chen) * crush: set_choose_tries = 100 for new erasure code rulesets (#10353 Loic Dachary) * librados: check initialized atomic safely (#9617 Josh Durgin) * librados: fix failed tick_event assert (#11183 Zhiqiang Wang) * librados: fix looping on skipped maps (#9986 Ding Dinghua) * librados: fix op submit with timeout (#10340 Samuel Just) * librados: pybind: fix memory leak (#10723 Billy Olsen) * librados: pybind: keep reference to callbacks (#10775 Josh Durgin) * librados: translate operation flags from C APIs (Matthew Richards) * libradosstriper: fix write_full on ENOENT (#10758 Sebastien Ponce) * libradosstriper: use strtoll instead of strtol (Dongmao Zhang) * mds: fix assertion caused by system time moving backwards (#11053 Yan, Zheng) * mon: allow injection of random delays on writes (Joao Eduardo Luis) * mon: do not trust small osd epoch cache values (#10787 Sage Weil) * mon: fail non-blocking flush if object is being scrubbed (#8011 Samuel Just) * mon: fix division by zero in stats dump (Joao Eduardo Luis) * mon: fix get_rule_avail when no osds (#10257 Joao Eduardo Luis) * mon: fix timeout rounds period (#10546 Joao Eduardo Luis) * mon: ignore osd failures before up_from (#10762 Dan van der Ster, Sage Weil) * mon: paxos: reset accept timeout before writing to store (#10220 Joao Eduardo Luis) * mon: return if fs exists on 'fs new' (Joao Eduardo Luis) * mon: use EntityName when expanding profiles (#10844 Joao Eduardo Luis) * mon: verify cross-service proposal preconditions (#10643 Joao Eduardo Luis) * mon: wait for osdmon to be writeable when requesting proposal (#9794 Joao Eduardo Luis) * mount.ceph: avoid spurious error message about /etc/mtab (#10351 Yan, Zheng) * msg/simple: allow RESETSESSION when we forget an endpoint (#10080 Greg Farnum) * msg/simple: discard delay queue before incoming queue (#9910 Sage Weil) * osd: clear_primary_state when leaving Primary (#10059 Samuel Just) * osd: do not ignore deleted pgs on startup (#10617 Sage Weil) * osd: fix FileJournal wrap to get header out first (#10883 David Zafman) * osd: fix PG leak in SnapTrimWQ (#10421 Kefu Chai) * osd: fix journalq population in do_read_entry (#6003 Samuel Just) * osd: fix operator== for op_queue_age_hit and fs_perf_stat (#10259 Samuel Just) * osd: fix rare assert after split (#10430 David Zafman) * osd: get pgid ancestor from last_map when building past intervals (#10430 David Zafman) * osd: include rollback_info_trimmed_to in {read,write}_log (#10157 Samuel Just) * osd: lock header_lock in DBObjectMap::sync (#9891 Samuel Just) * osd: requeue blocked op before flush it was blocked on (#10512 Sage Weil) * osd: tolerate missing object between list and attr get on backfill (#10150 Samuel Just) * osd: use correct atime for eviction decision (Xinze Chi) * rgw: flush XML header on get ACL request (#10106 Yehuda Sadeh) * rgw: index swift keys appropriately (#10471 Hemant Bruman, Yehuda Sadeh) * rgw: send cancel for bucket index pending ops (#10770 Baijiaruo, Yehuda Sadeh) * rgw: swift: support X_Remove_Container-Meta-{key} (#01475 Dmytro Iurchenko) For more detailed information, see :download:`the complete changelog `. v0.87.1 Giant ============= This is the first (and possibly final) point release for Giant. Our focus on stability fixes will be directed towards Hammer and Firefly. We recommend that all v0.87 Giant users upgrade to this release. Upgrading --------- * Due to a change in the Linux kernel version 3.18 and the limits of the FUSE interface, ceph-fuse needs be mounted as root on at least some systems. See issues #9997, #10277, and #10542 for details. Notable Changes --------------- * build: disable stack-execute bit on assembler objects (#10114 Dan Mick) * build: support boost 1.57.0 (#10688 Ken Dreyer) * ceph-disk: fix dmcrypt file permissions (#9785 Loic Dachary) * ceph-disk: run partprobe after zap, behave with partx or partprobe (#9665 #9721 Loic Dachary) * cephfs-journal-tool: fix import for aged journals (#9977 John Spray) * cephfs-journal-tool: fix journal import (#10025 John Spray) * ceph-fuse: use remount to trim kernel dcache (#10277 Yan, Zheng) * common: add cctid meta variable (#6228 Adam Crume) * common: fix dump of shard for ghobject_t (#10063 Loic Dachary) * crush: fix bucket weight underflow (#9998 Pawel Sadowski) * erasure-code: enforce chunk size alignment (#10211 Loic Dachary) * erasure-code: regression test suite (#9420 Loic Dachary) * erasure-code: relax caucy w restrictions (#10325 Loic Dachary) * libcephfs,ceph-fuse: allow xattr caps on inject_release_failure (#9800 John Spray) * libcephfs,ceph-fuse: fix cap flush tid comparison (#9869 Greg Farnum) * libcephfs,ceph-fuse: new flag to indicated sorted dcache (#9178 Yan, Zheng) * libcephfs,ceph-fuse: prune cache before reconnecting to MDS (Yan, Zheng) * librados: limit number of in-flight read requests (#9854 Jason Dillaman) * libradospy: fix thread shutdown (#8797 Dan Mick) * libradosstriper: fix locking issue in truncate (#10129 Sebastien Ponce) * librbd: complete pending ops before closing mage (#10299 Jason Dillaman) * librbd: fix error path on image open failure (#10030 Jason Dillaman) * librbd: gracefully handle deleted/renamed pools (#10270 Jason Dillaman) * librbd: handle errors when creating ioctx while listing children (#10123 Jason Dillaman) * mds: fix compat version in MClientSession (#9945 John Spray) * mds: fix journaler write error handling (#10011 John Spray) * mds: fix locking for file size recovery (#10229 Yan, Zheng) * mds: handle heartbeat_reset during shutdown (#10382 John Spray) * mds: store backtrace for straydir (Yan, Zheng) * mon: allow tiers for FS pools (#10135 John Spray) * mon: fix caching of last_epoch_clean, osdmap trimming (#9987 Sage Weil) * mon: fix 'fs ls' on peons (#10288 John Spray) * mon: fix MDS health status from peons (#10151 John Spray) * mon: fix paxos off-by-one (#9301 Sage Weil) * msgr: simple: do not block on takeover while holding global lock (#9921 Greg Farnum) * osd: deep scrub must not abort if hinfo is missing (#10018 Loic Dachary) * osd: fix misdirected op detection (#9835 Sage Weil) * osd: fix past_interval display for acting (#9752 Loic Dachary) * osd: fix PG peering backoff when behind on osdmaps (#10431 Sage Weil) * osd: handle no-op write with snapshot case (#10262 Ssage Weil) * osd: use fast-dispatch (Sage Weil, Greg Farnum) * rados: fix write to /dev/null (Loic Dachary) * radosgw-admin: create subuser when needed (#10103 Yehuda Sadeh) * rbd: avoid invalidating aio_write buffer during image import (#10590 Jason Dillaman) * rbd: fix export with images > 2GB (Vicente Cheng) * rgw: change multipart upload id magic (#10271 Georgios Dimitrakakis, Yehuda Sadeh) * rgw: check keystone auth for S3 POST (#10062 Abhishek Lekshmanan) * rgw: check timestamp for S3 keystone auth (#10062 Abhishek Lekshmanan) * rgw: fix partial GET with swift (#10553 Yehuda Sadeh) * rgw: fix quota disable (#9907 Dong Lei) * rgw: fix rare corruption of object metadata on put (#9576 Yehuda Sadeh) * rgw: fix S3 object copy content-type (#9478 Yehuda Sadeh) * rgw: headers end with \r\n (#9254 Benedikt Fraunhofer, Yehuda Sadeh) * rgw: remove swift user manifest DLO hash calculation (#9973 Yehuda Sadeh) * rgw: return correct len when len is 0 (#9877 Yehuda Sadeh) * rgw: return X-Timestamp field (#8911 Yehuda Sadeh) * rgw: run radosgw as apache with systemd (#10125) * rgw: sent ETag on S3 object copy (#9479 Yehuda Sadeh) * rgw: sent HTTP status reason explicitly in fastcgi (Yehuda Sadeh) * rgw: set length for keystone token validation (#7796 Mark Kirkwood, Yehuda Sadeh) * rgw: set ulimit -n on sysvinit before starting daemon (#9587 Sage Weil) * rgw: update bucket index on set_attrs (#5595 Yehuda Sadeh) * rgw: update swift subuser permission masks when authenticating (#9918 Yehuda Sadeh) * rgw: URL decode HTTP query params correction (#10271 Georgios Dimitrakakis, Yehuda Sadeh) * rgw: use cached attrs while reading object attrs (#10307 Yehuda Sadeh) * rgw: use strict_strtoll for content length (#10701 Axel Dunkel, Yehuda Sadeh) For more detailed information, see :download:`the complete changelog `. v0.87 Giant =========== This release will form the basis for the stable release Giant, v0.87.x. Highlights for Giant include: * *RADOS Performance*: a range of improvements have been made in the OSD and client-side librados code that improve the throughput on flash backends and improve parallelism and scaling on fast machines. * *CephFS*: we have fixed a raft of bugs in CephFS and built some basic journal recovery and diagnostic tools. Stability and performance of single-MDS systems is vastly improved in Giant. Although we do not yet recommend CephFS for production deployments, we do encourage testing for non-critical workloads so that we can better guage the feature, usability, performance, and stability gaps. * *Local Recovery Codes*: the OSDs now support an erasure-coding scheme that stores some additional data blocks to reduce the IO required to recover from single OSD failures. * *Degraded vs misplaced*: the Ceph health reports from 'ceph -s' and related commands now make a distinction between data that is degraded (there are fewer than the desired number of copies) and data that is misplaced (stored in the wrong location in the cluster). The distinction is important because the latter does not compromise data safety. * *Tiering improvements*: we have made several improvements to the cache tiering implementation that improve performance. Most notably, objects are not promoted into the cache tier by a single read; they must be found to be sufficiently hot before that happens. * *Monitor performance*: the monitors now perform writes to the local data store asynchronously, improving overall responsiveness. * *Recovery tools*: the ceph_objectstore_tool is greatly expanded to allow manipulation of an individual OSDs data store for debugging and repair purposes. This is most heavily used by our QA infrastructure to exercise recovery code. Upgrade Sequencing ------------------ * If your existing cluster is running a version older than v0.80.x Firefly, please first upgrade to the latest Firefly release before moving on to Giant. We have not tested upgrades directly from Emperor, Dumpling, or older releases. We *have* tested: * Firefly to Giant * Dumpling to Firefly to Giant * Please upgrade daemons in the following order: #. Monitors #. OSDs #. MDSs and/or radosgw Note that the relative ordering of OSDs and monitors should not matter, but we primarily tested upgrading monitors first. Upgrading from v0.80x Firefly ----------------------------- * The client-side caching for librbd is now enabled by default (rbd cache = true). A safety option (rbd cache writethrough until flush = true) is also enabled so that writeback caching is not used until the library observes a 'flush' command, indicating that the librbd users is passing that operation through from the guest VM. This avoids potential data loss when used with older versions of qemu that do not support flush. leveldb_write_buffer_size = 8*1024*1024 = 33554432 // 8MB leveldb_cache_size = 512*1024*1204 = 536870912 // 512MB leveldb_block_size = 64*1024 = 65536 // 64KB leveldb_compression = false leveldb_log = "" OSDs will still maintain the following osd-specific defaults: leveldb_log = "" * The 'rados getxattr ...' command used to add a gratuitous newline to the attr value; it now does not. * The ``*_kb perf`` counters on the monitor have been removed. These are replaced with a new set of ``*_bytes`` counters (e.g., ``cluster_osd_kb`` is replaced by ``cluster_osd_bytes``). * The ``rd_kb`` and ``wr_kb`` fields in the JSON dumps for pool stats (accessed via the ``ceph df detail -f json-pretty`` and related commands) have been replaced with corresponding ``*_bytes`` fields. Similarly, the ``total_space``, ``total_used``, and ``total_avail`` fields are replaced with ``total_bytes``, ``total_used_bytes``, and ``total_avail_bytes`` fields. * The ``rados df --format=json`` output ``read_bytes`` and ``write_bytes`` fields were incorrectly reporting ops; this is now fixed. * The ``rados df --format=json`` output previously included ``read_kb`` and ``write_kb`` fields; these have been removed. Please use ``read_bytes`` and ``write_bytes`` instead (and divide by 1024 if appropriate). * The experimental keyvaluestore-dev OSD backend had an on-disk format change that prevents existing OSD data from being upgraded. This affects developers and testers only. * mon-specific and osd-specific leveldb options have been removed. From this point onward users should use the `leveldb_*` generic options and add the options in the appropriate sections of their configuration files. Monitors will still maintain the following monitor-specific defaults: leveldb_write_buffer_size = 8*1024*1024 = 33554432 // 8MB leveldb_cache_size = 512*1024*1204 = 536870912 // 512MB leveldb_block_size = 64*1024 = 65536 // 64KB leveldb_compression = false leveldb_log = "" OSDs will still maintain the following osd-specific defaults: leveldb_log = "" * CephFS support for the legacy anchor table has finally been removed. Users with file systems created before firefly should ensure that inodes with multiple hard links are modified *prior* to the upgrade to ensure that the backtraces are written properly. For example:: sudo find /mnt/cephfs -type f -links +1 -exec touch \{\} \; * We disallow nonsensical 'tier cache-mode' transitions. From this point onward, 'writeback' can only transition to 'forward' and 'forward' can transition to 1) 'writeback' if there are dirty objects, or 2) any if there are no dirty objects. Notable Changes since v0.86 --------------------------- * ceph-disk: use new udev rules for centos7/rhel7 (#9747 Loic Dachary) * libcephfs-java: fix fstat mode (Noah Watkins) * librados: fix deadlock when listing PG contents (Guang Yang) * librados: misc fixes to the new threading model (#9582 #9706 #9845 #9873 Sage Weil) * mds: fix inotable initialization (Henry C Chang) * mds: gracefully handle unknown lock type in flock requests (Yan, Zheng) * mon: add read-only, read-write, and role-definer rols (Joao Eduardo Luis) * mon: fix mon cap checks (Joao Eduardo Luis) * mon: misc fixes for new paxos async writes (#9635 Sage Weil) * mon: set scrub timestamps on PG creation (#9496 Joao Eduardo Luis) * osd: erasure code: fix buffer alignment (Janne Grunau, Loic Dachary) * osd: fix alloc hint induced crashes on mixed clusters (#9419 David Zafman) * osd: fix backfill reservation release on rejection (#9626, Samuel Just) * osd: fix ioprio option parsing (#9676 #9677 Loic Dachary) * osd: fix memory leak during snap trimming (#9113 Samuel Just) * osd: misc peering and recovery fixes (#9614 #9696 #9731 #9718 #9821 #9875 Samuel Just, Guang Yang) Notable Changes since v0.80.x Firefly ------------------------------------- * bash completion improvements (Wido den Hollander) * brag: fixes, improvements (Loic Dachary) * buffer: improve rebuild_page_aligned (Ma Jianpeng) * build: fix build on alpha (Michael Cree, Dmitry Smirnov) * build: fix CentOS 5 (Gerben Meijer) * build: fix yasm check for x32 (Daniel Schepler, Sage Weil) * ceph-brag: add tox tests (Alfredo Deza) * ceph-conf: flush log on exit (Sage Weil) * ceph.conf: update sample (Sebastien Han) * ceph-dencoder: refactor build a bit to limit dependencies (Sage Weil, Dan Mick) * ceph-disk: add Scientific Linux support (Dan van der Ster) * ceph-disk: do not inadvertantly create directories (Owne Synge) * ceph-disk: fix dmcrypt support (Sage Weil) * ceph-disk: fix dmcrypt support (Stephen Taylor) * ceph-disk: handle corrupt volumes (Stuart Longlang) * ceph-disk: linter cleanup, logging improvements (Alfredo Deza) * ceph-disk: partprobe as needed (Eric Eastman) * ceph-disk: show information about dmcrypt in 'ceph-disk list' output (Sage Weil) * ceph-disk: use partition type UUIDs and blkid (Sage Weil) * ceph: fix for non-default cluster names (#8944, Dan Mick) * ceph-fuse, libcephfs: asok hooks for handling session resets, timeouts (Yan, Zheng) * ceph-fuse, libcephfs: fix crash in trim_caps (John Spray) * ceph-fuse, libcephfs: improve cap trimming (John Spray) * ceph-fuse, libcephfs: improve traceless reply handling (Sage Weil) * ceph-fuse, libcephfs: virtual xattrs for rstat (Yan, Zheng) * ceph_objectstore_tool: vastly improved and extended tool for working offline with OSD data stores (David Zafman) * ceph.spec: many fixes (Erik Logtenberg, Boris Ranto, Dan Mick, Sandon Van Ness) * ceph.spec: split out ceph-common package, other fixes (Sandon Van Ness) * ceph_test_librbd_fsx: fix RNG, make deterministic (Ilya Dryomov) * cephtool: fix help (Yilong Zhao) * cephtool: refactor and improve CLI tests (Joao Eduardo Luis) * cephtool: test cleanup (Joao Eduardo Luis) * clang build fixes (John Spray, Danny Al-Gaaf) * client: improved MDS session dumps (John Spray) * common: add config diff admin socket command (Joao Eduardo Luis) * common: add rwlock assertion checks (Yehuda Sadeh) * common: fix dup log messages (#9080, Sage Weil) * common: perfcounters now use atomics and go faster (Sage Weil) * config: support G, M, K, etc. suffixes (Joao Eduardo Luis) * coverity cleanups (Danny Al-Gaaf) * crush: clean up CrushWrapper interface (Xioaxi Chen) * crush: include new tunables in dump (Sage Weil) * crush: make ruleset ids unique (Xiaoxi Chen, Loic Dachary) * crush: only require rule features if the rule is used (#8963, Sage Weil) * crushtool: send output to stdout, not stderr (Wido den Hollander) * doc: cache tiering (John Wilkins) * doc: CRUSH updates (John Wilkins) * doc: document new upstream wireshark dissector (Kevin Cox) * doc: improve manual install docs (Francois Lafont) * doc: keystone integration docs (John Wilkins) * doc: librados example fixes (Kevin Dalley) * doc: many doc updates (John Wilkins) * doc: many install doc updates (John Wilkins) * doc: misc updates (John Wilkins, Loic Dachary, David Moreau Simard, Wido den Hollander. Volker Voigt, Alfredo Deza, Stephen Jahl, Dan van der Ster) * doc: osd primary affinity (John Wilkins) * doc: pool quotas (John Wilkins) * doc: pre-flight doc improvements (Kevin Dalley) * doc: switch to an unencumbered font (Ross Turk) * doc: updated simple configuration guides (John Wilkins) * doc: update erasure docs (Loic Dachary, Venky Shankar) * doc: update openstack docs (Josh Durgin) * filestore: disable use of XFS hint (buggy on old kernels) (Samuel Just) * filestore: fix xattr spillout (Greg Farnum, Haomai Wang) * fix hppa arch build (Dmitry Smirnov) * fix i386 builds (Sage Weil) * fix struct vs class inconsistencies (Thorsten Behrens) * global: write pid file even when running in foreground (Alexandre Oliva) * hadoop: improve tests (Huamin Chen, Greg Farnum, John Spray) * hadoop: update hadoop tests for Hadoop 2.0 (Haumin Chen) * init-ceph: continue starting other daemons on crush or mount failure (#8343, Sage Weil) * journaler: fix locking (Zheng, Yan) * keyvaluestore: fix hint crash (#8381, Haomai Wang) * keyvaluestore: header cache (Haomai Wang) * libcephfs-java: build against older JNI headers (Greg Farnum) * libcephfs-java: fix gcj-jdk build (Dmitry Smirnov) * librados: fix crash on read op timeout (#9362 Matthias Kiefer, Sage Weil) * librados: fix lock leaks in error paths (#9022, Paval Rallabhandi) * librados: fix pool existence check (#8835, Pavan Rallabhandi) * librados: fix rados_pool_list bounds checks (Sage Weil) * librados: fix shutdown race (#9130 Sage Weil) * librados: fix watch/notify test (#7934 David Zafman) * librados: fix watch reregistration on acting set change (#9220 Samuel Just) * librados: give Objecter fine-grained locks (Yehuda Sadeh, Sage Weil, John Spray) * librados: lttng tracepoitns (Adam Crume) * librados, osd: return ETIMEDOUT on failed notify (Sage Weil) * librados: pybind: fix reads when \0 is present (#9547 Mohammad Salehe) * librados_striper: striping library for librados (Sebastien Ponce) * librbd, ceph-fuse: reduce cache flush overhead (Haomai Wang) * librbd: check error code on cache invalidate (Josh Durgin) * librbd: enable caching by default (Sage Weil) * librbd: enforce cache size on read requests (Jason Dillaman) * librbd: fix crash using clone of flattened image (#8845, Josh Durgin) * librbd: fix error path when opening image (#8912, Josh Durgin) * librbd: handle blacklisting during shutdown (#9105 John Spray) * librbd: lttng tracepoints (Adam Crume) * librbd: new libkrbd library for kernel map/unmap/showmapped (Ilya Dryomov) * librbd: store and retrieve snapshot metadata based on id (Josh Durgin) * libs3: update to latest (Danny Al-Gaaf) * log: fix derr level (Joao Eduardo Luis) * logrotate: fix osd log rotation on ubuntu (Sage Weil) * lttng: tracing infrastructure (Noah Watkins, Adam Crume) * mailmap: many updates (Loic Dachary) * mailmap: updates (Loic Dachary, Abhishek Lekshmanan, M Ranga Swami Reddy) * Makefile: fix out of source builds (Stefan Eilemann) * many many coverity fixes, cleanups (Danny Al-Gaaf) * mds: adapt to new Objecter locking, give types to all Contexts (John Spray) * mds: add file system name, enabled flag (John Spray) * mds: add internal health checks (John Spray) * mds: add min/max UID for snapshot creation/deletion (#9029, Wido den Hollander) * mds: avoid tight mon reconnect loop (#9428 Sage Weil) * mds: boot refactor, cleanup (John Spray) * mds: cephfs-journal-tool (John Spray) * mds: fix crash killing sessions (#9173 John Spray) * mds: fix ctime updates (#9514 Greg Farnum) * mds: fix journal conversion with standby-replay (John Spray) * mds: fix replay locking (Yan, Zheng) * mds: fix standby-replay cache trimming (#8648 Zheng, Yan) * mds: fix xattr bug triggered by ACLs (Yan, Zheng) * mds: give perfcounters meaningful names (Sage Weil) * mds: improve health reporting to monitor (John Spray) * mds: improve Journaler on-disk format (John Spray) * mds: improve journal locking (Zheng, Yan) * mds, libcephfs: use client timestamp for mtime/ctime (Sage Weil) * mds: make max file recoveries tunable (Sage Weil) * mds: misc encoding improvements (John Spray) * mds: misc fixes for multi-mds (Yan, Zheng) * mds: multi-mds fixes (Yan, Zheng) * mds: OPTracker integration, dump_ops_in_flight (Greg Farnum) * mds: prioritize file recovery when appropriate (Sage Weil) * mds: refactor beacon, improve reliability (John Spray) * mds: remove legacy anchor table (Yan, Zheng) * mds: remove legacy discover ino (Yan, Zheng) * mds: restart on EBLACKLISTED (John Spray) * mds: separate inode recovery queue (John Spray) * mds: session ls, evict commands (John Spray) * mds: submit log events in async thread (Yan, Zheng) * mds: track RECALL progress, report failure (#9284 John Spray) * mds: update segment references during journal write (John Spray, Greg Farnum) * mds: use client-provided timestamp for user-visible file metadata (Yan, Zheng) * mds: use meaningful names for clients (John Spray) * mds: validate journal header on load and save (John Spray) * mds: warn clients which aren't revoking caps (Zheng, Yan, John Spray) * misc build errors/warnings for Fedora 20 (Boris Ranto) * misc build fixes for OS X (John Spray) * misc cleanup (Christophe Courtaut) * misc integer size cleanups (Kevin Cox) * misc memory leaks, cleanups, fixes (Danny Al-Gaaf, Sahid Ferdjaoui) * misc suse fixes (Danny Al-Gaaf) * misc word size fixes (Kevin Cox) * mon: add audit log for all admin commands (Joao Eduardo Luis) * mon: add cluster fingerprint (Sage Weil) * mon: add get-quota commands (Joao Eduardo Luis) * mon: add 'osd blocked-by' command to easily see which OSDs are blocking peering progress (Sage Weil) * mon: add 'osd reweight-by-pg' command (Sage Weil, Guang Yang) * mon: add perfcounters for paxos operations (Sage Weil) * mon: avoid creating unnecessary rule on pool create (#9304 Loic Dachary) * monclient: fix hang (Sage Weil) * mon: create default EC profile if needed (Loic Dachary) * mon: do not create file system by default (John Spray) * mon: do not spam log (Aanchal Agrawal, Sage Weil) * mon: drop mon- and osd- specific leveldb options (Joao Eduardo Luis) * mon: ec pool profile fixes (Loic Dachary) * mon: fix bug when no auth keys are present (#8851, Joao Eduardo Luis) * mon: fix 'ceph df' output for available space (Xiaoxi Chen) * mon: fix compat version for MForward (Joao Eduardo Luis) * mon: fix crash on loopback messages and paxos timeouts (#9062, Sage Weil) * mon: fix default replication pool ruleset choice (#8373, John Spray) * mon: fix divide by zero when pg_num is adjusted before OSDs are added (#9101, Sage Weil) * mon: fix double-free of old MOSDBoot (Sage Weil) * mon: fix health down messages (Sage Weil) * mon: fix occasional memory leak after session reset (#9176, Sage Weil) * mon: fix op write latency perfcounter (#9217 Xinxin Shu) * mon: fix 'osd perf' reported latency (#9269 Samuel Just) * mon: fix quorum feature check (#8738, Greg Farnum) * mon: fix ruleset/ruleid bugs (#9044, Loic Dachary) * mon: fix set cache_target_full_ratio (#8440, Geoffrey Hartz) * mon: fix store check on startup (Joao Eduardo Luis) * mon: include per-pool 'max avail' in df output (Sage Weil) * mon: make paxos transaction commits asynchronous (Sage Weil) * mon: make usage dumps in terms of bytes, not kB (Sage Weil) * mon: 'osd crush reweight-subtree ...' (Sage Weil) * mon, osd: relax client EC support requirements (Sage Weil) * mon: preload erasure plugins (#9153 Loic Dachary) * mon: prevent cache pools from being used directly by CephFS (#9435 John Spray) * mon: prevent EC pools from being used with cephfs (Joao Eduardo Luis) * mon: prevent implicit destruction of OSDs with 'osd setmaxosd ...' (#8865, Anand Bhat) * mon: prevent nonsensical cache-mode transitions (Joao Eduardo Luis) * mon: restore original weight when auto-marked out OSDs restart (Sage Weil) * mon: restrict some pool properties to tiered pools (Joao Eduardo Luis) * mon: some instrumentation (Sage Weil) * mon: use msg header tid for MMonGetVersionReply (Ilya Dryomov) * mon: use user-provided ruleset for replicated pool (Xiaoxi Chen) * mon: verify all quorum members are contiguous at end of Paxos round (#9053, Sage Weil) * mon: verify available disk space on startup (#9502 Joao Eduardo Luis) * mon: verify erasure plugin version on load (Loic Dachary) * msgr: avoid big lock when sending (most) messages (Greg Farnum) * msgr: fix logged address (Yongyue Sun) * msgr: misc locking fixes for fast dispatch (#8891, Sage Weil) * msgr: refactor to cleanly separate SimpleMessenger implemenetation, move toward Connection-based calls (Matt Benjamin, Sage Wei) * objecter: flag operations that are redirected by caching (Sage Weil) * objectstore: clean up KeyValueDB interface for key/value backends (Sage Weil) * osd: account for hit_set_archive bytes (Sage Weil) * osd: add ability to prehash filestore directories (Guang Yang) * osd: add 'dump_reservations' admin socket command (Sage Weil) * osd: add feature bit for erasure plugins (Loic Dachary) * osd: add header cache for KeyValueStore (Haomai Wang) * osd: add ISA erasure plugin table cache (Andreas-Joachim Peters) * osd: add local_mtime for use by cache agent (Zhiqiang Wang) * osd: add local recovery code (LRC) erasure plugin (Loic Dachary) * osd: add prototype KineticStore based on Seagate Kinetic (Josh Durgin) * osd: add READFORWARD caching mode (Luis Pabon) * osd: add superblock for KeyValueStore backend (Haomai Wang) * osd: add support for Intel ISA-L erasure code library (Andreas-Joachim Peters) * osd: allow map cache size to be adjusted at runtime (Sage Weil) * osd: avoid refcounting overhead by passing a few things by ref (Somnath Roy) * osd: avoid sharing PG info that is not durable (Samuel Just) * osd: bound osdmap epoch skew between PGs (Sage Weil) * osd: cache tier flushing fixes for snapped objects (Samuel Just) * osd: cap hit_set size (#9339 Samuel Just) * osd: clean up shard_id_t, shard_t (Loic Dachary) * osd: clear FDCache on unlink (#8914 Loic Dachary) * osd: clear slow request latency info on osd up/down (Sage Weil) * osd: do not evict blocked objects (#9285 Zhiqiang Wang) * osd: do not skip promote for write-ordered reads (#9064, Samuel Just) * osd: fix agent early finish looping (David Zafman) * osd: fix ambigous encoding order for blacklisted clients (#9211, Sage Weil) * osd: fix bogus assert during OSD shutdown (Sage Weil) * osd: fix bug with long object names and rename (#8701, Sage Weil) * osd: fix cache flush corner case for snapshotted objects (#9054, Samuel Just) * osd: fix cache full -> not full requeueing (#8931, Sage Weil) * osd: fix clone deletion case (#8334, Sam Just) * osd: fix clone vs cache_evict bug (#8629 Sage Weil) * osd: fix connection reconnect race (Greg Farnum) * osd: fix crash from duplicate backfill reservation (#8863 Sage Weil) * osd: fix dead peer connection checks (#9295 Greg Farnum, Sage Weil) * osd: fix discard of old/obsolete subop replies (#9259, Samuel Just) * osd: fix discard of peer messages from previous intervals (Greg Farnum) * osd: fix dump of open fds on EMFILE (Sage Weil) * osd: fix dumps (Joao Eduardo Luis) * osd: fix erasure-code lib initialization (Loic Dachary) * osd: fix extent normalization (Adam Crume) * osd: fix filestore removal corner case (#8332, Sam Just) * osd: fix flush vs OpContext (Samuel Just) * osd: fix gating of messages from old OSD instances (Greg Farnum) * osd: fix hang waiting for osdmap (#8338, Greg Farnum) * osd: fix interval check corner case during peering (#8104, Sam Just) * osd: fix ISA erasure alignment (Loic Dachary, Andreas-Joachim Peters) * osd: fix journal dump (Ma Jianpeng) * osd: fix journal-less operation (Sage Weil) * osd: fix keyvaluestore scrub (#8589 Haomai Wang) * osd: fix keyvaluestore upgrade (Haomai Wang) * osd: fix loopback msgr issue (Ma Jianpeng) * osd: fix LSB release parsing (Danny Al-Gaaf) * osd: fix MarkMeDown and other shutdown races (Sage Weil) * osd: fix memstore bugs with collection_move_rename, lock ordering (Sage Weil) * osd: fix min_read_recency_for_promote default on upgrade (Zhiqiang Wang) * osd: fix mon feature bit requirements bug and resulting log spam (Sage Weil) * osd: fix mount/remount sync race (#9144 Sage Weil) * osd: fix PG object listing/ordering bug (Guang Yang) * osd: fix PG stat errors with tiering (#9082, Sage Weil) * osd: fix purged_snap initialization on backfill (Sage Weil, Samuel Just, Dan van der Ster, Florian Haas) * osd: fix race condition on object deletion (#9480 Somnath Roy) * osd: fix recovery chunk size usage during EC recovery (Ma Jianpeng) * osd: fix recovery reservation deadlock for EC pools (Samuel Just) * osd: fix removal of old xattrs when overwriting chained xattrs (Ma Jianpeng) * osd: fix requesting queueing on PG split (Samuel Just) * osd: fix scrub vs cache bugs (Samuel Just) * osd: fix snap object writeback from cache tier (#9054 Samuel Just) * osd: fix trim of hitsets (Sage Weil) * osd: force new xattrs into leveldb if fs returns E2BIG (#7779, Sage Weil) * osd: implement alignment on chunk sizes (Loic Dachary) * osd: improved backfill priorities (Sage Weil) * osd: improve journal shutdown (Ma Jianpeng, Mark Kirkwood) * osd: improve locking for KeyValueStore (Haomai Wang) * osd: improve locking in OpTracker (Pavan Rallabhandi, Somnath Roy) * osd: improve prioritization of recovery of degraded over misplaced objects (Sage Weil) * osd: improve tiering agent arithmetic (Zhiqiang Wang, Sage Weil, Samuel Just) * osd: include backend information in metadata reported to mon (Sage Weil) * osd: locking, sharding, caching improvements in FileStore's FDCache (Somnath Roy, Greg Farnum) * osd: lttng tracepoints for filestore (Noah Watkins) * osd: make blacklist encoding deterministic (#9211 Sage Weil) * osd: make tiering behave if hit_sets aren't enabled (Sage Weil) * osd: many important bug fixes (Samuel Just) * osd: many many core fixes (Samuel Just) * osd: many many important fixes (#8231 #8315 #9113 #9179 #9293 #9294 #9326 #9453 #9481 #9482 #9497 #9574 Samuel Just) * osd: mark pools with incomplete clones (Sage Weil) * osd: misc erasure code plugin fixes (Loic Dachary) * osd: misc locking fixes for fast dispatch (Samuel Just, Ma Jianpeng) * osd, mon: add rocksdb support (Xinxin Shu, Sage Weil) * osd, mon: config sanity checks on start (Sage Weil, Joao Eduardo Luis) * osd, mon: distinguish between "misplaced" and "degraded" objects in cluster health and PG state reporting (Sage Weil) * osd, msgr: fast-dispatch of OSD ops (Greg Farnum, Samuel Just) * osd, objecter: resend ops on last_force_op_resend barrier; fix cache overlay op ordering (Sage Weil) * osd: preload erasure plugins (#9153 Loic Dachary) * osd: prevent old rados clients from using tiered pools (#8714, Sage Weil) * osd: reduce OpTracker overhead (Somnath Roy) * osd: refactor some ErasureCode functionality into command parent class (Loic Dachary) * osd: remove obsolete classic scrub code (David Zafman) * osd: scrub PGs with invalid stats (Sage Weil) * osd: set configurable hard limits on object and xattr names (Sage Weil, Haomai Wang) * osd: set rollback_info_completed on create (#8625, Samuel Just) * osd: sharded threadpool to improve parallelism (Somnath Roy) * osd: shard OpTracker to improve performance (Somnath Roy) * osd: simple io prioritization for scrub (Sage Weil) * osd: simple scrub throttling (Sage Weil) * osd: simple snap trimmer throttle (Sage Weil) * osd: tests for bench command (Loic Dachary) * osd: trim old EC objects quickly; verify on scrub (Samuel Just) * osd: use FIEMAP to inform copy_range (Haomai Wang) * osd: use local time for tiering decisions (Zhiqiang Wang) * osd: use xfs hint less frequently (Ilya Dryomov) * osd: verify erasure plugin version on load (Loic Dachary) * osd: work around GCC 4.8 bug in journal code (Matt Benjamin) * pybind/rados: fix small timeouts (John Spray) * qa: xfstests updates (Ilya Dryomov) * rados: allow setxattr value to be read from stdin (Sage Weil) * rados bench: fix arg order (Kevin Dalley) * rados: drop gratuitous \n from getxattr command (Sage Weil) * rados: fix bench write arithmetic (Jiangheng) * rados: fix {read,write}_ops values for df output (Sage Weil) * rbd: add rbdmap pre- and post post- hooks, fix misc bugs (Dmitry Smirnov) * rbd-fuse: allow exposing single image (Stephen Taylor) * rbd-fuse: fix unlink (Josh Durgin) * rbd: improve option default behavior (Josh Durgin) * rbd: parallelize rbd import, export (Jason Dillaman) * rbd: rbd-replay utility to replay captured rbd workload traces (Adam Crume) * rbd: use write-back (not write-through) when caching is enabled (Jason Dillaman) * removed mkcephfs (deprecated since dumpling) * rest-api: fix help (Ailing Zhang) * rgw: add civetweb as default frontent on port 7490 (#9013 Yehuda Sadeh) * rgw: add --min-rewrite-stripe-size for object restriper (Yehuda Sadeh) * rgw: add powerdns hook for dynamic DNS for global clusters (Wido den Hollander) * rgw: add S3 bucket get location operation (Abhishek Lekshmanan) * rgw: allow : in S3 access key (Roman Haritonov) * rgw: automatically align writes to EC pool (#8442, Yehuda Sadeh) * rgw: bucket link uses instance id (Yehuda Sadeh) * rgw: cache bucket info (Yehuda Sadeh) * rgw: cache decoded user info (Yehuda Sadeh) * rgw: check entity permission for put_metadata (#8428, Yehuda Sadeh) * rgw: copy object data is target bucket is in a different pool (#9039, Yehuda Sadeh) * rgw: do not try to authenticate CORS preflight requests (#8718, Robert Hubbard, Yehuda Sadeh) * rgw: fix admin create user op (#8583 Ray Lv) * rgw: fix civetweb URL decoding (#8621, Yehuda Sadeh) * rgw: fix crash on swift CORS preflight request (#8586, Yehuda Sadeh) * rgw: fix log filename suffix (#9353 Alexandre Marangone) * rgw: fix memory leak following chunk read error (Yehuda Sadeh) * rgw: fix memory leaks (Andrey Kuznetsov) * rgw: fix multipart object attr regression (#8452, Yehuda Sadeh) * rgw: fix multipart upload (#8846, Silvain Munaut, Yehuda Sadeh) * rgw: fix radosgw-admin 'show log' command (#8553, Yehuda Sadeh) * rgw: fix removal of objects during object creation (Patrycja Szablowska, Yehuda Sadeh) * rgw: fix striping for copied objects (#9089, Yehuda Sadeh) * rgw: fix test for identify whether an object has a tail (#9226, Yehuda Sadeh) * rgw: fix URL decoding (#8702, Brian Rak) * rgw: fix URL escaping (Yehuda Sadeh) * rgw: fix usage (Abhishek Lekshmanan) * rgw: fix user manifest (Yehuda Sadeh) * rgw: fix when stripe size is not a multiple of chunk size (#8937, Yehuda Sadeh) * rgw: handle empty extra pool name (Yehuda Sadeh) * rgw: improve civetweb logging (Yehuda Sadeh) * rgw: improve delimited listing of bucket, misc fixes (Yehuda Sadeh) * rgw: improve -h (Abhishek Lekshmanan) * rgw: many fixes for civetweb (Yehuda Sadeh) * rgw: misc civetweb fixes (Yehuda Sadeh) * rgw: misc civetweb frontend fixes (Yehuda Sadeh) * rgw: object and bucket rewrite functions to allow restriping old objects (Yehuda Sadeh) * rgw: powerdns backend for global namespaces (Wido den Hollander) * rgw: prevent multiobject PUT race (Yehuda Sadeh) * rgw: send user manifest header (Yehuda Sadeh) * rgw: subuser creation fixes (#8587 Yehuda Sadeh) * rgw: use systemd-run from sysvinit script (JuanJose Galvez) * rpm: do not restart daemons on upgrade (Alfredo Deza) * rpm: misc packaging fixes for rhel7 (Sandon Van Ness) * rpm: split ceph-common from ceph (Sandon Van Ness) * systemd: initial systemd config files (Federico Simoncelli) * systemd: wrap started daemons in new systemd environment (Sage Weil, Dan Mick) * sysvinit: add support for non-default cluster names (Alfredo Deza) * sysvinit: less sensitive to failures (Sage Weil) * test_librbd_fsx: test krbd as well as librbd (Ilya Dryomov) * unit test improvements (Loic Dachary) * upstart: increase max open files limit (Sage Weil) * vstart.sh: fix/improve rgw support (Luis Pabon, Abhishek Lekshmanan) v0.86 ===== This is a release candidate for Giant, which will hopefully be out in another week or two. We did a feature freeze about a month ago and since then have been doing only stabilization and bug fixing (and a handful on low-risk enhancements). A fair bit of new functionality went into the final sprint, but it's baked for quite a while now and we're feeling pretty good about it. Major items include: * librados locking refactor to improve scaling and client performance * local recovery code (LRC) erasure code plugin to trade some additional storage overhead for improved recovery performance * LTTNG tracing framework, with initial tracepoints in librados, librbd, and the OSD FileStore backend * separate monitor audit log for all administrative commands * asynchronos monitor transaction commits to reduce the impact on monitor read requests while processing updates * low-level tool for working with individual OSD data stores for debugging, recovery, and testing * many MDS improvements (bug fixes, health reporting) There are still a handful of known bugs in this release, but nothing severe enough to prevent a release. By and large we are pretty pleased with the stability and expect the final Giant release to be quite reliable. Please try this out on your non-production clusters for a preview Notable Changes --------------- * buffer: improve rebuild_page_aligned (Ma Jianpeng) * build: fix CentOS 5 (Gerben Meijer) * build: fix build on alpha (Michael Cree, Dmitry Smirnov) * build: fix yasm check for x32 (Daniel Schepler, Sage Weil) * ceph-disk: add Scientific Linux support (Dan van der Ster) * ceph-fuse, libcephfs: fix crash in trim_caps (John Spray) * ceph-fuse, libcephfs: improve cap trimming (John Spray) * ceph-fuse, libcephfs: virtual xattrs for rstat (Yan, Zheng) * ceph.conf: update sample (Sebastien Han) * ceph.spec: many fixes (Erik Logtenberg, Boris Ranto, Dan Mick, Sandon Van Ness) * ceph_objectstore_tool: vastly improved and extended tool for working offline with OSD data stores (David Zafman) * common: add config diff admin socket command (Joao Eduardo Luis) * common: add rwlock assertion checks (Yehuda Sadeh) * crush: clean up CrushWrapper interface (Xioaxi Chen) * crush: make ruleset ids unique (Xiaoxi Chen, Loic Dachary) * doc: improve manual install docs (Francois Lafont) * doc: misc updates (John Wilkins, Loic Dachary, David Moreau Simard, Wido den Hollander. Volker Voigt, Alfredo Deza, Stephen Jahl, Dan van der Ster) * global: write pid file even when running in foreground (Alexandre Oliva) * hadoop: improve tests (Huamin Chen, Greg Farnum, John Spray) * journaler: fix locking (Zheng, Yan) * librados, osd: return ETIMEDOUT on failed notify (Sage Weil) * librados: fix crash on read op timeout (#9362 Matthias Kiefer, Sage Weil) * librados: fix shutdown race (#9130 Sage Weil) * librados: fix watch reregistration on acting set change (#9220 Samuel Just) * librados: fix watch/notify test (#7934 David Zafman) * librados: give Objecter fine-grained locks (Yehuda Sadeh, Sage Weil, John Spray) * librados: lttng tracepoitns (Adam Crume) * librados: pybind: fix reads when \0 is present (#9547 Mohammad Salehe) * librbd: enforce cache size on read requests (Jason Dillaman) * librbd: handle blacklisting during shutdown (#9105 John Spray) * librbd: lttng tracepoints (Adam Crume) * lttng: tracing infrastructure (Noah Watkins, Adam Crume) * mailmap: updates (Loic Dachary, Abhishek Lekshmanan, M Ranga Swami Reddy) * many many coverity fixes, cleanups (Danny Al-Gaaf) * mds: adapt to new Objecter locking, give types to all Contexts (John Spray) * mds: add internal health checks (John Spray) * mds: avoid tight mon reconnect loop (#9428 Sage Weil) * mds: fix crash killing sessions (#9173 John Spray) * mds: fix ctime updates (#9514 Greg Farnum) * mds: fix replay locking (Yan, Zheng) * mds: fix standby-replay cache trimming (#8648 Zheng, Yan) * mds: give perfcounters meaningful names (Sage Weil) * mds: improve health reporting to monitor (John Spray) * mds: improve journal locking (Zheng, Yan) * mds: make max file recoveries tunable (Sage Weil) * mds: prioritize file recovery when appropriate (Sage Weil) * mds: refactor beacon, improve reliability (John Spray) * mds: restart on EBLACKLISTED (John Spray) * mds: track RECALL progress, report failure (#9284 John Spray) * mds: update segment references during journal write (John Spray, Greg Farnum) * mds: use meaningful names for clients (John Spray) * mds: warn clients which aren't revoking caps (Zheng, Yan, John Spray) * mon: add 'osd reweight-by-pg' command (Sage Weil, Guang Yang) * mon: add audit log for all admin commands (Joao Eduardo Luis) * mon: add cluster fingerprint (Sage Weil) * mon: avoid creating unnecessary rule on pool create (#9304 Loic Dachary) * mon: do not spam log (Aanchal Agrawal, Sage Weil) * mon: fix 'osd perf' reported latency (#9269 Samuel Just) * mon: fix double-free of old MOSDBoot (Sage Weil) * mon: fix op write latency perfcounter (#9217 Xinxin Shu) * mon: fix store check on startup (Joao Eduardo Luis) * mon: make paxos transaction commits asynchronous (Sage Weil) * mon: preload erasure plugins (#9153 Loic Dachary) * mon: prevent cache pools from being used directly by CephFS (#9435 John Spray) * mon: use user-provided ruleset for replicated pool (Xiaoxi Chen) * mon: verify available disk space on startup (#9502 Joao Eduardo Luis) * mon: verify erasure plugin version on load (Loic Dachary) * msgr: fix logged address (Yongyue Sun) * osd: account for hit_set_archive bytes (Sage Weil) * osd: add ISA erasure plugin table cache (Andreas-Joachim Peters) * osd: add ability to prehash filestore directories (Guang Yang) * osd: add feature bit for erasure plugins (Loic Dachary) * osd: add local recovery code (LRC) erasure plugin (Loic Dachary) * osd: cap hit_set size (#9339 Samuel Just) * osd: clear FDCache on unlink (#8914 Loic Dachary) * osd: do not evict blocked objects (#9285 Zhiqiang Wang) * osd: fix ISA erasure alignment (Loic Dachary, Andreas-Joachim Peters) * osd: fix clone vs cache_evict bug (#8629 Sage Weil) * osd: fix crash from duplicate backfill reservation (#8863 Sage Weil) * osd: fix dead peer connection checks (#9295 Greg Farnum, Sage Weil) * osd: fix keyvaluestore scrub (#8589 Haomai Wang) * osd: fix keyvaluestore upgrade (Haomai Wang) * osd: fix min_read_recency_for_promote default on upgrade (Zhiqiang Wang) * osd: fix mount/remount sync race (#9144 Sage Weil) * osd: fix purged_snap initialization on backfill (Sage Weil, Samuel Just, Dan van der Ster, Florian Haas) * osd: fix race condition on object deletion (#9480 Somnath Roy) * osd: fix snap object writeback from cache tier (#9054 Samuel Just) * osd: improve journal shutdown (Ma Jianpeng, Mark Kirkwood) * osd: improve locking in OpTracker (Pavan Rallabhandi, Somnath Roy) * osd: improve tiering agent arithmetic (Zhiqiang Wang, Sage Weil, Samuel Just) * osd: lttng tracepoints for filestore (Noah Watkins) * osd: make blacklist encoding deterministic (#9211 Sage Weil) * osd: many many important fixes (#8231 #8315 #9113 #9179 #9293 #9294 #9326 #9453 #9481 #9482 #9497 #9574 Samuel Just) * osd: misc erasure code plugin fixes (Loic Dachary) * osd: preload erasure plugins (#9153 Loic Dachary) * osd: shard OpTracker to improve performance (Somnath Roy) * osd: use local time for tiering decisions (Zhiqiang Wang) * osd: verify erasure plugin version on load (Loic Dachary) * rados: fix bench write arithmetic (Jiangheng) * rbd: parallelize rbd import, export (Jason Dillaman) * rbd: rbd-replay utility to replay captured rbd workload traces (Adam Crume) * rbd: use write-back (not write-through) when caching is enabled (Jason Dillaman) * rgw: add S3 bucket get location operation (Abhishek Lekshmanan) * rgw: add civetweb as default frontent on port 7490 (#9013 Yehuda Sadeh) * rgw: allow : in S3 access key (Roman Haritonov) * rgw: fix admin create user op (#8583 Ray Lv) * rgw: fix log filename suffix (#9353 Alexandre Marangone) * rgw: fix usage (Abhishek Lekshmanan) * rgw: many fixes for civetweb (Yehuda Sadeh) * rgw: subuser creation fixes (#8587 Yehuda Sadeh) * rgw: use systemd-run from sysvinit script (JuanJose Galvez) * unit test improvements (Loic Dachary) * vstart.sh: fix/improve rgw support (Luis Pabon, Abhishek Lekshmanan) v0.85 ===== This is the second-to-last development release before Giant that contains new functionality. The big items to land during this cycle are the messenger refactoring from Matt Benjmain that lays some groundwork for RDMA support, a performance improvement series from SanDisk that improves performance on SSDs, lots of improvements to our new standalone civetweb-based RGW frontend, and a new 'osd blocked-by' mon command that allows admins to easily identify which OSDs are blocking peering progress. The other big change is that the OSDs and Monitors now distinguish between "misplaced" and "degraded" objects: the latter means there are fewer copies than we'd like, while the former simply means the are not stored in the locations where we want them to be. Also of note is a change to librbd that enables client-side caching by default. This is coupled with another option that makes the cache write-through until a "flush" operations is observed: this implies that the librbd user (usually a VM guest OS) supports barriers and flush and that it is safe for the cache to switch into writeback mode without compromising data safety or integrity. It has long been recommended practice that these options be enabled (e.g., in OpenStack environments) but until now it has not been the default. We have frozen the tree for the looming Giant release, and the next development release will be a release candidate with a final batch of new functionality. Upgrading --------- * The client-side caching for librbd is now enabled by default (rbd cache = true). A safety option (rbd cache writethrough until flush = true) is also enabled so that writeback caching is not used until the library observes a 'flush' command, indicating that the librbd users is passing that operation through from the guest VM. This avoids potential data loss when used with older versions of qemu that do not support flush. leveldb_write_buffer_size = 32*1024*1024 = 33554432 // 32MB leveldb_cache_size = 512*1024*1204 = 536870912 // 512MB leveldb_block_size = 64*1024 = 65536 // 64KB leveldb_compression = false leveldb_log = "" OSDs will still maintain the following osd-specific defaults: leveldb_log = "" * The 'rados getxattr ...' command used to add a gratuitous newline to the attr value; it now does not. Notable Changes --------------- * ceph-disk: do not inadvertantly create directories (Owne Synge) * ceph-disk: fix dmcrypt support (Sage Weil) * ceph-disk: linter cleanup, logging improvements (Alfredo Deza) * ceph-disk: show information about dmcrypt in 'ceph-disk list' output (Sage Weil) * ceph-disk: use partition type UUIDs and blkid (Sage Weil) * ceph: fix for non-default cluster names (#8944, Dan Mick) * doc: document new upstream wireshark dissector (Kevin Cox) * doc: many install doc updates (John Wilkins) * librados: fix lock leaks in error paths (#9022, Paval Rallabhandi) * librados: fix pool existence check (#8835, Pavan Rallabhandi) * librbd: enable caching by default (Sage Weil) * librbd: fix crash using clone of flattened image (#8845, Josh Durgin) * librbd: store and retrieve snapshot metadata based on id (Josh Durgin) * mailmap: many updates (Loic Dachary) * mds: add min/max UID for snapshot creation/deletion (#9029, Wido den Hollander) * misc build errors/warnings for Fedora 20 (Boris Ranto) * mon: add 'osd blocked-by' command to easily see which OSDs are blocking peering progress (Sage Weil) * mon: add perfcounters for paxos operations (Sage Weil) * mon: create default EC profile if needed (Loic Dachary) * mon: fix crash on loopback messages and paxos timeouts (#9062, Sage Weil) * mon: fix divide by zero when pg_num is adjusted before OSDs are added (#9101, Sage Weil) * mon: fix occasional memory leak after session reset (#9176, Sage Weil) * mon: fix ruleset/ruleid bugs (#9044, Loic Dachary) * mon: make usage dumps in terms of bytes, not kB (Sage Weil) * mon: prevent implicit destruction of OSDs with 'osd setmaxosd ...' (#8865, Anand Bhat) * mon: verify all quorum members are contiguous at end of Paxos round (#9053, Sage Weil) * msgr: refactor to cleanly separate SimpleMessenger implemenetation, move toward Connection-based calls (Matt Benjamin, Sage Wei) * objectstore: clean up KeyValueDB interface for key/value backends (Sage Weil) * osd: add local_mtime for use by cache agent (Zhiqiang Wang) * osd: add superblock for KeyValueStore backend (Haomai Wang) * osd: add support for Intel ISA-L erasure code library (Andreas-Joachim Peters) * osd: do not skip promote for write-ordered reads (#9064, Samuel Just) * osd: fix ambigous encoding order for blacklisted clients (#9211, Sage Weil) * osd: fix cache flush corner case for snapshotted objects (#9054, Samuel Just) * osd: fix discard of old/obsolete subop replies (#9259, Samuel Just) * osd: fix discard of peer messages from previous intervals (Greg Farnum) * osd: fix dump of open fds on EMFILE (Sage Weil) * osd: fix journal dump (Ma Jianpeng) * osd: fix mon feature bit requirements bug and resulting log spam (Sage Weil) * osd: fix recovery chunk size usage during EC recovery (Ma Jianpeng) * osd: fix recovery reservation deadlock for EC pools (Samuel Just) * osd: fix removal of old xattrs when overwriting chained xattrs (Ma Jianpeng) * osd: fix requesting queueing on PG split (Samuel Just) * osd: force new xattrs into leveldb if fs returns E2BIG (#7779, Sage Weil) * osd: implement alignment on chunk sizes (Loic Dachary) * osd: improve prioritization of recovery of degraded over misplaced objects (Sage Weil) * osd: locking, sharding, caching improvements in FileStore's FDCache (Somnath Roy, Greg Farnum) * osd: many important bug fixes (Samuel Just) * osd, mon: add rocksdb support (Xinxin Shu, Sage Weil) * osd, mon: distinguish between "misplaced" and "degraded" objects in cluster health and PG state reporting (Sage Weil) * osd: refactor some ErasureCode functionality into command parent class (Loic Dachary) * osd: set rollback_info_completed on create (#8625, Samuel Just) * rados: allow setxattr value to be read from stdin (Sage Weil) * rados: drop gratuitous \n from getxattr command (Sage Weil) * rgw: add --min-rewrite-stripe-size for object restriper (Yehuda Sadeh) * rgw: add powerdns hook for dynamic DNS for global clusters (Wido den Hollander) * rgw: copy object data is target bucket is in a different pool (#9039, Yehuda Sadeh) * rgw: do not try to authenticate CORS preflight requests (#8718, Robert Hubbard, Yehuda Sadeh) * rgw: fix civetweb URL decoding (#8621, Yehuda Sadeh) * rgw: fix removal of objects during object creation (Patrycja Szablowska, Yehuda Sadeh) * rgw: fix striping for copied objects (#9089, Yehuda Sadeh) * rgw: fix test for identify whether an object has a tail (#9226, Yehuda Sadeh) * rgw: fix when stripe size is not a multiple of chunk size (#8937, Yehuda Sadeh) * rgw: improve civetweb logging (Yehuda Sadeh) * rgw: misc civetweb frontend fixes (Yehuda Sadeh) * sysvinit: add support for non-default cluster names (Alfredo Deza) v0.84 ===== The next Ceph development release is here! This release contains several meaty items, including some MDS improvements for journaling, the ability to remove the CephFS file system (and name it), several mon cleanups with tiered pools, several OSD performance branches, a new "read forward" RADOS caching mode, a prototype Kinetic OSD backend, and various radosgw improvements (especially with the new standalone civetweb frontend). And there are a zillion OSD bug fixes. Things are looking pretty good for the Giant release that is coming up in the next month. Upgrading --------- * The ``*_kb perf`` counters on the monitor have been removed. These are replaced with a new set of ``*_bytes`` counters (e.g., ``cluster_osd_kb`` is replaced by ``cluster_osd_bytes``). * The ``rd_kb`` and ``wr_kb`` fields in the JSON dumps for pool stats (accessed via the ``ceph df detail -f json-pretty`` and related commands) have been replaced with corresponding ``*_bytes`` fields. Similarly, the ``total_space``, ``total_used``, and ``total_avail`` fields are replaced with ``total_bytes``, ``total_used_bytes``, and ``total_avail_bytes`` fields. * The ``rados df --format=json`` output ``read_bytes`` and ``write_bytes`` fields were incorrectly reporting ops; this is now fixed. * The ``rados df --format=json`` output previously included ``read_kb`` and ``write_kb`` fields; these have been removed. Please use ``read_bytes`` and ``write_bytes`` instead (and divide by 1024 if appropriate). Notable Changes --------------- * ceph-conf: flush log on exit (Sage Weil) * ceph-dencoder: refactor build a bit to limit dependencies (Sage Weil, Dan Mick) * ceph.spec: split out ceph-common package, other fixes (Sandon Van Ness) * ceph_test_librbd_fsx: fix RNG, make deterministic (Ilya Dryomov) * cephtool: refactor and improve CLI tests (Joao Eduardo Luis) * client: improved MDS session dumps (John Spray) * common: fix dup log messages (#9080, Sage Weil) * crush: include new tunables in dump (Sage Weil) * crush: only require rule features if the rule is used (#8963, Sage Weil) * crushtool: send output to stdout, not stderr (Wido den Hollander) * fix i386 builds (Sage Weil) * fix struct vs class inconsistencies (Thorsten Behrens) * hadoop: update hadoop tests for Hadoop 2.0 (Haumin Chen) * librbd, ceph-fuse: reduce cache flush overhead (Haomai Wang) * librbd: fix error path when opening image (#8912, Josh Durgin) * mds: add file system name, enabled flag (John Spray) * mds: boot refactor, cleanup (John Spray) * mds: fix journal conversion with standby-replay (John Spray) * mds: separate inode recovery queue (John Spray) * mds: session ls, evict commands (John Spray) * mds: submit log events in async thread (Yan, Zheng) * mds: use client-provided timestamp for user-visible file metadata (Yan, Zheng) * mds: validate journal header on load and save (John Spray) * misc build fixes for OS X (John Spray) * misc integer size cleanups (Kevin Cox) * mon: add get-quota commands (Joao Eduardo Luis) * mon: do not create file system by default (John Spray) * mon: fix 'ceph df' output for available space (Xiaoxi Chen) * mon: fix bug when no auth keys are present (#8851, Joao Eduardo Luis) * mon: fix compat version for MForward (Joao Eduardo Luis) * mon: restrict some pool properties to tiered pools (Joao Eduardo Luis) * msgr: misc locking fixes for fast dispatch (#8891, Sage Weil) * osd: add 'dump_reservations' admin socket command (Sage Weil) * osd: add READFORWARD caching mode (Luis Pabon) * osd: add header cache for KeyValueStore (Haomai Wang) * osd: add prototype KineticStore based on Seagate Kinetic (Josh Durgin) * osd: allow map cache size to be adjusted at runtime (Sage Weil) * osd: avoid refcounting overhead by passing a few things by ref (Somnath Roy) * osd: avoid sharing PG info that is not durable (Samuel Just) * osd: clear slow request latency info on osd up/down (Sage Weil) * osd: fix PG object listing/ordering bug (Guang Yang) * osd: fix PG stat errors with tiering (#9082, Sage Weil) * osd: fix bug with long object names and rename (#8701, Sage Weil) * osd: fix cache full -> not full requeueing (#8931, Sage Weil) * osd: fix gating of messages from old OSD instances (Greg Farnum) * osd: fix memstore bugs with collection_move_rename, lock ordering (Sage Weil) * osd: improve locking for KeyValueStore (Haomai Wang) * osd: make tiering behave if hit_sets aren't enabled (Sage Weil) * osd: mark pools with incomplete clones (Sage Weil) * osd: misc locking fixes for fast dispatch (Samuel Just, Ma Jianpeng) * osd: prevent old rados clients from using tiered pools (#8714, Sage Weil) * osd: reduce OpTracker overhead (Somnath Roy) * osd: set configurable hard limits on object and xattr names (Sage Weil, Haomai Wang) * osd: trim old EC objects quickly; verify on scrub (Samuel Just) * osd: work around GCC 4.8 bug in journal code (Matt Benjamin) * rados bench: fix arg order (Kevin Dalley) * rados: fix {read,write}_ops values for df output (Sage Weil) * rbd: add rbdmap pre- and post post- hooks, fix misc bugs (Dmitry Smirnov) * rbd: improve option default behavior (Josh Durgin) * rgw: automatically align writes to EC pool (#8442, Yehuda Sadeh) * rgw: fix crash on swift CORS preflight request (#8586, Yehuda Sadeh) * rgw: fix memory leaks (Andrey Kuznetsov) * rgw: fix multipart upload (#8846, Silvain Munaut, Yehuda Sadeh) * rgw: improve -h (Abhishek Lekshmanan) * rgw: improve delimited listing of bucket, misc fixes (Yehuda Sadeh) * rgw: misc civetweb fixes (Yehuda Sadeh) * rgw: powerdns backend for global namespaces (Wido den Hollander) * systemd: initial systemd config files (Federico Simoncelli) v0.83 ===== Another Ceph development release! This has been a longer cycle, so there has been quite a bit of bug fixing and stabilization in this round. There is also a bunch of packaging fixes for RPM distros (RHEL/CentOS, Fedora, and SUSE) and for systemd. We've also added a new librados-striper library from Sebastien Ponce that provides a generic striping API for applications to code to. Upgrading --------- * The experimental keyvaluestore-dev OSD backend had an on-disk format change that prevents existing OSD data from being upgraded. This affects developers and testers only. * mon-specific and osd-specific leveldb options have been removed. From this point onward users should use the `leveldb_*` generic options and add the options in the appropriate sections of their configuration files. Monitors will still maintain the following monitor-specific defaults: leveldb_write_buffer_size = 32*1024*1024 = 33554432 // 32MB leveldb_cache_size = 512*1024*1204 = 536870912 // 512MB leveldb_block_size = 64*1024 = 65536 // 64KB leveldb_compression = false leveldb_log = "" OSDs will still maintain the following osd-specific defaults: leveldb_log = "" Notable Changes --------------- * ceph-disk: fix dmcrypt support (Stephen Taylor) * cephtool: fix help (Yilong Zhao) * cephtool: test cleanup (Joao Eduardo Luis) * doc: librados example fixes (Kevin Dalley) * doc: many doc updates (John Wilkins) * doc: update erasure docs (Loic Dachary, Venky Shankar) * filestore: disable use of XFS hint (buggy on old kernels) (Samuel Just) * filestore: fix xattr spillout (Greg Farnum, Haomai Wang) * keyvaluestore: header cache (Haomai Wang) * librados_striper: striping library for librados (Sebastien Ponce) * libs3: update to latest (Danny Al-Gaaf) * log: fix derr level (Joao Eduardo Luis) * logrotate: fix osd log rotation on ubuntu (Sage Weil) * mds: fix xattr bug triggered by ACLs (Yan, Zheng) * misc memory leaks, cleanups, fixes (Danny Al-Gaaf, Sahid Ferdjaoui) * misc suse fixes (Danny Al-Gaaf) * misc word size fixes (Kevin Cox) * mon: drop mon- and osd- specific leveldb options (Joao Eduardo Luis) * mon: ec pool profile fixes (Loic Dachary) * mon: fix health down messages (Sage Weil) * mon: fix quorum feature check (#8738, Greg Farnum) * mon: 'osd crush reweight-subtree ...' (Sage Weil) * mon, osd: relax client EC support requirements (Sage Weil) * mon: some instrumentation (Sage Weil) * objecter: flag operations that are redirected by caching (Sage Weil) * osd: clean up shard_id_t, shard_t (Loic Dachary) * osd: fix connection reconnect race (Greg Farnum) * osd: fix dumps (Joao Eduardo Luis) * osd: fix erasure-code lib initialization (Loic Dachary) * osd: fix extent normalization (Adam Crume) * osd: fix loopback msgr issue (Ma Jianpeng) * osd: fix LSB release parsing (Danny Al-Gaaf) * osd: improved backfill priorities (Sage Weil) * osd: many many core fixes (Samuel Just) * osd, mon: config sanity checks on start (Sage Weil, Joao Eduardo Luis) * osd: sharded threadpool to improve parallelism (Somnath Roy) * osd: simple io prioritization for scrub (Sage Weil) * osd: simple scrub throttling (Sage Weil) * osd: tests for bench command (Loic Dachary) * osd: use xfs hint less frequently (Ilya Dryomov) * pybind/rados: fix small timeouts (John Spray) * qa: xfstests updates (Ilya Dryomov) * rgw: cache bucket info (Yehuda Sadeh) * rgw: cache decoded user info (Yehuda Sadeh) * rgw: fix multipart object attr regression (#8452, Yehuda Sadeh) * rgw: fix radosgw-admin 'show log' command (#8553, Yehuda Sadeh) * rgw: fix URL decoding (#8702, Brian Rak) * rgw: handle empty extra pool name (Yehuda Sadeh) * rpm: do not restart daemons on upgrade (Alfredo Deza) * rpm: misc packaging fixes for rhel7 (Sandon Van Ness) * rpm: split ceph-common from ceph (Sandon Van Ness) * systemd: wrap started daemons in new systemd environment (Sage Weil, Dan Mick) * sysvinit: less sensitive to failures (Sage Weil) * upstart: increase max open files limit (Sage Weil) v0.82 ===== This is the second post-firefly development release. It includes a range of bug fixes and some usability improvements. There are some MDS debugging and diagnostic tools, an improved 'ceph df', and some OSD backend refactoring and cleanup. Notable Changes --------------- * ceph-brag: add tox tests (Alfredo Deza) * common: perfcounters now use atomics and go faster (Sage Weil) * doc: CRUSH updates (John Wilkins) * doc: osd primary affinity (John Wilkins) * doc: pool quotas (John Wilkins) * doc: pre-flight doc improvements (Kevin Dalley) * doc: switch to an unencumbered font (Ross Turk) * doc: update openstack docs (Josh Durgin) * fix hppa arch build (Dmitry Smirnov) * init-ceph: continue starting other daemons on crush or mount failure (#8343, Sage Weil) * keyvaluestore: fix hint crash (#8381, Haomai Wang) * libcephfs-java: build against older JNI headers (Greg Farnum) * librados: fix rados_pool_list bounds checks (Sage Weil) * mds: cephfs-journal-tool (John Spray) * mds: improve Journaler on-disk format (John Spray) * mds, libcephfs: use client timestamp for mtime/ctime (Sage Weil) * mds: misc encoding improvements (John Spray) * mds: misc fixes for multi-mds (Yan, Zheng) * mds: OPTracker integration, dump_ops_in_flight (Greg Farnum) * misc cleanup (Christophe Courtaut) * mon: fix default replication pool ruleset choice (#8373, John Spray) * mon: fix set cache_target_full_ratio (#8440, Geoffrey Hartz) * mon: include per-pool 'max avail' in df output (Sage Weil) * mon: prevent EC pools from being used with cephfs (Joao Eduardo Luis) * mon: restore original weight when auto-marked out OSDs restart (Sage Weil) * mon: use msg header tid for MMonGetVersionReply (Ilya Dryomov) * osd: fix bogus assert during OSD shutdown (Sage Weil) * osd: fix clone deletion case (#8334, Sam Just) * osd: fix filestore removal corner case (#8332, Sam Just) * osd: fix hang waiting for osdmap (#8338, Greg Farnum) * osd: fix interval check corner case during peering (#8104, Sam Just) * osd: fix journal-less operation (Sage Weil) * osd: include backend information in metadata reported to mon (Sage Weil) * rest-api: fix help (Ailing Zhang) * rgw: check entity permission for put_metadata (#8428, Yehuda Sadeh) v0.81 ===== This is the first development release since Firefly. It includes a lot of work that we delayed merging while stabilizing things. Lots of new functionality, as well as several fixes that are baking a bit before getting backported. Upgrading --------- * CephFS support for the legacy anchor table has finally been removed. Users with file systems created before firefly should ensure that inodes with multiple hard links are modified *prior* to the upgrade to ensure that the backtraces are written properly. For example:: sudo find /mnt/cephfs -type f -links +1 -exec touch \{\} \; * Disallow nonsensical 'tier cache-mode' transitions. From this point onward, 'writeback' can only transition to 'forward' and 'forward' can transition to 1) 'writeback' if there are dirty objects, or 2) any if there are no dirty objects. Notable Changes --------------- * bash completion improvements (Wido den Hollander) * brag: fixes, improvements (Loic Dachary) * ceph-disk: handle corrupt volumes (Stuart Longlang) * ceph-disk: partprobe as needed (Eric Eastman) * ceph-fuse, libcephfs: asok hooks for handling session resets, timeouts (Yan, Zheng) * ceph-fuse, libcephfs: improve traceless reply handling (Sage Weil) * clang build fixes (John Spray, Danny Al-Gaaf) * config: support G, M, K, etc. suffixes (Joao Eduardo Luis) * coverity cleanups (Danny Al-Gaaf) * doc: cache tiering (John Wilkins) * doc: keystone integration docs (John Wilkins) * doc: updated simple configuration guides (John Wilkins) * libcephfs-java: fix gcj-jdk build (Dmitry Smirnov) * librbd: check error code on cache invalidate (Josh Durgin) * librbd: new libkrbd library for kernel map/unmap/showmapped (Ilya Dryomov) * Makefile: fix out of source builds (Stefan Eilemann) * mds: multi-mds fixes (Yan, Zheng) * mds: remove legacy anchor table (Yan, Zheng) * mds: remove legacy discover ino (Yan, Zheng) * monclient: fix hang (Sage Weil) * mon: prevent nonsensical cache-mode transitions (Joao Eduardo Luis) * msgr: avoid big lock when sending (most) messages (Greg Farnum) * osd: bound osdmap epoch skew between PGs (Sage Weil) * osd: cache tier flushing fixes for snapped objects (Samuel Just) * osd: fix agent early finish looping (David Zafman) * osd: fix flush vs OpContext (Samuel Just) * osd: fix MarkMeDown and other shutdown races (Sage Weil) * osd: fix scrub vs cache bugs (Samuel Just) * osd: fix trim of hitsets (Sage Weil) * osd, msgr: fast-dispatch of OSD ops (Greg Farnum, Samuel Just) * osd, objecter: resend ops on last_force_op_resend barrier; fix cache overlay op ordering (Sage Weil) * osd: remove obsolete classic scrub code (David Zafman) * osd: scrub PGs with invalid stats (Sage Weil) * osd: simple snap trimmer throttle (Sage Weil) * osd: use FIEMAP to inform copy_range (Haomai Wang) * rbd-fuse: allow exposing single image (Stephen Taylor) * rbd-fuse: fix unlink (Josh Durgin) * removed mkcephfs (deprecated since dumpling) * rgw: bucket link uses instance id (Yehuda Sadeh) * rgw: fix memory leak following chunk read error (Yehuda Sadeh) * rgw: fix URL escaping (Yehuda Sadeh) * rgw: fix user manifest (Yehuda Sadeh) * rgw: object and bucket rewrite functions to allow restriping old objects (Yehuda Sadeh) * rgw: prevent multiobject PUT race (Yehuda Sadeh) * rgw: send user manifest header (Yehuda Sadeh) * test_librbd_fsx: test krbd as well as librbd (Ilya Dryomov) v0.80.11 Firefly ================ This is a bugfix release for Firefly. This Firefly 0.80.x is nearing its planned end of life in January 2016 it may also be the last. We recommend that all Firefly users upgrade. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * build/ops: /etc/init.d/radosgw restart does not work correctly (`issue#11140 `_, `pr#5831 `_, Dmitry Yatsushkevich) * build/ops: Fix -Wno-format and -Werror=format-security options clash (`issue#13417 `_, `pr#6207 `_, Boris Ranto) * build/ops: ceph-common needs python-argparse on older distros, but doesn't require it (`issue#12034 `_, `pr#5217 `_, Nathan Cutler) * build/ops: ceph.spec.in running fdupes unnecessarily (`issue#12301 `_, `pr#5224 `_, Nathan Cutler) * build/ops: ceph.spec.in: 50-rbd.rules conditional is wrong (`issue#12166 `_, `pr#5225 `_, Nathan Cutler) * build/ops: ceph.spec.in: useless %py_requires breaks SLE11-SP3 build (`issue#12351 `_, `pr#5394 `_, Nathan Cutler) * build/ops: fedora21 has junit, not junit4 (`issue#10728 `_, `pr#6203 `_, Ken Dreyer, Loic Dachary) * build/ops: upstart: configuration is too generous on restarts (`issue#11798 `_, `pr#5992 `_, Sage Weil) * common: Client admin socket leaks file descriptors (`issue#11535 `_, `pr#4633 `_, Jon Bernard) * common: FileStore calls syncfs(2) even it is not supported (`issue#12512 `_, `pr#5529 `_, Danny Al-Gaaf, Kefu Chai, Jianpeng Ma) * common: HeartBeat: include types (`issue#13088 `_, `pr#6038 `_, Sage Weil) * common: Malformed JSON command output when non-ASCII strings are present (`issue#7387 `_, `pr#4635 `_, Kefu Chai, Tim Serong) * common: Memory leak in Mutex.cc, pthread_mutexattr_init without pthread_mutexattr_destroy (`issue#11762 `_, `pr#5403 `_, Ketor Meng) * common: Thread:pthread_attr_destroy(thread_attr) when done with it (`issue#12570 `_, `pr#6325 `_, Piotr Dałek, Zheng Qiankun) * common: ThreadPool add/remove work queue methods not thread safe (`issue#12662 `_, `pr#5991 `_, Jason Dillaman) * common: buffer: critical bufferlist::zero bug (`issue#12252 `_, `pr#5388 `_, Haomai Wang) * common: log: take mutex while opening fd (`issue#12465 `_, `pr#5406 `_, Samuel Just) * common: recursive lock of md_config_t (0) (`issue#12614 `_, `pr#5814 `_, Josh Durgin) * crush: take crashes due to invalid arg (`issue#11602 `_, `pr#4769 `_, Sage Weil) * doc: backport v0.80.10 release notes to firefly (`issue#11090 `_, `pr#5307 `_, Loic Dachary, Sage Weil) * doc: update docs to point to download.ceph.com (`issue#13162 `_, `pr#5993 `_, Alfredo Deza) * fs: MDSMonitor: handle MDSBeacon messages properly (`issue#11590 `_, `pr#5199 `_, Kefu Chai) * fs: client nonce collision due to unshared pid namespaces (`issue#13032 `_, `pr#6087 `_, Josh Durgin, Sage Weil) * librbd: Objectcacher setting max object counts too low (`issue#7385 `_, `pr#4639 `_, Jason Dillaman) * librbd: aio calls may block (`issue#11056 `_, `pr#4854 `_, Haomai Wang, Sage Weil, Jason Dillaman) * librbd: internal.cc: 1967: FAILED assert(watchers.size() == 1) (`issue#12176 `_, `pr#5171 `_, Jason Dillaman) * mon: Clock skew causes missing summary and confuses Calamari (`issue#11877 `_, `pr#4867 `_, Thorsten Behrens) * mon: EC pools are not allowed as cache pools, disallow in the mon (`issue#11650 `_, `pr#5389 `_, Samuel Just) * mon: Make it more difficult to delete pools in firefly (`issue#11800 `_, `pr#4788 `_, Sage Weil) * mon: MonitorDBStore: get_next_key() only if prefix matches (`issue#11786 `_, `pr#5360 `_, Joao Eduardo Luis) * mon: PaxosService: call post_refresh() instead of post_paxos_update() (`issue#11470 `_, `pr#5358 `_, Joao Eduardo Luis) * mon: add a cache layer over MonitorDBStore (`issue#12638 `_, `pr#5698 `_, Kefu Chai) * mon: adding exsting pool as tier with --force-nonempty clobbers removed_snaps (`issue#11493 `_, `pr#5236 `_, Sage Weil, Samuel Just) * mon: ceph fails to compile with boost 1.58 (`issue#11576 `_, `pr#5129 `_, Kefu Chai) * mon: does not check for IO errors on every transaction (`issue#13089 `_, `pr#6091 `_, Sage Weil) * mon: get pools health'info have error (`issue#12402 `_, `pr#5410 `_, renhwztetecs) * mon: increase globalid default for firefly (`issue#13255 `_, `pr#6010 `_, Sage Weil) * mon: pgmonitor: wrong at/near target max“ reporting (`issue#12401 `_, `pr#5409 `_, huangjun) * mon: register_new_pgs() should check ruleno instead of its index (`issue#12210 `_, `pr#5404 `_, Xinze Chi) * mon: scrub error (osdmap encoding mismatch?) upgrading from 0.80 to ~0.80.2 (`issue#8815 `_, `issue#8674 `_, `issue#9064 `_, `pr#5200 `_, Sage Weil, Zhiqiang Wang, Samuel Just) * mon: the output is wrong when runing ceph osd reweight (`issue#12251 `_, `pr#5408 `_, Joao Eduardo Luis) * objecter: can get stuck in redirect loop if osdmap epoch == last_force_op_resend (`issue#11026 `_, `pr#4597 `_, Jianpeng Ma, Sage Weil) * objecter: pg listing can deadlock when throttling is in use (`issue#9008 `_, `pr#5043 `_, Guang Yang) * objecter: resend linger ops on split (`issue#9806 `_, `pr#5062 `_, Josh Durgin, Samuel Just) * osd: Cleanup boost optionals for boost 1.56 (`issue#9983 `_, `pr#5039 `_, William A. Kennington III) * osd: LibRadosTwoPools[EC]PP.PromoteSnap failure (`issue#10052 `_, `pr#5050 `_, Sage Weil) * osd: Mutex Assert from PipeConnection::try_get_pipe (`issue#12437 `_, `pr#5815 `_, David Zafman) * osd: PG stuck with remapped (`issue#9614 `_, `pr#5044 `_, Guang Yang) * osd: PG::handle_advance_map: on_pool_change after handling the map change (`issue#12809 `_, `pr#5988 `_, Samuel Just) * osd: PGLog: split divergent priors as well (`issue#11069 `_, `pr#4631 `_, Samuel Just) * osd: PGLog::proc_replica_log: correctly handle case where entries between olog.head and log.tail were split out (`issue#11358 `_, `pr#5287 `_, Samuel Just) * osd: WBThrottle::clear_object: signal on cond when we reduce throttle values (`issue#12223 `_, `pr#5822 `_, Samuel Just) * osd: cache full mode still skips young objects (`issue#10006 `_, `pr#5051 `_, Xinze Chi, Zhiqiang Wang) * osd: crash creating/deleting pools (`issue#12429 `_, `pr#5526 `_, John Spray) * osd: explicitly specify OSD features in MOSDBoot (`issue#10911 `_, `pr#4960 `_, Sage Weil) * osd: is_new_interval() fixes (`issue#11771 `_, `issue#10399 `_, `pr#5726 `_, Samuel Just, Jason Dillaman) * osd: make the all osd/filestore thread pool suicide timeouts separately configurable (`issue#11439 `_, `pr#5823 `_, Samuel Just) * osd: object creation by write cannot use an offset on an erasure coded pool (`issue#11507 `_, `pr#4632 `_, Jianpeng Ma, Loic Dachary) * osd: os/FileJournal: Fix journal write fail, align for direct io (`issue#12943 `_, `pr#5619 `_, Xie Rui) * osd: osd/PGLog.cc: 732: FAILED assert(log.log.size() == log_keys_debug.size()) (`issue#12652 `_, `pr#5820 `_, Sage Weil) * osd: read on chunk-aligned xattr not handled (`issue#12309 `_, `pr#5235 `_, Sage Weil) * rgw: Change variable length array of std::strings (not legal in C++) to std::vector (`issue#12467 `_, `pr#4583 `_, Daniel J. Hofmann) * rgw: Civetweb RGW appears to report full size of object as downloaded when only partially downloaded (`issue#11851 `_, `pr#5234 `_, Yehuda Sadeh) * rgw: Keystone PKI token expiration is not enforced (`issue#11367 `_, `pr#4765 `_, Anton Aksola) * rgw: Object copy bug (`issue#11639 `_, `pr#4762 `_, Javier M. Mellid) * rgw: RGW returns requested bucket name raw in "Bucket" response header (`issue#11860 `_, `issue#12537 `_, `pr#5730 `_, Yehuda Sadeh, Wido den Hollander) * rgw: Swift API: response for PUT on /container does not contain the mandatory Content-Length header when FCGI is used (`issue#11036 `_, `pr#5170 `_, Radoslaw Zarzynski) * rgw: content length parsing calls strtol() instead of strtoll() (`issue#10701 `_, `pr#5997 `_, Yehuda Sadeh) * rgw: delete bucket does not remove .bucket.meta file (`issue#11149 `_, `pr#4641 `_, Orit Wasserman) * rgw: doesn't return 'x-timestamp' in header which is used by 'View Details' of OpenStack (`issue#8911 `_, `pr#4584 `_, Yehuda Sadeh) * rgw: init some manifest fields when handling explicit objs (`issue#11455 `_, `pr#5729 `_, Yehuda Sadeh) * rgw: logfile does not get chowned properly (`issue#12073 `_, `pr#5233 `_, Thorsten Behrens) * rgw: logrotate.conf calls service with wrong init script name (`issue#12043 `_, `pr#5390 `_, wuxingyi) * rgw: quota not respected in POST object (`issue#11323 `_, `pr#4642 `_, Sergey Arkhipov) * rgw: swift smoke test fails on TestAccountUTF8 (`issue#11091 `_, `issue#11438 `_, `issue#12939 `_, `issue#12157 `_, `issue#12158 `_, `issue#12363 `_, `pr#5532 `_, Radoslaw Zarzynski, Orit Wasserman, Robin H. Johnson) * rgw: use correct objv_tracker for bucket instance (`issue#11416 `_, `pr#4535 `_, Yehuda Sadeh) * tests: ceph-fuse crash in test_client_recovery (`issue#12673 `_, `pr#5813 `_, Loic Dachary) * tests: kernel_untar_build fails on EL7 (`issue#11758 `_, `pr#6000 `_, Greg Farnum) * tests: qemu workunit refers to apt-mirror.front.sepia.ceph.com (`issue#13420 `_, `pr#6328 `_, Yuan Zhou, Sage Weil) * tools: src/ceph-disk : disk zap sgdisk invocation (`issue#11143 `_, `pr#4636 `_, Thorsten Behrens, Owen Synge) * tools: ceph-disk: sometimes the journal symlink is not created (`issue#10146 `_, `pr#5541 `_, Dan van der Ster) * tools: ceph-disk: support NVMe device partitions (`issue#11612 `_, `pr#4771 `_, Ilja Slepnev) * tools: ceph-post-file fails on rhel7 (`issue#11836 `_, `pr#5037 `_, Joseph McDonald, Sage Weil) * tools: ceph_argparse_flag has no regular 3rd parameter (`issue#11543 `_, `pr#4582 `_, Thorsten Behrens) * tools: use a new disk as journal disk,ceph-disk prepare fail (`issue#10983 `_, `pr#4630 `_, Loic Dachary) v0.80.10 Firefly ================ This is a bugfix release for Firefly. We recommend that all Firefly users upgrade. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * build/ops: ceph.spec.in: package mkcephfs on EL6 (`issue#11955 `_, `pr#4924 `_, Ken Dreyer) * build/ops: debian: ceph-test and rest-bench debug packages should require their respective binary packages (`issue#11673 `_, `pr#4766 `_, Ken Dreyer) * build/ops: run RGW as root (`issue#11453 `_, `pr#4638 `_, Ken Dreyer) * common: messages/MWatchNotify: include an error code in the message (`issue#9193 `_, `pr#3944 `_, Sage Weil) * common: Rados.shutdown() dies with Illegal instruction (core dumped) (`issue#10153 `_, `pr#3963 `_, Federico Simoncelli) * common: SimpleMessenger: allow RESETSESSION whenever we forget an endpoint (`issue#10080 `_, `pr#3915 `_, Greg Farnum) * common: WorkQueue: make wait timeout on empty queue configurable (`issue#10817 `_, `pr#3941 `_, Samuel Just) * crush: set_choose_tries = 100 for erasure code rulesets (`issue#10353 `_, `pr#3824 `_, Loic Dachary) * doc: backport ceph-disk man page to Firefly (`issue#10724 `_, `pr#3936 `_, Nilamdyuti Goswami) * doc: Fix ceph command manpage to match ceph -h (`issue#10676 `_, `pr#3996 `_, David Zafman) * fs: mount.ceph: avoid spurious error message (`issue#10351 `_, `pr#3927 `_, Yan, Zheng) * librados: Fix memory leak in python rados bindings (`issue#10723 `_, `pr#3935 `_, Josh Durgin) * librados: fix resources leakage in RadosClient::connect() (`issue#10425 `_, `pr#3828 `_, Radoslaw Zarzynski) * librados: Translate operation flags from C APIs (`issue#10497 `_, `pr#3930 `_, Matt Richards) * librbd: acquire cache_lock before refreshing parent (`issue#5488 `_, `pr#4206 `_, Jason Dillaman) * librbd: snap_remove should ignore -ENOENT errors (`issue#11113 `_, `pr#4245 `_, Jason Dillaman) * mds: fix assertion caused by system clock backwards (`issue#11053 `_, `pr#3970 `_, Yan, Zheng) * mon: ignore osd failures from before up_from (`issue#10762 `_, `pr#3937 `_, Sage Weil) * mon: MonCap: take EntityName instead when expanding profiles (`issue#10844 `_, `pr#3942 `_, Joao Eduardo Luis) * mon: Monitor: fix timecheck rounds period (`issue#10546 `_, `pr#3932 `_, Joao Eduardo Luis) * mon: OSDMonitor: do not trust small values in osd epoch cache (`issue#10787 `_, `pr#3823 `_, Sage Weil) * mon: OSDMonitor: fallback to json-pretty in case of invalid formatter (`issue#9538 `_, `pr#4475 `_, Loic Dachary) * mon: PGMonitor: several stats output error fixes (`issue#10257 `_, `pr#3826 `_, Joao Eduardo Luis) * objecter: fix map skipping (`issue#9986 `_, `pr#3952 `_, Ding Dinghua) * osd: cache tiering: fix the atime logic of the eviction (`issue#9915 `_, `pr#3949 `_, Zhiqiang Wang) * osd: cancel_pull: requeue waiters (`issue#11244 `_, `pr#4415 `_, Samuel Just) * osd: check that source OSD is valid for MOSDRepScrub (`issue#9555 `_, `pr#3947 `_, Sage Weil) * osd: DBObjectMap: lock header_lock on sync() (`issue#9891 `_, `pr#3948 `_, Samuel Just) * osd: do not ignore deleted pgs on startup (`issue#10617 `_, `pr#3933 `_, Sage Weil) * osd: ENOENT on clone (`issue#11199 `_, `pr#4385 `_, Samuel Just) * osd: erasure-code-profile set races with erasure-code-profile rm (`issue#11144 `_, `pr#4383 `_, Loic Dachary) * osd: FAILED assert(soid < scrubber.start || soid >= scrubber.end) (`issue#11156 `_, `pr#4185 `_, Samuel Just) * osd: FileJournal: fix journalq population in do_read_entry() (`issue#6003 `_, `pr#3960 `_, Samuel Just) * osd: fix negative degraded objects during backfilling (`issue#7737 `_, `pr#4021 `_, Guang Yang) * osd: get the currently atime of the object in cache pool for eviction (`issue#9985 `_, `pr#3950 `_, Sage Weil) * osd: load_pgs: we need to handle the case where an upgrade from earlier versions which ignored non-existent pgs resurrects a pg with a prehistoric osdmap (`issue#11429 `_, `pr#4556 `_, Samuel Just) * osd: ObjectStore: Don't use largest_data_off to calc data_align. (`issue#10014 `_, `pr#3954 `_, Jianpeng Ma) * osd: osd_types: op_queue_age_hist and fs_perf_stat should be in osd_stat_t::o... (`issue#10259 `_, `pr#3827 `_, Samuel Just) * osd: PG::actingset should be used when checking the number of acting OSDs for... (`issue#11454 `_, `pr#4453 `_, Guang Yang) * osd: PG::all_unfound_are_queried_or_lost for non-existent osds (`issue#10976 `_, `pr#4416 `_, Mykola Golub) * osd: PG: always clear_primary_state (`issue#10059 `_, `pr#3955 `_, Samuel Just) * osd: PGLog.h: 279: FAILED assert(log.log.size() == log_keys_debug.size()) (`issue#10718 `_, `pr#4382 `_, Samuel Just) * osd: PGLog: include rollback_info_trimmed_to in (read|write)_log (`issue#10157 `_, `pr#3964 `_, Samuel Just) * osd: pg stuck stale after create with activation delay (`issue#11197 `_, `pr#4384 `_, Samuel Just) * osd: ReplicatedPG: fail a non-blocking flush if the object is being scrubbed (`issue#8011 `_, `pr#3943 `_, Samuel Just) * osd: ReplicatedPG::on_change: clean up callbacks_for_degraded_object (`issue#8753 `_, `pr#3940 `_, Samuel Just) * osd: ReplicatedPG::scan_range: an object can disappear between the list and t... (`issue#10150 `_, `pr#3962 `_, Samuel Just) * osd: requeue blocked op before flush it was blocked on (`issue#10512 `_, `pr#3931 `_, Sage Weil) * rgw: check for timestamp for s3 keystone auth (`issue#10062 `_, `pr#3958 `_, Abhishek Lekshmanan) * rgw: civetweb should use unique request id (`issue#11720 `_, `pr#4780 `_, Orit Wasserman) * rgw: don't allow negative / invalid content length (`issue#11890 `_, `pr#4829 `_, Yehuda Sadeh) * rgw: fail s3 POST auth if keystone not configured (`issue#10698 `_, `pr#3966 `_, Yehuda Sadeh) * rgw: flush xml header on get acl request (`issue#10106 `_, `pr#3961 `_, Yehuda Sadeh) * rgw: generate new tag for object when setting object attrs (`issue#11256 `_, `pr#4571 `_, Yehuda Sadeh) * rgw: generate the "Date" HTTP header for civetweb. (`issue#11871,11891 `_, `pr#4851 `_, Radoslaw Zarzynski) * rgw: keystone token cache does not work correctly (`issue#11125 `_, `pr#4414 `_, Yehuda Sadeh) * rgw: merge manifests correctly when there's prefix override (`issue#11622 `_, `pr#4697 `_, Yehuda Sadeh) * rgw: send appropriate op to cancel bucket index pending operation (`issue#10770 `_, `pr#3938 `_, Yehuda Sadeh) * rgw: shouldn't need to disable rgw_socket_path if frontend is configured (`issue#11160 `_, `pr#4275 `_, Yehuda Sadeh) * rgw: Swift API. Dump container's custom metadata. (`issue#10665 `_, `pr#3934 `_, Dmytro Iurchenko) * rgw: Swift API. Support for X-Remove-Container-Meta-{key} header. (`issue#10475 `_, `pr#3929 `_, Dmytro Iurchenko) * rgw: use correct objv_tracker for bucket instance (`issue#11416 `_, `pr#4379 `_, Yehuda Sadeh) * tests: force checkout of submodules (`issue#11157 `_, `pr#4079 `_, Loic Dachary) * tools: Backport ceph-objectstore-tool changes to firefly (`issue#12327 `_, `pr#3866 `_, David Zafman) * tools: ceph-objectstore-tool: Output only unsupported features when incomatible (`issue#11176 `_, `pr#4126 `_, David Zafman) * tools: ceph-objectstore-tool: Use exit status 11 for incompatible import attemp... (`issue#11139 `_, `pr#4129 `_, David Zafman) * tools: Fix do_autogen.sh so that -L is allowed (`issue#11303 `_, `pr#4247 `_, Alfredo Deza) v0.80.9 Firefly =============== This is a bugfix release for firefly. It fixes a performance regression in librbd, an important CRUSH misbehavior (see below), and several RGW bugs. We have also backported support for flock/fcntl locks to ceph-fuse and libcephfs. We recommend that all Firefly users upgrade. For more detailed information, see :download:`the complete changelog `. Adjusting CRUSH maps -------------------- * This point release fixes several issues with CRUSH that trigger excessive data migration when adjusting OSD weights. These are most obvious when a very small weight change (e.g., a change from 0 to .01) triggers a large amount of movement, but the same set of bugs can also lead to excessive (though less noticeable) movement in other cases. However, because the bug may already have affected your cluster, fixing it may trigger movement *back* to the more correct location. For this reason, you must manually opt-in to the fixed behavior. In order to set the new tunable to correct the behavior:: ceph osd crush set-tunable straw_calc_version 1 Note that this change will have no immediate effect. However, from this point forward, any 'straw' bucket in your CRUSH map that is adjusted will get non-buggy internal weights, and that transition may trigger some rebalancing. You can estimate how much rebalancing will eventually be necessary on your cluster with:: ceph osd getcrushmap -o /tmp/cm crushtool -i /tmp/cm --num-rep 3 --test --show-mappings > /tmp/a 2>&1 crushtool -i /tmp/cm --set-straw-calc-version 1 -o /tmp/cm2 crushtool -i /tmp/cm2 --reweight -o /tmp/cm2 crushtool -i /tmp/cm2 --num-rep 3 --test --show-mappings > /tmp/b 2>&1 wc -l /tmp/a # num total mappings diff -u /tmp/a /tmp/b | grep -c ^+ # num changed mappings Divide the number of changed lines by the total number of lines in /tmp/a. We've found that most clusters are under 10%. You can force all of this rebalancing to happen at once with:: ceph osd crush reweight-all Otherwise, it will happen at some unknown point in the future when CRUSH weights are next adjusted. Notable Changes --------------- * ceph-fuse: flock, fcntl lock support (Yan, Zheng, Greg Farnum) * crush: fix straw bucket weight calculation, add straw_calc_version tunable (#10095 Sage Weil) * crush: fix tree bucket (Rongzu Zhu) * crush: fix underflow of tree weights (Loic Dachary, Sage Weil) * crushtool: add --reweight (Sage Weil) * librbd: complete pending operations before losing image (#10299 Jason Dillaman) * librbd: fix read caching performance regression (#9854 Jason Dillaman) * librbd: gracefully handle deleted/renamed pools (#10270 Jason Dillaman) * mon: fix dump of chooseleaf_vary_r tunable (Sage Weil) * osd: fix PG ref leak in snaptrimmer on peering (#10421 Kefu Chai) * osd: handle no-op write with snapshot (#10262 Sage Weil) * radosgw-admin: create subuser when creating user (#10103 Yehuda Sadeh) * rgw: change multipart uplaod id magic (#10271 Georgio Dimitrakakis, Yehuda Sadeh) * rgw: don't overwrite bucket/object owner when setting ACLs (#10978 Yehuda Sadeh) * rgw: enable IPv6 for embedded civetweb (#10965 Yehuda Sadeh) * rgw: fix partial swift GET (#10553 Yehuda Sadeh) * rgw: fix quota disable (#9907 Dong Lei) * rgw: index swift keys appropriately (#10471 Hemant Burman, Yehuda Sadeh) * rgw: make setattrs update bucket index (#5595 Yehuda Sadeh) * rgw: pass civetweb configurables (#10907 Yehuda Sadeh) * rgw: remove swift user manifest (DLO) hash calculation (#9973 Yehuda Sadeh) * rgw: return correct len for 0-len objects (#9877 Yehuda Sadeh) * rgw: S3 object copy content-type fix (#9478 Yehuda Sadeh) * rgw: send ETag on S3 object copy (#9479 Yehuda Sadeh) * rgw: send HTTP status reason explicitly in fastcgi (Yehuda Sadeh) * rgw: set ulimit -n from sysvinit (el6) init script (#9587 Sage Weil) * rgw: update swift subuser permission masks when authenticating (#9918 Yehuda Sadeh) * rgw: URL decode query params correctly (#10271 Georgio Dimitrakakis, Yehuda Sadeh) * rgw: use attrs when reading object attrs (#10307 Yehuda Sadeh) * rgw: use \r\n for http headers (#9254 Benedikt Fraunhofer, Yehuda Sadeh) v0.80.8 Firefly =============== This is a long-awaited bugfix release for firefly. It has several imporant (but relatively rare) OSD peering fixes, performance issues when snapshots are trimmed, several RGW fixes, a paxos corner case fix, and some packaging updates. We recommend that all users for v0.80.x firefly upgrade when it is convenient to do so. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * build: remove stack-execute bit from assembled code sections (#10114 Dan Mick) * ceph-disk: fix dmcrypt key permissions (#9785 Loic Dachary) * ceph-disk: fix keyring location (#9653 Loic Dachary) * ceph-disk: make partition checks more robust (#9721 #9665 Loic Dachary) * ceph: cleanly shut down librados context on shutdown (#8797 Dan Mick) * common: add $cctid config metavariable (#6228 Adam Crume) * crush: align rule and ruleset ids (#9675 Xiaoxi Chen) * crush: fix negative weight bug during create_or_move_item (#9998 Pawel Sadowski) * crush: fix potential buffer overflow in erasure rules (#9492 Johnu George) * debian: fix python-ceph -> ceph file movement (Sage Weil) * libcephfs,ceph-fuse: fix flush tid wraparound bug (#9869 Greg Farnum, Yan, Zheng) * libcephfs: close fd befure umount (#10415 Yan, Zheng) * librados: fix crash from C API when read timeout is enabled (#9582 Sage Weil) * librados: handle reply race with pool deletion (#10372 Sage Weil) * librbd: cap memory utilization for read requests (Jason Dillaman) * librbd: do not close a closed parent image on failure (#10030 Jason Dillaman) * librbd: fix diff tests (#10002 Josh Durgin) * librbd: protect list_children from invalid pools (#10123 Jason Dillaman) * make check improvemens (Loic Dachary) * mds: fix ctime updates (#9514 Greg Farnum) * mds: fix journal import tool (#10025 John Spray) * mds: fix rare NULL deref in cap flush handler (Greg Farnum) * mds: handle unknown lock messages (Yan, Zheng) * mds: store backtrace for straydir (Yan, Zheng) * mon: abort startup if disk is full (#9502 Joao Eduardo Luis) * mon: add paxos instrumentation (Sage Weil) * mon: fix double-free in rare OSD startup path (Sage Weil) * mon: fix osdmap trimming (#9987 Sage Weil) * mon: fix paxos corner cases (#9301 #9053 Sage Weil) * osd: cancel callback on blacklisted watchers (#8315 Samuel Just) * osd: cleanly abort set-alloc-hint operations during upgrade (#9419 David Zafman) * osd: clear rollback PG metadata on PG deletion (#9293 Samuel Just) * osd: do not abort deep scrub if hinfo is missing (#10018 Loic Dachary) * osd: erasure-code regression tests (Loic Dachary) * osd: fix distro metadata reporting for SUSE (#8654 Danny Al-Gaaf) * osd: fix full OSD checks during backfill (#9574 Samuel Just) * osd: fix ioprio parsing (#9677 Loic Dachary) * osd: fix journal direct-io shutdown (#9073 Mark Kirkwood, Ma Jianpeng, Somnath Roy) * osd: fix journal dump (Ma Jianpeng) * osd: fix occasional stall during peering or activation (Sage Weil) * osd: fix past_interval display bug (#9752 Loic Dachary) * osd: fix rare crash triggered by admin socket dump_ops_in_filght (#9916 Dong Lei) * osd: fix snap trimming performance issues (#9487 #9113 Samuel Just, Sage Weil, Dan van der Ster, Florian Haas) * osd: fix snapdir handling on cache eviction (#8629 Sage Weil) * osd: handle map gaps in map advance code (Sage Weil) * osd: handle undefined CRUSH results in interval check (#9718 Samuel Just) * osd: include shard in JSON dump of ghobject (#10063 Loic Dachary) * osd: make backfill reservation denial handling more robust (#9626 Samuel Just) * osd: make misdirected op checks handle EC + primary affinity (#9835 Samuel Just, Sage Weil) * osd: mount XFS with inode64 by default (Sage Weil) * osd: other misc bugs (#9821 #9875 Samuel Just) * rgw: add .log to default log path (#9353 Alexandre Marangone) * rgw: clean up fcgi request context (#10194 Yehuda Sadeh) * rgw: convet header underscores to dashes (#9206 Yehuda Sadeh) * rgw: copy object data if copy target is in different pool (#9039 Yehuda Sadeh) * rgw: don't try to authenticate CORS peflight request (#8718 Robert Hubbard, Yehuda Sadeh) * rgw: fix civetweb URL decoding (#8621 Yehuda Sadeh) * rgw: fix hash calculation during PUT (Yehuda Sadeh) * rgw: fix misc bugs (#9089 #9201 Yehuda Sadeh) * rgw: fix object tail test (#9226 Sylvain Munaut, Yehuda Sadeh) * rgw: make sysvinit script run rgw under systemd context as needed (#10125 Loic Dachary) * rgw: separate civetweb log from rgw log (Yehuda Sadeh) * rgw: set length for keystone token validations (#7796 Mark Kirkwood, Yehuda Sadeh) * rgw: subuser creation fixes (#8587 Yehuda Sadeh) * rpm: misc packaging improvements (Sandon Van Ness, Dan Mick, Erik Logthenberg, Boris Ranto) * rpm: use standard udev rules for CentOS7/RHEL7 (#9747 Loic Dachary) v0.80.7 Firefly =============== This release fixes a few critical issues with v0.80.6, particularly with clusters running mixed versions. We recommend that all v0.80.x Firefly users upgrade to this release. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * osd: fix invalid memory reference in log trimming (#9731 Samuel Just) * osd: fix use-after-free in cache tiering code (#7588 Sage Weil) * osd: remove bad backfill assertion for mixed-version clusters (#9696 Samuel Just) v0.80.6 Firefly =============== This is a major bugfix release for firefly, fixing a range of issues in the OSD and monitor, particularly with cache tiering. There are also important fixes in librados, with the watch/notify mechanism used by librbd, and in radosgw. A few pieces of new functionality of been backported, including improved 'ceph df' output (view amount of writeable space per pool), support for non-default cluster names when using sysvinit or systemd, and improved (and fixed) support for dmcrypt. We recommend that all v0.80.x Firefly users upgrade to this release. For more detailed information, see :download:`the complete changelog `. Notable Changes --------------- * build: fix atomic64_t on i386 (#8969 Sage Weil) * build: fix build on alpha (Michael Cree, Dmitry Smirnov) * build: fix build on hppa (Dmitry Smirnov) * build: fix yasm detection on x32 arch (Sage Weil) * ceph-disk: fix 'list' function with dmcrypt (Sage Weil) * ceph-disk: fix dmcrypt support (Alfredo Deza) * ceph: allow non-default cluster to be specified (#8944) * common: fix dup log messages to mon (#9080 Sage Weil) * global: write pid file when -f is used (systemd, upstart) (Alexandre Oliva) * librados: fix crash when read timeout is enabled (#9362 Matthias Kiefer, Sage Weil) * librados: fix lock leaks in error paths (#9022 Pavan Rallabhandi) * librados: fix watch resend on PG acting set change (#9220 Samuel Just) * librados: python: fix aio_read handling with \0 (Mohammad Salehe) * librbd: add interface to invalidate cached data (Josh Durgin) * librbd: fix crash when using clone of flattened image (#8845 Josh Durgin) * librbd: fix error path cleanup on open (#8912 Josh Durgin) * librbd: fix null pointer check (Danny Al-Gaaf) * librbd: limit dirty object count (Haomai Wang) * mds: fix rstats for root and mdsdir (Yan, Zheng) * mon: add 'get' command for new cache tier pool properties (Joao Eduardo Luis) * mon: add 'osd pool get-quota' (#8523 Joao Eduardo Luis) * mon: add cluster fingerprint (Sage Weil) * mon: disallow nonsensical cache-mode transitions (#8155 Joao Eduardo Luis) * mon: fix cache tier rounding error on i386 (Sage Weil) * mon: fix occasional memory leak (#9176 Sage Weil) * mon: fix reported latency for 'osd perf' (#9269 Samuel Just) * mon: include 'max avail' in 'ceph df' output (Sage Weil, Xioaxi Chen) * mon: persistently mark pools where scrub may find incomplete clones (#8882 Sage Weil) * mon: preload erasure plugins (Loic Dachary) * mon: prevent cache-specific settings on non-tier pools (#8696 Joao Eduardo Luis) * mon: reduce log spam (Aanchal Agrawal, Sage Weil) * mon: warn when cache pools have no hit_sets enabled (Sage Weil) * msgr: fix trivial memory leak (Sage Weil) * osd: automatically scrub PGs with invalid stats (#8147 Sage Weil) * osd: avoid sharing PG metadata that is not durable (Samuel Just) * osd: cap hit_set size (#9339 Samuel Just) * osd: create default erasure profile if needed (#8601 Loic Dachary) * osd: dump tid as JSON int (not string) where appropriate (Joao Eduardo Luis) * osd: encode blacklist in deterministic order (#9211 Sage Weil) * osd: fix behavior when cache tier has no hit_sets enabled (#8982 Sage Weil) * osd: fix cache tier flushing of snapshots (#9054 Samuel Just) * osd: fix cache tier op ordering when going from full to non-full (#8931 Sage Weil) * osd: fix crash on dup recovery reservation (#8863 Sage Weil) * osd: fix division by zero when pg_num adjusted with no OSDs (#9052 Sage Weil) * osd: fix hint crash in experimental keyvaluestore_dev backend (Hoamai Wang) * osd: fix leak in copyfrom cancellation (#8894 Samuel Just) * osd: fix locking for copyfrom finish (#8889 Sage Weil) * osd: fix long filename handling in backend (#8701 Sage Weil) * osd: fix min_size check with backfill (#9497 Samuel Just) * osd: fix mount/remount sync race (#9144 Sage Weil) * osd: fix object listing + erasure code bug (Guang Yang) * osd: fix race on reconnect to failed OSD (#8944 Greg Farnum) * osd: fix recovery reservation deadlock (Samuel Just) * osd: fix tiering agent arithmetic for negative values (#9082 Karan Singh) * osd: improve shutdown order (#9218 Sage Weil) * osd: improve subop discard logic (#9259 Samuel Just) * osd: introduce optional sleep, io priority for scrub and snap trim (Sage Weil) * osd: make scrub check for and remove stale erasure-coded objects (Samuel Just) * osd: misc fixes (#9481 #9482 #9179 Sameul Just) * osd: mix keyvaluestore_dev improvements (Haomai Wang) * osd: only require CRUSH features for rules that are used (#8963 Sage Weil) * osd: preload erasure plugins on startup (Loic Dachary) * osd: prevent PGs from falling behind when consuming OSDMaps (#7576 Sage Weil) * osd: prevent old clients from using tiered pools (#8714 Sage Weil) * osd: set min_size on erasure pools to data chunk count (Sage Weil) * osd: trim old erasure-coded objects more aggressively (Samuel Just) * rados: enforce erasure code alignment (Lluis Pamies-Juarez) * rgw: align object stripes with erasure pool alignment (#8442 Yehuda Sadeh) * rgw: don't send error body on HEAD for civetweb (#8539 Yehuda Sadeh) * rgw: fix crash in CORS preflight request (Yehuda Sadeh) * rgw: fix decoding of + in URL (#8702 Brian Rak) * rgw: fix object removal on object create (#8972 Patrycja Szabowska, Yehuda Sadeh) * systemd: use systemd-run when starting radosgw (JuanJose Galvez) * sysvinit: support non-default cluster name (Alfredo Deza) v0.80.5 Firefly =============== This release fixes a few important bugs in the radosgw and fixes several packaging and environment issues, including OSD log rotation, systemd environments, and daemon restarts on upgrade. We recommend that all v0.80.x Firefly users upgrade, particularly if they are using upstart, systemd, or radosgw. Notable Changes --------------- * ceph-dencoder: do not needlessly link to librgw, librados, etc. (Sage Weil) * do not needlessly link binaries to leveldb (Sage Weil) * mon: fix mon crash when no auth keys are present (#8851, Joao Eduardo Luis) * osd: fix cleanup (and avoid occasional crash) during shutdown (#7981, Sage Weil) * osd: fix log rotation under upstart (Sage Weil) * rgw: fix multipart upload when object has irregular size (#8846, Yehuda Sadeh, Sylvain Munaut) * rgw: improve bucket listing S3 compatibility (#8858, Yehuda Sadeh) * rgw: improve delimited bucket listing (Yehuda Sadeh) * rpm: do not restart daemons on upgrade (#8849, Alfredo Deza) For more detailed information, see :download:`the complete changelog `. v0.80.4 Firefly =============== This Firefly point release fixes an potential data corruption problem when ceph-osd daemons run on top of XFS and service Firefly librbd clients. A recently added allocation hint that RBD utilizes triggers an XFS bug on some kernels (Linux 3.2, and likely others) that leads to data corruption and deep-scrub errors (and inconsistent PGs). This release avoids the situation by disabling the allocation hint until we can validate which kernels are affected and/or are known to be safe to use the hint on. We recommend that all v0.80.x Firefly users urgently upgrade, especially if they are using RBD. Notable Changes --------------- * osd: disable XFS extsize hint by default (#8830, Samuel Just) * rgw: fix extra data pool default name (Yehuda Sadeh) For more detailed information, see :download:`the complete changelog `. v0.80.3 Firefly =============== This is the third Firefly point release. It includes a single fix for a radosgw regression that was discovered in v0.80.2 right after it was released. We recommand that all v0.80.x Firefly users upgrade. Notable Changes --------------- * radosgw: fix regression in manifest decoding (#8804, Sage Weil) For more detailed information, see :download:`the complete changelog `. v0.80.2 Firefly =============== This is the second Firefly point release. It contains a range of important fixes, including several bugs in the OSD cache tiering, some compatibility checks that affect upgrade situations, several radosgw bugs, and an irritating and unnecessary feature bit check that prevents older clients from communicating with a cluster with any erasure coded pools. One someone large change in this point release is that the ceph RPM package is separated into a ceph and ceph-common package, similar to Debian. The ceph-common package contains just the client libraries without any of the server-side daemons. We recommend that all v0.80.x Firefly users skip this release and use v0.80.3. Notable Changes --------------- * ceph-disk: better debug logging (Alfredo Deza) * ceph-disk: fix preparation of OSDs with dmcrypt (#6700, Stephen F Taylor) * ceph-disk: partprobe on prepare to fix dm-crypt (#6966, Eric Eastman) * do not require ERASURE_CODE feature from clients (#8556, Sage Weil) * libcephfs-java: build with older JNI headers (Greg Farnum) * libcephfs-java: fix build with gcj-jdk (Dmitry Smirnov) * librados: fix osd op tid for redirected ops (#7588, Samuel Just) * librados: fix rados_pool_list buffer bounds checks (#8447, Sage Weil) * librados: resend ops when pool overlay changes (#8305, Sage Weil) * librbd, ceph-fuse: reduce CPU overhead for clean object check in cache (Haomai Wang) * mon: allow deletion of cephfs pools (John Spray) * mon: fix default pool ruleset choice (#8373, John Spray) * mon: fix health summary for mon low disk warning (Sage Weil) * mon: fix 'osd pool set cache_target_full_ratio' (Geoffrey Hartz) * mon: fix quorum feature check (Greg Farnum) * mon: fix request forwarding in mixed firefly+dumpling clusters 9#8727, Joao Eduardo Luis) * mon: fix rule vs ruleset check in 'osd pool set ... crush_ruleset' command (John Spray) * mon: make osd 'down' count accurate (Sage Weil) * mon: set 'next commit' in primary-affinity reply (Ilya Dryomov) * mon: verify CRUSH features are supported by all mons (#8738, Greg Farnum) * msgr: fix sequence negotiation during connection reset (Guang Yang) * osd: block scrub on blocked objects (#8011, Samuel Just) * osd: call XFS hint ioctl less often (#8241, Ilya Dryomov) * osd: copy xattr spill out marker on clone (Haomai Wang) * osd: fix flush of snapped objects (#8334, Samuel Just) * osd: fix hashindex restart of merge operation (#8332, Samuel Just) * osd: fix osdmap subscription bug causing startup hang (Greg Farnum) * osd: fix potential null deref (#8328, Sage Weil) * osd: fix shutdown race (#8319, Sage Weil) * osd: handle 'none' in CRUSH results properly during peering (#8507, Samuel Just) * osd: set no spill out marker on new objects (Greg Farnum) * osd: skip op ordering debug checks on tiered pools (#8380, Sage Weil) * rados: enforce 'put' alignment (Lluis Pamies-Juarez) * rest-api: fix for 'rx' commands (Ailing Zhang) * rgw: calc user manifest etag and fix check (#8169, #8436, Yehuda Sadeh) * rgw: fetch attrs on multipart completion (#8452, Yehuda Sadeh, Sylvain Munaut) * rgw: fix buffer overflow for long instance ids (#8608, Yehuda Sadeh) * rgw: fix entity permission check on metadata put (#8428, Yehuda Sadeh) * rgw: fix multipart retry race (#8269, Yehuda Sadeh) * rpm: split ceph into ceph and ceph-common RPMs (Sandon Van Ness, Dan Mick) * sysvinit: continue startin daemons after failure doing mount (#8554, Sage Weil) For more detailed information, see :download:`the complete changelog `. v0.80.1 Firefly =============== This first Firefly point release fixes a few bugs, the most visible being a problem that prevents scrub from completing in some cases. Notable Changes --------------- * osd: revert incomplete scrub fix (Samuel Just) * rgw: fix stripe calculation for manifest objects (Yehuda Sadeh) * rgw: improve handling, memory usage for abort reads (Yehuda Sadeh) * rgw: send Swift user manifest HTTP header (Yehuda Sadeh) * libcephfs, ceph-fuse: expose MDS session state via admin socket (Yan, Zheng) * osd: add simple throttle for snap trimming (Sage Weil) * monclient: fix possible hang from ill-timed monitor connection failure (Sage Weil) * osd: fix trimming of past HitSets (Sage Weil) * osd: fix whiteouts for non-writeback cache modes (Sage Weil) * osd: prevent divide by zero in tiering agent (David Zafman) * osd: prevent busy loop when tiering agent can do no work (David Zafman) For more detailed information, see :download:`the complete changelog `. v0.80 Firefly ============= This release will form the basis for our long-term supported release Firefly, v0.80.x. The big new features are support for erasure coding and cache tiering, although a broad range of other features, fixes, and improvements have been made across the code base. Highlights include: * *Erasure coding*: support for a broad range of erasure codes for lower storage overhead and better data durability. * *Cache tiering*: support for creating 'cache pools' that store hot, recently accessed objects with automatic demotion of colder data to a base tier. Typically the cache pool is backed by faster storage devices like SSDs. * *Primary affinity*: Ceph now has the ability to skew selection of OSDs as the "primary" copy, which allows the read workload to be cheaply skewed away from parts of the cluster without migrating any data. * *Key/value OSD backend* (experimental): An alternative storage backend for Ceph OSD processes that puts all data in a key/value database like leveldb. This provides better performance for workloads dominated by key/value operations (like radosgw bucket indices). * *Standalone radosgw* (experimental): The radosgw process can now run in a standalone mode without an apache (or similar) web server or fastcgi. This simplifies deployment and can improve performance. We expect to maintain a series of stable releases based on v0.80 Firefly for as much as a year. In the meantime, development of Ceph continues with the next release, Giant, which will feature work on the CephFS distributed file system, more alternative storage backends (like RocksDB and f2fs), RDMA support, support for pyramid erasure codes, and additional functionality in the block device (RBD) like copy-on-read and multisite mirroring. Upgrade Sequencing ------------------ * If your existing cluster is running a version older than v0.67 Dumpling, please first upgrade to the latest Dumpling release before upgrading to v0.80 Firefly. Please refer to the `Dumpling upgrade`_ documentation. * We recommand adding the following to the [mon] section of your ceph.conf prior to upgrade:: mon warn on legacy crush tunables = false This will prevent health warnings due to the use of legacy CRUSH placement. Although it is possible to rebalance existing data across your cluster (see the upgrade notes below), we do not normally recommend it for production environments as a large amount of data will move and there is a significant performance impact from the rebalancing. * Upgrade daemons in the following order: #. Monitors #. OSDs #. MDSs and/or radosgw If the ceph-mds daemon is restarted first, it will wait until all OSDs have been upgraded before finishing its startup sequence. If the ceph-mon daemons are not restarted prior to the ceph-osd daemons, they will not correctly register their new capabilities with the cluster and new features may not be usable until they are restarted a second time. * Upgrade radosgw daemons together. There is a subtle change in behavior for multipart uploads that prevents a multipart request that was initiated with a new radosgw from being completed by an old radosgw. Upgrading from v0.79 -------------------- * OSDMap's json-formatted dump changed for keys 'full' and 'nearfull'. What was previously being outputted as 'true' or 'false' strings are now being outputted 'true' and 'false' booleans according to json syntax. * HEALTH_WARN on 'mon osd down out interval == 0'. Having this option set to zero on the leader acts much like having the 'noout' flag set. This warning will only be reported if the monitor getting the 'health' or 'status' request has this option set to zero. * Monitor 'auth' commands now require the mon 'x' capability. This matches dumpling v0.67.x and earlier, but differs from emperor v0.72.x. * A librados WATCH operation on a non-existent object now returns ENOENT; previously it did not. * Librados interface change: As there are no partial writes, the rados_write() and rados_append() operations now return 0 on success like rados_write_full() always has. This includes the C++ interface equivalents and AIO return values for the aio variants. * The radosgw init script (sysvinit) how requires that the 'host = ...' line in ceph.conf, if present, match the short hostname (the output of 'hostname -s'), not the fully qualified hostname or the (occasionally non-short) output of 'hostname'. Failure to adjust this when upgrading from emperor or dumpling may prevent the radosgw daemon from starting. Upgrading from v0.72 Emperor ---------------------------- * See notes above. * The 'ceph -s' or 'ceph status' command's 'num_in_osds' field in the JSON and XML output has been changed from a string to an int. * The recently added 'ceph mds set allow_new_snaps' command's syntax has changed slightly; it is now 'ceph mds set allow_new_snaps true'. The 'unset' command has been removed; instead, set the value to 'false'. * The syntax for allowing snapshots is now 'mds set allow_new_snaps ' instead of 'mds allow_new_snaps'. * 'rbd ls' on a pool which never held rbd images now exits with code 0. It outputs nothing in plain format, or an empty list in non-plain format. This is consistent with the behavior for a pool which used to hold images, but contains none. Scripts relying on this behavior should be updated. * The MDS requires a new OSD operation TMAP2OMAP, added in this release. When upgrading, be sure to upgrade and restart the ceph-osd daemons before the ceph-mds daemon. The MDS will refuse to start if any up OSDs do not support the new feature. * The 'ceph mds set_max_mds N' command is now deprecated in favor of 'ceph mds set max_mds N'. * The 'osd pool create ...' syntax has changed for erasure pools. * The default CRUSH rules and layouts are now using the 'bobtail' tunables and defaults. Upgaded clusters using the old values will now present with a health WARN state. This can be disabled by adding 'mon warn on legacy crush tunables = false' to ceph.conf and restarting the monitors. Alternatively, you can switch to the new tunables with 'ceph osd crush tunables firefly,' but keep in mind that this will involve moving a *significant* portion of the data already stored in the cluster and in a large cluster may take several days to complete. We do not recommend adjusting tunables on a production cluster. * We now default to the 'bobtail' CRUSH tunable values that are first supported by Ceph clients in bobtail (v0.56) and Linux kernel version v3.9. If you plan to access a newly created Ceph cluster with an older kernel client, you should use 'ceph osd crush tunables legacy' to switch back to the legacy behavior. Note that making that change will likely result in some data movement in the system, so adjust the setting before populating the new cluster with data. * We now set the HASHPSPOOL flag on newly created pools (and new clusters) by default. Support for this flag first appeared in v0.64; v0.67 Dumpling is the first major release that supports it. It is first supported by the Linux kernel version v3.9. If you plan to access a newly created Ceph cluster with an older kernel or clients (e.g, librados, librbd) from a pre-dumpling Ceph release, you should add 'osd pool default flag hashpspool = false' to the '[global]' section of your 'ceph.conf' prior to creating your monitors (e.g., after 'ceph-deploy new' but before 'ceph-deploy mon create ...'). * The configuration option 'osd pool default crush rule' is deprecated and replaced with 'osd pool default crush replicated ruleset'. 'osd pool default crush rule' takes precedence for backward compatibility and a deprecation warning is displayed when it is used. * As part of fix for #6796, 'ceph osd pool set ' now receives as an integer instead of a string. This affects how 'hashpspool' flag is set/unset: instead of 'true' or 'false', it now must be '0' or '1'. * The behavior of the CRUSH 'indep' choose mode has been changed. No ceph cluster should have been using this behavior unless someone has manually extracted a crush map, modified a CRUSH rule to replace 'firstn' with 'indep', recompiled, and reinjected the new map into the cluster. If the 'indep' mode is currently in use on a cluster, the rule should be modified to use 'firstn' instead, and the administrator should wait until any data movement completes before upgrading. * The 'osd dump' command now dumps pool snaps as an array instead of an object. Upgrading from v0.67 Dumpling ----------------------------- * See notes above. * ceph-fuse and radosgw now use the same default values for the admin socket and log file paths that the other daemons (ceph-osd, ceph-mon, etc.) do. If you run these daemons as non-root, you may need to adjust your ceph.conf to disable these options or to adjust the permissions on /var/run/ceph and /var/log/ceph. * The MDS now disallows snapshots by default as they are not considered stable. The command 'ceph mds set allow_snaps' will enable them. * For clusters that were created before v0.44 (pre-argonaut, Spring 2012) and store radosgw data, the auto-upgrade from TMAP to OMAP objects has been disabled. Before upgrading, make sure that any buckets created on pre-argonaut releases have been modified (e.g., by PUTing and then DELETEing an object from each bucket). Any cluster created with argonaut (v0.48) or a later release or not using radosgw never relied on the automatic conversion and is not affected by this change. * Any direct users of the 'tmap' portion of the librados API should be aware that the automatic tmap -> omap conversion functionality has been removed. * Most output that used K or KB (e.g., for kilobyte) now uses a lower-case k to match the official SI convention. Any scripts that parse output and check for an upper-case K will need to be modified. * librados::Rados::pool_create_async() and librados::Rados::pool_delete_async() don't drop a reference to the completion object on error, caller needs to take care of that. This has never really worked correctly and we were leaking an object * 'ceph osd crush set ' no longer adds the osd to the specified location, as that's a job for 'ceph osd crush add'. It will however continue to work just the same as long as the osd already exists in the crush map. * The OSD now enforces that class write methods cannot both mutate an object and return data. The rbd.assign_bid method, the lone offender, has been removed. This breaks compatibility with pre-bobtail librbd clients by preventing them from creating new images. * librados now returns on commit instead of ack for synchronous calls. This is a bit safer in the case where both OSDs and the client crash, and is probably how it should have been acting from the beginning. Users are unlikely to notice but it could result in lower performance in some circumstances. Those who care should switch to using the async interfaces, which let you specify safety semantics precisely. * The C++ librados AioComplete::get_version() method was incorrectly returning an int (usually 32-bits). To avoid breaking library compatibility, a get_version64() method is added that returns the full-width value. The old method is deprecated and will be removed in a future release. Users of the C++ librados API that make use of the get_version() method should modify their code to avoid getting a value that is truncated from 64 to to 32 bits. Notable changes since v0.79 --------------------------- * ceph-fuse, libcephfs: fix several caching bugs (Yan, Zheng) * ceph-fuse: trim inodes in response to mds memory pressure (Yan, Zheng) * librados: fix inconsistencies in API error values (David Zafman) * librados: fix watch operations with cache pools (Sage Weil) * librados: new snap rollback operation (David Zafman) * mds: fix respawn (John Spray) * mds: misc bugs (Yan, Zheng) * mds: misc multi-mds fixes (Yan, Zheng) * mds: use shared_ptr for requests (Greg Farnum) * mon: fix peer feature checks (Sage Weil) * mon: require 'x' mon caps for auth operations (Joao Luis) * mon: shutdown when removed from mon cluster (Joao Luis) * msgr: fix locking bug in authentication (Josh Durgin) * osd: fix bug in journal replay/restart (Sage Weil) * osd: many many many bug fixes with cache tiering (Samuel Just) * osd: track omap and hit_set objects in pg stats (Samuel Just) * osd: warn if agent cannot enable due to invalid (post-split) stats (Sage Weil) * rados bench: track metadata for multiple runs separately (Guang Yang) * rgw: fixed subuser modify (Yehuda Sadeh) * rpm: fix redhat-lsb dependency (Sage Weil, Alfredo Deza) Notable changes since v0.72 Emperor ----------------------------------- * buffer: some zero-copy groundwork (Josh Durgin) * build: misc improvements (Ken Dreyer) * ceph-conf: stop creating bogus log files (Josh Durgin, Sage Weil) * ceph-crush-location: new hook for setting CRUSH location of osd daemons on start) * ceph-disk: avoid fd0 (Loic Dachary) * ceph-disk: generalize path names, add tests (Loic Dachary) * ceph-disk: misc improvements for puppet (Loic Dachary) * ceph-disk: several bug fixes (Loic Dachary) * ceph-fuse: fix race for sync reads (Sage Weil) * ceph-fuse, libcephfs: fix several caching bugs (Yan, Zheng) * ceph-fuse: trim inodes in response to mds memory pressure (Yan, Zheng) * ceph-kvstore-tool: expanded command set and capabilities (Joao Eduardo Luis) * ceph.spec: fix build dependency (Loic Dachary) * common: bloom filter improvements (Sage Weil) * common: check preexisting admin socket for active daemon before removing (Loic Dachary) * common: fix aligned buffer allocation (Loic Dachary) * common: fix authentication on big-endian architectures (Dan Mick) * common: fix config variable substitution (Loic Dachary) * common: portability changes to support libc++ (Noah Watkins) * common: switch to unordered_map from hash_map (Noah Watkins) * config: recursive metavariable expansion (Loic Dachary) * crush: default to bobtail tunables (Sage Weil) * crush: fix off-by-one error in recent refactor (Sage Weil) * crush: many additional tests (Loic Dachary) * crush: misc fixes, cleanups (Loic Dachary) * crush: new rule steps to adjust retry attempts (Sage Weil) * crush, osd: s/rep/replicated/ for less confusion (Loic Dachary) * crush: refactor descend_once behavior; support set_choose*_tries for replicated rules (Sage Weil) * crush: usability and test improvements (Loic Dachary) * debian: change directory ownership between ceph and ceph-common (Sage Weil) * debian: integrate misc fixes from downstream packaging (James Page) * doc: big update to install docs (John Wilkins) * doc: many many install doc improvements (John Wilkins) * doc: many many updates (John Wilkins) * doc: misc fixes (David Moreau Simard, Kun Huang) * erasure-code: improve buffer alignment (Loic Dachary) * erasure-code: rewrite region-xor using vector operations (Andreas Peters) * init: fix startup ordering/timeout problem with OSDs (Dmitry Smirnov) * libcephfs: fix resource leak (Zheng Yan) * librados: add C API coverage for atomic write operations (Christian Marie) * librados: fix inconsistencies in API error values (David Zafman) * librados: fix throttle leak (and eventual deadlock) (Josh Durgin) * librados: fix watch operations with cache pools (Sage Weil) * librados: new snap rollback operation (David Zafman) * librados, osd: new TMAP2OMAP operation (Yan, Zheng) * librados: read directly into user buffer (Rutger ter Borg) * librbd: fix use-after-free aio completion bug #5426 (Josh Durgin) * librbd: localize/distribute parent reads (Sage Weil) * librbd: skip zeroes/holes when copying sparse images (Josh Durgin) * mailmap: affiliation updates (Loic Dachary) * mailmap updates (Loic Dachary) * many portability improvements (Noah Watkins) * many unit test improvements (Loic Dachary) * mds: always store backtrace in default pool (Yan, Zheng) * mds: cope with MDS failure during creation (John Spray) * mds: fix cap migration behavior (Yan, Zheng) * mds: fix client session flushing (Yan, Zheng) * mds: fix crash from client sleep/resume (Zheng Yan) * mds: fix many many multi-mds bugs (Yan, Zheng) * mds: fix readdir end check (Zheng Yan) * mds: fix Resetter locking (Alexandre Oliva) * mds: fix respawn (John Spray) * mds: inline data support (Li Wang, Yunchuan Wen) * mds: misc bugs (Yan, Zheng) * mds: misc fixes for directory fragments (Zheng Yan) * mds: misc fixes for larger directories (Zheng Yan) * mds: misc fixes for multiple MDSs (Zheng Yan) * mds: misc multi-mds fixes (Yan, Zheng) * mds: remove .ceph directory (John Spray) * mds: store directories in omap instead of tmap (Yan, Zheng) * mds: update old-format backtraces opportunistically (Zheng Yan) * mds: use shared_ptr for requests (Greg Farnum) * misc cleanups from coverity (Xing Lin) * misc coverity fixes, cleanups (Danny Al-Gaaf) * misc coverity fixes (Xing Lin, Li Wang, Danny Al-Gaaf) * misc portability fixes (Noah Watkins, Alan Somers) * misc portability fixes (Noah Watkins, Christophe Courtaut, Alan Somers, huanjun) * misc portability work (Noah Watkins) * mon: add erasure profiles and improve erasure pool creation (Loic Dachary) * mon: add 'mon getmap EPOCH' (Joao Eduardo Luis) * mon: allow adjustment of cephfs max file size via 'ceph mds set max_file_size' (Sage Weil) * mon: allow debug quorum_{enter,exit} commands via admin socket * mon: 'ceph osd pg-temp ...' and primary-temp commands (Ilya Dryomov) * mon: change mds allow_new_snaps syntax to be more consistent (Sage Weil) * mon: clean up initial crush rule creation (Loic Dachary) * mon: collect misc metadata about osd (os, kernel, etc.), new 'osd metadata' command (Sage Weil) * mon: do not create erasure rules by default (Sage Weil) * mon: do not generate spurious MDSMaps in certain cases (Sage Weil) * mon: do not use keyring if auth = none (Loic Dachary) * mon: fix peer feature checks (Sage Weil) * mon: fix pg_temp leaks (Joao Eduardo Luis) * mon: fix pool count in 'ceph -s' output (Sage Weil) * mon: handle more whitespace (newline, tab) in mon capabilities (Sage Weil) * mon: improve (replicate or erasure) pool creation UX (Loic Dachary) * mon: infrastructure to handle mixed-version mon cluster and cli/rest API (Greg Farnum) * mon: MForward tests (Loic Dachary) * mon: mkfs now idempotent (Loic Dachary) * mon: only seed new osdmaps to current OSDs (Sage Weil) * mon, osd: create erasure style crush rules (Loic Dachary, Sage Weil) * mon: 'osd crush show-tunables' (Sage Weil) * mon: 'osd dump' dumps pool snaps as array, not object (Dan Mick) * mon, osd: new 'erasure' pool type (still not fully supported) * mon: persist quorum features to disk (Greg Farnum) * mon: prevent extreme changes in pool pg_num (Greg Farnum) * mon: require 'x' mon caps for auth operations (Joao Luis) * mon: shutdown when removed from mon cluster (Joao Luis) * mon: take 'osd pool set ...' value as an int, not string (Joao Eduardo Luis) * mon: track osd features in OSDMap (Joao Luis, David Zafman) * mon: trim MDSMaps (Joao Eduardo Luis) * mon: warn if crush has non-optimal tunables (Sage Weil) * mount.ceph: add -n for autofs support (Steve Stock) * msgr: fix locking bug in authentication (Josh Durgin) * msgr: fix messenger restart race (Xihui He) * msgr: improve connection error detection between clients and monitors (Greg Farnum, Sage Weil) * osd: add/fix CPU feature detection for jerasure (Loic Dachary) * osd: add HitSet tracking for read ops (Sage Weil, Greg Farnum) * osd: avoid touching leveldb for some xattrs (Haomai Wang, Sage Weil) * osd: backfill to multiple targets (David Zafman) * osd: backfill to osds not in acting set (David Zafman) * osd: cache pool support for snapshots (Sage Weil) * osd: client IO path changes for EC (Samuel Just) * osd: default to 3x replication * osd: do not include backfill targets in acting set (David Zafman) * osd: enable new hashpspool layout by default (Sage Weil) * osd: erasure plugin benchmarking tool (Loic Dachary) * osd: fix and cleanup misc backfill issues (David Zafman) * osd: fix bug in journal replay/restart (Sage Weil) * osd: fix copy-get omap bug (Sage Weil) * osd: fix linux kernel version detection (Ilya Dryomov) * osd: fix memstore segv (Haomai Wang) * osd: fix object_info_t encoding bug from emperor (Sam Just) * osd: fix omap_clear operation to not zap xattrs (Sam Just, Yan, Zheng) * osd: fix several bugs with tier infrastructure * osd: fix throttle thread (Haomai Wang) * osd: fix XFS detection (Greg Farnum, Sushma Gurram) * osd: generalize scrubbing infrastructure to allow EC (David Zafman) * osd: handle more whitespace (newline, tab) in osd capabilities (Sage Weil) * osd: ignore num_objects_dirty on scrub for old pools (Sage Weil) * osd: improved scrub checks on clones (Sage Weil, Sam Just) * osd: improve locking in fd lookup cache (Samuel Just, Greg Farnum) * osd: include more info in pg query result (Sage Weil) * osd, librados: fix full cluster handling (Josh Durgin) * osd: many erasure fixes (Sam Just) * osd: many many many bug fixes with cache tiering (Samuel Just) * osd: move to jerasure2 library (Loic Dachary) * osd: new 'chassis' type in default crush hierarchy (Sage Weil) * osd: new keyvaluestore-dev backend based on leveldb (Haomai Wang) * osd: new OSDMap encoding (Greg Farnum) * osd: new tests for erasure pools (David Zafman) * osd: preliminary cache pool support (no snaps) (Greg Farnum, Sage Weil) * osd: reduce scrub lock contention (Guang Yang) * osd: requery unfound on stray notify (#6909) (Samuel Just) * osd: some PGBackend infrastructure (Samuel Just) * osd: support for new 'memstore' (memory-backed) backend (Sage Weil) * osd: track erasure compatibility (David Zafman) * osd: track omap and hit_set objects in pg stats (Samuel Just) * osd: warn if agent cannot enable due to invalid (post-split) stats (Sage Weil) * rados: add 'crush location', smart replica selection/balancing (Sage Weil) * rados bench: track metadata for multiple runs separately (Guang Yang) * rados: some performance optimizations (Yehuda Sadeh) * rados tool: fix listomapvals (Josh Durgin) * rbd: add 'rbdmap' init script for mapping rbd images on book (Adam Twardowski) * rbd: add rbdmap support for upstart (Laurent Barbe) * rbd: expose kernel rbd client options via 'rbd map' (Ilya Dryomov) * rbd: fix bench-write command (Hoamai Wang) * rbd: make 'rbd list' return empty list and success on empty pool (Josh Durgin) * rbd: prevent deletion of images with watchers (Ilya Dryomov) * rbd: support for 4096 mapped devices, up from ~250 (Ilya Dryomov) * rest-api: do not fail when no OSDs yet exist (Dan Mick) * rgw: add 'status' command to sysvinit script (David Moreau Simard) * rgw: allow multiple frontends (Yehuda Sadeh) * rgw: allow use of an erasure data pool (Yehuda Sadeh) * rgw: convert bucket info to new format on demand (Yehuda Sadeh) * rgw: fixed subuser modify (Yehuda Sadeh) * rgw: fix error setting empty owner on ACLs (Yehuda Sadeh) * rgw: fix fastcgi deadlock (do not return data from librados callback) (Yehuda Sadeh) * rgw: fix many-part multipart uploads (Yehuda Sadeh) * rgw: fix misc CORS bugs (Robin H. Johnson) * rgw: fix object placement read op (Yehuda Sadeh) * rgw: fix reading bucket policy (#6940) * rgw: fix read_user_buckets 'max' behavior (Yehuda Sadeh) * rgw: fix several CORS bugs (Robin H. Johnson) * rgw: fix use-after-free when releasing completion handle (Yehuda Sadeh) * rgw: improve swift temp URL support (Yehuda Sadeh) * rgw: make multi-object delete idempotent (Yehuda Sadeh) * rgw: optionally defer to bucket ACLs instead of object ACLs (Liam Monahan) * rgw: prototype mongoose frontend (Yehuda Sadeh) * rgw: several doc fixes (Alexandre Marangone) * rgw: support for password (instead of admin token) for keystone authentication (Christophe Courtaut) * rgw: switch from mongoose to civetweb (Yehuda Sadeh) * rgw: user quotas (Yehuda Sadeh) * rpm: fix redhat-lsb dependency (Sage Weil, Alfredo Deza) * specfile: fix RPM build on RHEL6 (Ken Dreyer, Derek Yarnell) * specfile: ship libdir/ceph (Key Dreyer) * sysvinit, upstart: prevent both init systems from starting the same daemons (Josh Durgin) Notable changes since v0.67 Dumpling ------------------------------------ * build cleanly under clang (Christophe Courtaut) * build: Makefile refactor (Roald J. van Loon) * build: fix [/usr]/sbin locations (Alan Somers) * ceph-disk: fix journal preallocation * ceph-fuse, radosgw: enable admin socket and logging by default * ceph-fuse: fix problem with readahead vs truncate race (Yan, Zheng) * ceph-fuse: trim deleted inodes from cache (Yan, Zheng) * ceph-fuse: use newer fuse api (Jianpeng Ma) * ceph-kvstore-tool: new tool for working with leveldb (copy, crc) (Joao Luis) * ceph-post-file: new command to easily share logs or other files with ceph devs * ceph: improve parsing of CEPH_ARGS (Benoit Knecht) * ceph: make -h behave when monitors are down * ceph: parse CEPH_ARGS env variable * common: bloom_filter improvements, cleanups * common: cache crc32c values where possible * common: correct SI is kB not KB (Dan Mick) * common: fix looping on BSD (Alan Somers) * common: migrate SharedPtrRegistry to use boost::shared_ptr<> (Loic Dachary) * common: misc portability fixes (Noah Watkins) * crc32c: fix optimized crc32c code (it now detects arch support properly) * crc32c: improved intel-optimized crc32c support (~8x faster on my laptop!) * crush: fix name caching * doc: erasure coding design notes (Loic Dachary) * hadoop: removed old version of shim to avoid confusing users (Noah Watkins) * librados, mon: ability to query/ping out-of-quorum monitor status (Joao Luis) * librados: fix async aio completion wakeup * librados: fix installed header #includes (Dan Mick) * librados: get_version64() method for C++ API * librados: hello_world example (Greg Farnum) * librados: sync calls now return on commit (instead of ack) (Greg Farnum) * librbd python bindings: fix parent image name limit (Josh Durgin) * librbd, ceph-fuse: avoid some sources of ceph-fuse, rbd cache stalls * mds: avoid leaking objects when deleting truncated files (Yan, Zheng) * mds: fix F_GETLK (Yan, Zheng) * mds: fix LOOKUPSNAP bug * mds: fix heap profiler commands (Joao Luis) * mds: fix locking deadlock (David Disseldorp) * mds: fix many bugs with stray (unlinked) inodes (Yan, Zheng) * mds: fix many directory fragmentation bugs (Yan, Zheng) * mds: fix mds rejoin with legacy parent backpointer xattrs (Alexandre Oliva) * mds: fix rare restart/failure race during fs creation * mds: fix standby-replay when we fall behind (Yan, Zheng) * mds: fix stray directory purging (Yan, Zheng) * mds: notify clients about deleted files (so they can release from their cache) (Yan, Zheng) * mds: several bug fixes with clustered mds (Yan, Zheng) * mon, osd: improve osdmap trimming logic (Samuel Just) * mon, osd: initial CLI for configuring tiering * mon: a few 'ceph mon add' races fixed (command is now idempotent) (Joao Luis) * mon: allow (un)setting HASHPSPOOL flag on existing pools (Joao Luis) * mon: allow cap strings with . to be unquoted * mon: allow logging level of cluster log (/var/log/ceph/ceph.log) to be adjusted * mon: avoid rewriting full osdmaps on restart (Joao Luis) * mon: continue to discover peer addr info during election phase * mon: disallow CephFS snapshots until 'ceph mds set allow_new_snaps' (Greg Farnum) * mon: do not expose uncommitted state from 'osd crush {add,set} ...' (Joao Luis) * mon: fix 'ceph osd crush reweight ...' (Joao Luis) * mon: fix 'osd crush move ...' command for buckets (Joao Luis) * mon: fix byte counts (off by factor of 4) (Dan Mick, Joao Luis) * mon: fix paxos corner case * mon: kv properties for pools to support EC (Loic Dachary) * mon: make 'osd pool rename' idempotent (Joao Luis) * mon: modify 'auth add' semantics to make a bit more sense (Joao Luis) * mon: new 'osd perf' command to dump recent performance information (Samuel Just) * mon: new and improved 'ceph -s' or 'ceph status' command (more info, easier to read) * mon: some auth check cleanups (Joao Luis) * mon: track per-pool stats (Joao Luis) * mon: warn about pools with bad pg_num * mon: warn when mon data stores grow very large (Joao Luis) * monc: fix small memory leak * new wireshark patches pulled into the tree (Kevin Jones) * objecter, librados: redirect requests based on cache tier config * objecter: fix possible hang when cluster is unpaused (Josh Durgin) * osd, librados: add new COPY_FROM rados operation * osd, librados: add new COPY_GET rados operations (used by COPY_FROM) * osd: 'osd recover clone overlap limit' option to limit cloning during recovery (Samuel Just) * osd: COPY_GET on-wire encoding improvements (Greg Farnum) * osd: add 'osd heartbeat min healthy ratio' configurable (was hard-coded at 33%) * osd: add option to disable pg log debug code (which burns CPU) * osd: allow cap strings with . to be unquoted * osd: automatically detect proper xattr limits (David Zafman) * osd: avoid extra copy in erasure coding reference implementation (Loic Dachary) * osd: basic cache pool redirects (Greg Farnum) * osd: basic whiteout, dirty flag support (not yet used) * osd: bloom_filter encodability, fixes, cleanups (Loic Dachary, Sage Weil) * osd: clean up and generalize copy-from code (Greg Farnum) * osd: cls_hello OSD class example * osd: erasure coding doc updates (Loic Dachary) * osd: erasure coding plugin infrastructure, tests (Loic Dachary) * osd: experiemental support for ZFS (zfsonlinux.org) (Yan, Zheng) * osd: fix RWORDER flags * osd: fix exponential backoff of slow request warnings (Loic Dachary) * osd: fix handling of racing read vs write (Samuel Just) * osd: fix version value returned by various operations (Greg Farnum) * osd: generalized temp object infrastructure * osd: ghobject_t infrastructure for EC (David Zafman) * osd: improvements for compatset support and storage (David Zafman) * osd: infrastructure to copy objects from other OSDs * osd: instrument peering states (David Zafman) * osd: misc copy-from improvements * osd: opportunistic crc checking on stored data (off by default) * osd: properly enforce RD/WR flags for rados classes * osd: reduce blocking on backing fs (Samuel Just) * osd: refactor recovery using PGBackend (Samuel Just) * osd: remove old magical tmap->omap conversion * osd: remove old pg log on upgrade (Samuel Just) * osd: revert xattr size limit (fixes large rgw uploads) * osd: use fdatasync(2) instead of fsync(2) to improve performance (Sam Just) * pybind: fix blacklisting nonce (Loic Dachary) * radosgw-agent: multi-region replication/DR * rgw: complete in-progress requests before shutting down * rgw: default log level is now more reasonable (Yehuda Sadeh) * rgw: fix S3 auth with response-* query string params (Sylvain Munaut, Yehuda Sadeh) * rgw: fix a few minor memory leaks (Yehuda Sadeh) * rgw: fix acl group check (Yehuda Sadeh) * rgw: fix inefficient use of std::list::size() (Yehuda Sadeh) * rgw: fix major CPU utilization bug with internal caching (Yehuda Sadeh, Mark Nelson) * rgw: fix ordering of write operations (preventing data loss on crash) (Yehuda Sadeh) * rgw: fix ordering of writes for mulitpart upload (Yehuda Sadeh) * rgw: fix various CORS bugs (Yehuda Sadeh) * rgw: fix/improve swift COPY support (Yehuda Sadeh) * rgw: improve help output (Christophe Courtaut) * rgw: misc fixes to support DR (Josh Durgin, Yehuda Sadeh) * rgw: per-bucket quota (Yehuda Sadeh) * rgw: validate S3 tokens against keystone (Roald J. van Loon) * rgw: wildcard support for keystone roles (Christophe Courtaut) * rpm: fix junit dependencies (Alan Grosskurth) * sysvinit radosgw: fix status return code (Danny Al-Gaaf) * sysvinit rbdmap: fix error 'service rbdmap stop' (Laurent Barbe) * sysvinit: add condrestart command (Dan van der Ster) * sysvinit: fix shutdown order (mons last) (Alfredo Deza) v0.79 ===== This release is intended to serve as a release candidate for firefly, which will hopefully be v0.80. No changes are being made to the code base at this point except those that fix bugs. Please test this release if you intend to make use of the new erasure-coded pools or cache tiers in firefly. This release fixes a range of bugs found in v0.78 and streamlines the user experience when creating erasure-coded pools. There is also a raft of fixes for the MDS (multi-mds, directory fragmentation, and large directories). The main notable new piece of functionality is a small change to allow radosgw to use an erasure-coded pool for object data. Upgrading --------- * Erasure pools created with v0.78 will no longer function with v0.79. You will need to delete the old pool and create a new one. * A bug was fixed in the authentication handshake with big-endian architectures that prevent authentication between big- and little-endian machines in the same cluster. If you have a cluster that consists entirely of big-endian machines, you will need to upgrade all daemons and clients and restart. * The 'ceph.file.layout' and 'ceph.dir.layout' extended attributes are no longer included in the listxattr(2) results to prevent problems with 'cp -a' and similar tools. * Monitor 'auth' read-only commands now expect the user to have 'rx' caps. This is the same behavior that was present in dumpling, but in emperor and more recent development releases the 'r' cap was sufficient. The affected commands are:: ceph auth export ceph auth get ceph auth get-key ceph auth print-key ceph auth list Notable Changes --------------- * ceph-conf: stop creating bogus log files (Josh Durgin, Sage Weil) * common: fix authentication on big-endian architectures (Dan Mick) * debian: change directory ownership between ceph and ceph-common (Sage Weil) * init: fix startup ordering/timeout problem with OSDs (Dmitry Smirnov) * librbd: skip zeroes/holes when copying sparse images (Josh Durgin) * mds: cope with MDS failure during creation (John Spray) * mds: fix crash from client sleep/resume (Zheng Yan) * mds: misc fixes for directory fragments (Zheng Yan) * mds: misc fixes for larger directories (Zheng Yan) * mds: misc fixes for multiple MDSs (Zheng Yan) * mds: remove .ceph directory (John Spray) * misc coverity fixes, cleanups (Danny Al-Gaaf) * mon: add erasure profiles and improve erasure pool creation (Loic Dachary) * mon: 'ceph osd pg-temp ...' and primary-temp commands (Ilya Dryomov) * mon: fix pool count in 'ceph -s' output (Sage Weil) * msgr: improve connection error detection between clients and monitors (Greg Farnum, Sage Weil) * osd: add/fix CPU feature detection for jerasure (Loic Dachary) * osd: improved scrub checks on clones (Sage Weil, Sam Just) * osd: many erasure fixes (Sam Just) * osd: move to jerasure2 library (Loic Dachary) * osd: new tests for erasure pools (David Zafman) * osd: reduce scrub lock contention (Guang Yang) * rgw: allow use of an erasure data pool (Yehuda Sadeh) v0.78 ===== This development release includes two key features: erasure coding and cache tiering. A huge amount of code was merged for this release and several additional weeks were spent stabilizing the code base, and it is now in a state where it is ready to be tested by a broader user base. This is *not* the firefly release. Firefly will be delayed for at least another sprint so that we can get some operational experience with the new code and do some additional testing before committing to long term support. .. note:: Please note that while it is possible to create and test erasure coded pools in this release, the pools will not be usable when you upgrade to v0.79 as the OSDMap encoding will subtlely change. Please do not populate your test pools with important data that can't be reloaded. Upgrading --------- * Upgrade daemons in the following order: #. Monitors #. OSDs #. MDSs and/or radosgw If the ceph-mds daemon is restarted first, it will wait until all OSDs have been upgraded before finishing its startup sequence. If the ceph-mon daemons are not restarted prior to the ceph-osd daemons, they will not correctly register their new capabilities with the cluster and new features may not be usable until they are restarted a second time. * Upgrade radosgw daemons together. There is a subtle change in behavior for multipart uploads that prevents a multipart request that was initiated with a new radosgw from being completed by an old radosgw. * CephFS recently added support for a new 'backtrace' attribute on file data objects that is used for lookup by inode number (i.e., NFS reexport and hard links), and will later be used by fsck repair. This replaces the existing anchor table mechanism that is used for hard link resolution. In order to completely phase that out, any inode that has an outdated backtrace attribute will get updated when the inode itself is modified. This will result in some extra workload after a legacy CephFS file system is upgraded. * The per-op return code in librados' ObjectWriteOperation interface is now filled in. * The librados cmpxattr operation now handles xattrs containing null bytes as data rather than null-terminated strings. * Compound operations in librados that create and then delete the same object are now explicitly disallowed (they fail with -EINVAL). * The default leveldb cache size for the ceph-osd daemon has been increased from 4 MB to 128 MB. This will increase the memory footprint of that process but tends to increase performance of omap (key/value) objects (used for CephFS and the radosgw). If memory in your deployment is tight, you can preserve the old behavio by adding:: leveldb write buffer size = 0 leveldb cache size = 0 to your ceph.conf to get back the (leveldb) defaults. Notable Changes --------------- * ceph-brag: new client and server tools (Sebastien Han, Babu Shanmugam) * ceph-disk: use partx on RHEL or CentOS instead of partprobe (Alfredo Deza) * ceph: fix combination of 'tell' and interactive mode (Joao Eduardo Luis) * ceph-fuse: fix bugs with inline data and multiple MDSs (Zheng Yan) * client: fix getcwd() to use new LOOKUPPARENT operation (Zheng Yan) * common: fall back to json-pretty for admin socket (Loic Dachary) * common: fix 'config dump' debug prefix (Danny Al-Gaaf) * common: misc coverity fixes (Danny Al-Gaaf) * common: throtller, shared_cache performance improvements, TrackedOp (Greg Farnum, Samuel Just) * crush: fix JSON schema for dump (John Spray) * crush: misc cleanups, tests (Loic Dachary) * crush: new vary_r tunable (Sage Weil) * crush: prevent invalid buckets of type 0 (Sage Weil) * keyvaluestore: add perfcounters, misc bug fixes (Haomai Wang) * keyvaluestore: portability improvements (Noah Watkins) * libcephfs: API changes to better support NFS reexport via Ganesha (Matt Benjamin, Adam Emerson, Andrey Kuznetsov, Casey Bodley, David Zafman) * librados: API documentation improvements (John Wilkins, Josh Durgin) * librados: fix object enumeration bugs; allow iterator assignment (Josh Durgin) * librados: streamline tests (Josh Durgin) * librados: support for atomic read and omap operations for C API (Josh Durgin) * librados: support for osd and mon command timeouts (Josh Durgin) * librbd: pass allocation hints to OSD (Ilya Dryomov) * logrotate: fix bug that prevented rotation for some daemons (Loic Dachary) * mds: avoid duplicated discovers during recovery (Zheng Yan) * mds: fix file lock owner checks (Zheng Yan) * mds: fix LOOKUPPARENT, new LOOKUPNAME ops for reliable NFS reexport (Zheng Yan) * mds: fix xattr handling on setxattr (Zheng Yan) * mds: fix xattrs in getattr replies (Sage Weil) * mds: force backtrace updates for old inodes on update (Zheng Yan) * mds: several multi-mds and dirfrag bug fixes (Zheng Yan) * mon: encode erasure stripe width in pool metadata (Loic Dachary) * mon: erasure code crush rule creation (Loic Dachary) * mon: erasure code plugin support (Loic Dachary) * mon: fix bugs in initial post-mkfs quorum creation (Sage Weil) * mon: fix error output to terminal during startup (Joao Eduardo Luis) * mon: fix legacy CRUSH tunables warning (Sage Weil) * mon: fix osd_epochs lower bound tracking for map trimming (Sage Weil) * mon: fix OSDMap encoding features (Sage Weil, Aaron Ten Clay) * mon: fix 'pg dump' JSON output (John Spray) * mon: include dirty stats in 'ceph df detail' (Sage Weil) * mon: list quorum member names in quorum order (Sage Weil) * mon: prevent addition of non-empty cache tier (Sage Weil) * mon: prevent deletion of CephFS pools (John Spray) * mon: warn when cache tier approaches 'full' (Sage Weil) * osd: allocation hint, with XFS support (Ilya Dryomov) * osd: erasure coded pool support (Samuel Just) * osd: fix bug causing slow/stalled recovery (#7706) (Samuel Just) * osd: fix bugs in log merging (Samuel Just) * osd: fix/clarify end-of-object handling on read (Loic Dachary) * osd: fix impolite mon session backoff, reconnect behavior (Greg Farnum) * osd: fix SnapContext cache id bug (Samuel Just) * osd: increase default leveldb cache size and write buffer (Sage Weil, Dmitry Smirnov) * osd: limit size of 'osd bench ...' arguments (Joao Eduardo Luis) * osdmaptool: new --test-map-pgs mode (Sage Weil, Ilya Dryomov) * osd, mon: add primary-affinity to adjust selection of primaries (Sage Weil) * osd: new 'status' admin socket command (Sage Weil) * osd: simple tiering agent (Sage Weil) * osd: store checksums for erasure coded object stripes (Samuel Just) * osd: tests for objectstore backends (Haomai Wang) * osd: various refactoring and bug fixes (Samuel Just, David Zafman) * rados: add 'set-alloc-hint' command (Ilya Dryomov) * rbd-fuse: fix enumerate_images overflow, memory leak (Ilya Dryomov) * rbdmap: fix upstart script (Stephan Renatus) * rgw: avoid logging system events to usage log (Yehuda Sadeh) * rgw: fix Swift range reponse (Yehuda Sadeh) * rgw: improve scalability for manifest objects (Yehuda Sadeh) * rgw: misc fixes for multipart objects, policies (Yehuda Sadeh) * rgw: support non-standard MultipartUpload command (Yehuda Sadeh) v0.77 ===== This is the final development release before the Firefly feature freeze. The main items in this release include some additional refactoring work in the OSD IO path (include some locking improvements), per-user quotas for the radosgw, a switch to civetweb from mongoose for the prototype radosgw standalone mode, and a prototype leveldb-based backend for the OSD. The C librados API also got support for atomic write operations (read side transactions will appear in v0.78). Upgrading --------- * The 'ceph -s' or 'ceph status' command's 'num_in_osds' field in the JSON and XML output has been changed from a string to an int. * The recently added 'ceph mds set allow_new_snaps' command's syntax has changed slightly; it is now 'ceph mds set allow_new_snaps true'. The 'unset' command has been removed; instead, set the value to 'false'. * The syntax for allowing snapshots is now 'mds set allow_new_snaps ' instead of 'mds allow_new_snaps'. Notable Changes --------------- * osd: client IO path changes for EC (Samuel Just) * common: portability changes to support libc++ (Noah Watkins) * common: switch to unordered_map from hash_map (Noah Watkins) * rgw: switch from mongoose to civetweb (Yehuda Sadeh) * osd: improve locking in fd lookup cache (Samuel Just, Greg Farnum) * doc: many many updates (John Wilkins) * rgw: user quotas (Yehuda Sadeh) * mon: persist quorum features to disk (Greg Farnum) * mon: MForward tests (Loic Dachary) * mds: inline data support (Li Wang, Yunchuan Wen) * rgw: fix many-part multipart uploads (Yehuda Sadeh) * osd: new keyvaluestore-dev backend based on leveldb (Haomai Wang) * rbd: prevent deletion of images with watchers (Ilya Dryomov) * osd: avoid touching leveldb for some xattrs (Haomai Wang, Sage Weil) * mailmap: affiliation updates (Loic Dachary) * osd: new OSDMap encoding (Greg Farnum) * osd: generalize scrubbing infrastructure to allow EC (David Zafman) * rgw: several doc fixes (Alexandre Marangone) * librados: add C API coverage for atomic write operations (Christian Marie) * rgw: improve swift temp URL support (Yehuda Sadeh) * rest-api: do not fail when no OSDs yet exist (Dan Mick) * common: check preexisting admin socket for active daemon before removing (Loic Dachary) * osd: handle more whitespace (newline, tab) in osd capabilities (Sage Weil) * mon: handle more whitespace (newline, tab) in mon capabilities (Sage Weil) * rgw: make multi-object delete idempotent (Yehuda Sadeh) * crush: fix off-by-one error in recent refactor (Sage Weil) * rgw: fix read_user_buckets 'max' behavior (Yehuda Sadeh) * mon: change mds allow_new_snaps syntax to be more consistent (Sage Weil) v0.76 ===== This release includes another batch of updates for firefly functionality. Most notably, the cache pool infrastructure now support snapshots, the OSD backfill functionality has been generalized to include multiple targets (necessary for the coming erasure pools), and there were performance improvements to the erasure code plugin on capable processors. The MDS now properly utilizes (and seamlessly migrates to) the OSD key/value interface (aka omap) for storing directory objects. There continue to be many other fixes and improvements for usability and code portability across the tree. Upgrading --------- * 'rbd ls' on a pool which never held rbd images now exits with code 0. It outputs nothing in plain format, or an empty list in non-plain format. This is consistent with the behavior for a pool which used to hold images, but contains none. Scripts relying on this behavior should be updated. * The MDS requires a new OSD operation TMAP2OMAP, added in this release. When upgrading, be sure to upgrade and restart the ceph-osd daemons before the ceph-mds daemon. The MDS will refuse to start if any up OSDs do not support the new feature. * The 'ceph mds set_max_mds N' command is now deprecated in favor of 'ceph mds set max_mds N'. Notable Changes --------------- * build: misc improvements (Ken Dreyer) * ceph-disk: generalize path names, add tests (Loic Dachary) * ceph-disk: misc improvements for puppet (Loic Dachary) * ceph-disk: several bug fixes (Loic Dachary) * ceph-fuse: fix race for sync reads (Sage Weil) * config: recursive metavariable expansion (Loic Dachary) * crush: usability and test improvements (Loic Dachary) * doc: misc fixes (David Moreau Simard, Kun Huang) * erasure-code: improve buffer alignment (Loic Dachary) * erasure-code: rewrite region-xor using vector operations (Andreas Peters) * librados, osd: new TMAP2OMAP operation (Yan, Zheng) * mailmap updates (Loic Dachary) * many portability improvements (Noah Watkins) * many unit test improvements (Loic Dachary) * mds: always store backtrace in default pool (Yan, Zheng) * mds: store directories in omap instead of tmap (Yan, Zheng) * mon: allow adjustment of cephfs max file size via 'ceph mds set max_file_size' (Sage Weil) * mon: do not create erasure rules by default (Sage Weil) * mon: do not generate spurious MDSMaps in certain cases (Sage Weil) * mon: do not use keyring if auth = none (Loic Dachary) * mon: fix pg_temp leaks (Joao Eduardo Luis) * osd: backfill to multiple targets (David Zafman) * osd: cache pool support for snapshots (Sage Weil) * osd: fix and cleanup misc backfill issues (David Zafman) * osd: fix omap_clear operation to not zap xattrs (Sam Just, Yan, Zheng) * osd: ignore num_objects_dirty on scrub for old pools (Sage Weil) * osd: include more info in pg query result (Sage Weil) * osd: track erasure compatibility (David Zafman) * rbd: make 'rbd list' return empty list and success on empty pool (Josh Durgin) * rgw: fix object placement read op (Yehuda Sadeh) * rgw: fix several CORS bugs (Robin H. Johnson) * specfile: fix RPM build on RHEL6 (Ken Dreyer, Derek Yarnell) * specfile: ship libdir/ceph (Key Dreyer) v0.75 ===== This is a big release, with lots of infrastructure going in for firefly. The big items include a prototype standalone frontend for radosgw (which does not require apache or fastcgi), tracking for read activity on the osds (to inform tiering decisions), preliminary cache pool support (no snapshots yet), and lots of bug fixes and other work across the tree to get ready for the next batch of erasure coding patches. For comparison, here are the diff stats for the last few versions:: v0.75 291 files changed, 82713 insertions(+), 33495 deletions(-) v0.74 192 files changed, 17980 insertions(+), 1062 deletions(-) v0.73 148 files changed, 4464 insertions(+), 2129 deletions(-) Upgrading --------- - The 'osd pool create ...' syntax has changed for erasure pools. - The default CRUSH rules and layouts are now using the latest and greatest tunables and defaults. Clusters using the old values will now present with a health WARN state. This can be disabled by adding 'mon warn on legacy crush tunables = false' to ceph.conf. Notable Changes --------------- * common: bloom filter improvements (Sage Weil) * common: fix config variable substitution (Loic Dachary) * crush, osd: s/rep/replicated/ for less confusion (Loic Dachary) * crush: refactor descend_once behavior; support set_choose*_tries for replicated rules (Sage Weil) * librados: fix throttle leak (and eventual deadlock) (Josh Durgin) * librados: read directly into user buffer (Rutger ter Borg) * librbd: fix use-after-free aio completion bug #5426 (Josh Durgin) * librbd: localize/distribute parent reads (Sage Weil) * mds: fix Resetter locking (Alexandre Oliva) * mds: fix cap migration behavior (Yan, Zheng) * mds: fix client session flushing (Yan, Zheng) * mds: fix many many multi-mds bugs (Yan, Zheng) * misc portability work (Noah Watkins) * mon, osd: create erasure style crush rules (Loic Dachary, Sage Weil) * mon: 'osd crush show-tunables' (Sage Weil) * mon: clean up initial crush rule creation (Loic Dachary) * mon: improve (replicate or erasure) pool creation UX (Loic Dachary) * mon: infrastructure to handle mixed-version mon cluster and cli/rest API (Greg Farnum) * mon: mkfs now idempotent (Loic Dachary) * mon: only seed new osdmaps to current OSDs (Sage Weil) * mon: track osd features in OSDMap (Joao Luis, David Zafman) * mon: warn if crush has non-optimal tunables (Sage Weil) * mount.ceph: add -n for autofs support (Steve Stock) * msgr: fix messenger restart race (Xihui He) * osd, librados: fix full cluster handling (Josh Durgin) * osd: add HitSet tracking for read ops (Sage Weil, Greg Farnum) * osd: backfill to osds not in acting set (David Zafman) * osd: enable new hashpspool layout by default (Sage Weil) * osd: erasure plugin benchmarking tool (Loic Dachary) * osd: fix XFS detection (Greg Farnum, Sushma Gurram) * osd: fix copy-get omap bug (Sage Weil) * osd: fix linux kernel version detection (Ilya Dryomov) * osd: fix memstore segv (Haomai Wang) * osd: fix several bugs with tier infrastructure * osd: fix throttle thread (Haomai Wang) * osd: preliminary cache pool support (no snaps) (Greg Farnum, Sage Weil) * rados tool: fix listomapvals (Josh Durgin) * rados: add 'crush location', smart replica selection/balancing (Sage Weil) * rados: some performance optimizations (Yehuda Sadeh) * rbd: add rbdmap support for upstart (Laurent Barbe) * rbd: expose kernel rbd client options via 'rbd map' (Ilya Dryomov) * rbd: fix bench-write command (Hoamai Wang) * rbd: support for 4096 mapped devices, up from ~250 (Ilya Dryomov) * rgw: allow multiple frontends (Yehuda Sadeh) * rgw: convert bucket info to new format on demand (Yehuda Sadeh) * rgw: fix misc CORS bugs (Robin H. Johnson) * rgw: prototype mongoose frontend (Yehuda Sadeh) v0.74 ===== This release includes a few substantial pieces for Firefly, including a long-overdue switch to 3x replication by default and a switch to the "new" CRUSH tunables by default (supported since bobtail). There is also a fix for a long-standing radosgw bug (stalled GET) that has already been backported to emperor and dumpling. Upgrading --------- * We now default to the 'bobtail' CRUSH tunable values that are first supported by Ceph clients in bobtail (v0.56) and Linux kernel version v3.9. If you plan to access a newly created Ceph cluster with an older kernel client, you should use 'ceph osd crush tunables legacy' to switch back to the legacy behavior. Note that making that change will likely result in some data movement in the system, so adjust the setting before populating the new cluster with data. * We now set the HASHPSPOOL flag on newly created pools (and new clusters) by default. Support for this flag first appeared in v0.64; v0.67 Dumpling is the first major release that supports it. It is first supported by the Linux kernel version v3.9. If you plan to access a newly created Ceph cluster with an older kernel or clients (e.g, librados, librbd) from a pre-dumpling Ceph release, you should add 'osd pool default flag hashpspool = false' to the '[global]' section of your 'ceph.conf' prior to creating your monitors (e.g., after 'ceph-deploy new' but before 'ceph-deploy mon create ...'). * The configuration option 'osd pool default crush rule' is deprecated and replaced with 'osd pool default crush replicated ruleset'. 'osd pool default crush rule' takes precedence for backward compatibility and a deprecation warning is displayed when it is used. Notable Changes --------------- * buffer: some zero-copy groundwork (Josh Durgin) * ceph-disk: avoid fd0 (Loic Dachary) * crush: default to bobtail tunables (Sage Weil) * crush: many additional tests (Loic Dachary) * crush: misc fixes, cleanups (Loic Dachary) * crush: new rule steps to adjust retry attempts (Sage Weil) * debian: integrate misc fixes from downstream packaging (James Page) * doc: big update to install docs (John Wilkins) * libcephfs: fix resource leak (Zheng Yan) * misc coverity fixes (Xing Lin, Li Wang, Danny Al-Gaaf) * misc portability fixes (Noah Watkins, Alan Somers) * mon, osd: new 'erasure' pool type (still not fully supported) * mon: add 'mon getmap EPOCH' (Joao Eduardo Luis) * mon: collect misc metadata about osd (os, kernel, etc.), new 'osd metadata' command (Sage Weil) * osd: default to 3x replication * osd: do not include backfill targets in acting set (David Zafman) * osd: new 'chassis' type in default crush hierarchy (Sage Weil) * osd: requery unfound on stray notify (#6909) (Samuel Just) * osd: some PGBackend infrastructure (Samuel Just) * osd: support for new 'memstore' (memory-backed) backend (Sage Weil) * rgw: fix fastcgi deadlock (do not return data from librados callback) (Yehuda Sadeh) * rgw: fix reading bucket policy (#6940) * rgw: fix use-after-free when releasing completion handle (Yehuda Sadeh) v0.73 ===== This release, the first development release after emperor, includes many bug fixes and a few additional pieces of functionality. The first batch of larger changes will be landing in the next version, v0.74. Upgrading --------- - As part of fix for #6796, 'ceph osd pool set ' now receives as an integer instead of a string. This affects how 'hashpspool' flag is set/unset: instead of 'true' or 'false', it now must be '0' or '1'. - The behavior of the CRUSH 'indep' choose mode has been changed. No ceph cluster should have been using this behavior unless someone has manually extracted a crush map, modified a CRUSH rule to replace 'firstn' with 'indep', recompiled, and reinjected the new map into the cluster. If the 'indep' mode is currently in use on a cluster, the rule should be modified to use 'firstn' instead, and the administrator should wait until any data movement completes before upgrading. - The 'osd dump' command now dumps pool snaps as an array instead of an object. - The radosgw init script (sysvinit) how requires that the 'host = ...' line in ceph.conf, if present, match the short hostname (the output of 'hostname -s'), not the fully qualified hostname or the (occasionally non-short) output of 'hostname'. Failure to adjust this when upgrading from emperor or dumpling may prevent the radosgw daemon from starting. Notable Changes --------------- * ceph-crush-location: new hook for setting CRUSH location of osd daemons on start * ceph-kvstore-tool: expanded command set and capabilities (Joao Eduardo Luis) * ceph.spec: fix build dependency (Loic Dachary) * common: fix aligned buffer allocation (Loic Dachary) * doc: many many install doc improvements (John Wilkins) * mds: fix readdir end check (Zheng Yan) * mds: update old-format backtraces opportunistically (Zheng Yan) * misc cleanups from coverity (Xing Lin) * misc portability fixes (Noah Watkins, Christophe Courtaut, Alan Somers, huanjun) * mon: 'osd dump' dumps pool snaps as array, not object (Dan Mick) * mon: allow debug quorum_{enter,exit} commands via admin socket * mon: prevent extreme changes in pool pg_num (Greg Farnum) * mon: take 'osd pool set ...' value as an int, not string (Joao Eduardo Luis) * mon: trim MDSMaps (Joao Eduardo Luis) * osd: fix object_info_t encoding bug from emperor (Sam Just) * rbd: add 'rbdmap' init script for mapping rbd images on book (Adam Twardowski) * rgw: add 'status' command to sysvinit script (David Moreau Simard) * rgw: fix error setting empty owner on ACLs (Yehuda Sadeh) * rgw: optionally defer to bucket ACLs instead of object ACLs (Liam Monahan) * rgw: support for password (instead of admin token) for keystone authentication (Christophe Courtaut) * sysvinit, upstart: prevent both init systems from starting the same daemons (Josh Durgin) v0.72.3 Emperor (pending release) ================================= Upgrading --------- * Monitor 'auth' read-only commands now expect the user to have 'rx' caps. This is the same behavior that was present in dumpling, but in emperor and more recent development releases the 'r' cap was sufficient. Note that this backported security fix will break mon keys that are using the following commands but do not have the 'x' bit in the mon capability:: ceph auth export ceph auth get ceph auth get-key ceph auth print-key ceph auth list v0.72.2 Emperor =============== This is the second bugfix release for the v0.72.x Emperor series. We have fixed a hang in radosgw, and fixed (again) a problem with monitor CLI compatiblity with mixed version monitors. (In the future this will no longer be a problem.) Upgrading --------- * The JSON schema for the 'osd pool set ...' command changed slightly. Please avoid issuing this particular command via the CLI while there is a mix of v0.72.1 and v0.72.2 monitor daemons running. * As part of fix for #6796, 'ceph osd pool set ' now receives as an integer instead of a string. This affects how 'hashpspool' flag is set/unset: instead of 'true' or 'false', it now must be '0' or '1'. Changes ------- * mon: 'osd pool set ...' syntax change * osd: added test for missing on-disk HEAD object * osd: fix osd bench block size argument * rgw: fix hang on large object GET * rgw: fix rare use-after-free * rgw: various DR bug fixes * rgw: do not return error on empty owner when setting ACL * sysvinit, upstart: prevent starting daemons using both init systems For more detailed information, see :download:`the complete changelog `. v0.72.1 Emperor =============== Important Note -------------- When you are upgrading from Dumpling to Emperor, do not run any of the "ceph osd pool set" commands while your monitors are running separate versions. Doing so could result in inadvertently changing cluster configuration settings that exhaust compute resources in your OSDs. Changes ------- * osd: fix upgrade bug #6761 * ceph_filestore_tool: introduced tool to repair errors caused by #6761 This release addresses issue #6761. Upgrading to Emperor can cause reads to begin returning ENFILE (too many open files). v0.72.1 fixes that upgrade issue and adds a tool ceph_filestore_tool to repair osd stores affected by this bug. To repair a cluster affected by this bug: #. Upgrade all osd machines to v0.72.1 #. Install the ceph-test package on each osd machine to get ceph_filestore_tool #. Stop all osd processes #. To see all lost objects, run the following on each osd with the osd stopped and the osd data directory mounted:: ceph_filestore_tool --list-lost-objects=true --filestore-path= --journal-path= #. To fix all lost objects, run the following on each osd with the osd stopped and the osd data directory mounted:: ceph_filestore_tool --fix-lost-objects=true --list-lost-objects=true --filestore-path= --journal-path= #. Once lost objects have been repaired on each osd, you can restart the cluster. Note, the ceph_filestore_tool performs a scan of all objects on the osd and may take some time. v0.72 Emperor ============= This is the fifth major release of Ceph, the fourth since adopting a 3-month development cycle. This release brings several new features, including multi-datacenter replication for the radosgw, improved usability, and lands a lot of incremental performance and internal refactoring work to support upcoming features in Firefly. Important Note -------------- When you are upgrading from Dumpling to Emperor, do not run any of the "ceph osd pool set" commands while your monitors are running separate versions. Doing so could result in inadvertently changing cluster configuration settings that exhaust compute resources in your OSDs. Highlights ---------- * common: improved crc32c performance * librados: new example client and class code * mds: many bug fixes and stability improvements * mon: health warnings when pool pg_num values are not reasonable * mon: per-pool performance stats * osd, librados: new object copy primitives * osd: improved interaction with backend file system to reduce latency * osd: much internal refactoring to support ongoing erasure coding and tiering support * rgw: bucket quotas * rgw: improved CORS support * rgw: performance improvements * rgw: validate S3 tokens against Keystone Coincident with core Ceph, the Emperor release also brings: * radosgw-agent: support for multi-datacenter replication for disaster recovery * tgt: improved support for iSCSI via upstream tgt Packages for both are available on ceph.com. Upgrade sequencing ------------------ There are no specific upgrade restrictions on the order or sequence of upgrading from 0.67.x Dumpling. However, you cannot run any of the "ceph osd pool set" commands while your monitors are running separate versions. Doing so could result in inadvertently changing cluster configuration settings and exhausting compute resources in your OSDs. It is also possible to do a rolling upgrade from 0.61.x Cuttlefish, but there are ordering restrictions. (This is the same set of restrictions for Cuttlefish to Dumpling.) #. Upgrade ceph-common on all nodes that will use the command line 'ceph' utility. #. Upgrade all monitors (upgrade ceph package, restart ceph-mon daemons). This can happen one daemon or host at a time. Note that because cuttlefish and dumpling monitors can't talk to each other, all monitors should be upgraded in relatively short succession to minimize the risk that an a untimely failure will reduce availability. #. Upgrade all osds (upgrade ceph package, restart ceph-osd daemons). This can happen one daemon or host at a time. #. Upgrade radosgw (upgrade radosgw package, restart radosgw daemons). Upgrading from v0.71 -------------------- * ceph-fuse and radosgw now use the same default values for the admin socket and log file paths that the other daemons (ceph-osd, ceph-mon, etc.) do. If you run these daemons as non-root, you may need to adjust your ceph.conf to disable these options or to adjust the permissions on /var/run/ceph and /var/log/ceph. Upgrading from v0.67 Dumpling ----------------------------- * ceph-fuse and radosgw now use the same default values for the admin socket and log file paths that the other daemons (ceph-osd, ceph-mon, etc.) do. If you run these daemons as non-root, you may need to adjust your ceph.conf to disable these options or to adjust the permissions on /var/run/ceph and /var/log/ceph. * The MDS now disallows snapshots by default as they are not considered stable. The command 'ceph mds set allow_snaps' will enable them. * For clusters that were created before v0.44 (pre-argonaut, Spring 2012) and store radosgw data, the auto-upgrade from TMAP to OMAP objects has been disabled. Before upgrading, make sure that any buckets created on pre-argonaut releases have been modified (e.g., by PUTing and then DELETEing an object from each bucket). Any cluster created with argonaut (v0.48) or a later release or not using radosgw never relied on the automatic conversion and is not affected by this change. * Any direct users of the 'tmap' portion of the librados API should be aware that the automatic tmap -> omap conversion functionality has been removed. * Most output that used K or KB (e.g., for kilobyte) now uses a lower-case k to match the official SI convention. Any scripts that parse output and check for an upper-case K will need to be modified. * librados::Rados::pool_create_async() and librados::Rados::pool_delete_async() don't drop a reference to the completion object on error, caller needs to take care of that. This has never really worked correctly and we were leaking an object * 'ceph osd crush set ' no longer adds the osd to the specified location, as that's a job for 'ceph osd crush add'. It will however continue to work just the same as long as the osd already exists in the crush map. * The OSD now enforces that class write methods cannot both mutate an object and return data. The rbd.assign_bid method, the lone offender, has been removed. This breaks compatibility with pre-bobtail librbd clients by preventing them from creating new images. * librados now returns on commit instead of ack for synchronous calls. This is a bit safer in the case where both OSDs and the client crash, and is probably how it should have been acting from the beginning. Users are unlikely to notice but it could result in lower performance in some circumstances. Those who care should switch to using the async interfaces, which let you specify safety semantics precisely. * The C++ librados AioComplete::get_version() method was incorrectly returning an int (usually 32-bits). To avoid breaking library compatibility, a get_version64() method is added that returns the full-width value. The old method is deprecated and will be removed in a future release. Users of the C++ librados API that make use of the get_version() method should modify their code to avoid getting a value that is truncated from 64 to to 32 bits. Notable Changes since v0.71 --------------------------- * build: fix [/usr]/sbin locations (Alan Somers) * ceph-fuse, radosgw: enable admin socket and logging by default * ceph: make -h behave when monitors are down * common: cache crc32c values where possible * common: fix looping on BSD (Alan Somers) * librados, mon: ability to query/ping out-of-quorum monitor status (Joao Luis) * librbd python bindings: fix parent image name limit (Josh Durgin) * mds: avoid leaking objects when deleting truncated files (Yan, Zheng) * mds: fix F_GETLK (Yan, Zheng) * mds: fix many bugs with stray (unlinked) inodes (Yan, Zheng) * mds: fix many directory fragmentation bugs (Yan, Zheng) * mon: allow (un)setting HASHPSPOOL flag on existing pools (Joao Luis) * mon: make 'osd pool rename' idempotent (Joao Luis) * osd: COPY_GET on-wire encoding improvements (Greg Farnum) * osd: bloom_filter encodability, fixes, cleanups (Loic Dachary, Sage Weil) * osd: fix handling of racing read vs write (Samuel Just) * osd: reduce blocking on backing fs (Samuel Just) * radosgw-agent: multi-region replication/DR * rgw: fix/improve swift COPY support (Yehuda Sadeh) * rgw: misc fixes to support DR (Josh Durgin, Yehuda Sadeh) * rgw: per-bucket quota (Yehuda Sadeh) * rpm: fix junit dependencies (Alan Grosskurth) Notable Changes since v0.67 Dumpling ------------------------------------ * build cleanly under clang (Christophe Courtaut) * build: Makefile refactor (Roald J. van Loon) * build: fix [/usr]/sbin locations (Alan Somers) * ceph-disk: fix journal preallocation * ceph-fuse, radosgw: enable admin socket and logging by default * ceph-fuse: fix problem with readahead vs truncate race (Yan, Zheng) * ceph-fuse: trim deleted inodes from cache (Yan, Zheng) * ceph-fuse: use newer fuse api (Jianpeng Ma) * ceph-kvstore-tool: new tool for working with leveldb (copy, crc) (Joao Luis) * ceph-post-file: new command to easily share logs or other files with ceph devs * ceph: improve parsing of CEPH_ARGS (Benoit Knecht) * ceph: make -h behave when monitors are down * ceph: parse CEPH_ARGS env variable * common: bloom_filter improvements, cleanups * common: cache crc32c values where possible * common: correct SI is kB not KB (Dan Mick) * common: fix looping on BSD (Alan Somers) * common: migrate SharedPtrRegistry to use boost::shared_ptr<> (Loic Dachary) * common: misc portability fixes (Noah Watkins) * crc32c: fix optimized crc32c code (it now detects arch support properly) * crc32c: improved intel-optimized crc32c support (~8x faster on my laptop!) * crush: fix name caching * doc: erasure coding design notes (Loic Dachary) * hadoop: removed old version of shim to avoid confusing users (Noah Watkins) * librados, mon: ability to query/ping out-of-quorum monitor status (Joao Luis) * librados: fix async aio completion wakeup * librados: fix installed header #includes (Dan Mick) * librados: get_version64() method for C++ API * librados: hello_world example (Greg Farnum) * librados: sync calls now return on commit (instead of ack) (Greg Farnum) * librbd python bindings: fix parent image name limit (Josh Durgin) * librbd, ceph-fuse: avoid some sources of ceph-fuse, rbd cache stalls * mds: avoid leaking objects when deleting truncated files (Yan, Zheng) * mds: fix F_GETLK (Yan, Zheng) * mds: fix LOOKUPSNAP bug * mds: fix heap profiler commands (Joao Luis) * mds: fix locking deadlock (David Disseldorp) * mds: fix many bugs with stray (unlinked) inodes (Yan, Zheng) * mds: fix many directory fragmentation bugs (Yan, Zheng) * mds: fix mds rejoin with legacy parent backpointer xattrs (Alexandre Oliva) * mds: fix rare restart/failure race during fs creation * mds: fix standby-replay when we fall behind (Yan, Zheng) * mds: fix stray directory purging (Yan, Zheng) * mds: notify clients about deleted files (so they can release from their cache) (Yan, Zheng) * mds: several bug fixes with clustered mds (Yan, Zheng) * mon, osd: improve osdmap trimming logic (Samuel Just) * mon, osd: initial CLI for configuring tiering * mon: a few 'ceph mon add' races fixed (command is now idempotent) (Joao Luis) * mon: allow (un)setting HASHPSPOOL flag on existing pools (Joao Luis) * mon: allow cap strings with . to be unquoted * mon: allow logging level of cluster log (/var/log/ceph/ceph.log) to be adjusted * mon: avoid rewriting full osdmaps on restart (Joao Luis) * mon: continue to discover peer addr info during election phase * mon: disallow CephFS snapshots until 'ceph mds set allow_new_snaps' (Greg Farnum) * mon: do not expose uncommitted state from 'osd crush {add,set} ...' (Joao Luis) * mon: fix 'ceph osd crush reweight ...' (Joao Luis) * mon: fix 'osd crush move ...' command for buckets (Joao Luis) * mon: fix byte counts (off by factor of 4) (Dan Mick, Joao Luis) * mon: fix paxos corner case * mon: kv properties for pools to support EC (Loic Dachary) * mon: make 'osd pool rename' idempotent (Joao Luis) * mon: modify 'auth add' semantics to make a bit more sense (Joao Luis) * mon: new 'osd perf' command to dump recent performance information (Samuel Just) * mon: new and improved 'ceph -s' or 'ceph status' command (more info, easier to read) * mon: some auth check cleanups (Joao Luis) * mon: track per-pool stats (Joao Luis) * mon: warn about pools with bad pg_num * mon: warn when mon data stores grow very large (Joao Luis) * monc: fix small memory leak * new wireshark patches pulled into the tree (Kevin Jones) * objecter, librados: redirect requests based on cache tier config * objecter: fix possible hang when cluster is unpaused (Josh Durgin) * osd, librados: add new COPY_FROM rados operation * osd, librados: add new COPY_GET rados operations (used by COPY_FROM) * osd: 'osd recover clone overlap limit' option to limit cloning during recovery (Samuel Just) * osd: COPY_GET on-wire encoding improvements (Greg Farnum) * osd: add 'osd heartbeat min healthy ratio' configurable (was hard-coded at 33%) * osd: add option to disable pg log debug code (which burns CPU) * osd: allow cap strings with . to be unquoted * osd: automatically detect proper xattr limits (David Zafman) * osd: avoid extra copy in erasure coding reference implementation (Loic Dachary) * osd: basic cache pool redirects (Greg Farnum) * osd: basic whiteout, dirty flag support (not yet used) * osd: bloom_filter encodability, fixes, cleanups (Loic Dachary, Sage Weil) * osd: clean up and generalize copy-from code (Greg Farnum) * osd: cls_hello OSD class example * osd: erasure coding doc updates (Loic Dachary) * osd: erasure coding plugin infrastructure, tests (Loic Dachary) * osd: experiemental support for ZFS (zfsonlinux.org) (Yan, Zheng) * osd: fix RWORDER flags * osd: fix exponential backoff of slow request warnings (Loic Dachary) * osd: fix handling of racing read vs write (Samuel Just) * osd: fix version value returned by various operations (Greg Farnum) * osd: generalized temp object infrastructure * osd: ghobject_t infrastructure for EC (David Zafman) * osd: improvements for compatset support and storage (David Zafman) * osd: infrastructure to copy objects from other OSDs * osd: instrument peering states (David Zafman) * osd: misc copy-from improvements * osd: opportunistic crc checking on stored data (off by default) * osd: properly enforce RD/WR flags for rados classes * osd: reduce blocking on backing fs (Samuel Just) * osd: refactor recovery using PGBackend (Samuel Just) * osd: remove old magical tmap->omap conversion * osd: remove old pg log on upgrade (Samuel Just) * osd: revert xattr size limit (fixes large rgw uploads) * osd: use fdatasync(2) instead of fsync(2) to improve performance (Sam Just) * pybind: fix blacklisting nonce (Loic Dachary) * radosgw-agent: multi-region replication/DR * rgw: complete in-progress requests before shutting down * rgw: default log level is now more reasonable (Yehuda Sadeh) * rgw: fix S3 auth with response-* query string params (Sylvain Munaut, Yehuda Sadeh) * rgw: fix a few minor memory leaks (Yehuda Sadeh) * rgw: fix acl group check (Yehuda Sadeh) * rgw: fix inefficient use of std::list::size() (Yehuda Sadeh) * rgw: fix major CPU utilization bug with internal caching (Yehuda Sadeh, Mark Nelson) * rgw: fix ordering of write operations (preventing data loss on crash) (Yehuda Sadeh) * rgw: fix ordering of writes for mulitpart upload (Yehuda Sadeh) * rgw: fix various CORS bugs (Yehuda Sadeh) * rgw: fix/improve swift COPY support (Yehuda Sadeh) * rgw: improve help output (Christophe Courtaut) * rgw: misc fixes to support DR (Josh Durgin, Yehuda Sadeh) * rgw: per-bucket quota (Yehuda Sadeh) * rgw: validate S3 tokens against keystone (Roald J. van Loon) * rgw: wildcard support for keystone roles (Christophe Courtaut) * rpm: fix junit dependencies (Alan Grosskurth) * sysvinit radosgw: fix status return code (Danny Al-Gaaf) * sysvinit rbdmap: fix error 'service rbdmap stop' (Laurent Barbe) * sysvinit: add condrestart command (Dan van der Ster) * sysvinit: fix shutdown order (mons last) (Alfredo Deza) v0.71 ===== This development release includes a significant amount of new code and refactoring, as well as a lot of preliminary functionality that will be needed for erasure coding and tiering support. There are also several significant patch sets improving this with the MDS. Upgrading --------- * The MDS now disallows snapshots by default as they are not considered stable. The command 'ceph mds set allow_snaps' will enable them. * For clusters that were created before v0.44 (pre-argonaut, Spring 2012) and store radosgw data, the auto-upgrade from TMAP to OMAP objects has been disabled. Before upgrading, make sure that any buckets created on pre-argonaut releases have been modified (e.g., by PUTing and then DELETEing an object from each bucket). Any cluster created with argonaut (v0.48) or a later release or not using radosgw never relied on the automatic conversion and is not affected by this change. * Any direct users of the 'tmap' portion of the librados API should be aware that the automatic tmap -> omap conversion functionality has been removed. * Most output that used K or KB (e.g., for kilobyte) now uses a lower-case k to match the official SI convention. Any scripts that parse output and check for an upper-case K will need to be modified. Notable Changes --------------- * build: Makefile refactor (Roald J. van Loon) * ceph-disk: fix journal preallocation * ceph-fuse: trim deleted inodes from cache (Yan, Zheng) * ceph-fuse: use newer fuse api (Jianpeng Ma) * ceph-kvstore-tool: new tool for working with leveldb (copy, crc) (Joao Luis) * common: bloom_filter improvements, cleanups * common: correct SI is kB not KB (Dan Mick) * common: misc portability fixes (Noah Watkins) * hadoop: removed old version of shim to avoid confusing users (Noah Watkins) * librados: fix installed header #includes (Dan Mick) * librbd, ceph-fuse: avoid some sources of ceph-fuse, rbd cache stalls * mds: fix LOOKUPSNAP bug * mds: fix standby-replay when we fall behind (Yan, Zheng) * mds: fix stray directory purging (Yan, Zheng) * mon: disallow CephFS snapshots until 'ceph mds set allow_new_snaps' (Greg Farnum) * mon, osd: improve osdmap trimming logic (Samuel Just) * mon: kv properties for pools to support EC (Loic Dachary) * mon: some auth check cleanups (Joao Luis) * mon: track per-pool stats (Joao Luis) * mon: warn about pools with bad pg_num * osd: automatically detect proper xattr limits (David Zafman) * osd: avoid extra copy in erasure coding reference implementation (Loic Dachary) * osd: basic cache pool redirects (Greg Farnum) * osd: basic whiteout, dirty flag support (not yet used) * osd: clean up and generalize copy-from code (Greg Farnum) * osd: erasure coding doc updates (Loic Dachary) * osd: erasure coding plugin infrastructure, tests (Loic Dachary) * osd: fix RWORDER flags * osd: fix exponential backoff of slow request warnings (Loic Dachary) * osd: generalized temp object infrastructure * osd: ghobject_t infrastructure for EC (David Zafman) * osd: improvements for compatset support and storage (David Zafman) * osd: misc copy-from improvements * osd: opportunistic crc checking on stored data (off by default) * osd: refactor recovery using PGBackend (Samuel Just) * osd: remove old magical tmap->omap conversion * pybind: fix blacklisting nonce (Loic Dachary) * rgw: default log level is now more reasonable (Yehuda Sadeh) * rgw: fix acl group check (Yehuda Sadeh) * sysvinit: fix shutdown order (mons last) (Alfredo Deza) v0.70 ===== Upgrading --------- * librados::Rados::pool_create_async() and librados::Rados::pool_delete_async() don't drop a reference to the completion object on error, caller needs to take care of that. This has never really worked correctly and we were leaking an object * 'ceph osd crush set ' no longer adds the osd to the specified location, as that's a job for 'ceph osd crush add'. It will however continue to work just the same as long as the osd already exists in the crush map. Notable Changes --------------- * mon: a few 'ceph mon add' races fixed (command is now idempotent) (Joao Luis) * crush: fix name caching * rgw: fix a few minor memory leaks (Yehuda Sadeh) * ceph: improve parsing of CEPH_ARGS (Benoit Knecht) * mon: avoid rewriting full osdmaps on restart (Joao Luis) * crc32c: fix optimized crc32c code (it now detects arch support properly) * mon: fix 'ceph osd crush reweight ...' (Joao Luis) * osd: revert xattr size limit (fixes large rgw uploads) * mds: fix heap profiler commands (Joao Luis) * rgw: fix inefficient use of std::list::size() (Yehuda Sadeh) v0.69 ===== Upgrading --------- * The sysvinit /etc/init.d/ceph script will, by default, update the CRUSH location of an OSD when it starts. Previously, if the monitors were not available, this command would hang indefinitely. Now, that step will time out after 10 seconds and the ceph-osd daemon will not be started. * Users of the librados C++ API should replace users of get_version() with get_version64() as the old method only returns a 32-bit value for a 64-bit field. The existing 32-bit get_version() method is now deprecated. * The OSDs are now more picky that request payload match their declared size. A write operation across N bytes that includes M bytes of data will now be rejected. No known clients do this, but the because the server-side behavior has changed it is possible that an application misusing the interface may now get errors. * The OSD now enforces that class write methods cannot both mutate an object and return data. The rbd.assign_bid method, the lone offender, has been removed. This breaks compatibility with pre-bobtail librbd clients by preventing them from creating new images. * librados now returns on commit instead of ack for synchronous calls. This is a bit safer in the case where both OSDs and the client crash, and is probably how it should have been acting from the beginning. Users are unlikely to notice but it could result in lower performance in some circumstances. Those who care should switch to using the async interfaces, which let you specify safety semantics precisely. * The C++ librados AioComplete::get_version() method was incorrectly returning an int (usually 32-bits). To avoid breaking library compatibility, a get_version64() method is added that returns the full-width value. The old method is deprecated and will be removed in a future release. Users of the C++ librados API that make use of the get_version() method should modify their code to avoid getting a value that is truncated from 64 to to 32 bits. Notable Changes --------------- * build cleanly under clang (Christophe Courtaut) * common: migrate SharedPtrRegistry to use boost::shared_ptr<> (Loic Dachary) * doc: erasure coding design notes (Loic Dachary) * improved intel-optimized crc32c support (~8x faster on my laptop!) * librados: get_version64() method for C++ API * mds: fix locking deadlock (David Disseldorp) * mon, osd: initial CLI for configuring tiering * mon: allow cap strings with . to be unquoted * mon: continue to discover peer addr info during election phase * mon: fix 'osd crush move ...' command for buckets (Joao Luis) * mon: warn when mon data stores grow very large (Joao Luis) * objecter, librados: redirect requests based on cache tier config * osd, librados: add new COPY_FROM rados operation * osd, librados: add new COPY_GET rados operations (used by COPY_FROM) * osd: add 'osd heartbeat min healthy ratio' configurable (was hard-coded at 33%) * osd: add option to disable pg log debug code (which burns CPU) * osd: allow cap strings with . to be unquoted * osd: fix version value returned by various operations (Greg Farnum) * osd: infrastructure to copy objects from other OSDs * osd: use fdatasync(2) instead of fsync(2) to improve performance (Sam Just) * rgw: fix major CPU utilization bug with internal caching (Yehuda Sadeh, Mark Nelson) * rgw: fix ordering of write operations (preventing data loss on crash) (Yehuda Sadeh) * rgw: fix ordering of writes for mulitpart upload (Yehuda Sadeh) * rgw: fix various CORS bugs (Yehuda Sadeh) * rgw: improve help output (Christophe Courtaut) * rgw: validate S3 tokens against keystone (Roald J. van Loon) * rgw: wildcard support for keystone roles (Christophe Courtaut) * sysvinit radosgw: fix status return code (Danny Al-Gaaf) * sysvinit rbdmap: fix error 'service rbdmap stop' (Laurent Barbe) v0.68 ===== Upgrading --------- * 'ceph osd crush set ' no longer adds the osd to the specified location, as that's a job for 'ceph osd crush add'. It will however continue to work just the same as long as the osd already exists in the crush map. * The OSD now enforces that class write methods cannot both mutate an object and return data. The rbd.assign_bid method, the lone offender, has been removed. This breaks compatibility with pre-bobtail librbd clients by preventing them from creating new images. * librados now returns on commit instead of ack for synchronous calls. This is a bit safer in the case where both OSDs and the client crash, and is probably how it should have been acting from the beginning. Users are unlikely to notice but it could result in lower performance in some circumstances. Those who care should switch to using the async interfaces, which let you specify safety semantics precisely. * The C++ librados AioComplete::get_version() method was incorrectly returning an int (usually 32-bits). To avoid breaking library compatibility, a get_version64() method is added that returns the full-width value. The old method is deprecated and will be removed in a future release. Users of the C++ librados API that make use of the get_version() method should modify their code to avoid getting a value that is truncated from 64 to to 32 bits. Notable Changes --------------- * ceph-fuse: fix problem with readahead vs truncate race (Yan, Zheng) * ceph-post-file: new command to easily share logs or other files with ceph devs * ceph: parse CEPH_ARGS env variable * librados: fix async aio completion wakeup * librados: hello_world example (Greg Farnum) * librados: sync calls now return on commit (instead of ack) (Greg Farnum) * mds: fix mds rejoin with legacy parent backpointer xattrs (Alexandre Oliva) * mds: fix rare restart/failure race during fs creation * mds: notify clients about deleted files (so they can release from their cache) (Yan, Zheng) * mds: several bug fixes with clustered mds (Yan, Zheng) * mon: allow logging level of cluster log (/var/log/ceph/ceph.log) to be adjusted * mon: do not expose uncommitted state from 'osd crush {add,set} ...' (Joao Luis) * mon: fix byte counts (off by factor of 4) (Dan Mick, Joao Luis) * mon: fix paxos corner case * mon: modify 'auth add' semantics to make a bit more sense (Joao Luis) * mon: new 'osd perf' command to dump recent performance information (Samuel Just) * mon: new and improved 'ceph -s' or 'ceph status' command (more info, easier to read) * monc: fix small memory leak * new wireshark patches pulled into the tree (Kevin Jones) * objecter: fix possible hang when cluster is unpaused (Josh Durgin) * osd: 'osd recover clone overlap limit' option to limit cloning during recovery (Samuel Just) * osd: cls_hello OSD class example * osd: experiemental support for ZFS (zfsonlinux.org) (Yan, Zheng) * osd: instrument peering states (David Zafman) * osd: properly enforce RD/WR flags for rados classes * osd: remove old pg log on upgrade (Samuel Just) * rgw: complete in-progress requests before shutting down * rgw: fix S3 auth with response-* query string params (Sylvain Munaut, Yehuda Sadeh) * sysvinit: add condrestart command (Dan van der Ster) v0.67.12 "Dumpling" (draft) =========================== This stable update for Dumpling fixes a few longstanding issues with backfill in the OSD that can lead to stalled IOs. There is also a fix for memory utilization for reads in librbd when caching is enabled, and then several other small fixes across the rest of the system. Dumpling users who have encountered IO stalls during backfill and who do not expect to upgrade to Firefly soon should upgrade. Everyone else should upgrade to Firefly already. This is likely to be the last stable release for the 0.67.x Dumpling series. Notable Changes --------------- * buffer: fix buffer rebuild alignment corner case (#6614 #6003 Loic Dachary, Samuel Just) * ceph-disk: reprobe partitions after zap (#9665 #9721 Loic Dachary) * ceph-disk: use partx instead of partprobe when appropriate (Loic Dachary) * common: add $cctid meta variable (#6228 Adam Crume) * crush: fix get_full_location_ordered (Sage Weil) * crush: pick ruleset id that matches rule_id (#9675 Xiaoxi Chen) * libcephfs: fix tid wrap bug (#9869 Greg Farnum) * libcephfs: get osd location on -1 should return EINVAL (Sage Weil) * librados: fix race condition with C API and op timeouts (#9582 Sage Weil) * librbd: constrain max number of in-flight read requests (#9854 Jason Dillaman) * librbd: enforce cache size on read requests (Jason Dillaman) * librbd: fix invalid close in image open failure path (#10030 Jason Dillaman) * librbd: fix read hang on sparse files (Jason Dillaman) * librbd: gracefully handle deleted/renamed pools (#10270 #10122 Jason Dillaman) * librbd: protect list_children from invalid child pool ioctxs (#10123 Jason Dillaman) * mds: fix ctime updates from clients without dirty caps (#9514 Greg Farnum) * mds: fix rare NULL dereference in cap update path (Greg Farnum) * mds: fix assertion caused by system clock backwards (#11053 Yan, Zheng) * mds: store backtrace on straydir (Yan, Zheng) * osd: fix journal committed_thru update after replay (#6756 Samuel Just) * osd: fix memory leak, busy loop on snap trim (#9113 Samuel Just) * osd: fix misc peering, recovery bugs (#10168 Samuel Just) * osd: fix purged_snap field on backfill start (#9487 Sage Weil, Samuel Just) * osd: handle no-op write with snapshot corner case (#10262 Sage Weil, Loic Dachary) * osd: respect RWORDERED rados flag (Sage Weil) * osd: several backfill fixes and refactors (Samuel Just, David Zafman) * rgw: send http status reason explicitly in fastcgi (Yehuda Sadeh) v0.67.11 "Dumpling" =================== This stable update for Dumpling fixes several important bugs that affect a small set of users. We recommend that all Dumpling users upgrade at their convenience. If none of these issues are affecting your deployment there is no urgency. Notable Changes --------------- * common: fix sending dup cluster log items (#9080 Sage Weil) * doc: several doc updates (Alfredo Deza) * libcephfs-java: fix build against older JNI headesr (Greg Farnum) * librados: fix crash in op timeout path (#9362 Matthias Kiefer, Sage Weil) * librbd: fix crash using clone of flattened image (#8845 Josh Durgin) * librbd: fix error path cleanup when failing to open image (#8912 Josh Durgin) * mon: fix crash when adjusting pg_num before any OSDs are added (#9052 Sage Weil) * mon: reduce log noise from paxos (Aanchal Agrawal, Sage Weil) * osd: allow scrub and snap trim thread pool IO priority to be adjusted (Sage Weil) * osd: fix mount/remount sync race (#9144 Sage Weil) v0.67.10 "Dumpling" =================== This stable update release for Dumpling includes primarily fixes for RGW, including several issues with bucket listings and a potential data corruption problem when multiple multi-part uploads race. There is also some throttling capability added in the OSD for scrub that can mitigate the performance impact on production clusters. We recommend that all Dumpling users upgrade at their convenience. Notable Changes --------------- * ceph-disk: partprobe befoere settle, fixing dm-crypt (#6966, Eric Eastman) * librbd: add invalidate cache interface (Josh Durgin) * librbd: close image if remove_child fails (Ilya Dryomov) * librbd: fix potential null pointer dereference (Danny Al-Gaaf) * librbd: improve writeback checks, performance (Haomai Wang) * librbd: skip zeroes when copying image (#6257, Josh Durgin) * mon: fix rule(set) check on 'ceph pool set ... crush_ruleset ...' (#8599, John Spray) * mon: shut down if mon is removed from cluster (#6789, Joao Eduardo Luis) * osd: fix filestore perf reports to mon (Sage Weil) * osd: force any new or updated xattr into leveldb if E2BIG from XFS (#7779, Sage Weil) * osd: lock snapdir object during write to fix race with backfill (Samuel Just) * osd: option sleep during scrub (Sage Weil) * osd: set io priority on scrub and snap trim threads (Sage Weil) * osd: 'status' admin socket command (Sage Weil) * rbd: tolerate missing NULL terminator on block_name_prefix (#7577, Dan Mick) * rgw: calculate user manifest (#8169, Yehuda Sadeh) * rgw: fix abort on chunk read error, avoid using extra memory (#8289, Yehuda Sadeh) * rgw: fix buffer overflow on bucket instance id (#8608, Yehuda Sadeh) * rgw: fix crash in swift CORS preflight request (#8586, Yehuda Sadeh) * rgw: fix implicit removal of old objects on object creation (#8972, Patrycja Szablowska, Yehuda Sadeh) * rgw: fix MaxKeys in bucket listing (Yehuda Sadeh) * rgw: fix race with multiple updates to a single multipart object (#8269, Yehuda Sadeh) * rgw: improve bucket listing with delimiter (Yehuda Sadeh) * rgw: include NextMarker in bucket listing (#8858, Yehuda Sadeh) * rgw: return error early on non-existent bucket (#7064, Yehuda Sadeh) * rgw: set truncation flag correctly in bucket listing (Yehuda Sadeh) * sysvinit: continue starting daemons after pre-mount error (#8554, Sage Weil) For more detailed information, see :download:`the complete changelog `. v0.67.9 "Dumpling" ================== This Dumpling point release fixes several minor bugs. The most prevalent in the field is one that occasionally prevents OSDs from starting on recently created clusters. We recommend that all Dumpling users upgrade at their convenience. Notable Changes --------------- * ceph-fuse, libcephfs: client admin socket command to kick and inspect MDS sessions (#8021, Zheng Yan) * monclient: fix failure detection during mon handshake (#8278, Sage Weil) * mon: set tid on no-op PGStatsAck messages (#8280, Sage Weil) * msgr: fix a rare bug with connection negotiation between OSDs (Guang Yang) * osd: allow snap trim throttling with simple delay (#6278, Sage Weil) * osd: check for splitting when processing recover/backfill reservations (#6565, Samuel Just) * osd: fix backfill position tracking (#8162, Samuel Just) * osd: fix bug in backfill stats (Samuel Just) * osd: fix bug preventing OSD startup for infant clusters (#8162, Greg Farnum) * osd: fix rare PG resurrection race causing an incomplete PG (#7740, Samuel Just) * osd: only complete replicas count toward min_size (#7805, Samuel Just) * rgw: allow setting ACLs with empty owner (#6892, Yehuda Sadeh) * rgw: send user manifest header field (#8170, Yehuda Sadeh) For more detailed information, see :download:`the complete changelog `. v0.67.8 "Dumpling" ================== This Dumpling point release fixes several non-critical issues since v0.67.7. The most notable bug fixes are an auth fix in librbd (observed as an occasional crash from KVM), an improvement in the network failure detection with the monitor, and several hard to hit OSD crashes or hangs. We recommend that all users upgrade at their convenience. Upgrading --------- * The 'rbd ls' function now returns success and returns an empty when a pool does not store any rbd images. Previously it would return an ENOENT error. * Ceph will now issue a health warning if the 'mon osd down out interval' config option is set to zero. This warning can be disabled by adding 'mon warn on osd down out interval zero = false' to ceph.conf. Notable Changes --------------- * all: improve keepalive detection of failed monitor connections (#7888, Sage Weil) * ceph-fuse, libcephfs: pin inodes during readahead, fixing rare crash (#7867, Sage Weil) * librbd: make cache writeback a bit less aggressive (Sage Weil) * librbd: make symlink for qemu to detect librbd in RPM (#7293, Josh Durgin) * mon: allow 'hashpspool' pool flag to be set and unset (Loic Dachary) * mon: commit paxos state only after entire quorum acks, fixing rare race where prior round state is readable (#7736, Sage Weil) * mon: make elections and timeouts a bit more robust (#7212, Sage Weil) * mon: prevent extreme pool split operations (Greg Farnum) * mon: wait for quorum for get_version requests to close rare pool creation race (#7997, Sage Weil) * mon: warn on 'mon osd down out interval = 0' (#7784, Joao Luis) * msgr: fix byte-order for auth challenge, fixing auth errors on big-endian clients (#7977, Dan Mick) * msgr: fix occasional crash in authentication code (usually triggered by librbd) (#6840, Josh Durgin) * msgr: fix rebind() race (#6992, Xihui He) * osd: avoid timeouts during slow PG deletion (#6528, Samuel Just) * osd: fix bug in pool listing during recovery (#6633, Samuel Just) * osd: fix queue limits, fixing recovery stalls (#7706, Samuel Just) * osd: fix rare peering crashes (#6722, #6910, Samuel Just) * osd: fix rare recovery hang (#6681, Samuel Just) * osd: improve error handling on journal errors (#7738, Sage Weil) * osd: reduce load on the monitor from OSDMap subscriptions (Greg Farnum) * osd: rery GetLog on peer osd startup, fixing some rare peering stalls (#6909, Samuel Just) * osd: reset journal state on remount to fix occasional crash on OSD startup (#8019, Sage Weil) * osd: share maps with peers more aggressively (Greg Farnum) * rbd: make it harder to delete an rbd image that is currently in use (#7076, Ilya Drymov) * rgw: deny writes to secondary zone by non-system users (#6678, Yehuda Sadeh) * rgw: do'nt log system requests in usage log (#6889, Yehuda Sadeh) * rgw: fix bucket recreation (#6951, Yehuda Sadeh) * rgw: fix Swift range response (#7099, Julien Calvet, Yehuda Sadeh) * rgw: fix URL escaping (#8202, Yehuda Sadeh) * rgw: fix whitespace trimming in http headers (#7543, Yehuda Sadeh) * rgw: make multi-object deletion idempotent (#7346, Yehuda Sadeh) For more detailed information, see :download:`the complete changelog `. v0.67.7 "Dumpling" ================== This Dumpling point release fixes a few critical issues in v0.67.6. All v0.67.6 users are urgently encouraged to upgrade. We also recommend that all v0.67.5 (or older) users upgrade. Upgrading --------- * Once you have upgraded a radosgw instance or OSD to v0.67.7, you should not downgrade to a previous version. Notable Changes --------------- * ceph-disk: additional unit tests * librbd: revert caching behavior change in v0.67.6 * osd: fix problem reading xattrs due to incomplete backport in v0.67.6 * radosgw-admin: fix reading object policy For more detailed information, see :download:`the complete changelog `. v0.67.6 "Dumpling" ================== .. note: This release contains a librbd bug that is fixed in v0.67.7. Please upgrade to v0.67.7 and do not use v0.67.6. This Dumpling point release contains a number of important fixed for the OSD, monitor, and radosgw. Most significantly, a change that forces large object attributes to spill over into leveldb has been backported that can prevent objects and the cluster from being damaged by large attributes (which can be induced via the radosgw). There is also a set of fixes that improves data safety and RADOS semantics when the cluster becomes full and then non-full. We recommend that all 0.67.x Dumpling users skip this release and upgrade to v0.67.7. Upgrading --------- * The OSD has long contained a feature that allows large xattrs to spill over into the leveldb backing store in situations where not all local file systems are able to store them reliably. This option is now enabled unconditionally in order to avoid rare cases where storing large xattrs renders the object unreadable. This is known to be triggered by very large multipart objects, but could be caused by other workloads as well. Although there is some small risk that performance for certain workloads will degrade, it is more important that data be retrievable. Note that newer versions of Ceph (e.g., firefly) do some additional work to avoid the potential performance regression in this case, but that is current considered too complex for backport to the Dumpling stable series. * It is very dangerous to downgrade from v0.67.6 to a prior version of Dumpling. If the old version does not have 'filestore xattr use omap = true' it may not be able to read all xattrs for an object and can cause undefined behavior. Notable changes --------------- * ceph-disk: misc bug fixes, particularly on RHEL (Loic Dachary, Alfredo Deza, various) * ceph-fuse, libcephfs: fix crash from read over certain sparseness patterns (Sage Weil) * ceph-fuse, libcephfs: fix integer overflow for sync reads racing with appends (Sage Weil) * ceph.spec: fix udev rule when building RPM under RHEL (Derek Yarnell) * common: fix crash from bad format from admin socket (Loic Dachary) * librados: add optional timeouts (Josh Durgin) * librados: do not leak budget when resending localized or redirected ops (Josh Durgin) * librados, osd: fix and improve full cluster handling (Josh Durgin) * librbd: fix use-after-free when updating perfcounters during image close (Josh Durgin) * librbd: remove limit on objects in cache (Josh Durgin) * mon: avoid on-disk full OSDMap corruption from pg_temp removal (Sage Weil) * mon: avoid stray pg_temp entries from pool deletion race (Joao Eduardo Luis) * mon: do not generate spurious MDSMaps from laggy daemons (Joao Eduardo Luis) * mon: fix error code from 'osd rm|down|out|in ...' commands (Loic Dachary) * mon: include all health items in summary output (John Spray) * osd: fix occasional race/crash during startup (Sage Weil) * osd: ignore stray OSDMap messages during init (Sage Weil) * osd: unconditionally let xattrs overflow into leveldb (David Zafman) * rados: fix a few error checks for the CLI (Josh Durgin) * rgw: convert legacy bucket info objects on demand (Yehuda Sadeh) * rgw: fix bug causing system users to lose privileges (Yehuda Sadeh) * rgw: fix CORS bugs related to headers and case sensitivity (Robin H. Johnson) * rgw: fix multipart object listing (Yehuda Sadeh) * rgw: fix racing object creations (Yehuda Sadeh) * rgw: fix racing object put and delete (Yehuda Sadeh) * rgw: fix S3 auth when using response-* query string params (Sylvain Munaut) * rgw: use correct secret key for POST authentication (Robin H. Johnson) For more detailed information, see :download:`the complete changelog `. v0.67.5 "Dumpling" ================== This release includes a few critical bug fixes for the radosgw, including a fix for hanging operations on large objects. There are also several bug fixes for radosgw multi-site replications, and a few backported features. Also, notably, the 'osd perf' command (which dumps recent performance information about active OSDs) has been backported. We recommend that all 0.67.x Dumpling users upgrade. Notable changes --------------- * ceph-fuse: fix crash in caching code * mds: fix looping in populate_mydir() * mds: fix standby-replay race * mon: accept 'osd pool set ...' as string * mon: backport: 'osd perf' command to dump recent OSD performance stats * osd: add feature compat check for upcoming object sharding * osd: fix osd bench block size argument * rbd.py: increase parent name size limit * rgw: backport: allow wildcard in supported keystone roles * rgw: backport: improve swift COPY behavior * rgw: backport: log and open admin socket by default * rgw: backport: validate S3 tokens against keystone * rgw: fix bucket removal * rgw: fix client error code for chunked PUT failure * rgw: fix hang on large object GET * rgw: fix rare use-after-free * rgw: various DR bug fixes * sysvinit, upstart: prevent starting daemons using both init systems For more detailed information, see :download:`the complete changelog `. v0.67.4 "Dumpling" ================== This point release fixes an important performance issue with radosgw, keystone authentication token caching, and CORS. All users (especially those of rgw) are encouraged to upgrade. Notable changes --------------- * crush: fix invalidation of cached names * crushtool: do not crash on non-unique bucket ids * mds: be more careful when decoding LogEvents * mds: fix heap check debugging commands * mon: avoid rebuilding old full osdmaps * mon: fix 'ceph crush move ...' * mon: fix 'ceph osd crush reweight ...' * mon: fix writeout of full osdmaps during trim * mon: limit size of transactions * mon: prevent both unmanaged and pool snaps * osd: disable xattr size limit (prevents upload of large rgw objects) * osd: fix recovery op throttling * osd: fix throttling of log messages for very slow requests * rgw: drain pending requests before completing write * rgw: fix CORS * rgw: fix inefficient list::size() usage * rgw: fix keystone token expiration * rgw: fix minor memory leaks * rgw: fix null termination of buffer For more detailed information, see :download:`the complete changelog `. v0.67.3 "Dumpling" ================== This point release fixes a few important performance regressions with the OSD (both with CPU and disk utilization), as well as several other important but less common problems. We recommend that all production users upgrade. Notable Changes --------------- * ceph-disk: partprobe after creation journal partition * ceph-disk: specify fs type when mounting * ceph-post-file: new utility to help share logs and other files with ceph developers * libcephfs: fix truncate vs readahead race (crash) * mds: fix flock/fcntl lock deadlock * mds: fix rejoin loop when encountering pre-dumpling backpointers * mon: allow name and addr discovery during election stage * mon: always refresh after Paxos store_state (fixes recovery corner case) * mon: fix off-by-4x bug with osd byte counts * osd: add and disable 'pg log keys debug' by default * osd: add option to disable throttling * osd: avoid leveldb iterators for pg log append and trim * osd: fix readdir_r invocations * osd: use fdatasync instead of sync * radosgw: fix sysvinit script return status * rbd: relicense as LGPL2 * rgw: flush pending data on multipart upload * rgw: recheck object name during S3 POST * rgw: reorder init/startup * rpm: fix debuginfo package build For more detailed information, see :download:`the complete changelog `. v0.67.2 "Dumpling" ================== This is an imporant point release for Dumpling. Most notably, it fixes a problem when upgrading directly from v0.56.x Bobtail to v0.67.x Dumpling (without stopping at v0.61.x Cuttlefish along the way). It also fixes a problem with the CLI parsing of the CEPH_ARGS environment variable, high CPU utilization by the ceph-osd daemons, and cleans up the radosgw shutdown sequence. Notable Changes --------------- * objecter: resend linger requests when cluster goes from full to non-full * ceph: parse CEPH_ARGS environment variable * librados: fix small memory leak * osd: remove old log objects on upgrade (fixes bobtail -> dumpling jump) * osd: disable PGLog::check() via config option (fixes CPU burn) * rgw: drain requests on shutdown * rgw: misc memory leaks on shutdown For more detailed information, see :download:`the complete changelog `. v0.67.1 "Dumpling" ================== This is a minor point release for Dumpling that fixes problems with OpenStack and librbd hangs when caching is disabled. Notable changes --------------- * librados, librbd: fix constructor for python bindings with certain usages (in particular, that used by OpenStack) * librados, librbd: fix aio_flush wakeup when cache is disabled * librados: fix locking for aio completion refcounting * fixes 'ceph --admin-daemon ...' command error code on error * fixes 'ceph daemon ... config set ...' command for boolean config options. For more detailed information, see :download:`the complete changelog `. v0.67 "Dumpling" ================ This is the fourth major release of Ceph, code-named "Dumpling." The headline features for this release include: * Multi-site support for radosgw. This includes the ability to set up separate "regions" in the same or different Ceph clusters that share a single S3/Swift bucket/container namespace. * RESTful API endpoint for Ceph cluster administration. ceph-rest-api, a wrapper around ceph_rest_api.py, can be used to start up a test single-threaded HTTP server that provides access to cluster information and administration in very similar ways to the ceph commandline tool. ceph_rest_api.py can be used as a WSGI application for deployment in a more-capable web server. See ceph-rest-api.8 for more. * Object namespaces in librados. .. _Dumpling upgrade: Upgrade Sequencing ------------------ It is possible to do a rolling upgrade from Cuttlefish to Dumpling. #. Upgrade ceph-common on all nodes that will use the command line 'ceph' utility. #. Upgrade all monitors (upgrade ceph package, restart ceph-mon daemons). This can happen one daemon or host at a time. Note that because cuttlefish and dumpling monitors can't talk to each other, all monitors should be upgraded in relatively short succession to minimize the risk that an a untimely failure will reduce availability. #. Upgrade all osds (upgrade ceph package, restart ceph-osd daemons). This can happen one daemon or host at a time. #. Upgrade radosgw (upgrade radosgw package, restart radosgw daemons). Upgrading from v0.66 -------------------- * There is monitor internal protocol change, which means that v0.67 ceph-mon daemons cannot talk to v0.66 or older daemons. We recommend upgrading all monitors at once (or in relatively quick succession) to minimize the possibility of downtime. * The output of 'ceph status --format=json' or 'ceph -s --format=json' has changed to return status information in a more structured and usable format. * The 'ceph pg dump_stuck [threshold]' command used to require a --threshold or -t prefix to the threshold argument, but now does not. * Many more ceph commands now output formatted information; select with '--format=', where can be 'json', 'json-pretty', 'xml', or 'xml-pretty'. * The 'ceph pg ...' commands (like 'ceph pg query') are deprecated in favor of 'ceph tell ...'. This makes the distinction between 'ceph pg ' and 'ceph pg ' less awkward by making it clearer that the 'tell' commands are talking to the OSD serving the placement group, not the monitor. * The 'ceph --admin-daemon ' used to accept the command and arguments as either a single string or as separate arguments. It will now only accept the command spread across multiple arguments. This means that any script which does something like:: ceph --admin-daemon /var/run/ceph/ceph-osd.0.asok 'config set debug_ms 1' needs to remove the quotes. Also, note that the above can now be shortened to:: ceph daemon osd.0 config set debug_ms 1 * The radosgw caps were inconsistently documented to be either 'mon = allow r' or 'mon = allow rw'. The 'mon = allow rw' is required for radosgw to create its own pools. All documentation has been updated accordingly. * The radosgw copy object operation may return extra progress info during the operation. At this point it will only happen when doing cross zone copy operations. The S3 response will now return extra field under the container. The Swift response will now send the progress as a json array. * In v0.66 and v0.65 the HASHPSPOOL pool flag was enabled by default for new pools, but has been disabled again until Linux kernel client support reaches more distributions and users. * ceph-osd now requires a max file descriptor limit (e.g., ``ulimit -n ...``) of at least filestore_wbthrottle_(xfs|btrfs)_inodes_hard_limit (5000 by default) in order to accomodate the new write back throttle system. On Ubuntu, upstart now sets the fd limit to 32k. On other platforms, the sysvinit script will set it to 32k by default (still overrideable via max_open_files). If this field has been customized in ceph.conf it should likely be adjusted upwards. Upgrading from v0.61 "Cuttlefish" --------------------------------- In addition to the above notes about upgrading from v0.66: * There has been a huge revamp of the 'ceph' command-line interface implementation. The ``ceph-common`` client library needs to be upgrade before ``ceph-mon`` is restarted in order to avoid problems using the CLI (the old ``ceph`` client utility cannot talk to the new ``ceph-mon``). * The CLI is now very careful about sending the 'status' one-liner output to stderr and command output to stdout. Scripts relying on output should take care. * The 'ceph osd tell ...' and 'ceph mon tell ...' commands are no longer supported. Any callers should use:: ceph tell osd. ... ceph tell mon. ... The 'ceph mds tell ...' command is still there, but will soon also transition to 'ceph tell mds. ...' * The 'ceph osd crush add ...' command used to take one of two forms:: ceph osd crush add 123 osd.123 ceph osd crush add osd.123 This is because the id and crush name are redundant. Now only the simple form is supported, where the osd name/id can either be a bare id (integer) or name (osd.):: ceph osd crush add osd.123 ceph osd crush add 123 * There is now a maximum RADOS object size, configurable via 'osd max object size', defaulting to 100 GB. Note that this has no effect on RBD, CephFS, or radosgw, which all stripe over objects. If you are using librados and storing objects larger than that, you will need to adjust 'osd max object size', and should consider using smaller objects instead. * The 'osd min down {reporters|reports}' config options have been renamed to 'mon osd min down {reporters|reports}', and the documentation has been updated to reflect that these options apply to the monitors (who process failure reports) and not OSDs. If you have adjusted these settings, please update your ``ceph.conf`` accordingly. Notable changes since v0.66 --------------------------- * mon: sync improvements (performance and robustness) * mon: many bug fixes (paxos and services) * mon: fixed bugs in recovery and io rate reporting (negative/large values) * mon: collect metadata on osd performance * mon: generate health warnings from slow or stuck requests * mon: expanded --format= support for monitor commands * mon: scrub function for verifying data integrity * mon, osd: fix old osdmap trimming logic * mon: enable leveldb caching by default * mon: more efficient storage of PG metadata * ceph-rest-api: RESTful endpoint for administer cluster (mirrors CLI) * rgw: multi-region support * rgw: infrastructure to support georeplication of bucket and user metadata * rgw: infrastructure to support georeplication of bucket data * rgw: COPY object support between regions * rbd: /etc/ceph/rbdmap file for mapping rbd images on startup * osd: many bug fixes * osd: limit number of incremental osdmaps sent to peers (could cause osds to be wrongly marked down) * osd: more efficient small object recovery * osd, librados: support for object namespaces * osd: automatically enable xattrs on leveldb as necessary * mds: fix bug in LOOKUPINO (used by nfs reexport) * mds: fix O_TRUNC locking * msgr: fixed race condition in inter-osd network communication * msgr: fixed various memory leaks related to network sessions * ceph-disk: fixes for unusual device names, partition detection * hypertable: fixes for hypertable CephBroker bindings * use SSE4.2 crc32c instruction if present Notable changes since v0.61 "Cuttlefish" ---------------------------------------- * add 'config get' admin socket command * ceph-conf: --show-config-value now reflects daemon defaults * ceph-disk: add '[un]suppress-active DEV' command * ceph-disk: avoid mounting over an existing osd in /var/lib/ceph/osd/* * ceph-disk: fixes for unusual device names, partition detection * ceph-disk: improved handling of odd device names * ceph-disk: many fixes for RHEL/CentOS, Fedora, wheezy * ceph-disk: simpler, more robust locking * ceph-fuse, libcephfs: fix a few caps revocation bugs * ceph-fuse, libcephfs: fix read zeroing at EOF * ceph-fuse, libcephfs: fix request refcounting bug (hang on shutdown) * ceph-fuse, libcephfs: fix truncatation bug on >4MB files (Yan, Zheng) * ceph-fuse, libcephfs: fix for cap release/hang * ceph-fuse: add ioctl support * ceph-fuse: fixed long-standing O_NOATIME vs O_LAZY bug * ceph-rest-api: RESTful endpoint for administer cluster (mirrors CLI) * ceph, librados: fix resending of commands on mon reconnect * daemons: create /var/run/ceph as needed * debian wheezy: fix udev rules * debian, specfile: packaging cleanups * debian: fix upstart behavior with upgrades * debian: rgw: stop daemon on uninstall * debian: stop daemons on uninstall; fix dependencies * hypertable: fixes for hypertable CephBroker bindings * librados python binding cleanups * librados python: fix xattrs > 4KB (Josh Durgin) * librados: configurable max object size (default 100 GB) * librados: new calls to administer the cluster * librbd: ability to read from local replicas * librbd: locking tests (Josh Durgin) * librbd: make default options/features for newly created images (e.g., via qemu-img) configurable * librbd: parallelize delete, rollback, flatten, copy, resize * many many fixes from static code analysis (Danny Al-Gaaf) * mds: fix O_TRUNC locking * mds: fix bug in LOOKUPINO (used by nfs reexport) * mds: fix rare hang after client restart * mds: fix several bugs (Yan, Zheng) * mds: many backpointer improvements (Yan, Zheng) * mds: many fixes for mds clustering * mds: misc stability fixes (Yan, Zheng, Greg Farnum) * mds: new robust open-by-ino support (Yan, Zheng) * mds: support robust lookup by ino number (good for NFS) (Yan, Zheng) * mon, ceph: huge revamp of CLI and internal admin API. (Dan Mick) * mon, osd: fix old osdmap trimming logic * mon, osd: many memory leaks fixed * mon: better trim/compaction behavior * mon: collect metadata on osd performance * mon: enable leveldb caching by default * mon: expanded --format= support for monitor commands * mon: fix election timeout * mon: fix leveldb compression, trimming * mon: fix start fork behavior * mon: fix units in 'ceph df' output * mon: fix validation of mds ids from CLI commands * mon: fixed bugs in recovery and io rate reporting (negative/large values) * mon: generate health warnings from slow or stuck requests * mon: many bug fixes (paxos and services, sync) * mon: many stability fixes (Joao Luis) * mon: more efficient storage of PG metadata * mon: new --extract-monmap to aid disaster recovery * mon: new capability syntax * mon: scrub function for verifying data integrity * mon: simplify PaxosService vs Paxos interaction, fix readable/writeable checks * mon: sync improvements (performance and robustness) * mon: tuning, performance improvements * msgr: fix various memory leaks * msgr: fixed race condition in inter-osd network communication * msgr: fixed various memory leaks related to network sessions * osd, librados: support for object namespaces * osd, mon: optionally dump leveldb transactions to a log * osd: automatically enable xattrs on leveldb as necessary * osd: avoid osd flapping from asymmetric network failure * osd: break blacklisted client watches (David Zafman) * osd: close narrow journal race * osd: do not use fadvise(DONTNEED) on XFS (data corruption on power cycle) * osd: fix for an op ordering bug * osd: fix handling for split after upgrade from bobtail * osd: fix incorrect mark-down of osds * osd: fix internal heartbeart timeouts when scrubbing very large objects * osd: fix memory/network inefficiency during deep scrub * osd: fixed problem with front-side heartbeats and mixed clusters (David Zafman) * osd: limit number of incremental osdmaps sent to peers (could cause osds to be wrongly marked down) * osd: many bug fixes * osd: monitor both front and back interfaces * osd: more efficient small object recovery * osd: new writeback throttling (for less bursty write performance) (Sam Just) * osd: pg log (re)writes are now vastly more efficient (faster peering) (Sam Just) * osd: ping/heartbeat on public and private interfaces * osd: prioritize recovery for degraded PGs * osd: re-use partially deleted PG contents when present (Sam Just) * osd: recovery and peering performance improvements * osd: resurrect partially deleted PGs * osd: verify both front and back network are working before rejoining cluster * rados: clonedata command for cli * radosgw-admin: create keys for new users by default * rbd: /etc/ceph/rbdmap file for mapping rbd images on startup * rgw: COPY object support between regions * rgw: fix CORS bugs * rgw: fix locking issue, user operation mask, * rgw: fix radosgw-admin buckets list (Yehuda Sadeh) * rgw: fix usage log scanning for large, untrimmed logs * rgw: handle deep uri resources * rgw: infrastructure to support georeplication of bucket and user metadata * rgw: infrastructure to support georeplication of bucket data * rgw: multi-region support * sysvinit: fix enumeration of local daemons * sysvinit: fix osd crush weight calculation when using -a * sysvinit: handle symlinks in /var/lib/ceph/osd/* * use SSE4.2 crc32c instruction if present v0.66 ===== Upgrading --------- * There is now a configurable maximum rados object size, defaulting to 100 GB. If you are using librados and storing objects larger than that, you will need to adjust 'osd max object size', and should consider using smaller objects instead. Notable changes --------------- * osd: pg log (re)writes are now vastly more efficient (faster peering) (Sam Just) * osd: fixed problem with front-side heartbeats and mixed clusters (David Zafman) * mon: tuning, performance improvements * mon: simplify PaxosService vs Paxos interaction, fix readable/writeable checks * rgw: fix radosgw-admin buckets list (Yehuda Sadeh) * mds: support robust lookup by ino number (good for NFS) (Yan, Zheng) * mds: fix several bugs (Yan, Zheng) * ceph-fuse, libcephfs: fix truncatation bug on >4MB files (Yan, Zheng) * ceph/librados: fix resending of commands on mon reconnect * librados python: fix xattrs > 4KB (Josh Durgin) * librados: configurable max object size (default 100 GB) * msgr: fix various memory leaks * ceph-fuse: fixed long-standing O_NOATIME vs O_LAZY bug * ceph-fuse, libcephfs: fix request refcounting bug (hang on shutdown) * ceph-fuse, libcephfs: fix read zeroing at EOF * ceph-conf: --show-config-value now reflects daemon defaults * ceph-disk: simpler, more robust locking * ceph-disk: avoid mounting over an existing osd in /var/lib/ceph/osd/* * sysvinit: handle symlinks in /var/lib/ceph/osd/* v0.65 ===== Upgrading --------- * Huge revamp of the 'ceph' command-line interface implementation. The ``ceph-common`` client library needs to be upgrade before ``ceph-mon`` is restarted in order to avoid problems using the CLI (the old ``ceph`` client utility cannot talk to the new ``ceph-mon``). * The CLI is now very careful about sending the 'status' one-liner output to stderr and command output to stdout. Scripts relying on output should take care. * The 'ceph osd tell ...' and 'ceph mon tell ...' commands are no longer supported. Any callers should use:: ceph tell osd. ... ceph tell mon. ... The 'ceph mds tell ...' command is still there, but will soon also transition to 'ceph tell mds. ...' * The 'ceph osd crush add ...' command used to take one of two forms:: ceph osd crush add 123 osd.123 ceph osd crush add osd.123 This is because the id and crush name are redundant. Now only the simple form is supported, where the osd name/id can either be a bare id (integer) or name (osd.):: ceph osd crush add osd.123 ceph osd crush add 123 * There is now a maximum RADOS object size, configurable via 'osd max object size', defaulting to 100 GB. Note that this has no effect on RBD, CephFS, or radosgw, which all stripe over objects. Notable changes --------------- * mon, ceph: huge revamp of CLI and internal admin API. (Dan Mick) * mon: new capability syntax * osd: do not use fadvise(DONTNEED) on XFS (data corruption on power cycle) * osd: recovery and peering performance improvements * osd: new writeback throttling (for less bursty write performance) (Sam Just) * osd: ping/heartbeat on public and private interfaces * osd: avoid osd flapping from asymmetric network failure * osd: re-use partially deleted PG contents when present (Sam Just) * osd: break blacklisted client watches (David Zafman) * mon: many stability fixes (Joao Luis) * mon, osd: many memory leaks fixed * mds: misc stability fixes (Yan, Zheng, Greg Farnum) * mds: many backpointer improvements (Yan, Zheng) * mds: new robust open-by-ino support (Yan, Zheng) * ceph-fuse, libcephfs: fix a few caps revocation bugs * librados: new calls to administer the cluster * librbd: locking tests (Josh Durgin) * ceph-disk: improved handling of odd device names * ceph-disk: many fixes for RHEL/CentOS, Fedora, wheezy * many many fixes from static code analysis (Danny Al-Gaaf) * daemons: create /var/run/ceph as needed v0.64 ===== Upgrading --------- * New pools now have the HASHPSPOOL flag set by default to provide better distribution over OSDs. Support for this feature was introduced in v0.59 and Linux kernel version v3.9. If you wish to access the cluster from an older kernel, set the 'osd pool default flag hashpspool = false' option in your ceph.conf prior to creating the cluster or creating new pools. Note that the presense of any pool in the cluster with the flag enabled will make the OSD require support from all clients. Notable changes --------------- * osd: monitor both front and back interfaces * osd: verify both front and back network are working before rejoining cluster * osd: fix memory/network inefficiency during deep scrub * osd: fix incorrect mark-down of osds * mon: fix start fork behavior * mon: fix election timeout * mon: better trim/compaction behavior * mon: fix units in 'ceph df' output * mon, osd: misc memory leaks * librbd: make default options/features for newly created images (e.g., via qemu-img) configurable * mds: many fixes for mds clustering * mds: fix rare hang after client restart * ceph-fuse: add ioctl support * ceph-fuse/libcephfs: fix for cap release/hang * rgw: handle deep uri resources * rgw: fix CORS bugs * ceph-disk: add '[un]suppress-active DEV' command * debian: rgw: stop daemon on uninstall * debian: fix upstart behavior with upgrades v0.63 ===== Upgrading --------- * The 'osd min down {reporters|reports}' config options have been renamed to 'mon osd min down {reporters|reports}', and the documentation has been updated to reflect that these options apply to the monitors (who process failure reports) and not OSDs. If you have adjusted these settings, please update your ``ceph.conf`` accordingly. Notable Changes --------------- * librbd: parallelize delete, rollback, flatten, copy, resize * librbd: ability to read from local replicas * osd: resurrect partially deleted PGs * osd: prioritize recovery for degraded PGs * osd: fix internal heartbeart timeouts when scrubbing very large objects * osd: close narrow journal race * rgw: fix usage log scanning for large, untrimmed logs * rgw: fix locking issue, user operation mask, * initscript: fix osd crush weight calculation when using -a * initscript: fix enumeration of local daemons * mon: several fixes to paxos, sync * mon: new --extract-monmap to aid disaster recovery * mon: fix leveldb compression, trimming * add 'config get' admin socket command * rados: clonedata command for cli * debian: stop daemons on uninstall; fix dependencies * debian wheezy: fix udev rules * many many small fixes from coverity scan v0.62 ===== Notable Changes --------------- * mon: fix validation of mds ids from CLI commands * osd: fix for an op ordering bug * osd, mon: optionally dump leveldb transactions to a log * osd: fix handling for split after upgrade from bobtail * debian, specfile: packaging cleanups * radosgw-admin: create keys for new users by default * librados python binding cleanups * misc code cleanups v0.61.9 "Cuttlefish" ==================== This point release resolves several low to medium-impact bugs across the code base, and fixes a performance problem (CPU utilization) with radosgw. We recommend that all production cuttlefish users upgrade. Notable Changes --------------- * ceph, ceph-authtool: fix help (Danny Al-Gaaf) * ceph-disk: partprobe after creating journal partition * ceph-disk: specific fs type when mounting (Alfredo Deza) * ceph-fuse: fix bug when compiled against old versions * ceph-fuse: fix use-after-free in caching code (Yan, Zheng) * ceph-fuse: misc caching bugs * ceph.spec: remove incorrect mod_fcgi dependency (Gary Lowell) * crush: fix name caching * librbd: fix bug when unpausing cluster (Josh Durgin) * mds: fix LAZYIO lock hang * mds: fix bug in file size recovery (after client crash) * mon: fix paxos recovery corner case * osd: fix exponential backoff for slow request warnings (Loic Dachary) * osd: fix readdir_r usage * osd: fix startup for long-stopped OSDs * rgw: avoid std::list::size() to avoid wasting CPU cycles (Yehuda Sadeh) * rgw: drain pending requests during write (fixes data safety issue) (Yehuda Sadeh) * rgw: fix authenticated users group ACL check (Yehuda Sadeh) * rgw: fix bug in POST (Yehuda Sadeh) * rgw: fix sysvinit script 'status' command, return value (Danny Al-Gaaf) * rgw: reduce default log level (Yehuda Sadeh) For more detailed information, see :download:`the complete changelog `. v0.61.8 "Cuttlefish" ==================== This release includes a number of important issues, including rare race conditions in the OSD, a few monitor bugs, and fixes for RBD flush behavior. We recommend that production users upgrade at their convenience. Notable Changes --------------- * librados: fix async aio completion wakeup * librados: fix aio completion locking * librados: fix rare deadlock during shutdown * osd: fix race when queueing recovery operations * osd: fix possible race during recovery * osd: optionally preload rados classes on startup (disabled by default) * osd: fix journal replay corner condition * osd: limit size of peering work queue batch (to speed up peering) * mon: fix paxos recovery corner case * mon: fix rare hang when monmap updates during an election * mon: make 'osd pool mksnap ...' avoid exposing uncommitted state * mon: make 'osd pool rmsnap ...' not racy, avoid exposing uncommitted state * mon: fix bug during mon cluster expansion * rgw: fix crash during multi delete operation * msgr: fix race conditions during osd network reinitialization * ceph-disk: apply mount options when remounting For more detailed information, see :download:`the complete changelog `. v0.61.7 "Cuttlefish" ==================== This release fixes another regression preventing monitors to start after undergoing certain upgrade sequences, as well as some corner cases with Paxos and support for unusual device names in ceph-disk/ceph-deploy. Notable Changes --------------- * mon: fix regression in latest full osdmap retrieval * mon: fix a long-standing bug in a paxos corner case * ceph-disk: improved support for unusual device names (e.g., /dev/cciss/c0d0) For more detailed information, see :download:`the complete changelog `. v0.61.6 "Cuttlefish" ==================== This release fixes a regression in v0.61.5 that could prevent monitors from restarting. This affects any cluster that was upgraded from a previous version of Ceph (and not freshly created with v0.61.5). All users are strongly recommended to upgrade. Notable Changes --------------- * mon: record latest full osdmap * mon: work around previous bug in which latest full osdmap is not recorded * mon: avoid scrub while updating For more detailed information, see :download:`the complete changelog `. v0.61.5 "Cuttlefish" ==================== This release most improves stability of the monitor and fixes a few bugs with the ceph-disk utility (used by ceph-deploy). We recommand that all v0.61.x users upgrade. Upgrading --------- * This release fixes a 32-bit vs 64-bit arithmetic bug with the feature bits. An unfortunate consequence of the fix is that 0.61.4 (or earlier) ceph-mon daemons can't form a quorum with 0.61.5 (or later) monitors. To avoid the possibility of service disruption, we recommend you upgrade all monitors at once. Notable Changes --------------- * mon: misc sync improvements (faster, more reliable, better tuning) * mon: enable leveldb cache by default (big performance improvement) * mon: new scrub feature (primarily for diagnostic, testing purposes) * mon: fix occasional leveldb assertion on startup * mon: prevent reads until initial state is committed * mon: improved logic for trimming old osdmaps * mon: fix pick_addresses bug when expanding mon cluster * mon: several small paxos fixes, improvements * mon: fix bug osdmap trim behavior * osd: fix several bugs with PG stat reporting * osd: limit number of maps shared with peers (which could cause domino failures) * rgw: fix radosgw-admin buckets list (for all buckets) * mds: fix occasional client failure to reconnect * mds: fix bad list traversal after unlink * mds: fix underwater dentry cleanup (occasional crash after mds restart) * libcephfs, ceph-fuse: fix occasional hangs on umount * libcephfs, ceph-fuse: fix old bug with O_LAZY vs O_NOATIME confusion * ceph-disk: more robust journal device detection on RHEL/CentOS * ceph-disk: better, simpler locking * ceph-disk: do not inadvertantely mount over existing osd mounts * ceph-disk: better handling for unusual device names * sysvinit, upstart: handle symlinks in /var/lib/ceph/* For more detailed information, see :download:`the complete changelog `. v0.61.4 "Cuttlefish" ==================== This release resolves a possible data corruption on power-cycle when using XFS, a few outstanding problems with monitor sync, several problems with ceph-disk and ceph-deploy operation, and a problem with OSD memory usage during scrub. Upgrading --------- * No issues. Notable Changes --------------- * mon: fix daemon exit behavior when error is encountered on startup * mon: more robust sync behavior * osd: do not use sync_file_range(2), posix_fadvise(...DONTNEED) (can cause data corruption on power loss on XFS) * osd: avoid unnecessary log rewrite (improves peering speed) * osd: fix scrub efficiency bug (problematic on old clusters) * rgw: fix listing objects that start with underscore * rgw: fix deep URI resource, CORS bugs * librados python binding: fix truncate on 32-bit architectures * ceph-disk: fix udev rules * rpm: install sysvinit script on package install * ceph-disk: fix OSD start on machine reboot on Debian wheezy * ceph-disk: activate OSD when journal device appears second * ceph-disk: fix various bugs on RHEL/CentOS 6.3 * ceph-disk: add 'zap' command * ceph-disk: add '[un]suppress-activate' command for preparing spare disks * upstart: start on runlevel [2345] (instead of after the first network interface starts) * ceph-fuse, libcephfs: handle mds session reset during session open * ceph-fuse, libcephfs: fix two capability revocation bugs * ceph-fuse: fix thread creation on startup * all daemons: create /var/run/ceph directory on startup if missing For more detailed information, see :download:`the complete changelog `. v0.61.3 "Cuttlefish" ==================== This release resolves a number of problems with the monitors and leveldb that users have been seeing. Please upgrade. Upgrading --------- * There is one known problem with mon upgrades from bobtail. If the ceph-mon conversion on startup is aborted or fails for some reason, we do not correctly error out, but instead continue with (in certain cases) odd results. Please be careful if you have to restart the mons during the upgrade. A 0.61.4 release with a fix will be out shortly. * In the meantime, for current cuttlefish users, v0.61.3 is safe to use. Notable Changes --------------- * mon: paxos state trimming fix (resolves runaway disk usage) * mon: finer-grained compaction on trim * mon: discard messages from disconnected clients (lowers load) * mon: leveldb compaction and other stats available via admin socket * mon: async compaction (lower overhead) * mon: fix bug incorrectly marking osds down with insufficient failure reports * osd: fixed small bug in pg request map * osd: avoid rewriting pg info on every osdmap * osd: avoid internal heartbeta timeouts when scrubbing very large objects * osd: fix narrow race with journal replay * mon: fixed narrow pg split race * rgw: fix leaked space when copying object * rgw: fix iteration over large/untrimmed usage logs * rgw: fix locking issue with ops log socket * rgw: require matching version of librados * librbd: make image creation defaults configurable (e.g., create format 2 images via qemu-img) * fix units in 'ceph df' output * debian: fix prerm/postinst hooks to start/stop daemons appropriately * upstart: allow uppercase daemons names (and thus hostnames) * sysvinit: fix enumeration of local daemons by type * sysvinit: fix osd weight calcuation when using -a * fix build on unsigned char platforms (e.g., arm) For more detailed information, see :download:`the complete changelog `. v0.61.2 "Cuttlefish" ==================== This release disables a monitor debug log that consumes disk space and fixes a bug when upgrade some monitors from bobtail to cuttlefish. Notable Changes --------------- * mon: fix conversion of stores with duplicated GV values * mon: disable 'mon debug dump transactions' by default For more detailed information, see :download:`the complete changelog `. v0.61.1 "Cuttlefish" ==================== This release fixes a problem when upgrading a bobtail cluster that had snapshots to cuttlefish. Notable Changes --------------- * osd: handle upgrade when legacy snap collections are present; repair from previous failed restart * ceph-create-keys: fix race with ceph-mon startup (which broke 'ceph-deploy gatherkeys ...') * ceph-create-keys: gracefully handle bad response from ceph-osd * sysvinit: do not assume default osd_data when automatically weighting OSD * osd: avoid crash from ill-behaved classes using getomapvals * debian: fix squeeze dependency * mon: debug options to log or dump leveldb transactions For more detailed information, see :download:`the complete changelog `. v0.61 "Cuttlefish" ================== Upgrading from v0.60 -------------------- * The ceph-deploy tool is now the preferred method of provisioning new clusters. For existing clusters created via mkcephfs that would like to transition to the new tool, there is a migration path, documented at `Transitioning to ceph-deploy`_. * The sysvinit script (/etc/init.d/ceph) will now verify (and, if necessary, update) the OSD's position in the CRUSH map on startup. (The upstart script has always worked this way.) By default, this ensures that the OSD is under a 'host' with a name that matches the hostname (``hostname -s``). Legacy clusters create with mkcephfs do this by default, so this should not cause any problems, but legacy clusters with customized CRUSH maps with an alternate structure should set ``osd crush update on start = false``. * radosgw-admin now uses the term zone instead of cluster to describe each instance of the radosgw data store (and corresponding collection of radosgw daemons). The usage for the radosgw-admin command and the 'rgw zone root pool' config options have changed accordingly. * rbd progress indicators now go to standard error instead of standard out. (You can disable progress with --no-progress.) * The 'rbd resize ...' command now requires the --allow-shrink option when resizing to a smaller size. Expanding images to a larger size is unchanged. * Please review the changes going back to 0.56.4 if you are upgrading all the way from bobtail. * The old 'ceph stop_cluster' command has been removed. * The sysvinit script now uses the ceph.conf file on the remote host when starting remote daemons via the '-a' option. Note that if '-a' is used in conjunction with '-c path', the path must also be present on the remote host (it is not copied to a temporary file, as it was previously). Upgrading from v0.56.4 "Bobtail" -------------------------------- Please see `Upgrading from Bobtail to Cuttlefish`_ for details. .. _Upgrading from Bobtail to Cuttlefish: ../install/upgrading-ceph/#upgrading-from-bobtail-to-cuttlefish * The ceph-deploy tool is now the preferred method of provisioning new clusters. For existing clusters created via mkcephfs that would like to transition to the new tool, there is a migration path, documented at `Transitioning to ceph-deploy`_. .. _Transitioning to ceph-deploy: ../rados/deployment/ceph-deploy-transition * The sysvinit script (/etc/init.d/ceph) will now verify (and, if necessary, update) the OSD's position in the CRUSH map on startup. (The upstart script has always worked this way.) By default, this ensures that the OSD is under a 'host' with a name that matches the hostname (``hostname -s``). Legacy clusters create with mkcephfs do this by default, so this should not cause any problems, but legacy clusters with customized CRUSH maps with an alternate structure should set ``osd crush update on start = false``. * radosgw-admin now uses the term zone instead of cluster to describe each instance of the radosgw data store (and corresponding collection of radosgw daemons). The usage for the radosgw-admin command and the 'rgw zone root pool' config optoins have changed accordingly. * rbd progress indicators now go to standard error instead of standard out. (You can disable progress with --no-progress.) * The 'rbd resize ...' command now requires the --allow-shrink option when resizing to a smaller size. Expanding images to a larger size is unchanged. * Please review the changes going back to 0.56.4 if you are upgrading all the way from bobtail. * The old 'ceph stop_cluster' command has been removed. * The sysvinit script now uses the ceph.conf file on the remote host when starting remote daemons via the '-a' option. Note that if '-a' is used in conjuction with '-c path', the path must also be present on the remote host (it is not copied to a temporary file, as it was previously). * The monitor is using a completely new storage strategy and intra-cluster protocol. This means that cuttlefish and bobtail monitors do not talk to each other. When you upgrade each one, it will convert its local data store to the new format. Once you upgrade a majority, the quorum will be formed using the new protocol and the old monitors will be blocked out until they too get upgraded. For this reason, we recommend not running a mixed-version cluster for very long. * ceph-mon now requires the creation of its data directory prior to --mkfs, similarly to what happens on ceph-osd. This directory is no longer automatically created, and custom scripts should be adjusted to reflect just that. * The monitor now enforces that MDS names be unique. If you have multiple daemons start with with the same id (e.g., ``mds.a``) the second one will implicitly mark the first as failed. This makes things less confusing and makes a daemon restart faster (we no longer wait for the stopped daemon to time out) but existing multi-mds configurations may need to be adjusted accordingly to give daemons unique names. * The 'ceph osd pool delete ' and 'rados rmpool ' now have safety interlocks with loud warnings that make you confirm pool removal. Any scripts curenty rely on these functions zapping data without confirmation need to be adjusted accordingly. Notable Changes from v0.60 -------------------------- * rbd: incremental backups * rbd: only set STRIPINGV2 feature if striping parameters are incompatible with old versions * rbd: require --allow-shrink for resizing images down * librbd: many bug fixes * rgw: management REST API * rgw: fix object corruption on COPY to self * rgw: new sysvinit script for rpm-based systems * rgw: allow buckets with '_' * rgw: CORS support * mon: many fixes * mon: improved trimming behavior * mon: fix data conversion/upgrade problem (from bobtail) * mon: ability to tune leveldb * mon: config-keys service to store arbitrary data on monitor * mon: 'osd crush add|link|unlink|add-bucket ...' commands * mon: trigger leveldb compaction on trim * osd: per-rados pool quotas (objects, bytes) * osd: tool to export, import, and delete PGs from an individual OSD data store * osd: notify mon on clean shutdown to avoid IO stall * osd: improved detection of corrupted journals * osd: ability to tune leveldb * osd: improve client request throttling * osd, librados: fixes to the LIST_SNAPS operation * osd: improvements to scrub error repair * osd: better prevention of wedging OSDs with ENOSPC * osd: many small fixes * mds: fix xattr handling on root inode * mds: fixed bugs in journal replay * mds: many fixes * librados: clean up snapshot constant definitions * libcephfs: calls to query CRUSH topology (used by Hadoop) * ceph-fuse, libcephfs: misc fixes to mds session management * ceph-fuse: disabled cache invalidation (again) due to potential deadlock with kernel * sysvinit: try to start all daemons despite early failures * ceph-disk: new 'list' command * ceph-disk: hotplug fixes for RHEL/CentOS * ceph-disk: fix creation of OSD data partitions on >2TB disks * osd: fix udev rules for RHEL/CentOS systems * fix daemon logging during initial startup Notable changes from v0.56 "Bobtail" ------------------------------------ * always use installed system leveldb (Gary Lowell) * auth: ability to require new cephx signatures on messages (still off by default) * buffer unit testing (Loic Dachary) * ceph tool: some CLI interface cleanups * ceph-disk: improve multicluster support, error handling (Sage Weil) * ceph-disk: support for dm-crypt (Alexandre Marangone) * ceph-disk: support for sysvinit, directories or partitions (not full disks) * ceph-disk: fix mkfs args on old distros (Alexandre Marangone) * ceph-disk: fix creation of OSD data partitions on >2TB disks * ceph-disk: hotplug fixes for RHEL/CentOS * ceph-disk: new 'list' command * ceph-fuse, libcephfs: misc fixes to mds session management * ceph-fuse: disabled cache invalidation (again) due to potential deadlock with kernel * ceph-fuse: enable kernel cache invalidation (Sam Lang) * ceph-fuse: fix statfs(2) reporting * ceph-fuse: session handling cleanup, bug fixes (Sage Weil) * crush: ability to create, remove rules via CLI * crush: update weights for all instances of an item, not just the first (Sage Weil) * fix daemon logging during initial startup * fixed log rotation (Gary Lowell) * init-ceph, mkcephfs: close a few security holes with -a (Sage Weil) * libcephfs: calls to query CRUSH topology (used by Hadoop) * libcephfs: many fixes, cleanups with the Java bindings * libcephfs: new topo API requests for Hadoop (Noah Watkins) * librados: clean up snapshot constant definitions * librados: fix linger bugs (Josh Durgin) * librbd: fixed flatten deadlock (Josh Durgin) * librbd: fixed some locking issues with flatten (Josh Durgin) * librbd: many bug fixes * librbd: optionally wait for flush before enabling writeback (Josh Durgin) * many many cleanups (Danny Al-Gaaf) * mds, ceph-fuse: fix bugs with replayed requests after MDS restart (Sage Weil) * mds, ceph-fuse: manage layouts via xattrs * mds: allow xattrs on root * mds: fast failover between MDSs (enforce unique mds names) * mds: fix xattr handling on root inode * mds: fixed bugs in journal replay * mds: improve session cleanup (Sage Weil) * mds: many fixes (Yan Zheng) * mds: misc bug fixes with clustered MDSs and failure recovery * mds: misc bug fixes with readdir * mds: new encoding for all data types (to allow forward/backward compatbility) (Greg Farnum) * mds: store and update backpointers/traces on directory, file objects (Sam Lang) * mon: 'osd crush add|link|unlink|add-bucket ...' commands * mon: ability to tune leveldb * mon: approximate recovery, IO workload stats * mon: avoid marking entire CRUSH subtrees out (e.g., if an entire rack goes offline) * mon: config-keys service to store arbitrary data on monitor * mon: easy adjustment of crush tunables via 'ceph osd crush tunables ...' * mon: easy creation of crush rules vai 'ceph osd rule ...' * mon: fix data conversion/upgrade problem (from bobtail) * mon: improved trimming behavior * mon: many fixes * mon: new 'ceph df [detail]' command * mon: new checks for identifying and reporting clock drift * mon: rearchitected to utilize single instance of paxos and a key/value store (Joao Luis) * mon: safety check for pool deletion * mon: shut down safely if disk approaches full (Joao Luis) * mon: trigger leveldb compaction on trim * msgr: fix comparison of IPv6 addresses (fixes monitor bringup via ceph-deploy, chef) * msgr: fixed race in connection reset * msgr: optionally tune TCP buffer size to avoid throughput collapse (Jim Schutt) * much code cleanup and optimization (Danny Al-Gaaf) * osd, librados: ability to list watchers (David Zafman) * osd, librados: fixes to the LIST_SNAPS operation * osd, librados: new listsnaps command (David Zafman) * osd: a few journaling bug fixes * osd: ability to tune leveldb * osd: add 'noscrub', 'nodeepscrub' osdmap flags (David Zafman) * osd: better prevention of wedging OSDs with ENOSPC * osd: ceph-filestore-dump tool for debugging * osd: connection handling bug fixes * osd: deep-scrub omap keys/values * osd: default to libaio for the journal (some performance boost) * osd: fix hang in 'journal aio = true' mode (Sage Weil) * osd: fix pg log trimming (avoids memory bloat on degraded clusters) * osd: fix udev rules for RHEL/CentOS systems * osd: fixed bug in journal checksums (Sam Just) * osd: improved client request throttling * osd: improved handling when disk fills up (David Zafman) * osd: improved journal corruption detection (Sam Just) * osd: improved detection of corrupted journals * osd: improvements to scrub error repair * osd: make tracking of object snapshot metadata more efficient (Sam Just) * osd: many small fixes * osd: misc fixes to PG split (Sam Just) * osd: move pg info, log into leveldb (== better performance) (David Zafman) * osd: notify mon on clean shutdown to avoid IO stall * osd: per-rados pool quotas (objects, bytes) * osd: refactored watch/notify infrastructure (fixes protocol, removes many bugs) (Sam Just) * osd: support for improved hashing of PGs across OSDs via HASHPSPOOL pool flag and feature * osd: tool to export, import, and delete PGs from an individual OSD data store * osd: trim log more aggressively, avoid appearance of leak memory * osd: validate snap collections on startup * osd: verify snap collections on startup (Sam Just) * radosgw: ACL grants in headers (Caleb Miles) * radosgw: ability to listen to fastcgi via a port (Guilhem Lettron) * radosgw: fix object copy onto self (Yehuda Sadeh) * radosgw: misc fixes * rbd-fuse: new tool, package * rbd: avoid FIEMAP when importing from file (it can be buggy) * rbd: incremental backups * rbd: only set STRIPINGV2 feature if striping parameters are incompatible with old versions * rbd: require --allow-shrink for resizing images down * rbd: udevadm settle on map/unmap to avoid various races (Dan Mick) * rbd: wait for udev to settle in strategic places (avoid spurious errors, failures) * rgw: CORS support * rgw: allow buckets with '_' * rgw: fix Content-Length on 32-bit machines (Jan Harkes) * rgw: fix log rotation * rgw: fix object corruption on COPY to self * rgw: fixed >4MB range requests (Jan Harkes) * rgw: new sysvinit script for rpm-based systems * rpm/deb: do not remove /var/lib/ceph on purge (v0.59 was the only release to do so) * sysvinit: try to start all daemons despite early failures * upstart: automatically set osd weight based on df (Guilhem Lettron) * use less memory for logging by default v0.60 ===== Upgrading --------- * Please note that the recently added librados 'list_snaps' function call is in a state of flux and is changing slightly in v0.61. You are advised not to make use of it in v0.59 or v0.60. Notable Changes --------------- * osd: make tracking of object snapshot metadata more efficient (Sam Just) * osd: misc fixes to PG split (Sam Just) * osd: improve journal corruption detection (Sam Just) * osd: improve handling when disk fills up (David Zafman) * osd: add 'noscrub', 'nodeepscrub' osdmap flags (David Zafman) * osd: fix hang in 'journal aio = true' mode (Sage Weil) * ceph-disk-prepare: fix mkfs args on old distros (Alexandre Marangone) * ceph-disk-activate: improve multicluster support, error handling (Sage Weil) * librbd: optionally wait for flush before enabling writeback (Josh Durgin) * crush: update weights for all instances of an item, not just the first (Sage Weil) * mon: shut down safely if disk approaches full (Joao Luis) * rgw: fix Content-Length on 32-bit machines (Jan Harkes) * mds: store and update backpointers/traces on directory, file objects (Sam Lang) * mds: improve session cleanup (Sage Weil) * mds, ceph-fuse: fix bugs with replayed requests after MDS restart (Sage Weil) * ceph-fuse: enable kernel cache invalidation (Sam Lang) * libcephfs: new topo API requests for Hadoop (Noah Watkins) * ceph-fuse: session handling cleanup, bug fixes (Sage Weil) * much code cleanup and optimization (Danny Al-Gaaf) * use less memory for logging by default * upstart: automatically set osd weight based on df (Guilhem Lettron) * init-ceph, mkcephfs: close a few security holes with -a (Sage Weil) * rpm/deb: do not remove /var/lib/ceph on purge (v0.59 was the only release to do so) v0.59 ===== Upgrading --------- * The monitor is using a completely new storage strategy and intra-cluster protocol. This means that v0.59 and pre-v0.59 monitors do not talk to each other. When you upgrade each one, it will convert its local data store to the new format. Once you upgrade a majority, the quorum will be formed using the new protocol and the old monitors will be blocked out until they too get upgraded. For this reason, we recommend not running a mixed-version cluster for very long. * ceph-mon now requires the creation of its data directory prior to --mkfs, similarly to what happens on ceph-osd. This directory is no longer automatically created, and custom scripts should be adjusted to reflect just that. Notable Changes --------------- * mon: rearchitected to utilize single instance of paxos and a key/value store (Joao Luis) * mon: new 'ceph df [detail]' command * osd: support for improved hashing of PGs across OSDs via HASHPSPOOL pool flag and feature * osd: refactored watch/notify infrastructure (fixes protocol, removes many bugs) (Sam Just) * osd, librados: ability to list watchers (David Zafman) * osd, librados: new listsnaps command (David Zafman) * osd: trim log more aggressively, avoid appearance of leak memory * osd: misc split fixes * osd: a few journaling bug fixes * osd: connection handling bug fixes * rbd: avoid FIEMAP when importing from file (it can be buggy) * librados: fix linger bugs (Josh Durgin) * librbd: fixed flatten deadlock (Josh Durgin) * rgw: fixed >4MB range requests (Jan Harkes) * rgw: fix log rotation * mds: allow xattrs on root * ceph-fuse: fix statfs(2) reporting * msgr: optionally tune TCP buffer size to avoid throughput collapse (Jim Schutt) * consume less memory for logging by default * always use system leveldb (Gary Lowell) v0.58 ===== Upgrading --------- * The monitor now enforces that MDS names be unique. If you have multiple daemons start with with the same id (e.g., ``mds.a``) the second one will implicitly mark the first as failed. This makes things less confusing and makes a daemon restart faster (we no longer wait for the stopped daemon to time out) but existing multi-mds configurations may need to be adjusted accordingly to give daemons unique names. Notable Changes --------------- * librbd: fixed some locking issues with flatten (Josh Durgin) * rbd: udevadm settle on map/unmap to avoid various races (Dan Mick) * osd: move pg info, log into leveldb (== better performance) (David Zafman) * osd: fix pg log trimming (avoids memory bloat on degraded clusters) * osd: fixed bug in journal checksums (Sam Just) * osd: verify snap collections on startup (Sam Just) * ceph-disk-prepare/activate: support for dm-crypt (Alexandre Marangone) * ceph-disk-prepare/activate: support for sysvinit, directories or partitions (not full disks) * msgr: fixed race in connection reset * msgr: fix comparison of IPv6 addresses (fixes monitor bringup via ceph-deploy, chef) * radosgw: fix object copy onto self (Yehuda Sadeh) * radosgw: ACL grants in headers (Caleb Miles) * radosgw: ability to listen to fastcgi via a port (Guilhem Lettron) * mds: new encoding for all data types (to allow forward/backward compatbility) (Greg Farnum) * mds: fast failover between MDSs (enforce unique mds names) * crush: ability to create, remove rules via CLI * many many cleanups (Danny Al-Gaaf) * buffer unit testing (Loic Dachary) * fixed log rotation (Gary Lowell) v0.57 ===== This development release has a lot of additional functionality accumulated over the last couple months. Most of the bug fixes (with the notable exception of the MDS related work) has already been backported to v0.56.x, and is not mentioned here. Upgrading --------- * The 'ceph osd pool delete ' and 'rados rmpool ' now have safety interlocks with loud warnings that make you confirm pool removal. Any scripts curenty rely on these functions zapping data without confirmation need to be adjusted accordingly. Notable Changes --------------- * osd: default to libaio for the journal (some performance boost) * osd: validate snap collections on startup * osd: ceph-filestore-dump tool for debugging * osd: deep-scrub omap keys/values * ceph tool: some CLI interface cleanups * mon: easy adjustment of crush tunables via 'ceph osd crush tunables ...' * mon: easy creation of crush rules vai 'ceph osd rule ...' * mon: approximate recovery, IO workload stats * mon: avoid marking entire CRUSH subtrees out (e.g., if an entire rack goes offline) * mon: safety check for pool deletion * mon: new checks for identifying and reporting clock drift * radosgw: misc fixes * rbd: wait for udev to settle in strategic places (avoid spurious errors, failures) * rbd-fuse: new tool, package * mds, ceph-fuse: manage layouts via xattrs * mds: misc bug fixes with clustered MDSs and failure recovery * mds: misc bug fixes with readdir * libcephfs: many fixes, cleanups with the Java bindings * auth: ability to require new cephx signatures on messages (still off by default) v0.56.7 "bobtail" ================= This bobtail update fixes a range of radosgw bugs (including an easily triggered crash from multi-delete), a possible data corruption issue with power failure on XFS, and several OSD problems, including a memory "leak" that will affect aged clusters. Notable changes --------------- * ceph-fuse: create finisher flags after fork() * debian: fix prerm/postinst hooks; do not restart daemons on upgrade * librados: fix async aio completion wakeup (manifests as rbd hang) * librados: fix hang when osd becomes full and then not full * librados: fix locking for aio completion refcounting * librbd python bindings: fix stripe_unit, stripe_count * librbd: make image creation default configurable * mon: fix validation of mds ids in mon commands * osd: avoid excessive disk updates during peering * osd: avoid excessive memory usage on scrub * osd: avoid heartbeat failure/suicide when scrubbing * osd: misc minor bug fixes * osd: use fdatasync instead of sync_file_range (may avoid xfs power-loss corruption) * rgw: escape prefix correctly when listing objects * rgw: fix copy attrs * rgw: fix crash on multi delete * rgw: fix locking/crash when using ops log socket * rgw: fix usage logging * rgw: handle deep uri resources For more detailed information, see :download:`the complete changelog `. v0.56.6 "bobtail" ================= Notable changes --------------- * rgw: fix garbage collection * rpm: fix package dependencies For more detailed information, see :download:`the complete changelog `. v0.56.5 "bobtail" ================= Upgrading --------- * ceph-disk[-prepare,-activate] behavior has changed in various ways. There should not be any compatibility issues, but chef users should be aware. Notable changes --------------- * mon: fix recording of quorum feature set (important for argonaut -> bobtail -> cuttlefish mon upgrades) * osd: minor peering bug fixes * osd: fix a few bugs when pools are renamed * osd: fix occasionally corrupted pg stats * osd: fix behavior when broken v0.56[.0] clients connect * rbd: avoid FIEMAP ioctl on import (it is broken on some kernels) * librbd: fixes for several request/reply ordering bugs * librbd: only set STRIPINGV2 feature on new images when needed * librbd: new async flush method to resolve qemu hangs (requires QEMU update as well) * librbd: a few fixes to flatten * ceph-disk: support for dm-crypt * ceph-disk: many backports to allow bobtail deployments with ceph-deploy, chef * sysvinit: do not stop starting daemons on first failure * udev: fixed rules for redhat-based distros * build fixes for raring For more detailed information, see :download:`the complete changelog `. v0.56.4 "bobtail" ================= Upgrading --------- * There is a fix in the syntax for the output of 'ceph osd tree --format=json'. * The MDS disk format has changed from prior releases *and* from v0.57. In particular, upgrades to v0.56.4 are safe, but you cannot move from v0.56.4 to v0.57 if you are using the MDS for CephFS; you must upgrade directly to v0.58 (or later) instead. Notable changes --------------- * mon: fix bug in bringup with IPv6 * reduce default memory utilization by internal logging (all daemons) * rgw: fix for bucket removal * rgw: reopen logs after log rotation * rgw: fix multipat upload listing * rgw: don't copy object when copied onto self * osd: fix caps parsing for pools with - or _ * osd: allow pg log trimming when degraded, scrubbing, recoverying (reducing memory consumption) * osd: fix potential deadlock when 'journal aio = true' * osd: various fixes for collection creation/removal, rename, temp collections * osd: various fixes for PG split * osd: deep-scrub omap key/value data * osd: fix rare bug in journal replay * osd: misc fixes for snapshot tracking * osd: fix leak in recovery reservations on pool deletion * osd: fix bug in connection management * osd: fix for op ordering when rebalancing * ceph-fuse: report file system size with correct units * mds: get and set directory layout policies via virtual xattrs * mds: on-disk format revision (see upgrading note above) * mkcephfs, init-ceph: close potential security issues with predictable filenames For more detailed information, see :download:`the complete changelog `. v0.56.3 "bobtail" ================= This release has several bug fixes surrounding OSD stability. Most significantly, an issue with OSDs being unresponsive shortly after startup (and occasionally crashing due to an internal heartbeat check) is resolved. Please upgrade. Upgrading --------- * A bug was fixed in which the OSDMap epoch for PGs without any IO requests was not recorded. If there are pools in the cluster that are completely idle (for example, the ``data`` and ``metadata`` pools normally used by CephFS), and a large number of OSDMap epochs have elapsed since the ``ceph-osd`` daemon was last restarted, those maps will get reprocessed when the daemon restarts. This process can take a while if there are a lot of maps. A workaround is to 'touch' any idle pools with IO prior to restarting the daemons after packages are upgraded:: rados bench 10 write -t 1 -b 4096 -p {POOLNAME} This will typically generate enough IO to touch every PG in the pool without generating significant cluster load, and also cleans up any temporary objects it creates. Notable changes --------------- * osd: flush peering work queue prior to start * osd: persist osdmap epoch for idle PGs * osd: fix and simplify connection handling for heartbeats * osd: avoid crash on invalid admin command * mon: fix rare races with monitor elections and commands * mon: enforce that OSD reweights be between 0 and 1 (NOTE: not CRUSH weights) * mon: approximate client, recovery bandwidth logging * radosgw: fixed some XML formatting to conform to Swift API inconsistency * radosgw: fix usage accounting bug; add repair tool * radosgw: make fallback URI configurable (necessary on some web servers) * librbd: fix handling for interrupted 'unprotect' operations * mds, ceph-fuse: allow file and directory layouts to be modified via virtual xattrs For more detailed information, see :download:`the complete changelog `. v0.56.2 "bobtail" ================= This release has a wide range of bug fixes, stability improvements, and some performance improvements. Please upgrade. Upgrading --------- * The meaning of the 'osd scrub min interval' and 'osd scrub max interval' has changed slightly. The min interval used to be meaningless, while the max interval would only trigger a scrub if the load was sufficiently low. Now, the min interval option works the way the old max interval did (it will trigger a scrub after this amount of time if the load is low), while the max interval will force a scrub regardless of load. The default options have been adjusted accordingly. If you have customized these in ceph.conf, please review their values when upgrading. * CRUSH maps that are generated by default when calling ``ceph-mon --mkfs`` directly now distribute replicas across hosts instead of across OSDs. Any provisioning tools that are being used by Ceph may be affected, although probably for the better, as distributing across hosts is a much more commonly sought behavior. If you use ``mkcephfs`` to create the cluster, the default CRUSH rule is still inferred by the number of hosts and/or racks in the initial ceph.conf. Notable changes --------------- * osd: snapshot trimming fixes * osd: scrub snapshot metadata * osd: fix osdmap trimming * osd: misc peering fixes * osd: stop heartbeating with peers if internal threads are stuck/hung * osd: PG removal is friendlier to other workloads * osd: fix recovery start delay (was causing very slow recovery) * osd: fix scheduling of explicitly requested scrubs * osd: fix scrub interval config options * osd: improve recovery vs client io tuning * osd: improve 'slow request' warning detail for better diagnosis * osd: default CRUSH map now distributes across hosts, not OSDs * osd: fix crash on 32-bit hosts triggered by librbd clients * librbd: fix error handling when talking to older OSDs * mon: fix a few rare crashes * ceph command: ability to easily adjust CRUSH tunables * radosgw: object copy does not copy source ACLs * rados command: fix omap command usage * sysvinit script: set ulimit -n properly on remote hosts * msgr: fix narrow race with message queuing * fixed compilation on some old distros (e.g., RHEL 5.x) For more detailed information, see :download:`the complete changelog `. v0.56.1 "bobtail" ================= This release has two critical fixes. Please upgrade. Upgrading --------- * There is a protocol compatibility problem between v0.56 and any other version that is now fixed. If your radosgw or RBD clients are running v0.56, they will need to be upgraded too. If they are running a version prior to v0.56, they can be left as is. Notable changes --------------- * osd: fix commit sequence for XFS, ext4 (or any other non-btrfs) to prevent data loss on power cycle or kernel panic * osd: fix compatibility for CALL operation * osd: process old osdmaps prior to joining cluster (fixes slow startup) * osd: fix a couple of recovery-related crashes * osd: fix large io requests when journal is in (non-default) aio mode * log: fix possible deadlock in logging code For more detailed information, see :download:`the complete changelog `. v0.56 "bobtail" =============== Bobtail is the second stable release of Ceph, named in honor of the `Bobtail Squid`: http://en.wikipedia.org/wiki/Bobtail_squid. Key features since v0.48 "argonaut" ----------------------------------- * Object Storage Daemon (OSD): improved threading, small-io performance, and performance during recovery * Object Storage Daemon (OSD): regular "deep" scrubbing of all stored data to detect latent disk errors * RADOS Block Device (RBD): support for copy-on-write clones of images. * RADOS Block Device (RBD): better client-side caching. * RADOS Block Device (RBD): advisory image locking * Rados Gateway (RGW): support for efficient usage logging/scraping (for billing purposes) * Rados Gateway (RGW): expanded S3 and Swift API coverage (e.g., POST, multi-object delete) * Rados Gateway (RGW): improved striping for large objects * Rados Gateway (RGW): OpenStack Keystone integration * RPM packages for Fedora, RHEL/CentOS, OpenSUSE, and SLES * mkcephfs: support for automatically formatting and mounting XFS and ext4 (in addition to btrfs) Upgrading --------- Please refer to the document `Upgrading from Argonaut to Bobtail`_ for details. .. _Upgrading from Argonaut to Bobtail: ../install/upgrading-ceph/#upgrading-from-argonaut-to-bobtail * Cephx authentication is now enabled by default (since v0.55). Upgrading a cluster without adjusting the Ceph configuration will likely prevent the system from starting up on its own. We recommend first modifying the configuration to indicate that authentication is disabled, and only then upgrading to the latest version.:: auth client required = none auth service required = none auth cluster required = none * Ceph daemons can be upgraded one-by-one while the cluster is online and in service. * The ``ceph-osd`` daemons must be upgraded and restarted *before* any ``radosgw`` daemons are restarted, as they depend on some new ceph-osd functionality. (The ``ceph-mon``, ``ceph-osd``, and ``ceph-mds`` daemons can be upgraded and restarted in any order.) * Once each individual daemon has been upgraded and restarted, it cannot be downgraded. * The cluster of ``ceph-mon`` daemons will migrate to a new internal on-wire protocol once all daemons in the quorum have been upgraded. Upgrading only a majority of the nodes (e.g., two out of three) may expose the cluster to a situation where a single additional failure may compromise availability (because the non-upgraded daemon cannot participate in the new protocol). We recommend not waiting for an extended period of time between ``ceph-mon`` upgrades. * The ops log and usage log for radosgw are now off by default. If you need these logs (e.g., for billing purposes), you must enable them explicitly. For logging of all operations to objects in the ``.log`` pool (see ``radosgw-admin log ...``):: rgw enable ops log = true For usage logging of aggregated bandwidth usage (see ``radosgw-admin usage ...``):: rgw enable usage log = true * You should not create or use "format 2" RBD images until after all ``ceph-osd`` daemons have been upgraded. Note that "format 1" is still the default. You can use the new ``ceph osd ls`` and ``ceph tell osd.N version`` commands to doublecheck your cluster. ``ceph osd ls`` will give a list of all OSD IDs that are part of the cluster, and you can use that to write a simple shell loop to display all the OSD version strings: :: for i in $(ceph osd ls); do ceph tell osd.${i} version done Compatibility changes --------------------- * The 'ceph osd create []' command now rejects an argument that is not a UUID. (Previously it would take take an optional integer OSD id.) This correct syntax has been 'ceph osd create []' since v0.47, but the older calling convention was being silently ignored. * The CRUSH map root nodes now have type ``root`` instead of type ``pool``. This avoids confusion with RADOS pools, which are not directly related. Any scripts or tools that use the ``ceph osd crush ...`` commands may need to be adjusted accordingly. * The ``ceph osd pool create `` command now requires the ``pgnum`` argument. Previously this was optional, and would default to 8, which was almost never a good number. * Degraded mode (when there fewer than the desired number of replicas) is now more configurable on a per-pool basis, with the min_size parameter. By default, with min_size 0, this allows I/O to objects with N - floor(N/2) replicas, where N is the total number of expected copies. Argonaut behavior was equivalent to having min_size = 1, so I/O would always be possible if any completely up to date copy remained. min_size = 1 could result in lower overall availability in certain cases, such as flapping network partitions. * The sysvinit start/stop script now defaults to adjusting the max open files ulimit to 16384. On most systems the default is 1024, so this is an increase and won't break anything. If some system has a higher initial value, however, this change will lower the limit. The value can be adjusted explicitly by adding an entry to the ``ceph.conf`` file in the appropriate section. For example:: [global] max open files = 32768 * 'rbd lock list' and 'rbd showmapped' no longer use tabs as separators in their output. * There is configurable limit on the number of PGs when creating a new pool, to prevent a user from accidentally specifying a ridiculous number for pg_num. It can be adjusted via the 'mon max pool pg num' option on the monitor, and defaults to 65536 (the current max supported by the Linux kernel client). * The osd capabilities associated with a rados user have changed syntax since 0.48 argonaut. The new format is mostly backwards compatible, but there are two backwards-incompatible changes: * specifying a list of pools in one grant, i.e. 'allow r pool=foo,bar' is now done in separate grants, i.e. 'allow r pool=foo, allow r pool=bar'. * restricting pool access by pool owner ('allow r uid=foo') is removed. This feature was not very useful and unused in practice. The new format is documented in the ceph-authtool man page. * 'rbd cp' and 'rbd rename' use rbd as the default destination pool, regardless of what pool the source image is in. Previously they would default to the same pool as the source image. * 'rbd export' no longer prints a message for each object written. It just reports percent complete like other long-lasting operations. * 'ceph osd tree' now uses 4 decimal places for weight so output is nicer for humans * Several monitor operations are now idempotent: * ceph osd pool create * ceph osd pool delete * ceph osd pool mksnap * ceph osd rm * ceph pg revert Notable changes --------------- * auth: enable cephx by default * auth: expanded authentication settings for greater flexibility * auth: sign messages when using cephx * build fixes for Fedora 18, CentOS/RHEL 6 * ceph: new 'osd ls' and 'osd tell version' commands * ceph-debugpack: misc improvements * ceph-disk-prepare: creates and labels GPT partitions * ceph-disk-prepare: support for external journals, default mount/mkfs options, etc. * ceph-fuse/libcephfs: many misc fixes, admin socket debugging * ceph-fuse: fix handling for .. in root directory * ceph-fuse: many fixes (including memory leaks, hangs) * ceph-fuse: mount helper (mount.fuse.ceph) for use with /etc/fstab * ceph.spec: misc packaging fixes * common: thread pool sizes can now be adjusted at runtime * config: $pid is now available as a metavariable * crush: default root of tree type is now 'root' instead of 'pool' (to avoid confusiong wrt rados pools) * crush: fixed retry behavior with chooseleaf via tunable * crush: tunables documented; feature bit now present and enforced * libcephfs: java wrapper * librados: several bug fixes (rare races, locking errors) * librados: some locking fixes * librados: watch/notify fixes, misc memory leaks * librbd: a few fixes to 'discard' support * librbd: fine-grained striping feature * librbd: fixed memory leaks * librbd: fully functional and documented image cloning * librbd: image (advisory) locking * librbd: improved caching (of object non-existence) * librbd: 'flatten' command to sever clone parent relationship * librbd: 'protect'/'unprotect' commands to prevent clone parent from being deleted * librbd: clip requests past end-of-image. * librbd: fixes an issue with some windows guests running in qemu (remove floating point usage) * log: fix in-memory buffering behavior (to only write log messages on crash) * mds: fix ino release on abort session close, relative getattr path, mds shutdown, other misc items * mds: misc fixes * mkcephfs: fix for default keyring, osd data/journal locations * mkcephfs: support for formatting xfs, ext4 (as well as btrfs) * init: support for automatically mounting xfs and ext4 osd data directories * mon, radosgw, ceph-fuse: fixed memory leaks * mon: improved ENOSPC, fs error checking * mon: less-destructive ceph-mon --mkfs behavior * mon: misc fixes * mon: more informative info about stuck PGs in 'health detail' * mon: information about recovery and backfill in 'pg query' * mon: new 'osd crush create-or-move ...' command * mon: new 'osd crush move ...' command lets you rearrange your CRUSH hierarchy * mon: optionally dump 'osd tree' in json * mon: configurable cap on maximum osd number (mon max osd) * mon: many bug fixes (various races causing ceph-mon crashes) * mon: new on-disk metadata to facilitate future mon changes (post-bobtail) * mon: election bug fixes * mon: throttle client messages (limit memory consumption) * mon: throttle osd flapping based on osd history (limits osdmap ΄thrashing' on overloaded or unhappy clusters) * mon: 'report' command for dumping detailed cluster status (e.g., for use when reporting bugs) * mon: osdmap flags like noup, noin now cause a health warning * msgr: improved failure handling code * msgr: many bug fixes * osd, mon: honor new 'nobackfill' and 'norecover' osdmap flags * osd, mon: use feature bits to lock out clients lacking CRUSH tunables when they are in use * osd: backfill reservation framework (to avoid flooding new osds with backfill data) * osd: backfill target reservations (improve performance during recovery) * osd: better tracking of recent slow operations * osd: capability grammar improvements, bug fixes * osd: client vs recovery io prioritization * osd: crush performance improvements * osd: default journal size to 5 GB * osd: experimental support for PG "splitting" (pg_num adjustment for existing pools) * osd: fix memory leak on certain error paths * osd: fixed detection of EIO errors from fs on read * osd: major refactor of PG peering and threading * osd: many bug fixes * osd: more/better dump info about in-progress operations * osd: new caps structure (see compatibility notes) * osd: new 'deep scrub' will compare object content across replicas (once per week by default) * osd: new 'lock' rados class for generic object locking * osd: optional 'min' pg size * osd: recovery reservations * osd: scrub efficiency improvement * osd: several out of order reply bug fixes * osd: several rare peering cases fixed * osd: some performance improvements related to request queuing * osd: use entire device if journal is a block device * osd: use syncfs(2) when kernel supports it, even if glibc does not * osd: various fixes for out-of-order op replies * rados: ability to copy, rename pools * rados: bench command now cleans up after itself * rados: 'cppool' command to copy rados pools * rados: 'rm' now accepts a list of objects to be removed * radosgw: POST support * radosgw: REST API for managing usage stats * radosgw: fix bug in bucket stat updates * radosgw: fix copy-object vs attributes * radosgw: fix range header for large objects, ETag quoting, GMT dates, other compatibility fixes * radosgw: improved garbage collection framework * radosgw: many small fixes, cleanups * radosgw: openstack keystone integration * radosgw: stripe large (non-multipart) objects * radosgw: support for multi-object deletes * radosgw: support for swift manifest objects * radosgw: vanity bucket dns names * radosgw: various API compatibility fixes * rbd: import from stdin, export to stdout * rbd: new 'ls -l' option to view images with metadata * rbd: use generic id and keyring options for 'rbd map' * rbd: don't issue usage on errors * udev: fix symlink creation for rbd images containing partitions * upstart: job files for all daemon types (not enabled by default) * wireshark: ceph protocol dissector patch updated v0.54 ===== Upgrading --------- * The osd capabilities associated with a rados user have changed syntax since 0.48 argonaut. The new format is mostly backwards compatible, but there are two backwards-incompatible changes: * specifying a list of pools in one grant, i.e. 'allow r pool=foo,bar' is now done in separate grants, i.e. 'allow r pool=foo, allow r pool=bar'. * restricting pool access by pool owner ('allow r uid=foo') is removed. This feature was not very useful and unused in practice. The new format is documented in the ceph-authtool man page. * Bug fixes to the new osd capability format parsing properly validate the allowed operations. If an existing rados user gets permissions errors after upgrading, its capabilities were probably misconfigured. See the ceph-authtool man page for details on osd capabilities. * 'rbd lock list' and 'rbd showmapped' no longer use tabs as separators in their output. v0.48.3 "argonaut" ================== This release contains a critical fix that can prevent data loss or corruption after a power loss or kernel panic event. Please upgrade immediately. Upgrading --------- * If you are using the undocumented ``ceph-disk-prepare`` and ``ceph-disk-activate`` tools, they have several new features and some additional functionality. Please review the changes in behavior carefully before upgrading. * The .deb packages now require xfsprogs. Notable changes --------------- * filestore: fix op_seq write order (fixes journal replay after power loss) * osd: fix occasional indefinitely hung "slow" request * osd: fix encoding for pool_snap_info_t when talking to pre-v0.48 clients * osd: fix heartbeat check * osd: reduce log noise about rbd watch * log: fixes for deadlocks in the internal logging code * log: make log buffer size adjustable * init script: fix for 'ceph status' across machines * radosgw: fix swift error handling * radosgw: fix swift authentication concurrency bug * radosgw: don't cache large objects * radosgw: fix some memory leaks * radosgw: fix timezone conversion on read * radosgw: relax date format restrictions * radosgw: fix multipart overwrite * radosgw: stop processing requests on client disconnect * radosgw: avoid adding port to url that already has a port * radosgw: fix copy to not override ETAG * common: make parsing of ip address lists more forgiving * common: fix admin socket compatibility with old protocol (for collectd plugin) * mon: drop dup commands on paxos reset * mds: fix loner selection for multiclient workloads * mds: fix compat bit checks * ceph-fuse: fix segfault on startup when keyring is missing * ceph-authtool: fix usage * ceph-disk-activate: misc backports * ceph-disk-prepare: misc backports * debian: depend on xfsprogs (we use xfs by default) * rpm: build rpms, some related Makefile changes For more detailed information, see :download:`the complete changelog `. v0.48.2 "argonaut" ================== Upgrading --------- * The default search path for keyring files now includes /etc/ceph/ceph.$name.keyring. If such files are present on your cluster, be aware that by default they may now be used. * There are several changes to the upstart init files. These have not been previously documented or recommended. Any existing users should review the changes before upgrading. * The ceph-disk-prepare and ceph-disk-active scripts have been updated significantly. These have not been previously documented or recommended. Any existing users should review the changes before upgrading. Notable changes --------------- * mkcephfs: fix keyring generation for mds, osd when default paths are used * radosgw: fix bug causing occasional corruption of per-bucket stats * radosgw: workaround to avoid previously corrupted stats from going negative * radosgw: fix bug in usage stats reporting on busy buckets * radosgw: fix Content-Range: header for objects bigger than 2 GB. * rbd: avoid leaving watch acting when command line tool errors out (avoids 30s delay on subsequent operations) * rbd: friendlier use of --pool/--image options for import (old calling convention still works) * librbd: fix rare snapshot creation race (could "lose" a snap when creation is concurrent) * librbd: fix discard handling when spanning holes * librbd: fix memory leak on discard when caching is enabled * objecter: misc fixes for op reordering * objecter: fix for rare startup-time deadlock waiting for osdmap * ceph: fix usage * mon: reduce log noise about "check_sub" * ceph-disk-activate: misc fixes, improvements * ceph-disk-prepare: partition and format osd disks automatically * upstart: start everyone on a reboot * upstart: always update the osd crush location on start if specified in the config * config: add /etc/ceph/ceph.$name.keyring to default keyring search path * ceph.spec: don't package crush headers For more detailed information, see :download:`the complete changelog `. v0.48.1 "argonaut" ================== Upgrading --------- * The radosgw usage trim function was effectively broken in v0.48. Earlier it would remove more usage data than what was requested. This is fixed in v0.48.1, but the fix is incompatible. The v0.48 radosgw-admin tool cannot be used to initiate the trimming; please use the v0.48.1 version. * v0.48.1 now explicitly indicates support for the CRUSH_TUNABLES feature. No other version of Ceph requires this, yet, but future versions will when the tunables are adjusted from their historical defaults. * There are no other compatibility changes between v0.48.1 and v0.48. Notable changes --------------- * mkcephfs: use default 'keyring', 'osd data', 'osd journal' paths when not specified in conf * msgr: various fixes to socket error handling * osd: reduce scrub overhead * osd: misc peering fixes (past_interval sharing, pgs stuck in 'peering' states) * osd: fail on EIO in read path (do not silently ignore read errors from failing disks) * osd: avoid internal heartbeat errors by breaking some large transactions into pieces * osd: fix osdmap catch-up during startup (catch up and then add daemon to osdmap) * osd: fix spurious 'misdirected op' messages * osd: report scrub status via 'pg ... query' * rbd: fix race when watch registrations are resent * rbd: fix rbd image id assignment scheme (new image data objects have slightly different names) * rbd: fix perf stats for cache hit rate * rbd tool: fix off-by-one in key name (crash when empty key specified) * rbd: more robust udev rules * rados tool: copy object, pool commands * radosgw: fix in usage stats trimming * radosgw: misc API compatibility fixes (date strings, ETag quoting, swift headers, etc.) * ceph-fuse: fix locking in read/write paths * mon: fix rare race corrupting on-disk data * config: fix admin socket 'config set' command * log: fix in-memory log event gathering * debian: remove crush headers, include librados-config * rpm: add ceph-disk-{activate, prepare} For more detailed information, see :download:`the complete changelog `. v0.48 "argonaut" ================ Upgrading --------- * This release includes a disk format upgrade. Each ceph-osd daemon, upon startup, will migrate its locally stored data to the new format. This process can take a while (for large object counts, even hours), especially on non-btrfs file systems. * To keep the cluster available while the upgrade is in progress, we recommend you upgrade a storage node or rack at a time, and wait for the cluster to recover each time. To prevent the cluster from moving data around in response to the OSD daemons being down for minutes or hours, you may want to:: ceph osd set noout This will prevent the cluster from marking down OSDs as "out" and re-replicating the data elsewhere. If you do this, be sure to clear the flag when the upgrade is complete:: ceph osd unset noout * There is a encoding format change internal to the monitor cluster. The monitor daemons are careful to switch to the new format only when all members of the quorum support it. However, that means that a partial quorum with new code may move to the new format, and a recovering monitor running old code will be unable to join (it will crash). If this occurs, simply upgrading the remaining monitor will resolve the problem. * The ceph tool's -s and -w commands from previous versions are incompatible with this version. Upgrade your client tools at the same time you upgrade the monitors if you rely on those commands. * It is not possible to downgrade from v0.48 to a previous version. Notable changes --------------- * osd: stability improvements * osd: capability model simplification * osd: simpler/safer --mkfs (no longer removes all files; safe to re-run on active osd) * osd: potentially buggy FIEMAP behavior disabled by default * rbd: caching improvements * rbd: improved instrumentation * rbd: bug fixes * radosgw: new, scalable usage logging infrastructure * radosgw: per-user bucket limits * mon: streamlined process for setting up authentication keys * mon: stability improvements * mon: log message throttling * doc: improved documentation (ceph, rbd, radosgw, chef, etc.) * config: new default locations for daemon keyrings * config: arbitrary variable substitutions * improved 'admin socket' daemon admin interface (ceph --admin-daemon ...) * chef: support for multiple monitor clusters * upstart: basic support for monitors, mds, radosgw; osd support still a work in progress. The new default keyring locations mean that when enabling authentication (``auth supported = cephx``), keyring locations do not need to be specified if the keyring file is located inside the daemon's data directory (``/var/lib/ceph/$type/ceph-$id`` by default). There is also a lot of librbd code in this release that is laying the groundwork for the upcoming layering functionality, but is not actually used. Likewise, the upstart support is still incomplete and not recommended; we will backport that functionality later if it turns out to be non-disruptive.