Ceph is a distributed object, block, and file storage platform
Go to file
Xiubo Li 5ade254c15 mds: switch mds_lock to fair mutex
The implementations of the Mutex (e.g. std::mutex in C++) do not
guarantee fairness, they do not guarantee that the lock will be
acquired by threads in the order that they called the lock().

In most case this works well, but in corner case in the Finisher
thread in mds daemon, which may call more than one complete()s
once the mdlog flushing succeeds, after the mdlog flushing is done
it will call the queued complete callbacks and the Finisher thread
could always successfully acquire the mds_lock in successive
complete callbakcs even there may have other threads already being
stuck waiting the mds_lock. This will make the other threads starve
and if they are client's requests, it will cause several or even
tens of seconds long delay for user's operations.

This will switch the mds_lock to fair mutex and it could make sure
that the all the mds_lock waiters are in FIFO order and the Finisher
thread won't hold the mds_lock that long.

At the same time, if the finisher thread has many completes needed
to run the fair mutex could guarantee that the finisher won't be
scheduled out due to fair mutex unlock() if no any other mds_lock
waiter queued.

Fixes: https://tracker.ceph.com/issues/51722
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2021-08-25 09:42:51 +08:00
.github .github/labeler: add rook label to PRs related to Rook 2021-08-20 09:50:37 -04:00
admin admin/build-doc: rebuild venv if md5 does not match 2021-08-05 09:08:56 +08:00
bin
ceph-erasure-code-corpus@2d7d78b9cc
ceph-menv
ceph-object-corpus@038c72b5ac
cmake/modules common/options: validate see-also 2021-08-24 22:22:37 +08:00
debian Merge pull request #42194 from rhcs-dashboard/add-grafonnet-grafana 2021-08-11 18:11:59 +02:00
doc Merge pull request #42910 from zdover23/doc-2021-08-25-doc-dev-config-verb-agreement-typo 2021-08-24 16:27:32 -07:00
etc
examples
fusetrace
keys
man
mirroring
monitoring cmake: exclude "grafonnet-lib" target from "all" 2021-08-20 22:50:42 +08:00
qa Merge PR #42371 into master 2021-08-23 20:02:31 -04:00
selinux
share
src mds: switch mds_lock to fair mutex 2021-08-25 09:42:51 +08:00
sudoers.d
systemd
udev
.gitattributes
.githubmap .githubmap: update mail address 2021-08-17 09:57:25 +05:30
.gitignore
.gitmodule_mirrors
.gitmodules .gitmodules: remove thrift submodule 2021-08-21 12:10:19 +05:30
.mailmap .githubmap: update mail address 2021-08-17 09:57:25 +05:30
.organizationmap .githubmap: update mail address 2021-08-17 09:57:25 +05:30
.peoplemap .githubmap: update mail address 2021-08-17 09:57:25 +05:30
.readthedocs.yml .readthedocs.yml: use ditaa instead of plantweb 2021-08-02 01:00:35 +08:00
AUTHORS
ceph.spec.in Merge pull request #42051 from melissa-kun-li/asyncssh 2021-08-23 14:29:01 +02:00
CMakeLists.txt rgw/dbstore: Fix DBstore build conflicts 2021-08-05 14:27:19 +05:30
CodingStyle
CONTRIBUTING.rst
COPYING
COPYING-GPL2
COPYING-LGPL2.1
COPYING-LGPL3
do_cmake.sh do_cmake:sh: do not set BOOST_J 2021-08-11 17:34:23 +08:00
do_freebsd.sh
doc_deps.deb.txt admin/build-doc: s/virtualenv/python3 -m venv/ 2021-07-31 22:34:05 +08:00
install-deps.sh install-deps.sh: s/virtualenv/python -m venv/ 2021-07-31 21:16:56 +08:00
make-debs.sh
make-dist
make-srpm.sh
mingw_conf.sh
PendingReleaseNotes rgw: default auth_client_required=cephx 2021-08-09 11:59:54 -04:00
pom.xml
README.aix ceph.spec.in: drop gdbm from build deps 2021-08-18 01:04:42 +08:00
README.FreeBSD
README.md
README.solaris
README.windows.rst
run-make-check.sh Merge pull request #42842 from ideepika/wip-werror-testing 2021-08-20 12:00:49 +05:30
SECURITY.md
SubmittingPatches-backports.rst
SubmittingPatches-kernel.rst
SubmittingPatches.rst
win32_build.sh
win32_deps_build.sh win32_deps_build.sh: only clone the tip of required tag 2021-07-29 12:13:26 +08:00

Ceph - a scalable distributed storage system

Please see http://ceph.com/ for current info.

Contributing Code

Most of Ceph is dual licensed under the LGPL version 2.1 or 3.0. Some miscellaneous code is under a BSD-style license or is public domain. The documentation is licensed under Creative Commons Attribution Share Alike 3.0 (CC-BY-SA-3.0). There are a handful of headers included here that are licensed under the GPL. Please see the file COPYING for a full inventory of licenses by file.

Code contributions must include a valid "Signed-off-by" acknowledging the license for the modified or contributed file. Please see the file SubmittingPatches.rst for details on what that means and on how to generate and submit patches.

We do not require assignment of copyright to contribute code; code is contributed under the terms of the applicable license.

Checking out the source

You can clone from github with

git clone git@github.com:ceph/ceph

or, if you are not a github user,

git clone git://github.com/ceph/ceph

Ceph contains many git submodules that need to be checked out with

git submodule update --init --recursive

Build Prerequisites

The list of Debian or RPM packages dependencies can be installed with:

./install-deps.sh

Building Ceph

Note that these instructions are meant for developers who are compiling the code for development and testing. To build binaries suitable for installation we recommend you build deb or rpm packages or refer to the ceph.spec.in or debian/rules to see which configuration options are specified for production builds.

Build instructions:

./do_cmake.sh
cd build
ninja

(do_cmake.sh now defaults to creating a debug build of ceph that can be up to 5x slower with some workloads. Please pass "-DCMAKE_BUILD_TYPE=RelWithDebInfo" to do_cmake.sh to create a non-debug release.

The number of jobs used by ninja is derived from the number of CPU cores of the building host if unspecified. Use the -j option to limit the job number if the build jobs are running out of memory. On average, each job takes around 2.5GiB memory.)

This assumes you make your build dir a subdirectory of the ceph.git checkout. If you put it elsewhere, just point CEPH_GIT_DIR to the correct path to the checkout. Any additional CMake args can be specified by setting ARGS before invoking do_cmake. See cmake options for more details. Eg.

ARGS="-DCMAKE_C_COMPILER=gcc-7" ./do_cmake.sh

To build only certain targets use:

ninja [target name]

To install:

ninja install

CMake Options

If you run the cmake command by hand, there are many options you can set with "-D". For example, the option to build the RADOS Gateway is defaulted to ON. To build without the RADOS Gateway:

cmake -DWITH_RADOSGW=OFF [path to top-level ceph directory]

Another example below is building with debugging and alternate locations for a couple of external dependencies:

cmake -DLEVELDB_PREFIX="/opt/hyperleveldb" \
-DCMAKE_INSTALL_PREFIX=/opt/ceph -DCMAKE_C_FLAGS="-Og -g3 -gdwarf-4" \
..

To view an exhaustive list of -D options, you can invoke cmake with:

cmake -LH

If you often pipe ninja to less and would like to maintain the diagnostic colors for errors and warnings (and if your compiler supports it), you can invoke cmake with:

cmake -DDIAGNOSTICS_COLOR=always ...

Then you'll get the diagnostic colors when you execute:

ninja | less -R

Other available values for 'DIAGNOSTICS_COLOR' are 'auto' (default) and 'never'.

Building a source tarball

To build a complete source tarball with everything needed to build from source and/or build a (deb or rpm) package, run

./make-dist

This will create a tarball like ceph-$version.tar.bz2 from git. (Ensure that any changes you want to include in your working directory are committed to git.)

Running a test cluster

To run a functional test cluster,

cd build
ninja vstart        # builds just enough to run vstart
../src/vstart.sh --debug --new -x --localhost --bluestore
./bin/ceph -s

Almost all of the usual commands are available in the bin/ directory. For example,

./bin/rados -p rbd bench 30 write
./bin/rbd create foo --size 1000

To shut down the test cluster,

../src/stop.sh

To start or stop individual daemons, the sysvinit script can be used:

./bin/init-ceph restart osd.0
./bin/init-ceph stop

Running unit tests

To build and run all tests (in parallel using all processors), use ctest:

cd build
ninja
ctest -j$(nproc)

(Note: Many targets built from src/test are not run using ctest. Targets starting with "unittest" are run in ninja check and thus can be run with ctest. Targets starting with "ceph_test" can not, and should be run by hand.)

When failures occur, look in build/Testing/Temporary for logs.

To build and run all tests and their dependencies without other unnecessary targets in Ceph:

cd build
ninja check -j$(nproc)

To run an individual test manually, run ctest with -R (regex matching):

ctest -R [regex matching test name(s)]

(Note: ctest does not build the test it's running or the dependencies needed to run it)

To run an individual test manually and see all the tests output, run ctest with the -V (verbose) flag:

ctest -V -R [regex matching test name(s)]

To run tests manually and run the jobs in parallel, run ctest with the -j flag:

ctest -j [number of jobs]

There are many other flags you can give ctest for better control over manual test execution. To view these options run:

man ctest

Building the Documentation

Prerequisites

The list of package dependencies for building the documentation can be found in doc_deps.deb.txt:

sudo apt-get install `cat doc_deps.deb.txt`

Building the Documentation

To build the documentation, ensure that you are in the top-level /ceph directory, and execute the build script. For example:

admin/build-doc