2016-11-23 22:05:37 +00:00
|
|
|
Using perf
|
|
|
|
==========
|
|
|
|
|
|
|
|
Top::
|
|
|
|
|
|
|
|
sudo perf top -p `pidof ceph-osd`
|
|
|
|
|
|
|
|
To capture some data with call graphs::
|
|
|
|
|
|
|
|
sudo perf record -p `pidof ceph-osd` -F 99 --call-graph dwarf -- sleep 60
|
|
|
|
|
|
|
|
To view by caller (where you can see what each top function calls)::
|
|
|
|
|
|
|
|
sudo perf report --call-graph caller
|
|
|
|
|
2016-12-05 11:10:08 +00:00
|
|
|
To view by callee (where you can see who calls each top function)::
|
2016-11-23 22:05:37 +00:00
|
|
|
|
|
|
|
sudo perf report --call-graph callee
|
|
|
|
|
2016-12-05 11:10:08 +00:00
|
|
|
:note: If the caller/callee views look the same you may be
|
2016-11-23 22:05:37 +00:00
|
|
|
suffering from a kernel bug; upgrade to 4.8 or later.
|
|
|
|
|
2017-08-30 04:14:37 +00:00
|
|
|
Common Issues
|
|
|
|
-------------
|
|
|
|
|
|
|
|
Ceph use `RelWithDebInfo` as its default `CMAKE_BUILD_TYPE`. Hence `-O2 -g` is
|
|
|
|
used to compile the tree in this case. And the `-O2` optimization level
|
|
|
|
enables `-fomit-frame-pointer` by default. But this prevents stack profilers
|
|
|
|
from accessing the complete stack information. So one can disable this option
|
|
|
|
when launching `cmake` ::
|
|
|
|
|
|
|
|
cmake -DCMAKE_CXX_FLAGS="-fno-omit-frame-pointer"
|
|
|
|
|
|
|
|
or when building the tree::
|
|
|
|
|
|
|
|
make CMAKE_CXX_FLAGS="-fno-omit-frame-pointer"
|
|
|
|
|
|
|
|
|
2016-11-23 22:05:37 +00:00
|
|
|
Flamegraphs
|
|
|
|
-----------
|
|
|
|
|
|
|
|
First, get things set up::
|
|
|
|
|
|
|
|
cd ~/src
|
|
|
|
git clone https://github.com/brendangregg/FlameGraph
|
|
|
|
|
|
|
|
Run ceph, then record some perf data::
|
|
|
|
|
|
|
|
sudo perf record -p `pidof ceph-osd` -F 99 --call-graph dwarf -- sleep 60
|
|
|
|
|
|
|
|
Then generate the flamegraph::
|
|
|
|
|
|
|
|
sudo perf script | ~/src/FlameGraph/stackcollapse-perf.pl > /tmp/folded
|
|
|
|
~/src/FlameGraph/flamegraph.pl /tmp/folded > /tmp/perf.svg
|
|
|
|
firefox /tmp/perf.svg
|