Corpus structure ================ ceph.git/ceph-object-corpus is a submodule.:: bin/ # misc scripts archive/$version/objects/$type/$hash # a sample of encoded objects from a specific version You can also mark known or deliberate incompatibilities between versions with:: archive/$version/forward_incompat/$type The presence of a file indicates that new versions of code cannot decode old objects across that $version (this is normally the case). How to generate an object corpus ================================ We can generate an object corpus for a particular version of ceph like so. #. Checkout a clean repo (best not to do this where you normally work):: git clone ceph.git cd ceph git submodule update --init --recursive #. Build with flag to dump objects to /tmp/foo:: rm -rf /tmp/foo ; mkdir /tmp/foo ./do_autogen.sh -e /tmp/foo make #. Start via vstart:: cd src MON=3 OSD=3 MDS=3 RGW=1 ./vstart.sh -n -x #. Use as much functionality of the cluster as you can, to exercise as many object encoder methods as possible:: ./rados -p rbd bench 10 write -b 123 ./ceph osd out 0 ./init-ceph stop osd.1 for f in ../qa/workunits/cls/*.sh ; do PATH=".:$PATH" $f ; done ../qa/workunits/rados/test.sh ./ceph_test_librbd ./ceph_test_libcephfs ./init-ceph restart mds.a Do some more stuff with rgw if you know how. #. Stop:: ./stop.sh #. Import the corpus (this will take a few minutes):: test/encoding/import.sh /tmp/foo `./ceph-dencoder version` ../ceph-object-corpus/archive test/encoding/import-generated.sh ../ceph-object-corpus/archive #. Prune it! There will be a bazillion copies of various objects, and we only want a representative sample.:: pushd ../ceph-object-corpus bin/prune-archive.sh popd #. Verify the tests pass:: make check-local #. Commit it to the corpus repo and push:: pushd ../ceph-object-corpus git checkout -b wip-new git add archive/`../src/ceph-dencoder version` git commit -m `../src/ceph-dencoder version` git remote add cc ceph.com:/git/ceph-object-corpus.git git push cc wip-new popd #. Go test it out:: cd my/regular/tree cd ceph-object-corpus git fetch origin git checkout wip-new cd ../src make check-local #. If everything looks good, update the submodule master branch, and commit the submodule in ceph.git.