Publications
Ceph has grown out of the petabyte-scale storage research at the Storage Systems Research Center at the University of California, Santa Cruz. The project is funded primarily by a grant from the Lawrence Livermove, Sandia, and Los Alamos National Laboratories. A range of publications related to scalable storage systems have resulted.
The following publications are directly related to the current design of Ceph.
- Sage A. Weil, Andrew W. Leung, Scott A. Brandt, Carlos Maltzahn. RADOS: A Fast, Scalable, and Reliable Storage Service for Petabyte-scale Storage Clusters. Petascale Data Storage Workshop SC07, November, 2007. [ slides ]
- Sage Weil, Scott A. Brandt, Ethan L. Miller, Darrell D. E. Long, Carlos Maltzahn, Ceph: A Scalable, High-Performance Distributed File System, Proceedings of the 7th Conference on Operating Systems Design and Implementation (OSDI '06), November 2006.
- Sage Weil, Scott A. Brandt, Ethan L. Miller, Carlos Maltzahn, CRUSH: Controlled, Scalable, Decentralized Placement of Replicated Data, Proceedings of SC '06, November 2006.
- Sage Weil, Kristal Pollack, Scott A. Brandt, Ethan L. Miller, Dynamic Metadata Management for Petabyte-Scale File Systems, Proceedings of the 2004 ACM/IEEE Conference on Supercomputing (SC '04), November 2004.
- Qin Xin, Ethan L. Miller, Thomas Schwarz, Evaluation of Distributed Recovery in Large-Scale Storage Systems, Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing (HPDC 2004), June 2004, pages 172-181.
The following papers describe aspects of subsystems of Ceph that have not yet been fully designed or integrated, but soon will be.
- Andrew Leung, Ethan L. Miller, Scalable Security for Large, High Performance Storage Systems, Proceedings of the 2nd ACM Workshop on Storage Security and Survivability (StorageSS 2006), October 2006.
- Joel C. Wu, Scott A. Brandt, The Design and Implementation of AQuA: an Adaptive Quality of Service Aware Object-Based Storage Device, Proceedings of the 23rd IEEE / 14th NASA Goddard Conference on Mass Storage Systems and Technologies, May 2006, pages 209-218.
The following papers represent earlier research upon which Ceph's design is partially based.
- Christopher Olson, Ethan L. Miller, Secure Capabilities for a Petabyte-Scale Object-Based Distributed File System, Proceedings of the 2005 ACM Workshop on Storage Security and Survivability (StorageSS 2005), November 2005.
- Qin Xin, Thomas Schwarz, Ethan L. Miller, Disk Infant Mortality in Large Storage Systems, Proceedings of the 13th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS '05), September 2005.
- Joel C. Wu, Scott A. Brandt, Hierarchical Disk Sharing for Multimedia Systems and Servers, Proceedings of the 15th ACM International Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV 2005), June 2005, pages 189-194.
- Qin Xin, Ethan L. Miller, Thomas Schwarz, Darrell D. E. Long, Impact of Failure on Interconnection Networks in Large Storage Systems, Proceedings of the 22nd IEEE / 13th NASA Goddard Conference on Mass Storage Systems and Technologies, April 2005.
- Feng Wang, Scott A. Brandt, Ethan L. Miller, Darrell D. E. Long, OBFS: A File System for Object-Based Storage Devices, Proceedings of the 21st IEEE / 12th NASA Goddard Conference on Mass Storage Systems and Technologies, April 2004, pages 283-300.
- Feng Wang, Qin Xin, Bo Hong, Scott A. Brandt, Ethan L. Miller, Darrell D. E. Long, Tyce T. Mclarty, File System Workload Analysis For Large Scientific Computing Applications, NASA/IEEE Conference on Mass Storage Systems and Technologies (MSST 2004), April 2004, pages 139?152.
- Andy Hospodor, Ethan L. Miller, Interconnection Architectures for Petabyte-Scale High-Performance Storage Systems, Proceedings of the 21st IEEE / 12th NASA Goddard Conference on Mass Storage Systems and Technologies, April 2004, pages 273-281.
- R. J. Honicky, Ethan L. Miller, Replication Under Scalable Hashing: A Family of Algorithms for Scalable Decentralized Data Distribution, Proceedings of the 18th International Parallel & Distributed Processing Symposium (IPDPS 2004), April 2004.
This is a partial selection. A complete list of publications for the project is available on the
SSRC Ceph project web site.