LOTS: Software Distributed Shared Memory Supporting a Large Object Space

                         

Software DSM provides good programmability for cluster computing, but its performance and limited shared memory space for large applications hinder its popularity. Due to the memory mapping strategy and the limited size of the process space, most current DSM systems cannot create more than 4GB of shared memory on 32-bit machines, regardless of the number of machines used.

We introduce LOTS (Large ObjecT Space), a DSM system whose main objective is to provide a large shared object space, with the use of the local disk of each machine as backing store. LOTS can provide more than 4GB of shared object space, with an upper bound being the space available in the local hard disk. To let the virtual address space store more objects, only a trace of control information for each object is needed to be resident in the virtual address space, while the actual object data are dynamically but lazily mapped into the virtual memory when being accessed. The whole process is done at runtime, without any compiler preprocessing.

To our knowledge, LOTS is the first pure runtime software DSM supporting a shared object space larger than the local process space. Our testing shows that LOTS can utilize all the free hard disk space available to support hundreds of gigabytes of shared objects with a small overhead (around 5-15% of the total execution time). The scope consistency memory model and a new mixed coherence protocol allow LOTS to achieve better scalability with respect to problem size and cluster size.

A detailed description of the LOTS project will be available soon.

Publications:

  • Benny W.L. Cheung, C.L. Wang, and F.C.M. Lau,  ``LOTS: A Software DSM Supporting Large Object Space,  2004 IEEE International Conference on Cluster Computing (Cluster2004), pp. 225-234, Sept. 20-23 2004, San Diego, California. (pdf) (PPT).

  • Benny W.L. Cheung, C.L. Wang, and F.C.M. Lau; `` Migrating-Home Protocol for Software Distributed Shared Memory,'' Journal of Information Science and Engineering, Vol. 18, No. 6, November 2002, pp. 929-957. (PAPER

  • Benny W.L. Cheung, C.L. Wang, Kai Hwang; ``A Migrating-Home Protocol for Implementing Scope Consistency Model on a Cluster of Workstations,’’ International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA’99), pp. 821-827, June 28 – July 1, 1999, Las Vegas, Nevada, USA. (Postscript)

  • Benny W.L. Cheung, C.L. Wang, Kai Hwang; ``JUMP-DP: A Software DSM System with Low-Latency Communication Support,” International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA2000), pp. 445-451, June 26 – 29, 2000, Las Vegas, Nevada, USA. (Postscript) (slide) .