Recoverable distributed shared memory

Kanthadai, Sundarrajan S

NOTE: This item is not available outside the Texas A&M University network. Texas A&M affiliated users who are off campus can access the item through NetID and password authentication or by using TAMU VPN. Non-affiliated individuals should request a copy through their local library's interlibrary loan service.

View/ Open

1996 Thesis K35.pdf (992.6Kb)

Date

1996

Author

Kanthadai, Sundarrajan S

Metadata

Show full item record

Abstract

Distributed Shared Memory (DSM) is a model for interprocess communication, implemented on top of message passing systems. In this model, processes running on separate hosts can access a shared, coherent memory address space, provided by the underlying DSM system, through the normal read and write operations. Thus, by avoiding the programming complexities of message passing, it has become a convenient model to work with. It is a natural extension of parallel programming on uniprocessors to distributed environments, As the number of processors in the system and the running time of applications executing on such a system increases, the likelihood of processor failure due to machine malfunction, power failure, user error, etc., increases. The benefits given by these systems can possibly be achieved only if the whole system behaves like a failure-free system. Many algorithms that have been proposed for implementing a reliable DSM, require the processes to take checkpoints whenever there is a data transfer, thus resulting in high overhead during failure-free execution. We propose a new recoverable DSM algorithm to tolerate multiple node failures and where the checkpointing interval can be tailored to balance the cost of checkpointing versus the savings in recovery obtained by taking checkpoints often. The technique uses independent checkpointing and keeps track of the dependencies by logging writes and some additional information about the occurrence of reads. Unlike previous recovery techniques, this one reduces both the message and the logging overheads.

URI

https://hdl.handle.net/1969.1/ETD-TAMU-1996-THESIS-K35

Description

Due to the character of the original source materials and the nature of batch digitization, quality control issues may be present in this document. Please report any quality issues you encounter to digital@library.tamu.edu, referencing the URI of the item.
Includes bibliographical references: p. 35-37.
Issued also on microfiche from Lange Micrographics.

Subject

computer science.
Major computer science.

Collections

Digitized Theses and Dissertations (1922–2004)

Citation

Kanthadai, Sundarrajan S (1996). Recoverable distributed shared memory. Master's thesis, Texas A&M University. Available electronically from https : / /hdl .handle .net /1969 .1 /ETD -TAMU -1996 -THESIS -K35.

This item and its contents are restricted. If this is your thesis or dissertation, you can make it open-access. This will allow all visitors to view the contents of the thesis.

Request Open Access