NOTE: This item is not available outside the Texas A&M University network. Texas A&M affiliated users who are off campus can access the item through NetID and password authentication or by using TAMU VPN. Non-affiliated individuals should request a copy through their local library's interlibrary loan service.
FTMP : a protocol for operating system fault tolerance in a fully distributed, loosely coupled environment
dc.contributor.advisor | Pooch, Udo W. | |
dc.contributor.advisor | Sheppard, Sallie V. | |
dc.creator | Safford, David Robert | |
dc.date.accessioned | 2024-02-09T20:47:58Z | |
dc.date.available | 2024-02-09T20:47:58Z | |
dc.date.issued | 1990 | |
dc.identifier.uri | https://hdl.handle.net/1969.1/DISSERTATIONS-1174770 | |
dc.description | Typescript (photocopy) | en |
dc.description | Vita | en |
dc.description | Major subject: Computer science | en |
dc.description.abstract | This research presents the design, implementation, and testing of FTMP (Fault Tolerant Monitor Protocol). FTMP provides distributed reliability services to application programs in a highly redundant, loosely coupled, distributed network. Design goals for this research included network and location transparency for the applications; topology and operating system independence; the ability to diagnose, contain, and recover from both hardware and software failures; and provision for future protocol extensions. The goal of topology independence is significant in that it prevents the use of existing network broadcast and routing services, while offering the use of much more fault tolerant network designs, such as planar-2. The design approach used includes both the addition of new vertical operating system layers, as well as full distribution and data replication for these new services. Vertical layering is used to place fault tolerance appropriately. While many prior efforts have centered only on hardware based methods, or application based methods, this approach recognizes that fault tolerance is needed in all levels, including hardware, device driver, operating system kernel, library, and application layers. In addition, two new layers are added between the application and traditional operating system, to provide desired new distributed services. The specific design provides the necessary operating system extensions with one service daemon per node. This daemon implements FTMP in a fully distributed fashion. In addition, an interface library is provided which simplifies the application's use of the FTMP services, while also implementing some of the fault tolerance services on a per application basis. Together, the interface library and service daemon provide automatic detection and correction of hardware and software failures, location transparent communication through distributed named ports, automatic critical file remote replication, and distributed object handling. A version is implemented in a 4 by 4 planar-2 mesh of Sun processors running Sun UNIX. This implementation is evaluated for completeness, correct operation, and efficiency. The results demonstrate that FTMP provides powerful tools for highly reliable distributed applications. | en |
dc.format.extent | xi, 336 leaves | en |
dc.format.medium | electronic | en |
dc.format.mimetype | application/pdf | |
dc.language.iso | eng | |
dc.rights | This thesis was part of a retrospective digitization project authorized by the Texas A&M University Libraries. Copyright remains vested with the author(s). It is the user's responsibility to secure permission from the copyright holder(s) for re-use of the work beyond the provision of Fair Use. | en |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
dc.subject | Major computer science | en |
dc.subject.classification | 1990 Dissertation S128 | |
dc.subject.lcsh | Operating systems (Computers) | en |
dc.subject.lcsh | Fault-tolerant computing | en |
dc.subject.lcsh | Electronic data processing | en |
dc.subject.lcsh | Distributed processing | en |
dc.subject.lcsh | Computer network protocols | en |
dc.title | FTMP : a protocol for operating system fault tolerance in a fully distributed, loosely coupled environment | en |
dc.type | Thesis | en |
thesis.degree.discipline | Computer science | en |
thesis.degree.grantor | Texas A&M University | en |
thesis.degree.name | Doctor of Philosophy | en |
thesis.degree.name | Ph. D | en |
thesis.degree.level | Doctorial | en |
dc.contributor.committeeMember | Friesen, Donald K. | |
dc.contributor.committeeMember | Koppa, Rodger | |
dc.contributor.committeeMember | McCormick, Bruce H. | |
dc.type.genre | dissertations | en |
dc.type.material | text | en |
dc.format.digitalOrigin | reformatted digital | en |
dc.publisher.digital | Texas A&M University. Libraries | |
dc.identifier.oclc | 24024097 |
Files in this item
This item appears in the following Collection(s)
-
Digitized Theses and Dissertations (1922–2004)
Texas A&M University Theses and Dissertations (1922–2004)
Request Open Access
This item and its contents are restricted. If this is your thesis or dissertation, you can make it open-access. This will allow all visitors to view the contents of the thesis.