Show simple item record

dc.contributor.advisorLoguinov, Dmitri
dc.creatorLi, Xiaoyong
dc.date.accessioned2016-07-08T15:14:19Z
dc.date.available2016-07-08T15:14:19Z
dc.date.created2016-05
dc.date.issued2016-05-09
dc.date.submittedMay 2016
dc.identifier.urihttps://hdl.handle.net/1969.1/157023
dc.description.abstractNowadays an increasing number of applications need to maintain local copies of remote data sources to provide services to their users. Because of the dynamic nature of the sources, an application has to synchronize its copies with remote sources constantly to provide reliable services. Instead of push-based synchronization, we focus on pull-based strategy because it doesn’t require source cooperation and has been widely adopted by existing systems. The scalability of the pull-based synchronization comes at the expense of increased inconsistency of the copied content. We model this system under non-Poisson update/refresh processes and obtain sample-path averages of various metrics of staleness cost, generalizing previous results and studying its statistical properties. Computing staleness requires knowledge of the inter-update distribution at the source, which can only be estimated through blind sampling – periodic downloads and comparison against previous copies. We show that all previous approaches are biased unless the observation rate tends to infinity or the update process is Poisson. To overcome these issues, we propose four new algorithms that achieve various levels of consistency, which depend on the amount of temporal information revealed by the source and capabilities of the download process. Then we focus on applying freshness to P2P replication systems. We extend our results to several more difficult algorithms – cascaded replication, cooperative caching, and redundant querying from the clients. Surprisingly, we discover that optimal cooperation involves just a single peer and that redundant querying can hurt the ability of the system to handle load (i.e., may lead to lower scalability).en
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectdata synchronizationen
dc.subjectfreshnessen
dc.titleDistributed Synchronization Under Data Churnen
dc.typeThesisen
thesis.degree.departmentComputer Science and Engineeringen
thesis.degree.disciplineComputer Scienceen
thesis.degree.grantorTexas A & M Universityen
thesis.degree.nameDoctor of Philosophyen
thesis.degree.levelDoctoralen
dc.contributor.committeeMemberBettati, Riccardo
dc.contributor.committeeMemberCaverlee, Jame
dc.contributor.committeeMemberReddy, Narasimha
dc.type.materialtexten
dc.date.updated2016-07-08T15:14:19Z
local.etdauthor.orcid0000-0003-3574-3325


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record