Show simple item record

dc.contributor.advisorNi, Yang
dc.contributor.advisorKatzfuss, Matthias
dc.creatorKidd, Brian J.
dc.date.accessioned2023-05-26T18:00:52Z
dc.date.created2022-08
dc.date.issued2022-07-21
dc.date.submittedAugust 2022
dc.identifier.urihttps://hdl.handle.net/1969.1/197934
dc.description.abstractDirected graphs have widespread applicability, so there is continued need for both computational and theoretical improvements. We extend the standard methodology in three different ways, which constitute the main chapters of the thesis. In spatial settings, information about data collection location is often ignored. We apply nearest neighbor ideas to an underlying directed graph of the data for covariance estimation. By making the underlying graph sparse, estimation is scalable and more stable for a small number of observations. Novel priors to encourage stability are developed in the Bayesian paradigm. Our methods are highly scalable, flexible, and parallelizable. We conduct numerical comparisons and apply our methodology to climate-model output, enabling statistical emulation of an expensive physical model. Next, in many healthcare settings, maintaining data privacy becomes paramount. Without sharing the data itself, local models could be vastly different; by pooling even model information, the overall model should be more unified and robust. This novel area of interest is called federated learning. We establish sparse regression and directed graphical modelling techniques for this task. Results show that our model is similar to modelling all of the data jointly but is faster and maintains the local privacy. We illustrate our method on both hospital COVID data and cancer genetics data. Finally, we relax the assumption of no cycles as many realistic systems have feedback loops. Although it is less computationally scalable when allowing cycles, we exploit various mathematical formulas to help improve computation. By allowing cycles, we are also able to use simpler priors and not require exact zeros as are necessary in acyclic modelling. We design a Bayesian method for estimating nonlinear directed graphs while allowing cycles under the assumption that the system is in equilibrium. We demonstrate the nonlinear modelling capacity with B-splines, but the only requirement is to have the gradient available in closed form. Overall, we advanced directed graph methodology in three ways with Bayesian methodology.
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectdirected graphs
dc.subjectphylogenetic trees
dc.subjectfederated learning
dc.subjectclimate emulation
dc.titleDirected Graphs and Applications
dc.typeThesis
thesis.degree.departmentStatistics
thesis.degree.disciplineStatistics
thesis.degree.grantorTexas A&M University
thesis.degree.nameDoctor of Philosophy
thesis.degree.levelDoctoral
dc.contributor.committeeMemberBhattacharya, Anirban
dc.contributor.committeeMemberQian, Xiaoning
dc.type.materialtext
dc.date.updated2023-05-26T18:00:53Z
local.embargo.terms2024-08-01
local.embargo.lift2024-08-01
local.etdauthor.orcid0000-0002-9990-3159


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record