Show simple item record

dc.contributor.advisorZhang, Xianyang
dc.creatorYi, Sangyoon
dc.date.accessioned2021-02-03T19:40:15Z
dc.date.available2022-08-01T06:52:17Z
dc.date.created2020-08
dc.date.issued2020-07-09
dc.date.submittedAugust 2020
dc.identifier.urihttps://hdl.handle.net/1969.1/192358
dc.description.abstractThis dissertation consists of the two independent studies on statistical inference in high-dimensional models. The first study considers high-dimensional linear model where the number of predictors is greater than the sample size. The second study covers high-dimensional association tests in genomics where the number of features exceeds the sample size. In the first study, we develop a new method to estimate the projection direction in the debiased Lasso estimator. The basic idea is to decompose the overall bias into two terms corresponding to strong and weak signals respectively. We propose to estimate the projection direction by balancing the squared biases associated with the strong and weak signals as well as the variance of the projection-based estimator. Standard quadratic programming solver can efficiently solve the resulting optimization problem. In theory, we show that the unknown set of strong signals can be consistently estimated and the projection-based estimator enjoys the asymptotic normality under suitable assumptions. A slight modification of our procedure leads to an estimator with a potentially smaller order of bias comparing to the original debiased Lasso. We further generalize our method to conduct inference for a sparse linear combination of the regression coefficients. Numerical studies demonstrate the advantage of the proposed approach concerning coverage accuracy over some existing alternatives. The second study presents a novel two-stage approach for more powerful confounder adjustment in large-scale multiple testing to strike a balance between the Type I error and power. Specifically, we use the unadjusted z-statistics to enrich signals in the first stage and then use the adjusted z-statistics to remove the false signals due to confounders in the second stage. We develop a new way of simultaneously choosing the two cutoffs in both steps. This is based on our estimates for the false rejections by using nonparametric empirical Bayes approach. We show that our proposed method provides asymptotic false discovery rate control and delivers more power than the traditional one-stage approach. Promising finite sample performance is demonstrated via simulations and real data illustration in comparison with existing competitors.en
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectConfidence intervalen
dc.subjectHigh-dimensional linear modelsen
dc.subjectLassoen
dc.subjectQuadratic programmingen
dc.subjectBenjamini-Hochberg procedureen
dc.subjectConfounding factoren
dc.subjectEmpirical bayesen
dc.subjectFalse discovery rateen
dc.subjectMultiple testing.en
dc.titleStatistical Inference in High-Dimensional Modelsen
dc.typeThesisen
thesis.degree.departmentStatisticsen
thesis.degree.disciplineStatisticsen
thesis.degree.grantorTexas A&M Universityen
thesis.degree.nameDoctor of Philosophyen
thesis.degree.levelDoctoralen
dc.contributor.committeeMemberPourahmadi, Mohsen
dc.contributor.committeeMemberGaynanova, Irina
dc.contributor.committeeMemberChen, Yong
dc.type.materialtexten
dc.date.updated2021-02-03T19:40:15Z
local.embargo.terms2022-08-01
local.etdauthor.orcid0000-0001-6892-577X


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record