Show simple item record

dc.contributor.advisorCarroll, Raymond
dc.contributor.advisorMallick, Bani
dc.creatorWang, Yanqing
dc.date.accessioned2015-02-05T21:23:10Z
dc.date.available2016-08-01T05:30:26Z
dc.date.created2014-08
dc.date.issued2014-08-15
dc.date.submittedAugust 2014
dc.identifier.urihttps://hdl.handle.net/1969.1/153663
dc.description.abstractMotivated by a logistic regression problem involving diet and cancer, we reconsider the problem of forming a confidence interval for the ratio of two location parameters. We develop a new methodology, which we call the Direct Integral Method for Ratios (DIMER). In simulations, we compare this method to many others, including Wald's method, Fieller's interval, Hayya's method, the nonparametric bootstrap and the parametric bootstrap. These simulations show that, generally, DIMER more closely achieves the nominal confidence level, and in those cases that the other methods achieve the nominal levels, DIMER generally has smaller confidence interval lengths. We also show that DIMER eliminates the probability of infinite length or enormous length confidence intervals, something that can occur in Fieller's interval. Furthermore, we study the real Healthy Eating Index-2005 (HEI-2005) data set from the NIH-AARP Study of Diet and Health, consider a weighted logistic regression model in which there are multiple subpopulations, and multiple diseases within each subpopulation. Based on this model, we present six different approaches to form the confidence intervals for the relative risks of different diseases in different subpopulations, including DIMER. The asymptotic distributions of the estimates for the log(relative risks) by the maximum likelihood and the nonparametric bootstrap method are provided. Next, the algorithms are presented to perform hypothesis tests and likelihood ratio tests to check there are significant differences between our proposed model and the other three logistic regression models or not. In addition, the adaptive lasso and an estimator with bounded constrains are described for variable selection and a novel algorithm to solve the nonlinear regression model with L1 norm penalty is proposed. The application of all those methods to the HEI-2005 data are illustrated. Additionally, we expand the linear function of nutrition components inside the logistic regression model to a nonlinear case. More than that, we consider there are some limitations from the knowledge of biology and nutrition and propose a logistic regression model involving I-spline basis functions and an algorithm to solve it. Application to the real HEI-200d data set and comparison to a logistic model with total HEI scores are also presented.en
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectAdaptive Lassoen
dc.subjectDIMERen
dc.subjectDirect Integral Method for Ratiosen
dc.subjectHEI-2005en
dc.subjectI-splineen
dc.subjectRelative Risken
dc.subjectVariable Selection.en
dc.titleRelative Risks Analysis in Nutritional Epidemiologyen
dc.typeThesisen
thesis.degree.departmentStatisticsen
thesis.degree.disciplineStatisticsen
thesis.degree.grantorTexas A & M Universityen
thesis.degree.nameDoctor of Philosophyen
thesis.degree.levelDoctoralen
dc.contributor.committeeMemberBaladandayuthapani, Veera
dc.contributor.committeeMemberBraga-Neto, Ulisses
dc.type.materialtexten
dc.date.updated2015-02-05T21:23:10Z
local.embargo.terms2016-08-01
local.etdauthor.orcid0000-0003-2687-4323


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record