Show simple item record

dc.contributor.advisorHuang, Jianhua
dc.contributor.advisorHu, Jianhua
dc.creatorMaadooliat, Mehdi
dc.date.accessioned2012-10-19T15:28:31Z
dc.date.accessioned2012-10-22T18:03:28Z
dc.date.available2012-10-19T15:28:31Z
dc.date.available2012-10-22T18:03:28Z
dc.date.created2011-08
dc.date.issued2012-10-19
dc.date.submittedAugust 2011
dc.identifier.urihttps://hdl.handle.net/1969.1/ETD-TAMU-2011-08-9731
dc.description.abstractStorage and analysis of high-dimensional datasets are always challenging. Dimension reduction techniques are commonly used to reduce the complexity of the data and obtain the informative aspects of datasets. Principal Component Analysis (PCA) is one of the commonly used dimension reduction techniques. However, PCA does not work well when there are outliers or the data distribution is skewed. Gene expression index estimation is an important problem in bioinformatics. Some of the popular methods in this area are based on the PCA, and thus may not work well when there is non-Gaussian structure in the data. To address this issue, a likelihood based data transformation method with a computationally efficient algorithm is developed. Also, a new multivariate expression index is studied and the performance of the multivariate expression index is compared with the commonly used univariate expression index. As an extension of the gene expression index estimation problem, a general procedure that integrates data transformation with the PCA is developed. In particular, this general method can handle missing data and data with functional structure. It is well-known that the PCA can be obtained by the eigen decomposition of the sample covariance matrix. Another focus of this dissertation is to study the covariance (or correlation) structure under the non-Gaussian assumption. An important issue in modeling the covariance matrix is the positive definiteness constraint. The modified Cholesky decomposition of the inverse covariance matrix has been considered to address this issue in the literature. An alternative Cholesky decomposition of the covariance matrix is considered and used to construct an estimator of the covariance matrix under multivariate-t assumption. The advantage of this alternative Cholesky decomposition is the decoupling of the correlation and the variances.en
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.subjectCorrelation modellingen
dc.subjectFunctional PCAen
dc.subjectMissing dataen
dc.subjectPCAen
dc.subjectProfile likelihooden
dc.subjectTransformation modelen
dc.titleDimension Reduction and Covariance Structure for Multivariate Data, Beyond Gaussian Assumptionen
dc.typeThesisen
thesis.degree.departmentStatisticsen
thesis.degree.disciplineStatisticsen
thesis.degree.grantorTexas A&M Universityen
thesis.degree.nameDoctor of Philosophyen
thesis.degree.levelDoctoralen
dc.contributor.committeeMemberPourahmadi, Mohsen
dc.contributor.committeeMemberLiang, Faming
dc.contributor.committeeMemberLi, Peng
dc.type.genrethesisen
dc.type.materialtexten


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record