SCALABLE ALGORITHMS FOR HIGH DIMENSIONAL STRUCTURED DATA

Ren, Shaogang

dc.contributor.advisor	Qian, Xiaoning
dc.creator	Ren, Shaogang
dc.date.accessioned	2019-01-16T17:00:02Z
dc.date.available	2019-12-01T06:34:02Z
dc.date.created	2017-12
dc.date.issued	2017-12-11
dc.date.submitted	December 2017
dc.identifier.uri	https://hdl.handle.net/1969.1/173033
dc.description.abstract	Emerging technologies and digital devices provide us with increasingly large volume of data with respect to both the sample size and the number of features. To explore the benefits of massive data sets, scalable statistical models and machine learning algorithms are more and more important in different research disciplines. For robust and accurate prediction, prior knowledge regarding dependency structures within data needs to be formulated appropriately in these models. On the other hand, scalability and computation complexity of existing algorithms may not meet the needs to analyze massive high-dimensional data. This dissertation presents several novel methods to scale up sparse learning models to analyze massive data sets. We first present our novel safe active incremental feature (SAIF) selection algorithm for LASSO (least absolute shrinkage and selection operator), with the time complexity analysis to show the advantages over state of the art existing methods. As SAIF is targeting general convex loss functions, it potentially can be extended to many learning models and big-data applications, and we show how support vector machines (SVM) can be scaled up based on the idea of SAIF. Secondly, we propose screening methods to generalized LASSO (GL), which specifically considers the dependency structure among features. We also propose a scalable feature selection method for non-parametric, non-linear models based on sparse structures and kernel methods. Theoretical analysis and experimental results in this dissertation show that model complexity can be significantly reduced with the sparsity and structure assumptions.	en
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject	Sparse Learning	en
dc.subject	LASSO	en
dc.subject	Structured Sparse	en
dc.subject	Scalability	en
dc.subject	Big Data	en
dc.title	SCALABLE ALGORITHMS FOR HIGH DIMENSIONAL STRUCTURED DATA	en
dc.type	Thesis	en
thesis.degree.department	Electrical and Computer Engineering	en
thesis.degree.discipline	Computer Engineering	en
thesis.degree.grantor	Texas A & M University	en
thesis.degree.name	Doctor of Philosophy	en
thesis.degree.level	Doctoral	en
dc.contributor.committeeMember	Dougherty, Edward
dc.contributor.committeeMember	Huang, Jianhua
dc.contributor.committeeMember	Li, Peng
dc.contributor.committeeMember	Shakkottai, Srinivas
dc.type.material	text	en
dc.date.updated	2019-01-16T17:00:03Z
local.embargo.terms	2019-12-01
local.etdauthor.orcid	0000-0003-2352-3288

Files in this item

Name:: REN-DISSERTATION-2017.pdf
Size:: 3.019Mb
Format:: PDF

View/ Open

This item appears in the following Collection(s)

Electronic Theses, Dissertations, and Records of Study (2002– )
Texas A&M University Theses, Dissertations, and Records of Study (2002– )

Show simple item record