Ranking, Labeling, and Summarizing Short Text in Social Media

Khabiri, Elham

dc.contributor.advisor	Caverlee, James
dc.creator	Khabiri, Elham
dc.date.accessioned	2013-10-03T14:41:36Z
dc.date.available	2013-10-03T14:41:36Z
dc.date.created	2013-05
dc.date.issued	2013-04-18
dc.date.submitted	May 2013
dc.identifier.uri	https://hdl.handle.net/1969.1/149337
dc.description.abstract	One of the key features driving the growth and success of the Social Web is large-scale participation through user-contributed content – often through short text in social media. Unlike traditional long-form documents – e.g., Web pages, blog posts – these short text resources are typically quite brief (on the order of 100s of characters), often of a personal nature (reflecting opinions and reactions of users), and being generated at an explosive rate. Coupled with this explosion of short text in social media is the need for new methods to organize, monitor, and distill relevant information from these large-scale social systems, even in the face of the inherent “messiness” of short text, considering the wide variability in quality, style, and substance of short text generated by a legion of Social Web participants. Hence, this dissertation seeks to develop new algorithms and methods to ensure the continued growth of the Social Web by enhancing how users engage with short text in social media. Concretely, this dissertation takes a three-fold approach: First, this dissertation develops a learning-based algorithm to automatically rank short text comments associated with a Social Web object (e.g., Web document, image, video) based on the expressed preferences of the community itself, so that low-quality short text may be filtered and user attention may be focused on highly-ranked short text. Second, this dissertation organizes short text through labeling, via a graph- based framework for automatically assigning relevant labels to short text. In this way meaningful semantic descriptors may be assigned to short text for improved classification, browsing, and visualization. Third, this dissertation presents a cluster-based summarization approach for extracting high-quality viewpoints expressed in a collection of short text, while maintaining diverse viewpoints. By summarizing short text, user attention may quickly assess the aggregate viewpoints expressed in a collection of short text, without the need to scan each of possibly thousands of short text items.	en
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject	social media	en
dc.subject	short text	en
dc.subject	rank	en
dc.subject	label	en
dc.subject	summarize	en
dc.title	Ranking, Labeling, and Summarizing Short Text in Social Media	en
dc.type	Thesis	en
thesis.degree.department	Computer Science and Engineering	en
thesis.degree.discipline	Computer Science	en
thesis.degree.grantor	Texas A&M University	en
thesis.degree.name	Doctor of Philosophy	en
thesis.degree.level	Doctoral	en
dc.contributor.committeeMember	Shipman, Frank
dc.contributor.committeeMember	Gutierrez Osuna, Ricardo
dc.contributor.committeeMember	Burkart, Patrick
dc.type.material	text	en
dc.date.updated	2013-10-03T14:41:36Z

Files in this item

Name:: KHABIRI-DISSERTATION-2013.pdf
Size:: 2.538Mb
Format:: PDF

View/ Open

This item appears in the following Collection(s)

Electronic Theses, Dissertations, and Records of Study (2002– )
Texas A&M University Theses, Dissertations, and Records of Study (2002– )

Show simple item record