Show simple item record

dc.contributor.advisorCaverlee, James
dc.creatorKhabiri, Elham
dc.date.accessioned2013-10-03T14:41:36Z
dc.date.available2013-10-03T14:41:36Z
dc.date.created2013-05
dc.date.issued2013-04-18
dc.date.submittedMay 2013
dc.identifier.urihttps://hdl.handle.net/1969.1/149337
dc.description.abstractOne of the key features driving the growth and success of the Social Web is large-scale participation through user-contributed content – often through short text in social media. Unlike traditional long-form documents – e.g., Web pages, blog posts – these short text resources are typically quite brief (on the order of 100s of characters), often of a personal nature (reflecting opinions and reactions of users), and being generated at an explosive rate. Coupled with this explosion of short text in social media is the need for new methods to organize, monitor, and distill relevant information from these large-scale social systems, even in the face of the inherent “messiness” of short text, considering the wide variability in quality, style, and substance of short text generated by a legion of Social Web participants. Hence, this dissertation seeks to develop new algorithms and methods to ensure the continued growth of the Social Web by enhancing how users engage with short text in social media. Concretely, this dissertation takes a three-fold approach: First, this dissertation develops a learning-based algorithm to automatically rank short text comments associated with a Social Web object (e.g., Web document, image, video) based on the expressed preferences of the community itself, so that low-quality short text may be filtered and user attention may be focused on highly-ranked short text. Second, this dissertation organizes short text through labeling, via a graph- based framework for automatically assigning relevant labels to short text. In this way meaningful semantic descriptors may be assigned to short text for improved classification, browsing, and visualization. Third, this dissertation presents a cluster-based summarization approach for extracting high-quality viewpoints expressed in a collection of short text, while maintaining diverse viewpoints. By summarizing short text, user attention may quickly assess the aggregate viewpoints expressed in a collection of short text, without the need to scan each of possibly thousands of short text items.en
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectsocial mediaen
dc.subjectshort texten
dc.subjectranken
dc.subjectlabelen
dc.subjectsummarizeen
dc.titleRanking, Labeling, and Summarizing Short Text in Social Mediaen
dc.typeThesisen
thesis.degree.departmentComputer Science and Engineeringen
thesis.degree.disciplineComputer Scienceen
thesis.degree.grantorTexas A&M Universityen
thesis.degree.nameDoctor of Philosophyen
thesis.degree.levelDoctoralen
dc.contributor.committeeMemberShipman, Frank
dc.contributor.committeeMemberGutierrez Osuna, Ricardo
dc.contributor.committeeMemberBurkart, Patrick
dc.type.materialtexten
dc.date.updated2013-10-03T14:41:36Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record