Show simple item record

dc.contributor.advisorJiang, Anxiao
dc.creatorSunder, Gowrishankar
dc.date.accessioned2015-09-21T17:00:37Z
dc.date.available2015-09-21T17:00:37Z
dc.date.created2015-05
dc.date.issued2015-05-07
dc.date.submittedMay 2015
dc.identifier.urihttps://hdl.handle.net/1969.1/155125
dc.description.abstractError Correction has applications in a variety of domains given the prevalence of errors of various kinds and the need to programmatically correct them as accurately as possible. For example, error correction is used in portable mobile devices to fix typographical errors while taking input from the keypads. It can also be useful in lower level applications – to fix errors in storage media or to fix network transmission errors. The precision and the influence of such techniques can vary based on requirements and the capabilities of the correction technique but they essentially form a part of the application for its effective functioning. The research primarily focuses on various techniques to provide error correction given the location of the erroneous token. The errors are essentially Erasures which are missing bits in a stream of binary data, the locations of which are known. The basic idea behind these techniques lies in building up contextual information from an error-free training corpora and using these models, provide alternative suggestions which could replace the erroneous tokens. We look into two models - the topic-based LDA (Latent Dirichlet Allocation) model and the N-Gram model. We also propose an efficient mechanism to process such errors which offers exponential speed-ups. Using these models, we are able to achieve up to 5% improvement in accuracy as compared to a standard word distribution model using minimal domain knowledge.en
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectError Correctionen
dc.subjectNatural Language Processingen
dc.subjectLDAen
dc.subjectTopic Modelen
dc.subjectn-gramen
dc.titleError Correction Using Probabilistic Language Modelsen
dc.typeThesisen
thesis.degree.departmentComputer Science and Engineeringen
thesis.degree.disciplineComputer Scienceen
thesis.degree.grantorTexas A & M Universityen
thesis.degree.nameMaster of Scienceen
thesis.degree.levelMastersen
dc.contributor.committeeMemberLiu, Tie
dc.contributor.committeeMemberChoe, Yoonsuck
dc.type.materialtexten
dc.date.updated2015-09-21T17:00:37Z
local.etdauthor.orcid0000-0002-7329-5407


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record