Show simple item record

dc.creatorYao, Yuan
dc.date.accessioned2018-05-23T15:32:53Z
dc.date.available2018-05-23T15:32:53Z
dc.date.created2019-05
dc.date.submittedMay 2019
dc.identifier.urihttps://hdl.handle.net/1969.1/166469
dc.description.abstractIn the age of fast growing technology, massive storage, and cluster computing, efficient big-data processing algorithms are in high demand. MapReduce is one of the programming models that enables massive-scale cluster technology around the world. Despite significant public efforts, the open-source implementation of MapReduce – Apache Hadoop – is cumbersome, complex, and inefficient. The purpose of this research is to improve the performance of Hadoop, specifically its sorting component, by developing a single-pass, streambased multithreaded bucket sort. Our new set of algorithms has the potential to influence the future of data-centric computing.en
dc.format.mimetypeapplication/pdf
dc.subjectstorageen
dc.subjectcluster computingen
dc.subjectMapReduceen
dc.subjectalgorithmsen
dc.subjectsingle-passen
dc.subjectstreambased multithread sorten
dc.titleVirtual Memory Streaming and Sorting in MapReduce Applicationsen
dc.typeThesisen
thesis.degree.departmentComputer Science & Engineeringen
thesis.degree.disciplineComputer Scienceen
thesis.degree.grantorUndergraduate Research Scholars Programen
thesis.degree.nameBSen
thesis.degree.levelUndergraduateen
dc.contributor.committeeMemberLoguinov, Dmitri
dc.type.materialtexten
dc.date.updated2018-05-23T15:32:54Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record