Show simple item record

dc.creatorSacco, Jacob
dc.date.accessioned2019-06-10T16:18:07Z
dc.date.available2019-06-10T16:18:07Z
dc.date.created2019-05
dc.date.submittedMay 2019
dc.identifier.urihttps://hdl.handle.net/1969.1/175464
dc.description.abstractAs machine learning is applied to ever more ambitious tasks, higher performance is required to be able to train and evaluate neural nets in reasonable amounts of time. To this end, many hardware accelerators for machine learning have been made, ranging from ASICs to CUDA code that runs on a conventional GPU. GPU and FPGA based accelerators have seen more success than ASICS due to the ease with which the design can be tweaked or revised, but still suffer from latency resulting from the interface between the processor and the accelerator (generally PCIe). The purpose of this paper is to build a hardware accelerator on Intel’s Heterogeneous Architecture Research Platform, which includes a Xeon processor and Arria 10 FPGA on the same mainboard, which share access to common memory. This should significantly reduce latency and increase throughput. This accelerator is expected to at least match the performance of a typical machine learning library implementation on a GPU, and will hopefully significantly exceed it.en
dc.format.mimetypeapplication/pdf
dc.subjectMachine Learningen
dc.subjectHARPen
dc.subjectVerilogen
dc.subjectHardware Acceleratoren
dc.titleBuilding a Better Machine Learning Hardware Accelerator with HARPen
dc.typeThesisen
thesis.degree.departmentElectrical & Computer Engineeringen
thesis.degree.disciplineComputer Engineering-Electrical Engineering Tracken
thesis.degree.grantorUndergraduate Research Scholars Programen
thesis.degree.nameBSen
thesis.degree.levelUndergraduateen
dc.contributor.committeeMemberKhatri, Sunil
dc.type.materialtexten
dc.date.updated2019-06-10T16:18:07Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record