Show simple item record

dc.contributor.advisorKalafatis, Stavros
dc.creatorGurushanthappa, Poornima Bevakal
dc.date.accessioned2020-09-10T15:16:34Z
dc.date.available2021-12-01T08:43:49Z
dc.date.created2019-12
dc.date.issued2019-10-24
dc.date.submittedDecember 2019
dc.identifier.urihttps://hdl.handle.net/1969.1/189051
dc.description.abstractMachine Learning involves analysing large sets of training data to make predictions and decisions to achieve a specific objective. This data-intensive computation places enormous demand on the underlying hardware. To improve overall performance and make predictions quickly, extremely powerful specialized hardware has been built to make the data processing faster. Although this has yielded improved results, they are not economical. They often require significant investment in additional infrastructure and collaboration among the various hardware components. On the other hand, CPUs are cost-effective, and easily accessible for a fraction of the cost. Unfortunately, very little work has been done to identify the configuration bottlenecks in CPUs and improve their overall performance to meet the demands of Machine Learning. This thesis aims to identify the system parameters which can be tweaked to achieve a boost in CPU performance for Machine Learning algorithms. Leveraging the Gem5 system simulator, a series of experiments were conducted varying the hardware configurations to observe the overall system performance. Analysis of the simulation results showed that CPU and system operating frequency, the L2 cache size and Indirect branch predictor can significantly affect the system performance. We strongly believe our system simulation results can further help in optimizing the performance of CPUs for machine learning workloads.en
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectArtificial Intelligenceen
dc.subjectMachine Learningen
dc.subjectCPUen
dc.subjectConvolutional Neural Networken
dc.subjectMNISTen
dc.subjectCaffeen
dc.subjectTensorFlow.en
dc.titlePerformance Analysis of Artificial Intelligence Workloadsen
dc.typeThesisen
thesis.degree.departmentElectrical and Computer Engineeringen
thesis.degree.disciplineComputer Engineeringen
thesis.degree.grantorTexas A&M Universityen
thesis.degree.nameMaster of Scienceen
thesis.degree.levelMastersen
dc.contributor.committeeMemberGratz, Paul
dc.contributor.committeeMemberTyagi, Aakash
dc.type.materialtexten
dc.date.updated2020-09-10T15:16:35Z
local.embargo.terms2021-12-01
local.etdauthor.orcid0000-0002-1613-8464


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record