Multithreading Aware Hardware Prefetching for Chip Multiprocessors

Albarakat, Laith Mohammad

dc.contributor.advisor	Gratz, Paul V
dc.creator	Albarakat, Laith Mohammad
dc.date.accessioned	2018-02-05T21:10:56Z
dc.date.available	2018-02-05T21:10:56Z
dc.date.created	2017-08
dc.date.issued	2017-07-28
dc.date.submitted	August 2017
dc.identifier.uri	https://hdl.handle.net/1969.1/165781
dc.description.abstract	To take advantage of the processing power in the Chip Multiprocessors design, applications must be divided into semi-independent processes that can run concur- rently on multiple cores within a system. Therefore, programmers must insert thread synchronization semantics (i.e. locks, barriers, and condition variables) to synchro- nize data access between processes. Indeed, threads spend long time waiting to acquire the lock of a critical section. In addition, a processor has to stall execution to wait for load data accesses to complete. Furthermore, there are often independent instructions which include load instructions beyond synchronization semantics that could be executed in parallel while a thread waits on the synchronization semantics. The conveniences of the cache memories come with some extra cost in Chip Multiprocessors. Cache Coherence mechanisms address the Memory Consistency problem. However, Cache Coherence adds considerable overhead to memory accesses. Having aggressive prefetcher on different cores of a Chip Multiprocessor can definitely lead to significant system performance degradation when running multi-threaded applications. This result of prefetch-demand interference when a prefetcher in one core ends up pulling shared data from a producing core before it has been written, the cache block will end up transitioning back and forth between the cores and result in useless prefetch, saturating the memory bandwidth and substantially increase the latency to critical shared data. We present a hardware prefetcher that enables large performance improvements from prefetching in Chip Multiprocessors by significantly reducing prefetch-demand interference. Furthermore, it will utilize the time that a thread spends waiting on syn- chronization semantics to run ahead of the critical section to speculate and prefetch independent load instruction data beyond the synchronization semantics.	en
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject	prefetcher	en
dc.title	Multithreading Aware Hardware Prefetching for Chip Multiprocessors	en
dc.type	Thesis	en
thesis.degree.department	Electrical and Computer Engineering	en
thesis.degree.discipline	Computer Engineering	en
thesis.degree.grantor	Texas A & M University	en
thesis.degree.name	Master of Science	en
thesis.degree.level	Masters	en
dc.contributor.committeeMember	Hou, I-Hong
dc.contributor.committeeMember	Jimenez, Daniel A
dc.type.material	text	en
dc.date.updated	2018-02-05T21:10:57Z
local.etdauthor.orcid	0000-0001-9644-6565

Files in this item

Name:: ALBARAKAT-THESIS-2017.pdf
Size:: 858.6Kb
Format:: PDF

View/ Open

This item appears in the following Collection(s)

Electronic Theses, Dissertations, and Records of Study (2002– )
Texas A&M University Theses, Dissertations, and Records of Study (2002– )

Show simple item record