DGIST Scholar: SAFE: Sharing-Aware Prefetching for Efficient GPU Memory Management With Unified Virtual Memory

Detail View

ETC 1. Journal Articles

SAFE: Sharing-Aware Prefetching for Efficient GPU Memory Management With Unified Virtual Memory

Citations

WEB OF SCIENCE

Citations

SCOPUS

Metadata Downloads

XML

Excel

DC Field	Value	Language
dc.contributor.author	Shin, Hyunkyun	-
dc.contributor.author	Bang, Seongtae	-
dc.contributor.author	Park, Hyungwon	-
dc.contributor.author	Kim, Daehoon	-
dc.date.accessioned	2025-07-03T20:10:11Z	-
dc.date.available	2025-07-03T20:10:11Z	-
dc.date.created	2025-07-03	-
dc.date.issued	2025-01	-
dc.identifier.issn	1556-6056	-
dc.identifier.uri	https://scholar.dgist.ac.kr/handle/20.500.11750/58611	-
dc.description.abstract	As the demand for GPU memory from applications such as machine learning continues to grow exponentially, maximizing GPU memory capacity has become increasingly important. Unified Virtual Memory (UVM), which combines host and GPU memory into a unified address space, allows GPUs to utilize more memory than their physical capacity. However, this advantage comes at the cost of significant overheads when accessing host memory. Although existing prefetching techniques help alleviate these overheads, they still encounter challenges when dealing with irregular workloads and dynamic mixed workloads. In this paper, we demonstrate that the regularity of workloads is strongly correlated with the sharing status of UVM memory blocks among the Streaming Multiprocessors (SMs) of GPUs, which in turn impacts the effectiveness of prefetching. In addition, we propose the Sharing Aware preFEtching technique, SAFE, which dynamically adjusts prefetching strategies based on the sharing status of the accessed memory blocks. SAFE efficiently tracks the sharing status of the memory blocks by leveraging unified TLBs (uTLBs) and enforces tailored prefetching configurations for each block. This approach requires no hardware modifications and incurs negligible performance overhead. Our evaluation shows that SAFE achieves up to a 6.5x performance improvement over UVM default prefetcher for workloads with predominantly irregular memory access patterns, with an average improvement of 3.6x. © 2025 IEEE	-
dc.language	English	-
dc.publisher	Institute of Electrical and Electronics Engineers	-
dc.title	SAFE: Sharing-Aware Prefetching for Efficient GPU Memory Management With Unified Virtual Memory	-
dc.type	Article	-
dc.identifier.doi	10.1109/LCA.2025.3553143	-
dc.identifier.wosid	001515500800001	-
dc.identifier.scopusid	2-s2.0-105009384729	-
dc.identifier.bibliographicCitation	Shin, Hyunkyun. (2025-01). SAFE: Sharing-Aware Prefetching for Efficient GPU Memory Management With Unified Virtual Memory. IEEE Computer Architecture Letters, 24(1), 117–120. doi: 10.1109/LCA.2025.3553143	-
dc.description.isOpenAccess	FALSE	-
dc.subject.keywordAuthor	unified TLB	-
dc.subject.keywordAuthor	Unified virtual memory	-
dc.subject.keywordAuthor	graphics processing unit	-
dc.subject.keywordAuthor	prefetcher	-
dc.citation.endPage	120	-
dc.citation.number	1	-
dc.citation.startPage	117	-
dc.citation.title	IEEE Computer Architecture Letters	-
dc.citation.volume	24	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Hardware & Architecture	-
dc.type.docType	Article	-

Show Simple Item Record

File Downloads

There are no files associated with this item.

Detail View

SAFE: Sharing-Aware Prefetching for Efficient GPU Memory Management With Unified Virtual Memory

File Downloads

공유

Total Views & Downloads