Detail View

SAFE: Sharing-Aware Prefetching for Efficient GPU Memory Management With Unified Virtual Memory
Citations

WEB OF SCIENCE

Citations

SCOPUS

Metadata Downloads

DC Field Value Language
dc.contributor.author Shin, Hyunkyun -
dc.contributor.author Bang, Seongtae -
dc.contributor.author Park, Hyungwon -
dc.contributor.author Kim, Daehoon -
dc.date.accessioned 2025-07-03T20:10:11Z -
dc.date.available 2025-07-03T20:10:11Z -
dc.date.created 2025-07-03 -
dc.date.issued 2025-01 -
dc.identifier.issn 1556-6056 -
dc.identifier.uri https://scholar.dgist.ac.kr/handle/20.500.11750/58611 -
dc.description.abstract As the demand for GPU memory from applications such as machine learning continues to grow exponentially, maximizing GPU memory capacity has become increasingly important. Unified Virtual Memory (UVM), which combines host and GPU memory into a unified address space, allows GPUs to utilize more memory than their physical capacity. However, this advantage comes at the cost of significant overheads when accessing host memory. Although existing prefetching techniques help alleviate these overheads, they still encounter challenges when dealing with irregular workloads and dynamic mixed workloads. In this paper, we demonstrate that the regularity of workloads is strongly correlated with the sharing status of UVM memory blocks among the Streaming Multiprocessors (SMs) of GPUs, which in turn impacts the effectiveness of prefetching. In addition, we propose the Sharing Aware preFEtching technique, SAFE, which dynamically adjusts prefetching strategies based on the sharing status of the accessed memory blocks. SAFE efficiently tracks the sharing status of the memory blocks by leveraging unified TLBs (uTLBs) and enforces tailored prefetching configurations for each block. This approach requires no hardware modifications and incurs negligible performance overhead. Our evaluation shows that SAFE achieves up to a 6.5x performance improvement over UVM default prefetcher for workloads with predominantly irregular memory access patterns, with an average improvement of 3.6x. © 2025 IEEE -
dc.language English -
dc.publisher Institute of Electrical and Electronics Engineers -
dc.title SAFE: Sharing-Aware Prefetching for Efficient GPU Memory Management With Unified Virtual Memory -
dc.type Article -
dc.identifier.doi 10.1109/LCA.2025.3553143 -
dc.identifier.wosid 001515500800001 -
dc.identifier.scopusid 2-s2.0-105009384729 -
dc.identifier.bibliographicCitation Shin, Hyunkyun. (2025-01). SAFE: Sharing-Aware Prefetching for Efficient GPU Memory Management With Unified Virtual Memory. IEEE Computer Architecture Letters, 24(1), 117–120. doi: 10.1109/LCA.2025.3553143 -
dc.description.isOpenAccess FALSE -
dc.subject.keywordAuthor unified TLB -
dc.subject.keywordAuthor Unified virtual memory -
dc.subject.keywordAuthor graphics processing unit -
dc.subject.keywordAuthor prefetcher -
dc.citation.endPage 120 -
dc.citation.number 1 -
dc.citation.startPage 117 -
dc.citation.title IEEE Computer Architecture Letters -
dc.citation.volume 24 -
dc.description.journalRegisteredClass scie -
dc.description.journalRegisteredClass scopus -
dc.relation.journalResearchArea Computer Science -
dc.relation.journalWebOfScienceCategory Computer Science, Hardware & Architecture -
dc.type.docType Article -
Show Simple Item Record

File Downloads

  • There are no files associated with this item.

공유

qrcode
공유하기

Total Views & Downloads