DGIST Scholar: A GPU-Based Complex Graph Query Processing System Based on Grid Partitioning for Trillion-Scale Graphs

Detail View

Department of Electrical Engineering and Computer Science Theses Ph.D.

A GPU-Based Complex Graph Query Processing System Based on Grid Partitioning for Trillion-Scale Graphs

Seyeon Oh

Citations

WEB OF SCIENCE

Citations

SCOPUS

Metadata Downloads

XML

Excel

DC Field	Value	Language
dc.contributor.advisor	좌훈승	-
dc.contributor.author	Seyeon Oh	-
dc.date.accessioned	2026-01-23T11:00:07Z	-
dc.date.available	2026-01-24T06:00:43Z	-
dc.date.issued	2025	-
dc.identifier.uri	https://scholar.dgist.ac.kr/handle/20.500.11750/59791	-
dc.identifier.uri	http://dgist.dcollection.net/common/orgView/200000892195	-
dc.description	GPU, Graph, Multi-hop, Out-of-Memory, Scheduling, Buffer Management	-
dc.description.abstract	Graphs are continually growing in size, and processing complex queries, such as multi-hop pattern queries, on them is becoming increasingly important. Although GPUs have received significant attention recently, there is still a notable shortage of efficient GPU-based out-of-memory methods for handling these queries. Three key issues arise when processing multi-hop queries on large-scale graphs using GPUs: the need for an efficient graph format, effective scheduling of accesses to graph partitions on storage, and dynamic buffer management on both the host and GPUs. To address these issues, we first propose the High-density Graph Representation Format (HGF), a GPU-friendly and space-efficient graph storage format that enables scalable graph partitioning and efficient access patterns on both CPUs and GPUs. Based on HGF, we present GFlux, an efficient GPU-based out-of-memory multi-hop query processing framework. GFlux introduces GTask, a novel abstraction that represents the data access and dependency structure of complex multi-hop queries, allowing them to be executed efficiently and in parallel on GPUs. Additionally, we design a GPU-based multi-hop query execution mechanism and a dynamic buffer management strategy, both built on GTasks. Through extensive experiments, we have demonstrated that GFlux significantly improves both the speed and scalability compared to existing state-of-the-art methods.\|그래프 데이터의 규모가 지속적으로 증가함에 따라, 멀티홉 (multi-hop) 패턴 질의와 같은 복잡한 질의를 효율적으로 처리하는 기술의 중요성이 커지고 있다. 최근 GPU 활용이 활발해졌지만, 이러한 질의를 위한 GPU 기반 out-of-memory 처리 방식은 여전히 부족하다. 대규모 그래프에서 멀티홉 질의를 GPU로 처리하려면, 효율적인 그래프 저장 형식, 저장장치 기반의 파티션 접근 스케줄링, 그리고 호스트와 GPU 간의 동적 버퍼 관리가 필수적이다. 이러한 문제를 해결하기 위해, GPU 친화적 (GPU-aware)이면서 공간 효율성을 갖춘 그래프 저장 형식인 High-density Graph Representation Format (HGF)를 제안하였다. HGF는 CPU와 GPU 모두에서 확장 가능한 그래프 파티셔닝과 효율적인 접근 패턴을 지원한다. 이를 기반으로, GPU 기반 out-of-memory 멀티홉 질의 처리 프레임워크인 GFlux를 제안하였다. GFlux는 복잡한 멀티홉 질의의 데이터 접근과 실행 의존성을 추상화하는 GTask 모델을 도입하고, 이를 활용하여 GPU 상에서 질의를 병렬적이고 효율적으로 처리한다. 또한, GTask 기반의 멀티홉 질의 실행 방식과 동적 버퍼 관리 기법을 함께 제안하여 전체 처리 성능을 향상시킨다. 다양한 실험을 통해, GFlux는 기존 state-of-the-art 기법들과 비교하여 처리 속도뿐만 아니라 확장성과 공간 효율성 측면에서도 우수한 성능을 입증하였다.	-
dc.description.tableofcontents	List of Contents ii List of Tables iv List of Figures v 1. Introduction 1 1.1 Motivation and objectives 1 1.2 Main Contributions 4 1.3 Structure of thesis 4 2. Background 7 2.1 Graph Partitioning and Formats 7 2.1.1 Partitioning 7 2.1.2 Format 8 2.2 Multi-hop Pattern Query 10 3. High-density Graph Representation 12 3.1 Hierarchical Grid Partitioning 12 3.2 Flip24: GPU-aware Graph Format 15 4. Multi-hop Pattern Query Processing 21 4.1 Finding a single pattern instance 21 4.2 Finding all pattern instances 24 5. GPU-based Out-of-Memory Graph Processing 28 5.1 GTask: unit of query processing task 28 5.1.1 PR 29 5.1.2 TC 29 5.1.3 Example of TC 31 5.2 Buffer management 33 5.3 Architecture of GFlux 36 5.3.1 GTask scheduling layer 36 5.3.2 GTask execution layer 38 5.4 Generality of GFlux 38 5.4.1 CC and SSSP 38 5.4.2 Quadrangle Counting (QC) 39 6. Implementation Details 42 6.1 Asynchronous Task Processing 42 6.2 Multi-GPU Scheduling 43 6.3 Aligned Buddy Memory Allocation 44 6.4 On-the-Fly PTR Unfolding 47 6.5 Degree Data Encoding 48 7. Experimental Evaluation 50 7.1 Experimental Setup 50 7.1.1 Datasets 50 7.1.2 Methods compared 50 7.1.3 H/W and S/W setting 51 7.2 Evaluation of 3-byte addressing 52 7.3 Evaluation of space efficiency of HGF 53 7.4 Evaluation of n-hop graph query processing 54 7.5 Evaluation of various queries 57 7.6 Decomposed execution time 58 7.7 Evaluation of scalability 59 7.8 Effectiveness of buffer management and scheduling 60 8. Related work 62 9. Conclusions 63	-
dc.format.extent	73	-
dc.language	eng	-
dc.publisher	DGIST	-
dc.title	A GPU-Based Complex Graph Query Processing System Based on Grid Partitioning for Trillion-Scale Graphs	-
dc.type	Thesis	-
dc.identifier.doi	10.22677/THESIS.200000892195	-
dc.description.degree	Doctor	-
dc.contributor.department	Department of Electrical Engineering and Computer Science	-
dc.contributor.coadvisor	Min-Soo Kim	-
dc.date.awarded	2025-08-01	-
dc.publisher.location	Daegu	-
dc.description.database	dCollection	-
dc.citation	XT.ID 오54 202508	-
dc.date.accepted	2025-07-21	-
dc.contributor.alternativeDepartment	전기전자컴퓨터공학과	-
dc.subject.keyword	GPU, Graph, Multi-hop, Out-of-Memory, Scheduling, Buffer Management	-
dc.contributor.affiliatedAuthor	Seyeon Oh	-
dc.contributor.affiliatedAuthor	Hoon Sung Chwa	-
dc.contributor.affiliatedAuthor	Min-Soo Kim	-
dc.contributor.alternativeName	오세연	-
dc.contributor.alternativeName	Hoon Sung Chwa	-
dc.contributor.alternativeName	김민수	-

Show Simple Item Record

File Downloads

There are no files associated with this item.

Detail View

A GPU-Based Complex Graph Query Processing System Based on Grid Partitioning for Trillion-Scale Graphs

File Downloads

공유

Total Views & Downloads