DGIST Scholar: TurboGraph: A Fast Parallel Graph Engine Handling Billion-scale Graphs in a Single PC

Detail View

Department of Electrical Engineering and Computer Science InfoLab 2. Conference Papers

TurboGraph: A Fast Parallel Graph Engine Handling Billion-scale Graphs in a Single PC

Citations

WEB OF SCIENCE

Citations

SCOPUS

Metadata Downloads

XML

Excel

Title: TurboGraph: A Fast Parallel Graph Engine Handling Billion-scale Graphs in a Single PC

Issued Date: 2013-08-12

Citation: Wook-Shin Han. (2013-08-12). TurboGraph: A Fast Parallel Graph Engine Handling Billion-scale Graphs in a Single PC. ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 77–85. doi: 10.1145/2487575.2487581

Type: Conference Paper

ISBN: 9781450321747

Abstract: Graphs are used to model many real objects such as social networks and web graphs. Many real applications in various fields require efficient and effective management of large-scale graph structured data. Although distributed graph engines such as GBase and Pregel handle billion-scale graphs, the user needs to be skilled at managing and tuning a distributed system in a cluster, which is a nontrivial job for the ordinary user. Furthermore, these distributed systems need many machines in a cluster in order to provide reasonable performance. In order to address this problem, a disk-based parallel graph engine called GraphChi, has been recently proposed. Although GraphChi significantly outperforms all representative (disk-based) distributed graph engines, we observe that GraphChi still has serious performance problems for many important types of graph queries due to 1) limited parallelism and 2) separate steps for I/O processing and CPU processing. In this paper, we propose a general, disk-based graph engine called TurboGraph to process billion-scale graphs very efficiently by using modern hardware on a single PC. TurboGraph is the first truly parallel graph engine that exploits 1) full parallelism including multicore parallelism and FlashSSD IO parallelism and 2) full overlap of CPU processing and I/O processing as much as possible. Specifically, we propose a novel parallel execution model, called pin-Andslide. TurboGraph also provides engine-level operators such as BFS which are implemented under the pin-And-slide model. Extensive experimental results with large real datasets show that Turbo- Graph consistently and significantly outperforms GraphChi by up to four orders of magnitude Our implementation of TurboGraph is available at http://wshan.net/turbograph as executable files. Copyright © 2013 ACM.
더보기

URI: http://hdl.handle.net/20.500.11750/6104

DOI: 10.1145/2487575.2487581

Publisher: ACM Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD)

Show Full Item Record

File Downloads

There are no files associated with this item.

Detail View

TurboGraph: A Fast Parallel Graph Engine Handling Billion-scale Graphs in a Single PC

File Downloads

공유

Total Views & Downloads