
Applied Science Building
8888 University Drive
Simon Fraser University
Burnaby, BC V5A 1S6
Email: qilin_hu [at] sfu.ca
I am a visiting Ph.D. student in Engineering Science at the Simon Fraser University, advised by Prof. Zhenman Fang. My research interests include: High-Performance Computing, Parallel Computing, Memory Systems, and Hardware/Software Codesign. I am also a Ph.D. student in Computer Science and Technology at the Hunan University, under the supervision of Prof. Chubo Liu and co-supervision of Prof. Keqin Li. During this period, my major work focused on memory systems modeling and cache behavior simulation. I received my M.S. from HNU, under the supervision of Prof. Kenli Li on parallel algorithm design.
Currently, I am a heterogeneous computing systems researcher and focus on hardware-accelerated implementation of fully homomorphic encryption.
| J1 |
CBANA: A Lightweight, Efficient, and Flexible Cache Behavior Analysis Framework TC
Cache miss analysis has become one of the most important things to improve the execution performance of a program. Generally, the approaches for analyzing cache misses can be categorized into dynamic analysis and static analysis. The former collects sampling statistics during program execution but is limited to specialized hardware support and incurs expensive execution overhead. The latter avoids the limitations but faces two challenges: inaccurate execution path prediction and inefficient analysis resulted by the explosion of the program state graph. To overcome these challenges, we propose CBANA, an LLVM- and process address space-based lightweight, efficient, and flexible cache behavior analysis framework. CBANA significantly improves the prediction accuracy of the execution path with awareness of inputs. To improve analysis efficiency and utilize the program preprocessing, CBANA refactors loop structures to reduce search space and dynamically splices intermediate results to reduce unnecessary or redundant computations. CBANA also supports configurable hardware parameter settings, and decouples the module of cache replacement policy from other modules. Thus, its flexibility is established. We evaluate CBANA by using the popular open benchmark PolyBench, graph workloads, and our synthetic workloads with good and poor data locality. Compared with the popular dynamic cache analysis tools Perf and Valgrind, the cache miss gap is less than 3.79% and 2.74% respectively with over ten thousand data accesses for the synthetic workloads, and the time reduction is up to 92.38% and 97.51% for the multiple-path workloads. Compared with the popular static cache analysis tool Heptane, CBANA achieves a time reduction of 97.71% while ensuring accuracy at the same time.
@ARTICLE{10564807,
author={Hu, Qilin and Ding, Yan and Liu, Chubo and Li, Keqin and Li, Kenli and Zomaya, Albert Y.},
journal={IEEE Transactions on Computers},
title={CBANA: A Lightweight, Efficient, and Flexible Cache Behavior Analysis Framework},
year={2024},
volume={73},
number={9},
pages={2262-2274},
keywords={Static analysis;Hardware;Accuracy;Performance analysis;Explosions;Analytical models;Data models;Cache behavior modeling;path analysis;path selection;static analysis},
doi={10.1109/TC.2024.3416747}}
|