PARDA (IPDPS 12) 论文阅读
PARDA: A Fast Parallel Reuse Distance Analysis Algorithm
SHARDS 基于 PARDA 实现,我比较好奇对 MRC generation 并行计算该怎么做,并且我不知道这个并行优化后的结果和 OSCA 哪个更好。
Abstract
parallel algorithm to compute accurate reuse distances
Introduction
Background: Reuse Distance
Sequential Reuse Distance Analysis
data:image/s3,"s3://crabby-images/24260/24260eb60acd26b099975be517cad681db0a1843" alt=""
PARDA algorithm: np steps where each process processes its chunk of the trace and then sends its local infinities to its left neighbor.
data:image/s3,"s3://crabby-images/c3151/c3151b27d097f203dbf0ea2b996fd1c9d8152330" alt=""
data:image/s3,"s3://crabby-images/4f707/4f7070daa41bef3af0e738262fbe03f48097b075" alt=""
复杂度分析暂略