Intel Advisor
Overview
CPU / Memory Roofline Modeling
Guess: In comment, the data is from L1 traffic
- Diagram show the info about L1,L2,L3 DRAM bandwidth and theoretical compute bound
- Roofline Arithmetic Intensity: $$ AI = \frac{Performance \times SelfTime}{SelfMemoryTraffic} $$
- Each node corresponds one to one with the Topdown stack function call int the bottom of diagram.
- deep color and big size means more
Self Time
occupation
参考文献
[^1]: oneapi user guide