Quantization
- cache simulation to see the behavior, miss rate high??
- modifying data frequently, impact the performance.
- what has been called before during and after this routine, cache data, (later...)
- try to simulate in Cachegrind... to check the miss for each lines... and each source code.
Memory access pattern
- similar to the cats image. (pcb_large), memory behaves very similar, show relative low cache miss rate
- sim_num_cycles, print the number of warp around.
Work
- cachegrind to analyze the fusion... check the cache miss before and after fusion (the line inspecting feature in cachegrind)
- loop bound graph in cblk for other color images
- 3-D plot in mpfast, separate reads and writes count, and showing the intensity
- vectorization for quantization? on 32 bit quad core processor
- visual cache behavior
No comments:
Post a Comment