Friday, October 22, 2010

Skype call

  1. resovled the segmetation faults in the p-cblk()
  2. the col_grp() function, use the Intel vector instructions
  3. use ASM directive to insert the assebly code in the c source code
  4. operating 64 bits (Quard word) instead of 32 bits at a time
  5. compilation: #gcc -O2 -o testcopy testcopy.c vectorcopy.s
  6. loop unrolling, instead of MOV, ADD(4), we do 4 MOV, and ADD(16)
  7. gcc could do loop unroolling with certain argument when compile, look up that
  8. Tasks
  9. - split_col_grp() mainly data copying, try to enhance
  10. - try gcc unrolling on jasper, and profile/record the execution time
  11. - try vector instructions (ASM directive)
  12. - publis a paper by the end of year

No comments:

Post a Comment