cpu-cache

Strange behaviour during multiprocess calls to numpy conjugate

Strange behaviour during multiprocess calls to numpy conjugate Question: The attached script evaluates the numpy.conjugate routine for varying numbers of parallel processes on differently sized matrices and records the corresponding run times. The matrix shape only varies in it’s first dimension (from 1,64,64 to 256,64,64). Conjugation calls are always made on 1,64,64 sub matrices to …

Total answers: 2

Why is linear read-shuffled write not faster than shuffled read-linear write?

Why is linear read-shuffled write not faster than shuffled read-linear write? Question: I’m currently trying to get a better understanding of memory/cache related performance issues. I read somewhere that memory locality is more important for reading than for writing, because in the former case the CPU has to actually wait for the data whereas in …

Total answers: 5