Unlimited Speculative Execution

A Macrotask-level Unlimited Speculative Execution on Multiprocessors (18/19)

The following description is based on the paper presented at 1995 ACM International Conference on Supercomputing head in Spain on July 3rd to 7th,1995.

If you want to have the copy of the paper, please let me know.

The left hand side graph shows the speedup against the TASK_SIZE. The distributed control always archives good performance in comparison with the centralized control. Moreover, the proposed scheme can perform nearly the maximum of the theoretical speedup when the TASK_SIZE is enough to be large.
Since the average execution time of macrotasks at second hierarchy level from the bottom is between 150us to 300us, the software implementation of the proposed scheme is able to be adopted at the second hierarchy level.
The anothergraph shows the speedup against the number of running MT. The proposed scheme always achieve good performance in comparison with the centralized control. The speedup ratio increases when the number of running MT is less than 40, because the speculation depth increases in proportion to the number of iterations. After 40, the speculation depth is balanced with the overhead of the macrotask control. Then it holds same speedup.
As for the centralized control, the speedup decreases because the overhead to control the MT increases in proportion to the number of macrotasks.
These results show that the proposed scheme has small macrotask control overhead in comparison with the centralized control.