|
Syllabus
|
|
Week |
Date (2010) |
Topic |
Tasks |
|
1a |
27-Sep |
Administration |
|
|
1b |
29-Sep |
Introduction & Background |
|
|
2a |
4-Oct |
Understanding Concurrency |
|
|
2b |
6-Oct |
Convergence of Parallel Architectures |
|
|
3a |
11-Oct |
Convergence of Parallel Architectures |
|
|
3b |
13-Oct |
Parallel Programming |
|
|
4a |
18-Oct |
Parallel Programming |
|
|
4b |
20-Oct |
Workload-driven Performance Evaluation |
|
|
5a |
21-Oct (Fri) |
Parallel Programming, Workload-driven Performance Evaluation |
|
|
5b |
25-Oct |
Time, Ordering, and Memory Consistency |
|
|
6a |
1-Nov |
Time, Ordering, and Memory Consistency |
|
|
6b |
8-Nov |
Memory consistency models |
|
|
7a |
10-Nov |
Memory consistency models |
|
|
7b |
11-Nov
(Fri) |
Snoop-Based Shared Memory Multiprocessors |
|
|
8a |
15-Nov |
Distributed Shared Memory Multiprocessors |
|
|
8b |
18-Nov
(fri) |
Distributed Shared Memory Multiprocessors |
|
|
9a |
22-Nov |
Distributed Shared Memory Multiprocessors |
|
|
9b |
24-Nov
|
Software Distributed Shared Memory |
|
|
10a |
29-Nov |
Software Distributed Shared Memory |
|
|
10b |
1-Dec |
Message Passing Multiprocessors |
|
|
11a |
6-Dec |
Distributed Shared Memory Multiprocessors |
|
|
11b |
8-Dec |
Message Passing Multiprocessors |
|
|
12a |
13-Dec |
Message Passing Multiprocessors |
|
|
12b |
15-Dec |
Message Passing Multiprocessors |
|
|
13a |
20-Dec |
Cell BE |
|
|
13b
|
22-Dec
|
Anton
|
|
|
14a |
22-Dec |
Project Presentations |
|
Further References
Anton, a special-purpose machine for molecular dynamics simulation. D.E. Shaw et al. Proceedings of the 34th annual international symposium on Computer architecture. June 2007. San Diego, California, USA.
A tightly-coupled processor-network interface. Dana S. Henry and Christopher F. Joerg. Proceedings of the 5th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS). October, 12-15, 1992. Boston, Massachusetts, USA.
DDM--A Cache-Only Memory Architecture. E. Hagersten, A. Landin, and S. Haridi. IEEE Computer, 25, 9 (September 1992), 44-54.
Synchronization and Communication in the T3E Multiprocessor. S. Scott. Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems. October 1996, pp 26-36.
Effects of Communication Latency, Overhead, and Bandwidth in a Cluster Architecture. ISCA24, Denver, Co, June, 1997. Richard P. Martin, Amin M. Vahdat, David E. Culler, Thomas E. Anderson.
Autonet: a High-speed, Self-configuring Local Area Network with Point-to-point Links. Michael D. Schroeder and Andrew D. Birrell and Michael Burrows and Hal Murray and Roger M. Needham and Thomas L. Rodeheffer and Edwin H. Satterthwaite and Charles P. Thacker. Technical Report, Digital Equipment Corporation, Systems Research Centre, Number 59, p. 42 pages, 30 April 1990.
Deadlock-Free Message Routing in Multiprocessor Interconnection Networks. William J. Dally Charles L. Seitz. IEEE trans. on comp. Vol. C-36. No. 5, pp. 547-553, May 1987.
User-Space Communication: A Quantitative Study. Soichiro Araki, Angelos Bilas, Cezary Dubnicki, Jan Edler, Koichi Konishi and James Philbin. Supercomputing, November 1998.
K. LI and P. HUDAK, "Memory Coherence in Shared Virtual Memory Systems", ACM Trans. on Computer Systems, 7, 4 (November 1989), 321-359.
System Area Network Mapping. Brent Chun, Alan Mainwaring, Saul Schleimer, Daniel Wilkerson. SPAA'97 , Newport, Rhode Island , June 1997.
TreadMarks: Distributed Shared Memory on Standard Workstations and Operating Systems. Pete Keleher, Alan L. Cox, Sandhya Dwarkadas and Willy Zwaenepoel. In The 1994 Winter USENIX Conference.
D. LENOSKI, J. LAUDON, K. GHARACHORLOO, W. WEBER, A. GUPTA, J. HENNESSY, M. HOROWITZ and M. LAM, "The Stanford DASH Multiprocessor", IEEE Computer, 25, 3 (March 1992), 63-79.
Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors. J. M. Mellor-Crummey and M. L. Scott. ACM Trans. on Computer Systems, February 1991.
L. M. CENSIER and P. FEAUTRIER, "A New Solution to Coherence Problems in Multicache Systems", IEEE Transactions on Computers, C-27, 12 (December 1978), 1112-1118.
How to Make a Multiprocessor That Correctly Executes Multiprocess Programs. Leslie Lamport. IEEE Trans. on Computers, Vol. C-28, Number 9, pp. 690-691, September 1979.
J. E. THORNTON "Parallel Operation in the Control Data 6600," Fall Joint Computers Conference, vol. 26, pp. 33-40, 1961.
G. MOORE, "Cramming More Components onto Integrated Circuits", Electronics, p114-117, April 1965.
G. M. AMDAHL, "Validity of the Single-Processor Approach to Achieving Large Scale Computing Capabilities", AFIPS Conference Proceedings, (April 1967), 483-485.