The class scribe notes are currently restricted to students taking the course. The notes will be made public at some point, probably once the semester is over. Clickable lecture numbers have notes posted.
Lecture | Date | Topic (notes) | Reading | Comments |
1 | 1/20 | Introduction and Policies | |
2 | 1/22 | Users and Systems | |
3 | 1/27 | GPP binary compatibility | J. Denhert et al., “The Transmeta Code Morphing™ Software: using speculation, recovery, and adaptive retranslation to address real-life challenges,” CGO’03 | |
4 | 1/29 | Nanometer design | X. Liang et al., “ReVIVaLReVIVaL: A Variation-Tolerant Architecture Using Voltage Interpolation and Variable Latency,” ISCA 2008 | Comments |
5 | 2/3 | GPP source compatibility | S. Swanson et al., “WaveScalar,” MICRO 2003 | |
6 | 2/5 | GPP source compatibility | S. Swanson et al., “WaveScalar,” MICRO 2003 | |
7 | 2/10 | GPP virtualization | K. Adams and O. Agesen, “A comparison of software and hardware techniques for x86 virtualization”, ASPLOS-XII | comments |
8 | 2/12 | CMP Resource Sharing | O. Mutlu and T. Moscibroda, “Stall-Time Fair Memory Access Scheduling for Chip Multiprocessors,” MICRO 2007 | comments |
9 | 2/17 | Defect Tolerance and Yield | TWO PAPERS:
1 Y. Zorian and Z. Gizopoulos, “Design for Yield and Reliability”, IEEE Design and Test, Vol. 21, No. 3, May 2004
2 W. Culbertson et al., “Defect Tolerance on the Teramac Custom Computer”, IEEE Symposium on FPGAs for Custom Computing Machines, 1997 | |
10 | 2/19 | Reliability and Power | D. Ernst et al., “Razor: A Low-Power Pipeline Based on Circuit-Level Timing Speculation”, MICRO-36, 2003 | |
11 | 2/24 | Quiz 1 | | |
12 | 2/26 | Power Reduction I | J. Li et al., “The thrifty barrier: energy-aware synchronization in shared-memory multiprocessors”, HPCA 2004 | comments |
13 | 3/3 | Power Reduction II | C. Isci et al., “An Analysis of Efficient Multi-Core Global Power Management Policies: Maximizing Performance for a Given Power Budget,” MICRO 2006 | |
14 | 3/5 | Simulation | J. Gibson et al., “FLASH vs. (Simulated) FLASH: Closing the Simulation Loop,” ASPLOS 2000 | |
15 | 3/10 | Compiling for Parallelism | A. Lim and M. Lam, “Maximizing Parallelism and Minimizing Synchronization with Affine Transforms”, Symposium on Principles of Programming Languages, January 1997. | Hard Paper! |
16 | 3/12 | IBM BlueGene/L Supercomupter | Nicer figures and description but no results: A. Gara et al., “An Overview of the BlueGene/L System Arcyitecture”, IBM Journal of R&D, Vol. 49 No. 2/3, 2005.
K. Davis et al., “A Performance and Scalability Analysis of the BlueGene/L Architecture”, SC’04. | |
17 | 3/24 | Parallel Languages I (Titanium) | K. Datta et al., “Titanium Performance and Potential: an NPB Experimental Study”, LCPC 2005.
J. Su and K. Yellick, “Automatic Support for Irregular Computations in a High-Level Language”, IPDPS 2005. | |
18 | 3/26 | HW Active Messages | M. Noakes et al., “The J-Machine Multicomputer: An Architecural Evaluation”, ISCA 20, 1993.
PLEASE ALSO READ THIS OVERVIEW WITH PICTURES: W. J. Dally et al., “The J-Machine: A Retrospective”, 1998. | |
19 | 3/31 | Interconnection Networks | Lecture by Prof. Chiou | no reading |
20 | 4/2 | Lightweight Threads | D. E. Culler et al., “Fine-Grain Parralelism with Minimal Hardware Support: A Compiler-Controlled Threaded Abstract Machine”,ASPLOS IV, 1991.
BACKGROUND ON ID: K. R. Traub, “A Compiler for the MIT Tagged-Token Dataflow Architecture (pages 13--21 only)”, MIT Masters Thesis, 1986. | |
21 | 4/7 | Stream Processors | S. Rixner et al., “A Bandwidth-Efficient Architecture for Media Processing”, MICRO-31 1998. | more reading |
22 | 4/9 | Supercomputing with Streams | Mattan Erez, Online Lecture on Merrimac, 2006
READ: M. Erez et al., “Executing Irregular Applications on Stream Architectures”, ICS’07. | |
23 | 4/14 | Auto-Parallelization for CMPs | M. Bridges et al., “Revisiting the Sequential Programming Model for Multi-Core”, MICRO-40, 2007 | |
24 | 4/16 | Thread-Level Speculation | L. Hammond et al., “Data Speculation Support for Chip Multiprocessors”, ASPLOS-VIII, 1998 | comments |
25 | 4/21 | Generalized Streaming | K. Fatahalian et al.,”Programming the Memory Hierarchy”, SC’06 | |
26 | 4/23 | Reliability Tradeoffs | M. Erez et al., “Fault Tolerance Techniques for the Merrimac Streaming Supercomputer”, SC’05 | |
27 | 4/28 | Quiz 2 | | |