








Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
History, Challenges, and Opportunities. Lessons of last 50 years of Computer Architecture. 1. Software advances can inspire architecture.
Typology: Schemes and Mind Maps
1 / 14
This page cannot be seen from the preview
Don't miss anything!
Full Turing Lecture: https://www.acm.org/hennessy-patterson-turing-lecture (^) 1
2
▪ Instruction set architecture (ISA) ▪ I/O system and Secondary Storage: magnetic tapes, drums and disks ▪ Assemblers, compilers, libraries,... ▪ Market niche: business, scientific, real time, ...
▪ Processor designs split between datapath , where numbers are stored and arithmetic operations computed, and control , which sequences operations on datapath ▪ Biggest challenge for computer designers was getting control correct
▪ Maurice Wilkes invented the idea of microprogramming to design the control unit of a processor*
▪ Logic expensive vs. ROM or RAM ▪ ROM cheaper and faster than RAM ▪ Control design now programming
Condition?
Control
Main Memory
Address Data
Control Lines
Datapath
PC Inst. Reg.Registers ALU
Instruction
Busy?
4
Microprogramming in IBM 360
Model M30 M40 M50 M Datapath width 8 bits 16 bits 32 bits 64 bits Microcode size 4k x 50 4k x 52 2.75k x 85 2.75k x 87 Clock cycle time (ROM) 750 ns 625 ns 500 ns 200 ns Main memory cycle time 1500 ns 2500 ns 2000 ns 750 ns Price (1964 $) $192,000 $216,000 $460,000 $1,080, Price (2018 $) $1,560,000 $1,760,000 $3,720,000 $8,720,
Fred Brooks, Jr. 5
IC Technology, Microcode, and CISC
6
Microprocessor Evolution
▪ Rapid progress in 1970s, fueled by advances in MOS technology, imitated minicomputers and mainframe ISAs ▪ “Microprocessor Wars”: compete by adding instructions (easy for microcode), justified given assembly language programming ▪ Intel iAPX 432: Most ambitious 1970s micro, started in 1975 ▪ 32-bit capability-based, object-oriented architecture, custom OS written in Ada ▪ Severe performance, complexity (multiple chips), and usability problems; announced 1981 ▪ Intel 8086 (1978, 8MHz, 29,000 transistors) ▪ “Stopgap” 16-bit processor, 52 weeks to new chip ▪ ISA architected in 3 weeks (10 person weeks) assembly-compatible with 8 bit 8080 ▪ IBM PC 1981 picks Intel 8088 for 8-bit bus (and Motorola 68000 was late)
7
▪ Estimated PC sales: 250,
▪ Actual PC sales: 100,000,000 ⇒ 8086 “overnight” success
▪ Binary compatibility of PC software ⇒ bright future for 8086
Analyzing Microcoded Machines 1980s
▪ HW/SW interface rises from assembly to HLL programming ▪ Compilers now source of measurements ▪ John Cocke group at IBM ▪ Worked on a simple pipelined processor, 801 minicomputer (ECL server), and advanced compilers inside IBM ▪ Ported their compiler to IBM 370, only used simple register-register and load/store instructions (similar to 801) ▪ Up to 3X faster than existing compilers that used full 370 ISA! ▪ Emer and Clark at DEC in early 1980s* ▪ Found VAX 11/780 average clock cycles per instruction (CPI) = 10! ▪ Found 20% of VAX ISA ⇒ 60% of microcode, but only 0.2% of execution time!
8
John Cocke
Moore’s Law Slowdown in Intel Processors
13
Moore, Gordon E. "No exponential is forever: but ‘Forever’ can be delayed!" Solid-State Circuits Conference, 2003.
Technology & Power: Dennard Scaling
Power consumption based on models in Esmaeilzadeh [2011]. 14
Energy scaling for fixed task is better, since more and faster transistors
Power consumption based on models in “Dark Silicon and the End of Multicore Scaling,” Hadi Esmaelizadeh, ISCA, 2011
End of Growth of Single Program Speed?
15
End of the Line? 2X / 20 yrs (3%/yr)
RISC 2X / 1. yrs (52%/yr)
CISC 2X / 3.5 yrs (22%/yr)
End of Dennard Scaling ⇒ Multicore 2X / 3. yrs (23%/yr)
Am- dahl’s Law ⇒ 2X / 6 yrs ( 12%/yr )
Based on SPECintCPU. Source: John Hennessy and David Patterson, Computer Architecture: A Quantitative Approach, 6/e. 2018
Current Security Challenge
Looks Bad!
What Opportunities Left? (Part I)
▪ SW-centric
▪ HW-centric
▪ Combination:
18
What’s the Opportunity?
19
from: “There’s Plenty of Room at the Top,” Leiserson, et. al., Science , to appear.
50X
7X
20X
9X
63,000X
What Opportunities Left?
▪ Only performance path left is Domain Specific
Architectures (DSAs)
▪ Achieve higher efficiency by tailoring the
architecture to characteristics of the domain
▪ Not one application, but a domain of
applications
▪ Different from strict ASIC since still runs
software 20
Perf/Watt TPU vs CPU & GPU
25
See MLPerf.org (“SPEC for ML”) ● Benchmark suite being developed by 23 companies and 7 universities ● 1 st^ Results Public 12/12/
Moore’s Law performance doubles every 18 months
From “AI and Compute.” Dario Amodei and Danny Hernandez, May 16, 2018
Since 2012,
AI training state of the art compute
demand 10X per year!
(Moore’s Law “only” 10X in 5 years)
From “AI and Compute.” Dario Amodei and Danny Hernandez, May 16, 2018
Training: TPUv2 (5/2017), TPUv3 (5/2018)
Ying, C., Kumar, S., Chen, D., Wang, T. and Cheng, Y., December 2018. Image Classification at Supercomputer Scale. arXiv preprint arXiv:1811..
29
30
31
300 mm (12 inch) wafer
215 x 215 mm (8.5 x 8.5 inch) “chip”
32
What Opportunities Left? (Part II)
● Software advances can inspire
architecture innovations
● Why open source compilers and
operating systems but not ISAs?
37
NVDLA: An Open DSA and Implementation
Security and Open Architecture
What Opportunities Left? (Part III)
39
Agile Hardware Development Methodology
C++
FPGA
ASIC Flow
Tape-in
Tape-out
Big Chip Tape-out
40
Lee, Y., Waterman, A., Cook, H., Zimmer, B., Keller, B., Puggelli, A., ... & Chiu, P. F. (2016). “An agile approach to building RISC-V microprocessors.” IEEE Micro , 36 (2), 8-20.
41
42
Quantum Computing to the Rescue?
Quantum Computing to the Rescue?
*Mark Horowitz (Chair, NAE, Stanford, EE), Alán Aspuru-Guzik (U. Toronto, Chemistry), David Awschalom (NAE & NAS, U. Chicago, Physics), Robert Blakley (Citigroup), Dan Boneh (NAE, Stanford, CS), Susan Coppersmith (NAS, U. Wisconsin, Physics), Jungsang Kim (Duke, Physics & CS), John Martinis (UCSB & Google), Margaret Martonosi (Princeton, CS), Michele Mosca (U. Waterloo, Math & Physics), William Oliver (MIT, Physics), Krysta Svore (Microsoft), Umesh Vazirani (NAE, Berkeley, CS), National Academies, Washington D.C. https://www.nap.edu/catalog/25196/quantum-computing-progress-and-prospects
4949
Specifications
5050
Specifications
Designs
5151
Specifications
Designs
Designs
5252
Specifications
Designs
Based on Licensed or Closed Designs
Based on Closed Designs
5353
Specifications
Designs
Based on Free & Open, Licensed, Closed Designs Based on Licensed or Closed Designs
Based on Closed Designs
OURS Pygmy microprocessor
multicore architecture
OURS (睿思芯科) energy-efficient RISC-V AI Chip for IoT