Hardware and Architecture
The machinery that powers many of our interactions today — Web search, social networking, email, online video, shopping, game playing — is made of the smallest and the most massive computers. The smallest part is your smartphone, a machine that is over ten times faster than the iconic Cray-1 supercomputer. The capabilities of these remarkable mobile devices are amplified by orders of magnitude through their connection to Web services running on building-sized computing systems that we call Warehouse-scale computers (WSCs).
Google’s engineers and researchers have been pioneering both WSC and mobile hardware technology with the goal of providing Google programmers and our Cloud developers with a unique computing infrastructure in terms of scale, cost-efficiency, energy-efficiency, resiliency and speed. The tight collaboration among software, hardware, mechanical, electrical, environmental, thermal and civil engineers result in some of the most impressive and efficient computers in the world.
57 Publications
-
Eric Brewer, Lawrence Ying, Lawrence Greenfield, Robert Cypher, Theodore T'so
Google (2016), pp. 1-16
-
The 2015 Top Picks In Computer Architecture (Guest Editor's Introduction)
Daniel J. Sorin, Milo M. K. Martin
IEEE Micro, vol. May/June (2016)
-
Web Browser Workload Characterization for Power Management on HMP Platforms
Benedikt Meurer, Daniel Clifford, Hannes Payer, Nadja Peters, Samarjit Chakraborty, Sangyoung Park
Proceedings of the Eighth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS '12) (2016)
-
Can Traditional Programming Bridge the Ninja Performance Gap for Parallel Computing Applications?
Nadathur Satish, Changkyu Kim, Jatin Chhugani, Hideki Saito, Rakesh Krishnaiyer, Mikhail Smelyanskiy, Milind Girkar, Pradeep Dubey
Communications of the ACM, vol. 58 (2015), pp. 77-86
-
Full-Chip Simulations, Keys to Success
SNUG Silicon Valley 2015 Proceedings, Silicon Valley
-
Heracles: Improving Resource Efficiency at Scale
David Lo, Liqun Cheng, Rama Govindaraju, Parthasarathy Ranganathan, Christos Kozyrakis
Proceedings of the 42th Annual International Symposium on Computer Architecture (2015)
-
Optimal trajectory control for parallel single phase H-bridge inverters
David K. Fork, Seungil You, Ross Koningstein
Decision and Control (CDC), 2015 IEEE 54th Annual Conference on, IEEE, pp. 1983 - 1990
-
Author Retrospective for A NUCA Substrate for Flexible CMP Cache Sharing
Jaehyuk Huh, Changkyu Kim, Hazim Shafi, Lixin Zhang, Doug Burger, Stephen W. Keckler
ICS 25th Anniversary Volume, ACM SIGARCH (2014)
-
Author Retrospective for Cooperative Cache Partitioning for Chip Multiprocessors
Jichuan Chang, Gurindar S Sohi
ICS 25th Anniversary Volume, 2014, ACM SIGARCH
-
Dynamically Detecting and Tolerating IF-Condition Data Races
Shanxiang Qi, Abdullah A. Muzahid, Wonsun Ahn, Josep Torrellas
International Symposium on High Performance Computer Architecture (HPCA), IEEE (2014)
-
HaPPy: Hyperthread-aware Power Profiling Dynamically
Yan Zhai, Xiao Zhang, Stephane Eranian, Lingjia Tang, Jason Mars
USENIX Annual Technical Conference 2014
-
Low-Overhead Network-on-Chip Support for Location-Oblivious Task Placement
Gwangsun Kim, Lee, M.M.-J., John Kim, Dennis Abts, Michael R. Marty
IEEE Transactions on Computers, vol. Volume 63, Issue 6 (2014), pp. 1487 - 1500
-
Near-Data Processing: Insights from a MICRO-46 Workshop
Rajeev Balasubramonian, Jichuan Chang, Troy Manning, Jaime H. Moreno, Richard Murphy, Ravi Nair, Steven Swanson
IEEE Micro (Special Issue on Big Data), vol. 34 (2014), pp. 36-43
-
Profiling a warehouse-scale computer
Svilen Kanev, Juan Darago, Kim Hazelwood, Parthasarathy Ranganathan, Tipp Moseley, Gu-Yeon Wei, David Brooks
ISCA '15 Proceedings of the 42nd Annual International Symposium on Computer Architecture, ACM (2014), pp. 158-169
-
Security Vulnerability in Processor-Interconnect Router Design
WonJun Song, John Kim, Jae W. Lee, Dennis Abts
Proceedings of the 2014 ACM SIGSAC Conference on Computer and Communications Security, ACM, New York, NY, pp. 358-368
-
IEEE Pervasive Computing, vol. 13-03 (2014), pp. 76-79
-
Towards Energy Proportionality for Large-Scale Latency-Critical Workloads
David Lo, Liqun Cheng, Rama Govindaraju, Luiz André Barroso, Christos Kozyrakis
Proceedings of the 41th Annual International Symposium on Computer Architecture, ACM (2014)
-
A Green Display for the Internet
Ken Foo, Bill Hamburgen, Jim Zhuang
Renewable Energy and the Environment, Optical Society of America (2013)
-
Concurrency-aware compiler optimizations for hardware description languages
Harikumar Somakumar
ACM Transactions on Design Automation of Electronic Systems (TODAES), vol. Volume 18, Issue 1 (2013), 10:1-10:16
-
Optimizing Google's Warehouse Scale Computers: The NUMA Experience
Lingjia Tang, Jason Mars, Xiao Zhang, Robert Hagmann, Robert Hundt, Eric Tune
The 19th IEEE International Symposium on High Performance Computer Architecture (2013)
-
Luiz André Barroso, Jimmy Clidaras, Urs Hölzle
Morgan & Claypool Publishers (2013)
-
Whare-Map: Heterogeneity in “Homogeneous” Warehouse-Scale Computers
Jason Mars, Lingjia Tang, Robert Hundt
Proceedings of the 2013 ACM/IEEE International Symposium on Computer Architecture (ISCA), IEEE (to appear)
-
Honggang Sheng, Fred Wang, C.W. Tipton
IEEE Transactions on Power Electronics, vol. 27 (2012), pp. 685-697
-
ADEL: An automatic detector of energy leaks for smartphone applications
Lide Zhang, M. S. Gordon, Robert P. Dick, Z. Morley Mao, Peter Dinda, Lei Yang
Proceeding of International Conference on Hardware/Software Codesign and System Synthesis, (2012) (to appear)
-
Accelerator Compiler for the VENICE Vector Processor
Zhiduo Liu, Aaron Severance, Guy G.F. Lemieux, Satnam Singh
FPGA, ACM (2012)
-
Managing Distributed UPS Energy for Effective Power Capping in Data Centers
Vasileios Kontorinis, Liuyi Eric Zhang, Baris Aksanli, Jack Sampson, Houman Homayoun, Eddie Pettis, Dean M. Tullsen, Tajana Simunic Rosing
International Symposium on Computer Architecture (2012), pp. 488-499
-
Resource-bounded multicore emulation using Beefarm
Oriol Arcas, Nehir Sonmez, Gokhan Sayilar, Satnam Singh, Osman S. Unsal, Adrian Cristal, Ibrahim Hur, Mateo Valero
Microprocessors and Microsystems (2012)
-
Runtime adaptation: a case for reactive code alignment
Michelle McDaniel, Kim Hazelwood
Proceedings of the 2nd International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era, ACM, New York, NY, USA (2012), pp. 1-11
-
Bubble-Up: Increasing Utilization In Modern Warehouse Scale Computers Via Sensible Co-Locations
Jason Mars, Linjia Tang, Robert Hundt, Kevin Skadron, Mary Lou Souffa
Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture, 2011, IEEE, New York, NY, USA
-
Dynamic cache contention detection in multi-threaded applications
Qin Zhao, David Koh, Syed Raza, Derek Bruening, Weng-Fai Wong
VEE 2011; Proceedings of the 7th ACM SIGPLAN/SIGOPS International conference on virtual execution environments, ACM, New York, NY, pp. 27-37
-
FAWN: a fast array of wimpy nodes: technical perspective
Communications of the ACM, vol. 54 (2011), pp. 100-100
-
Heterogeneity in “Homogeneous” Warehouse-Scale Computers: A Performance Opportunity
Jason Mars, Lingjia Tang, Robert Hundt
IEEE Computer Architecture Letters (CAL), vol. Vol. 10 No. 2 (2011), pp. 29-32
-
High Performance Datacenter Networks: Architectures, Algorithms, and Opportunities
Dennis Abts, John Kim
Morgan & Claypool, San Rafael, California (2011)
-
Power Management of Online Data-Intensive Services
David Meisner, Christopher M. Sadler, Luiz André Barroso, Wolf-Dietrich Weber, Thomas F. Wenisch
Proceedings of the 38th ACM International Symposium on Computer Architecture (2011)
-
Simultaneous Technology Mapping and Placement for Delay Minimization
Yifang Liu, Rupesh S. Shelar, Jiang Hu
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, vol. 30 (2011), pp. 416-426
-
The Cray XT4 and Seastar 3-D Torus Interconnect
Encyclopedia of Parallel Computing, Springer (2011)
-
The Future of Computing Performance: Game Over or Next Level?
Samuel H. Fuller, Luiz André Barroso, Robert P. Colwell, William J. Dally, Dan Dobberpuhl, Pradeep Dubey, Mark D. Hill, Mark Horowitz, David Kirk, Monica Lam, Kathryn S. McKinley, Charles Moore, Katherine Yelick
The National Academies Press (2011), pp. 200
-
The Impact of Memory Subsystem Resource Sharing on Datacenter Applications
Lingjia Tang, Jason Mars, Neil Vachharajani, Robert Hundt, Mary-Lou Soffa
ISCA, ACM (2011)
-
Lide Zhang, Birjodh Tiwana, Zhiyun Qian, Zhaoguang Wang, Robert P. Dick, Z. Morley Mao, Lei Yang
Proceeding of Internation Conference on Hardware/Software Codesign and System Synthesis (2010), pp. 105-114
-
Brawny cores still beat wimpy cores, most of the time
IEEE MICRO (2010)
-
Efficient Topologies for Large-Scale Cluster Networks
John Kim, William J. Dally, Dennis Abts
2010 Conference on OFC/NFOEC, IEEE, pp. 1-3
-
Energy Proportional Datacenter Networks
Dennis Abts, Mike Marty, Philip Wells, Peter Klausler, Hong Liu
Proceedings of the International Symposium on Computer Architecture, ACM (2010), pp. 338-347
-
Probabilistic Distance-based Arbitration: Providing Equality of Service for Many-core CMPs
Michael M. Lee, John Kim, Dennis Abts, Michael Marty, Jae W. Lee
MICRO43: Proceedings of the 43rd Annual International Symposium on Microarchitecture, IEEE/ACM (2010)
-
Scalable Thread Scheduling and Global Power Management for Heterogeneous Many-Core Architectures
Jonathan A. Winter, David H. Albonesi, Christine A. Shoemaker
Proceedings of the Nineteenth International Conference on Parallel Architectures and Compilation Techniques (PACT), Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY, 10121-0701 (2010), pp. 29-39
-
Warehouse Scale Computing - A keynote address to SIGMOD'10
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data (2010)
-
Achieving Predictable Performance through Better Memory Controller Placement in Many-Core CMPs
Dennis Abts, Natalie Engright Jerger, John Kim, Dan Gibson, Mikko Lipasti
Proceedings of the International Symposium on Computer Architecture, ACM (2009)
-
Dynamic Heterogeneity and the Need for Multicore Virtualization
Philip M Wells, Koushik Chakraborty, Gurindar S Sohi
ACM SIGOPS Operating Systems Review, vol. 43 (2009), pp. 5-14
-
The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines
Luiz André Barroso, Urs Hölzle
Morgan & Claypool Publishers (2009)
-
Amdahl's Law in the Multicore Era
Mark D. Hill, Michael R. Marty
IEEE Computer, vol. 41 (2008), pp. 33-38
-
Incrementally Parallelizing Database Transactions with Thread-Level Speculation
Christopher B. Colohan, Anastassia Ailamaki, J. Gregory Steffan, Todd C. Mowry
ACM Transactions on Computer Systems (TOCS), vol. 26 (2008)
-
Technology-Driven, Highly-Scalable Dragonfly Topology
John Kim, William J. Dally, Steve Scott, Dennis Abts
Proceedings of the 35th International Symposium on Computer Architecture, IEEE Computer Society, Washington, DC USA (2008), pp. 77-88
-
All Watts Considered
Keynote address, International Symposium on Low Power Electronics and Design, ACM, Portland, OR (2007)
-
Power Provisioning for a Warehouse-sized Computer
Xiaobo Fan, Wolf-Dietrich Weber, Luiz André Barroso
The 34th ACM International Symposium on Computer Architecture (2007)
-
The Case for Energy-Proportional Computing
Luiz André Barroso, Urs Hölzle
IEEE Computer, vol. 40 (2007)
-
High-efficiency power supplies for home computers and servers
Urs Hölzle, Bill Weihl
Google (2006), pp. 1-3
-
The Price of Performance: An Economic Case for Chip Multiprocessing
ACM Queue, vol. 3 (2005), pp. 48-53
-
A Model for Battery Lifetime Analysis for Organizing Applications on a Pocket Computer
Daler N. Rakhmatov, Sarma B. K. Vrudhula, Deborah A. Wallach
IEEE Transactions on Very Large Scale Integration Systems, vol. 11 (2003), pp. 1019-1030
