Distributed Systems and Parallel Computing

95 Publications

  •    

    CPI2: CPU performance isolation for shared compute clusters

    Xiao Zhang, Eric Tune, Robert Hagmann, Rohit Jnagal, Vrigo Gokhale, John Wilkes

    ACM SIGOPS European Conference on Computer Systems (2013)

  •   

    Ensuring Connectivity via Data Plane Mechanisms

    Junda Liu

    10th USENIX Symposium on Networked Systems Design and Implementation (2013)

  •    

    Minimizing Weighted Flowtime on Capacitated Machines

    Kyle Fox, Madhukar Korupolu

    ACM Symposium on Discrete Algorithms (SODA) (2013) (to appear)

  •    

    On the k-atomicity-verification problem

    Wojciech Golab, Jeremy Hurwitz, Xiaozhou Li

    The 33rd International Conference on Distributed Computing Systems, IEEE (2013) (to appear)

  •    

    The Tail at Scale

    Jeffrey Dean, Luiz André Barroso

    Communications of the ACM, vol. 56 (2013), pp. 74-80

  •    

    Verifying Cloud Services: Present and Future

    Sara Bouchenak, Gregory Chockler, Hana Chockler, Gabriela Gheorghe, Nuno Santos, Alexander Shraer

    Operating Systems Review (2013) (to appear)

  •    

    A Guided Tour of Datacenter Networking

    Dennis Abts, Bob Felderman

    Communications of the ACM - ACM Queue, vol. 55, number 6 (2012), pp. 44-51

  •   

    An approach to Distributed Virtual Environment performance modeling: Addressing system complexity and user behavior

    H. Lally Singh, Denis Gracanin

    Proceedings of the 2012 IEEE Virtual Reality, IEEE Computer Society, Washington, DC, USA, pp. 71-72

  •   

    CloudRAMSort: fast and efficient large-scale distributed RAM sort on shared-nothing cluster

    Changkyu Kim, Jongsoo Park, Nadathur Satish, Hongrae Lee, Pradeep Dubey, Jatin Chhugani

    Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, ACM, New York, NY, USA, pp. 841-850

  •   

    DIPLOMA: Consistent and Coherent Shared Memory over Mobile Phones

    Niket Agarwal

    30th IEEE International Conference on Computer Design (2012)

  •    

    F1 - The Fault-Tolerant Distributed RDBMS Supporting Google's Ad Business

    Jeff Shute, Mircea Oancea, Stephan Ellner, Ben Handy, Eric Rollins, Bart Samwel, Radek Vingralek, Chad Whipkey, Xin Chen, Beat Jegerlehner, Kyle Littlefield, Phoenix Tong

    SIGMOD (2012)

  •  

    Finding Connected Components in Map-reduce in Logarithmic Rounds

    Vibhor Rastogi, Ashwin Machanavajjhala, Laukik Chitnis, Anish Das Sarma

    ICDE, IEE (2012) (to appear)

  •   

    JANUS: exploiting parallelism via hindsight

    Omer Tripp, Roman Manevich, John Field, Mooly Sagiv

    Proceedings of the 33rd ACM SIGPLAN conference on Programming Language Design and Implementation, ACM, New York, NY, USA (2012), pp. 145-156

  •    

    Optimistic Scheduling with Geographically Replicated Services in the Cloud Environment (COLOR)

    Wenbo Zhu, C. Murray Woodside

    Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on, IEEE CONFERENCE PUBLICATIONS, pp. 735-740

  •   

    Orchestrating the deployment of computations in the cloud with conductor

    Alexander Wieder, Pramod Bhatotia, Ansley Post, Rodrigo Rodrigues

    Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation, USENIX Association, Berkeley, CA, USA (2012), pp. 27-27

  •  

    Overlapping clusters for distributed computation

    Reid Andersen, David Gleich, Vahab Mirrokni

    ACM Conference on Web Search and Data Mining (WSDM) (2012)

  •    

    Processing a Trillion Cells per Mouse Click

    Alex Hall, Olaf Bachmann, Robert Buessow, Silviu-Ionut Ganceanu, Marc Nunkesser

    PVLDB, vol. 5 (2012), pp. 1436-1446

  •    

    Projecting Disk Usage Based on Historical Trends in a Cloud Environment

    Murray Stokely, Amaan Mehrabian, Christoph Albrecht, Francois Labelle, Arif Merchant

    ScienceCloud 2012 Proceedings of the 3rd International Workshop on Scientific Cloud Computing, ACM, pp. 63-70

  •    

    Recursion in Scalable Protocols via Distributed Data Flows

    Krzysztof Ostrowski

    Languages for Distributed Algorithms (2012) (to appear)

  •    

    Resource-bounded multicore emulation using Beefarm

    Oriol Arcas, Nehir Sonmez, Gokhan Sayilar, Satnam Singh, Osman S. Unsal, Adrian Cristal, Ibrahim Hur, Mateo Valero

    Microprocessors and Microsystems (2012)

  •   

    Spanner: Google's Globally-Distributed Database

    James C. Corbett, Jeffrey Dean, Michael Epstein, Andrew Fikes, Christopher Frost, JJ Furman, Sanjay Ghemawat, Andrey Gubarev, Christopher Heiser, Peter Hochschild, Wilson Hsieh, Sebastian Kanthak, Eugene Kogan, Hongyi Li, Alexander Lloyd, Sergey Melnik, David Mwaura, David Nagle, Sean Quinlan, Rajesh Rao, Lindsay Rolig, Dale Woodford, Yasushi Saito, Christopher Taylor, Michal Szymaniak, Ruth Wang

    OSDI (2012) (to appear)

  •    

    Trickle: Rate Limiting YouTube Video Streaming

    Monia Ghobadi, Yuchung Cheng, Ankur Jain, Matt Mathis

    Proceedings of the USENIX Annual Technical Conference (2012), pp. 6

  •    

    Uncertainty in Aggregate Estimates from Sampled Distributed Traces

    Nate Coehlo, Arif Merchant, Murray Stokely

    2012 Workshop on Managing Systems Automatically and Dynamically, USENIX (to appear)

  •    

    Upper and Lower Bounds on the Cost of a Map-Reduce Computation

    Foto Afrati, Anish Das Sarma, Semih Salihoglu, Jeffrey Ullman

    Arxiv (2012)

  •    

    Vision Paper: Towards an Understanding of the Limits of Map-Reduce Computation

    Foto Afrati, Anish Das Sarma, Semih Salihoglu, Jeffrey Ullman

    CloudFutures Workshop (2012)

  •  

    A Tight Unconditional Lower Bound on Distributed Random Walk Computation

    Danupon Nanongkai, Atish Das Sarma, Gopal Pandurangan

    ACM Symposium on Principles of Distributed Computing (PODC) (2011)

  •    

    Characterizing Task Usage Shapes in Google Compute Clusters

    Qi Zhang, Joseph Hellerstein, Raouf Boutaba

    Proceedings of the 5th International Workshop on Large Scale Distributed Systems and Middleware (2011)

  •   

    Design and Implementation of FAITH, an Experimental System to Intercept and Manipulate Online Social Informatics

    Ruaylong Lee, Roozbeh Nia, Jason Hsu, Karl N. Levitt, Jeff Rowe, S. Felix Wu, Shaozhi Ye

    International Conference on Advances in Social Networks Analysis and Mining, IEEE (2011), pp. 195-202

  •    

    Diagnosing Latency in Multi-Tier Black-Box Services

    Krzysztof Ostrowski, Gideon Mann, Mark Sandler

    5th Workshop on Large Scale Distributed Systems and Middleware (LADIS 2011) (to appear)

  •    

    Exploiting Service Usage Information for Optimizing Server Resource Management

    Alexander Totok, Vijay Karamcheti

    ACM Transactions on Internet Technology (TOIT), vol. 11 (2011), pp. 1-26

  •   

    FAWN: a fast array of wimpy nodes: technical perspective

    Luiz André Barroso

    Communications of the ACM, vol. 54 (2011), pp. 100-100

  •    

    HTAF: Hybrid Testing Automation Framework to Leverage Local and Global Computing Resources

    Keun Soo Yim, David Hreczany, Ravishankar K. Iyer

    Lecture Notes in Computer Science, vol. 6784 (2011), pp. 479-494

  •    

    Megastore: Providing Scalable, Highly Available Storage for Interactive Services

    Jason Baker, Chris Bond, James C. Corbett, JJ Furman, Andrey Khorlin, James Larson, Jean-Michel Leon, Yawei Li, Alexander Lloyd, Vadim Yushprakh

    Proceedings of the Conference on Innovative Data system Research (CIDR) (2011), pp. 223-234

  •    

    Modeling and Synthesizing Task Placement Constraints in Google Compute Clusters

    Victor Chudnovsky, Rasekh Rifaat, Joseph Hellerstein, Bikash Sharma, Chita Das

    Symposium on Cloud Computing, ACM (2011)

  •  

    Modeling the Parallel Execution of Black-Box Services

    Gideon Mann, Mark Sandler, Darja Krushevskaja, Sudipto Guha, Eyal Even-Dar

    HotCloud, Usenix (2011)

  •   

    PowerNap: An Energy Efficient MAC Layer for Random Routing in Wireless Sensor Networks

    Onur Soysal, Sami Ayyorgun, Murat Demirbas

    IEEE SECON 2011

  •    

    Tenzing A SQL Implementation On The MapReduce Framework

    Biswapesh Chattopadhyay, Liang Lin, Weiran Liu, Sagar Mittal, Prathyusha Aragonda, Vera Lychagina, Younghee Kwon, Michael Wong

    Proceedings of VLDB, VLDB Endowment (2011), pp. 1318-1327

  •    

    The Emerging Optical Data Center

    Amin Vahdat, Hong Liu, Xiaoxue Zhao, Chris Johnson

    OFC 2011, OTuH2

  •    

    Thialfi: A Client Notification Service for Internet-Scale Applications

    Atul Adya, Gregory Cooper, Daniel Myers, Michael Piatek

    Proc. 23rd ACM Symposium on Operating Systems Principles (SOSP) (2011), pp. 129-142

  •    

    Warehouse-scale Computing: entering the teenage decade

    Luiz André Barroso

    Association for Computing Machinery (2011)

  •   

    Analyzing and enhancing the parallel sort operation on multithreaded architectures

    Layali K. Rashid, Wessam Hassanein, Moustafa A. Hammad

    The Journal of Supercomputing, vol. 53 (2010), pp. 293-312

  •  

    Anti-Omega: the weakest failure detector for set agreement

    Piotr Zielinski

    Distributed Computing, vol. 22 (2010), pp. 335-348

  •    

    Availability in Globally Distributed Storage Systems

    Daniel Ford, Francois Labelle, Florentina Popovici, Murray Stokely, Van-Anh Truong, Luiz Barroso, Carrie Grimes, Sean Quinlan

    Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation, USENIX (2010)

  •    

    Dapper, a Large-Scale Distributed Systems Tracing Infrastructure

    Benjamin H. Sigelman, Luiz André Barroso, Mike Burrows, Pat Stephenson, Manoj Plakal, Donald Beaver, Saul Jaspan, Chandan Shanbhag

    Google, Inc. (2010)

  •    

    Datacenter-scale Computing

    Luiz André Barroso, Parthasarathy Ranganathan

    IEEE Micro, vol. 30 (2010), pp. 6-7

  •    

    Dremel: Interactive Analysis of Web-Scale Datasets

    Sergey Melnik, Andrey Gubarev, Jing Jing Long, Geoffrey Romer, Shiva Shivakumar, Matt Tolton, Theo Vassilakis

    Proc. of the 36th Int'l Conf on Very Large Data Bases (2010), pp. 330-339

  •  

    FlumeJava: Easy, Efficient Data-Parallel Pipelines

    Craig Chambers, Ashish Raniwala, Frances Perry, Stephen Adams, Robert Henry, Robert Bradshaw, Nathan

    ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI) (2010)

  •    

    Large-scale Incremental Processing Using Distributed Transactions and Notifications

    Daniel Peng, Frank Dabek

    Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation, USENIX (2010)

  •    

    Mahout in Action

    Robin Anil, Sean Owen, Ted Dunning, Ellen Friedman

    Manning, Manning Publications Co. Sound View Ct. #3B Greenwich, CT 06830 (2010), pp. 350

  •   

    MapReduce: a flexible data processing tool

    Jeffrey Dean, Sanjay Ghemawat

    Commun. ACM, vol. 53 (2010), pp. 72-77

  •    

    Optimizing Utilization of Resource Pools in Web Application Servers

    Alexander Totok, Vijay Karamcheti

    Concurrency and Computation: Practice and Experience, vol. 22 (2010), pp. 2421-2444

  •   

    Warehouse Scale Computing - A keynote address to SIGMOD'10

    Luiz André Barroso

    Proceedings of the 2010 ACM SIGMOD International Conference on Management of data (2010)

  •    

    A unified format for traces of peer-to-peer systems

    Boxun Zhang, Alexandru Iosup, Pawel Garbacki, Johan Pouwelse

    LSAP '09: Proceedings of the 1st ACM workshop on Large-Scale system and application performance, ACM, New York, NY, USA (2009), pp. 27-34

  •    

    Causeway: a message-oriented distributed debugger

    Terry Stanley, Tyler Close, Mark S. Miller

    HP Labs (2009)

  •   

    Machine Learning-Based Prefetch Optimization for Data Center Applications

    Shih-wei Liao, Tzu-Han Hung, Donald Nguyen, Chinyen Chou, Chiaheng Tu, Hucheng Zhou

    Proceedings of Supercomputing (2009)

  •    

    MapReduce: The programming model and practice

    Jerry Zhao, Jelena Pjesivac-Grbovic

    SIGMETRICS (2009)

  •   

    Parallel algorithms for mining large-scale rich-media data

    Edward Y. Chang, Hongjie Bai, Kaihua Zhu

    MM '09: Proceedings of the seventeen ACM international conference on Multimedia, ACM, New York, NY, USA (2009), pp. 917-918

  •   

    Prefetch optimizations on large-scale applications via parameter value prediction

    Shih-wei Liao, Tzu-Han Hung, Donald Nguyen, Hucheng Zhou, Chinyen Chou, Chiaheng Tu

    ICS (2009), pp. 519-520

  •   

    Pregel: A System for Large-Scale Graph Processing

    Grzegorz Malewicz, Matthew H. Austern, Aart J.C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, Grzegorz Czajkowski

    28th ACM Symposium on Principles of Distributed Computing (2009), pp. 6-6

  •    

    Using a Market Economy to Provision Compute Resources Across Planet-wide Clusters

    Murray Stokely, Jim Winget, Ed Keyes, Carrie Grimes, Benjamin Yolken

    Proceedings for the International Parallel and Distributed Processing Symposium 2009, IEEE, pp. 1-8

  •  

    Anti-Omega: the weakest failure detector for set agreement

    Piotr Zielinski

    27th ACM Symposium on Principles of Distributed Computing (PODC 2008)

  •  

    Enhancing Community Authorization Services

    Kumar Abhishek, Kumar Kapil

    16th Euromicro International Conference on Parallel, Distributed and network-based Processing, IEEE Computer Society (2008) (to appear)

  •  

    Extending IC-Scheduling via the Sweep Algorithm

    Gennaro Cordasco, Grzegorz Malewicz, Arnold L. Rosenberg

    16th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (2008), pp. 366-373

  •   

    MapReduce: Simplified Data Processing on Large Clusters

    Jeffrey Dean, Sanjay Ghemawat

    Communications of the ACM, vol. 51, no. 1 (2008), pp. 107-113

  •    

    Parallel Spectral Clustering

    Yangqiu Song, Wen-Yen Chen, Hongjie Bai, Chih-Jen Lin, Edward Chang

    European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD), Springer (2008), pp. 374-389

  •   

    Physics Aware Programming Paradigm: Approach and Evaluation

    Salim Hariri, Yaser Jararweh, Yeliang Zhang, Talal Moukabary

    Proc. 6th International Workshop on Challenges of Large Applications in Distributed Environments, ACM, Boston (2008), pp. 1-6

  •   

    RaWMS - Random Walk based Lightweight Membership Service for Wireless Ad Hoc Networks

    Ziv Bar-Yossef, Roy Friedman, Gabi Kliot

    ACM Transactions on Computer Systems, vol. 26 (2008), pp. 1-66

  •    

    Age-based Packet Arbitration in Large k-ary n-cubes

    Dennis Abts, Deborah Weisser

    SC (2007)

  •   

    Applying IC-Scheduling Theory to Familiar Classes of Computations

    Gennaro Cordasco, Grzegorz Malewicz, Arnold L. Rosenberg

    Workshop on Large-Scale and Volatile Desktop Grids in conjunction with IPDPS'07 (2007), pp. 1-8

  •   

    Architect's dream or developer's nightmare?

    Gregor Hohpe

    Proc. 2007 inaugural international conference on distributed event-based systems, ACM, Toronto, pp. 188-188

  •   

    Distributed Programming with MapReduce

    Jeffrey Dean, Sanjay Ghemawat

    Beautiful Code, O'Reilly (2007), Chapter 23

  •    

    Engineering Reliability into Web Sites: Google SRE

    Alexander R. Perry

    Proceedings of LinuxWorld (2007)

  •   

    Large Language Models in Machine Translation

    Thorsten Brants, Ashok C. Popat, Peng Xu, Franz J. Och, Jeffrey Dean

    Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp. 858-867

  •   

    Let's Have a Conversation

    Gregor Hohpe

    IEEE Internet Computing, vol. 11, no. 3 (2007), pp. 78-81

  •   

    MRPSO: MapReduce Particle Swarm Optimization

    Andrew W. McNabb, Christopher K. Monson, Kevin D. Seppi

    Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2007), IEEE

  •   

    Parallel Approximate Matrix Factorization for Kernel Methods

    Kaihua Zhu, Hang Cui, Hongjie Bai, Jian Li, Zhihuan Qiu, Hao Wang, Hui Xu, Edward Y. Chang

    IEEE International Conference on Multimedia and Expo(ICME) (2007)

  •   

    Parallel PSO Using MapReduce

    Andrew W. McNabb, Christopher K. Monson, Kevin D. Seppi

    Proceedings of the IEEE Congress on Evolutionary Computation, IEEE Press (2007), pp. 7-14

  •    

    Parallelizing Support Vector Machines on Distributed Computers

    Edward Y. Chang, Kaihua Zhu, Hao Wang, Hongjie Bai, Jian Li, Zhihuan Qiu, Hang Cui

    Neural Information Processing Systems (NIPS) (2007)

  •    

    Paxos Made Live - An Engineering Perspective (2006 Invited Talk)

    Tushar Deepak Chandra, Robert Griesemer, Joshua Redstone

    Proceedings of the 26th Annual ACM Symposium on Principles of Distributed Computing, ACM press (2007)

  •   

    A Tool for Prioritizing DAGMan Jobs and Its Evaluation

    Grzegorz Malewicz, Ian Foster, Arnold Rosenberg, Michael Wilde

    Proceedings of the IEEE International Symposium on High-Performance Distributed Computing (HPDC06), Paris, France (2006), pp. 156-167

  •  

    An Autonomic Routing Framework for Sensor Networks

    Yu He, Cauligi S. Raghavendra, Steven Berson, Robert Braden

    Cluster Computing, Special Issue on Autonomic Computing (Kluwer Academic Pulishers), vol. 9 (2006), pp. 191-200

  •   

    An Experimental Study of the Skype Peer-to-Peer VoIP System

    Saikat Guha, Neil Daswani, Ravi Jain

    Proceedings of The 5th International Workshop on Peer-to-Peer Systems (IPTPS '06), Santa Barbara, CA (2006)

  •   

    Bigtable: A Distributed Storage System for Structured Data

    Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber

    7th USENIX Symposium on Operating Systems Design and Implementation (OSDI), {USENIX} (2006), pp. 205-218

  •   

    Data Management for Internet-Scale Single-Sign-On

    Sharon E. Perl, Margo Seltzer

    Proceedings of the 3rd Workshop on Real, Large Distributed Systems, Usenix (2006)

  •   

    Experiences with MapReduce, an abstraction for large-scale computation

    Jeffrey Dean

    Proc. 15th International Conference on Parallel Architectures and Compilation Techniques, ACM, Seattle, WA (2006), pp. 1

  •   

    Java Concurrency in Practice

    Brian Goetz, Tim Peierls, Joshua Bloch, Joseph Bowbeer, David Holmes, Doug Lea

    Addison-Wesley, Boston, MA (2006)

  •  

    Minimizing the Stretch when Scheduling Flows of Biological Requests

    Arnaud Legrand, Alan Su, Frédéric Vivien

    Proceedings of the 18th ACM Symposium on Parallelism in Algorithms and Architectures (2006)

  •   

    On Scheduling Expansive and Reductive Dags for Internet-Based Computing

    Gennaro Cordasco, Grzegorz Malewicz, Arnold L. Rosenberg

    26th IEEE International Conference on Distributed Computing Systems (2006), pp. 29

  •   

    Simple Efficient Load-Balancing Algorithms for Peer-to-Peer Systems

    David R. Karger, Matthias Ruhl

    Theory of Computing Systems, vol. 39, no. 6 (2006), pp. 787-804

  •   

    The Chubby lock service for loosely-coupled distributed systems

    Mike Burrows

    7th USENIX Symposium on Operating Systems Design and Implementation (OSDI), {USENIX} (2006)

  •   

    Decentralized algorithms using both local and random probes for P2P load balancing

    Krishnaram Kenthapadi, Gurmeet Singh Manku

    SPAA 2005 (17th ACM Symposium on Parallelism in Algorithms an Architectures), pp. 135-144

  •    

    Interpreting the Data: Parallel Analysis with Sawzall

    Rob Pike, Sean Dorward, Robert Griesemer, Sean Quinlan

    Scientific Programming Journal, vol. 13 (2005), pp. 277-298

  •   

    Papillon: Greedy Routing in Rings

    Ittai Abraham, Dahlia Malkhi, Gurmeet Singh Manku

    DISC (2005), pp. 514-515

  •   

    MapReduce: Simplified Data Processing on Large Clusters

    Jeffrey Dean, Sanjay Ghemawat

    OSDI'04: Sixth Symposium on Operating System Design and Implementation, San Francisco, CA (2004), pp. 137-150

  •    

    Web Search for a Planet: The Google Cluster Architecture

    Luiz Andre Barroso, Jeffrey Dean, Urs Hölzle

    IEEE Micro, vol. 23 (2003), pp. 22-28