Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing
Venue
VLDB (2014)
Publication Year
2014
Authors
Ashish Gupta, Fan Yang, Jason Govig, Adam Kirsch, Kelvin Chan, Kevin Lai, Shuo Wu, Sandeep Dhoot, Abhilash Kumar, Ankur Agiwal, Sanjay Bhansali, Mingsheng Hong, Jamie Cameron, Masood Siddiqi, David Jones, Jeff Shute, Andrey Gubarev, Shivakumar Venkataraman, Divyakant Agrawal
BibTeX
Abstract
                Mesa is a highly scalable analytic data warehousing system that stores critical
                measurement data related to Google's Internet advertising business. Mesa is
                designed to satisfy a complex and challenging set of user and systems requirements,
                including near real-time data ingestion and queryability, as well as high
                availability, reliability, fault tolerance, and scalability for large data and
                query volumes. Specifically, Mesa handles petabytes of data, processes millions of
                row updates per second, and serves billions of queries that fetch trillions of rows
                per day. Mesa is geo-replicated across multiple datacenters and provides consistent
                and repeatable query answers at low latency, even when an entire datacenter fails.
                This paper presents the Mesa system and reports the performance and scale that it
                achieves.
              
             
 