Steven Euijong Whang

I am a Research Scientist in the Structured Data Group of Google Research. Previously, I was a Ph.D. student and Postdoctoral Researcher at the Computer Science Department of Stanford University. My research interests include Databases, Data Mining, Big Data Analytics, Information Integration, Data Privacy, Crowdsourcing, Knowledge Bases, and Web Data Management.

Google Publications

Previous Publications

  •  

    Disinformation Techniques for Entity Resolution

    Steven Euijong Whang, Hector Garcia-Molina

    Proc. 22nd ACM Int'l Conf. on Information and Knowledge Management (CIKM) (2013)

  •  

    Incremental Entity Resolution on Rules and Data

    Steven Euijong Whang, Hector Garcia-Molina

    The VLDB Journal (2013)

  •  

    Joint Entity Resolution on Multiple Datasets

    Steven Euijong Whang, Hector Garcia-Molina

    The VLDB Journal (2013)

  •  

    Pay-As-You-Go Entity Resolution

    Steven Euijong Whang, David Marmaros, Hector Garcia-Molina

    IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 25 (2013), pp. 1111-1124

  •   

    Question Selection for Crowd Entity Resolution

    Steven Euijong Whang, Peter Lofgren, Hector Garcia-Molina

    Proc. 39th Int'l Conf. on Very Large Data Bases (PVLDB) (2013), pp. 349-360

  •  

    A Model for Quantifying Information Leakage

    Steven Euijong Whang, Hector Garcia-Molina

    Proc. 9th VLDB Workshop on Secure Data Management (SDM) (2012), pp. 25-44

  •  

    Data Analytics: Integration and Privacy

    Steven Euijong Whang

    Ph.D. Thesis (2012)

  •  

    Joint Entity Resolution

    Steven Euijong Whang, Hector Garcia-Molina

    Proc. 28th IEEE International Conference on Data Engineering (ICDE) (2012), pp. 294-305

  •  

    Developments in Generic Entity Resolution

    Steven Euijong Whang, Hector Garcia-Molina

    IEEE Data Engineering Bulletin (2011), pp. 51-59

  •  

    Managing Information Leakage

    Steven Euijong Whang, Hector Garcia-Molina

    Proc. 5th Biennial Conference on Innovative Data Systems Research (CIDR) (2011)

  •  

    Entity Resolution with Evolving Rules

    Steven Euijong Whang, Hector Garcia-Molina

    Proc. 36th Int'l Conf. on Very Large Data Bases (PVLDB) (2010), pp. 1326-1337

  •  

    Evaluating Entity Resolution Results

    David Menestrina, Steven Euijong Whang, Hector Garcia-Molina

    Proc. 36th Int'l Conf. on Very Large Data Bases (PVLDB) (2010), pp. 208-219

  •  

    Entity Resolution with Iterative Blocking

    Steven Euijong Whang, David Menestrina, Georgia Koutrika, Martin Theobald, Hector Garcia-Molina

    Proc. 2009 ACM SIGMOD Int'l Conf. on Management of Data (SIGMOD), pp. 219-232

  •  

    Generic Entity Resolution with Negative Rules

    Steven Euijong Whang, Omar Benjelloun, Hector Garcia-Molina

    The VLDB Journal, vol. 18 (2009), pp. 1261-1277

  •  

    Indexing Boolean Expressions

    Steven Euijong Whang, Chad Brower, Jayavel Shanmugasundaram, Sergei Vassilvitskii, Erik Vee, Ramana Yerneni, Hector Garcia-Molina

    Proc. 35th Int'l Conf. on Very Large Data Bases (PVLDB) (2009), pp. 37-48

  •  

    QuickStart: an Upfront Client-based Design Advisor for Parallel Data Warehouses

    Malu Castellanos, Ivo Jimenez, Neil Coddington, Steven Euijong Whang, Umeshwar Dayal

    In Proc. 25th Int'l Conf. on Data Engineering (ICDE) (2009), pp. 1543-1546

  •  

    Swoosh: A Generic Approach to Entity Resolution

    Omar Benjelloun, Hector Garcia-Molina, David Menestrina, Qi Su, Steven Euijong Whang, Jennifer Widom

    The VLDB Journal, vol. 18 (2009), pp. 255-276

  •  

    A Practitioner's Approach to Normalizing XQuery Expressions

    Ki-hoon Lee, Seo-young Kim, Steven Euijong Whang, Jae-gil Lee

    Proc. 11th Int'l Symposium on Database Systems for Advanced Applications (DASFAA) (2006), pp. 437-453