Steven Euijong Whang

I am a Research Scientist at Google Research currently working on data management infrastructure for large-scale machine learning systems.

Google Publications

Previous Publications

  •  

    Disinformation Techniques for Entity Resolution

    Steven Euijong Whang, Hector Garcia-Molina

    Proc. 22nd ACM Int'l Conf. on Information and Knowledge Management (CIKM) (2013)

  •  

    Incremental Entity Resolution on Rules and Data

    Steven Euijong Whang, Hector Garcia-Molina

    The VLDB Journal (2013)

  •  

    Joint Entity Resolution on Multiple Datasets

    Steven Euijong Whang, Hector Garcia-Molina

    The VLDB Journal (2013)

  •  

    Pay-As-You-Go Entity Resolution

    Steven Euijong Whang, David Marmaros, Hector Garcia-Molina

    IEEE Transactions on Knowledge and Data Engineering (TKDE), vol. 25 (2013), pp. 1111-1124

  •   

    Question Selection for Crowd Entity Resolution

    Steven Euijong Whang, Peter Lofgren, Hector Garcia-Molina

    Proc. 39th Int'l Conf. on Very Large Data Bases (PVLDB) (2013), pp. 349-360

  •  

    A Model for Quantifying Information Leakage

    Steven Euijong Whang, Hector Garcia-Molina

    Proc. 9th VLDB Workshop on Secure Data Management (SDM) (2012), pp. 25-44

  •  

    Data Analytics: Integration and Privacy

    Steven Euijong Whang

    Ph.D. Thesis (2012)

  •  

    Joint Entity Resolution

    Steven Euijong Whang, Hector Garcia-Molina

    Proc. 28th IEEE International Conference on Data Engineering (ICDE) (2012), pp. 294-305

  •  

    Developments in Generic Entity Resolution

    Steven Euijong Whang, Hector Garcia-Molina

    IEEE Data Engineering Bulletin (2011), pp. 51-59

  •  

    Managing Information Leakage

    Steven Euijong Whang, Hector Garcia-Molina

    Proc. 5th Biennial Conference on Innovative Data Systems Research (CIDR) (2011)

  •  

    Entity Resolution with Evolving Rules

    Steven Euijong Whang, Hector Garcia-Molina

    Proc. 36th Int'l Conf. on Very Large Data Bases (PVLDB) (2010), pp. 1326-1337

  •  

    Evaluating Entity Resolution Results

    David Menestrina, Steven Euijong Whang, Hector Garcia-Molina

    Proc. 36th Int'l Conf. on Very Large Data Bases (PVLDB) (2010), pp. 208-219

  •  

    Entity Resolution with Iterative Blocking

    Steven Euijong Whang, David Menestrina, Georgia Koutrika, Martin Theobald, Hector Garcia-Molina

    Proc. 2009 ACM SIGMOD Int'l Conf. on Management of Data (SIGMOD), pp. 219-232

  •  

    Generic Entity Resolution with Negative Rules

    Steven Euijong Whang, Omar Benjelloun, Hector Garcia-Molina

    The VLDB Journal, vol. 18 (2009), pp. 1261-1277

  •  

    Indexing Boolean Expressions

    Steven Euijong Whang, Chad Brower, Jayavel Shanmugasundaram, Sergei Vassilvitskii, Erik Vee, Ramana Yerneni, Hector Garcia-Molina

    Proc. 35th Int'l Conf. on Very Large Data Bases (PVLDB) (2009), pp. 37-48

  •  

    QuickStart: an Upfront Client-based Design Advisor for Parallel Data Warehouses

    Malu Castellanos, Ivo Jimenez, Neil Coddington, Steven Euijong Whang, Umeshwar Dayal

    In Proc. 25th Int'l Conf. on Data Engineering (ICDE) (2009), pp. 1543-1546

  •  

    Swoosh: A Generic Approach to Entity Resolution

    Omar Benjelloun, Hector Garcia-Molina, David Menestrina, Qi Su, Steven Euijong Whang, Jennifer Widom

    The VLDB Journal, vol. 18 (2009), pp. 255-276

  •  

    A Practitioner's Approach to Normalizing XQuery Expressions

    Ki-hoon Lee, Seo-young Kim, Steven Euijong Whang, Jae-gil Lee

    Proc. 11th Int'l Symposium on Database Systems for Advanced Applications (DASFAA) (2006), pp. 437-453