Jump to Content

Structured Data Meets the Web: A Few Observations

Alon Halevy
Shirley Cohen
Xin (Luna) Dong
Shawn R. Jeffery
David Ko
Cong Yu
Data Engineering Bulletin (2006)

Abstract

The World Wide Web is witnessing an increase in the amount of structured content -- vast heterogeneous collections of structured data are on the rise due to the Deep Web, annotation schemes like Flickr, and sites like Google Base. While this phenomenon is creating an opportunity for structured data management, dealing with heterogeneity on the web-scale presents many new challenges. In this paper we articulate challenges based on our experience with addressing them at Google, and offer some principles for addressing them in a general fashion.

Research Areas