Thresher: automating the unwrapping of semantic content from the World Wide Web, Andrew Hogue, David Karger, WWW '05: Proceedings of the 14th international conference on World Wide Web, 2005, pp. 86-95.