“Thresher: automating the unwrapping of semantic content from the World Wide Web”, Andrew Hogue, David Karger, WWW '05: Proceedings of the 14th international conference on World Wide Web, 2005, pp. 86-95. [doi.acm.org] [pdf] [search]