Ray Smith

Ray developed the Tesseract OCR engine at HPLabs Bristol for 10 years, followed by a 3 year term developing the text and line drawings pipelines for the HP PrecisionScan product in Greeley, Colorado. After spending a further 7 years developing a new architecture for the Omnipage OCR product for Caere/Scansoft/Nuance, Ray is now at Google, working on Tesseract again.

Google Publications

Previous Publications

  •   

    A simple and efficient skew detection algorithm via text row accumulation

    Ray Smith

    Proceedings 3rd ICDAR'95, IEEE (1995), pp. 1145-1148

  •   

    Computer processing of line images: a survey

    R. W. Smith

    Pattern Recogn., vol. 20 (1987), pp. 7-15