David Rybach

David Rybach is currently a Software Engineer at Google. His research focuses on decoding methods for automatic speech recognition and related topics. He received his PhD from RWTH Aachen University in 2014.

Research Areas

Speech Processing

Authored Publications

Google Publications

Other Publications

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR

W. Ronny Huang

Shuo-yiin Chang

David Rybach

Tara N Sainath

Rohit Prabhavalkar

Cyril Allauzen

Cal Peyser

Zhiyun Lu

Interspeech 2022 (2022) (to appear)

Handling Compounding in Mobile Keyboard Input

Andreas Christian Kabel

Keith B. Hall

Tom Ouyang

David Rybach

Daan van Esch

Françoise Simone Beaufays

arXiv cs.CL (2022)

On Weight Interpolation of the Hybrid Autoregressive Transducer Model

Bhuvana Ramabhadran

Cyril Allauzen

David Rybach

Ehsan Variani

Michael D. Riley

Tongzhou Chen

Interspeech 2022, Interspeech 2022 (2022) (to appear)

An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling

Tara N Sainath

Yanzhang (Ryan) He

Arun Narayanan

Rami Botros

Ruoming Pang

David Johannes Rybach

Cyril Allauzen

Ehsan Variani

James Qin

Quoc-Nam Le-The

Alex Gruenstein

Anmol Gulati

Bo Li

Cal Peyser

Chung-Cheng Chiu

Diamantino A. Caseiro

Emmanuel Guzman

Ian Carmichael McGraw

Jiahui Yu

Michael D. Riley

Pat Rondon

Qiao Liang

Sepand Mavandadi

Shuo-yiin Chang

Trevor Deatrick Strohman

W. Ronny Huang

Wei Li

Yonghui Wu

Yu Zhang

Interspeech (2021) (to appear)

Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging

Rohit Prabhavalkar

Yanzhang (Ryan) He

David Johannes Rybach

Sean Campbell

Arun Narayanan

Trevor Deatrick Strohman

Tara N Sainath

ICASSP 2021, IEEE

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition

W. Ronny Huang

Tara N Sainath

Cal Peyser

Shankar Kumar

David Johannes Rybach

Trevor Deatrick Strohman

Interspeech (2021) (to appear)

Hybrid Autoregressive Transducer (HAT)

Ehsan Variani

David Rybach

Cyril Allauzen

Michael Riley

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, Barcelona, Spain, pp. 6139-6143

Low Latency Speech Recognition using End-to-End Prefetching

Shuo-yiin Chang

Bo Li

David Johannes Rybach

Wei Li

Yanzhang (Ryan) He

Tara N Sainath

Trevor Deatrick Strohman

Interspeech 2020 (to appear)

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Tara Sainath

Yanzhang (Ryan) He

Bo Li

Arun Narayanan

Ruoming Pang

Antoine Bruguier

Shuo-yiin Chang

Wei Li

Raziel Alvarez

Zhifeng Chen

Chung-Cheng Chiu

David Garcia

Alex Gruenstein

Kevin Hu

Minho Jin

Anjuli Kannan

Qiao Liang

Ian McGraw

Cal Peyser

Rohit Prabhavalkar

Golan Pundak

David Rybach

(June) Yuan Shangguan

Yash Sheth

Trevor Strohman

Mirkó Visontai

Yonghui Wu

Yu Zhang

Ding Zhao

ICASSP (2020)

STREAMING END-TO-END SPEECH RECOGNITION FOR MOBILE DEVICES

Yanzhang He

Tara Sainath

Rohit Prabhavalkar

Ian McGraw

Raziel Alvarez

Ding Zhao

David Rybach

Anjuli Kannan

Yonghui Wu

Ruoming Pang

Qiao Liang

Deepti Bhatia

Yuan Shangguan

Bo Li

Golan Pundak

Khe Chai Sim

Tom Bagby

Shuo-yiin Chang

Kanishka Rao

Alex Gruenstein

ICASSP (2019)

Two-Pass End-to-End Speech Recognition

Tara Sainath

Ruoming Pang

David Rybach

Yanzhang (Ryan) He

Rohit Prabhavalkar

Wei Li

Mirkó Visontai

Qiao Liang

Trevor Strohman

Yonghui Wu

Ian McGraw

Chung-Cheng Chiu

Interspeech (2019)

On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition

Kazuki Irie

Rohit Prabhavalkar

Anjuli Kannan

Antoine Bruguier

David Rybach

Patrick Nguyen

Interspeech (2019)

Contextual speech recognition in end-to-end neural network systems using beam search

Ian Williams

Anjuli Kannan

Petar Aleksic

David Rybach

Tara Sainath

Interspeech (2018)

No Need For A Lexicon? Evaluating The Value Of The Pronunciation Lexica In End-To-End Models

Tara Sainath

Rohit Prabhavalkar

Shankar Kumar

Seungji Lee

Anjuli Kannan

David Rybach

Vlad Schogol

Patrick Nguyen

Bo Li

Yonghui Wu

Zhifeng Chen

Chung-Cheng Chiu

ICASSP (2018)

On Lattice Generation for Large Vocabulary Speech Recognition

David Rybach

Johan Schalkwyk

Michael Riley

IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, Japan (2017)

Transliterated mobile keyboard input via weighted finite-state transducers

Lars Hellsten

Brian Roark

Prasoon Goyal

Cyril Allauzen

Francoise Beaufays

Tom Ouyang

Michael Riley

David Rybach

Proceedings of the 13th International Conference on Finite State Methods and Natural Language Processing (FSMNLP) (2017)

Personalized Speech Recognition On Mobile Devices

Ian McGraw

Rohit Prabhavalkar

Raziel Alvarez

Montse Gonzalez Arenas

Kanishka Rao

David Rybach

Ouais Alsharif

Hasim Sak

Alexander Gruenstein

Françoise Beaufays

Carolina Parada

Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE (2016)

Bringing Contextual Information to Google Speech Recognition

Petar Aleksic

Mohammadreza Ghodsi

Assaf Michaely

Cyril Allauzen

Keith Hall

Brian Roark

David Rybach

Pedro Moreno

Interspeech 2015, International Speech Communications Association

Preview

Multitask learning and system combination for automatic speech recognition

Olivier Siohan

David Rybach

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)

Preview

Composition-based on-the-fly rescoring for salient n-gram biasing

Keith Hall

Eunjoon Cho

Cyril Allauzen

Francoise Beaufays

Noah Coccaro

Kaisuke Nakajima

Michael Riley

Brian Roark

David Rybach

Linda Zhang

Interspeech 2015, International Speech Communications Association

Preview

Context Dependent State Tying for Speech Recognition using Deep Neural Network Acoustic Models

M. Bacchiani

D. Rybach

Proceedings of the International Conference on Acoustics,Speech and Signal Processing (2014)

Preview

Direct construction of compact context-dependency transducers from data

David Rybach

Michael Riley

Chris Alberti

Computer Speech & Language, vol. 28 (2014), pp. 177-191

Direct Construction of Compact Context-Dependency Transducers From Data

David Rybach

Michael Riley

Interspeech 2010, ISCA

Lexical Prefix Tree and WFST: A Comparison of Two Dynamic Search Concepts for LVCSR

David Rybach

Hermann Ney

Ralf Schlüter

IEEE Transactions on Audio, Speech, and Language Processing, vol. 21 (2013), pp. 1295-307

Open Vocabulary Handwriting Recognition Using Combined Word-Level and Character-Level Language Models

Michal Kozielski

David Rybach

Stefan Hahn

Ralf Schlüter

Hermann Ney

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2013), pp. 8257-8261

RWTH OCR: A Large Vocabulary Optical Character Recognition System for Arabic Scripts

Philippe Dreuw

David Rybach

Georg Heigold

Hermann Ney

Guide to OCR for Arabic Scripts, Springer (2012), pp. 215-254

Silence is Golden: Modeling Non-speech Events in WFST-based Dynamic Network Decoders

David Rybach

Ralf Schlüter

Hermann Ney

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2012), pp. 4205-4208

WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding

Björn Hoffmeister

Georg Heigold

David Rybach

Ralf Schlüter

Hermann Ney

IEEE Transactions on Audio, Speech, and Language Processing, vol. 20 (2012), pp. 551-564

A Comparative Analysis of Dynamic Network Decoding

David Rybach

Ralf Schlüter

Hermann Ney

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2011), pp. 5184-5187

The RWTH Aachen University Open Source Speech Recognition System

David Rybach

Christian Gollan

Georg Heigold

Björn Hoffmeister

Jonas Lööf

Ralf Schlüter

Hermann Ney

Interspeech (2009), pp. 2111-2114

Investigations on Convex Optimization Using Log-Linear HMMs for Digit String Recognition

Georg Heigold

David Rybach

Ralf Schlüter

Hermann Ney

Interspeech (2009), pp. 216-219

Audio Segmentation for Speech Recognition using Segment Features

David Rybach

Christian Gollan

Ralf Schlüter

Hermann Ney

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2009), pp. 4197-4200

Writer Adaptive Training and Writing Variant Model Refinement for Offline Arabic Handwriting Recognition

Philippe Dreuw

David Rybach

Christian Gollan

Hermann Ney

International Conference on Document Analysis and Recognition (ICDAR) (2009), pp. 21-25

Spoken Language Processing Techniques for Sign Language Recognition and Translation.

Philippe Dreuw

Daniel Stein

Thomas Deselaers

David Rybach

Morteza Zahedi

Jan Bungeroth

Hermann Ney

Technology and Dissability, vol. 20 (2008), pp. 121-133

Advances in Arabic Broadcast News Transcription at RWTH

David Rybach

Stefan Hahn

Christian Gollan

Ralf Schluter

Hermann Ney

IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (2007), pp. 449-454

Speech Recognition Techniques for a Sign Language Recognition System

Philippe Dreuw

David Rybach

Thomas Deselaers

Morteza Zahedi

Hermann Ney

Interspeech (2007), pp. 2513-2516

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

David Rybach

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

David Rybach

Research Areas

Filter by:

Year

Research Area

Team

Join us

AI/ML Foundations  & Capabilities