Arun Narayanan

Research Areas

Authored Publications

Google Publications

Other Publications

SNRi Target Training for Joint Speech Enhancement and Recognition

Yuma Koizumi

Shigeki Karita

Arun Narayanan

Sankaran Panchapagesan

Michiel Adriaan Unico Bacchiani

Proc. Interspeech (2022) (to appear)

Transducer-Based Streaming Deliberation For A Cascaded Encoder Model

Arun Narayanan

Kevin Hu

Ruoming Pang

Tara N Sainath

Trevor Deatrick Strohman

ICASSP 2022 (2022) (to appear)

Extracting Targeted Training Data from ASR Models, and How to Mitigate It

Ehsan Amid

Om Thakkar

Arun Narayanan

Rajiv Mathews

Françoise Beaufays

Proc. Interspeech 2022 (2022) (to appear)

Improving Streaming ASR with Non-streaming Model Distillation on Unsupervised Data

Ananya Misra

Arun Narayanan

Chung-Cheng Chiu

Liangliang Cao

Min Ma

Ruoming Pang

Thibault Doutre

Wei Han

Yu Zhang

Zhiyun Lu

ICASSP 2021 (to appear)

An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling

Tara N Sainath

Yanzhang (Ryan) He

Arun Narayanan

Rami Botros

Ruoming Pang

David Johannes Rybach

Cyril Allauzen

Ehsan Variani

James Qin

Quoc-Nam Le-The

Alex Gruenstein

Anmol Gulati

Bo Li

Cal Peyser

Chung-Cheng Chiu

Diamantino A. Caseiro

Emmanuel Guzman

Ian Carmichael McGraw

Jiahui Yu

Michael D. Riley

Pat Rondon

Qiao Liang

Sepand Mavandadi

Shuo-yiin Chang

Trevor Deatrick Strohman

W. Ronny Huang

Wei Li

Yonghui Wu

Yu Zhang

Interspeech (2021) (to appear)

FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization

Jiahui Yu

Chung-Cheng Chiu

Bo Li

Shuo-yiin Chang

Tara N Sainath

Yanzhang (Ryan) He

Arun Narayanan

Wei Han

Anmol Gulati

Yonghui Wu

Ruoming Pang

ICASSP 2021

Personalized Keyphrase Detection using Speaker and Environment Information

Rajeev Vijay Rikhye

Quan Wang

Qiao Liang

Yanzhang (Ryan) He

Ding Zhao

Yiteng (Arden) Huang

Arun Narayanan

Ian Carmichael McGraw

Interspeech 2021

Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging

Rohit Prabhavalkar

Yanzhang (Ryan) He

David Johannes Rybach

Sean Campbell

Arun Narayanan

Trevor Deatrick Strohman

Tara N Sainath

ICASSP 2021, IEEE

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Tara Sainath

Yanzhang (Ryan) He

Bo Li

Arun Narayanan

Ruoming Pang

Antoine Bruguier

Shuo-yiin Chang

Wei Li

Raziel Alvarez

Zhifeng Chen

Chung-Cheng Chiu

David Garcia

Alex Gruenstein

Kevin Hu

Minho Jin

Anjuli Kannan

Qiao Liang

Ian McGraw

Cal Peyser

Rohit Prabhavalkar

Golan Pundak

David Rybach

(June) Yuan Shangguan

Yash Sheth

Trevor Strohman

Mirkó Visontai

Yonghui Wu

Yu Zhang

Ding Zhao

ICASSP (2020)

From audio to semantics: Approaches to end-to-end spoken language understanding

Parisa Haghani

Arun Narayanan

Michiel Adriaan Unico Bacchiani

Galen Chuang

Neeraj Gaur

Pedro Jose Moreno Mengibar

Delia Qu

Rohit Prabhavalkar

Austin Waters

Spoken Language Technology Workshop (SLT), 2018 IEEE

Spectral distortion model for training phase-sensitive deep-neural networks for far-field speech recognition

Chanwoo Kim

Tara Sainath

Arun Narayanan

Ananya Misra

Rajeev Nongpiur

Michiel Bacchiani

ICASSP 2018 (2018)

TOWARD DOMAIN-INVARIANT SPEECH RECOGNITION VIA LARGE SCALE TRAINING

Ananya Misra

Anshuman Tripathi

Arun Narayanan

Golan Pundak

Khe Chai Sim

Michiel Adriaan Unico Bacchiani

Mohamed (Mo) Elfeky

Parisa Haghani

Trevor Deatrick Strohman

SLT, IEEE (2018)

Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition

Ananya Misra

Anshuman Tripathi

Arun Narayanan

Bo Li

Golan Pundak

Khe Chai Sim

Michiel Adriaan Unico Bacchiani

Parisa Haghani

Tara N Sainath

Interspeech (2018), pp. 892-896

Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models

Chanwoo Kim

Ehsan Variani

Arun Narayanan

Michiel Bacchiani

arxiv (2017)

Acoustic Modeling for Google Home

Bo Li

Tara Sainath

Arun Narayanan

Joe Caroselli

Michiel Bacchiani

Ananya Misra

Izhak Shafran

Hasim Sak

Golan Pundak

Kean Chin

Khe Chai Sim

Ron J. Weiss

Kevin Wilson

Ehsan Variani

Chanwoo Kim

Olivier Siohan

Mitchel Weintraub

Erik McDermott

Rick Rose

Matt Shannon

INTERSPEECH 2017 (2017)

Multichannel Signal Processing with Deep Neural Networks for Automatic Speech Recognition

Tara Sainath

Ron J. Weiss

Kevin Wilson

Bo Li

Arun Narayanan

Ehsan Variani

Michiel Bacchiani

Izhak Shafran

Andrew Senior

Kean Chin

Ananya Misra

Chanwoo Kim

IEEE /ACM Transactions on Audio, Speech, and Language Processing, vol. 25 (2017), pp. 965 - 979

Raw Multichannel Processing Using Deep Neural Networks

Tara N. Sainath

Ron J. Weiss

Kevin W. Wilson

Arun Narayanan

Michiel Bacchiani

Bo Li

Ehsan Variani

Izhak Shafran

Andrew Senior

Kean Chin

Ananya Misra

Chanwoo Kim

New Era for Robust Speech Recognition: Exploiting Deep Learning, Springer (2017)

Generation of large-scale simulated utterances in virtual rooms to train deep-neural networks for far-field speech recognition in Google Home

Chanwoo Kim

Ananya Misra

Kean Chin

Thad Hughes

Arun Narayanan

Tara Sainath

Michiel Bacchiani

interspeech 2017 (2017), pp. 379-383

Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction

Tara N. Sainath

Arun Narayanan

Ron J. Weiss

Ehsan Variani

Kevin W. Wilson

Michiel Bacchiani

Izhak Shafran

Proc. Interspeech, ISCA (2016)

Preview

Factored Spatial and Spectral Multichannel Raw Waveform CLDNNs

Tara N. Sainath

Ron J. Weiss

Kevin W. Wilson

Arun Narayanan

Michiel Bacchiani

International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE (2016)

Preview

Large-scale, sequence-discriminative, joint adaptive training for masking-based robust ASR

Arun Narayanan

Ananya Misra

Kean Chin

INTERSPEECH-2015, ISCA, pp. 3571-3575

Speaker Location and Microphone Spacing Invariant Acoustic Modeling from Raw Multichannel Waveforms

Tara N. Sainath

Ron J. Weiss

Kevin Wilson

Arun Narayanan

Michiel Bacchiani

Andrew Senior

ASRU (2015)

Preview

Improving robustness of deep neural network acoustic models via speech separation and joint adaptive training

Arun Narayanan

DeLiang Wang

IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23 (2015), pp. 92-101

Computational auditory scene analysis and robust automatic speech recognition

Arun Narayanan

Ph.D. Thesis, Ohio State University (2014)

Analysis by synthesis feature estimation for robust automatic speech recognition using spectral masks

Michael I Mandel

Arun Narayanan

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE (2014), pp. 2528-2532

Joint noise adaptive training for robust automatic speech recognition

Arun Narayanan

DeLiang Wang

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE (2014), pp. 2523-2527

Investigation of speech separation as a front-end for noise robust speech recognition

Arun Narayanan

DeLiang Wang

IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 22 (2014), pp. 826-835

On training targets for supervised speech separation

Yuxuan Wang

Arun Narayanan

DeLiang Wang

IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 22 (2014), pp. 1849-1858

Ideal ratio mask estimation using deep neural networks for robust speech recognition

Arun Narayanan

DeLiang Wang

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE (2013), pp. 7092-7096

Coupling binary masking and robust ASR

Arun Narayanan

DeLiang Wang

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE (2013), pp. 6817-6821

The role of binary mask patterns in automatic speech recognition in background noise

Arun Narayanan

DeLiang Wang

Journal of the Acoustical Society of America, vol. 133 (2013), pp. 3083-3093

A direct masking approach to robust ASR

William Hartmann

Arun Narayanan

Eric Fosler-Lussier

DeLiang Wang

IEEE Transactions on Audio, Speech, and Language Processing, vol. 21 (2013), pp. 1993-2005

Computational auditory scene analysis and automatic speech recognition

Arun Narayanan

DeLiang Wang

Techniques for Noise Robustness in Automatic Speech Recognition, John Wiley & Sons (2012), pp. 433-462

On the role of binary mask pattern in automatic speech recognition

Arun Narayanan

DeLiang Wang

INTERSPEECH-2012, ISCA, pp. 1239-1242

A CASA based system for long-term SNR estimation

Arun Narayanan

DeLiang Wang

IEEE Transactions on Audio, Speech, and Language Processing, vol. 20 (2012), pp. 2518-2527

On the use of ideal binary masks for improving phonetic classification

Arun Narayanan

DeLiang Wang

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE (2011), pp. 5212-5215

Robust speech recognition using multiple prior models for speech reconstruction

Arun Narayanan

Xiaojia Zhao

DeLiang Wang

Eric Fosler-Lussier

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), IEEE (2011), pp. 4800-4803

Robust speech recognition from binary masks

Arun Narayanan

DeLiang Wang

Journal of the Acoustical Society of America, vol. 128 (2010), EL217-222

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Arun Narayanan

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Arun Narayanan

Research Areas

Filter by:

Year

Team

Research Area

Join us

AI/ML Foundations  & Capabilities