Cyril Allauzen

Co-Authors
-
Alexander Gruenstein
-
Assaf Hurwitz-Michaely
-
Boulos Harb
-
Brian Roark
-
Carolina Parada
-
Ciprian Chelba
-
Corinna Cortes
-
David Rybach
-
Françoise Beaufays
-
Hasim Sak
-
Jeffrey Sorensen
-
Keith B. Hall
-
Leif Johnson
-
Martin Jansche
-
Mehryar Mohri
-
Michael Riley
-
Mohammadreza Ghodsi
-
Pedro J. Moreno
-
Peng Xu
-
Petar Aleksic
-
Richard Sproat
-
Shankar Kumar
-
Tom Ouyang
-
Wolfgang Macherey
-
Yun-hsuan Sung
Cyril is an author of the OpenFst Library, the OpenKernel Library and the GRM Library.
Google Publications
-
Transliterated mobile keyboard input via weighted finite-state transducers
Lars Hellsten, Brian Roark, Prasoon Goyal, Cyril Allauzen, Francoise Beaufays, Tom Ouyang, Michael Riley, David Rybach
Proceedings of the 13th International Conference on Finite State Methods and Natural Language Processing (FSMNLP) (2017)
-
Distributed representation and estimation of WFST-based n-gram models
Cyril Allauzen, Michael Riley, Brian Roark
Proceedings of the ACL Workshop on Statistical NLP and Weighted Automata (StatFSM) (2016), pp. 32-41
-
Bringing Contextual Information to Google Speech Recognition
Petar Aleksic, Mohammadreza Ghodsi, Assaf Michaely, Cyril Allauzen, Keith Hall, Brian Roark, David Rybach, Pedro Moreno
Interspeech 2015, International Speech Communications Association
-
Composition-based on-the-fly rescoring for salient n-gram biasing
Keith Hall, Eunjoon Cho, Cyril Allauzen, Francoise Beaufays, Noah Coccaro, Kaisuke Nakajima, Michael Riley, Brian Roark, David Rybach, Linda Zhang
Interspeech 2015, International Speech Communications Association
-
Improved recognition of contact names in voice commands
Petar Aleksic, Cyril Allauzen, David Elson, Aleks Kracun, Diego Melendo Casado, Pedro J. Moreno
ICASSP 2015
-
Rapid Vocabulary Addition to Context-Dependent Decoder Graphs
Interspeech 2015
-
Encoding Linear Models As Weighted Finite-State Transducers
Ke Wu, Cyril Allauzen, Keith Hall, Michael Riley, Brian Roark
Interspeech 2014, ISCA, pp. 1258-1262
-
Pushdown automata in statistical machine translation
Cyril Allauzen, Bill Byrne, Adrià de Gispert, Gonzalo Iglesias, Michael Riley
Computational Linguistics, vol. 40 (2014), pp. 687-723
-
Language Model Verbalization for Automatic Speech Recognition
Hasim Sak, Françoise Beaufays, Kaisuke Nakajima, Cyril Allauzen
Proc ICASSP, IEEE (2013)
-
Mixture of mixture n-gram language models
Hasim Sak, Cyril Allauzen, Kaisuke Nakajima, Françoise Beaufays
ASRU (2013), pp. 31-36
-
Pre-Initialized Composition for Large-Vocabulary Speech Recognition
Interspeech 2013, 666 – 670
-
Smoothed marginal distribution constraints for language modeling
Brian Roark, Cyril Allauzen, Michael Riley
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL) (2013), pp. 43-52
-
Written-Domain Language Modeling for Automatic Speech Recognition
Hasim Sak, Yun-hsuan Sung, Françoise Beaufays, Cyril Allauzen
Interspeech (2013)
-
A Pushdown Transducer Extension for the OpenFst Library
CIAA, Springer (2012), pp. 66-77
-
Language Modeling for Automatic Speech Recognition Meets the Web: Google Search by Voice
Ciprian Chelba, Johan Schalkwyk, Boulos Harb, Carolina Parada, Cyril Allauzen, Leif Johnson, Michael Riley, Peng Xu, Preethi Jyothi, Thorsten Brants, Vida Ha, Will Neveitt
University of Toronto (2012)
-
The OpenGrm Open-Source Finite-State Grammar Software Libraries
Brian Roark, Richard Sproat, Cyril Allauzen, Michael Riley, Jeffrey Sorensen, Terry Tai
ACL (System Demonstrations) (2012), pp. 61-66
-
Voice Query Refinement
Cyril Allauzen, Edward Benson, Ciprian Chelba, Michael Riley, Johan Schalkwyk
Interspeech (2012)
-
A Dual Coordinate Descent Algorithm for SVMs Combined with Rational Kernels
Cyril Allauzen, Corinna Cortes, Mehryar Mohri
International Journal of Foundations of Computer Science, vol. 22 (2011), pp. 1761-1779
-
A Filter-based Algorithm for Efficient Composition of Finite-State Transducers
Cyril Allauzen, Michael Riley, Johan Schalkwyk
International Journal of Foundations of Computer Science, vol. 22 (2011), pp. 1781-1795
-
Bayesian Language Model Interpolation for Mobile Speech Input
Interspeech 2011, pp. 1429-1432
-
General Algorithms for Testing the Ambiguity of Finite Automata and the Double-Tape Ambiguity of Finite-State Transducers
Cyril Allauzen, Mehryar Mohri, Ashish Rastogi
International Journal of Foundations of Computer Science, vol. 22 (2011), pp. 883-904
-
Hierarchical Phrase-Based Translation Representations
Gonzalo Iglesias, Cyril Allauzen, William Byrne, Adrià de Gispert, Michael Riley
Proceedings of EMNLP 2011
-
Language Modeling for Automatic Speech Recognition Meets the Web: Google Search by Voice
Ciprian Chelba, Johan Schalkwyk, Boulos Harb, Carolina Parada, Cyril Allauzen, Michael Riley, Peng Xu, Thorsten Brants, Vida Ha, Will Neveitt
OGI/OHSU Seminar Series, Portland, Oregon, USA (2011)
-
Unary Data Structures for Language Models
Jeffrey Sorensen, Cyril Allauzen
Interspeech 2011, International Speech Communication Association, pp. 1425-1428
-
Expected Sequence Similarity Maximization
Cyril Allauzen, Shankar Kumar, Wolfgang Macherey, Mehryar Mohri, Michael Riley
NAACL HLT (2010)
-
Filters for Efficient Composition of Weighted Finite-State Transducers
Cyril Allauzen, Michael Riley, Johan Schalkwyk
CIAA (2010), pp. 28-38
-
Large-Scale Training of SVMs with Automata Kernels
Cyril Allauzen, Corinna Cortes, Mehryar Mohri
CIAA (2010), pp. 17-27
-
On-Demand Language Model Interpolation for Mobile Speech Input
Brandon Ballinger, Cyril Allauzen, Alexander Gruenstein, Johan Schalkwyk
Interspeech (2010), pp. 1812-1815
-
SVM Optimization for Lattice Kernels
Cyril Allauzen, Corinna Cortes, Mehryar Mohri
Mining and Learning with Graphs (2010)
-
A Generalized Composition Algorithm for Weighted Finite-State Transducers
Cyril Allauzen, Michael Riley, Johan Schalkwyk
Interspeech 2009
-
N-Way Composition of Weighted Finite-State Transducers
International Journal of Foundations of Computer Science, vol. 20 (2009), pp. 613-627
-
Michael Riley, Cyril Allauzen, Martin Jansche
Proceedings of the North American Chapter of the Association for Computational Linguistics -- Human Language Technologies (NAACL HLT) 2009 conference, Tutorials
-
3-Way Composition of Weighted Finite-State Transducers
Proceedings of the 13th International Conference on Implementation and Application of Automata (CIAA 2008), Springer-Verlag, Heidelberg, Germany, San Francisco, California, pp. 262-273
-
General Algorithms for Testing the Ambiguity of Finite Automata
Cyril Allauzen, Mehryar Mohri, Ashish Rastogi
DLT 2008, LNCS 5257, Springer, pp. 108-120
-
General Algorithms for Testing the Ambiguity of Finite Automata
Cyril Allauzen, Mehryar Mohri, Ashish Rastogi
Proceedings of Twelfth International Conference Developments in Language Theory (DLT 2008), Springer, Heidelberg, Germany, Kyoto, Japan
-
Linear-Space Computation of the Edit-Distance between a String and a Finite Automaton
London Algorithmics 2008: Theory and Practice, College Publications (to appear)
-
Sequence Kernels for Predicting Protein Essentiality
Cyril Allauzen, Mehryar Mohri, Ameet Talwalkar
Proceedings of ICML 2008
Previous Publications
-
OpenFst: a General and Efficient Weighted Finite-State Transducer Library
Cyril Allauzen, Michael Riley, Johan Schalkwyk, Wojciech Skut, Mehryar Mohri
Proceedings of the 12th International Conference on Implementation and Application of Automata (CIAA 2007), Springer-Verlag, Heidelberg, Germany, Prague, Czech Republic
-
A Unified Construction of the Glushkov, Follow, and Antimirov Automata
Proceedings of the 31st International Symposium on Mathematical Foundations of Computer Science (MFCS 2006), Springer-Verlag, Heidelberg, Germany, Star\'a Lesn\'a, Slovakia, pp. 110-121
-
A Unified Construction of the Glushkov, Follow, and Antimirov Automata
MFCS (2006), pp. 110-121
-
A General Weighted Grammar Library
Cyril Allauzen, Mehryar Mohri, Brian Roark
Ninth International Conference on Automata (CIAA 2004), Kingston, Canada, July 22-24, 2004, Springer-Verlag, Berlin-NY (2005)
-
The Design Principles and Algorithms of a Weighted Grammar Library
Cyril Allauzen, Mehryar Mohri, Brian Roark
International Journal of Foundations of Computer Science, vol. 16 (2005)
-
The design principles and algorithms of a weighted grammar library
Cyril Allauzen, Mehryar Mohri, Brian Roark
Int. J. Found. Comput. Sci., vol. 16 (2005), pp. 403-421
-
A General Weighted Grammar Library
Cyril Allauzen, Mehryar Mohri, Brian Roark
Proceedings of the Ninth International Conference on Automata (CIAA 2004), Kingston, Ontario, Canada
-
A General Weighted Grammar Library
Cyril Allauzen, Mehryar Mohri, Brian Roark
CIAA (2004), pp. 23-34
-
A Generalized Construction of Integrated Speech Recognition Transducers
Cyril Allauzen, Mehryar Mohri, Brian Roark, Michael Riley
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), Montreal, Canada
-
An Optimal Pre-Determinization Algorithm for Weighted Transducers
Theoretical Computer Science, vol. 328 (2004)
-
An optimal pre-determinization algorithm for weighted transducers
Theor. Comput. Sci., vol. 328 (2004), pp. 3-18
-
General Indexation of Weighted Automata -- Application to Spoken Utterance Retrieval
Cyril Allauzen, Mehryar Mohri, Murat Saraclar
Proceedings of the annual meeting of the Human Language Technology conference and North American Chapter of the Association for Computational Linguistics (HLT/NAACL 2004), Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval, Boston, Massachusetts
-
Statistical Modeling for Unit Selection in Speech Synthesis
Cyril Allauzen, Mehryar Mohri, Michael Riley
42nd Meeting of the Association for Computational Linguistics (ACL 2004), Proceedings of the Conference, Barcelona, Spain
-
Statistical Modeling for Unit Selection in Speech Synthesis
Cyril Allauzen, Mehryar Mohri, Michael Riley
$42$nd Meeting of the Association for Computational Linguistics (ACL 2004), Proceedings of the Conference, Barcelona, Spain
-
$p$-Subsequentiable Transducers
Seventh International Conference on Automata (CIAA 2002), Tours, France, Springer, Berlin-NY (2003), pp. 24-34
-
An Efficient Pre-Determinization Algorithm
Eighth International Conference on Automata (CIAA 2003), Santa Barbara, CA, Springer, Berlin-NY, pp. 83-95
-
An Efficient Pre-determinization Algorithm
CIAA (2003)
-
Efficient Algorithms for Testing the Twins Property
Journal of Automata, Languages and Combinatorics, vol. 8 (2003)
-
Finitely Subsequential Transducers
Int. J. Found. Comput. Sci., vol. 14 (2003)
-
Finitely Subsequential Transducers
International Journal of Foundations of Computer Science, vol. 14 (2003), pp. 983-994
-
Generalized Algorithms for Constructing Statistical Language Models
Cyril Allauzen, Mehryar Mohri, Brian Roark
ACL (2003)
-
Generalized Algorithms for Constructing Statistical Language Models
Cyril Allauzen, Mehryar Mohri, Brian Roark
$41$st Meeting of the Association for Computational Linguistics (ACL 2003), Proceedings of the Conference, Sapporo, Japan
-
Generalized Optimization Algorithm for Speech Recognition Transducers
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Hong Kong
-
Generalized optimization algorithm for speech recognition transducers
ICASSP (1) (2003), pp. 352-355
-
$p$-Subsequentiable Transducers
Proceedings of the Seventh International Conference on Automata (CIAA 2002), Tours, France
-
On the Determinizability of Weighted Automata and Transducers
Proceedings of the workshop Weighted Automata: Theory and Applications (WATA), Dresden, Germany (2002)
-
p-Subsequentiable Transducers
CIAA (2002), pp. 24-34